Web and Wireless Geographical Information Systems: 9th International Symposium, W2GIS 2009, Maynooth, Ireland, December 7-8, 2009. Proceedings ... Applications, incl. Internet Web, and HCI)
This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Lecture Notes in Computer Science Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Germany Madhu Sudan Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max-Planck Institute of Computer Science, Saarbruecken, Germany
5886
James D. Carswell A. Stewart Fotheringham Gavin McArdle (Eds.)
Web and Wireless Geographical Information Systems 9th International Symposium, W2GIS 2009 Maynooth, Ireland, December 7-8, 2009 Proceedings
13
Volume Editors James D. Carswell Digital Media Centre Dublin Institute of Technology Aungier Street Dublin 2, Ireland E-mail: [email protected] A. Stewart Fotheringham National University of Ireland Maynooth National Centre for Geocomputation John Hume Building Maynooth, Co. Kildare, Ireland E-mail: [email protected] Gavin McArdle University College Dublin School of Computer Science and Informatics Belfield, Dublin 4, Ireland E-mail: [email protected]
Library of Congress Control Number: 2009939279 CR Subject Classification (1998): H.2.8, J.2, J.7, K.6, H.2, H.3 LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Web and HCI ISSN ISBN-10 ISBN-13
0302-9743 3-642-10600-5 Springer Berlin Heidelberg New York 978-3-642-10600-2 Springer Berlin Heidelberg New York
This volume contains the extended papers selected for presentation at the ninth edition of the International Symposium on Web & Wireless Geographical Information Systems (W2GIS 2009) hosted by the National Centre for Geocomputation in NUI Maynooth (Ireland). W2GIS 2009 was the ninth in a series of successful events beginning with Kyoto 2001, and alternating locations between East Asia and Europe. We invited submissions that provided an up-to-date review of advances in theoretical, technical, and practical issues of W2GIS and Intelligent GeoMedia. Reports on ongoing implementations and real-world applications research were particularly welcome at this symposium. Now in its ninth year, the scope of W2GIS has expanded to include continuing advances in wireless and Internet technologies that generate ever increasing interest in the diffusion, usage, and processing of geo-referenced data of all types - geomedia. Spatially aware wireless and Internet devices offer new ways of accessing and analyzing geo-spatial information in both real-world and virtual spaces. Consequently, new challenges and opportunities are provided that expand the traditional GIS research scope into the realm of intelligent media – including geomedia with context-aware behaviors for self-adaptive use and delivery. Our common aim is research-based innovation that increases the ease of creating, delivering, and using geomedia across different platforms and application domains that continue to have dramatic effect on today’s society. For the first time, W2GIS was pleased to invite 12 high-quality journal-length papers plus two shorter keynote papers for final presentation and inclusion in the Springer LNCS Volume 5886 proceedings. All papers submitted that were directly in scope with the symposium received at least three reviews and ranked accordingly. The final accepted papers cover a wide range of topics, including geospatial analysis, personalization and Semantic geospatial Web, W2GIS case studies, and Web and mobile applications and prototypes. We had the privilege of having two distinguished invited talks by Hanan Samet from the Department of Computer Science, University of Maryland in the USA, and Peter Fröhlich from FTW Telecommunications Research Center in Vienna, Austria. The best paper(s) from the symposium was selected by the Steering Committee and invited for journal publication in Transactions in GIS. We wish to thank the authors that contributed to this symposium for the high quality of their papers and presentations and the support of Springer LNCS. We would also like to thank once again the Program Committee for the quality and timeliness of their evaluations. Finally, many thanks to Christophe Claramunt and the entire W2GIS Steering Committee for providing continuous advice.
October 2009
James D. Carswell A. Stewart Fotheringham Gavin McArdle
W2GIS 2009 Organization
Symposium Chairs J.D. Carswell A.S. Fotheringham G. McArdle
Dublin Institute of Technology, Ireland National Centre for Geocomputation, NUIM, Ireland University College Dublin, Ireland
Local Chair E. Mac Aoidh
National Centre for Geocomputation, NUIM, Ireland
Steering Committee M. Bertolotto J.D. Carswell C. Claramunt M. Egenhofer K.J. Li T. Tezuka C. Vangenot
University College Dublin, Ireland Dublin Institute of Technology, Ireland Naval Academy Research Institute, France NCGIA, The University of Maine, USA Pusan National University, Korea Kyoto University, Japan EPFL, Switzerland
Program Committee A. Abdelmoty T. Badard A. Bouju T. Brinkhoff E. Camossi M.L. Damiani R.A. de By U. Demsar M. Duckham A. Edwardes P. Fröhlich K. Gardiner R. Güting B. Huang Y. Ishikawa B. Jiang H.A. Karimi
Cardiff University, UK Laval University, Canada University of La Rochelle, France IAPG, Germany University College Dublin, Ireland DICO - University of Milan, Italy ITC, The Netherlands National Centre for Geocomputation, Ireland University of Melbourne, Australia University of Zurich, Switzerland FTW, Austria Dublin Institute of Technology, Ireland Fernuniversität Hagen, Germany Chinese University of Hong Kong Nagoya University, Japan University of Gävle, Sweden University of Pittsburgh, USA
VIII
Organization
B. Kobben J. Komarkova Y.J. Kwon D. Li S. Li X. Li S. Liang M.R. Luaces E. MacAoidh H. Martin P. Muro-Medrano S. Mustiere D. Pfoser C. Ray P. Rigaux J. Sankaranarayanan M. Schneider C. Shahabi K. Sumiya Y. Theodoridis M. Tomko A. Winstanley S. Winter O. Wolfson A. Zipf
ITC, The Netherlands University of Pardubice, Czech Republic Hankuk Aviation University, Korea Hong Kong University of Science and Engineering Ryerson University, Canada East China Normal University, China University of Calgary, Canada University of Coruna, Spain National University of Ireland Maynooth University of Grenoble, France Universidad de Zaragoza, Spain IGN, France IMIS-R.C. Athens, Greece Naval Academy Research Institute, France University of Paris-Dauphine, France University of Maryland, USA University of Florida, USA University of Southern California, USA University of Hyogo, Japan University of Piraeus, Greece Universtiy of Zurich, Switzerland National University of Ireland Maynooth The University of Melbourne, Australia University of Illinois at Chicago, USA University of Bonn, Germany
Organization
Sponsors Science Foundation Ireland (SFI)
National University of Ireland, Maynooth (NUIM)
National Centre for Geocomputation (NCG), Ireland
Strategic Research in Advanced Geotechnologies (StratAG)
Session 1 – Geospatial Analysis Moving Phenomenon: Aggregation and Analysis of Geotime-Tagged Contents on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Kyoung-Sook Kim, Koji Zettsu, Yutaka Kidawara, and Yasushi Kiyoki GeoPW: Towards the Geospatial Processing Web . . . . . . . . . . . . . . . . . . . . Peng Yue, Jianya Gong, Liping Di, Jie Yuan, Lizhi Sun, and Qian Wang
7
25
Session 2 – Personalization and Semantic Geo-spatial Web Towards an Integration of Space and Accessibility in Web Personalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mohamed Ramzi Haddad, Hajer Baazaoui, Marie Aude Aufaure, Christophe Claramunt, Yves Lechevallier, and Henda Ben Ghezala
39
Tiled Vectors: A Method for Vector Transmission over the Web . . . . . . . . Vyron Antoniou, Jeremy Morley, and Mordechai (Muki) Haklay
56
Semantic Challenges for Sensor Plug and Play . . . . . . . . . . . . . . . . . . . . . . . Arne Br¨ oring, Krzysztof Janowicz, Christoph Stasch, and Werner Kuhn
A Study of Spatial Interaction Behaviour for Improved Delivery of Web-Based Maps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Eoin Mac Aoidh, David C. Wilson, and Michela Bertolotto
120
Session 4 – Web and Mobile Applications and Prototypes EgoViz – A Mobile Based Spatial Interaction System . . . . . . . . . . . . . . . . . Keith Gardiner, Junjun Yin, and James D. Carswell Wired Fisheye Lens: A Motion-Based Improved Fisheye Interface for Mobile Web Map Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Daisuke Yamamoto, Shotaro Ozeki, and Naohisa Takahashi Automatic Generation of an Adaptive WebGIS . . . . . . . . . . . . . . . . . . . . . . Sergio Di Martino, Filomena Ferrucci, Gavin McArdle, and Giacomo Petillo Campus Guidance System for International Conferences Based on OpenStreetMap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ricky Jacob, Jianghua Zheng, Bla˙zej Ciepluch, Peter Mooney, and Adam C. Winstanley Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
135
153 171
187
199
Database and Representation Issues in Geographic Information Systems (GIS) Hanan Samet Computer Science Department, Center for Automation Research, Institute for Advanced Computer Studies, University of Maryland, College Park, Maryland 20742 [email protected] www.cs.umd.edu/~hjs Abstract. A review is provided of some database and representation issues involved in the implementation of geographic information systems (GIS).
The increasing popularity of web-based mapping systems such as Microsoft Virtual Earth and Google Earth and Maps, as well as other software offerings that are coupled with portable devices, such as the iPhone, has led to a proliferation of services that are characterized as being location-based. The data provided by these services is differentiated from other offerings by the presence of a locational component. In the past, this type of data was found primarily in geographic information systems (GIS). The available technology led to a focus on the paper map as the output device for responses. Since anything is better than drawing by hand, there was little emphasis on efficiency measures such as minimization of execution time. However, the emergence of of display devices has changed the mode of operation to one of expecting answers relatively quickly. This has had a number of effects. First, the paper medium supports relatively high resolution output while the display screen is usually of a more limited resolution, thereby enabling the use of less precise algorithms. For example, a spatial range query such as a buffer/corridor (e.g., a query that seeks all cities of population in excess of 20,000 within 100 miles of the Mississippi River) often takes quite a bit of time to compute (i.e., the extent of the spatial region in question) when using the Euclidean distance. Instead, we can use a Chessboard distance (L∞ ) to approximate the Euclidean distance coupled with a quadtree representation to yield algorithms that are several orders of magnitude faster [1] but, of course, may not be as precise. Second, the ability to obtain answers quickly led to an increase in the volume of data that is input to the system. Such high volumes of data need to be organized and the obvious next step was to make use of databases, which meant
This work was supported in part by the National Science Foundation under Grants IIS-08-12377 CCF-08-30618, and IIS-07-13501, as well as NVIDIA Corporation, Microsoft Research, Google, the E.T.S. Walton Visitor Award of the Science Foundation of Ireland, and the National Center for Geocomputation at the National University of Ireland at Maynooth.
J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 1–6, 2009. c Springer-Verlag Berlin Heidelberg 2009
2
H. Samet
incorporating this technology in the GIS thereby transitioning the field to one that is also known as spatial databases [2,3,4,5,6,7]. A natural outgrowth of this transition, which will eventually emerge, is the use of spatial spreadsheets [8, 9] as they traditionally enable users to ask what-if questions and see the results instantaneously. There are several possible innovations here. One innovation lies in the fact that the map plays the role of the spreadsheet with the the different attributes being the columns and the rows playing the roles of the individual tuples. The map can be used to visualize spatial variability by using sliders to select ranges of values of the attributes and show the tuples that satisfy them. A second innovation lies in the fact that as compositions of operations are performed, data is generated with a locational component [10, 11]. The results can be viewed as maps and are stored as relations containing the tuples that satisfy the queries. For example, suppose that we want to find the locations of bridges and tunnels. One way to do so, assuming the existence of a roads and a rivers relation, is to take the spatial join of the two relations. The result would be a set of pairs of the form (roadi,riverj ) where the spatial attribute would consist of the location(s) that roadi and riverj have in common. Notice that the input spatial attribute of both the roads and rivers relation is a line segment or a collection of piecewise linear line segments. In addition to speed, there has also been much activity in expanding the range of queries for which answers are expected. In particular, spatial queries can be broken down into the following two classes: location-based and feature-based. A location-based query takes a location, traditionally specified using lat/long coordinate values, as an argument, and returns a set of features that are associated with the location, while a feature-based query takes a feature as an argument and returns the set of locations with which the feature is associated. The queries can also be characterized as functions where one function is viewed as the inverse of the other. Feature-based queries have also become known as spatial data mining [12, 13]. Although features are usually properties of the data such as crops, soil types, zones, etc., they can be much more diversified. In particular, they can correspond to collections of unstructured data such as documents in which case the queries reduce to finding all documents that mention location X, and possibly others, or finding all locations mentioned in document Y , and possibly others, that could be related in some manner. STEWARD [14] is an example of a system that supports such queries on a collection of documents on the hidden web, while NewsStand [15] is an example of a system that supports such queries on a collection of news articles where the relationship between the documents is that they are on the same or related set of topics. The two systems are also distinguished by the fact that the collection of documents in the former is relatively static, while in the latter it is very dynamic in the sense that its composition is constantly changing as new articles are processed and old ones fade away in importance.
Database and Representation Issues in GIS
3
The inclusion of documents in the range of features reminds us that although spatial data is usually specified geometrically, in this case it can also be specified using collections of words of text that can be (but are not required to be) interpreted as the names of locations Textual data that corresponds to spatial data are called toponyms and its specification invariably involves some ambiguity. This ambiguity has both advantages and disadvantages. The advantage of the ambiguity is that, from a geometric standpoint, the textual specification captures both point and spatial extent interpretations of the data (analogous to a polymorphic type in parameter transmission which serves as the cornerstone of inheritance in object-oriented programming languages). For example, geometrically, a city can be specified by either a point such as its centroid, or a region corresponding to its boundary, the choice of which depends on the level of zoom with which the query interface is activated. On the other hand, the disadvantage of the ambiguity is that we are not always sure if a term is a geographic location or not (e.g., does “Jordan” refer to a country or is it a surname as in “Michael Jordan”?). Moreover, if it is a geographic location, then which, if any, of the possibly many instances of geographic locations with the same name is meant (e.g., does “London” refer to an instance in the UK, Ontario, Canada, or one of many more others?). The examples that we have outlined serve to show many of the database and representation issues involved in geographic information systems. Some additional issues include: 1. How to integrate spatial with nonspatial data in a seamless manner. 2. Retrieval is facilitated by building an index (e.g., [16,17,18]). There is a need to find a way to sort the data [19]. The index should be compatible with the data being stored. We need an implicit rather than an explicit index as it is impossible to foresee all of the possible queries in advance. For example, if we sort all cities with respect to their distance from Chicago, we can find the nearest city to Chicago with population in excess of 200,000. However, this sort will not help in finding the closest city to Denver with population in excess of 200,000. 3. There is a need to identify the possible queries and to find their analogs in a conventional database [20, 21, 22]. For example, a map in a spatial database is like a relation in a conventional database. However, the difference is the presence of input spatial attributes and also the presence of output spatial attributes as in the rives/roads example. Another example, is the combination of ranking and the distance semi-join [23] to yield a discrete Voronoi diagram (e.g., [24]) and the ability to do clustering. A recent example, is the ability to perform queries on spatial networks using simple SQL commands [25]. 4. How to interact with the database. SQL may not always be easy to adapt. It may be desirable to make use of a graphical query language. 5. Determining what functionality users really desire and need, and providing it. 6. How to ensure the spatial integrity of the data such as that the edges of a polygon link to form a complete object, that line segments do not intersect except at vertices, that contour lines do not cross, etc.
4
H. Samet
7. Develop a strategy for answering a query that mixes spatial data with nonspatial data. This implies a need for query optimization strategies [26], which in turn calls for the definition of selectivity factors. This depends on whether or not an index exists on the spatial data. If not, then select on the nonspatial data first. Otherwise, the situation is more complex as we perform the spatial selection first only if there is high spatial selectivity (e.g., the range in a spatial range query is small). 8. How to incorporate time-varying data as well as deal with the fact that temporal data, as well as spatial data, is also continuous rather than being restricted to discrete which is the case when the valid time and transaction time primitives are being used. This will enable the handling of rates. 9. Processing data lying on a spatial network using network distance [25, 27, 28, 29, 30, 31]. 10. How to incorporate imagery into the database. 11. Interoperability. 12. How to make use of advanced computing architectures such as GPUs (e.g., [32]). 13. Resolving ambiguities in the textual specification of spatial data with no errors (or almost none). 14. Determining the geographic focus of a set of documents on a related topic. As the above show, many database and representation issues are involved in geographic information systems that need resolution, thereby forming a vibrant area of research.
References 1. Ang, C.H., Samet, H., Shaffer, C.A.: A new region expansion for quadtrees. IEEE Transactions on Pattern Analysis and Machine Intelligence 12, 682–686 (1990); Also see Proceedings of the Third International Symposium on Spatial Data Handling, Sydney, Australia, pp. 19–37 (August 1988) 2. G¨ unther, O.: Environmental Information Systems. Springer, Berlin (1998) 3. G¨ uting, R.H.: An introduction to spatial database systems. VLDB Journal 3, 401– 444 (1994) 4. Laurini, R., Thompson, D.: Fundamentals of Spatial Information Systems. Academic Press, San Diego (1992) 5. Rigaux, P., Scholl, M., Voisard, A.: Spatial Databases with Application to GIS. Morgan Kaufmann, San Francisco (2002) 6. Shekhar, S., Chawla, S.: Spatial Databases: A Tour. Prentice-Hall, Englewood Cliffs (2003) 7. Worboys, M.: GIS A Computing Perspective. Taylor & Francis, London (1995) 8. Iwerks, G.S., Samet, H.: The spatial spreadsheet. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 317–324. Springer, Heidelberg (1999) 9. Iwerks, G.S., Samet, H.: The internet spatial spreadsheet: enabling remote visualization of dynamic spatial data and ongoing query results over a network. In: Hoel, E., Rigaux, P. (eds.) Proceedings of the 11th ACM International Symposium on Advances in Geographic Information Systems, New Orleans, LA, pp. 154–160 (2003)
Database and Representation Issues in GIS
5
10. Samet, H., Alborzi, H., Brabec, F., Esperan¸ca, C., Hjaltason, G.R., Morgan, F., Tanin, E.: Use of the SAND spatial browser for digital government applications. Communications of the ACM 46, 63–66 (2003) 11. Brabec, F., Samet, H.: Client-based spatial browsing on the world wide web. IEEE Internet Computing 11, 52–59 (2007) 12. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000) 13. Aref, W.G., Samet, H.: Efficient processing of window queries in the pyramid data structure. In: Proceedings of the 9th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), Nashville, TN, pp. 265–272 (1990); Also in Proceedings of the Fifth Brazilian Symposium on Databases, Rio de Janeiro, Brazil, April 1990, pp. 15–26 (1990) 14. Lieberman, M.D., Samet, H., Sankaranarayanan, J., Sperling, J.: STEWARD: architecture of a spatio-textual search engine. In: Samet, H., Schneider, M., Shahabi, C. (eds.) Proceedings of the 15th ACM International Symposium on Advances in Geographic Information Systems, Seattle, WA, pp. 186–193 (2007) 15. Teitler, B., Lieberman, M.D., Panozzo, D., Sankaranarayanan, J., Samet, H., Sperling, J.: NewsStand: A new view on news. In: Aref, W.G., Mokbel, M.F., Samet, H., Schneider, M., Shahabi, C., Wolfson, O. (eds.) Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Irvine, CA, pp. 144–153 (2008) (2008 ACM SIGSPATIAL (ACMGIS 2008) Best Paper Award) 16. Samet, H.: Applications of Spatial Data Structures: Computer Graphics, Image Processing, and GIS. Addison-Wesley, Reading (1990) 17. Samet, H.: The Design and Analysis of Spatial Data Structures. Addison-Wesley, Reading (1990) 18. Samet, H.: Foundations of Multidimensional and Metric Data Structures. MorganKaufmann, San Francisco (2006) 19. Samet, H.: A sorting approach to indexing spatial data. International Journal on Shape Modeling 14, 15–37 (2008) 20. Aref, W.G., Samet, H.: An approach to information management in geographical applications. In: Proceedings of the 4th International Symposium on Spatial Data Handling, Zurich, Switzerland, vol. 2, pp. 589–598 (1990) 21. Aref, W.G., Samet, H.: Extending a DBMS with spatial operations. In: G¨ unther, O., Schek, H.J. (eds.) SSD 1991. LNCS, vol. 525, pp. 299–318. Springer, Heidelberg (1991) 22. Samet, H., Aref, W.G.: Spatial data models and query processing. In: Kim, W. (ed.) Modern Database Systems, The Object Model, Interoperability and Beyond, pp. 338–360. ACM Press/Addison-Wesley, New York (1995) 23. Hjaltason, G.R., Samet, H.: Incremental distance join algorithms for spatial databases. In: Hass, L., Tiwary, A. (eds.) Proceedings of the ACM SIGMOD Conference, Seattle, WA, pp. 237–248 (1998) 24. Samet, H., Phillippy, A., Sankaranarayanan, J.: Knowledge discovery using the SAND spatial browser. In: Proceedings of the 7th National Conference on Digital Government Research, Philadelphia, PA, pp. 284–285 (2007) 25. Sankaranarayanan, J., Samet, H.: Distance oracles for spatial networks. In: Proceedings of the 25th IEEE International Conference on Data Engineering, Shanghai, China, pp. 652–663 (2009)
6
H. Samet
26. Aref, W.G., Samet, H.: Optimization strategies for spatial query processing. In: Lohman, G.M., Sernadas, A., Camps, R. (eds.) Proceedings of the 17th International Conference on Very Large Databases (VLDB), Barcelona, Spain, pp. 81–90 (1991) 27. Sankaranarayanan, J., Samet, H., Alborzi, H.: Path oracles for spatial networks. In: Proceedings of the VLDB Endowment PVDB: Proceedings of the 35th International Conference on Very Large Data Bases (VLDB), Lyon, France, vol. 2, pp. 1210–1221 (2009) 28. Samet, H., Sankaranarayanan, J., Alborzi, H.: Scalable network distance browsing in spatial databases. In: Proceedings of the ACM SIGMOD Conference, Vancouver, Canada, pp. 43–54 (2008); Also see University of Maryland Computer Science Technical Report TR–4865, April 2007 (2008 ACM SIGMOD Best Paper Award) 29. Sankaranarayanan, J., Alborzi, H., Samet, H.: Efficient query processing on spatial networks. In: Proceedings of the 13th ACM International Symposium on Advances in Geographic Information Systems, Bremen, Germany, pp. 200–209 (2005) 30. Sankaranarayanan, J., Alborzi, H., Samet, H.: Enabling query processing on spatial networks. In: Proceedings of the 22nd IEEE International Conference on Data Engineering, Atlanta, GA, p. 163 (2006) 31. Sankaranarayanan, J., Alborzi, H., Samet, H.: Distance join queries on spatial networks. In: Proceedings of the 14th ACM International Symposium on Advances in Geographic Information Systems, Arlington, VA, pp. 211–218 (2006) 32. Lieberman, M.D., Sankaranarayanan, J., Samet, H.: A fast similarity join algorithm using graphics processing units. In: Proceedings of the 24th IEEE International Conference on Data Engineering, Cancun, Mexico, pp. 1111–1120 (2008)
Moving Phenomenon: Aggregation and Analysis of Geotime-Tagged Contents on the Web Kyoung-Sook Kim1 , Koji Zettsu1 , Yutaka Kidawara1, and Yasushi Kiyoki1,2 1
Knowledge Creating Communication Research Center National Institute of Information and Communications Technology 3-5 Hikaridai, Seika-cho, Sora-gun, Kyoto 619-0289 Japan {ksookim,zettsu,kidawara}@nict.go.jp 2 Faculty of Environmental Information Keio University 5322 Endo, Fujisawa-shi, Kanagawa 252-8520, Japan [email protected]
Abstract. The analysis of movement of people, vehicles, and other objects is important for carrying out research in social and scientific domains. The study of movement behavior of spatiotemporal entities helps enhance the quality of service in decision-making in real applications. However, the spread of certain entities such as diseases or rumor is difficult to observe compared to the movement of people, vehicles, or animals. We can only infer their locations in a certain region of spacetime on the basis of observable events. In this paper, we propose a new model, called as moving phenomenon, to represent time-varying phenomena over geotime-tagged contents on the Web. The most important feature of this model is the integration of thematic dimension into an event-based spatiotemporal data model. By using the proposed model, a user can aggregate relevant contents relating to an interesting phenomenon and perceive its movement behavior; further, the model also enables a user to navigate the spatial, temporal, and thematic information of the contents along all the three-dimensions. Finally, we present an example of typhoons to illustrate moving phenomena and draw a comparison between the movement of the moving phenomenon created using information from news articles on the Web and that of the actual typhoon.
1
Introduction
The development of mobile positioning systems and wireless communication technologies has enabled people to investigate and analyze the movements of various entities involving objects and phenomena such as animals, humans, vehicles, and hurricanes. Many researchers have developed various tools and techniques to analyze trajectory information and acquire useful knowledge about movement behaviors or activities of moving entities. In particular, moving-object J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 7–24, 2009. c Springer-Verlag Berlin Heidelberg 2009
8
K.-S. Kim et al.
data models have been proposed to represent the trajectory data of moving objects as time-varying spatial objects. In contrast to spatiotemporal data models, which are represented in a discrete manner, moving-object data models indicate the movements of entities continuously through time. However, we meet certain challenges when representing many kinds of phenomena using movingobject models, such as the spread of an infectious disease, circulation of rumor, and effects of global warming, because most of the phenomena are modeled by adopting an object-oriented approach. From a high-level ontological perspective, we can make a distinction between continuants (objects) and occurrents (events). While a continuant is an entity in the world that endures through time, such as people, airplanes, and volcanos, an occurrent represents an entity that happens or occurs and unfolds itself through time such as a human life, a flight, and an eruption. In [9], Grenon and Smith introduced SNAP ontology for continuants and SPAN ontology for occurrents as upper-level ontologies to represent the dynamic aspects of the world in spatiotemporal domains. According to this point of view, we can think of movement as a collection of events whose location changes over time. In this paper, we propose a new model, called as moving phenomenon, to represent the continuous movement of a phenomenon from a user-centric perspective as a collection of occurrences of the world. We particularly take Web contents relevant to real-world occurrences into account. Nowadays, people create, distribute, and share their information on maps through Geospatial Web (in short, Geoweb) technologies [19]. Goodchild defines volunteered geographic information (VGI) as a special case of user-generated geospatial content on the Geoweb and discusses the role of people as sensors in monitoring the world [8]. For example, users can share their opinions or activities with friends and visitors using blogs, photos, music, or videos with spatial coordinates and time metadata on Web communities. Further, news articles that usually deal with events and accidents occurring in our lives contain spatial and temporal metadata created by geo-parsing and geo-coding. As a result, the amount of spatial and temporal data, large parts of which describe information related to facts, events, activities, and experiences, on the Web is increasing rapidly. In this paper, we call such contents geotime-tagged contents, and the moving phenomenon aggregates them with respect to a user query along three dimensions: space, time, and theme. By transforming the contents into a moving phenomenon, we acquire knowledge of certain phenomenon on the basis of continuous movement information and semantic relevance. The remainder of the paper is organized as follows: In section 2, we review related Geoweb technologies and spatiotemporal data models and address the motivation for our work. Section 3 introduces the concepts of geo-events and moving phenomena. In Section 4, we describe a framework to create moving phenomena over the contents on the Web, and in Section 5, we present a proto-system for implementing the moving phenomenon framework. Finally, we conclude this paper with future research issues in Section 6.
Aggregation and Analysis of Geotime-Tagged Contents on the Web
2
9
Motivation and Related Work
Everything happens somewhere, sometime. We are always interested in question such as “when”, “where”, “what happened”, “what is happening”, or “what is going to happen.” Hence, we read newspapers, communicate with other people, and gather information pertaining to our daily lives. In recent years, Geoweb technologies have made it easier for people to receive information and create and publish contents about events or phenomena occurring around the world. In particular, the geo-tagging processes and map-based APIs of various Geoweb services such as Google Maps, Yahoo Maps, Bing Maps, and OpenLayers affix spatial and temporal metadata to Web contents such as photos, videos, web pages, or RSS feeds. The Web-a-Where system proposed in [4] associates Web pages with locations by using the names of the places appearing on the pages. In addition, there is an increase in the number of geo-tagged pictures that are automatically or manually generated by digital cameras with GPS or by users on the Web [22]. GeoTracker presented in [5] is a spatiotemporal navigator of Really Simple Syndication (RSS) feeds. It performs location mining from the text of RSS feeds and presents them on the world map. Moreover, it supports a temporal navigation for feeds with time sliding bar. NewsStand proposed in [21] is also a good example of a navigator to display news articles using a map interface. When compared to GeoTracker, in which content is described along spatial and temporal dimensions, NewsStand focuses on the space and topic. In [25], a method to automatically extract events from blog entries that describe events as experienced in the real world is presented. We can find a large amount of contents related to events of the real world described along the three dimensions—spatial, temporal, and thematic. Furthermore, some contents discuss the same topics even though they are generated at different locations and times and by different users. Therefore, we need a data model to handle and analyze such data on the basis of space, time, and topic [18]. In particular, we are interested in the representation of time-varying phenomena occurring in the world by means of Web contents. GIS researchers have developed a number of spatiotemporal data models, which are reviewed in [17]. Their aim was to represent dynamic geographic objects or temporal changes in geographic phenomena. While earlier models were concerned with the representation of changes in a discrete manner, current spatiotemporal models are trying to capture not only the discrete changes but also the continuous changes in moving entities. Moving-object data models represent spatial objects as points, lines, areas, or volumes that change continuously over time. They are conceptually defined as function mobject : time → spatial-object; this means that we can estimate the position of an object at any time during its lifetime. Among the moving-object data models developed, the models of moving points and moving regions introduced in [6,10] have provided a basis for analyzing past trajectories of moving objects. It is based on the sliced representation with sampling positions and a moving function which estimates the position at any time. However, we are presented with two challenges while trying to directly apply the moving-object data model to the phenomena described in the Web
10
K.-S. Kim et al.
(a) A time-varying spatial region of a ty- (b) the thematic information of a typhoon phoon; Fig. 1. A typhoon phenomenon
contents. First, it is designed by using an object-oriented approach. Within GIS communities, an object represents a discrete and independent entity having a geographical boundary. In the case of many phenomena, however, it is very difficult to identify the domain area of their locations and track their movements. We can only infer their location in a certain region of space-time on the basis of observed independent occurrences. Second, it focuses on the existence of objects over time to answer the question “Where is the object at a specific time?” that is, it dose not describe the semantics of the movement of objects. In [11,24], event-based approaches are presented to capture the dynamic aspects of moving objects with descriptions of the semantics of real-world occurrences. For example, locations and descriptions of specialized events such as meetings, business trips, or lunches represent specific semantics of the movement of objects. However, their integration into Web contents is still not possible because they do not support complex thematic analysis operations. In addition, events are specified by a given object, region, and application domain. Therefore, we need another event-oriented approach that can be adapted to Web contents with respect to space, time, and topic.
3
From Geo-events to Moving Phenomena
In this section, we describe the feature of our model that enables the representation of spatiotemporal and thematic information by geotime-tagged contents. Geotime-tagging is performed to add the metadata of location and time to the contents. Our previous study [13] was focused on the retrieval of content related to phenomena by carrying out a spatiotemporal operation. Figure 1(a) shows a time-varying spatial region that is created from the trajectory of a typhoon by
Aggregation and Analysis of Geotime-Tagged Contents on the Web
11
Theme (What) Photos
Sensor data
Blogs
News
Geo-Event Real world
Web
Space (Where)
Time (When)
Fig. 2. Representation of geo-events on the basis of the cognitive systems of what, where, and when t
Space-time Point y Space-time Line
Space-time Region
Space-time Volume
x
Fig. 3. Geometric representation of geo-events in the space-time space
carrying out a buffering operation. However, it is difficult to estimate the effects of the typhoon by only considering its geometric shape. If it contains thematic information, as shown in Figure 1(b), we can easily capture the effects of the typhoon phenomenon and assess the events or situations caused by it. Moreover, we do not have any information about the spatiotemporal region even though we have conceptual information in the case of most of the phenomena. Hence, in our model, we focus on the construction of time-varying phenomena that contain information about the spatiotemporal region and thematic information based on geotime-tagged contents. In this study, we assume that a geotime-tagged content expresses a certain phenomenon occurring in the world around us, for example, a new article that reports on the serious damages caused by hurricanes, a web page that reveals changes in the weather in Japan, and a blog entry that describes the activities undertaken during summer vacation. In a number of occurrences, we design moving phenomena to represent the movements of processes related to interesting topics. First of all, we explain geo-events, which describe “what happened at a specific location at a specific time?” In order to transform a geotime-tagged content into a geo-event, we employ the well-known systems of what, where, and when, as shown in Figure 2. Thus, a geo-event is defined as follows: Definition 1. Geo-Events A geo-event corresponding to a geotime-tagged content is defined as a triple of (S, T, K), where S is the location with (x, y) coordinates (longitude, latitude),
(a) An instance of GeoRSS feeds time (t) LC: (45.256, -71.92) T: 2005-08-17T07:02:32Z K: {(Mona,1), (Passage,1), (big, 1)}
categories by keyword vectors longitude (y)
latitude (x)
(b) Spatiotemporal and thematic representation Fig. 4. Example of geo-events encoded from instances of GeoRSS feeds
T is the time instance (or interval), and K is a vector of keywords with associated weights (w1 , w2 , ..., wd ) in d-dimensional vector-space representing the importance of the keywords in the event. On the basis of the location and time information, each geo-event is depicted by a three-dimensional geometry—a space-time point, line, region, or volume—in the space-time cube in which the two horizontal dimensions represent geographic space and the vertical dimension represents time, as shown in Figure 3. In this paper, we have restricted the geometry to space-time points for the sake of simplification, but our model of moving phenomena is compatible with the other types of spatiotemporal geometries. Figure 4(a) show GeoRSS feeds as an example of geotime-tagged contents represented by space-time points. From such instances, we can encode geo-events and represent them spatiotemporally and thematically, as shown in Figure 4(b). Now, we explain moving phenomena as the abstraction of objects from individual geo-events. As mentioned in [7], we may obtain useful information by assigning a set of points to a region. In Figure 5(a), for example, there are two types of points. In the figure, it is a slightly difficult to recognize spatial relationships between point-set A and point-set B. However, if we generalize them as two regions, a new topological relationship appears, as shown in Figure 5(b). By aggregating geo-events relevant to a phenomenon that varies continuously across
Aggregation and Analysis of Geotime-Tagged Contents on the Web y
13
y
A B
A B
x
(a) Two clusters of points
(b) Intersection relationship between two regions occupied by clusters
Fig. 5. Deriving a new spatial relationship from the individual 2D points
a region of space and time but whose position and shape is hard to track, we can understand its processes over time. Hence, we intuitively define a moving phenomenon as a group of geo-events and represent its boundary shape on the basis of the spatiotemporal volume occupied by these events in a three-dimensional space-time domain. The shape information helps users identify the movement behavior of the phenomenon. If we only consider the geometric information of moving phenomena, we can use the data model of moving object databases to represent moving phenomena, such as moving points or moving regions. However, semantic factors also play a very important role in defining geo-events and moving phenomena. For example, let us consider phenomena related to global warming. Global warming refers to an increase in the average temperature of Earth; however, people have their own perceptions about global warming. Hence, we use the keyword vector in our definition in order to connect specific events to a certain phenomenon such as global warming. The vector-space model based on term weights has been widely used for text-based information retrieval [15]. Furthermore, we need to respond to another challenge in order to design the moving phenomena—spatially and temporally reconstructing continuous phenomena from a set of discrete sampling geo-events. In other words, given a set of samples, we should be able to estimate aspects of a phenomenon at an unsampled location. For reconstructing continuous phenomena, we introduce the concept of coverage, which is commonly used in GIS and geoscientific applications. A coverage function is proposed for representing continuous geographic phenomena within a bounded spatiotemporal domain. It assigns a value or a set of values of defined data types to a position within its domain [12]. Conceptually, a coverage function is denoted as a mapping function f of a given domain D to a set of values V , i.e., f : D → V . Figure 6 shows the difference between moving objects with movement functions and moving phenomena. We need coverage functions to map spatiotemporal locations with topic keywords in order to construct a moving phenomenon over the geo-events. Therefore, the moving phenomena are defined as follows:
14
K.-S. Kim et al. time (t)
Coverage function
Movement function
longitude (y) latitude (x)
Fig. 6. Trajectories of moving phenomena with movement and coverage functions
Definition 2. Moving phenomena A moving phenomenon of a set of geo-events GE = {e1 , e2 , .., en } is a function mp to return a keyword vector from its domain for any location within a bounded spatiotemporal volume, i.e., mp : (x, y, t) → k such that ∀ (x, y, t) ∈ D(GE), where x and y are coordinates of longitude and latitude, t is a timestamp, D(GE) is the three-dimensional volume representing the collective location of geo-events in a spatiotemporal domain, K(GE) is the collection of keyword vectors of geo-events, and k is a keyword vector derived from K(GE). According to our definition, a moving phenomenon may be regarded as a type of geo-event. By integrating space, time, and theme into moving phenomena, we can understand the movement behavior and the topic trend of an interesting phenomenon occurring in the real world.
4
Framework of Moving Phenomena
Here, we explain the implementation of the framework required to create moving phenomena over the Web contents. The purpose of our framework is to help users understand the movement (or shape) and topic information of the geoevents relevant to a given user query. In brief, a user can input keywords to describe a moving phenomenon according to his/her point of view and can restrict the geo-events to a particular area of interest by entering a specific location and time. The system retrieves geo-events related to the user query, and constructs a moving phenomenon by using the representation data model. The user can navigate through the created user-centric moving phenomena with the interfaces of a three-dimensional space-time viewer that combines geographic space and time. Figure 7 illustrates the overall system architecture of our moving phenomenon framework. The framework consists of three main parts—geo-event convertor, moving phenomenon engine, and GeoT3D visualization tool—which are discussed in the following subsections.
Aggregation and Analysis of Geotime-Tagged Contents on the Web
GeoT3D VT
GeoNames
15
Moving Phenomenon Engine
rssToGeo Geo-Event Convertor
search
geo-events {(S1,T1,K1), }
…
Geo-Event DB
RSS feeds, blog entries, news articles, photos, etc.
Fig. 7. Overview of moving phenomenon framework
4.1
Geo-event Convertor
The geo-event convertor generates geo-events from Web contents; however, these geo-events are generated after geotime-tagging if the convertor requires metadata about the location and time of the contents. In order to realize the framework, we perform geotime-tagging by employing a simple method that uses Geonames service [2] for geo-parsing and geo-coding. We mainly collect GeoRSS feeds obtained via the RSS-to-GeoRSS convertor of Geonames service from various sites. In the case of GeoRSS, we use tags of location and time on the contents for the generation of geo-events. However, we need to carry out geotime-tagging on the basis of named entity recognition for certain contents that do not contain geographic or temporal tags. The named entity recognition is used to classify text elements into categories such as location, organization, and times by natural language processing and to put tags to each element. Among the above mentioned categories, the entities that indicate location and organization are used to derive positions, and time entities are used for the time stamping of geoevents. We need more sophisticated methods in order to determine the location and time information from the named entities; however, we can eliminate the requirement of such methods by using a heuristic and simple method and by concentrating upon the problem to capture the movements of phenomena, rather than geotime-tagging. Next, the convertor encodes the keyword vectors of pairs (term, weight) to represent thematic information of geo-events, excluding stop words. The weight of a term denotes the importance of a term in the content of a geo-event. Even though there are several variants of term-weighting for the vector space model, such as a well-known method used in information retrieval (IR) systems [20], we apply the tf-idf weighting scheme that is based on the number of occurrences of each term. This scheme is given by w(t, e) = tf t,e × ief t
(1)
where w(t, e) is the weight value of a term t in a geo-event vector e, tft,e is the number of occurrences of t in the geo-event vector, and ieft is the inverse event frequency that indicates the general importance of the term t by considering
Fig. 8. Data model for the representation of moving phenomena
the total number of events. The last parameter is computed by the expression ieft = logN/eft, where N is the total number of geo-events and eft is the number of geo-events that contain t. Further, it is very important to assign a weight to each term in each content to improve the search effectiveness of an IR system. However, this study does not aim to describe high-precision IR system, but rather seeks to present a model to aggregate and analyze the geo-events with spatiotemporal dimensions. Finally, the converted geo-events are stored in a database. 4.2
Moving Phenomenon Engine
The moving phenomenon engine plays a key role in our framework. It searches geo-events on the basis of a user query based by measuring the similarity between the query vector and the keyword vector of a geo-event. In the vector space model, the cosine of the angle between two vectors (dot product) is typically used as a measure of similarity. If e is a geo-event vector, q is the query vector, and t is a term, then the similarity can be calculated as t∈q,e wt,q · wt,e Sim(e, q) = , (2) 2 · 2 w w t,q t,e t∈q,e t∈q,e where wt,q is the weight value of the term t in the query vector and wt,e is the event vector; the weight values are given by eq. (1). We select the geo-events whose scores are greater than a user-controlled threshold, computed by eq. (2) and transform them into as a moving phenomenon by using the data model shown in Figure 8. In Section 3, we defined a moving phenomenon as an aggregation of geo-events. However, we define the class of UnitPhenomenon between the classes of MovingPhenomenon and GeoEvent in order to realize the coverage functions that fill whole space among sample events inside of a moving phenomenon. This data model is motivated from
Aggregation and Analysis of Geotime-Tagged Contents on the Web t
17
t
y
y
x
x
(a) Delaunay tetrahedralization with geo-events e1 (x1, y1, t1, k1)
e (x, y, t)
k
e4 (x4, y4, t4, k4)
4
e2 (x2, y2, t2, k2)
f (e) =
∑V i =1 4
i
∑V
(e)k i i
= k
(e )
i =1
e3 (x3, y3, t3, k3)
e1
e1
e1
e e4 e2
e
e4
V2
V1
e V3 e
e2
V4
e4 e3
e3
e2
e3
(b) Interpolation by barycentric coordinates Fig. 9. Example of moving phenomenon creation using Delaunay tetrahedralization
the sliced representation of moving-object data models. We consider several coverage functions that are based on interpolation methods in order to estimate the value of the attributes at an unsampled location when a set of sample data is given. The movement/shape of a unit phenomenon depends on the type of coverage functions, and this movement/shape remains constant for a certain period of time. Eventually, an instance of MovingPhenomenon consists of instances of UnitPhenomenon, which comprises sample elements of geo-events. In order to define the coverage functions of moving phenomena, we adopt the data model that is used for representing the variations in an attribute value of continuous spatial fields within a spatial domain in GIS applications. Various data structures for spatial fields are based on the vector and raster model, such as digital elevation models (DEMs), triangular irregular networks (TINs), contour lines, and coverage maps [23]. In this study, we adopt a TIN model that is based on Delaunay triangulation in order to create moving phenomena with geo-event,
18
K.-S. Kim et al.
t
t
y
y
geo-space
geo-space
potential path area space-time path
space-time volume
space-time prism x
(a) Space-time path and prism
x
(b) Space-time volume
Fig. 10. Space-time cube models
although such models have certain disadvantages too. The TIN model has been used for digital terrain modeling in GIS. It divides the domain area of discrete points into a set of non-overlapping triangles in a two-dimension space. On the Web, it is difficult to construct regular grid cells or tessellations to cover the domain space because of variations in the distribution of discrete geo-events, and the Delaunay triangulation is particularly suitable when we do not want to force any constraints on the set of points that are to be connected. We thus extend the TIN model to a three-dimensional space-time using Delaunay tetrahedralization; in this paper, we refer to this model as ST-TIN. In three-dimensional space-time, we produce an ST-TIN over more than four selected geo-events to represent a moving phenomenon, as shown in Figure 9(a). If we can construct a tetrahedron by connecting four geo-events, then we can estimate the keyword vector at an arbitrary location. Figure 9(b) shows an example of a barycentric interpolation inside the tetrahedron [14]. Moreover, the engine can search for the geo-events in a particular moving phenomena of interest on the basis of not only spatiotemporal operations but also semantic relevance. Fundamentally, the proposed moving phenomena are combination of spatial, temporal, and thematic information of various geo-events, and our model enables the handling of such information. 4.3
GeoT3D Visualization Tool
The GeoT3D visualization tool enables users to navigate geo-events and moving phenomena in a three-dimensional space-time domain. The main component is a three-dimensional viewer whose x-axis and y-axis represent geographic space and z-axis (height) represents time. We use a space-time viewer to show the movements/changes through space over time; this is crucial since our model of moving phenomena is based on three dimensional space-time geometries. A three-dimensional space-time cube is often used to represent the movements of objects since H¨ agerstand developed time geography. In time-geography, the space-time path and prism are used to trace the physical movement of an individual object in the space-time cube, as shown in Figure10(a) [16]. The timegeography model contains information about the space and time of the activities
Aggregation and Analysis of Geotime-Tagged Contents on the Web
19
of an object and provides a framework for representing moving objects over various levels of temporal granularity. However, such a model is inappropriate for our framework because our model represents an aggregation object with several individual events. Thus, we employ space-time volumes to represent a moving phenomenon in the space-time cube, as shown in Figure10(b). In addition, a user can select geo-events and create a query region by using the visualization tool. In this study, the shape types are restricted to boxes and tetrahedrons; however, in the future, we plan to include other shapes for representing query regions and moving phenomena with respect to interpolation methods. The query region is passed to the moving phenomenon engine and the geo-events inside the query region are displayed on the cube. We can also access the original contents of geo-events by using the visualization tool. That is, our tool helps users to interactively explore geo-events and moving phenomena.
5
Implementation
We developed and implemented a prototype system by using CGAL [1] for threedimensional Delaunay triangulation, LingPipe [3] for named entity recognition, and PostgreSQL for creating a database. For the experiments, we gathered Web news articles published in 2005, especially those featuring typhoons, because we wanted to compare the movements of a real typhoon and a user-centric typhoon consisting of geo-events generated on the basis of user keywords. A screenshot of the system is shown in Figure 11. As already mentioned, the system comprises a three-dimensional space-time viewer and several control boxes such as create, delete, edit, and select to manipulate moving phenomena. The viewer browses
Controller
3D Viewer
Fig. 11. Screenshot of the moving phenomenon framework with individual geo-events on the space-time cube
20
K.-S. Kim et al.
thematic condition temporal condition spatial condition 3D shape condition
(a) Definition of a moving phenomenon
Geoevent
Moving phenomenon
(b) Mapping the moving phenomenon using tetrahedrons
(c) Browsing semantic information of contents Fig. 12. Screenshot taken during the creation of a moving phenomenon on the basis of user keywords
individual geo-events, which are represented using different colors corresponding to the category of the contents, on the space-time cube. In the experiment, we classified three groups of contents as thematic information.
Aggregation and Analysis of Geotime-Tagged Contents on the Web
21
We first present the method to create a moving phenomenon on the framework. Figure 12 shows an example of a moving phenomenon that is created on the basis of user query keywords. We type the keyword “Typhoon Nabi,” which is a high-intensity typhoon that hit Japan in 2005, and set the time period to define “NABI” phenomenon in the query dialog, as shown in Figure 12(a). The default coverage method used is barycentric interpolation. The system performs Delaunay tetrahedralization to construct the shape of the user-centric moving phenomenon. The red portion in Figure 12(b) depicts “NABI” phenomenon. Moreover, the system displays list view of relevant geo-events and the top-k keywords related to the query keywords (Figure 12(c)). By clicking on an event in the list, a user can access the Web page in which the geo-event is described. Hence, our moving phenomenon framework helps a user to intuitively understand various moving phenomena with spatial, temporal, and thematic information. Figure 13 shows a screenshot taken during event retrieval using a minimum bounding box, which is created from the geo-events occurring in the area of interest specified by the user. While navigating through the geo-events by using
query box
query box
(a) Figures illustrating the changes in the position and size of a query box
(b) Query box showing the results of the geo-events corresponding to the top-k keywords Fig. 13. Screenshot taken during the retrieval of geo-events from the query box generated on the basis of the area of interest specified by the user
22
K.-S. Kim et al.
User-centric typhoon phenomenon
Real typhoon phenomenon
Fig. 14. Comparison of the movement behaviours between real phenomena and usercentric phenomena
the space-time viewer, a user can select the geo-events on the basis of space, time, and theme and create a query box using them to retrieve relevant geoevents to them. During retrieval, the geo-events corresponding t topics that are different from query events are excluded from the results. Further, the user can also restart the search procedure by changing the position or size of the query box. Finally, we compare the actual movement of typhoon “Nabi” and the movement of our moving phenomenon reconstructed from the news articles on the Web. Figure 14 shows a comparison between the movement behavior of the actual typhoon and the constructed moving phenomenon. The red portion corresponds to the user-centric moving phenomenon, and the blue portion corresponds to the trajectory of the typhoon eye constructed by using information form digital typhoon database. Contrary to our expectations, we could not establish any relationship between the two, such as similar shapes; this is because our model is based on a three-dimensional volume, while the current trajectory model is based on a three-dimensional polyline. In addition, we used inaccurate methods for geotime-tagging, term weighting, and searching similar contents. By improving the accuracy of these methods, we can establish a relationship between the two. Furthermore, the coverage function that represent realistic phenomena should be taken into account in a future study.
6
Conclusion
The Geoweb environment produces a large amount of geotime-tagged contents that describe various phenomena occurring in the world, such as facts, events,
Aggregation and Analysis of Geotime-Tagged Contents on the Web
23
activities, or situations. In this study, we developed a spatiotemporal model to combine spatial, temporal, and thematic information about phenomena over the geotime-contents. We defined geo-events to represent primitive information and proposed moving phenomena to aggregate them. In particular, the proposed moving phenomenon is obtained by integrating two models—a vector space model used for information retrieval and a geographic coverage model used in GIS applications. By aggregating geo-events into moving phenomena, we provided continuous movement information and semantic relevance to analyze geotime-tagged contents corresponding to the movement behaviors of phenomena occurring in the world. Furthermore, we presented a simple implementation framework to handle moving phenomena. By using this framework, users can create their own moving phenomena with respect to keywords and regions of interest and navigate through the phenomenon along all the three dimensions— space, time, and theme. In particular, users can intuitively perceive their movement behavior and spatiotemporal relationships as visual geo-events and moving phenomena by using three-dimensional geometries (boxes, tetrahedrons, etc.) in a space-time cube. In the future, we intend discussing the use of intelligent methods for geotimetagging, term-weighting, and searching similar contents to improve the accuracy of retrieval. We also intend extending our framework to support functionalities for creating moving phenomena using various types of coverage functions and using predicates about the distance or spatiotemporal relationships among geoevents. Finally, we shall continue to develop the moving phenomenon database. Acknowledgment. The authors thank Mr. Toru Kamada for his help with the design implementation of the prototype used in this paper.
References 1. 2. 3. 4.
5.
6.
7.
8.
CGAL, Computational Geometry Algorithms Library, http://www.cgal.org GeoNames, http://www.geonames.org Lingpipe, http://alias-i.com/lingpipe/ Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-where: geotagging web content. In: Proc. of the 27th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR), pp. 273–280 (2004) Chen, Y.-F., Fabbrizio, G.D., Gibbon, D., Jana, R., Jora, S.: Geotracker: Geospatial and temporal rss navigation. In: Proc. of the International World Wide Web Conference (WWW), pp. 41–50 (2007) Forlizzi, L., G¨ uting, R.H., Nardelli, E., Schneider, M.: A data model and data structures for moving objects databases. In: Proc. of the 2000 ACM SIGMOD international conference on Management of data (SIGMOD), pp. 319–330 (2000) Galton, A., Duckham, M.: What is the region occupied by a set of points? In: Raubal, M., Miller, H.J., Frank, A.U., Goodchild, M.F. (eds.) GIScience 2006. LNCS, vol. 4197, pp. 81–98. Springer, Heidelberg (2006) Goodchild, M.F.: Citizens as sensors: The world of volunteered geography. GeoJournal 69(4), 211–221 (2007)
24
K.-S. Kim et al.
9. Grenon, P., Smith, B.: SNAP and SPAN: Towards dynamic spatial ontology. Spatial Cognition & Computation: An Interdisciplinary Journal 4(1), 69–104 (2004) 10. G¨ uting, R.H., B¨ ohlen, M.H., Erwig, M., Jensen, C.S., Lorentzos, N.A., Schneider, M., Vazirgiannis, M.: A foundation for representing and querying moving objects. ACM Transactions on Database Systems 25(1), 1–42 (2000) 11. Hornsby, K.S., Cole, S.: Modeling moving geospatial objects from an event-based perspective. Transactions in GIS 11(4), 555–573 (2007) 12. ISO/TC211. ISO 19123 Geographic information - schema for overage geometry and functions (2005) 13. Kim, K.-S., Zettsu, K., Kidawara, Y., Kiyoki, Y.: Sticker: Searching and aggregating user-generated contents along with trajectories of moving phenomena. In: Proc. of the 10th International Conference on Mobile Data Management (MDM), pp. 365–366 (2009) 14. Ledoux, H., Gold, C.M.: Modelling three-dimensional geoscientific fields with the voronoi diagram and its dual. International Journal of Geographical Information Science 22(5), 547–574 (2008) 15. Lee, D.L., Chuang, H., Seamons, K.: Document ranking and the vector-space model. IEEE Software 14(2), 67–75 (1997) 16. Miller, H.J.: A measurement theory for time geography. Geographical Analysis 37, 17–45 (2005) 17. Pelekis, N., Theodoulidis, B., Kopanakis, I., Theodoridis, Y.: Literature review of spatio-temporal database models. The Knowledge Engineering Review 19(3), 235–274 (2004) 18. Perry, M., Hakimpour, F., Sheth, A.: Analyzing theme, space and time: An ontology-based approach. In: Proc. of the 14th annual ACM international symposium on Advances in geographic information systems (ACM-GIS), pp. 147–154 (2006) 19. Scharl, A., Tochtermann, K. (eds.): The Geospatial Web: How Geobrowsers, Social Software and the Web 2.0 are Shaping the Network Society. Springer, Heidelberg (2007) 20. Singhal, A.: Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin 24(4), 35–43 (2001) 21. Teitler, B.E., Lieberman, M.D., Panozzo, D., Sankaranarayanan, J., Samet, H., Sperling, J.: Newsstand: a new view on news. In: Proc. of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems (ACM-GIS), pp. 1–10 (2008) 22. Torniai, C., Battle, S., Cayzer, S.: Sharing, discovering and browsing geotagged pictures on the world wide web. In: The Geospatial Web, pp. 159–170 (2007) 23. Worboys, M., Duckham, M. (eds.): GIS: A Computing Perspective, 2nd edn. CRC Press, Boca Raton (2004) 24. Worboys, M.F.: Event-oriented approaches to geographic phenomena. International Journal of Geographical Information Science 19(1), 1–28 (2005) 25. Yasumura, S., Ikezaki, M., Watanabe, T., Ushiama, T.: Event-centralized management of geographic information collected from blog. In: Apolloni, B., Howlett, R.J., Jain, L. (eds.) KES 2007, Part II. LNCS (LNAI), vol. 4693, pp. 769–776. Springer, Heidelberg (2007)
GeoPW: Towards the Geospatial Processing Web Peng Yue1, Jianya Gong1, Liping Di2, Jie Yuan1, Lizhi Sun1, and Qian Wang1 1
Abstract. The growth of the Web has resulted in the Web-based sharing of distributed geospatial data and computational resources. Web Service technologies, especially those standards-based interoperable geospatial services, make geoprocessing functionalities easily accessible to software programs and users. GeoPW is a set of services aiming to provide a large amount of geoprocessing utilities over the Web. These services are provided by middleware wrapping legacy GIS analysis components to provide the OGC WPS interface. These legacy components include analysis utilities of GIS software systems GRASS and GeoStar. Currently, GeoPW has provided over one hundred services and we are still adding more. Geoprocessing functions provided by these services include GIS vector and raster analysis, network analysis, data processing, geostatistical analysis, and hydrological analysis. The services are open and accessible to the public, and provide building blocks and a solid technological foundation for the Geospatial Processing Web. Keywords: Geoprocessing, Web Processing Service: Geospatial Processing Web, OGC, GIS, Spatial Analysis.
Geoscientists are experiencing a data-rich yet analysis-poor period. Existing Webbased GIS and geospatial service developments have a more focus on data sharing and interoperability. As a result, while we can discover many geospatial data services such as Web Feature Service (WFS), Web Map Service (WMS), and Web Coverage Service (WCS) on the Web, few geoprocessing services are available. In this work, a Web Service toolkit to support geoprocessing over the Web, named GeoPW (http://geopw.whu.edu.cn), has been implemented. It leverages Web Services, legacy GIS software, and OGC specifications to make traditional desktop analysis functions configurable, accessible, and interoperable over the Web. This paper describes design, implementation, and potential of GeoPW. The rest of the paper is organized as follows. Section 2 introduces the background on geoprocessing over the Web and the vision of the Geospatial Processing Web. In particular, the role and potential of GeoPW in the Geospatial Processing Web are identified. Section 3 presents the system architecture of GeoPW and describes its components. In Section 4, implementation details are provided. The work is compared with related work in Section 5, and conclusions and pointers to future work are given in Section 6.
2 Geoprocessing over the Web The term “geoprocessing” has a broader meaning than traditional analysis functionalities in GIS and can refer to any sort of geospatial processing or analysis functionality. For example, it can include computational models such as an Earth system prediction model that is not available in traditional GIS. In the real world applications, a single analysis function is not enough to solve a complex problem. Therefore, geoprocessing models, which compose multiple analysis functionalities together to solve application-specific problems, are needed. The Environmental Systems Research Institute (ESRI) has provided a geoprocessing framework since its release of ArcGIS 9 software product in 2004. A number of analysis functions (or called processes) are included in this framework. In addition to these basic functional analysis units, a ModelBuilder tool is provided to help users visually combine different analysis processes together and generate geoprocessing models (or called process chains). The spatial modeling tool in the ERDAS IMAGINE software has a similar process chaining function. It can combine GIS analysis functions with ERDAS’s commercial image processing functions. Other traditional GIS software systems support common analysis functionalities such as overlay analysis, buffer analysis, network analysis, terrain analysis, and geostatistical analysis. Nevertheless, these legacy GIS software systems cannot support geoprocessing over the Web. The analysis functions and even the process modeling capability can only be used in their own proprietary environments. From the viewpoint of SOA, geoprocessing over the Web should follow the “publish-find-bind” paradigm. The individual geoprocessing services are published into the service registry. The geoprocessing requestor is able to find geoprocessing services through the registry. When appropriate services are located, the requestor can bind and assign geoprocessing tasks. In a distributed environment, a complex geoprocessing task may include services coming from multiple service providers. A geoprocessing model that contains multiple geoprocessing steps will be bound to an executable service chain or concrete workflow.
GeoPW: Towards the Geospatial Processing Web
27
Fig. 1. The framework for Geospatial Processing Web
The Geospatial Processing Web named here refers to a distributed, integrated, and collaborated service-oriented geoscientific research environment enabled by geoprocessing over the Web. It is an information infrastructure where various geoprocessing functions are exposed as services and accessed through standard protocols. These services can be located and integrated intelligently for complex geoprocessing tasks. The Web is evolvable by accommodating new geoprocessing components and models, and maintainable through upgrade or replacement of existing geoprocessing components and models. The implementation can leverage the latest development of distributed computing technologies, Web technologies, and interoperability standards. Figure 1 illustrates a framework and components to support the Geospatial Processing Web. It is a three-layer architecture. The first layer is the legacy GIS platform that manages geospatial data and analysis functions in its own proprietary form. The second layer is the geospatial data, services, and models management platform that provides basic utilities for geoprocessing modeling and applications in the higher layer. Such basic utilities include geospatial data providing, processing, and visualization services. For process chaining, geoprocessing models and workflow management utilities are needed. The search service can be used to locate geoprocessing services or models, and inference engines may be involved for a semantic match when needed. In the third layer, a
28
P. Yue et al.
workbench or portal can be used to help users develop geoprocessing models and applications. The geoprocessing models, after going through a collaborative peer review, can be registered into the model warehouse as a type of knowledge. Since GeoPW provides a set of geoprocessing services, it serves as the building blocks for higher geoprocessing modeling and complex geoprocessing tasks, and is part of fundamental utilities for the implementation of the Geospatial Processing Web.
3 GeoPW: System Design Figure 2 shows the system architecture of GeoPW. The design of GeoPW not only supports the provision of a geoprocessing service module warehouse, but also enables new geoprocessing service development to be easily carried out. A geoprocessing service module warehouse can be provided by GeoPW through wrapping legacy or new geoprocessing functions. The following considerations are introduced when developing geoprocessing services.
Fig. 2. The system architecture of GeoPW
GeoPW: Towards the Geospatial Processing Web
29
3.1 Granularity of Geoprocessing Services The warehouse contains individual geoprocessing Web Services that are selfcontained, self-described, and modular. The service modules can be published, located, and dynamically invoked or chained together to form complex geoprocessing models for producing sophisticated geospatial information products on-demand. The granularity of individual service modules is an important factor affecting the flexibility, applicability, and reusability of service modules in different geospatial models. If the module’s functionality is too small, it needs a lot of modules to construct a complex geospatial model, hence reducing the system performance. If we aggregate too many functions into a service module, the module will not be easily plugged into other geospatial models. The flexibility, applicability, and reusability will be discounted. The International Organization for Standardization (ISO) 19119 geospatial information - service standards defines six categories of geospatial services: geographic human interaction services, geographic model/information management services, geographic workflow/task management services, geographic processing services, geographic communication services, and geographic system management services [3]. The geographic processing services can be further divided into four categories based on their partial modification of geospatial data in these aspects: spatial, thematic, temporal, and metadata. Most geoprocessing services can be classified into these four categories. However, ISO 19119 is a high-level service architecture standard and no fine-grained geoprocessing service taxonomy is provided in this standard currently. In the science keywords provided by the Global Change Master Directory (GCMD)1, a category of service keywords for Earth science is defined hierarchically. GIS functionality is classified as a sub-category in the Data Analysis And Visualization topic. The commercial GIS software system ArcGIS has provided over two hundred geoprocessing tools. These geoprocessing tools2 are grouped into over twenty toolboxes, each of them representing a kind of geoprocessing category. The geoprocessing functions in the open source GIS software system GRASS3 are grouped into vector and raster analysis categories. Over one hundred geoprocessing commands are provided in these two categories. The geoprocessing tools in ArcGIS and commands in GRASS are typical representations of the functional granularity in GIS. Therefore, these fundamental geoprocessing functions in GIS software systems, designed originally to be tailored to users’ preference, can provide a valuable reference for determining the granularity of geoprocessing services. 3.2 Reuse of Existing Components The service modules in GeoPW act as individual building blocks for dynamically constructing complex geoprocessing models. The richness of the service modules will, in certain extent, decides the capability of the in-house processing power of GeoPW system. Through many years development, existing GIS software systems 1
have accumulated various analysis functions and can be reused. Therefore, the key development effort for GeoPW resides in the middleware service layer, focusing on developing various geoprocessing processes that communicate with legacy analysis components through a range of modes including JAVA (Application Programming Interface) API call, Command call, or HTTP call. As shown in Figure 2, the Web application server hosts services with various geoprocessing processes that will be accessed by different clients. The request/response messages for geoprocessing services can be processed initially by either the servlet/JSP container or SOAP4 engine. SOAP is a Web Service standard established by the World Wide Web Consortium (W3C). The payload of the message, i.e. the content specific for geoprocessing services, is further processed by service request/response handlers. The invocation of a geoprocessing task is dealt with by the corresponding geoprocessing process. For example, if geoprocessing services are implemented using JAVA, there will be JAVA classes to implement request/response handlers. And geoprocessing operations will be implemented as JAVA methods. The existing geoprocessing component in a legacy GIS software system is a suite of software package sharing common runtime features (Fig. 2). For example, it has the same geoprocessing module and data management, API style for geoprocessing and data operation such as Dynamic-Link Library (DLL)-based API or Commandbased API, and environment variables settings. Therefore, once a geoprocessing process wrapping a certain analysis function of a software package is implemented, it also provides a skeleton for other geoprocessing processes wrapping the same software package. To avoid the duplication of the same steps for implementing various geoprocessing processes based on the same software package, the design pattern template method can be used [4]. The skeleton for the targeted legacy analysis component is defined, deferring some specific steps to subclasses or configuration files. As shown in Figure 2, GeoPW administrators can create, edit, and deploy their own XML-based configuration files to develop new geoprocessing processes, with no or little knowledge of changing, compiling, or building computer source codes. 3.3 Adherence with Standards The standard implemented in GeoPW allows the system interoperable, reusable, extensible, and evolvable. In an interoperable service environment, the service modules existing in different provider communities can and will be shared. Therefore, a complex geospatial model can be built by service modules in all member systems of the provider communities and can be executed across the member systems. This is the power of the standards and federation. The GeoPW service modules are developed based on the OGC WPS standard that is currently under development. OGC WPS specifies a standard interface and protocol to discover and execute distributed geoprocessing processes. A geoprocessing process could be any sort of GIS analysis functions. The three mandatory operations included in the standard WPS interface are GetCapabilities, DescribeProcess, and Execute [5]. GetCapabilities operation allows a client to request and receive back a service capabilities document that 4
Simple Object Access Protocol (SOAP) Version 1.2, http://www.w3.org/TR/2001/WD-soap12-part0-20011217/
GeoPW: Towards the Geospatial Processing Web
31
describes operations and processes of a specific WPS implementation. DescribeProcess operation allows a client to get detailed information about specific processes such as input and output parameter types. Execute operation allows the client to run a specific process in a WPS server. In addition to the service interface, more specific features are defined including encodings of request/response for process execution, embedded data and metadata in inputs/outputs for process execution, references to Web-accessible data inputs/outputs, long-running processes support, process status information and processing errors report, and storage of process outputs. More details please refer to the OGC WPS document [5]. Like other OGC service specifications, WPS use HTTP protocol, in particular GET and POST operations, to exchange XMLbased request and response messages.
4 Implementation The 52n WPS framework5 is used to develop all geospatial processing services in GeoPW. It runs as a Web application inside the Jakarta Tomcat server. The analysis functions provided by GIS software systems GRASS and GeoStar are wrapped as geoprocessing processes and plugged into the 52n WPS. 4.1 Wrapping GRASS and GeoStar through the 52n WPS Framework Geographic Resources Analysis Support System, commonly referred to as GRASS GIS, is one of the most widely used open source GIS software system. Originally developed by the U.S. Army Construction Engineering Research Laboratories (USACERL), GRASS has evolved into a powerful utility with over 350 programs and tools ranging from vector/raster analysis to data visualization and image processing. GRASS is integrated with GDAL/OGR6 libraries to support an extensive range of raster and vector formats, including OGC-conformal Simple Features7. A distinguished feature of GRASS is that it provides command line syntax for its large geoprocessing functions. GeoStar is a leading enterprise GIS software system in China. It has been developed by the State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University (formerly Wuhan Technical University of Surveying and Mapping) since 1992 [6]. The software is a suite of functional components for feature and image data management, processing, and visualization. The spatial analysis component in the current version of GeoStar is a DLL with C++ API running under Windows. The primary issue for implementation here is to bridge the legacy GIS analysis components and OGC WPS interface and protocol. The 52n WPS framework is a JAVA Web application. It provides servlets and general request and response handlers to process HTTP GET/POST request for WPS GetCapabilities, DescribeProcess, and Execute operations. WPS is a generic interface and does not specify any specific processes. Therefore, the implementation of WPS must define the specific 5
Fig. 3. Wrappers for GRASS and GeoStar in the 52n WPS framework
processes that it supports. The 52n WPS framework provides a pluggable framework to allow new processes to be added and easily accessed through OGC WPS interface and protocol. As shown in Figure 3, the framework defines two kinds of repositories. The first one is the Algorithms Repository. An abstract JAVA class AbstractAlgorithm is defined and must be inherited by all implementation classes for corresponding geoprocessing processes. These processes are registered in the algorithms repository through a configure file and loaded into the server during the initialization of a WPS server. Each process has its XML-based process description document and can be retrieved by the DescribeProcess operation. The second one is the Data Handler Repository. The input and output parameters for specific processes may be complex or simple. A simple parameter may describe itself with a simple string or integer, while a complex one needs an XML schema to describe its structure (e.g., a feature collection
GeoPW: Towards the Geospatial Processing Web
33
represented using GML). To generate or parse complex parameter values, the XML generator and parser classes must be registered in the data handler repository and loaded during the initialization of WPS server. Figure 3 illustrates the development of wrappers for GRASS and GeoStar so that legacy analysis functions can be wrapped as new processes in the 52n WPS framework. As mentioned in Section 3.2, the design pattern Template Method is used. An abstract class CMDProcesses is developed to invoke commands and communicate with the execution process of commands for logging status. GRASS offers commands and shell scripts that can be invoked through methods in this class. The use of GRASS commands requires the establishment of the working environment including DATABASE, LOCATION, and MAPSET. Therefore, a sub-class GrassProcesses inherited from CMDProcesses is defined to handle these specific details. Similarly, the class GeostarProcesses deriving from CMDProcesses is developed. As mentioned before, the spatial analysis component of GeoStar is a Windows DLL. Therefore, a Windows Console Application is developed to wrap the DLL as commands. The sequence that the geoprocessing service processes a WPS Execute operation is illustrated in Figure 4.
Fig. 4. Sequence diagram of requests execution
The wrappers provide a way to develop new processes using XML scripting. An XML-based process configuration file describes the sequence of internal command steps that are needed to complete a geoprocessing function defined by a process. For example, a single process using GRASS requires the steps including importing, processing, and exporting data. In addition, the process configuration files describe the data bindings between the command parameters and inputs/outputs of a process description. Therefore, the wrappers can generate executable commands or scripts based on these configuration files and deliver the output to the WPS response. These process configuration files, together with the process description documents and repository registration
34
P. Yue et al.
configuration file, define the processes developed and deployed. The development of new processes can then be simplified as adding new XML configuration files to compose and expose analysis functions. If a bug is detected or a new version of software or an algorithm is available, developers can update the configuration files or GIS software without changing service interfaces, thus eliminating the manipulation of original computer codes and software upgrades by service end users. 4.2 Services Provided Based on the functions provided by GRASS and GeoStar, GeoPW has already developed many geoprocessing services. These services provides geoprocessing functions including GIS vector and raster analysis, network analysis, data processing, geostatistical analysis, and hydrological analysis. We group them into ten categories: Vector Spatial Analysis, Vector Geometry Operation, Vector Network Analysis, Vector Data Processing, Raster Spatial Analysis, Raster Data Processing, Raster Aggregation, Hydrologic Analysis, Geostatistics, and Others. Figure 5 shows the number of services in different categories. 25
20
15
10
5
0
or ct Ve
l ia at Sp
is ys al An
or ct Ve
ry et om Ge
n io at er p O
is ys al An
or ct Ve
or ct Ve
k or tw Ne
ta Da
g in ss ce o Pr
is ys al An
er st Ra
er st Ra
l ia at Sp
ta Da
n s s si ic io ly st at na ti eg A a r t g ic os Ag og Ge r ol te r s d Ra Hy
g in ss ce o Pr
rs he Ot
Fig. 5. Services distribution in different categories of geoprocessing functions
All these services are deployed on the Hewlett-Packard (HP) blade servers (Intel® Xeon® 5110 1.6GHz, 4GB RAM) located at LIESMARS. GRASS software is installed in a server blade with UNIX operation system, and GeoStar is located in another server blade with Microsoft Windows operation system. All services are provided to the public through a single Web access point on one server blade that is public accessible. This Web entry point is implemented using servlets to redirect the request to processes deployed in internal server blades. Currently, services in GeoPW has wrapped most vector and raster analysis functions of GRASS, and demonstrated
GeoPW: Towards the Geospatial Processing Web
35
the wrapper for GeoStar through exposing its spatial buffer and overlay analysis functions. More processes are being developed, and after passing tests, will be deployed into distributed internal server blades. The latest news about GeoPW and all developed geoprocessing services are available at http://geopw.whu.edu.cn (Figure 6). The process description and one testing example for each process are provided through Web page links. There is a link on the home page that provides a guide for GeoPW. The guide provides Web addresses for geoprocessing services and process descriptions and testing examples for analysis functions provides by both GeoStar and GRASS. Information on how to invoke these services through HTML Form and compose services using Oracle BPEL software is also available on the guide page.
Fig. 6. Web access entry of GeoPW
4.3 Result Analysis The effectiveness of our approach is demonstrated through developing over one hundred geoprocessing services successfully based on GRASS and GeoStar. Both vector and raster analysis services are developed. In addition, customized geoprocessing services through composing internal commands can also be provided. Although GML has been widely used as the geospatial data encoding format over the Web, we add the support to the popular GIS file format such as ESRI Shapefile, since these file formats are common for geospatial data owned by users and therefore, practical and
36
P. Yue et al.
ready-to-use. As demonstrated in [7, 8], these standards-based services can be semantically described, published, discovered, and chained to support the Geospatial Processing Web mentioned in Section 2. Through the research and implementation experiment, we also identified several issues that are important for the practical use of geoprocessing services. (1) Long-time running process: A geoprocessing process may take a long time to execute, especially when such a processing is data-intensive or computing-intensive. For the data-intensive processing, although we install a FTP server for purpose of temporary dataset uploading and downloading, it is better to deploy GeoPW at a datarich environment, possibly data pools in data archiving centers, thus upgrading data archiving centers to data-rich learning and research centers for remote end users. For computing-intensive processing, asynchronous notification should be supported by WPS implementations. (2) Standardization of specific process descriptions: As mentioned before, WPS is a generic interface and does not specify the process descriptions of specific processes. The variety of process descriptions for different geoprocessing functions, such as vector, raster, text input data, or multiple output data files for a given process, should be regulated and captured semantically. For example, considering the buffer and overlay processes in GeoPW, services developed based on GRASS and GeoStar can share the same process descriptions. Semantic Web approaches or WPS profiles [5] provides prospect towards solutions for this issue. (3) Intellectual property: The scientists should be credited for their contributions of geospatial data products, algorithms, models, and services. These contributions need to be peer reviewed, rated, or honored in the open Web environment through some specifications. Intellectual property should be addressed so that scientists have incentives and feel comfortable to make contributions [1]. Only in this way geoprocessing services on the Web could be contributed by more volunteered persons or organizations. This community-involved, open, and accumulated approach will help drive our geoscience research environment into a cyber research service network.
5 Related Work The emergence of geoprocessing over the Web has made WPS specification and its related applications active research projects in recent years [9, 10, 11]. OGC has approved version 1.0 of WPS as a standard in 2008. Some open source implementations of WPS such as 52n WPS8, Python Web Processing Service (PyWPS)9, and Deegree Web Processing Service (WPS)10 exist and provide frameworks and examples for geoprocessing services. Michaelis and Ames [12] discuss implementation considerations for WPS and suggest some improvements. Our work focuses on providing a number of geoprocessing services as building blocks for geoprocessing over the Web, with a more emphasis on how to wrap legacy GIS components. There are some work focusing on integration and chaining of geoprocessing services [13] [14]. Stollberg and Zipf [13] propose several ways in WPS to support service 8
chaining. Brauner and Schäffer [14] demonstrate the use of BPEL [15] to chaining WPS services. Our previous work [7][8] also demonstrates the applicability of OWL-S11 and BPEL for chaining of WPS services. Therefore, the geoprocessing services provided in GeoPW can be chained with other standard-compliant services based on these approaches. GRASS has been used in scientific workflow systems such as the Kepler workflow system [16]. Providing geoprocessing services over the Web has been an active research effort in the GeoBrain project [17] since 2004. GRASS was encapsulated as W3C SOAP-based Web services in GeoBrain since OGC WPS specification was under initial discussion at the time to develop those services. Currently, OGC Web Services are not equivalent to the W3C SOAP-based Web Services [18]. Most of OGC Web Service implementations provide access via HTTP GET, HTTP POST and do not support SOAP. Providing WPS processes through GRASS is now a promising approach. PyWPS and 52n WPS [14] demonstrate the WPS interface for GRASS. From this aspect, our work share similarity with their work. In particular, we provide a flexible way to develop processes based on commands and provide a number of processes covering a large amount of analysis functions of GRASS instead of only several examples.
6 Conclusions Geoprocessing over the Web brings the demand for geoprocessing services. This paper describes the design and implementation of GeoPW. Over one hundred geoprocessing services are available now in GeoPW for geoscientific research. These services provide fundamental building blocks for the Geospatial Processing Web. The experience for developing processes helps guide the semantic description, discovery, and chaining of geoprocessing services. We are also adding more services and incorporating our previous work [7, 8] into GeoPW to support service discovery, chaining, and geoprocessing modeling, thus enabling GeoPW to be involved into a system embracing full aspects of the Geospatial Processing Web.
Acknowledgements This work was funded fully or partially by Project 40801153 supported by NSFC, 2009 ISPRS Scientific Initiative Project, 863 Program of China (2007AA120501, 2007AA12Z214), LIESMARS and SKLSE (Wuhan University) Special Research Fundings.
References 1. Foster, I.: Service-oriented science. Science 308(5723), 814–817 (2005) 2. Clery, D., Voss, D.: All for one and one for all. Science 308(5723), 809 (2005) 3. ISO/TC 211: ISO19119:2005, Geographic Information – Services (2005) 11
http://www.daml.org/services/owl-s
38
P. Yue et al.
4. Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-oriented Software, p. 325. Addison-Wesley Longman, Inc., Amsterdam (1995) 5. Schut, P.: OpengGIS® web processing service, Version 1.0.0, OGC 05-007r7, Open Geospatial Consortium, Inc., p. 87 (2007) 6. Li, D., Gong, J., Zhu, Q., Zhu, X.: GeoStar-—A China Made GIS Software for Digital Earth. In: Proceedings of the International Symposium on Digital Earth, Beijing, China, November 29-December 2, 1999, pp. 483–488 (1999) 7. Yue, P., Gong, J., Di, L.: Automatic Transformation from Semantic Description to Syntactic Specification for Geoprocessing Service Chains. In: Bertolotto, M., Ray, C., Li, X. (eds.) W2GIS 2008. LNCS, vol. 5373, pp. 50–62. Springer, Heidelberg (2008) 8. Yue, P., Di, L., Yang, W., Yu, G., Zhao, P., Gong, J.: Semantic web services based process planning for earth science applications. International Journal of Geographical Information Science 23(9), 1139–1163 (2009) 9. Kiehle, C.: Business logic for geoprocessing of distributed geodata. Computers and Geosciences 32(10), 1746–1757 (2006) 10. Foerster, T., Schäffer, B.: A client for distributed geoprocessing on the Web. In: Ware, J.M., Taylor, G.E. (eds.) W2GIS 2007. LNCS, vol. 4857, pp. 252–263. Springer, Heidelberg (2007) 11. Díaz, L., Costa, S., Granell, C., Gould, M.: Migrating geoprocessing routines to web services for water resource management applications. In: Proceedings of the 10th AGILE Conference on Geographic Information Science, Aalborg, Denmark, pp. 1–9 (2007) 12. Michaelis, C.D., Ames, D.P.: Evaluation and implementation of the ogc web processing service for use in client-side gis. Geoinformatica 13, 109–120 (2009) 13. Stollberg, B., Zipf, A.: OGC web processing service interface for web service orchestration - Aggregating geoprocessing services in a bomb threat scenario. In: Ware, J.M., Taylor, G.E. (eds.) W2GIS 2007. LNCS, vol. 4857, pp. 239–251. Springer, Heidelberg (2007) 14. Brauner, J., Schäffer, B.: Integration of GRASS functionality in web based SDI service chains. In: Proceedings of the FOSS4G 2008, Cape Town, South Africa, September 29October 4, p. 10 (2008) 15. OASIS: Web services business process execution language, version 2.0. Web Services Business Process Execution Language (WSBPEL) Technical Committee (TC), p. 264 (2007) 16. Zhang, J., Pennington, D.D., Michener, W.K.: Automatic transformation from geospatial conceptual workflow to executable workflow using GRASS GIS command line modules in Kepler. In: Alexandrov, V.N., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2006. LNCS, vol. 3993, pp. 912–919. Springer, Heidelberg (2006) 17. Di, L.: GeoBrain-A Web Services based Geospatial Knowledge Building System. In: Proceedings of NASA Earth Science Technology Conference 2004, Palo Alto, CA, USA, June 22-24, p. 8 (2004) 18. Tu, S., Abdelguerfi, M.: Web services for geographic information systems. IEEE Internet Computing 10(5), 13–15 (2006)
Towards an Integration of Space and Accessibility in Web Personalization Mohamed Ramzi Haddad1, Hajer Baazaoui1 , Marie Aude Aufaure2,3, Christophe Claramunt4, Yves Lechevallier2, and Henda Ben Ghezala1 1
Riadi-Gdl Laboratory, Riadi-Gdl Laboratory, ENSI Tunis, Campus la Manouba, La Manouba, 2010, Tunisia [email protected], [email protected], [email protected] 2 INRIA-Rocquencourt, INRIA-Rocquencourt, Domaine de Voluceau. 78 153 Le Chesnay Cedex, France [email protected] 3 MAS Loboratory, Ecole Centrale Paris, MAS Laboratory. Chaire SAP BusinessObjects Grande Voie des Vignes 92 295 Chatenay-Malabry, France [email protected] 4 Naval Academy Research Institute, Naval Academy Research Institute, Lanvoc-Poulmic, BP 600, Brest naval, France [email protected]
Abstract. Web personalization can be seen as an interdisciplinary domain that facilitates interaction between web content and user needs. One of the peculiarities of Web information is that a significant part of the data is georeferenced although this is not completely taken into account by current search and personalization engines. This paper introduces a spatial personalization approach based on a user modeling technique and a measure of spatial accessibility. We develop a personalized accessibility measure whose objective is to predict and evaluate location relevancy, accessibility and associations at the user level. This measure favors delivery of location-based and personalized recommendations.
1 Introduction The main objective of personalization systems is to collect users’ preferences in order to better tailor services and information to their needs. The principal task of a personalization process is to model the user and/or her/his context to analyze data from an individual point of view, predict content relevancy and facilitate data exploration, visualization and filtering. When manipulating spatial data, an effective personalization approach should take into account not only spatial data properties (i.e., location attributes and characteristics), but also existing spatial constraints and/or associations derived from the user’s spatial context. Personalization is closely related to a set of concepts to be taken into consideration such as user preferences, constraints and needs. A user model can be considered as a necessary condition to make a bridge between the user’s point of view, and relevancy J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 39–55, 2009. c Springer-Verlag Berlin Heidelberg 2009
40
M.R. Haddad et al.
of a Web personalization process. A user model should be derived from the information semantics, and in the context of location-based information, from the spatial distribution of the data which is likely to have a significant influence on user choices. For instance, several spatial criteria such as proximity and mobility constraints have an important role when planning a trip, or choosing a possible destination [1][2]. It appears clearly that integration of spatial measures in personalization approaches can improve information system usability, and facilitates spatial data exploration and visualization on the Web. This implies to filter the content of information deliveries using spatial properties and associations with respect to some user constraints. In particuler, the notion of accessibility is an important paramater to consider when designing services closely related to spatial mobility. Accessibility can be defined as the simplicity with which activities in the society can be reached, including needs of citizens as trade, industries and public services. Accessibility is also used to study the structural and functional properties of space [3]. Accessibility encompasses the notions of distance, cost of travel and interactions between locations and the structural properties of space. Accessibility measures are key elements to understand the way space is used, and the way users interact in space and time. These can be used to assess the impact of the locations distribution (e.g., services, shops etc) on users’ movements. Accessibility measures can be used to determine travel patterns, and then human behaviors in space and time. When providing location-based or travel-oriented services, the notion of context is preferably determined by the location of the user. Representing space and relationships between a user and nearby location-based data should be ideally predicted by the system. Such a prediction should be personalized in order to extract relevant information at the user scale. But current spatial and accessibility measures consider that the spatial distribution of location-based data has the same effect on users’ displacements, which is not the case as each user has some specific constraints, wishes and objectives. The spatial patterns determined by conventional accessibility measures are often global patterns of land use, which are implicitly the result of an aggregation of some structural anf functional properties, and land use micro-patterns. Consequently, spatial measures such as accessibility have to be personalized in order to reflect the user perception of its environment, and to extract personal knowledge at the user level. We believe that a close integration of personalization approaches and spatial analysis techniques may improve the quality and the relevancy of the delivered information, when considering location based services on the Web. This might also lead to discover and categorize individual user knowledge, this favoring the design of Web information systems. The remainder of the paper is organized as follows. Section 2 briefly introduces related work oriented to the measure of accessibility and personalization. Section 3 develops the personalization approach, the user modeling and semantic relevancy evaluation technique. Section 4 details the spatial analysis component and the proposed user-oriented accessibility measure that integrates the concept of personalization. Section 5 presents some experimental research results. Finally Section 6 concludes the paper and draws some research perspectives.
Towards an Integration of Space and Accessibility in Web Personalization
41
2 Related Work When considering spatial information on the Web, personalization can be categorised into two complementary approaches oriented to (1) the generation of a collection of information describing some user interests and (2) inference and filtering of some spatial information in order to deliver the closest content to the user expectations. The latter is used to determinate the most accessible and relevant locations, that is, the ones that are the more likely to be visited or searched by the user. This outlines the close interaction that should be modeled between the concept of personalization and the one of spatial accessibility. 2.1 Spatial Accessibility and Personalization Accessibility measures are used for the study and interpretation of the way space is structured and used. Accessibility measures allow to evaluate, quantify and visualize the impact of locations distribution (e.g., services, facilities etc. . . ) on the users’ navigational and traveling behaviors, and by identifying the most accessible destinations. These measures are of particular interest in the context of geographical information systems delivering spatial related services such as location based services and travel planning assistance. Accessibility can contribute to model and predict users travel patterns, and consequently to recommend potential destinations among the most accessible to the users. Accessibility measures so far developed generally encompass different points of view as space can be interpreted in different ways. As research objectives are varied, current approaches are influenced by their application purpose (e.g., urban planning, economy, social studies). For instance, accessibility can be evaluated at the location level [4][5][2]; meanwhile, Pirie [6] and Kwan [7] introduced the notion of individual accessibility that takes into account the user’s point of view. While also stating that the concept of accessibility depends on the scope and objectives of the study, Handy and Niemeyer introduced two concepts to consider when defining an accessibility measure [2]. The first one is the concept of destination attractiveness which is the factor that motivates a given person to visit a destination. The second concept is the destination travel impedance which is the repellent factor since distant destinations are less likely to be visited. The travel impedance is measured by several criteria such as distance and travel time, and by surveys where distance is apprehended as perceived as the users, thus taking into account the cognitive dimension. Functional or economicalbased properties such as displacement costs can enrich accessibility approaches [2]. However, impedance is often considered as a static and user-independent measure, that is, an accessibility opportunity with respect to a given destination. It is usually influenced by role of the location, that is, its importance with respect to the phenomenon studied. Attractiveness in this context is not personalized since it is evaluated using user-independent measures and statistics. This is where and why the concept of attractiveness can be revisited and studied with respect to user preferences and expectations. The simplest accessibility measures are distance-based measures. These are based on the assumption that the closer the location is, the more it is accessible. This type of measure evaluates average or weighted location distances using different Euclidean or
42
M.R. Haddad et al.
network based distances. In the simplest modelling approach, destinations are considered as equally attractive. Oberg [8], and Wachs and Kumagai [9] introduced cumulativeopportunity measures. The accessibility is here defined as the number or the proportion of realizable destinations within a certain travel impedance. Destinations are equally weighted without considering the concept of attractiveness, this being hardly the usual and practical case as observed by Black et Conroy works [10]. Gravity-based measures take into account the way the notion of accessibility often behaves and is evaluated in space [11]. Gravity-based measures are based on the assumption that locations have a certain amount of attractiveness with respect to their neighborhood. Place accessibility is evaluated by the measure of attractiveness attenuated by an impedance-based function. Geertman and van Eck [4] and Sonesson [12] proposed a gravity accessibility measure derived from a gravity model of trip distribution. For instance, attraction has been evaluated by exponential or inverse power functions [2] [13] [5]. Ingram [14] proposed a Gaussian formulation, the advantage being to have a slow decline close to the origin, and still a relatively slow decline at a greater distance when compared with negative exponential or inverse power functions. The closest type of accessibility measures to the personalization field is the individual accessibility. Accessibility is evaluated with respect to the user needs and resources. Kwan [7] and Pirie [6] considered accessibility as evolutive, and a function of the user experience, context and constraints [15]. The modeling of user interactions with spatial data on the Web has been recently considered. In [16], the authors presented an interactive system design framework for spatial personalization on the Web where the spatial dimension of the environment is integrated within a user-centred design approach. The notion of context is specified at the user, geographical and interface levels, the objective being to categorise similar behaviours and to derive user groups of similar interests. However, the approach is not directly centered towards the user and her/his knowledge and preferences. In [17], the authors proposed an implicit profiling approach at the interface level by tracking users’ mouse movements as they interact with spatial data on the Web. The approach is essentially constrained by space with no particular integration of some semantic properties of the data considered or the specific knowledge and experience of the users. 2.2 Discussion It appears that in most personalization approaches, the spatial aspect is not fully taken into consideration. However, personalization approaches should consider spatial measures to model spatial constraints and factors that influence user’s navigation process, and the search for a ”best” destination when the purpose is a navigation task (i.e., a displacement). When considering the search on the Web for a navigation process, and while filtering spatial information, users need to visualize not only the semantic relevancy of the items of interest, but also their spatial associations and characteristics in order to evaluate trip relevancy, and decide if these meet their current spatial constraints. The spatial aspect of the data is as important as its semantic one while evaluating the relevancy of the task. Accessibility should be also personalized since the spatial characteristics of spatial entities are evaluated differently by different users. Usual accessibility measures, when applied to navigation modeling, derive some general knowledge and land use patterns. Meanwhile, their interest for the evaluation
Towards an Integration of Space and Accessibility in Web Personalization
43
of users navigational patterns is mainly an abstract notion if not related to user background and objectives. Accessibility should be here the result of individual knowledge and experiences accumulation. While interacting in space, individuals enrich their experience and re-adapt their point of view on accessibility, depending on the evolution of their knowledge of the environment and their own needs. For example, two users in a same context may have different evaluations of travel impedances and/or some locations attractiveness, this consequently generating different decisions and behaviours. Moreover, even similar users in similar contexts may evaluate differently convergent navigation criteria. The work presented in this paper aims to take into account the factors that influence individual choices of destinations during a navigation, and particularly relevancy and spatial accessibility. We introduce a personalized accessibility measure whose objective is to identify the most relevant and accessible destinations for a given geo-located user. This approach can be used to study individual accessibility in a given space, or integrated into information systems offering personalized services related to mobility, such as travel planning or navigation assistance. It offers the possibility to visualize spatial relevancy and accessibility using conventional GIS and mapping tools.
3 User Interests Model One of the requirements of a personalization process is to determine user needs efficiently. Personalization approaches categorize information items into spatial categories (e.g., hotels, restaurants) and then attempt to predict the most relevant ones with respect to a given user. This is not completely efficient as there is no warranty that the recommended entities satisfy the user needs, and this because of the different characteristics of the items even if they belong to the same category of information. In fact, categorisation of aspatial and spatial concepts and attributes can contribute to a better representation of the data semantics. A spatial entity can be informally defined as an instantiation of a spatial category or class described by a set of symbolic values. This leads to model spatial categories but also towards all possible attribute characteristics within each category. Our modeling approach is preliminary oriented towards the design of a user model that starts by identifying her/his profile, taking into consideration her/his interests. This information is derived by identifying and evaluating user’s interest when visiting some locations on the Web. The concept of implicit information gathering is as follows: when the user interacts with an information item on the Web, she/he is implicitly voting for it. Based on this assumption, we introduce in the following section several measures of interest to quantify the user interests by browsing and analyzing all her/his navigation history for a long term profile, or only its current navigation session in the case of short term personalization. 3.1 User Interests Evaluation The user profile, in this context, is used to describe the user interests towards some visited items. Visits frequency and durations are two implicit interest indicators that are
44
M.R. Haddad et al.
used to predict the user interest during navigation without asking her/him to express it. Explicit items rating is also used since it gives the user the opportunity to evaluate the items’ relevancy. These parameters are defined as follows: – Entity visits frequency: As the user is surfing the Web and some informational content, the data gathering process counts the number of ”visits” to a given entity as this reflects her/his interest to it, and to similar entities. In fact the number of visits to each item can be used to determine the most interesting ones. – Entity visits duration: The visit duration is a significant and self-explanatory parameter as users spend more time reading information when perceived as relevant [18]. This will be approximated as the time that separates access to a given entity to the next action of the user, this being evaluated at the Web interface level. – Explicit entities rating: A user behavior does not always reflect her/his underlying interests. Taking into account ratings is important when a user spends a lot of time visiting an entity that is not of interest or when she/he visits frequently an item that she/he is not planning to visit. Rating an entity should reduce the influence of the duration and the frequency of those visits. Thus, the quality of preferences and interests elicitation increases when the user evaluates explicitly the visited entities. 3.2 Entities Interest The main measures used to derive the user interest towards a visited entity are the frequency and duration of the visits, and explicit entities rating when available. Tracking theses measures requires saving all user transactions and their duration. If the user gives an explicit evaluation on a visited entity, this is considered in the measure in order to refine the user interests evaluation. The interest degree Ie of a user U towards a visited entity x, considering only the semantic aspect of x, is given by 1 dv (U, x) nv (U, x) Ie (U, x) = ∗ + ∗ V (U, x) (1) 2 Dv (U) Nv (U) Where dv (U, x) and nv (U, x) denote the duration and the number of the user visits to the entity x in the considered period of study, respectively. Nv (U) and Dv (U) refer respectively to the total number of user visits and their duration. V (U, x) is the average rate assigned by the user U to x. This rate is bounded by the unit interval between 0 (not interesting) to 1 (very interesting). If the user does not rate that item, it is valued as 0.5. Thus, the more often an entity is rated and visited in term of frequency and duration, the more it is considered as interesting to the user. We consider the time and frequency of the user visits as equally weighted. Further studies on the user implicit feedbacks should be able to quantify their reliability and weights when reflecting user interests. 3.3 Categories Interest In order to classify spatial categories according to the user interest, we measure the user interest towards spatial entities that belong to a considered category. The notion of
Towards an Integration of Space and Accessibility in Web Personalization
45
relevancy is derived with consideration of the degree of interest she/he has shown towards the entities of this category. Thus, the more interesting entities a spatial category contains, the more relevant it is. Let us consider C a spatial category and xi one of the n items visited by the user U. We consider X(C) as the set of the items belonging to the category C. The normalized degree of interest of the user U towards this category, noted Ic (U,C) is Ic (U,C) =
∑xi ∈X(C) Ie (U, xi ) ∑ni=1 Ie (U, xi )
(2)
3.4 Entity Values Interest After classifying categories by degrees of interest, relevant entities should be ordered according to their attribute values. This is done by analogy to the evaluation of category interests, that is, the degree of interest to a given attribute value is calculated according to the interest shown to the entities that have a similar value. Considering the attribute/value model of the data, we introduced a set of similarity/dissimilarity measures in order to quantify the difference of attribute values according to their types (e.g., numeric, enumeration, set). A similarity measure between two given entities is valued according to an aggregation of the similarity degrees of their attribute values. For example, the user interest towards a value of 50 euros for the room price of an hotel is derived from the user interests towards hotels having similar room prices. This implies to give a more formal definition of a semantic similarity measure. Let x and y be two entities of a same category, γFi a predefined importance coefficient of the attribute Fi . n is the number of attributes describing x and y depending on the category to which they belong. αi and βi denote the values of the attribute F( i) shown by x and y. The semantic distance between them is given by n
Dsem (x, y) = ∑ γFi ∗ Dsem (αi , βi )
(3)
i=1
Let us consider Fi as an attribute of the category C and vk as one possible value of Fi . Sim(Fi , vk ) is the set of values of Fi that are similar to vk . X(Fi , Sim(Fi , vk )) is the set of all entities visited by the user U and that have a similar value to vk related to the attribute Fi depending on its predefined similarity threshold. The interest of the user U to the value vk is given by I f (U, Fi , vk ) =
∑
Ie (U, x j )
(4)
x j∈X(Fi ,Sim(Fi ,vk ))
The next step is to normalize this value to be independent of the number of elements. Let V (Fi ) be the set of all the possible values of the attribute Fi . The user’s degree of interest on the value vk of the attribute Fi is Iv (U, Fi , vk ) =
I f (U, Fi , vk ) ∑v j ∈V (Fi ) I f (U, Fi , v j )
(5)
46
M.R. Haddad et al.
3.5 Interest Prediction User preferences reflect degrees of interest to some spatial categories and semantic values. We assume that the higher the degree of interest of a user to a given category is, the more likely the entities of this category will be visited. This applies by analogy to the level of an entity whose likeliness to be visited will increase with the values of its calculated degrees of interest. We also consider that category attributes have different importance degrees noted αFi in order to reflect the impact of these attributed on user choices. αFi can be predefined and personalized by the user. Therefore, the degree of user satisfaction for an entity x belonging to the category C is defined as follows n
S(U, x) = Ic (U,C) ∗ ∑ αFi ∗ Iv (U, Fi , v)
(6)
i=1
Where n denotes the number of the category attributes, v the value of the attribute Fi presented by the entity x and Iv (U, Fi , v) is the degree of interest of the user U towards the value v. Ic (U,C) refers to the interest degree of the user U towards category C.
4 Personalized Accessibility Measure When combined, the concepts of attraction and impedance should reflect the spatial and semantic aspects of some accessibility information available on the Web. Indeed, this should provide a support to model the notion of user context. This can be used to filter the available amount of information to extract a relevant content. This section first identifies the concepts and the basic ideas that support our approach. The proposed measure of accessibility and its experimental results are then presented.
4.1 Modeling Principles The personalization approach quantifies the interest of a given user towards some visited entities on the Web, and predicts her/his interest to the others according to some similarity measures. This allows identification of the locations that may be of interest for a given user. The values of interest are considered as the values of attractiveness. Attractiveness is considered as a user-oriented concept unlike conventional accessibility measures which are static and user independent. This measure depends on the user behavior and experiences. Regarding the concept of impedance, it is considered as a combination of several factors related to traveling activity, and not just as a usual impedance such as travel distance or travel time [2]. This measure can be also customized according to user preferences in order to obtain a closer prediction to her/his personal evaluation of the travel impedance. By analogy to conventional accessibility measures, attraction of a location in a given point is weakened according to the impedance of the travel distance, and modeled by a Gaussian function.
Towards an Integration of Space and Accessibility in Web Personalization
47
4.2 Accessibility Measure Let A(xi ) be the accessibility value of a spatial entity xi , Pk (xi ) its attractiveness value calculated by predicting the degree of interest of the user k towards it. The parameter βk is related to the variance of the Gaussian function; it defines its full width at half maximum. The full width of the Gaussian function is considered as the neighborhood around the origin. It represents the personal parameter of the travel limit of the user k. Let V (xi ) be the set of items located in the immediate neighborhood of xi and Ii j the impedance value of the travel from xi to x j . As denoted in [2] integration of several factors within an impedance measure increase its value. The proposed accessibility measure retains the spatial distance as an impedance measure (while we plan in further work to consider additional impedance factors). The accessibility measure is then based on a generic impedance term as follows 2
A(xi ) =
∑
Pk (x j ) ∗ e
I − i2j β k
(7)
x j inV (xi )
Considering βk as the permissiveness to the cost of travel, the higher this value is, the more ”costly” navigations are. This gives the user a better control of the personalization and accessibility evaluation process by specifying the navigation limit constraint related to her/his current navigation. Integration of the permissiveness factor is necessary to handle the cases where the user, in the same context and with the same needs, can have different travel limit constraints. The permissiveness factor role here is to determinate the user’s neighborhood since it defines the width of the Gaussian curve. The use of the permissiveness parameter make the Gaussian formulation more personalized than the classic ones, since it considers the user’s personal travel constraints (e.g., distance, cost, etc).
5 Experimentations This section introduces the experimental data and the developed prototype used for validatation purpose. 5.1 Experimental Data The experimental data represent several tourist locations in an administrative district named Nievre in the center of France. This data was used in an online information system, whose main services are trip planning. We considered a set of spatial entities (i.e., 700 location points). Each entity is characterized by its spatial position, a set of metadata describing the semantics of the domain (i.e., attributes, features) and the spatial category to which it belongs. Moreover, we considered a set of users (know or unknown users) characterized by a set of visits to some given spatial entities. These users have been experimentally selected from a panel of 20 undergraduate students who have been asked to interact with the interface as potential tourists whose objective will be to make some visits in the Nievre, and then selecting some entities of interest. Each visit, named transaction, is characterized by its beginning time, duration and, when available, an explicit rate expressed by the user depending on her/his degree of interest.
48
M.R. Haddad et al.
5.2 Prototype The prototype is a web based information system presenting several spatial entities that the user can explore and select to build a trip plan. The prototype offers several facilities that allow to describe the different entities identified in order to assist the user while navigating. Spatial category exploration. Figure 1 shows the interface that allows the user to navigate among the entities of a selected category. Entities are presented by groups in the middle of the figure. The user in this case is not identified, thus, one column of recommendations is available (Figure 1 right) relatively to her/his short-term profile of the current navigation session.
Fig. 1. Spatial categories
Visiting an entity. When the user selects an entity to view its detailed description, the interface outlined in the figure 2 arises. This page has two columns of recommendations related to the user’s long-term and short-term profiles as she/he is considered as a known user in this case. In addition to the description of the entity, there is an interface component that allows the user to rate the presented entity and explicitely quantify
Towards an Integration of Space and Accessibility in Web Personalization
49
Fig. 2. Entity description
her/his interest. Finally, the user can continue her/his navigation using the map showing her/his hypothetical location (i.e., the centric circle), recommended entities in relation with the two facets of his profile (black and grey marks) and other irrelevant ones (white marks). Management of travel plan. While reading the description of an entity, the user can add it to her/his travel plan. The latter case is shown in figure 3. Adding a recomanded entity to the travel plan is one of several used personalization-quality values. 5.3 Spatial Accessibility Evaluation The personalization process delivers a set of recommended spatial entities assumed to be the most suitable to the user preferences with respect to the proposed accessibility measure. Therefore, the most accessible entities are recommended to the user since they present the best compromise between semantic and spatial relevancy.
50
M.R. Haddad et al.
Fig. 3. Travel planning
In order to report on the spatial accessibility experimentation, let us consider a random user modeled by the personalization approach. We consider only the user’s long term profile describing her/his long term preferences. In this case, her/his interest measures have shown that the user is mainly interested by several spatial categories like hotels, restaurants, historical sites and sport. Figure 4 shows the distribution of the considered spatial entities and the user interests degrees according to their properties. The most relevant entities, and consequently the most attractive ones, are colored with dark spots in the following figures. These measures of entities relevancy facilitate a visual search of the cities of interest as shown in figure 4.
Towards an Integration of Space and Accessibility in Web Personalization
51
Fig. 4. Locations spatial distribution
Fig. 5. User’s neighbourhood accessibility
Figure 5 and 6 shows the locations attractiveness and accessibility degrees as perceived by the user using the proposed measure. Space is disaggregated into zones, where each zone represents a location neighborhood. The darker the zones are, the more accessible their associated locations are. In figure 5, accessible zones (in grey) are located in the user’s neighbourhood with respect to his/her travel distance constraint. The more permissive the user is, the larger the accessible area around her/him is. In figure 5, accessibility is evaluated with regard to the user position whereas in the following maps, it is evaluated for every location in the space independently of the user location. It is worth noting that the cities of interest identified by the user-oriented place accessibilities generate a sort of corridor in space that is likely to influence the way the considered user is likely to behave in space, or at least to help her/him to organise her/his navigation. The 50% most accessible areas are delimited by the darkest zone as shown in figure 7. Without loss of generality we considered a permissiveness value that is equivalent to a 30KM travel distance limit. Consequently, places that are contained in the most accessible areas are within a 30KM diameter and then respect the user travel-distance
52
M.R. Haddad et al.
Fig. 6. Places accessibility
Fig. 7. Most accessible places (case 1)
constraint. In this case, the most accessible entities, which are recommended to the user, are mainly in towns like Clamecy, Nevers, Saulieu and Saint Benin d’Azy (these cities being well-known historical cities). Figures 8 and 9 show the 50% most accessible locations considering the same user with the same profile state but with different permissiveness values that are respectively equivalent to 15Km and 10Km. The user in those cases has additional spatial constraints related to the travel limit. Consequently, the recommended entities are relatively clustered in order to meet the user displacement capabilities. Here, small and isolated towns like Pouilly-en-Auxois, Dijon and Conse-cours-sur-loire, which were considered as spatially irrelevant in the first use case, may be recommended to the user since they are located in areas that fit her/his limited travel cost permissiveness. This measure is able to identify the regions where there is a high density level of relevant entities. The permissiveness factor has an important role in determining the most accessible regions since, as shown in figures 8 and 9, there may be a huge difference in the given results for different values of permissiveness. This increases the personalization
Towards an Integration of Space and Accessibility in Web Personalization
53
Fig. 8. Most accessible places (case 2)
Fig. 9. Most accessible places (case 3)
degree of the proposed measure taking into account the user preferences, not only at the attractiveness evaluation step, but also when considering the travel related spatial constraints.
6 Conclusion Personalization should facilitate exploration and control of informational content, by providing personalized information to the users, and taking into account their preferences and navigational behaviors. Current personalization approaches consider the semantic constraints of the information, but often neglect the spatial aspect and constraints of geo-referenced information. As almost 80% of the information on the Web is considered as georeferenced, integration of the spatial dimension within personalization approaches is necessary as this has an important impact on the extracted knowledge and results. However, current spatial and location based services delivered on the Web have not fully taken into account users interests, individual experiences and points of
54
M.R. Haddad et al.
view when manipulating and delivering spatial information. We believe that modeling individual knowledge and preferences can provide personalized data that are certainly different from those delivered by usual accessibility measures. The preliminary research presented in this paper introduces a modeling approach whose objective is to formalize and integrate the measure of spatial accessibility and users interests when accessing spatial information on the Web. This approach can be used to design personalized location based services such as travel assistance or trip planning, and taking into account the spatial and the semantic aspects of the available information on the Web. This should be integrated into a more general personalization service implementing several quality measures that can be used to evaluate personalization quality. Further work concerns the evaluation and tuning of the developed prototype by measuring the impact of the personalization approach at the user and application levels, and satisfaction evaluations. We also plan to approximate the complexity and computational time of the different accessibility measures. The optimization process should not only be treated at the implementation level, but also by imposing restrictions on the analysis, or by taking into account different levels of granularity when evaluating accessibility.
References 1. Makri, M.C., Folkesson, C.: Accessibility measures for analyses of land use and traveling with geographical information systems. In: Proceedings of the 2nd KFB-Research Conference, June 1999, pp. 1–17 (1999) 2. Handy, S.L., Niemeier, D.A.: Measuring accessibility: an exploration of issues and alternatives. Environment and Planning A 29(7), 1175–1194 (1997) 3. Batty, M.: A new theory of space syntax. UCL working papers series (75) (2004) 4. Stan, C., Geertman, M., van Eck, J.R.R.: GIS and models of accessibility potential: An application in planning. International Journal of Geographical Information Systems 9(1), 67–80 (1995) 5. Song, S.: Some tests of alternative accessibility measures: A population density approach. Land Economics 4, 474–482 (1996) 6. Pirie, G.H.: Measuring accessibility: a review and proposal. Environment and Planning A 11(3), 299–312 (1979) 7. Kwan, M.: Space-time and integral measures of individual accessibility: A comparative analysis using a point-based framework. Geographical Analysis 30(3), 191–216 (1998) 8. Oberg, S.: Methods of describing physical access to supply points. Lund Studies in Geography, Series B, Human Geography 42, 474–482 (1976) 9. Wachs, M., Kumagai, T.G.: Physical accessibility as a social indicator. Socio-Economic Planning Sciences 7(5), 437–456 (1973) 10. Black, J., Conroy, M.: Accessibility measures and the social evaluation of urban structure. Environment and Planning A 9, 1013–1031 (1977) 11. Burnett, P., Hanson, S.: An analysis of travel as an example of complex human behavior in spatially constrained situations. Transportation Research 16, 87–102 (1982) 12. Sonesson, C.: Evaluations of some exponentially weighted moving average methods. Journal of Applied Statistics 30(10), 1115–1133 (2003) 13. Kwan, M.P., Hong, X.D.: Network-based constraints-oriented choice set formation using GIS. Geographical Systems 5, 139–162 (1998)
Towards an Integration of Space and Accessibility in Web Personalization
55
14. Ingram, D.: The concept of accessibility: A search for an operational form. Regional Studies 5(2), 101–107 (1971) 15. Burnett, P.: Spatial constraints-oriented modelling as an alternative approach to movement, microeconomics theory and urban policy. Urban Geography 1, 53–67 (1980) 16. Petit, M., Claramunt, C., Ray, C., Calvary, G.: A design process for the development of an interactive and adaptive GIS. In: Bertolotto, M., Ray, C., Li, X. (eds.) W2GIS 2008. LNCS, vol. 5373, pp. 96–106. Springer, Heidelberg (2008) 17. Mac Aoidh, E., Bertolotto, M., Wilson, D.: Implicit profiling for contextual reasoning about users’ spatial preferences. In: Proceedings of 2nd Workshop on Case Based Reasoning and Context-Awareness, August 2007. CEUR Workshop Proceedings, vol. 271 (2007) ISSN 1613-0073 18. Kelly, D., Belkin, N.: Reading time, scrolling and interaction: Exploring implicit sources of user preferences for relevant feedback. ACM Special Interest Group on Information Retrieval, 408–409 (2001)
Tiled Vectors: A Method for Vector Transmission over the Web Vyron Antoniou1, Jeremy Morley2, and Mordechai (Muki) Haklay1 1
Department of Civil, Environmental & Geomatic Engineering University College London Gower St., WC1E 6BT, London, UK 2 Centre for Geospatial Science, University of Nottingham University Park, NG7 2RD, Nottingham, UK {v.antoniou,m.haklay}@ucl.ac.uk, [email protected]
Abstract. Transmitting vector data over the Web is a challenging issue. Existing methods for vector delivery over the Web focus on progressive transmission techniques. Long standing problems in the formalization of dynamic generalization and the time needed for complex algorithms does not allow their implementation in real life applications. We propose a new method for data transmission over the Web based on tiles. We show that in client-server architecture the coordination of all involved parts can create an efficient way to transmit vectors. We describe the methodology of how we can implement the successful for raster data, tile-based method to tackle the particularities of vector data transmission. Keywords: Vector data, Geographic Information, Web mapping, AJAX.
Tiled Vectors: A Method for Vector Transmission over the Web
57
Despite the need for mapping applications to be able to host vector data, vector maps suffered from setbacks that prevented wide implementation. These problems stem from the voluminous nature of vector data and range from limitations inherent in each of the formats introduced to intrinsic disadvantages of vector encoding (such as on-the-fly generalization and efficient transmission over the Web). In order to tackle the latter issue, many efforts introduced are trying to imitate, with limited success in real-life applications, the successful progressive raster data transmission methods to the vector transmission problem. This paper is presenting a new method for XML and text-encoded vector data transmission over the Web. Instead of creating a technique for progressive transmission, we suggest that sending asynchronously tiled parts of vector data from the server to the client can help to build interactive Web mapping applications. The tile-based method for raster data delivery has been successfully implemented by major mapping providers like Google, Yahoo! and Microsoft. In fact, the explosion of mapping applications on the Web and the consecutive phenomena of map mash-ups, neogeography [8] and VGI [9] have been based on the efficiency and easiness of raster data delivery using the tile-based technique. In this paper we present a tile-based method for data transmission, tailored to the particularities of vector data. The document is structured as follows: in Section 2 we present related work on the subject. The proposed methodology is explained in Section 3 followed by performance evaluation in Section 4. We discuss our findings and give indications for our future work in Section 5.
2 Related Work 2.1 Raster Data Transmission Many progressive transmission techniques for raster data have been introduced. [2], [10] and [3] offer reviews of those techniques and describe in brief their main characteristics. In general, highly sophisticated compression algorithms and interleaving transmission methods made raster formats suitable for data delivery over the Web. The efficiency of these methods enabled the Geographic Information (GI) community to build the majority of Web mapping applications using raster data and thus easily deliver spatial information to the users. The evolution of the Web and the pursuit of enhanced responsiveness and increased usability for Web applications lead to the development of new programming techniques like AJAX (Asynchronous Javascript and XML) and a new method of raster data transmission that uses tiled raster images and different levels of detail (LoD). According to this method the highest LoD of the mapping area is divided into a number of quadrants (tiles). Each of these quadrants is further sub-divided into new tiles that form the next LoD. This process continues until the lowest LoD is reached. Although this can lead to a huge number of tiles for a detailed or large dataset, the storage, indexing and handling of raster files is straightforward, especially as data storage becomes cheaper. When a map of a given LoD is requested a number of tiles are sent to the user. The tiles are loaded into the Web browser window as a matrix, and from the user’s perspective it seems to be continuous image. For any consequent request such as pan,
58
V. Antoniou, J. Morley, and M. (Muki) Haklay
zoom or managing layers a new request is made by the application that runs on the client’s Web browser and the server transmits to the client only the tiles that are needed in addition to the ones currently in the client’s cache. This method makes the application considerably faster and more responsive since the use of AJAX techniques reduces the application’s response time by communicating with the server without the user actually noticing it. 2.2 Vector Data Transmission The volume of the vector data and the difficulty of transmitting it over the Web has been a long standing problem for Web mapping and Web GIS. The success of the progressive transmission methods for raster data turned the focus of research towards the development of similar techniques, tailored to vector encoding. Efficient methods of progressive transmission have been introduced for a particular case of vector data: triangular meshes. Triangular meshes are usually used to describe digital terrain models or the surface of 3d objects. On the contrary, progressive transmission of cartographic vector data over the Web remains problematic despite numerous efforts (see [2], [3] and [11] for a review). According to progressive transmission methods a coarser map version is sent initially to the user, and depending on the user’s requirements, consecutive data packets are send to improve the map. The coarser map versions can either be generated dynamically at the time of the request (on-the-fly) or can be pre-calculated through the process of generalization. On-the-fly generalization is an unsolved problem for cartography. A number of researchers have focused on dynamic generalization ([12], [13], [14]) in order to enhance progressive vector transmission but since there are no formalized cartographic generalization principles and applicable generalizing operators [15], automated dynamic generalization still remains a challenge. Moreover, the existing generalization algorithms are time-demanding and thus not applicable for real-life Web application. Another disadvantage of dynamic generalization is that it produces inconsistent results in terms of retaining topology and geometry attributes and thus often need an a posteriori evaluation of their consistency. A topologic consistent approach has been proposed for polygons and lines by [11] with the exception of isolated polygons that have area smaller than a given threshold and lines that belong to the smallest category (for example first-order streams in a river network). On the other hand, off-line generalization and the creation of different LoDs is the norm for mapping agencies. In this case, generalization is time-insensitive and is usually performed interactively by expert cartographers with the help of specialized software [12]. [2] presented a method for pre-computing and storing multiple map representations suitable for progressive transmission to the user but without further implementation. Although the maintenance of different LoD is cumbersome, off-line generalization yields topologically and geometrically accurate products, and similar data management techniques to these used to maintain multi-LoD raster databases can be applied to such vector databases. The advantage of progressive data transmission relies on the fact that users can perform preliminary operation even on a coarser version of the map or they can assess the suitability of the map requested and possibly change their request without waiting
Tiled Vectors: A Method for Vector Transmission over the Web
59
for the whole dataset to be downloaded. The process is shown in Figure 1, where A, B, C, D1 and D2 are various time periods of the process. Period A is the time during mouse move, B is the time that the user waits for the coarser version of the map to be loaded, C is the time the server needs to extract the data. D1 is the time the user observes the map while it becomes more detailed and D2 is the time that the user observes the fully detailed map. Progressive transmission enables users to start observing the map before the whole map is downloaded.
Fig. 1. Steps and time periods in the progressive transmission of vector data
Recognizing the need for more efficient transmission methods, some early efforts on vector tiling have been introduced ([16], [17]). These methods are based on the off-line preparation of vector tiles which introduces an extra step in the preparation and maintenance of the map (i.e. to create and maintain the tiles for each LoD). The main disadvantage though is that the proposed methods mainly focus on the simple visualization of the map and do not provide a merging mechanism of the tiles at the client side. Thus, the user is presented with a map composed by segmented entities at the edges of the tiles which leads in semantic inconsistencies between the real entities and the map entities presented to the user. In contrast, in [18] we suggested that a mechanism that provides a solution for on-the-fly tiling of the data on the server and the merging of the tiles on the client will be more appropriate.
3 Methodology Instead of trying to implement a progressive transmission technique tailored to vector data needs, the method presented here follows the tile-based approach. In brief, according
60
V. Antoniou, J. Morley, and M. (Muki) Haklay
to the proposed methodology, asynchronous data requests are submitted to the server only if the data has not already been sent to the user, otherwise data are read from the browser’s cache. When a user’s request reaches the server the data are cut into tiles and then send to the user’s browser. At the user’s machine the tiles are merged and the final map is presented to the user (see Table 1 for the steps followed after a typical map request). This approach provides a method to transmit vector data to the client using AJAX, but it does not solve the problem of on-the-fly vector generalization. Thus, this approach is applicable when we have pre-prepared multi-resolution spatial databases. It is understandable that the effective implementation of a methodology based on the clientserver architecture needs the coordination of all engaged parts (spatial database, server, user’s browser and the map document itself). In what follows we describe the architecture of the methodology with an emphasis on the structure of the map document, the interaction of map document with the browser and the server and the map preparation (i.e. tile merging). 3.1 The Map Document The role of the map document’s structure is central for the methodology. The map document has three different layers (Figure 2). The first layer consists of a 5x5 grid of tiles (background area). This layer is not visible to the user but is used to hold the tiled data sent to the map document either by the server or the browser’s cache memory. Also, this layer provides data to the next layer of the map document. The second layer (viewable area) consists of a 3x3 grid of tiles. These tiles are cloned from the background area (first layer). Its role is to hold the data that are going to be merged and then assigned to the thematic layers at the next layer of the document. The third layer (map area) consists of one tile. This layer is the actual map requested by the user and is compiled from the thematic layers according to cartographic rules.
Fig. 2. The structure of the map document
Tiled Vectors: A Method for Vector Transmission over the Web
61
The cloning of data from one part of the document to another is triggered by the user’s actions. When the user requests to view (by panning the map) more data, the tiles that are already stored in the hidden area of the document are cloned to the merging area, where the merge of geometries takes place (see next paragraph), and finally are assigned to the correct thematic layers which are presented to the user as the final map. At the same time, the map document prepares itself for the next user actions by requesting new data either from the browser’s cache memory or from the server. 3.2 Merging An important step of the whole methodology is the merging process that takes place at the client. The geometries stored in each of the 9 tiles of the viewable area are merged before they get assigned to the correct thematic layer. This step is needed because the map entities presented to the user should be in logical accordance with the entities stored in the database. For example, if a single polygon is split into two polygons (each one stored in a different tile) during the extraction from the database, when presented to the user these polygons should be merged back into one entity. This will allow users to interact correctly with the elements of the map. Since thematic layers can hold either point, line or polygon geometry there needs to be a merging mechanism for every type of geometry. 3.2.1 Points The merging of points is trivial, since the only thing needed is to clone all points from the tiles to the final thematic layers. Javascript can easily parse the Document Object Model (DOM) of XML documents and clone data from one part of the document to another. 3.2.2 Lines The merging of lines is based on the use of the unique feature IDs. IDs are used as keys to search inside the 9 tiles of the viewable area. Line segment that have the same ID are grouped and then joined into single entities. Such a join may result either in a
Fig. 3. The merge of a line that resides in different tiles can lead to: (a) polyline element or (b)multi-polyline element
62
V. Antoniou, J. Morley, and M. (Muki) Haklay
polyline feature when the segments have common points or in a multi-polyline feature when there are no common points among segments that have the same ID, depending on how the original line, stored in the database, was split into tiles (Figure 3). 3.2.3 Polygons Unique IDs are also used for merging polygons. Once again, polygons that share the same ID are grouped and then merged either into polygon or multi-polygon entities. The case of polygon merging though, presents a greater degree of difficulty in order to disambiguate all possible cases. The main obstacle is the presence of unwanted border lines generated during the tiling process (see for example Ch. 3 of [19] for details about spatial operations and how new line segments are generated through the tilingintersection process). For example, Figure 4a and 4b show two different cases of a border line appearance. In Figure 4a a polygon border line generated during the tiling process is needed to achieve the correct coloring of the polygon presented to the user. In contrast, in Figure 4b that same border line is causing an unwanted visual effect. The correct way of rendering the merged parts of the polygon in 4b, is shown in Figure 4c.
Fig. 4. Dealing with border lines during the merge of polygons
Still, the border lines generated during the tiling process are necessary during the merging phase since they help elucidate the rendering of polygons. For example, Figure 5 shows that when border lines are absent there is no indication of how a polygon should be colored.
Fig. 5. A vague case of polygon coloring
Tiled Vectors: A Method for Vector Transmission over the Web
63
To tackle that problem we need to have both the border lines of the polygons and an indication regarding when each border line should be used in the merging process. Since border lines are generated by the intersection of the polygon and the tile it is obvious that these border lines will coincide geometrically with the outline of the tile. We need to record at which side (1, 2, 3 or 4) of the tile (see Figure 6) there was a border line created. This element allows us to create a set of rules regarding the inclusion or not of the border line in the merging process. For example, border lines that coincide with the number 1 side of a tile should be included in the process only if the tile is placed in the (0, j) places of the 3x3 grid. In any other case the border line should be excluded by the process.
Fig. 6. The method to assign an indicator to border lines
By doing so, in the case shown in Figure 4a the border line would have been used to form the outline of the polygon but in case shown in Figure 4b the border would have been excluded in the merging process. This approach has been successfully tested in all possible polygon cases like ring polygons, island polygons or multi-part polygons. Finally, Table 1 describes the steps that take place inside the map document. Table 1. The steps of map preparation
1. The method starts with a request by the user. A number of tiles is extracted from the database, covering a broader map area and send to the user’s browser.
2. At the client’s browser, for each layer, the geometry of the tiles is merged creating a continuous and correctly compiled map.
64
V. Antoniou, J. Morley, and M. (Muki) Haklay Table 1. (continued)
3. The user is presented with the requested map.
4. So, in fact, while the user sees a restricted map area (the one that originally requested), a much larger area is available to accommodate the next panning gesture without the need for further client-server communication.
5. Additionally, another set of tiles is also extracted from the server and send to the user. These tiles are not merged but are kept in the background section of the map which is not visible to the user. These tiles work as cache memory inside the document to accommodate the next panning gestures without making the user wait for new data.
6. When the user makes a panning gesture the map document is able to respond without delay. Nevertheless, the map document starts the process to prepare itself in order to accommodate the next user’s moves by re-arranging the tiles in the map document and by getting the missing data, either from browser’s cache memory (if the needed data are already there) or from the server (see how in the next steps).
7. First the column (or row) of tiles in the background section that is on the opposite from the pan direction is dropped from the document and stored in browser’s cache.
8. The tiles that exist in the viewable area and are not needed any more are dropped, but remain in the browser’s cache too.
Tiled Vectors: A Method for Vector Transmission over the Web
65
Table 1. (continued)
9. The tiles that reside in the background area and are needed to accommodate the new map are cloned to the viewable mapping area.
10. The geometry of the tiles is once again merged for each thematic layer.
11. At the same time, a new set of tiles, needed to prepare the map document, is requested from the browser. If not found there, the request is propagated to the server. These tiles will be stored in the background area of the map document and the process is ready to start all over again.
3.3 Map Document’s Interaction with Browser and Server Client side caching is a common programming technique. This technique allows browsers to hold locally for later use data that have been sent from the server without any further interaction. Thus, reduced network latency and enhanced user experience is achieved. In our case, browser’s cache memory is used to hold tiles of data sent from the server to the user but not any more stored inside the map document. Thus, whenever new tiles are required from the map document, the search starts in the browser’s cache. If the tiles needed are stored in the cache memory, then the data is inserted into the map document. If the data has not been already sent to the user, then the request for new tiles is propagated from the client to the server. This client-server interaction takes place asynchronously using AJAX requests. AJAX is a well known methodology that has played a key role in the evolution of Web 2.0 by helping programmers to build desktop-like applications over the Web. In Web mapping AJAX made possible to overcome long standing obstacles related with the transmission of raster data (due to their volume) or poor user interaction with the
66
V. Antoniou, J. Morley, and M. (Muki) Haklay
mapping applications. In our case we use AJAX to manage the client-server communication behind the scenes. Given the fact that vector tiles have fixed dimensions (defined by the map window of the application), the construction of spatial queries to retrieve the missing tiles is a straightforward process. Figure 7 shows the time periods involved in the whole process. Period A is the mouse movement; using AJAX we can trigger a new map request while the user is still panning in a direction. Period B is the time that the user observes the map. The tiled method coupled with AJAX requests provides data constantly to the user and thus there is no need for the user to wait for the map to be downloaded. Period C is the time the server needs to extract the data. The key point of the method is that it fully exploits the time period of user inactivity (i.e. does not interact with the map requesting new data). While the user observes or queries the map entities available, behind the scenes there is work in progress in order to prepare the map document to accommodate the next user’s moves. If time period B is very small (i.e. the user is quickly panning towards the same direction) the process of requesting new data from the server can be suspended with the help of a time counter. This process is implemented only for those requests that cannot be accommodated by data stored in browser’s cache.
Fig. 7. Steps and time periods in tile-based transmission of vector data
Figure 8 shows a screenshot of a prototype built in order to test and improve our algorithms. We used SVG as the format to build our map because it is an XML based format that supports scripting. Any practices and methods applied during the implementation can be implemented in other XML-based or text-based vector formats.
Tiled Vectors: A Method for Vector Transmission over the Web
67
Fig. 8. A screenshot of the prototype
4 Performance Usually the performance of progressive transmission methods for vector data are tested by examining the time needed to transmit a dataset of a certain size (for example see [20], [11], [3]). This type of performance evaluation is not applicable in our case since the method aims to the exact opposite target: to avoid the transmission of bulk sets of data and instead to split data in small packets and exploit the real-life user behavior to efficiently transmit vector data. Therefore, for the performance evaluation of our method a more user-oriented and thus more suitable method was used. We examined the time periods for panning (time period A in Figure 8) and map observation (time period B in Figure 8) that will allow our method to present to the user a ready to use map immediately (i.e. not fail to fulfill the condition tmouse up = tmap presentation). The experiments were performed over the Web using a server with 2.13 GHz CPU double-core processor and 1 GB of memory stationed in Athens (Greece), connected to the Internet with nominal value for download 24Mb/s and for uploading 1Mb/s (the actual average values recorded during the experiments were 3.1Mb/s and 280Kb/s respectively) and a client with 1.66 GHz CPU double-core processor and 1 GB memory stationed in London (UK), connected to the Internet with 8Mb/s download and 1Mb/s upload speed nominal values (the actual average values recorded during the experiments were 3.8Mb/s and 682.9Kb/s respectively). The average ping time between client and server during the experiments was 112 ms. In our evaluation we used two different sets of data: a world level dataset that contains countries, rivers and cities and one at a street level one that contains parcels, parcels’ registration points and street centerlines. Table 2 shows the analysis both of the entire datasets stored in a spatial database and of the data accessed by the user during the experiments. We examined the performance of our method in the worst case scenario: the user was panning constantly in the same direction and thus no help from the browser’s cache was provided. Instead, every user’s request was propagated to the server in order to retrieve data not yet sent to the client. We recorded the time for the panning
68
V. Antoniou, J. Morley, and M. (Muki) Haklay Table 2. The analysis of the datasets used and accessed during the experiments
World Level Data Geometry Type Polygons Polylines Points Total
Entire Dataset Num. Num. of of Entities Parts 147 249 98 177 606 606 851 1,032
Num. of Points 28,162 3,965 606 32,733
Data Accessed by the User Num. Num. Num. of of of Entities Parts Points 140 228 24,983 95 147 3,852 550 550 550 785 925 29,385
Street Level Data Geometry Type Polygons Polylines Points Total
Entire Dataset Num. Num. of of Entities Parts 5,879 5,879 15,186 15,186 6,245 6,245 27,310 27,310
Num. of Points 37,213 32,128 6,245 75,586
Data Accessed by the User Num. Num. Num. of of of Entities Parts Points 2,360 2,360 15,122 5,941 5,941 12,535 2,447 2,447 2,447 10,748 10,748 30,104
gestures and the minimum time that the user had to observe the requested map until the map document prepares itself for the next panning gesture without introducing further delays. In every step of our experiments the user was presented with a correctly composed and fully functional map. This means that all entities where styled and assigned to the correct thematic layers which were used to compose the map. Additionally, a link was assigned to every entity presented to the user so to enable further AJAX queries (for example, request the attributes of an entity from the server). Table 3 shows the time needed for the presentation of the map after the first request and the minimum, maximum and average map observation times in order the user to be constantly presented with such a map. Table 3. Performance results of the proposed method
First map presentation (sec)
Average Pan Time (sec)
Min - Max – Average Observation Time (sec)
World Level
7.8
0.7
0.4 - 3.5 - 1.3
Street Level
18.1
0.7
0.6 - 5.5 - 2.8
As expected, the slowest part of the method is the time needed for the presentation of the map after the first request. Subsequent map presentations though, need considerably less time which varies due to differences in the volume of data transmitted and
Tiled Vectors: A Method for Vector Transmission over the Web
69
the number of entities that need to be merged each time. As explained earlier, in a real life scenario (i.e. the user does not pan only in one direction) the observation time is expected to be further reduced by using data stored in browser’s cache memory.
5 Discussion and Future Work The proposed method provides a smooth user interaction with the map, overcoming the problem of long waits for the download of vector data since there is no need for the user to wait for such downloads. The efficiency of the method depends on the correct coordination and refinement of all the method’s steps. So, in addition to what has been discussed so far, a new database structure can be developed that will hold directly pre-calculated tiles instead of extracting them at the time of request, based on global gridding similar to existing tiled raster services. This will considerably improve server performance and server-side caching. The key point of the proposed method is that the map document should always have the necessary data to accommodate the next user’s move; the tiled vector data must reach the browser before the user requests them in order to eliminate waiting times. In other words the performance of the method is in close relationship with the user’s behavior as well as the efficiency of the method itself. The bigger the time periods the user remains inactive in terms of data request, the more time is available for the preparation of the map document for the next request. In contrast with what takes place in the case of raster tiles, this preparation time varies in the vector case. Raster tiles have a standard dimension which also results in having almost a fixed size. In contrast, there is no guarantee of the size of a tile that holds vector data and this can introduce delays in the process. A solution to that could be tiles of variable dimension (i.e. different spatial coverage) that will hold data that has size below a chosen threshold. In this case specialized algorithms for tile indexing and manipulation are needed. Alternatively, progressive transmission techniques could be implemented for the data that each tile holds. The proposed method has a number of advantages compared to the transmission methods available today. Following this strategy, a continuous vector map will be available to the user by sending only small pieces of data in order to achieve reduced network latency. The procedure does not need to interact with the server, and thus introduce network latency, for small panning gestures since the client will have enough data to accommodate such user actions. Additionally, when needed, the use of client caching and AJAX will avoid the unnecessary client-server transactions but will still allow the missing tiles to be embodied in the map behind the scene without the user noticing it. By testing our method over the Web we proved that it can be implemented in real life applications. The approach builds upon the architecture that most mapping agencies use: multi-scale databases. While the on-the-fly generalization problem remains unsolved, map providers serve maps from different LoDs on the Web. Exploiting that fact and in contrast with the existing limitations of progressive transmission techniques, this method disturbs neither the geometry nor the topology of the features presented on the map. Moreover, it has no restrictions in handling multi layered requests of any geometry and in applying all cartographic principles during map composition.
70
V. Antoniou, J. Morley, and M. (Muki) Haklay
Also, the method can be implemented to any XML or text-encoded format like KML or GeoJSON. It is interesting to note that, after the first map presentation, this method offers an efficient way so users can access unlimited volume of vector data with waiting/observation times that not differ from the typical waiting/observation times that occur when they browse any other Web application. Finally, this tiling approach makes more difficult to compromise intellectual property rights (IPRs) of vector data compared with methods that send the whole dataset to the client, progressively or not. The compromise of IPRs has been a major factor that made developers and mapping agencies reluctant to publish vector data on the Web. In the proposed method, while in the first step more data than requested are sent, this disadvantage is balanced by reduced client-server interaction and network latency during the subsequent steps. The combination of the evolution in the Web and the efficient mechanisms for raster delivery on the one hand and the need for vector data for enhanced interactivity and object manipulation on the client side on the other form a new environment for Web mapping. We have suggested that in such an environment the role of hybrid Web maps able to host both raster and vector data will be considerably increased [21]. Such maps will use only the strong points of raster and vector data and is likely to be the most efficient way to deliver spatial data for complex Web mapping applications. In that context, our future work will focus on researching methods that will enable the harmonious cooperation of the established raster delivery methods with our proposed method for vector data transmission.
References 1. Cartwright, W.E.: Mapping in a digital age. In: Wilson, P.J., Fotheringham, A.S. (eds.) The Handbook of Geographic Information Science, pp. 199–221. Blackwell Publishing Ltd., Malden (2008) 2. Bertolotto, M., Egenhofer, M.J.: Progressive Transmission of Vector Map Data over the World Wide Web. GeoInformatica 5(4), 345–373 (2001) 3. Bertolotto, M.: Progressive Techniques for Efficient Vector Map Data Transmission: An Overview. In: Belussi, A., Catania, B., Clementini, E., Ferrari, E. (eds.) Spatial Data on the Web: Modeling and Management, pp. 65–84. Springer, New York (2007) 4. Dykes, J.A.: Exploring spatial data representation with dynamic graphics. Computers & Geosciences 23(4), 345–370 (1997) 5. Cook, D., Symanzik, J., Majure, J.J., Cressie, N.: Dynamic graphics in a GIS: More examples using linked software. Computers & Geosciences 23, 371–385 (1997) 6. Andrienko, G.L., Andrienko, N.V.: Interactive maps for visual data exploration. International Journal of Geographical Information Science 13(4), 355–374 (1999) 7. Zhao, H., Shneiderman, B.: Colour-coded pixel-based highly interactive Web mapping for georeferenced data exploration. International Journal of Geographical Information Science 19(4), 413–428 (2005) 8. Turner, A.: Introduction to neogeography. O’Reilly Media, Sebastopol (2006) 9. Goodchild, M.: Citizens as sensors: the world of volunteered geography. GeoJournal 69(4), 211–221 (2007) 10. Yang, C., Wong, W.D., Yang, R., Kafatos, M., Li, Q.: Performance-improving techniques in web-based GIS. International Journal of Geographical Information Science 19(3), 319– 342 (2005)
Tiled Vectors: A Method for Vector Transmission over the Web
71
11. Yang, B., Purves, R., Weibel, R.: Efficient transmission of vector data over the Internet. International Journal of Geographical Information Science 21(2), 215–237 (2007) 12. Cecconi, A., Galanda, M.: Adaptive zooming in Web cartography. In: SVGOpen 2002 (2002), http://www.svgopen.org/2002/papers/ cecconi_galanda__adaptive_zooming/index.html 13. Lehto, L., Sarjakoski, L.T.: Real-time generalization of XML-encoded spatial data for the Web and mobile devices. International Journal of Geographical Information Science 19(8), 957–973 (2005) 14. Jones, C.B., Ware, J.M.: Map generalization in the Web age. International Journal of Geographical Information Science 19(8), 859–870 (2005) 15. Weibel, R., Dutton, G.: Generalizing spatial data and dealing with multiple representations. In: Longley, A.P., Goodchild, F.M., Maguire, J.D., Rhind, W.D. (eds.) Geographical information systems. John Wiley & Sons, Chichester (1999) 16. Campin, B.: Use of vector and raster tiles for middle-size Scalable Vector Graphics’ mapping applications. In: SVGOpen 2005 (2005), http://www.svgopen.org/2005/papers/ VectorAndRasterTilesForMappingApplications/ 17. Langfeld, D., Kunze, R., Vornberger, O.: SVG Web Mapping. Four-dimensional visualization of time- and geobased data. In: SVGOpen 2008 (2008), http://www.svgopen.org/2008/papers/92-SVG_Web_Mapping/ 18. Antoniou, V., Morley, J.: Web Mapping and WebGIS: do we actually need to use SVG? In: SVGOpen 2008 (2008), http://www.svgopen.org/2008/papers/ 82- Web_Mapping_and_WebGIS_do_we_actually_need_to_use_SVG/ 19. Rigaux, P., Scholl, M., Voisard, A.: Spatial Databases: With Application to GIS. Morgan Kaufmann Publishers, San Francisco (2002) 20. Yang, B.: A multi-resolution model of vector map data for rapid transmission over the Internet. Computers & Geosciences 31(5), 569–578 (2005) 21. Antoniou, V., Morley, J., Haklay, M.: Is your Web map fit for purpose? Drawing a line under raster mapping. In: AGI Geocommunity 2008 (2008), http://www.agi.org.uk/site/upload/document/Events/AGI2008/ Papers/VyronAntoniou.pdf
Semantic Challenges for Sensor Plug and Play Arne Br¨oring, Krzysztof Janowicz, Christoph Stasch, and Werner Kuhn Institute for Geoinformatics, University of Muenster, Germany {arneb,janowicz,staschc,kuhn}@uni-muenster.de
Abstract. The goal of the Sensor Web Enablement (SWE) initiative of the Open Geospatial Consortium (OGC) is the definition of web service interfaces and data encodings to make sensors discoverable, taskable and accessible on the World Wide Web. The SWE specifications enable a standardized communication and interaction with arbitrary types of sensors and sensor systems. The central concepts within OGC’s Sensor Web architecture are sensors, observations and features of interest. Sensors and their observations can be registered and stored through the Sensor Observation Service (SOS) to make them accessible for clients. So far, mechanisms are missing which support a semantic matching between features of interest stored in a database and referred to by an observation. The same applies for the matching between observations as sensor outputs and the properties of the features of interest. By taking a use case from disaster management, we outline the challenges and demonstrate how semantically annotated SWE data models and service interfaces support semantic matching. The result is a roadmap towards a semantically enabled sensor plug & play within the Sensor Web.
1
Introduction
Recent improvements in sensor technology and lower prices change the way we collect and process massive amounts of data in realtime. Thus, the usage of sensors increases in applications ranging from environmental monitoring over early warning systems and precision agriculture up to personal health and performance monitoring [1,2,3]. The Sensor Web Enablement initiative of the Open Geospatial Consortium (OGC) aims at standardizing the discovery, exchange, and processing of sensor data as well as their tasking. Therefore, the Sensor Web Enablement initiative defines a framework of data models and encodings for describing sensors and their observations as well as a suite of web service interfaces leveraging these models and encodings [4]. While the OGC has already done substantial work in defining protocols and service interfaces to enable syntactical interoperability, semantic enablement is still in an early stage [5]. Recently, Sheth et al. [6] coined the term Semantic Sensor Web to combine Sensor Web technology with the Semantic Web. A first step towards the realization of the Semantic Sensor Web has been presented by Henson et al. [7] by introducing a semantic enabled Sensor Observation Service called SemSOS which semantically annotates the service responses. J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 72–86, 2009. c Springer-Verlag Berlin Heidelberg 2009
Semantic Challenges for Sensor Plug and Play
73
In contrast, our approach focuses on the semantic annotation of service requests for adding new sensors and observations to an Sensor Observation Service. The correct semantic matching from sensor inputs and outputs to the observed property of the features of interest as well as the matching between a real world entity observed by a sensor and the feature of interest have to be assured. So far, these matchings have to be established and maintained manually by the service provider. In particular, this problem appears when multiple observation suppliers publish their content via the same service instance. A sensor services can be set up for certain geographic regions and various sensors of different types can register at these services and upload their observations. Taking into account mobile sensors moving in and out of this region the problem becomes even more pressing. An automatic plug & play of sensors which realizes a correct mapping of the different Sensor Web concepts is needed. In this paper, we present a detailed analysis of the challenges of adding new sensors to the Sensor Web and publishing their gathered observations. The work will serve as a roadmap towards semantically enabled plug & play for the Sensor Web. The remaining paper is structured as follows. Section 2 describes the basis of this work by introducing the Sensor Web and the role of ontologies. The following section 3 describes an emergency scenario to illustrate our work. We then provide an in-depth analysis of existing challenges for registering sensors and publishing observations. The paper closes with conclusions and an outlook to future work.
2
Background
The idea of the Sensor Web is to standardize the web based discovery, exchange, and processing of sensor data as well as their tasking. The OGC has established a SWE working group which defines a framework of data models and encodings for describing sensors and sensor observations, as well as a suite of web service interfaces leveraging these models and encodings [4]. The Sensor Observation Service [8] is part of the SWE framework and provides a standardized interface for the pull-based access to archived and near-realtime, sensor observations and metadata. The service interface and its operations are divided into three profiles: Core, Transactional and Enhanced. The core profile includes the three mandatory operations, GetCapabilities for requesting a description of the service and the offered sensor data, DescribeSensor for retrieving sensor metadata, and GetObservation for querying observations of particular sensors or phenomena using any combination of temporal, spatial and value filters. The RegisterSensor operation of the optional transactional profile enables the registration of new sensors. Afterwards, the InsertObservation operation allows the integration of new observations produced by registered sensors. The enhanced profile offers optional operations such as the GetResult operation to retrieve only results of observations without their metadata. A service implements the entire profile, if it supports all operations. The SOS uses the Sensor Model Language (SensorML) specification [9] for the encoding of sensor metadata descriptions. SensorML provides models and
74
A. Br¨ oring et al.
Fig. 1. Basic observation model of O&M specification
encodings to describe any kind of process in sensor or post processing systems. Thus, the basic type of all SensorML descriptions is the process type containing input and output elements, as well as several additional parameters. Different subtypes of the process type are provided for various kinds of detectors, actuators, or aggregated systems. The Observations and Measurements (O&M) specification [10] is utilized by the service to encode the data gathered by sensors. It defines a model describing sensor observations as an act of observing a certain phenomenon. The basic observation model contains five components as shown in figure 1. The observation comprises a link to the procedure (usually a sensor, e.g., a water gauge), which generates the value for the observation, as well as a reference to the observed property (e.g., water level) representing the phenomenon which was observed. The feature Of interest refers to the real world entity (e.g., a river) which was target of the observation and has to carry the observed property as its feature property. The real world entity can also be a process such as the dispersion of a chemical cloud. The sampling time attribute indicates the time, when the observation was applied to the feature of interest. The observation value is contained in the result element. It depicts a symbol for the observed phenomenon during the sampling time located at a certain feature of interest. Thus, the type of the observation result must be consistent with the observed phenomenon and the observed property has to be a property of the feature of interest. If the measurement procedure represents a sampling of a spatially distributed entity, the features of interest refer to artifacts of the sampling strategy. Therefore, part 2 of the O&M specification [11] defines certain representation types for these artifacts, the so called sampling features. The specification also provides a link from the sampling features to ultimate features of interest which represent the spatially distributed real world entities. For example, when measuring the surface temperature of a lake, the concrete locations of the measurements are
Semantic Challenges for Sensor Plug and Play
75
represented through the sampling points. The real world entity, which carries the surface temperature property, is represented through a reference from the sampling points to the feature representation of the lake. In case of using sampling points as features of interest, the term feature of interest becomes ambiguous as it represents the artifacts of sampling as well as the real world entities which are observed. Recently, other approaches have tried to model the locations of the sampling points as part of the observation results and use the features of interest for representing the ultimate entities of interest. Consequently, in such an approach the features of interest are representations of real world entities. Besides modeling sensors, their observations, and features of interest using SensorML and O&M, ontologies are used to specify types of sensors, observations, and features in more detail [12,7,13]. In general, ontologies are applied at three stages: modeling, integration, and discovery. First, they allow to restrict the meaning of technical terms such as wind direction or pollution towards an intended interpretation [14,15]. As executable specification, ontologies can be checked for consistency, i.e., whether they are contradiction-free, and used to make implicit knowledge explicit [16]. Second, using various reasoning services, alignment, matching, and translation [17], ontologies play a crucial role in onthe-fly integration of heterogeneous information and hence assist in establishing semantic interoperability [18]. For instance, complex service chains of Sensor Observation Services, Web Processing Services, and Web Mapping Services require more knowledge about the exchanged data than just code lists. Finally, formal definitions of sensors, observations, and feature types support information retrieval beyond simple keyword search by using reasoning services such as subsumption and similarity [19,20]. To realize these goals various research groups started to specify sensor, stimuli, and observation ontologies [21], examples include the Semantic Web for Earth and Environmental Terminology (SWEET)1 and the sensor ontology developed as part of the W3C Semantic Sensor Network Incubator Group2 .
3
Scenario
Based on a use case of the SoKNOS project [22], this section introduces a fire scenario to illustrate the challenges for registering sensors and publishing observations. A fast extending blaze at the waste dump of Muenster in Germany causes a dispersion of pollutants into the air. The air pollutants threaten an important European bird reserve, the so called Rieselfelder, and the surrounding settlements. In our scenario, mobile sensors are deployed to monitor air pollutants, wind speed, and wind direction. We assume that a local Sensor Web is already in place and used by a disaster relief organization. The newly deployed sensors have to be made available within the Sensor Web on-the-fly. Applications can directly utilize the gathered observations to get an overview of the situation 1 2
and for dispersion simulations. The scenario definition contains three examples for the registration of sensors and access to their observations: 1. A service is already set up for certain features of interest. If new sensors are registered, it has to be checked, whether these sensors produce values for already existing properties of the monitored features of interest, or whether new properties have to be created. This example illustrates the matching between outputs of a sensor and properties of already existing features of interest. 2. A service instance is already deployed for specific meteorological phenomena. If the mobile sensors are registered, it has to be checked whether the sensor outputs comply with the wind phenomena offered by the service. Additionally, when a new feature of interest is inserted into the service, it has to be assured whether the properties of the feature correspond to the phenomena provided by the service. This example demonstrates the matching of sensor outputs as well as feature properties with the phenomena offered by the service. 3. If a new observation is inserted into the service, it has to be checked whether the observed property of the feature of interest complies with the input and output of the sensor which has been registered for this observation before. This example illustrates a consistency check between the InsertObservation request and the registered sensors. Such a scenario is typical for Sensor Web use cases as it covers two important tasks at the same time – device discovery (e.g., which sensors are necessary to monitor the gas plume) and data discovery (e.g., which data can be used to compute the dispersion of the gas plume).
4
Semantic Challenges for Sensor Registration
In the following, we analyze the challenges for registering sensors and publishing their observations on the Sensor Web. Different kinds of sensors are necessary to compute the plume of air pollutants introduced in the scenario. For the sake of readability, the following examples focus mostly on wind direction sensors. To avoid terminological confusion3 , in this work the term entity refers to particulars in the real world. This also includes processes such as the dispersion of pollutants. The term feature of interest (or feature for short) refers to the computational representation of real world entities, e.g., a polygon representation of the gas plume. Consequently, features can also represent processes. Sensors measure stimuli (observable phenomena) which are either directly or indirectly related to the real world entities [13,12]. The result is called an observation. For 3
The O&M specifications are not very clear about the exact meaning of the terms phenomenon, measurand, feature, and so forth, see for example [10, p.17-18], as well as the distinction between real world entities and their representation.
Semantic Challenges for Sensor Plug and Play
77
Fig. 2. Semantic Matching for Sensor Observation Services
instance, one characteristic of a gas plume is the concentration of a specific pollutant. Sensors can measure this concentration and convert it into an observation value. This value then refers to the observed property of a feature of interest associated with a Sensor Observation Service. If it is not clear from the context, we will explicitly distinguish between sampling features and ultimate features. A sensor can be added to the Sensor Web by using the RegisterSensor operation of a Sensor Observation Service. The metadata description passed along with the operation request defines input and output of the sensor. The semantic matching between inputs and outputs of sensors and the observed property of the features of interest have to be assured. Also, the real world entity observed by a sensor must match the feature of interest of an observation. The Sensor Web is missing a mechanism which ensures a meaningful matching without user interaction to support a semantically enabled sensor plug & play. In the following, three major mapping challenges are introduced and discussed in detail; an overview of the combined challenges is presented in figure 2. 4.1
Matching of Real World Entities and Features
Sensors are deployed to monitor certain entities by observing stimuli related to them [12]. Their computational representations (the features of interest) are stored in geodatabases or OGC services such as the SOS. When deploying and registering new sensors, it has to be assured that the real world entities which are observed by the sensors have their counterparts in features provided by the SOS. This challenge is depicted in figure 3.
78
A. Br¨ oring et al.
Fig. 3. Matching of real world entities and features
In our scenario, the sensor deployer has to define representations of real world entities (e.g., sampling points located within the Rieselfelder, or the Rieselfelder as an ultimate feature of interest) which shall be observed. After defining these features, the sensors are registered at an existing SOS instance. Since the service should only provide observations for particular features, it has to be determined whether the existing features correspond to the real world entities observed by the newly added sensors. In fact, this challenge relates to the so-called symbol grounding problem [23]. The definition of features of interest and their feature types (e.g., bird reserve) has to be grounded in a shared and commonly agreed upon reference system. Sampling points can be reduced to their spatial footprint and hence can be grounded using a spatial reference system. In contrast, the ultimate feature of interest cannot be reduced to its spatio-temporal footprint but also requires a thematic component. Therefore it has to be grounded in spatial, temporal, and semantic reference systems [24,25]. In case of the gas plume scenario, it is not clear whether the gas plume dispersion (which is a process), the waste dump, the Rieselfelder or the physical position of the wind direction sensor should be selected as feature of interest. The conceptualization of this feature also influences whether a 2D or a 3D sonic anemometer should be used as sensor (see also figure 4). 4.2
Matching of Stimuli to Sensor Inputs
The second challenge describes the matching between sensor inputs as specified in SensorML and stimuli related to real world entities. This challenge is depicted in figure 4. Sensors are used to gather information about specific characteristics of particular entities. These characteristics can only be observed by stimuli related to them [12]. Increasing temperature, for instance, can be observed by the volume expansion of mercury. Typically a single sensor is constructed to observe a single stimulus. However, a stimulus can be interpreted in different ways to learn about multiple characteristics of the observed entities. Additionally, single sensors can be combined to sensor systems. With respect to the gas plume scenario, a propeller anemometer is a combination of a wind speed and
Semantic Challenges for Sensor Plug and Play
79
Fig. 4. Matching of stimuli to sensor inputs
Fig. 5. Matching of sensor output and feature property
a wind direction sensor. Both sensors use the flow of air mass as stimulus. A sonic anemometer makes use of an indirect stimulus, namely the transit time of a sonic pulse between pairs of transducers to measure wind direction and speed (in 2D or 3D). Consequently, taking the vision of a sensor plug & play with minimal human intervention seriously the feature of interest has to be modeled based on the notions of observations and stimuli. This would allow to select appropriate sensors semi-automatically and register them at a Sensor Observation Service to gather their measurements. The stimulus to which a sensor reacts is the origin of its measurement procedure. Besides this basic characteristic other more technical properties of the sensor such as the transfer function, the sensitivity range and quality parameters describe the behavior of a sensor. All these different properties have to be considered to prove the suitability of a sensor to measure certain characteristics. 4.3
Matching of Sensor Output and Feature Property
As outlined in section 2, an observation acts as a property value provider for a feature of interest. For example, an observation provides a value (e.g., 20◦ ) generated by a sensor (e.g., an anemometer) for certain characteristics (e.g., wind direction) of a feature (e.g., the Rieselfelder) at a certain time-stamp.
80
A. Br¨ oring et al.
The challenge in this case is whether the symbol and the semantics of the output produced by the sensor comply with the symbol and semantics of the property of the feature of interest. So far, code lists are used for a syntactic matching. Consequently, it is up to the SOS provider to ensure that the semantics of WindDirection in a particular SOS matches to prevailing direction in a SensorML description. The example of wind direction shows that answering this question can be challenging. Wind direction can be defined as the direction from which the wind blows, or as the direction the wind is blowing to. The observation value for wind direction can be an angle, a textual value, or a more complex XSD type [26].
5
Towards Semantically Enabled Sensor Plug and Play
While the previous section introduced several challenges for registering sensors and retrieving their observations, this section outlines the role of semantic annotation and reasoning to realize the envisioned sensor plug & play. In general, plug & play aims at reducing or avoiding any manual configuration when plugging new components into a system. With respect to Sensor Observation Services, it should be possible to select and register sensors with minimal human interaction. In large scale real world applications, it is unlikely that the provider of a Sensor Observation Service is also responsible for modeling the various features of interest, observations, and sensors using O&M and SensorML, respectively. In fact, these components are provided by external sources. Features of interest can be retrieved from Web Feature Services using semantically enabled catalogues [5,27]. The definitions of observable characteristics can be taken from ontologies such as SWEET or extracted from statistical models, while the SensorML annotations can be provided by the sensor manufacturers. In this case, the SOS provider cannot simply assume a meaningful correspondence based on the name of a sensor output and the name of a property related to a feature of interest. Semantic matching needs to assure that both names point to the same domain concept, e.g., the shared conceptualization that wind direction is denoted as pointing to the compulsion, in degrees, and in compass direction; see also [26] for details. In the following, we assume that the sensors used in the gas plume scenario are accompanied by a SensorML self-description provided by its vendor or manufacturer. Consequently, an SOS provider does not have to create the SensorML description at runtime. Additionally, we assume that the Sensor Web infrastructure contains an SOS which offers sensor data for multiple sampling features located within the affected ultimate feature, i.e., the Rieselfelder, and it is set up for various observable properties such as wind speed, wind direction, and pollutant concentrations. A new sensor can be made available on the Sensor Web by adding it to the SOS. Therefore, the RegisterSensor operation is invoked whose request contains the SensorML description. Listing 1 shows a fragment of such a request which registers an anemometer. The sensor is modeled as a system which incorporates among other descriptive elements an input and an output.
Semantic Challenges for Sensor Plug and Play
81
... <sml : System> ... <sml : i n p u t s > <sml : I n p u t L i s t > <sml : i n p u t name=’air movement ’> <swe : O b s e r v a b l e P r o p e r t y d e f i n i t i o n = ’ urn : o g c : d e f : phenomenon :OGC: air movement ’/ > <sml : o u t p u t s > <sml : O u t p u t L i st> <sml : o u t p u t name=’ wind direction ’> <swe : Q u a n t i t y d e f i n i t i o n= ’ urn : o g c : d e f : phenomenon :OGC: wind direction ’> <swe : uom c o d e =’ deg ’ /> <swe : q u a l i t y > <swe : QuantityRange d e f i n i t i o n= ’ urn : o g c : d e f : phenomenon :OGC: tolerance ’> <swe : v a l u e >−0.5 0.5 v a l u e> ... R e g i s t e r S e n s o r >
Listing 1. Request to register a new sensor
The stimulus observed by the sensor, its input, is identified by the definition attribute whose value is a Unified Resource Name (URN)4 . It uniquely identifies the referenced concept, in this case air movement, by pointing to a description stored in a dictionary or code list. Similarly, the sensor output, the wind direction, is referenced by a URN. Sensor observations provide values for properties of particular features of interest, e.g., a gas plume, associated with the SOS. To enable sensor plug & play, we propose to refer to ontologies containing formal specifications for stimuli, observations, and functional aspects of a sensor (marked bold in the listing). One example for such an ontology is the sensor type ontology developed at the W3C Semantic Sensor Network Incubator Group. While this ontology provides definitions for sensors and their components, future ontologies have to define stimuli and observations [13]. Existing technologies for ontological alignment, matching [17], and similarity [20] can then be used to ensure that the specified output of a sensor produces appropriate values for properties of certain features of interest. For instance, if a feature property wind direction has been modeled as a 3-dimensional quality, trying to assign a 2D anemometer to it would produce an error or warning (see also [29] for an ontological investigation on the dimensionality of qualities). Whether the semantic annotation of SensorML documents is realized using RDFa [6,7] or other technologies such as SAWSDL [30,31], is an implementation decision not
4
The structure scheme for the OGC namespace is defined by Whiteside [28].
82
A. Br¨ oring et al.
discussed here5 . A similar approach was introduced by Hornsby and King [32] for the transportation domain. For our scenario we assume that the sensor invokes the InsertObservation operation of the SOS as soon as data is available. Listing 2 shows a fragment of such a request. ... ... <s a : S a m p l i n g P o i n t gml : i d =’ s a m p l i n g 0 1 ’> <s a : sa m p l e d F e a t u r e x l i n k : h r e f= ’ urn : o g c : d e f : f e a t u r e :OGC: R i e s e l f e l d e r ’/ > <s a : p o s i t i o n > 7.89 52.90 ... 52.0 ...
Listing 2. Request to insert new observations
Similar to the RegisterSensor operation, the InsertObservation request has to be semantically annotated. This way, the SOS can verify whether the output type defined by the sensor is semantically compliant with one of the observed properties associated with the SOS and corresponding to a property of an associated feature of interest, e.g., a sampling feature within the Rieselfelder. On the long term and based on fixed stimulus-observation alignments, one could automatically discover and select appropriate sensors using features of interest and observations as queries. For instance, in case of a query such as will the Rieselfelder be affected by a gas plume, appropriate sensors and processing services can be automatically selected and arranged (using a Sensor Planning Service [33]). The necessary inference can be performed based on the knowledge about types of features and observations provided in the ontologies. Gas plumes, for instance, can be modeled as processes which have a direction of dispersion, concentrations of different pollutants and so forth. Each of these characteristics can be aligned to stimuli in a stimulus ontology used to describe sensors. This, however, requires the integration of ontology repositories and reasoning services into spatial data infrastructures. First, a Web Reasoning Service (WRS) is needed to encapsulate the reasoning components developed as core parts of the Semantic Web. Such a service could be developed as a profile of the Web Processing Service specification [34]. Using a WRS, users could query a Web 5
An API for the semantic annotation of OGC services is under development and can be downloaded at https://www.assembla.com/spaces/dashboard/index/sapience
Semantic Challenges for Sensor Plug and Play
83
Feature Service for waterbodies and retrieve individual rivers, lakes, reservoirs, and so forth. Based on similarity reasoning, users can also query for specific features (e.g., canals) and get similar features back in addition (e.g., rivers). Second, a Web Ontology Service (WOS) has to be designed which acts as a catalogue to registered ontologies and enable semantics-based discovery of Sensor Web related concepts such as features, observations, and sensors. A WOS can be considered as a profile for the OGC Catalogue Service [35]. Introducing profiles instead of completely new service types enables the integration with existing SDI technologies and simplifies the service orchestration; see [5] for details.
6
Conclusions and Further Work
In this paper, we discussed the challenges related to registering new sensors and inserting their observations to a Sensor Observation Service. We argued that these matchings have to be established and maintained manually by the provider of the SOS and explained the difficulties in doing so. Starting with an abstract view on the semantic matching challenges, we described how the semantic annotation of RegisterSensor and InsertObservation requests can serve as a basis for reasoning-based consistency checking and hence improve the manual matching process. The long term vision underlying this research is to enable sensor plug & play with minimum human intervention. The main difficulty lies in the relationship between the different OGC constructs used to model sensors, observations, and features of interest on the Sensor Web. Three challenges can be distinguished. The first describes the relation of the real world entity and the corresponding feature of interest as the computational artifact. If two sensors of different type both deliver observations assigned to a particular feature of interest in an SOS, do they both refer to the same real world entity? This challenge relates to the symbol grounding problem and requires further work on reference systems [25]. The second challenge is related to the selection of an appropriate sensor which is capable of measuring characteristics of a particular feature - the sensor inputs, i.e., real world stimuli, and the entity’s observed characteristics have to match. Third, the sensor output has to comply with the property of the feature of interest stored in a Sensor Observation Service. Using the wind direction as an example, we discussed why a purely syntactic matching is not sufficient. While we focused on introducing these challenges as well as the role of semantic annotation and reasoning, the implementation of this work is part of the 52◦ N orth semantics community which aims at establishing a semanticenablement layer for OGC services6 . Besides introducing the idea of sensor plug & play, the paper also shows that two new OGC service types are being required to incorporate semantics-based information retrieval, on-the-fly integration, and the composition of sensors and services. However, the question how to represent perdurants, for example the dispersion of the gas plume, in services such as the WFS is still an open issue. 6
http://www.52north.org/semantics
84
A. Br¨ oring et al.
Acknowledgment Major parts of the presented research have been developed at a joint workshop of the Institute for Geoinformatics, University of M¨ unster, Germany, and the National Institute for Space Research (INPE), Brazil, which has been supported by the German Research Foundation (DFG), project no. 444 BRA 121/2/09 and by The State of S˜ ao Paulo Research (FAPESP), project no. 2008/116046. Partial funding came from the International Research Training Group on Semantic Integration of Geospatial Information (DFG GRK 1498).
References 1. Connaghan, D., Hughe, S., May, G., O’Brien, K., Kelly, P., Connaire, C.O., O’Connor, N., O’Gorman, D., Warrington, G., Smeaton, A.F., Moyna., N.: A Sensing Platform for Physiological and Contextual Feedback to Tennis Athletes. In: BSN 2009 - Body Sensor Networks Workshop (2009) 2. Hayes, J., O’Conor, E., Cleary, J., Kolar, H., McCarthy, R., Tynan, R., O’Hare, R., Smeaton, A., O’Connor, N., Diamond, D.: Views From the Coalface: ChemoSensors, Sensor Networks and the Semantic Sensor Web. In: SemSensWeb 2009 International Workshop on the Semantic Sensor Web (2009) 3. Shepherd, D., Kumar, S.: Microsensor Applications. In: Distributed Sensor Networks. Chapman and Hall, Boca Raton (2005) 4. Botts, M., Percivall, G., Reed, C., Davidson, J.: OGC Sensor Web Enablement: Overview And High Level Architecture. Technical report, Open Geospatial Consortium (2007) 5. Janowicz, K., Schade, S., Br¨ oring, A., Keßler, C., Stasch, C., Mau´e, P., Diekhof, T.: A transparent semantic enablement layer for the geospatial web. In: Terra Cognita 2009 Workshop In conjunction with the 8th International Semantic Web Conference, ISWC 2009 (forthcomming, 2009) 6. Sheth, A., Henson, C., Sahoo, S.: Semantic Sensor Web. IEEE Internet Computing, 78–83 (2008) 7. Henson, C.A., Pschorr, J.K., Sheth, A.P., Thirunarayan, K.: SemSOS: Semantic Sensor Observation Service. In: International Symposium on Collaborative Technologies and Systems, CTS 2009 (2009) 8. Na, A., Priest, M.: OGC Implementation Specification 06-009r6: OpenGIS Sensor Observation Service, SOS (2007) 9. Botts, M.: OGC Implementation Specification 07-000: OpenGIS Sensor Model Language, SensorML (2007) 10. Cox, S.: OGC Implementation Specification 07-022r1: Observations and Measurements - Part 1 - Observation schema (2007) 11. Cox, S.: OGC Implementation Specification 07-022r3: Observations and Measurements - Part 2 - Sampling Features. Technical report, Open Geospatial Consortium (2007) 12. Stasch, C., Janowicz, K., Broering, A., Reis, I., Kuhn, W.: A Stimulus-Centric Algebraic Approach to Sensors and Observations. In: Trigoni, N., Markham, A., Nawaz, S. (eds.) 3rd International Conference on Geosensor Networks. LNCS, vol. 5659, pp. 169–179. Springer, Heidelberg (2009)
Semantic Challenges for Sensor Plug and Play
85
13. Kuhn, W.: A functional ontology of observation and measurement (2009) (under review) The ontology, http://musil.uni-muenster.de/wp-content/uploads/Observation2.hs 14. Guarino, N.: Formal Ontology and Information Systems. In: Guarino, N. (ed.) International Conference on Formal Ontology in Information Systems (FOIS 1998), pp. 3–15. IOS Press, Trento (1998) 15. Kuhn, W.: Semantic Engineering. In: Navratil, G. (ed.) Research Trends in Geographic Information Science. Springer, Heidelberg (forthcoming, 2009) 16. Allemang, D., Hendler, J.: Semantic Web for the Working Ontologist: Modeling in RDF, RDFS and OWL. Morgan Kaufmann Elsevier, Amsterdam (2008) 17. Shvaiko, P., Euzenat, J.: Ten Challenges for Ontology Matching. In: Meersman, R., Tari, Z. (eds.) OTM 2008, Part II. LNCS, vol. 5332, pp. 1164–1182. Springer, Heidelberg (2008) 18. Harvey, F., Kuhn, W., Pundt, H., Bisher, Y., Riedemann, C.: Semantic Interoperability: A Central Issue for Sharing Geographic Information. The Annals of Regional Science 33, 213–232 (1999) 19. Janowicz, K., Keßler, C., Schwarz, M., Wilkes, M., Panov, I., Espeter, M., Baeumer, B.: Algorithm, Implementation and Application of the SIM-DL Similarity Server. In: Fonseca, F., Rodr´ıguez, M.A., Levashkin, S. (eds.) GeoS 2007. LNCS, vol. 4853, pp. 128–145. Springer, Heidelberg (2007) 20. Janowicz, K., Wilkes, M.: SIM-DLA: A Novel Semantic Similarity Measure for Description Logics Reducing Inter-concept to Inter-instance Similarity. In: Aroyo, L., Traverso, P., Ciravegna, F., Cimiano, P., Heath, T., Hyvoenen, E., Mizoguchi, R., Oren, E., Sabou, M., Simperl, E.P.B. (eds.) ESWC 2009. LNCS, vol. 5554, pp. 353–367. Springer, Heidelberg (2009) 21. Compton, M., Henson, C., Lefort, L., Neuhaus, H.: A survey of the semantic specification of sensors. Technical report (2009), http://lists.w3.org/Archives/Public/public-xg-ssn/2009Aug/ att-0037/SSN-XG StateOfArt.pdf 22. Stasch, C., Walkowski, A.C., Jirka, S.: A Geosensor Network Architecture for Disaster Management based on Open Standards. In: Ehlers, M., Behncke, K., Gerstengabe, F.W., Hillen, F., Koppers, L., Stroink, L., W¨ achter, J. (eds.) Digital Earth Summit on Geoinformatics 2008: Tools for Climate Change Research, pp. 54–59 (2008) 23. Harnad, S.: The Symbol Grounding Problem. Physica D 42, 335–346 (1990) 24. Kuhn, W.: Semantic Reference Systems. International Journal of Geographic Information Science 17(5), 405–409 (2003) 25. Scheider, S., Janowicz, K., Kuhn, W.: Grounding Geographic Categories in the Meaningful Environment. In: Stewart Hornby, K., et al. (eds.) COSIT 2009. LNCS, vol. 5756, pp. 69–87. Springer, Heidelberg (2009) 26. Probst, F., Lutz, M.: Giving Meaning to GI Web Service Descriptions. In: 2nd International Workshop on Web Services: Modeling, Architecture and Infrastructure (WSMAI 2004), Porto, Portugal (2004) 27. Stock, K., Small, M., Ou, Y., Reitsma, F.: OGC Discussion Paper 09-010 - OWL Application Profile of CSW. Technical report, Open Geospatial Consortium (2009) 28. Whiteside, A.: OGC Recommendation Paper 05-010: URNs of Definitions in OGC Namespace (2007)
86
A. Br¨ oring et al.
29. Probst, F., Espeter, M.: Spatial dimensionality as a classification criterion for qualities. In: Bennett, B., Fellbaum, C. (eds.) International Conference on Formal Ontology in Information Systems (FOIS 2006). Frontiers in Artificial Intelligence and Applications, vol. 150, pp. 77–88. IOS Press, Amsterdam (2006) 30. Farrell, J., Lausen, H.: Semantic annotations for WSDL and XML schema. W3C recommendation (2007), http://www.w3.org/TR/sawsdl/ 31. Mau´e, P., Schade, S., Duchesne, P.: OGC Discussion Paper 08-167r1: Semantic annotations in OGC standards. Technical report, OGC (2009) 32. Hornsby, K., King, K.: Linking geosensor network data and ontologies to support transportation modeling. In: Nittel, S., Labrinidis, A., Stefanidis, A. (eds.) GSN 2006. LNCS, vol. 4540, pp. 191–209. Springer, Heidelberg (2008) 33. Simonis, I.: OGC Implementation Specification 07-014r3: OpenGIS Sensor Planning Service. Technical report, Open Geospatial Consortium (2007) 34. Schut, P.: OGC Implementation Specification 05-007r7: OpenGIS Web Processing Service (2007) 35. Nebert, D., Whiteside, A., Vretanos, P.: OGC Implementation Specification 07006r1: OpenGIS Catalogue Services Specification (2007)
Providing Universal Access to History Textbooks: A Modified GIS Case Dieter Pfoser1,3, Alexandros Efentakis1, Thanasis Hadzilacos1,4, Sophia Karagiorgou1, and Giorgos Vasiliou2 1 RA Computer Technology Institute University Campus Patras, 26500 Rion, Greece {pfoser,efedakis,thh,karagior}@cti.gr 2 Talent SA, Karytsi Square 4Α, 10561 Athens, Greece [email protected] 3 RC “Athena”, Institute for the Management of Information Systems Bakou 17, Athens 11524, Greece [email protected] 4 Open University of Cyprus, 13-15, Digeni Akrita Avenue 1055 Nicosia, Cyprus [email protected]
Abstract. Integrating and accessing structured textual content from various sources is a challenging task and becomes even more so when dealing with multiple languages. The objective of this work is to showcase the technological efforts towards the creation of a digital European history textbook repository that integrates respective textbooks from various countries and publishers. The content integration is achieved by introducing language independent metadata based on space (locations), time (dates), and thematic categories (history gazetteer). Providing adequate interfaces such metadata can be used to provide language-independent access to Multilanguage history textbook content. The specific focus in this work will be on (i) presenting the metadata, (ii) the data management approach including indexing the history textbook content and (iii) the resulting textbook repository including its GIS-based interface allowing for a combination of map, timeline and keyword based search of the history content. Keywords: Content integration, spatiotemporal indexing, Multilanguage content, history content.
A digital European history textbook base will integrate and provide access to already existing digital material from various European publishers. The integration is achieved by introducing largely language independent metadata including (i) space (locations), (ii) time (dates), and (iii) history concepts. These three metadata aspects represent essential abstractions of the content and will be used to index and access the textbook content in a largely language-independent way. The aim is to use the history concepts to define a (near) language-independent (abstractions that can be easier translated than the entire texts) thematic categorization of content. In addition, to identify locations in arbitrary textbooks, a multilingual dataset that includes historic place names is needed. Finally, temporal identifiers are used to create a timeline for the respective text portions. While in theory, the potential as an indexing means is substantial, a considerable challenge poses the identification, collection and integration of said metadata. This work focuses on creating a thematic ontology for history concepts and a multilingual location corpus. The former will contain thematic categories used in country-specific curricula to arrive at a history ontology covering all of Europe. The location corpus will include place name identifiers that can be used to geographically reference text portions. Integration of metadata sources entails also the translation of each source to several target languages. To make this process as efficient as possible, an metadata translation tool is created that utilizes the Wikipedia encyclopedia. Subsequent manual translation and verification provide for an overall semi-automatic translation tool. The three metadata aspects will be used to tag the historic textbooks, i.e., relate metadata entries to the respective text portions. Essentially, all textbook content and metadata is stored by means of a relational database with the tagging software scanning the textbook content and relating it to the metadata. As a result a Multilanguage index based on temporal, spatial and thematic metadata is created that provides language-independent access to the textbooks, i.e., searching or content using Greek metadata will produce results also in other languages. Automatic translation tools will then be used to translate the content into the target language, e.g., Spanish text into Greek. The work presented in this paper touches several research aspects that all belong to the larger complex of digital library research. Foremost, the spatial and temporal metadata-based access to content is been exploited in several projects and products. The Centennia Historical Atlas [2] supports spatiotemporal exploration of history by means of interactive maps and displayed relevant content. In a similar fashion, HyperHistory Online [12] allows one to explore selected history concepts by means of a simple temporal categorization and image maps of concept visualizations. Work in the general area of Named Entity Recognition is manifold having resulted already in many research projects and products, many of them open-source. GATE [5] is a general framework for information extraction tasks from content. In our work, we will use this tool to perform the essential content tagging task, i.e., automatically relating metadata to content. MinorThird [4] is an open-source toolkit (collection of Java classes) for storing text, annotating text, and learning to extract entities and to categorize text. As such its functionality is limited with respect to GATE but would fit the purposes of our project, i.e., relating metadata to content in terms of annotations. However, a severe shortcoming of MinorThird is that it does no support Unicode character encoding. Clearforest Gnosis [3] is a browser extension that performs cate-
Providing Universal Access to History Textbooks: A Modified GIS Case
89
gorization of terms for Web pages by annotating (color coding) them. Gnosis is a free service of Clearforest, which provides commercial solutions for text-driven business intelligence, i.e., text categorization and text analysis. A similar product suite is LexiQuest [19] from SPSS providing text analysis functionality such as concept and relationship discovery combined with visualization interfaces for the results. The tool supports several languages including German, Italian and Spanish. LexiQuest has been evaluated and was found to provide limited flexibility with respect to storage of, both, metadata and content and provided no flexibility with respect to customizing the tagging solution. A limitation with most tools is that they focus on English as a working language and typically lack Unicode support. Further, available metadata for NER is limited and linked to supported languages. In addition to general text mining tools, specialized approaches for geocoding content are available. Metacarta [13] offers a tool for the geocoding of Web pages and texts in general. The tool recognizes geographic key words and relates them to coordinates. In connection with a spatial content browser, e.g., Google Earth [11], this technology can be used as an additional powerful means to improve the quality of keyword-based search results. In the scope of this project, we will develop similar techniques that have a narrower content focus and are based on a general NER approach. The remainder of this work is organized as follows. Section 2 describes the data scenario. Section 3 details how a spatiotemporal-thematic index is created from respective metadata for history textbooks. Section 4 describes the CITER tool for searching and accessing content. Finally, Section 5 provides conclusions and directions for future research.
2 The Data To better understand the motivation behind our work of using three distinguished metadata aspects to index content, the following sections briefly describe the nature of history textbook content and subsequently introduce the used metadata in more detail. 2.1 History Textbook Content Digital educational content and constructive educational software are two key “instruments” towards the essential introduction of new technologies in the learning process and the achievement of the anticipated goals. For the history discipline, the multilingual and multicultural digital content gains in particular importance as it allows comparative approaches to the discovery and interpretation of the historical facts and the development of a broader historic perspective by considering the existing multiple national views in a comparative teaching approach. The specific content used in this work comprises • • • •
55 history textbooks from a total of eight publishers from seven countries, representing six languages (English, Spanish, German, Italian, Greek and Slovenian).
90
D. Pfoser et al.
The type of content will be mostly history textbooks available in digital form, but also include content from history CDs and picture libraries. An important step towards the creation of the repository is the integration of content sources by means of an efficient storage mechanism. We achieved this by (i) defining an integrated data model and (ii) a methodology for transforming the textbooks according to this model and entering them in relational database (PostgreSQL DBMS in our case). While all “textbooks” are available in digital form, an elaborate process was designed to import books prepared using desktop-publishing software such as Adobe InDesign and Quark Xpress in the production process into the database. Here, a semi-automatic methodology was defined that includes manipulating the original content sources, i.e., improving the quality and correcting the original DTP files, translating them to (structured) XML documents and finally importing them to the database. 2.2 Metadata Identifying and authoring proper metadata and indexing content is a next-toimpossible task when making no restrictions on the scope of the content. Focusing on the subset of history textbooks one can exploit specific metadata aspects of the data that present adequate abstractions of the content and provide a simple indexing means. • • •
Space (locations), time (dates) and thematic metadata (history concepts)
are used to structure history content and to provide a general means for (largely) language independent access and integration of content, i.e., comparatively little effort is needed to translate the metadata. In our endeavor involving content in six different languages, we use a combined approach involving manual as well as machine-based metadata collection. Sources of metadata are (i) already existing metadata for textbooks (indices, TOC), (ii) existing metadata collections (spatial feature name collections, datasets from publishers, ontologies), and (iii) text mining tools that extract relevant concepts from text analysis. The overall objective is to integrate all three sources to produce a comprehensive metadata base. The two fundamental aspects for the study of historical facts are space and time. These two aspects have the advantage over classical semantic markups such as keywords in that they are non-ambiguous, i.e., there is no doubt as to what an x and y coordinate or a date mean! An important prerequisite to spatial metadata authoring is the identification of geographic feature names. Although simple at first sight, this task poses a challenge in that also Multilanguage issues and historical naming have to be considered. Thus, we have to include Multilanguage mappings of locations as well as historical to current mappings of locations. All these variations have to be captured in the geographic feature name corpus to afterwards identify place names in texts. Existing datasets for feature names that include geographic co-ordinates are the Geographic Names System (GNS) [8] and the Getty Thesaurus of Geographic Names (TGN) [9], with the latter comprising around 900k places and related information.
Providing Universal Access to History Textbooks: A Modified GIS Case
91
Place names may include names in the vernacular language, English, other languages, historical names, names and in natural order and inverted order. Besides these content sources, we will evaluate and add any dataset that is available from the partners in this project. Given the fact that large metadata sources do exist, identifying appropriate spatial metadata is simple when compared to the task of building a history ontology for thematic metadata. Accessing content based on location and time might not yield the desired results when trying to find information related to specific topics that cut across time and space. An example here would be “age of enlightenment,” which affected all European countries in different ways. Hence, a thematic component is needed to better discriminate and index history content. For the definition of a history ontology, we followed a combined top-down/bottom-up approach. The CITER history ontology has been defined by a historian and history textbook author. The history ontology consists of 300 classes, including three main categories (event, organization, and person), a qualitative category (source), a time class, a location class and an attribute class (properties to additionally style instances). The maximum depth for each of the main classes is three (e.g., three levels below the “event” class). At the same time, concepts (instances) for this ontology are collected in a bottom-up fashion, i.e., collecting and integrating existing textbook metadata as well as using text analysis techniques. Finally, the instances are related to the respective classes of the ontology. Hence, while in a top-down fashion candidate entries of the history ontology are defined by domain experts such as historians, machine translation techniques and existing metadata sources provide concepts at the instance level. 2.3 Metadata Translation - or - the Wikipedia Trick Space, time and thematic metadata were chosen since the represent (i) a powerful index for history information and (ii) need comparatively little effort to be translated. However, this translation task for metadata is still a considerable challenge and our goal was to provide automatic and high-quality means of translation as briefly outlined in the following. Each metadata entry represents a specific “label” in its respective language. Any automatic translation would require and be based on dedicated Multilanguage resources, i.e., dictionaries. In case no dictionary entry is available, a manual translation is needed. One such dictionary that is publicly available is Wikipedia [22]. In the present approach, the Wikipedia encyclopedia is viewed as a multi-language resource based on concepts. Each Wikipedia article page has a column that contains links to articles covering the same topic in another language. Figure 1 gives an example of an English article page on “World War II”, where one can follow the “in other languages” links, to find respective articles in, among others, German, Italian, Spanish, Greek, and Slovenian. What is of interest is not the actual content of the article but its “label.” The label is a human translation, i.e., by the article authors, of the respective concept into the various languages. Exploiting this fact, a Wikipedia translation tool was created that takes a label as input and tries to translate it into the various target languages. Two versions of the tool are publicly available [21], Java code for database-centered batch processing as well as an online interface.
92
D. Pfoser et al.
Original article (concept with English label))
Articles in other languages (concept label translation)
In-otherlanguage bar
Fig. 1. Wikipedia “term” translation Table 1. Concepts after translation Concepts incl. Organizations Orig.
German
Places
After WP Transl.
After Manual Transl.
Orig.
After WP Transl.
After Manual Transl.
3123
3317
3690
473
1313
Slovenian
611
733
3690
352
678
3215
Italian
191
1592
3690
327
1094
3215
English
327
Spanish Greek Total
4253
3215
3690
3690
1789
3215
3215
1451
3690
148
488
3215
464
3690
206
654
3215
11247
3690
3294
7443
3215
Using the Wikipedia translation tool and translating the metadata into all six languages, the complete amount of available metadata is shown in Table 1. The columns represent (i) the number of original metadata entries in their respective language as collected from the various sources, (ii) the number of automatically translated terms (Wikipedia tool) and (iii) the final number of metadata entries available in all languages. Essentially, using the Wikipedia tool, roughly half of all entries have been translated automatically.
3 Spatiotemporal-Thematic Index While creating spatial and thematic metadata involves a lot of work, tagging the actual content with this metadata is even more so challenging. Such a process is generally
Providing Universal Access to History Textbooks: A Modified GIS Case
93
labeled Named Entity Recognition (NER), i.e., assigning a (group of) words to a set of predefined categories. In our specific context it would mean to discover temporal, spatial and thematic identifiers that can be linked to a timeline, locations recorded as spatial metadata, and statements referring to concepts in the history ontology. 3.1 Software Framework When faced with a large amount of content, the tagging content with metadata has to be automated to the largest extent possible. Information extraction (IE) systems automate tagging process and can be integrated as software infrastructure. In the specific context, the software that is used is GATE (A General Architecture for Text Engineering) [5], a software framework for natural language processing and engineering. In the context of this work, we assume that terms with a common stem will usually have similar meanings. The performance of an information retrieval system will be improved if term groups are conflated into a single term, e.g., “connect”, “connected”, “connecting”, “connection”, and “connections” to the stem “connect” [17]. In the specific context, we used Snowball [18], a framework for writing stemming algorithms. Out of our six target languages, stemmers for English, German, Spanish and Italian were readily available. We developed a Greek stemmer based on [14]. A Slovenian stemmer has been developed based on [16]. The Snowball framework is available as a plugin for GATE. Spatial and Thematic Tagging. For tagging content with metadata and considering the natural language processing aspect, a workbench was created integrating GATE and specific Snowball stemming functionality. Our approach of text processing is (i) word level oriented, uses (ii) stop-words (articles, conjunctions, etc.) and (iii) stemming. In our framework, the textbook content and the metadata are stored by means of a relational DBMS, in our case PostgreSQL. This is in contrast to the original GATE implementation that uses files. The Content Tagger is used to relate terms found in the content to terms contained in the metadata. The tagging result is stored in tables as relationships between metadata and content. Figure 2 shows another tagging result. The concept “Second World War” was found in various texts. The visualization of the result directly reflects the result table. Text portions that were tagged are highlighted. Terms with the same number of words are highlighted using the same color. The highlighting is primarily used as a debugging tool to verify tagging results and improve the algorithm. Temporal Tagging. Temporal tagging, i.e., the discovery of temporal identifiers requires a different approach since a priori temporal metadata is limited! A temporal identifier in the respective languages is composed of (i) numbers that may represent everything dates, month, years and centuries, (ii) metadata comprising labels for weekdays, month, seasons, decades, centuries, etc. and (iii) the respective rules to from form valid temporal identifiers, e.g., dates. A parser was developed to detect date, month and year numbers from the textual representation of a temporal phrase. The implementation is based on [10].
94
D. Pfoser et al.
Fig. 2. Tagging example – visualization
3.2 Resulting Spatiotemporal-Thematic Index To assess the quality of the resulting spatiotemporal-thematic index, Table 2 shows tagging results for some exemplary books covering topics of the 20th century. The results given are in that order Words – number of words in book with stop words excluded, Found all – number of concepts found measured in terms of number of words, Found % - number of concepts found in terms of percentage of total words, and Found dist. – distinct number of concepts found. Given the prominent examples of German and Slovenian textbooks it is shown that a significant part of the content is essentially related to metadata. In the case of only considering thematic metadata, 15% of the content can be linked to metadata, while when also considering spatial and temporal metadata the actual words in the content that are linked to metadata increases to close to 30%! Table 2. Thematic index coverage Content
Lang.
Words
Found all
Found Found % dist.
Concepts and Organizations Geschichte und Geschehen 4 Britain World War II
EN
14794
OMNIA
IT
2714250
20. stoletje Ιστορία ΣΤ Δημοτικού
SLO GR
DE
38651
5986 15.5% 1739 11.8% 80332
921 185
3.0%
1188
24784
5102 20.6%
597
12833
1373 10.7%
223
Providing Universal Access to History Textbooks: A Modified GIS Case
95
Metadata Analysis, Content Overlap. A more detailed metadata analysis was conducted in order to realize the semantic overlap between books, i.e., given a specific metadata entry how often and in how many books does it appear. The analysis focuses on the same textbooks also given in Table 2, i.e., including German, English, Italian, Slovenian and Greek. All books focus on the 20th century, i.e., an a-priori overlap is given. What needs to be established is whether this overlap can be also found by examining the overlap in metadata as well. Figure 3 shows the overlap among the discovered metadata concepts for sets of textbooks. Consider the example of thematic metadata found for the German textbook in Figure 3(a), which is 921. This textbook shares 460 terms with the Italian book and 116 with the Greek book. The overlap of all three books is 91 terms. Considering the overall of all five textbooks results in 35 terms (center of Figure 3(a)). Similar figures are shown for geographic metadata. Temporal metadata differ in that here temporal overlaps are considered. For example if one books mentions March 1945 and the other March 25, 1945, this is considered an overlap. The temporal overlap is largely similar to the numbers reported for spatial metadata. DE
DE 921
IT
309
IT
GR
GR 460
1188
9
1324
223
3
5
7
238
116
19
101 7
EN
185
3 9
88 35
16
3 3
20
35
6
5
45
44
21
116
47 26
7 597
EN
SL
(a) Thematic metadata overlap
53
16
42
22 17 27
1324 SL
(b) Geographic metadata overlap
Fig. 3. Metadata overlap
Overall Content and Index Size. Did the previous section show some specific facts on the respective coverage of the index, so will we in the following give the overall statistics concerning the index and its coverage. History Textbook Content. The 55 history textbooks comprise a database consisting of • • •
a total of 82800 paragraphs, which constitute 8.5 million words, or an average paragraph size of 103 words.
Metadata. The collected metadata comprises a total of • •
The temporal metadata is individually discovered, i.e., the number of hits correspond to the actual size of the metadata. Temporal metadata is stored by means of time, i.e., temporal queries such as range are supported. Spatiotemporal-thematic index. An important measure for our spatiotemporalthematic index is its size. The following numbers give the total number of hits per thematic category, i.e., number of occurrences in content. • • •
Total thematic hits: 757,536 Total spatial hits: 195,546 Total temporal hits: 159,161
It can be observed that while spatial and temporal metadata produce roughly the same number of hits, the thematic metadata produces 4 times as many hits. The reason is simply that thematic metadata was derived from the content itself using indices, TOCs and text mining tools. The spatial metadata was derived from external sources and was then actually “discovered” in the content. As such the number of hits is astounding. Consequently, the total number of paragraphs that have been indexed is 75949 out of total 82791, corresponding to a coverage of the index of 92%. This means that at least 92% of all paragraphs have been indexed at least once. Elaborating further on the index characteristics, it can be observed that for •
•
the thematic metadata, on average each entry has o 223 hits, i.e., references to content o spread over 6.7 history textbooks o and 2.63 languages (out of 6). the spatial metadata, on average each entry has o 142 hits, i.e., references to content o spread over 5.7 history textbooks o and 2.6 languages (out of 6).
Again, considering the fact that spatial metadata was simple compiled using external sources, the actually hit rate when compared to thematic metadata is very good. Summary. Given the right metadata, a combined spatiotemporal-thematic index for history content becomes feasible. The metadata gazetteer includes roughly 10000 concepts. Using such a gazetteer to tag history content, results in up to 30% of the content (i.e., the actual words not considering stop words) being related to metadata! The textbooks further exhibit a considerable overlap with respect to the metadata they share. This overlap is important when we consider the objective of the metadata, namely to find content sources in Multilanguage textbooks related to the same thematic, geographic and temporal concept/metadata.
4 Accessing History Content – The Modified GIS Case Tagging content with spatial, temporal and thematic metadata creates a formidable index to access history textbook content. This section describes the CITER platform and interface specifically supporting content search based on the three distinguished metadata aspects.
Providing Universal Access to History Textbooks: A Modified GIS Case
97
The CITER platform is a client-server application resembling a somewhat peculiar GIS desktop application. The application is based on and uses the Cruiser platform [20]. The application has to be installed on a Windows XP/Vista computer with continuous and preferably broadband Internet connection. As such, the application is similar to, e.g., Google Earth. All the CITER platform data (books, photographs, indices etc.) are stored on a server. On the client side (desktop app), apart from the application, map data is installed to achieve fast response time and to avoid overloading network traffic. In this guide we will have a look at the most useful and exciting platform features. Figure 4 showcases the basic user interfaces to query the history textbooks. Properly indexed content based on space, time and thematic metadata is accessed using the three respective querying mechanisms. The temporal parameter is fixed using a temporal slider that allows for the selection of time points as well as ranges. The spatial scope is determined by a spatial range and thus selected landmarks and alternatively also based on keywords, e.g., Greece. This functionality is supported using a base map of Europe to show important geographical landmarks the user can query. A next step in the application will be to integrate historic maps (vector data with changing boundaries in time). The thematic range is determined by browsing/querying keywords in the respective language, thus effectively identifying concepts in the history ontology. Query results are displayed on the right side using an embedded Web-browser (see below for examples and explanation). The retrieved content is showed in various tabs,
¼ Fig. 4. Integration of new functionality and existing e-content portal solutions
98
D. Pfoser et al.
each representing the content in its original form or a respective machine translation. Each type of content is marked as such. The software used for the translation of the history content to the various target languages is the ESTEAM Translator [6]. The interface allows temporal, spatial and thematic parameters to be used simultaneously to query the textbook content. For example, given the spatial range of Greece, a temporal range of the year 1923 and the thematic concept of “war”, the respective metadata entries in the database are queried to follow their pointers to the respective text portions they index. In this case, texts relating to the Greek-Turkish war will be retrieved.
Fig. 5. Spatial search parameter
Fig. 6. Temporal search parameter
Providing Universal Access to History Textbooks: A Modified GIS Case
99
In the following, the various query parameters are described in more detail. Spatial parameter. The user can add spatial parameters to the query (cf. Figure 5), thus indicating that he/she wants to have results that are only related to the places that have been selected. Spatial data is selected by dragging a place from the map and dropping it to the reserved area. Range queries for selection are supported. Temporal parameter. The timeline (cf. Figure 6) is an active component in the application that can zoom or pan to select a respective timespan (point). It has a highlighted area, which can be moved, shrunk or expanded fitting the query needs. This highlighted area restricts the query results only to those that refer to the timeperiod represented by the area to retrieve content referenced by the temporal index. Adding thematic parameters. Thematic terms (history concepts) (cf. Figure 7) can be added by opening a dialog and allowing the user to select one or many entities either by browsing the history ontology or by using (keyword) search.
Fig. 7. Thematic search parameter
¼ Fig. 8. Result tree structure
100
D. Pfoser et al.
Results. The results are shown in a tree-like structure, in the panel at the bottom (cf. Figure 8). The results are divided into main texts (and photos) and sources of the texts. The results can be browsed freely and the user can see a paragraph (cf. Figure 9), or even a whole section (chapter) containing the paragraphs that match the query (cf. Figure 10).
¼ Fig. 9. Individual paragraph view
¼ Fig. 10. Book chapter view
5 Conclusions and Future Work Content integration involving multilingual sources poses an interesting challenge, especially when coupled with the ambition of language-independent access. This
Providing Universal Access to History Textbooks: A Modified GIS Case
101
work demonstrates that given the right content, specific metadata can be identified that (i) provides a sufficiently accurate abstraction of the content and can be used as an index to provide for sufficiently accurate search and (ii) at the same time is either language independent or requires a comparatively small effort to be translated. In our case of history textbooks, such metadata includes time, space and thematic concepts. This work showcased (i) the data – history textbook content, (ii) spatial, temporal, and thematic metadata resulting in a multilingual history ontology and spatial metadata gazetteer and (iii) the resulting largely language-independent spatiotemporal-thematic index. This index covers the content adequately and also shows sufficient overlap between books and languages. Directions for future work are manifold. As was expected for the geocoding of content, i.e., performing NER for geographic features, disambiguation of geographic terms is needed to avoid false hits. This task is challenging in that we have to deal with a multitude of languages, i.e., taking into account context and grammar at the individual language level. The objective however should be to provide a language independent approach for the disambiguation of such terms. An interesting approach when deriving metadata is automatic ontology creation [15]. Here, texts related to the same topic are analyzed to automatically extract classes and relationships to construct a domain ontology. While the history ontology has been created with the specific purpose of indexing content in mind and a large number of instances were available a priori, such an approach can be used to verify the current ontology structure as well as to produce additional instances. Using GATE as the basic software framework has led to the using Snowball-based stemming algorithms. However, promising work in the area of language independent stemming [1] needs to be evaluated in the present context as it can significantly reduce the effort needed for adding new languages to the history textbook repository. Besides technological improvements, an important effort will be to collect additional metadata to improve the coverage of the repository index. In the context of content tagging, we want to experiment with fuzzy matching of concepts to texts. Besides stemming, which relaxes the matching words, we want to experiment with relaxing the word sequences of phrases themselves. For example, instead of trying to identify the metadata concept “Treaty of Versailles” in the text, we would search for occurrences of such a word sequence within some proximity, thus also linking the text “the peace treaty, which was signed in Versailles, France” to the above concept. Finally, we want to develop this project into a Wiki-based annotation system that utilizes the history textbooks as core content and provides a means for students, teachers, and other interested third parties to re-structure content so as to develop individualized teaching materials as well as additions to the content itself. Acknowledgments. This work is partially supported by the CITER project (Creation of a European History Textbook Repository – http://citer.cti.gr) funded by European Commission, eContentplus programme, grant agreement number ECP-2005-EDU038193. The authors would like to thank the project partners for their contributions to this work.
102
D. Pfoser et al.
References [1] Bacchin, M., Ferro, N., Melucci, M.: A probabilistic model for stemmer generation. Information Processing and Management 41(1), 121–137 (2005) [2] The Centennia Atlas. Centennia Software, http://www.clockwk.com [3] ClearForest. ClearForest Gnosis. Product Web page, http://sws.clearforest.com/Blog/?page_id=32/ [4] Cohen, W.: Minorthird: Methods for Identifying Names and Ontological Relations in Text using Heuristics for Inducing Regularities from Data (2004), http://minorthird.sourceforge.net [5] Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proc. 40th Anniversary Meeting of the Association for Computational Linguistics, ACL 2002 (2002) [6] ESTeam AB: Automatic Translation Solutions. Company Web page, http://www.esteam.gr [7] Gamma, E., Helm, R., Johnson, R., Vlissides, J., Design Patterns, J.: Elements of Reusable Object-Oriented Software. Addison-Wesley, Reading (1995) [8] GEONet Names Server. Project Web page, http://earth-info.nga.mil/gns/html/index.html [9] Getty Trust, The J. Paul: Getty Thesaurus of Geographic Names, http://www.getty.edu/research/conducting_research/vocabulari es/tgn/index.html (Current as of June 2008) [10] Glowacki, D.: CalendarParser, a Java object class, http://icecube.wisc.edu/~dglo/software/calparse/index.html [11] Google Inc.: Google Earth. Product Web page, http://earth.google.com [12] HyperHistory Online. Project Web page, http://www.hyperhistory.com/ [13] Metacarta. Product Web page, http://www.metacarta.com [14] Ntais, G.: Development of a Stemmer for the Greek Language. Master Thesis, Department of Computer and System Sciences, Royal Institute of Technology, Sweden (2006), http://www.dsv.su.se/~hercules/papers/Ntais_greek_stemmer_th esis_final.pdf [15] Ogata, N., Collier, N.: Ontology Express: Statistical and Non-Monotonic Learning of Domain Ontologies from Text. In: Proc. ECAI 2004 Workshop on Ontology Learning and Population (2004) [16] Popovic, M., Willett, P.: The effectiveness of stemming for natural-language access to Slovene textual data. Journal of the American Society for Information Science 43(5), 384–390 (1992) [17] Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980) [18] Porter, M.F.: Snowball: A Language for Stemming Algorithms, http://snowball.tartarus.org/texts/introduction.html [19] SPSS. LexiQuest Mine. Product Web page, http://www.spss.com/lexiquest/lexiquest_mine.htm [20] Talent S.A.: Cruiser platform. Product home page, http://www.cruiser.gr [21] Wikipedia Translation Tool, http://citer.cti.gr/the-project/ wikipedia-phrase-translation-tool/ [22] Wikipedia Foundation: Wikipedia – The Free Encyclopedia, http://www.wikipedia.org
Data Integration GeoService: A First Proposed Approach Using Historical Geographic Data Eric Grosso1, Alain Bouju2 , and S´ebastien Musti`ere1 1
Institut G´eographique National, Laboratoire COGIT, Saint-Mand´e, France {eric.grosso,sebastien.mustiere}@ign.fr http://www.recherche.ign.fr/labos/cogit 2 Universit´e de la Rochelle, L3i, La Rochelle, France [email protected] http://l3i.univ-larochelle.fr/
Abstract. Geographical data users increasingly express the need to access a data integration process, notably to integrate historical data into a current frame of reference. However, easily accessible data integration treatments are not yet available for users. To tackle this problem, this paper focuses on proposing solutions to build a data integration geoservice. This paper focuses specifically on the historical data management, improving of geographical data adjustment process into a current frame of reference and finally on building of a data integration geoservice. Keywords: Data integration, geographical data, geo-processing services, historical data, georeferencing, old maps, metadata.
1
Introduction
The creation and diffusion of geographical information have considerably increased over the last few years; some geographical data users have even become geographical data producers. This general trend will be a priori confirmed in the next decade, partly due to the accessibility of that data in line with the recent application of the European directive INSPIRE (Infrastructure for Spatial Information in Europe). In order to benefit from this information as much as possible, either to carry out better analysis or to study the temporal evolution of georeferenced data, users have expressed a strong need to couple – or to integrate – their data with other data provided both by producers or other users. Moreover, the need to integrate historical data is particularly expressed. This need is perfectly illustrated in the environmental field, as seen in the georeferenced mapping project of old French forests [1] or the Mannahatta project1 [2]. However, easily accessible data integration treatments are not yet available for 1
http://themannahattaproject.org/
J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 103–119, 2009. c Springer-Verlag Berlin Heidelberg 2009
104
E. Grosso, A. Bouju, and S. Musti`ere
users. In explaining how building a data integration geoservice is possible in the historical data context this paper aims to introduce a solution to this problem. This solution follows the INSPIRE recommendations concerning geographical information diffusion, in particular the diffusion of geo-processing services. This paper is structured as follows. Section 2 introduces related research in the context of geographical data integration, historical data and geo-processing Web services. Section 3 focuses on historical data management, section 4 on the proposed data adjustment process and section 5 on the building of a complex geo-processing service such as a data integration geoservice. This paper ends with concluding remarks in section 6. Preliminary remark: The notions of user and producer are in some cases merged, users being producers and vice versa. Throughout this paper, users are defined as persons whose role is to use geographical data, without taking into consideration their possible role of producer.
2
Context and Objectives
The work presented here lies within a triple context: firstly the geographical data integration context, secondly the historical data context, and finally the building of a complex geo-processing service – a data integration geoservice – context. 2.1
Geographical Data Integration
The availability of geographical information is mainly due to the growth in Internet technologies, GPS techniques, etc. However, this information is made up of a mixture of distributed, heterogeneous and autonomous data sources making data combination almost impossible. To tackle this problem a method of data integration which will effectively find its way through this maze and wealth of information is essential [3,4]. This can be done by integrating all user data into a common frame of reference which is usually the most detailed database available to the institutional geographical data producers. Data integration is notably useful for propagating updates in databases, for data enrichment [5,6] for enabling the detection and correction of inconsistencies [7,8] and also for the evaluation of data quality. It also enables multi-level analysis which increases the potentiality of applications development from this data and improves the study of geographical phenomenon at different scales. Users thus have a real need to access integration treatments. Producers attempt to answer this need but only have at present ad hoc solutions adapted to their databases and their constraints which are non-exploitable in their current form by users. Moreover, such solutions are not provided by the principal GIS software publishers. We can conclude from this that there does not yet exist an adequate geographical data integration treatment system that meets current user needs.
Data Integration GeoService
2.2
105
A Particular Context: Working with Historical Geographic Data
This paper focuses on a particular integration context which is the integration of historical data into a current georeferenced framework. Historical data is defined here as old data which has a spatial component. By definition, historical data is heterogeneous due to the diversity of its sources such as old maps, archaeological plans, catalogues, monographs, charts, etc. Historical data can be seen as having two spheres of interest to users. The first being that historical data (mainly old maps) is often visually attractive and can represent cultural and artistic information. The second is that historical data contains invaluable information which is often unmapped or not represented in current maps or data. Historical data is particularly of interest to ecologists [1,2] (study of forest evolutions, comparison of ground occupation on various dates, study of climate evolution, etc.), and also to archaeologists and historians [9]. It can also be useful in the field of simulation (research of evolution rules based on historical data). Therefore, the need for digitalisation of historical data for the purposes of its preservation and conservation has increased and has consequently contributed to its wider diffusion. In order to exploit, analyse and process historical data and to get a meaningful result, users have to integrate it into a recent georeferenced framework. A georeferencing process is thus essential. In view of this widely expressed interest, several projects already propose tools adapted to this data. For example, MapAnalyst [10] provides a tool to visualise planimetric accuracy of historical maps, and the Old Maps Online project aims to help with online publishing of scanned historical maps [11] based on the observation that the ability to use old maps is critically under-utilised in the current Internet environment. The projects previously described deal only with raster data. Based on the observation that more and more historical vector data are digitized from historical raster data [1,9], this paper deals with both raster and vector data. 2.3
Services
As already discussed above, the need has been expressed by the European INSPIRE directive and users to provide diffusion of geographical information and geographical data processing. In response to this need, a number of projects and studies have emerged. Firstly, geographical information diffusion services were developed, in particular the standards defined by the Open Geospatial Consortium2 (OGC), of which the most well known are the Web Feature Service (WFS) and the Web Map Service (WMS). The main objective of these standards3 is to widen the interoperability of both data (e.g. KML - Keyhole Markup Language - and GML - Geographic Markup Language -), its diffusion and the associated processing. 2 3
Once the services of data diffusion were standardized and implemented within the framework of the OGC-Web Services (OWS), the objective was to create a standardized framework with geo-processing services. Then much work was undertaken around geo-processing services. The University of Zurich worked on the study and implementation of standardized Web Services, respecting the recommendations promulgated by the World Wide Web Consortium4 (W3C) and the Organization for the Advancement of Structured Information Standards5 (OASIS). The main objective of this work is to propose the construction of a common research platform in generalisation based on geo-processing services [12,13]. Following this, the OGC worked on the Web Processing Service (WPS) standard (standard since February 2008) which aims at providing an open framework to any geo-processing service. A first implementation has been done by the 52◦ North project. This project made it possible to improve and propose a compatible implementation of the OGC-WPS standard [14] (WPS 0.4 then WPS 1.0). More recent work deals with improving the discovery, use and orchestration of geo-processing services in order to improve efficiency. One project, [15], proposes an approach to improving existing metadata models for web services, [16] approaches to acquire better descriptions of services and [17], a generic preprocessing service to improve the use of geo-processing services. Others focus specifically on the orchestration to test inter-standards compatibilities [18] (mainly between BPEL - Business Process Language Execution - and OGC-WPS) which provide an implementation [19] and then apply it to examples [20,21,22]. Finally, the 52◦ North project focuses partially on the grid computing for the purpose of complex geo-processing services. All these projects provide solutions to build complex geo-processing services. However, they do not propose to specifically develop a data integration service. 2.4
Objectives
The present proposal has three main objectives. The first objective is to provide users one solution for searching, finding and using historical data in an accessible way. The second objective is to propose a more precise solution to the classical georeferencing process (by taking into account the use of vector data) to enable the integration of historical data into a recent reference framework by using data matching tools. The third objective is to propose a framework to build a first data integration service. 2.5
A User-Centred Approach
The projects that use historical data generally regroup diverse actors such as historians, archaeologists, genealogists, ecologists, geomaticians, etc. The actors of these projects have two main needs as previously described. Firstly, they need 4 5
http://www.w3.org/ http://www.oasis-open.org
Data Integration GeoService
107
to find specific historical data, and secondly, they need to integrate this data into a recent reference framework using a georeferencing process. Let us introduce the example of a wind turbines installation project in France to illustrate the importance of these two needs. The actors of this project have to determine the potential of exploitable sites which will maximize the buoyancy pressure and thus the output of the wind turbines. There are two possible solutions in finding an appropriate site; firstly they could use a 3D GIS coupled to a model of wind simulation. Secondly they could locate the sites of old windmills by using historical data based on the logic that the old windmills would have been well placed to maximize wind performance. The latter solution appears simpler but in choosing this solution the actors will need to answer the following questions: – Within this framework which historical data should we use? – How can we integrate this historical data into a recent reference framework to enable us to use it? This example will be used to illustrate our proposal. The proposal made below is thus a user-centred approach which aims to answer these two questions by improving the historical geographic data management, improving the georeferencing process and providing the diffusion of this process through a data integration geo-processing service. This proposal is described in the next three sections.
3
Historical Geographic Data Management
As seen in section 2.2, historical data contains invaluable information which is often unmapped or unrepresented in current maps or data. We identify two major issues of non-accessibility to this information for users. One is that users are mostly unaware of the historical data content, especially old maps content. Indeed, it appears difficult for users to have an overall view and knowledge of this data and its contents. The second issue is that historical data is widely distributed, for example in national archives, national libraries, mapping agencies, private collections, on the Internet, etc., and thus hard to access easily. The question “How to find this information contained in this data?” – the windmills in our example – is therefore raised. Once this information is found, users will ask for a solution to link this information with a recent frame of reference, as historical data is generally only useful once it is possible to georeference the contents. Consequently, the question “What are the possible links between historical and current data?” is also raised. To answer these questions, two proposals are made in the following two sections: first to create a metadata catalogue of historical data and second to create a link between historical and recent data. 3.1
Towards an Historical Data Cataloguing System
Similarly to the problem of current distributed data, historical data are also disseminated. This makes it difficult to identify, access and use. To tackle this
108
E. Grosso, A. Bouju, and S. Musti`ere
problem, the creation of a metadata catalogue of historical data is essential. This catalogue would aim to enable users to research information within historical data and allow easier access to the historical data content. For example, to answer the question “Which map or plan contains a cartographic representation of windmills?”, the role of this catalogue would be to provide the relevant information by answering “Cassini maps provide the wood and stone windmills sites”. Moreover, this catalogue has to provide data quality information to enable the user to carry out more in-depth analysis. This catalogue is thus made up of two levels of metadata: the primary metadata which is a general data description – descriptive metadata –, and the secondary metadata which is more specific to the data – technical metadata –. The two levels of metadata are described below. Primary metadata – The first objective of primary metadata is to provide a description of historical data. This is done through giving a general description about the identification, global content (which enumerates roughly the different represented themes/objects in historical data), coverage, scale, spatial reference, global accuracy, global homogeneity (in terms of quality), date of edition, type (maps, archaeological plans, catalogues, monographs, etc.), source (URL or address – library, archive, etc. –) and the possible specifications (specifications used during the creation process) of that data. To fill this metadata structure with contents, numerous works of literature (books or studies) that deal with historical data would be used. These works contain both general [23] and specific information [24,25] about historical data. For example, Cassini metadata can be filled with the rough description given by [24]: “Cassini maps are made up of 181 sheets of 80km by 50km at a 1:86,400 scale ; Approximately 300 points per sheet are used for positioning (triangulation) ; Triangulation starts from high points, bell-towers, towers, etc., which ensure the mapping accuracy ; The roads and rivers are drawn and the relief is sketched”. Other characteristics can supplement these elements, e.g. the fact that Cassini maps are homogeneous, accurate, and offer for the first time in French history, a vision of space occupation. Cassini maps also provide the results of the first useful toponymic investigations. This description of Cassini maps illustrates that a part of the metadata can be easily filled. Secondary metadata – The secondary metadata can be considered as similar to thematic metadata. This metadata contains a description of a particular theme (e.g. windmills in Cassini maps) and the related information, in particular the estimation of its data accuracy and its homogeneity throughout its spatial extent. Presently, due to an approximate knowledge about the quality of historical data, the main objective of this metadata is to provide an estimatation of this quality, theme by theme. Let us use the following example based on the roads in Cassini maps to illustrate this lack of knowledge. [24,25] points out the inaccuracy in drawing roads and waterways due to the fact that engineers drew at sight certain elements (the
Data Integration GeoService
109
Fig. 1. Overlapping of a Cassini maps extract and the current roads network
roads, the heights, the small valleys, the wood contours, the footpaths direction, the rivers). This information suggests that roads are inaccurate in Cassini maps. A georeferencing of Cassini maps allows us to estimate visually this inaccuracy which varies from 100 to 400 meters as shown in figure 1. This metadata can very roughly be retrieved in a similar manner as that which is done for the primary metadata. Nevertheless, a very-time consuming complementary work would have to be done to study all the themes present in each data. Lastly,the secondary metadata can be useful for georeferencing processing (cf. 4.1), notably to evaluate the accuracy of control points. A method to build this catalogue – Projects which use historical data each have a different description structure of metadata. Resulting in a situation where the number of proposed solutions to describe historical data is almost equivalent to the number of projects themselves. This implies a lack of interoperability which does not allow a unique entry point through a metadata portal. To propose a unique way to describe historical data appears essential but at the same time presents a major challenge given the existed amount of historical data. To define this structure, the primary and secondary metadata previously described can be used. The whole of this metadata can be described by the ISO19115 norm [26]. Indeed, this norm is dedicated and applicable to the cataloguing and full description of datasets. Contrary to the descriptive metadata which can be directly described by the ISO-19115, a specific profile to describe the secondary metadata has to be added to the ISO-19115 norm (which allows such a profile creation). Finally, to build this catalogue, studies of historical data content and quality are required. This work could be partly done in a collaborative way, using a metadata portal usable by all users.
110
3.2
E. Grosso, A. Bouju, and S. Musti`ere
Which Possible Link between Historical and Current Data?
Once users find the historical data they are looking for, they need to know the possible link between historical and current data in order to integrate it. Therefore, users need to have a way to create correspondences between historical and current objects. A pivot data schema is therefore necessary. We propose to use the grid defined by the French National Center of Urban Archaeology. This grid is composed by seven main urban functions: Public transmission lines, city planning ; Defensive and military structures ; Civil constructions ; Religious buildings ; Funerary ; Production ; Environment. Each main urban function is made up of headings. For example, the function “Public transmission lines, city planning” is made up of the following headings: Roads ; Clear spaces ; Management of the banks and water ways ; Relief management ; Crossings ; Water conveyances ; Storm drains, evacuations. This grid is based on concepts which describe both historical and current data. Therefore, it seems well adapted to our problem.
4
Data Adjustment Process
Once users have found adapted data which fit their need, they have to adjust this data. Indeed, it is the only solution to compute the inconsistencies between several states (different epoch dates), to compute global and local accuracies, and moreover to have historical data consistent with a frame of reference. The classical process consists in a control points selection, a georeferencing process and a quick evaluation (Fig. 2.A). Our proposal (Fig. 2.B) is not only to create additional steps but also to improve the usual georeferencing process at all steps. Each step of our approach is now described.
Fig. 2. The different steps of the data adjustment process. A: the classical process, B; our proposal.
Data Integration GeoService
4.1
111
Control Points Selection
To georeference data, users have first to define at least three control points. Control points are traditionally described as a couple of planimetric coordinates which enable to compute a spatial transformation from a source frame (here the historical data frame) to a target frame (here the recent frame of reference). A couple of control points loosely consists of a source control point (xsource , ysource ) and a target control point (xtarget , ytarget ). A common approach would be that all control points are identically weighted. However, this does not reflect reality. To improve this existing solution, we propose an alternative control points definition by taking into account the theme with which a control point is linked. Thanks to the cataloguing process previously described, in particular the description of the quality and the accuracy of data, it is now possible to estimate the adapted weight of each individual control point. This parameter will then be introduced into a calculation making it possible to find the most adequate spatial transformation. To implement this solution, we define an “adjustment link” as the association between two control points, through the following XML Schema Description: <x s : s c h e m a x m l n s : x s=” h t t p : //www. w3 . o r g /2001/XMLSchema”> <x s : e l e m e n t name=” a d j u s t m e n t l i n k ”> <xs:complex Ty pe> <x s : s e q u e n c e> <x s : e l e m e n t name=”name” type=” x s : s t r i n g ” /> <x s : e l e m e n t name=” x s o u r c e ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” y s o u r c e ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” a c c u r a c y s o u r c e ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” x t a r g e t ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” y t a r g e t ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” a c c u r a c y t a r g e t ” type=” x s : d o u b l e ” /> <x s : e l e m e n t name=” h i s t o r i c a l d a t a ” type=” x s : s t r i n g ” /> <x s : e l e m e n t name=” l a b e l ” type=” x s : s t r i n g ” /> x s : s e q u e n c e> x s:complexType> x s : e l e m e n t> x s : s c h e m a>
An “adjustment link” is therefore defined by a source control point (xsource , ysource ) and a target control point (xtarget , ytarget ), in association with a name which enables to identify a specific link. To improve this common definition, four additional parameters are considered: – accuracysource and accuracytarget . The accuracysource and accuracytarget are functions of the scan resolution, the zoom level and the spatial extent. The scan resolution is considered due to the fact that control points are selected on raster data.
112
E. Grosso, A. Bouju, and S. Musti`ere
– historicaldata and label. An historicaldata fits the historical data identification, e.g. “Cassini maps”, and a label fits the nature of a control point, e.g. a label can be “tower-bell”, “road (intersection)”, etc. A label is therefore linked to the secondary metadata. Consequently, the historicaldata and label parameters can provide information about the accuracy of a control point. Thanks to these parameters, the accuracy of a control point can be estimated and then a weight can be defined for each control point. 4.2
Global Spatial Adjustment or Georeferencing
Once control points have been selected, a global spatial adjustment (comparable to a georeferencing process) needs to be carried out. Users have a choice among several possible spatial transformations to achieve this task: – – – – – – –
Affine transformations, Helmert transformations (four or seven parameters), Transformations based on a gravitating model [27], Triangulation and rubbert sheeting [28], Second (or higher) order polynomial transformations, Thin-plate spline method, etc.
Consequently, users have to choose the spatial transformation adapted to their context. This choice is not easy because users are not necessarily expert in spatial transformations. The system has thus to propose the “most adapted” transformation in regard to user needs and the “most adapted” mathematical way to solve this problem (even if a least squares adjustment is traditionally used to solve this problem). “Most adapted” means here that the transformation has to satisfy different kinds of constraints. In our proposal, the objective is to minimize distortions and to take into account some possible user constraints. On one hand, the distortions have to be quantified to know how a transformation can accurately map all control points. This can be done by computing the Root Mean Square (RMS) error based on the residual errors (a residual error is the distance between the target control point and the associated transformed source point). This indicator gives a good assessment of the consistency and the accuracy of a transformation between the different control points. Nevertheless, even if the RMS error is low, some residual errors can be particularly significant (e.g. due to a misplaced control point). In this case, an adjustment link can be removed to improve the transformation. On the other hand, users sometimes need to use georeferenced data for other tasks than those which simply consists in overlapping several layers, e.g. to analyse the orientation of geographical features. The transformation has in consequence to minimize as much as possible length, angular or surface distortions. Knowledge has thus to be added to the system, such as “an affine transformation implies that straight lines remain straight, parallel lines remain parallel, rectangles may become parallelograms”.
Data Integration GeoService
113
Lastly, to improve the consistency of the transformation, the historicaldata and label parameters are used to manage transformations relative to datum and projections. 4.3
Local Spatial Adjustment
The georeferencing process is a classical process which is generally based on a global transformation without consideration of local distortions. To tackle this problem, solutions of local spatial adjustment are presented hereafter. A first solution based on a data matching approach can be defined. This kind of approach proposed by [29] aims at adjusting source geometries in order to have the same geometries as the target objects. To be automated, this approach involves some basic prerequisites: the semantic correspondences between objects must be known and then the types of objects must be identified. According to our approach (cf. 3.2 and 4.1), part of the semantic correspondences can be found through the pivot schema which links historical and recent data. The types of objects can then be found thanks to the label parameter. In order to increase and diversify matching possibilities at the same time, a set of rules is defined to characterise specific matching. One of these rules is for example “the contour of a forest can partly be defined by the layout of a road” (Fig. 3). Another solution would consist of defining new control points in local areas and then in having a new local scale transformation such as rubber sheeting [28]. 4.4
Connections Management
Historical data is often available by geographical area. Each area is represented by a map sheet, which can therefore present a problem of connections management between different map sheets in the georeferencing process. Our proposal does not yet bring forth solutions but raises the following questions:
Fig. 3. An example of matching rules: “The contour of a forest can partly be defined by the layout of a road”
114
E. Grosso, A. Bouju, and S. Musti`ere
– How to adjust historical data to preserve consistency at the borders of map sheets? – What is the impact of such a management for the adjustment process? – Does one have to apply a single spatial transformation to be applied to all map sheets? Experiments are currently in progress to answer these questions. 4.5
Evaluation
The evaluation step is the key point of the data adjustment process. Indeed, to carry out the best data analysis, users need information about the accuracy, consistency and quality of output data provided by the process. This information must be both a technical and a visual support. The RMS error and the residual errors are therefore computed to check the consistency and accuracy of the transformation. Numerous maps can then be drawn: map of displacement vectors, map of interpolated displacement vectors (to visualize the field of displacement vectors), map of residual errors (vectors), map of interpolated residual errors, distortion grid, isolines of scale and rotation (as introduced by [10]), map of distortion forces (isolines), map of Tissot’s indicatrix (or Tissot’s ellipse of distorsion). All these maps give an overall view of distortions. We propose to provide users this information through a vector format in order to exploit it. Finally, to improve the standard evaluation process, we propose to use some geostatistical tools (notably variograms) to detect a particular spatial structure of errors. Our final objective is now to build a complex geo-processing service based on our proposal.
5
Building a Complex Geo-process through Services: A Data Integration Geoservice
The data integration process is complex as this process requires many tools whose assemblage seems indivisible, e.g. the data matching tool. Consequently, the problem is to know how to build a geo-processing service which takes into account the complexity of the process. This section aims to give a first answer. 5.1
From Business Components to Geo-processing Services
The first objective was to decompose the data integration problem into business components in order to split the problem into sub-problems. This decomposition allows at the same time to create our architecture by creating a service linked with each business component. The data integration framework is therefore composed of the following services:
Fig. 4. Sequence diagram of our proposal
Data Integration GeoService 115
116
– – – – – – – –
E. Grosso, A. Bouju, and S. Musti`ere
Coordinate transformation service, Global data adjusting service or georeferencing service, Local data adjusting service, Connections management service, Evaluation service, Metadata service, Orchestration service, Data analysis service.
All these geoservices are described in the last sections, except for the orchestration and the analysis services. The orchestration service is introduced to manage all the other services. This service can be considered like a controlling or monitoring service which is able to dynamically create a geoservice chaining to fit the context defined by both data and users. Its role is illustrated in figure 4. Finally, a data analysis service is added to this architecture. Its role is described below. 5.2
Introduction of a Data Analysis Geoservice
In a framework of complex geo-processing services involving service chaining, the error management is crucial. Indeed, a geo-processing service which encounters
Fig. 5. Data analysis service principle
Data Integration GeoService
117
a problem, usually involves a global error for the entire process. Moreover due to an increase in data heterogeneity, the error management becomes evermore complex. To improve this management, geo-processing services have thus to be adapted to unspecified input data which are heterogeneous in contents and in quality. In response to this problem, we propose to introduce a data analysis service which is able to provide the maximum of contained information about data provided by users, such as geometric type, topological quality, reference coordinates system (if possible), dataset homogeneity, objects density, etc. This service can be seen as an example of geo-preprocessing service such as described in [17], thus introducing more flexibility into the process. The role of this service is to define an applicative context which can guide the entire process, and check data constraints (e.g. topological constraints) as illustrated in figure 5.B, to enable a data pre-processing which will avoid errors that are contrary to common cases (Fig. 5.A). 5.3
Building the Solution
To build the solution described in this paper, several strategies can be imagined. We have chosen to develop a solution based on open source components. A thin client based on OpenLayers – in order to privilege a user on-line access – will be used to develop the client interface, GeoServer for the WMS and WFS framework, and MDWeb for the metadata part. To develop geo-processing services, the WPS framekork provided by the 52◦ North project will be used, and the GeOxygene platform will provide integration tools.
6
Conclusion
This paper has presented an on-going work to build a first data integration service in the historical data context. This service aims at providing an access to data integration process in order to meet user needs at different levels. Our approach is based on the three following proposals: The first proposal is to create a metadata catalogue to improve the historical data management. The role of this catalogue is to provide users with an easy access to historical data content through global and technical metadata and provide users with the opportunity to share their knowledge thanks to a metadata portal. The second proposal is to improve the common data adjustment process by enhancing the weighting and the management of control points, by using both a global and a local approach, and by coupling differents techniques (standard, geostatistics) to evaluate the results. The last proposal consists in defining a first approach to build a data integration service. Data integration problem is decomposed into sub-problems. A geo-processing service is created to solve each sub-problem and an orchestration
118
E. Grosso, A. Bouju, and S. Musti`ere
of all the geo-processing is proposed. Lastly, a data analysis service is introduced to guide the orchestration service to dynamically create a geoservice chaining, and to improve error management.
References 1. Dupouey, J.-L., Bachacou, J., Cosserat, R., Aberdam, S., Vallauri, D., Chappart, G., Corvisier De Vill`ele, M.-A.: Vers la r´ealisation d’une carte g´eor´ef´erenc´ee des forˆets anciennes de France. In: Revue du Comit´e Fran¸cais de Cartographie (CFC), vol. 191, pp. 85–98 (2007) 2. Sanderson, E.W., Brown, M.: Mannahatta: An ecological first look at the Manhattan landscape prior to Henry Hudson. Northeastern Naturalist 14(4), 545–570 (2007) 3. Sheth, A., Larson, J.: Federated database systems for managing distributed, heterogeneous and autonomous databases. ACM Computing Surveys (22:3), 183–236 (1990) 4. Parent, C., Spaccapietra, S.: Database Integration: The Key to Data Interoperability. In: Papazoglou, M., Spaccapietra, S., Tari, Z. (eds.) Advances in ObjectOriented Data Modeling, pp. 221–253. MIT Press, Cambridge (2000) 5. Kilpel¨ ainen, T.: Maintenance of multiple representation databases for topographic data. The Cartographic Journal (37:2), 101–107 (2000) 6. Hampe, M., Sester, M.: Real-time integration and generalization of spatial data for mobile applications. In: Geowissenschaftliche Mitteilungen, Maps and the Internet, Wien, pp. 167–175 (2002) 7. Egenhofer, M., Clementini, E., Di Felice, P.: Evaluating inconsistencies among multiple representations. In: The Sixth International Symposium on Spatial Data Handling (SDH), Edinburgh, Scotland, pp. 901–920 (1994) 8. Sheeren, D., Musti`ere, S., Zucker, J.: Consistency Assessment Between Multiple Representations of Geographical Databases: a Specification-Based Approach. In: Developments in Spatial Data Handling, Proceedings of the 11th International Symposium on Spatial Data Handling (SDH), pp. 617–627. Springer, Heidelberg (2004) 9. Noizet, H.: Les plans d’lots Vasserot, support d’un syst`eme de l’information g´eographique historique de Paris. In: EAV, La revue de l’´ecole nationale sup´erieure d’architecture de Versailles, vol. 14, pp. 86–95 (2009) 10. Jenny, B., Adrian, W., Lorenz, H.: Visualising the Planimetric Accuracy of Historical Maps with MapAnalyst. Cartographica 42-1, 89–94 (2007) 11. Pridal, P., Zabicka, P.: Tiles as an approach to on-line publishing of scanned old maps, vedute and other historical documents. In: e-Perimetron, vol. 3(1), pp. 10–21 (2008); ISSN: 1790-3769 12. Neun, M., Burghardt, D.: Web Services for an Open Generalisation Research Platform. In: ICA, Workshop on Generalisation and Multiple Representation, A Coruna, Spain (2005) 13. Burghardt, D., Neun, M., Weibel, R.: Generalization Services on the Web - Classification and an Initial Prototype Implementation. Cartography and Geographic Information Science (32:4), 257–268 (2005) 14. Foerster, T., Stoter, J.: Establishing an OGC Web Processing Service for generalization processes. In: ICA, Workshop on Generalization and Multiple Representation, Portland, USA (2006)
Data Integration GeoService
119
15. Lemmens, R., Granell, C., Wytzisk, A., De By, R., Gould, M., Van Oosterom, P.: Semantic and syntactic service descriptions at work in geo-service chaining. In: Proceedings of the 9th AGILE International Conference on Geographic Information Science, Visegr´ ad, Hungary (2006) 16. Bucher, B., Jolivet, L.: Acquiring service oriented descriptions of GI processing software from experts. In: 11th AGILE Conference, Girona, Spain, August 4-8 (2008) 17. Bucher, B., Balley, S.: A generic preprocessing service for more usable data processing services. In: Proceedings of the 10th AGILE Conference, Aalborg, Danemark (2007) 18. Stollberg, B., Zipf, A.: OGC Web Processing Service Interface for Web Service Orchestration - Aggregating Geo-Processing services in a bomb threat scenario. In: Ware, J.M., Taylor, G.E. (eds.) W2GIS 2007. LNCS, vol. 4857, pp. 239–251. Springer, Heidelberg (2007) 19. Sch¨ affer, B., Foerster, T.: A client for distributed geo-processing and workflow design. Journal of location based services 2(3), 194–210 (2008) 20. Weiser, A., Zipf, A.: Web Service Orchestration of OGC Web Services for Disaster Management. In: Li, J., Sisi Zlatanova, A.F. (eds.) Geomatics Solutions for Disaster Management, GI4DM, 3rd International Symposium on Geoinformation for Disaster Management, pp. 239–254. Springer, Toronto (2007) 21. Nash, E., Bobert, J., Wenkel, K., Mirschel, W., Wieland, R.: Geocomputing Made Simple: Service-Chain Based Automated Geoprocessing for Precision Agriculture. In: 9th International Conference on Geocomputation, National University of Ireland, Maynooth, Ireland (2007) 22. Stollberg, B., Zipf, A.: Geoprocessing Services for Spatial Decision Support in the Domain of Housing Market Analyses. In: The 11th AGILE 2008 Conference on GI Science (2008) 23. Costa, L., Robert, S.: Guide de lecture des cartes anciennes. Editions errance, Paris (2009) 24. Pelletier, M.: La carte de Cassini, l’extraordinaire aventure de la carte de France. Presses de l’´ecole nationale des Ponts et Chauss´ees (1990) 25. Institut G´eographique National: La Nouvelle Carte de France au 20:000 - son utilit´e, son ex´ecution. Institut G´eographique National, Minist`ere des travaux publics, des transports et du tourisme (1951) 26. ISO: ISO-TC211 19115 Geographic Information, Metadata. International Standard (2003) 27. Langlois, P.: Une transformation ´elastique du plan bas´ee sur un mod`ele d’interaction spatiale, applications en g´eomatique. In: Journ´ees de la Recherche sur les SIG, GDR 1041 Cassini, INSA Lyon, pp. 241–250 (1994) 28. White, M.S., Griffin, P.: Piecewise linear rubber-sheet map transformations. The American Cartographer 12(3), 123–131 (1985) 29. Sester, M., von G¨ osseln, G., Kieler, B.: Identification and adjustment of corresponding objects in data sets of different origin. In: Proceedings of the 10th AGILE Conference, Aalborg, Danemark (2007)
A Study of Spatial Interaction Behaviour for Improved Delivery of Web-Based Maps Eoin Mac Aoidh1 , David C. Wilson2 , and Michela Bertolotto3 1
National Centre for Geocomputation, NUI Maynooth, Ireland 2 University of North Carolina at Charlotte, USA 3 School of Computer Science and Informatics, UCD, Dublin, Ireland [email protected], [email protected], [email protected]
Abstract. As access to large quantities of detailed spatial information becomes more commonplace through popular Web-based map browsing systems, the need for automatic spatial information filtering processes grows. Particulary in the case of wireless access, a suitable filtering mechanism can assist the user by personalising the map contents, giving priority to the most relevant content and eliding less relevant content. We have developed a generic solution for implicitly obtaining an insight into a user’s interests and intentions based on their mouse interactions with a Web-based map browsing interface. Results of a previously reported experiment showed visual patterns in interface interaction across multiple users. This has led us to develop a new categorisation of the user interactions that can serve as a basis for more accurately weighting user interest profiles. This paper proposes a classification scheme for user interactions according to their functional properties by describing a ’mouse movement hierarchy’. The value of such a weighting is demonstrated by employing the hierarchy to make a new experimental analysis of a dataset of previously logged user interactions. By properly understanding sequences of interactions, implicit inferences about the users interests can be made, leading to improved map personalisation, ultimately improving the user’s experience by delivering the most relevant content to the user in a timely fashion.
1
Introduction
Widely used interactive Web-based map browsing systems such as Google Maps [1], Yahoo! Local Maps [2], MapQuest [3], etc. all provide similar fundamental functionality. The map is embedded in a Web page, providing tools such as pan, zoom, search, and often offering the ability to toggle features (e.g. roads) on and off. Everyday use and generation of spatially referenced data, and the popularity of GPS enabled devices are major contributing factors to the rapidly growing availability of spatial data. Map data can be represented by raster or vector data. Typically, Web-based map applications operate a hybrid data representation approach, converting vector data to a series of rasterised tiles in standard image formats for easy transfer and rendering over the Web. The techniques described in this paper are best suited to vector based maps, however they are also J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 120–134, 2009. c Springer-Verlag Berlin Heidelberg 2009
A Study of Spatial Interaction Behaviour for Improved Delivery
121
applicable to hybrid approaches, as such servers typically store an underlying vector representation from which the raster tiles are generated. Such a system, with a flexible mapping solution, and the ability to implicitly infer the user’s interests can also personalise the user’s map accordingly. Map objects (e.g. roads, lakes, buildings etc.) can be withheld from, or included in the user’s map at various levels of detail, providing personalised content to the user and reducing the download time for the most relevant data. Where the ability to pan and zoom an interactive map was once enough to easily locate the information required, the current, growing quantity of freely available high-resolution spatial data can swamp the user with information. This is exacerbated in the case of a Location Based Service (LBS) where the user typically accesses content over a wireless connection on a mobile device with a small screen, slower processor, and reduced bandwidth. More advanced filtering techniques are now required as a technological aid to personalise the spatial content for the user based on his/her preferences. By personalising map content, information likely to be of interest to the user can be prioritised, and less relevant information can be elided. Personalisation can enrich the user experience by improving access time to priority information specific to the user, and reducing information overload by eliding less relevant content. In order to personalise the content, the user’s preferences must be known. This is done by implicitly inferring the user’s interests and building a user profile. User interests can be automatically inferred based on their interactions [4,5,6,7,8]. Over the course of a session, a user may perform a huge amount of interaction. A weakness of our original approach is that in a case where the user performs a large amount of interaction, the interest determining process can become swamped with information. This is of particular concern in the Web and wireless domain where information transmission and processing is at a premium. The work presented in this paper addresses the problem of identifying and filtering the most salient interactions performed by a user. The approach adopted is to categorise interactions according to a hierarchy which can be applied to a weighting scheme in order to improve the accuracy of the interaction-based interest determining technique. Implicit profiling analyses user interaction behaviour to detect activity that indicates an interest, or alternatively, a disinterest. For instance, in the case of a Web page, the act of printing or book-marking the page indicates an interest. Conversely, navigating away from the page rapidly is likely to signify a disinterest. The corresponding user profile would indicate that content similar to that printed by the user should be given priority, while content similar to that navigated away from should be reduced in priority. In a spatial browser, zooming in on an object and clicking on it for additional information are operations which signify an interest on behalf of the user. Conversely, areas not viewed by the user are likely to be less important and can be weighted as such. Effectively, the user’s interactions can act as a proxy to reveal the user’s thoughts (positive or negative) in relation to a specific piece of information. We have developed an algorithm which employs the user’s mouse activity as a spatial implicit interest
122
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
indicator. A prototype system called GIIViz (Geospatial Interactions Interpretation and Visualisation) was developed to log user’s mouse activity with a typical Web-based map browsing system for subsequent visual and computational analysis. In a previous experiment with GIIViz, it was noted that some types of mouse activity were better suited to disclosing user interests than others [8]. This paper further investigates this notion by proposing a new mouse movement hierarchy, which classifies the types of mouse activities identified, and considers their appropriate use for interest indication in a Web-based spatial environment. The hierarchy allows us to interpret the user’s interactions in such a way that more precise indications of interest can be obtained with reduced amount of computation. The remainder of this paper is structured as follows: Section 2 provides a review of the most relevant literature to the work presented in this paper. Section 3 outlines the experiment carried out with the GIIViz system. Section 4 presents the mouse movement hierarchy, and describes each of its constituent components. Section 5 discusses the application of the hierarchy for implicit user profiling in the context of improving the delivery of Web-based maps for a Location Based Service. Section 6 provides a conclusion.
2
Background
The research documented in this paper is based on the provision of generic functionality such as pan and zoom, and a clickable/dragable map as offered by typical Web-based map browsing systems. A review of the functionality, design, and usability of such systems is provided by Nivala et al. in [9]. The research presented here is based on the premise that no additional hardware is required to gain an insight into the user’s interests. The mouse movements of a user are readily accessible and can be employed as implicit interest indicators in the case of non-spatial information, as documented by authors such as Claypool et al. [4], Arroyo et al. [5], Chen et al. [7], and Atterer et al. [10]. These, and indeed other authors [6,11,12], document and exploit the positive relationship between users’ mouse movements, eye movements and associated thought processing. Based on the work conducted in the non-spatial domain, we developed an adapted technique for spatial information which automatically calculates an interest score for each mapped object according to the user’s map navigational operations and their associated mouse movements. The technique is developed for use with vector-based data, where each individual feature (building, road section, lake etc.) is treated as a unique object. The interest determining technique is designed to be applicable to any map browsing application providing the generic map browsing functionality critiqued in [9]. The details of our technique are outlined in [8,13], which also introduces the Web-based GIIViz prototype mapping system (Geospatial Interactions Interpretation and Visualisation). The use of GIIViz for the visualisation of user’s mouse movements in relation to the spatial data being browsed is highlighted in [8]. The use of visualisation to study mouse movement behaviour in a non-spatial context is documented by Mueller et al. [6], and Arroyo et al. [5]. These authors
A Study of Spatial Interaction Behaviour for Improved Delivery
123
produce heatmap-style interpolated visualisations to identify hot and cold spots for mouse interactivity on Web pages. The visualisations produced by GIIViz relate to spatial information and are more specific, additionally allowing for the identification of the direction and speed of any mouse movements. Furthermore, as GIIViz visualises the movements of the user’s mouse in the context of the map navigation operations carried out, it is possible to identify those actions which were required to manipulate the map (to pan or zoom etc.) and those actions which appear to have no specific purpose, and may have been performed subconsciously. Interactions with a spatial dataset are inherently different to those with a body of text and images, which are typically presented in linear format. In the case of spatial information, it is necessary to pan and zoom to various sections of the dataset which are of interest. It is the identification of these interactions which are necessary to view and navigate through the data, versus those interactions which occur for no obvious data manipulation function, but which may be indicative of a subconscious level of interest in their surrounding area which are the focus of our implicit interest determining process. The notion of conscious and subconscious interactions are identified in a nonspatial context by Atterer et al. in [10]. The work presented in this paper places the psychological concept behind such actions in a spatial context, and also considers other properties of the actions, such as their intentional or unintentional execution, and the physical patterns produced by such activity. Physical patterns of movement in a non-spatial environment were explored by Mueller et al. [6]. A study using Web pages is documented, which reveals that users often employ their mouse pointer in a scanning motion when quickly reading through lines of text for instance, illustrating that the paths travelled by the mouse can indicate distinct areas of interest for the user. A review of the cognitive processes at work in the brain when reading maps is documented in [14]. The work presented in this paper considers both the physical and psychological aspects of mouse movements within a spatial context, rather than the deeper cognitive mechanisms reviewed by Lobben [14]. The interest determining algorithm component of the GIIViz system, outlined in [8,13], automatically computes an interest score for each of the objects in the user’s vector-based map. Each lake, building, road section etc. are represented as individual objects. The score is computed according to the level of mouse interaction in the object’s vicinity, while taking the scale of the map at the time into account. Accordingly, the regions viewed at the highest level of zoom with the greatest quantity of mouse activity are most likely to contain the user’s objects of interest. By ranking the map objects according to their computed score, it is possible to identify the objects of greatest interest to the user. Figure 1 shows a screenshot of GIIViz illustrating the areas of a map viewed by a particular user (rectangles), and the mouse movements executed by the user (circles). The smallest rectangles correspond to the areas viewed in greatest detail. The size of the mouse movement indicators is proportional to duration. Larger circles indicate slower movements / longer hesitations of the mouse. The interface provided by GIIViz to view the visualisation in Figure 1 is interactive. It can be panned
124
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
Fig. 1. Visualisation of a user’s interactions produced by GIIViz. Map detail has been entirely removed from this image for ease of viewing. The areas around A and B show the general focus of the user’s attention. Objects within these areas are much more likely to be of interest to the user than objects in the areas which were not interacted with. Dots/circles indicate areas where the mouse pointer was logged.
and zoomed for closer analysis. It is evident, based on this user’s interactions, that the areas, and objects of interest to this user are those around A and B. Much of the mouse activity is also clustered around these areas. The areas which were not interacted with or viewed in any detail are least likely to be of interest to the user. The map objects have been removed from this visualisation for ease of viewing, only the represented user interactions remain. As the interest score is based on the level of interaction in each object’s vicinity, the objects around A and B in Figure 1 receive the highest interest scores. This is a graphical interpretation of the GIIViz interest determining process. Such visualisations were employed as visual aides in the identification of the interaction characteristics which define the mouse movement hierarchy described in this paper.
3
Experiment Discussion
An experiment was conducted to determine the viability of the automatic interest determining method. The experiment and the method are described in [8,13]. The experiment gathered interaction data from 16 participants. The same ten tasks were completed by each participant. The tasks involved were short, unambiguous, purpose driven ’locate’/’compare’/’find a route’ type tasks. The tasks required the identification of specific objects within the map, which were classified as the user’s objects of interest for the task in question. The algorithm was evaluated on its ability to identify the named objects in the user’s task as the objects of greatest interest to the user. Two participants were over 60
A Study of Spatial Interaction Behaviour for Improved Delivery
125
years of age with no previous spatial data browsing experience. The remaining 14 participants were aged between 20 and 35. Five of them were female and the remainder, male. Half of them (computer science students) use spatial data applications such as Google Maps [1] regularly (more than once every two weeks), the other half, graduates of various third level disciplines, make occasional use of such spatial applications. Within the User Centred Design (UCD) domain, it is commonly accepted that six to eight participants are sufficient for assessing usability [15]. UCD techniques are usually based on explicit feedback. As the focus of this work is on implicit feedback (mouse movements), this guideline figure was doubled to 16 participants to ensure sufficient coverage. Participants’ behaviour during the experiment was observed and noted using a combination of think-aloud by the participants and over-the-shoulder observation. The GIIViz interface employed in the experiment is a Java-based map browsing interface designed to be similar in appearance to, and to emulate the functionality provided by popular Web-based map browsing systems. The interface includes a large interactive map panel which can be dragged and zoomed, a set of fixed increment pan and zoom buttons, search and distance measuring tools, and the option to toggle specific types of features such as roads, lakes and airports on and off. A full description of the interface is given in [8]. Each of the user’s interactions with the interface were logged continuously during the experiment, providing us with a detailed set of interactions for analysis. The information logged includes the mouse cursor position and any map navigational operations such as dragging the map, clicking on a pan or zoom button, or searching for an object. With this information it is possible to tell which areas, and more specifically which objects were being viewed and interacted with by the user, as indicated in Figure 1. It is this logged information, re-analysed in the context of our mouse movement hierarchy, described in Section 4, that provides the analytical content in this paper, improving the inferences that can be made from the users spatial interactions, and driving the improved delivery of Web-based personalised maps. The original experiment with these logged interactions, detailed in [8,13], found that the algorithm was capable of correctly inferring each user’s main objects of interest in 80% of cases by processing all of the interaction activity in the session. By subsequently analysing the performance of the algorithm and comparing it with visualisations of user interactions, such as that in Figure 1, discrepancies became apparent in the accuracy of the algorithm: some mouse activities were identified to be more indicative of interest than other activities. This highlighted a weakness in our interest determining approach. Just as users can become swamped with irrelevant map data, the interest determining algorithm can also become swamped with interaction data. The system has the potential to log the mouse cursor position every 15ms. Over the course of a user session, the quantity of interactions logged could easily become unmanageable. Intensive computation is required to calculate user interests based on all interactions during a lengthy session. Furthermore some of the interactions performed during a session may be misleading for interest
126
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
determining purposes. For instance, the mouse might hesitate for a long period of time at a specific point: Is the user actively considering the objects in the vicinity of the mouse cursor, or is he gone off to answer the phone or to get a cup of coffee? Imagine another case where the mouse moves over a mapped object, causing the information associated with the object to pop up due to the system’s recognition of a mouse-over operation. Is the user actively seeking the mouse-over information associated with the object, or did the mouse happen to pass over the object en-route to the toolbar to perform some other function? Our mouse movement hierarchy was developed to categorise and isolate the most salient mouse interactions in order to eliminate misleading interactions from interest calculations and to identify the most salient interactions, thereby reducing the computational intensity required to accurately determine user interests. The interactions logged during the experiment summarised here are analysed through the framework of the mouse movement hierarchy in Section 5.
4
Mouse Movement Hierarchy
Mouse movement activity takes place on two dimensions: The physical activity of moving the mouse, and the psychology behind its movement. The psychological and physical aspects associated with an interaction are intrinsically linked. An understanding of one aspect would in turn help give a clearer understanding of the other aspect, allowing for a clearer insight into the interactions best suited to inferring user’s interests and intentions. Such is the speed of the human brain that sequences of actions are conceived and executed within fractions of a second. The psychological and physical aspects of the actions must be broken down into their constituent parts for understanding and interpretation purposes. In order to provide such a breakdown, we propose a classification of mouse movement activity, in the form of a mouse movement hierarchy. Figure 2 shows our mouse movement hierarchy, which represents the types of mouse movements identified through the visualisation of user interactions collected during our experiment. As the user executes a task, each movement of the mouse can be represented by a section of the hierarchy. Some parts of the hierarchy are represented in a user’s mouse movements extensively, while other types of movement occur less frequently. Only the most frequently represented parts of the hierarchy are expanded in Figure 2 for ease of viewing. Two sub hierarchies are unexpanded, however they are identical to their neighboring subhierarchies as indicated. At the highest level, the psychological aspect of a mouse movement involves either a conscious or subconscious use of the mouse: the user either knowingly moves the mouse, or moves the mouse without realising it. Conscious actions are most likely to be performed intentionally. For instance, if the user wants to zoom out, the mouse is intentionally moved to the zoom out button and clicked. Conscious interactions are necessary interactions that must be performed in order to produce a result from the interface. Conscious interactions include pans, zooms, and intentional movements of the mouse to and from the map toolbars. An example of such conscious, intentional mouse movement is
A Study of Spatial Interaction Behaviour for Improved Delivery
127
Fig. 2. Combined Mouse Movement Hierarchy
shown in the upper part of Figure 3. In this case, the user intentionally moved the mouse between the toolbar on the left and two different park objects on the map. A smooth, steady mouse track is evident from the dots in the figure. Subconscious actions, on the other hand are often performed by the user without even realising it, as described by Mueller et al. [6]; in a non-spatial text based interface, the mouse pointer is often employed to scan through lines of text for example. Subconscious sequences of mouse movements assist the user in his/her assessment of the display, to determine the areas that will require further investigation. This type of mouse movement behaviour reflects the research outlined in work such as [5,6,7,11] which outline a relationship between users’ mouse cursor position, their thought process, and their eye movements. Locations where the eyes or mouse hesitate or fixate for brief periods can be associated with ”intense cognitive processing” according to Pan et al. [11]. Subconscious movements are not necessary for the user to complete a task, they can be exhibited to greater or lesser extents depending on the user. Furthermore, as each individual is unique, it would be unreasonable to assume the same level of correlation between the user’s mouse and their thought processing, therefore strength of this subconscious indicator of interest varies from user to user. An example of such subconscious activity can be seen in the lower part of Figure 3. The mouse movement in this case is uneven and sporadic in contrast to the conscious movements shown in the same figure.
128
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
Conscious Movements
Subconscious Movements
Fig. 3. Examples of conscious intentional map manipulative behaviour (above) and subconscious behaviour (below) The path of the mouse can be seen as a series of dots. Larger circles indicate longer pauses. Much of the map content has been removed for ease of viewing.
According to our analysis, consciously executed movements are likely to be either intentional or accidental. Conversely, subconscious movements are likely to be unintentional or accidental. There is a possibility that unintentional activity could be consciously performed, and that subconscious activity could be intentionally performed. These cases have not been identified in this particular study to date, as the research is based on a visual and observational investigation. However, more detailed research into the cognition involved [14], perhaps using equipment such as EEG monitors and eye tracking hardware might give a clearer insight. To this effect, these nodes in the hierarchy remain unexpanded until such research is conducted. Accidental operations were found to be performed both consciously or subconsciously. For example, while moving the mouse consciously or subconsciously, the user might click and drag the map by mistake. In the event of such a mistake occurring, the user is likely to recognise that an accidental operation was performed. If the mistake causes a problem, such as causing the map to pan to an area outside the required piece of map, he either corrects it with a conscious intentional operation (by panning back to the correct piece of the map), or the mistake is ignored and the next operation follows as normal. The physical movements of the mouse are accounted for in the hierarchy by swift movements and distinct hesitations, where the mouse is left resting in a specific location. The variable δ is used to define the boundary between these activities. sequences of logged positions with a duration less than δ between them
A Study of Spatial Interaction Behaviour for Improved Delivery
129
are classified as swift movements, while those with a duration greater than δ are classified as distinct hesitations. 200ms was employed as a default value for δ during the experiment described in this paper. The value was derived through a series of empirical tests. We expect, subject to further testing, that the value of δ should vary, depending on the user’s level of comprehension and technological competence, with a lesser value for more proficient users, and a higher value for novice users. Hesitations greater than δ (where δ = 200ms) were identified as being of little use for interest determining purposes. Such hesitations could occur as a result of a wide range of possibilities, including leaving the computer to go for a cup of coffee, answering the phone etc. As such, longer hesitations have a greater potential to skew the interest determining process. By contrast, minor pauses or fluctuations within the swift movements executed by a user were found to be much more reliable in assessing inferences of interest. Sequences of ’scanning behaviour’ where the mouse is moved along a trajectory at a steady pace and terminated by ’focus points’ where the velocity of the mouse drops suddenly and the mouse pauses briefly for less than δ were identified as more reliable behaviour than hesitations. Scanning behaviour is defined as swift movements with a duration between 0 and α. Focus behaviour is identified as pauses within swift movements lasting between α and δ in duration. 30ms was employed as a default value for α during the experiment described here. As with δ, its value may be subject to change depending on the individual user. 30ms was identified as reasonable average value for α based on the interactions of the subjects involved in the experiment. Due to the very short durations involved, focus points are often clustered together or interspersed with sequences of scanning behaviour. The short, interspersed nature of scanning and focus points may be reflective of what is known as saccadic behaviour in the human eye [11,16]. A saccade is a series of small jerky movements of the eyes which occurs when changing focus from one point to another. This type of scanning and focus activity was identified in visualisations of experiment subjects’ interactions particularly during route finding tasks, where the mouse is used to trace along or near the route. It has also been noted to occur in the initial part of a task where the user subconsciously moves the mouse back and fourth between the areas/objects of interest while mentally processing the task and deciding what the best approach to the task is. Additional visual examples of the physical activities described here, including scanning and focus behaviours are given in [8]. The bottom level of the hierarchy concerns the resulting operations of the mouse activity. The activity can produce manipulative, or non-manipulative effects: it can cause the map’s appearance to change, or the appearance can remain unaltered. For example, navigational operations such as pan and zoom which alter the user’s view of the map are map manipulative. Operations such as clicking on an object for additional information or intentionally moving the mouse to the toolbar or file menu are non-map manipulative. These types of actions could be performed as a result of conscious or subconscious mouse activities.
130
5
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
Experimental Hierarchy Application
In order to evaluate our classification scheme, we have used it to weight a new version of our interest determining algorithm and compared it to the baseline of our previous results. The performance of the interest determining algorithm using different types of activity (nodes in the hierarchy) were compared in terms of their accuracy at determining the user’s objects of interest and the quantity of interactions considered in each case. While the user typically performs a certain quantity of each type of activity in our hierarchy during the completion of a task, here we isolate individual types of activity for comparative analysis. The main findings are presented in this section. Each of the activities described by the nodes in the hierarchy have different strengths of interest indicating properties, and some of the paths in the hierarchy are traversed with a greater frequency than others. For example, the user might perform more subconscious scanning activity than conscious hesitations. The experiment found that interests determined using all mouse activity in a session could be employed to determine the user’s interests to an average accuracy of 80%. Better results were obtained by considering particular types of activities in isolation. For example, interests determined exclusively using hesitations had an average accuracy of 83%, while interests determined with swift movements in isolation attained an average accuracy of 87%. However, swift movements occur with a much higher frequency, typically outnumbering hesitations by 7:1. This means 7 times more computation is required to calculate user interests with swift movements over hesitations. Figure 4 illustrates the difference in the quantity of mouse pointer locations considered in each stage of the experiment, and the associated accuracy attained. While the average accuracy scores of the algorithm for swift movements and for hesitations were not altogether dissimilar with the experiment data, it must be noted that a greater strength, or degree of certainty could be attributed to those interests determined using swift movements, as longer hesitations could introduce a biased weight. Furthermore, swift movements can be indicative gestures of both map manipulation and thought processing. Indeed each of the nodes in the hierarchy have different strengths according to their properties. For example, the location of the mouse pointer during conscious, intentionally executed, non-map manipulative operations such as performing a mouse-over or click operation on an object to obtain information associated with the object are solid indicators of objects of interest. By comparison, the mouse pointer location associated with conscious, intentionally executed, map manipulative operations such as panning or zooming are less indicative of the user’s objects of interest. While the act of zooming in on an object is indicative of interest, the locations of the mouse pointer during the zoom operation are usually not in close proximity to the object of interest: A zoom is performed by dragging a box around the object of interest, or by pressing the zoom button in the toolbar. It is the interest indicating properties of the location of the mouse pointer rather than the resulting operation that we are concerned with. It was observed during the experiment that users often switch quickly and suddenly between
A Study of Spatial Interaction Behaviour for Improved Delivery
Experiment Category
Illustration of average quantity of mouse pointer locations employed for interest determing
131
Average Accuracy Attained
All Mouse Activity
80% (100%)
Mouse Hesitations
83% (14%)
Swift Mouse Movements
87%
(86%)
Behavioural Heuristics
85% (14.5 %)
Fig. 4. An illustration of the quantity of pointer locations considered in each scenario and the associated average accuracy attained by the interest determining algorithm. The spread of points is randomly generated using average values for illustrative purposes.
making conscious, intentional map-manipulative operations and subconscious mouse activities at key points during a task. For instance, conscious, intentional actions (i.e. panning, and zooming) are carried out in order to manipulate the map to locate a specific piece of the map. Once the user has found the appropriate piece of map, the mouse is no longer required to manipulate the map, and may be engaged in subconscious activity until the next piece of map must consciously be located. The categorisation of activities laid out by the hierarchy allowed us to exploit the stronger interest indicating activities, and eliminate the less salient activities from consideration, reducing the amount of computation required. by following a heuristic approach, we have automatically identified the scanning sequences and and focus point patterns within the swift movements category. Subsequently, by applying a point decimation heuristic a total of 75% of the activity in the swift movements category was eliminated. We named this series of heuristics “Behavioural Heuristics”, as they are employed to reduce the dataset of mouse pointer locations according to the user’s interaction behaviour. Figure 4 illustrates the significant reduction in the quantity of mouse pointer locations considered with the behavioural heuristics technique. Following the application of this technique, the algorithm achieved an average accuracy of 85%. The average set of user interactions in a session were reduced to just 25% of their most salient swift movements. This is a concrete example of how a detailed classification of mouse movements can help to improve the implicit user profiling process. Computation was reduced by eliminating noise, and focusing on the most salient activities. The elimination of noise helped to improve the accuracy of
132
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
the algorithm. By improving the accuracy of the interest determining algorithm, the accuracy of the user profiles and subsequent personalisation provided can also be improved. The intelligent filtering of content according to accurate user profiles is becoming crucial for the development of a successful Location Based Service. The sheer quantity of available information must be managed and prioritised to suit the user. Typically, Web-based mapping applications transmit maps as raster tiles for ease of download and rendering, however the rasterised images are usually generated from a vector dataset on the server. This gives scope for the generation of raster images containing a filtered set of features. An application that downloads information without filtering and prioritising over a wireless network with slow download times and intermittent coverage has the potential to cause user frustration. Furthermore, the profiling process employed to provide accurate filtering must be unobtrusive to avoid further frustrating the user. This means that a balance must be struck between the intensive use of resources and the construction of an accurate profile. By applying lessons learnt from the deconstructive analysis of user interactions as demonstrated by the mouse movement hierarchy it is possible to channel resources towards specific aspects of interaction, and away from other noisy aspects which both increase computation and introduce inaccuracies.
6
Conclusion
This paper describes our mouse movement hierarchy. The hierarchy categorises the psychological and physical aspects of mouse movement in relation to a typical spatial data browsing interface. It is a framework within which the most, and least salient aspects of a user’s mouse interactions can be identified. This helps eliminate misleading interactions from consideration when implicitly calculating the user’s interests based on their interactions. An example of such a misleading interaction is a zoom operation where the user moves the mouse to the zoom button in the top left corner of the map, and leaves it resting on the map near the button. This could be mistaken as being indicative of interest in the nearby map features. Application of the hierarchy to such an interaction would reveal that it is a weaker indicator of interest than operations such as clicking on a map object for additional information, and could be weighted as such, potentially removing the operation from consideration by applying a behavioural heuristics technique. Computation of user interests can be focused on the most salient interactions, reducing computational intensity, and also improving the accuracy of calculations by eliminating noisy interactions from consideration. The experiment results described in this paper give a concrete example of the application of the hierarchy. Identifying and simplifying the most salient subset of interactions, scanning and focus operations, it was possible to compute users’ top objects of interest to an average accuracy of 85%. This was achieved using a subset of 25% of swift movements, where swift movements account for 87.5% ( 78 ) of all interactions in a session. Before deconstructing the interactions and defining
A Study of Spatial Interaction Behaviour for Improved Delivery
133
the mouse movement hierarchy, an average accuracy of 80% was attained when employing all interactions in a session. Accuracy was improved and computation reduced by identifying and eliminating noisy interactions. An analysis of associated computation time and algorithm complexity is currently underway. By improving the implicit profiling process, the delivery of Web-based maps for a Location Based Service can be improved. This leads to a better quality, personalised experience for the end user, reducing the delay in information processing, downloading, and rendering. Ultimately, this reduces frustration and improves user satisfaction. The hierarchy presented in this paper is specific to mouse-based interaction with spatial information. To the best of our knowledge, no such hierarchy has been described within the wider non-spatial HCI community. The concept of the hierarchy could be adapted to non-spatial information, and also to other modalities within the spatial domain. Wireless applications are becoming increasingly mobile, and incorporating a wide variety of interaction modes. Interaction with a stylus, or by voice or even device tilting and shaking etc. are becoming more prevalent. A similar deconstruction of each type of interaction according to a hierarchy, as demonstrated by our mouse movement hierarchy, could be performed to improve implicit interaction interpretation for each mode of interaction in turn. Acknowledgements. We would like to thank Christophe Claramunt of the French Naval Academy Research Institute for his helpful comments which contributed to the development of the work presented in this paper. Research presented in this paper was funded by a Strategic Research Cluster grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. We gratefully acknowledge this support.
References 1. 2. 3. 4.
Google Maps (2009), http://www.maps.google.com Yahoo! Local Maps (2008), http://www.maps.yahoo.com MapQuest (2008), http://www.mapquest.com Claypool, M., Le, P., Waseda, M., Brown, D.: Implicit Interest Indicators. In: Proceedings of the ACM International Conference on Intelligent User Interfaces (IUI 2001), Santa Fe, New Mexico, USA, pp. 33–40 (2001) 5. Arroyo, E., Selker, T., Wei, W.: Usability Tool for Analysis of Web Designs Using Mouse Tracks. In: Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2006) Extended Abstracts on Human Factors in Computing Systems, Qu´ebec, Canada, pp. 484–489 (2006) 6. Mueller, F., Lockerd, A.: Cheese: Tracking Mouse Movement Activity on Websites a Tool for User Modeling. In: Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2002), Seattle, Washington, USA, pp. 279–280 (2002) 7. Chen, M.C., Anderson, J.R., Sohn, M.H.: What Can a Mouse Cursor Tell Us More? Correlation of Eye/Mouse Movements on Web Browsing. In: Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2001), Seattle, Washington, USA, pp. 281–282 (2001)
134
E. Mac Aoidh, D.C. Wilson, and M. Bertolotto
8. Mac Aoidh, E., Bertolotto, M., Wilson, D.C.: Understanding Geospatial Interests by Visualising Map Interaction Behaviour. Information Visualization, Palgrave 7(34), 257–286 (2008) 9. Nivala, A.-M., Brewster, S., Sarjakoski, T.L.: Usability Evaluation of Web Mapping Sites. The Cartographic Journal 45(2), 129–138 (2008) 10. Atterer, R., Wnuk, M., Schmidt, A.: Knowing the Users Every Move - User Activity Tracking for Website Usability Evaluation and Implicit Interaction. In: Proceedings of the 15th International ACM Conference on World Wide Web, Edinburgh, Scotland, pp. 203–212 (2006) 11. Pan, B., Hembrooke, H., Gay, G., Granka, L., Feusner, M., Newman, J.: The Determinants of Web Page Viewing Behavior: An Eye Tracking Study. In: Proceedings of the 2004 Symposium on Eye Tracking Research and Applications (ETRA), San Antonio, Texas, USA, pp. 147–154 (2004) 12. Cox, A.L., Silva, M.M.: The Role of Mouse Movements in Interactive Search. In: Proceedings of the 28th Annual CogSci Conference, Vancouver, Canada, July 2629, pp. 1156–1162 (2006) 13. Mac Aoidh, E., Bertolotto, M., Wilson, D.C.: Analysis of Implicit Interest Indicators for Spatial Data. In: Proceedings of the 15th ACM International Symposium on Advances in Geographic Information Systems (Short paper), Seattle, Washington, USA, November 7-9, pp. 336–339 (2007) 14. Lobben, A.K.: Tasks, Strategies, and Cognitive Processes Associated With Navigational Map Reading: A Review Perspective. The Professional Geographer 56(2), 270–281 (2004) 15. Nielsen, J.: Guerilla HCI: Using Discount Usability Engineering to Penetrate the Intimidation Barrier. In: Dias, R.G., Mayhew, D.J. (eds.) Cost-Justifying Usability. Academic Press, London (1994) 16. Poole, A., Ball, L.J.: Eye Tracking in Human-Computer Interaction and Usability Research: Current Status and Future Prospects. The Mind’s Eye: Cognitive and Applied Aspects of Eye Movement Research (2003), http://www.alexpoole.info/academic/poole\&ball%20eyetracking.pdf
EgoViz – A Mobile Based Spatial Interaction System Keith Gardiner*, Junjun Yin, and James D. Carswell Digital Media Centre, Dublin Institute of Technology, Ireland {keith.gardiner,junjun.yin,jcarswell}@dit.ie
Abstract. This paper describes research carried out in the area of mobile spatial interaction and the development of a mobile (i.e. on-device) version of a simulated web-based 2D directional query processor. The TellMe application integrates location (from GPS, GSM, WiFi) and orientation (from digital compass/tilt sensors) sensing technologies into an enhanced spatial query processing module capable of exploiting a mobile device’s position and orientation for querying real-world 3D spatial datasets. This paper outlines the technique used to combine these technologies and the architecture needed to deploy them on a sensor enabled smartphone (i.e. Nokia 6210 Navigator). With all these sensor technologies now available on one device, it is possible to employ a personal query system that can work effectively in any environment using location and orientation as primary parameters for directional queries. In doing so, novel approaches for determining a user’s query space in 3 dimensions based on lineof-sight and 3D visibility (ego-visibility) are also investigated. The result is a mobile application that is location, direction and orientation aware and using these data is able to identify objects (e.g. buildings, points-of-interest, etc.) by pointing at them or when they are in a specified field-of-view. Keywords: MSI, Directional Query, Isovist, Radial Query.
of them, to trilaterate location estimates [2, 3]. The orientation component or DirectMe module and a primary focus of this paper uses data from a number of sensors including a GPS sensor, a magnetometer sensor (digital compass) and an accelerometer sensor (tilt sensor). Using these data, the TellMe application formulates directional queries that are performed by a spatial database to determine if any spatial interaction exists between the query “window” and any of the buildings in our 3D university campus model. The shape of this query window takes on a variety of 2D and 3D forms from a simple ray to a polygon to a volume (see Figure 14). The results of this interaction are subsequently presented to the user in the form of a building address/details plus web-links to more information (e.g. classroom timetables, lab opening hours, etc.). Primarily, the TellMe application provides a framework for processing 2dimensional queries in an open, non-directional query-space (e.g. range query). This subsequently enables us to investigate some of the major issues in the area of mobile spatial interaction within 3D environments (e.g. 3D Visibility). In relation to sensor data quality (e.g. noise), tests to compare the data gathered from a higher quality (i.e. more stable) external sensor packs with that of the integrated sensors on our mobile device are ongoing and will help to determine the suitability of current mobile devices for exploitation in the area of mobile spatial interaction. Following this, our attention shifts to exploring 3-dimensional visibility with the development of an “Ego-Visibility” query processor that further confines the query-space to simulate a user’s view frustum. Results from this will enable us to investigate the possibility of providing full “Hidden Query Removal” functionality where only what the user can actually physically see gets returned by the directional query processor. The remainder of the paper is organised as follows: Section 2 describes some related work in the area of directional querying. Section 3 outlines some methods used by current GIS to perform these queries using visibility analysis. Section 4 describes the design of the TellMe system which includes the LocateMe and DirectMe modules, the TellMe Server and hardware considerations. Section 5 details our approach to egocentric visibility and describes five different types of queries and Section 6 concludes with a summary and future work.
2 Related Work There have been a number of different applications proposed recently that utilise compasses and tilt sensors in current state-of-the-art mobile devices. Some early work by [4, 5] introduced the idea of an XWand which is a custom built pointing device that controls electronic devices in an intelligent environment using a variety of sensors in combination to support pointing and gesture recognition tasks. A museum guide is described in [6] to help visitors quickly locate exhibits using orientation aware handheld devices. [7] reports on an interaction method where a user points a mobile device at a building to view a virtual representation of it at different times in the past. In [8], their approach is to use the sensor data to turn the device into a musical instrument using shaking and sweeping gestures. The Shoogle project [9] aims to use inertial sensing to provide an eyes-free vibrotactile display that mimics objects rattling around
EgoViz – A Mobile Based Spatial Interaction System
137
inside the device. Such active use of sensors provides for a rich multimodal interaction that can be used without any visual attention. The majority of current research focuses on providing enhanced navigation capabilities in the area of mobile spatial interaction. The Point-to-Discover GeoWand [1, 10] is a system and application framework for orientation-aware location-based mobile services. This application demonstrates how their custom-built device can be used as a pointing tool to display web-based information about bars and restaurants on the device. A very similar approach is taken by [11] with the iPointer application which is based on an augmented reality engine and a thin client api that provides a local mobile search based on GPS and an eCompass and delivers content, such as pictures, menus, and audio overviews, which are streamed back to the user's phone. A rather different approach is taken by [12]. Using their Point-to-GeoBlog application, users are able to select landmarks by using the point and tilt functionality of a custom built device. No content is provided up front but later when the user logs onto a computer with more visually adequate display capabilities. A detailed usability study by [12] reported that the most intuitive approach was to provide a simple point and tilt interface over a visual map and also a preference for remote tagging that allows users to select landmarks beyond their line-of-sight. These results confirmed a comparative outdoor study by [13] that tested conceptual designs for 4 interaction areas considered important for spatial information applications (SIA). In addition, the information Pull technique, where the user decides what information to view and has control over it [14-16] was the most intuitive and preferred data interaction approach by users. Whereas the Push technique, where all information is automatically presented to the user, is not easily managed on constrained mobile devices. An alternative approach that combats the types of problems related to too much data being presented to the user is to restrict the search space based on certain criteria. In [17], the approach is to restrict the search space to a users field-of-view using the concept of an observer’s 2-dimensional query frustum to determine what the user can actually see from their position in the environment. The use of the Point-to-Discover block model and visibility computation algorithm in [1] uses a rather different approach to determine what buildings the observer can actually see from a single vantage point. In [18], this idea is extended with the development of a local visibility model that introduces the concept of “billboards” as a mechanism to identify what buildings the user can see. This type of egocentric visibility will be a primary focus of this paper. We consider only possibilities in relation to what a user can physically see from their current position by using visibility analysis to do so. Utilizing this type of visibility shape on a mobile platform is a new concept and will be a key aspect of the research.
3 Visibility Analysis There are a number of methods used in the current research described above for performing line-of-sight queries. This section gives an overview of the technologies and methods used by a number of them and describes some emerging possibilities.
138
K. Gardiner, J. Yin, and J.D. Carswell
3.1 Ray Tracing The ray tracing process is a fundamental process in this area and works by simulating the light that travels within a space. In many applications, this process is performed backwards to determine the visibility from one or multiple points (e.g. user positions), which instead of perceiving the light emitted from the objects (e.g. buildings or other infrastructure), the ray is transmitted from a given point outwards in all directions. By retrieving all the intersections from a generated ray and the objects, a polygon in a 2D environment and a volume in a 3D environment can be constructed that represents the visibility from the point in 2D and 3D respectively (Figure 1). This approach is termed Line-of-Sight (LOS) in Geographic Information Systems (GIS).
Fig. 1. Ray Tracing Process
3.2 Line-of-Sight The Line-of-Sight function has been integrated in most commercial GIS software. It determines the visibility from the observer’s position to a target point considering DTM fluctuations. The path from the viewpoint to the target point is one of the rays emitted from the viewpoint. In effect, some of the projected rays are truncated where they intersect with obstacles (e.g. building blocks, etc) while on route to the target. This collection of interaction points forms a convex polygon or volume representing the visibility area of a specific viewpoint. In a 2D environment, where the elevation of the objects is not taken into account, the line-of-sight function to establish the visibility of a particular point in space can be simplified by recording all the intersection points between the rays and obstacles surrounding it in the horizontal plane (Figure 2). However, in a 3 dimensional space, the ray is not just projected in the horizontal plane but also in the vertical plane using the tilt angle from the horizontal plane that ranges from 0 to 360 degrees. This particular object is referred to as an Isovist in a 3D environment [19]. An isovist is described as a visibility polygon comprised of the set of all points visible from a given vantage point in space with respect to an environment. Isovists were first introduced as a method for analysing space in space syntax research by [19]. In [20] this concept has been extended recently with the automatic generation of an axial map which can be used to generate an isovist. Fortunately, a similar function available in GIS is known as a Viewshed for geodetic survey applications.
EgoViz – A Mobile Based Spatial Interaction System
139
Fig. 2. Line-of-Sight Process
3.3 Viewshed Analysis Viewshed analysis adopts the same approach as line-of-sight analysis. However, instead of examining the single path from the viewpoint to the target point, a beam of rays is generated from the viewpoint in the horizontal plane. Concurrently, a beam of rays is generated vertically along each ray path in the horizontal plane and the azimuth is taken into consideration within a certain range. For instance, in the example in Figure 3 the viewshed is based on the viewpoint and the range (based on the azimuth) combined with the horizontal angle or tilt, where the actual visible space is a volume excluding the section of the 3D object that intersects with the surface or building. To estimate the visibility in all directions from the viewpoint using viewshed analysis, the azimuth in the horizontal plane and the view angle in the vertical plane can be extended to a full range, which generates the rays from a point between 0 and 360 degrees, forming a frustum shaped viewshed. One existing application of this idea is the Threat Dome, which is used for estimating all possible locations in space that are visible from a given point and is utilised mainly in military defence situations [21, 22].
Fig. 3. Viewshed analysis in geodetic surveying
140
K. Gardiner, J. Yin, and J.D. Carswell
Adopting such an approach to measure the visible space within a certain radius can significantly reduce the number of calculations required to determine the possible intersection points. This strategy is similar to radar scanning the environment with a limited signal distance (strength). Essentially, it is an extreme case of the radial lineof-sight, which is derived by calculating the basic line-of-sight. 3.4 Threat Dome Analysis In contrast to viewshed analysis, capturing the visible volume from a point taking into account all existing obstacles (objects) in the environment can be very computationally intensive. The threat dome approach specifies a radius defining the view distance from a point and hence the rays emitted from the viewpoint form a sphere. By determining the intersections of the rays with obstacles in different levels of elevation within the sphere, the actual visible space within this specified radius can be determined (Figure 4).
Fig. 4. Example of Threat Dome
4 TellMe The TellMe application is the mobile application that requests information from two modules, namely the LocateMe and DirectMe modules and uses this information to perform directional queries using the TellMe Server against various spatial data sets. Two main approaches are investigated in relation to implementing the TellMe Server. One approach is to host the spatial dataset on the mobile device itself. In this case the data retrieved from LocateMe and DirectMe is used to perform spatial queries locally on the mobile device. The results of the queries are displayed to the user by the TellMe Mobile application. Because of limited memory and the processing speeds required to perform spatial queries on large geographic data sets, this type of architecture was deemed unsuitable. The alternative approach (i.e. thin client/server approach) is to request the required data from LocateMe and DirectMe and perform spatial queries on a dataset hosted on an external server. The parameters are passed to a web application, which in turn carries out the queries on a spatial database to determine any spatial interaction
EgoViz – A Mobile Based Spatial Interaction System
141
between the users location and orientation and the dataset. The results are subsequently displayed to the user in the web browser on the mobile device. This architecture is illustrated in Figure 5. This approach of using a web browser to display the returned data follows current trends in this area. The argument for using web browsers for as many on-device functions as possible is because of the increasing complexity associated with developing applications to run on many different devices [23]. To avoid developing a different application for each operating system, a more attractive solution is to develop applications using a web scripting language capable of producing highly interactive applications that perform the same functions as desktop applications without the concern about underlying operating system functions and restrictions. Taking this approach a step further, it is now becoming more acceptable to let the browser access more different types of data as well. Using a set of JavaScript APIs, it is even possible to access some of the hardware on a mobile device directly from the browser. For example, this is one of the newest features of Google Gears [24] where the Geolocation API allows an application access to the GPS hardware on the device. Using this approach, if all sensor data was made available to a web application, it would be possible to eliminate the need for an on-device application altogether, essentially making the TellMe application entirely web-based.
Fig. 5. Overall TellMe Architecture
142
K. Gardiner, J. Yin, and J.D. Carswell
4.1 LocateMe The LocateMe module is used in conjunction with the DirectMe module to gather information about a user’s current position and orientation. This data is then used to execute directional queries against various data sets both internal and external to the phone. The LocateMe module is based on a hybrid positioning system that utilises GSM, Wi-Fi and Bluetooth radio signals in addition to GPS to determine location. As this is not the focus of this paper, a more comprehensive description of this technology can be found in [25]. 4.2 DirectMe The DirectMe module is one of two modules that provide data to the TellMe application. Its function is to determine the direction that the mobile device is currently pointing by using a digital compass and tilt sensors. This data is then collected on request from the TellMe application and synchronised with the location data coming from the LocateMe module. The architecture of the DirectMe module is similar to the LocateMe module where each technology (i.e. - compass and tilt sensors) has a native hardware spotter that relays data to a higher-level component synchronises data from each spotter. This architecture is illustrated in Figure 6.
TellMe Mobile Application
PyS60
Synchronization Module
Sensor Server
Acc Listener
Mag Listener
Accelerometer
Magnetometer
Fig. 6. DirectMe Design
Symbian S60
Tilt Spotter
Hardware
Compass Spotter
EgoViz – A Mobile Based Spatial Interaction System
143
This type of architecture is required in order to overcome some restrictions imposed in relation to access to various mobile device hardware components (e.g. compass and tilt sensors) from particular APIs. 4.3 Mobile Device Hardware/Sensors There are three main mobile platforms that currently provide mobile devices that contain the required sensor hardware for the TellMe system. The Nokia 6210 Navigator [26] has a digital compass and tilt sensors and the API providing access to these sensors has just (winter ’09) been back ported from Symbian S60 5th Edition to Symbian S60 3rd Edition FP2. Using this version of the API, the DirectMe module can ascertain the current heading and orientation of the device. However, there have been recent reports that the performance of these integrated sensors is of poor quality [8]. Another hardware option is the HTC Dream (a.k.a. Google phone), with integrated digital compass and accelerometers. This device runs the Android operating system from Google and is currently not available in Ireland [27]. The most recent possibility in terms of hardware suitability is the Apple iPhone 3GS (Apple, 2009). In addition to providing WiFi and accelerometer access there is now a compass available making it suitable also. With this limited number of options available in relation to acquiring orientation data from mobile devices integrated or “on-board” sensors, another option is to use external sensors packs such as the SHAKE. The SHAKE SK6 is an external sensor pack with digital compass, accelerometer, and tactile feedback. The even smaller SK7 has been released in Q1 2009. This sensor pack communicates with a mobile device via Bluetooth. Unlike current cellphone sensors, the SHAKE device has better quality sensors and a number of filters on board to reduce any noise introduced by the mobile phones antenna. The SHAKE SK7 is shown in Figure 7. However, following a review of these options, it was decided that the DirectMe module would collect data from a number of MEMS (Micro Electro-Mechanical Systems) sensors on a Nokia 6210 Navigator (Figure 7) in favor of the SHAKE as these sensors are integrated. In particular, the sensors that we use are magnetometers, which are capable of sensing the magnetic field surrounding the device. Using the magnetic field to determine magnetic north, it is possible to calculate compass bearing [8]. Furthermore, accelerometers are used to measure acceleration or the rate of change of velocity with respect to time.
5
Fig. 7. Nokia 6210 Navigator and SHAKE SK7 sensor pack
144
K. Gardiner, J. Yin, and J.D. Carswell
4.4 TellMe Server The TellMe server is essentially a spatial application server that is used to perform all the complex spatial queries in the system. It is responsible for communicating with the TellMe Mobile application, which collects data (including location, direction and orientation) from the LocateMe and DirectMe modules on the mobile device. This data is communicated wirelessly to the spatial application server and used to perform spatial queries against the Oracle Spatial 3D Database. The TellMe Server is based on ESRIs ArcGIS Server platform and is used to perform the complex queries that are required to determine the mobile spatial interaction between the users line-of-sight and the 3d database. This platform provides server extensions for many of the traditional spatial query functions found in most GIS. Using these extensions, it is now possible to perform these types of spatial queries in a mobile context that were previously only possible in a desktop setting. Another important reason for this choice of server is based around the issue of scalability; using this software ensures that the system is scalable and can manage a large number of users efficiently. 4.5 3D Database To perform directional queries, a 2D spatial database can be used and is the minimum requirement in order to do so. These databases support extensive 2D feature types and indexing techniques for performing spatial queries. 2D spatial queries can be performed effectively with a standard spatial database in an efficient manner and can identify objects (i.e. buildings) that intersect with a direction vector for example. Some types of queries possible are illustrated in Figures 10, 11 & 12. However, to perform 3D spatial queries this process becomes somewhat more complex. In comparison, a 3D spatial query should be able, for example, to not only identify what building a direction vector is intersecting with but also the floor of the building it is directed at in a 3D Euclidean space. This means a true 3D database should support three-dimensional data types such as point, line, surface and volume in its geometric data model, be capable of indexing the data and must also offer functions and operations embedded in its spatial query language that can operate on these data types [28]. In fact, these requirements significantly reduce the number of options that are available to us in terms of being able to perform these types of queries. There are two main options here. 4.5.1 Oracle 11g Beginning with the 11g version of Oracle it is now possible to store, index and query 3D objects using the sdo_geometry data type. Using this type, it is now possible to store point, line, polygon, polygon with hole and collection data types in 2D and 3D [28]. An example of the types of data that can be stored is illustrated in Figure 8 (b). 4.5.2 ESRI ArcGIS To support this rising trend in 3D data storage, ESRI developed a native volumetric geometry feature type called the Multipatch feature supported by its geo-database
EgoViz – A Mobile Based Spatial Interaction System
145
models that is treated like any other geometry type in the database. The Multipatch is constructed of triangle strips and fans and defines objects boundaries using triangular faces. This is shown in Figure 8(a)
In our case, we use Oracle 11g and its exclusively 3D forms: simple solid, composite surface, composite solid and collection to represent our data in the database. The data is comprised of 3d data that is based on Ordinance Survey Ireland’s [29] 2d vector data that has extruded using height values from airborne LiDAR scans. The result is a block model of the NUI campus and Dublin city centre. There are also some detailed building furniture models of the NUI campus buildings, illustrated in Figure 8b. This data is spatially indexed and is queried using a 3D query window generated by the data collected from the mobile device sensors. All attribute data (building name, class timetables, etc.) presented to the user is also stored in the Oracle database. To perform 3D queries an extended set of operators can be used as support for 3D data is restricted to the SDO_Filter, SDO_Anyinteract, SDO_Within_Distance, and SDO_NN (nearest neighbour) operators using the Geographic-3D coordinate system. An example of a typical 3D query or frustum query is shown in Figure 9.
Fig. 9. Spatial query using view frustum
146
K. Gardiner, J. Yin, and J.D. Carswell
5 Ego-Visibility In this section, the different types of queries that are performed by the TellMe application are discussed, all of which are based on the users egocentric point-of-view. Egocentric visibility refers to the portion of a search space that is visible to a user at a particular time based on their location, direction and orientation. For example, in the case of the TellMe application, the user’s visible query space acts as a secondary filter on data that is returned by the query processor by restricting it to contain only objects that are in the users field-of-view (FOV). The FOV therefore excludes the portions of the dataset obscured by buildings. This method is primarily used to identify points-ofinterest (POI) other than buildings that are in the users FOV within a predefined distance from the user and is illustrated in Figure 11. This method can be used to identify objects in the distance that may be too small to point at directly but are still in the users FOV nonetheless, like a monument or statue. The algorithm to determine the searchable space in this instance builds on previous work outlined in [17] where a 2D directional query processor was developed and used in a virtual environment for similar purposes. The following sections outline the queries that can be performed using the TellMe system. 5.1 Directional Query Directional querying in terms of Mobile Spatial Interaction (MSI) is a method by which a device’s position and orientation along 2-axis (horizontal and vertical) can be determined with the use of GPS and compass/tilt sensors. This data is then used to build a “query space” in the database to identify what object(s) the device is pointing at - in our case a campus building and any relevant information about it (Figure 10). When the user points a device at a building (creating a query vector), the device is able to identify what it is pointing at by determining the interaction between the query vector and the building data model stored in the database. The returned information can be communicated back to the user using visual, auditory or tactile interfaces.
Fig. 10. Identifying buildings using directional query vectors
EgoViz – A Mobile Based Spatial Interaction System
147
This initial scenario is the simplest possible example of what we aimed to achieve in our research and it assumes that a users LOS is a straight-line vector. As this is not the case, we investigated other possibilities in terms of what the user can actually see. We look at ways of representing 2D and 3D query frustums that interact with objects based on LOS (i.e. direction) and visible open space (i.e. everything a user can see from a single point in all directions) or Egocentric Visibility (EgoViz). 5.2 Field-of-View Query The field-of-view query maintains the use of the direction vector in the query process by producing a query frustum that closely represents a users actual field-of view. In contrast to the directional query, performing this type of query returns a list of results that identify not only what the user is pointing at but also everything in the users field-of-view (Figure 11). It is also possible to pre-select the layers that the user requires information about. For example, a user may only want information about the buildings that are in their field-of-view or alternatively they may want information about points-of-interest only. This categorisation of the required data helps to speed up the query process by eliminating sets of data not required. Taking this concept a step further, the idea of the visibility polygon or Isovist is introduced. A visibility polygon is the portion of open space a user can see in all directions. Being able to determine a users visibility polygon in real-time as they navigate through the campus or city streets enables the TellMe system to deliver a much richer experience in terms of the relevance of the data to the user and the speed at which the data is delivered. As the visibility polygon identifies the area the user can actually see, this generates a much smaller search space reducing the size and complexity of the spatial queries to be performed.
Fig. 11. Identifying POI objects in a user’s field-of-view
5.3 Isovist Query Recent work in this area by [18] describes the local visibility model (Lvis) that uses the concept of billboards to determine what buildings are in the users FOV in a 2.5D
148
K. Gardiner, J. Yin, and J.D. Carswell
environment. To achieve this in our case, a different approach is taken based on work carried out by [20] into the concept of Isovists and medial axes. Using Isovists, we attempt to automatically generate a panoramic (360º) line-of-sight search space for performing spatial queries on. This method should prove to be very effective as the fundamental aim is firstly to determine precisely what area in geographical terms is visible to the user in all directions and secondly to determine what objects inside this area is of interest to the user. This idea is illustrated in Figure 12.
Fig. 12. Egocentric Visibility using IsovistExplorer 360°
5.4 Frustum Query Extending the 2D Isovist to work in 3D space is a primary objective of this work. In the case of the frustum query, information about relevant artifacts that lie within the sections of open space (i.e. the space between the buildings in all directions up/down/ sideways) that has been identified as visible to the user are queried and returned.
Fig. 13. 3D Visibility Query
EgoViz – A Mobile Based Spatial Interaction System
(a)
(b)
(c)
(d)
(e) Fig. 14. Examples of 3D Visibility Queries
149
150
K. Gardiner, J. Yin, and J.D. Carswell
In this case, the second generation of the 3D Isovist is represented by a 3D object that represents a users actual LOS omitting the geometry of any other objects that interact with it (illustrated in Figure 13). Building this type of 3D Isovist query requires an extensive 3D database of buildings and building furniture at floor and room detail in order to utilise the query frustum geometry. Examples of the types of geometries used are illustrated in Figure 14. 5.5 Ego-Dome The Ego-Dome is essentially an extension of the 2D Isovist described above except the visible open space of the user from a particular location is reproduced in 3D. This concept of querying all open space in this manner has been explored previously but usually only using standalone desktop applications to do so, offline. In the case of the TellMe system, the Ego-Dome is created using the current location of the user (Figure 15). Any objects that interact with the dome are detected and the required results are presented to the user using visual, auditory or tactile interfaces. This type of spatial interaction is very useful for a host of applications in terms of real-time feedback that gives the user the ability to interact and have knowledge about an environment in real-time.
Fig. 15. Ego-dome being used to determine the visible space within a certain radius
6 Conclusions This mobile based spatial interaction system (Egoviz), which is based on the TellMe application is designed to allow users to interact in a context sensitive way with information using current mobile phone technology. Current devices are beginning to offer hardware such as digital compasses and tilt sensors. With the use of these technologies, a wide spectrum of applications is emerging. The development of the DirectMe module and its synchronisation with the LocateMe module is ongoing. Concurrently, the development of the complete TellMe mobile application is taking place and provides a framework for testing our directional query techniques.
EgoViz – A Mobile Based Spatial Interaction System
151
In relation to performing directional queries, there are a number of fundamental issues that have been highlighted in terms of the quality of the data that is retrieved from the sensors. In [10] the quality of the sensor data from a custom built sensor pack is analysed and compared to actual data showing good results for performing directional queries. With the quality of the sensor data being a pivotal aspect of directional queries in terms of the relevance of the data returned and the move towards making this type of interaction available on mainstream mobile devices, we intend to carry out two important sets of tests. Firstly we investigated the quality of selected external sensors (i.e. SHAKE and Nokia LD-4W GPS) and secondly we will compare these results with the quality of the data from the integrated sensors in the “off-theshelf” phone (i.e. Nokia 6210 Navigator). Regarding egocentric visibility, research will be carried out to help us better identify what comprises exactly a user’s FOV. Initially we have studied work by [20] in the area of spatial planning and in particular Isovists by developing a 2D EgoVis filter to determine a users true LoS in 360°. In due course, the technical feasibility of extending to a true 3D directional query processor will be investigated using the idea of the threat dome as an example.
Acknowledgments Research presented in this paper was funded by a Strategic Research Cluster grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. The authors gratefully acknowledge this support.
References 1. Simon, R., Frohlich, P.: The Point to Discover GeoWand. In: 9th International Conference on Ubiquitous Computing (UbiComp 2007), Innsbruck, Austria (2007) 2. Kilfeather, E., Carswell, J., Gardiner, K., Rooney, S.: Urban Location Based Services using Mobile Clients: The ICiNG Approach. GISRUK, Maynooth (2007) 3. Rooney, S., Gardiner, K., Carswell, J.: An Open Source Approach To Wireless Positioning Techniques. In: The 5th International Symposium on Mobile Mapping Technology (MMT 2007), Padua, Italy. (2007) 4. Wilson, A., Pham, H.: Pointing in Intelligent Environments with the WorldCursor. Interact (2003) 5. Wilson, A., Shafer, S.: XWand: UI for Intelligent Spaces. CHI, Fort Lauderdale, Florida (2003) 6. Chan, L.-W., Hsu, Y.-Y., Hung, Y.-P., Hsu, J.Y.-J.: Orientation-Aware Handhelds for Panorama-Based Museum Guiding System. In: UbiComp 2005 Workshop: Smart Environments and thier Applications to Cultural Heritage (2005) 7. Baillie, L., Kunczier, H., Anegg, H.: Rolling, Rotating and Imagining in a Virtual Mobile World. In: 7th international conference on Human computer interaction with mobile devices & services, Salzburg, Austria (2005) 8. Essl, G., Rohs, M.: ShaMus - A Sensor-Based Integrated Mobile Phone Instrument. In: Proceedings of the International Computer Music Conference (ICMC), Copenhagen (2007)
152
K. Gardiner, J. Yin, and J.D. Carswell
9. Williamson, J., Murray-Smith, R., Hughes, S.: Shoogle: Excitatory Multimodal Interaction on Mobile Devices. In: Proceedings of ACM SIG CHI Conference, San Jose (2007) 10. Simon, R., Frohlich, P.: GeoPointing: Evaluating the Performance of Orientation-Aware Location-Based Interaction Under Real-World Conditions. In: 4th International Conference on LBS and TeleCartography. Taylor & Francis, Hong Kong (2008) 11. Intelligent Spatial Technologies (2009), http://www.i-spatialtech.com/ipointer.html 12. Robinson, S., Eslambolchilar, P., Jones, M.: Point-to-GeoBlog: Gestures and Sensors to Support User Generated Content Creation. In: 10th international conference on Human computer interaction with mobile devices and services, Amsterdam, The Netherlands (2008) 13. Frohlich, P., Simon, R., Baillie, L., Anegg, H.: Comparing Conceptual Designs for Mobile Access to Geo-Spatial Information. In: 8th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI 2006), Helsinki, Finland (2006) 14. Persson, P., Espinoza, F., Fagerberg, P., Sandin, A., Cöster, R.: GeoNotes: A Locationbased Information System for Public Spaces. Readings in Social Navigation of Information Space, 151–173 (2002) 15. Strachan, S., Williamson, J., Murray-Smith, R.: Show me the way to monte carlo: densitybased trajectory navigation. In: Proceedings of the FP 205 SIGCHI conference on Human factors in computing systems, New York, USA, pp. 1245–1248 (2007) 16. Strachan, S., Murray-Smith, R.: Bearing-based selection in mobile spatial interaction. Personal and Ubiquitous Computing 13 (2009) 17. Gardiner, K., Carswell, J.: Viewer-Based Directional Querying for Mobile Applications. In: Third International Workshop on Web and Wireless Geographical Information Systems W2GIS, Rome, Italy (2003) 18. Simon, R., Frohlich, P.: A Mobile Application Framework for the Geospatial Web. In: Proceedings of the 16th international conference on World Wide Web, Banff, Alberta, Canada (2007) 19. Benedikt, M.L.: To take hold of space: isovists and isovist fields. Environment and Planning B 6, 47–65 (1979) 20. Jiang, B., Liu, X.: AxialGen: a research prototype for automatically generating the axial map. In:11th International Conference on Computers in Urban Planning and Urban Management, Hong Kong (submitted, 2009) 21. ESRI Globe (2009), http://www.esri.com/products/index.html#4 22. Skyline Terraserver (2009), http://www.skylinesoft.com/skylineglobe/corporate/Products/ TerraExplorer.aspx 23. Mobile Web Application Best Practices (2008), http://www.w3.org/TR/2008/WD-mwabp-20080729/ 24. Google Gears API (2008), http://code.google.com/apis/gears/api_geolocation.html 25. Rooney, S., Gardiner, K., Carswell, J.: Wireless Positioning Techniques – A Developers Update. In: Ware, J.M., Taylor, G.E. (eds.) W2GIS 2007. LNCS, vol. 4857, pp. 162–174. Springer, Heidelberg (2007) 26. Sense Your Location (2008), http://www.nokia.com/A41229034 27. HTC - Dream (2008), http://htcdream.com/ 28. Schön, B., Laefer, D.F., Morrish, S.W., Bertolotto, M.: Three-Dimensional Spatial Information Systems: State of the Art Review. Recent Patents on Computer Science 2, 21–31 (2009) 29. Osi Website (2009), http://www.osi.ie
Wired Fisheye Lens: A Motion-Based Improved Fisheye Interface for Mobile Web Map Services Daisuke Yamamoto, Shotaro Ozeki, and Naohisa Takahashi Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya city, Aichi pref., Japan [email protected], [email protected], [email protected]
Abstract. We propose a mobile Web map interface that is based on a metaphor of the Wired Fisheye Lens. The interface was developed by using improved fisheye views (Focus+Glue+Context map). When a user wants to obtain information on both the details of the target area and the geographical relation between the present location and the target area by using existing Web map services, he/she has to scroll maps and change the scale of the maps many times. These operations result in a large cognitive cost for users. The Wired Fisheye Lens enables users to easily search an area surrounding the present location since it has the following features: 1) The Focus+Glue+Context map enables users to view both a large-scale map (Focus) and a small-scale map (Context) without changing the scales of the maps; 2) The posture sensor enables users to search for details of the surrounding area by tilting, shaking, and looking through the fisheye lens; 3) The Focus is moved by considering it to be a fisheye lens connected with the present location by a rubber wire on the map. Even if the lens approaches the edge of the screen, it can be kept within the screen by scaling down the Context as if the lens were pulled in by its rubber wire and as if the map were a rubber sheet and pulled in by the lens. As a result, the user can easily navigate through the area surrounding the present location while keeping the Focus within the map. These features enable users to find the target quickly. We developed the Web-based mobile map system that uses commercial maps that are utilized by Yahoo Japan. We confirmed the advantages of the proposed system by evaluation experiments. The new system will be able to contribute to the novel mobile Web map services with fisheye views for mobile terminals such as cellular phones. Keywords: Fisheye views, Focus+Glue+Context, Web map service, Mobile maps.
geomagnetic sensors, and acceleration sensors enable novel mobile services such as pedestrian navigations [3, 4]. On the other hand, when users want both information on multiple areas and their relation to areas such as the present location and target area, they have to switch between multiple maps with different scales and mentally form geographical relations between these maps. These operations lead to a large cognitive cost for users. For instance, let us assume a situation in which a user searches for a landmark on a street by using a mobile map. Although it is easy to search for a target displayed within the screen, the user has to scroll and zoom the map if the target is not displayed within the screen. The user may lose sight of the present location if it becomes necessary to repeatedly scroll and zoom the map. Moreover, a small target landmark cannot necessarily be displayed when a user views a wide-area map to comprehend geographical relations. This problem becomes particularly serious in devices having small display screens. In our previous study [5], we proposed a Focus+Glue+Context (we call it F+G+C in this paper) map based on a metaphor of a fisheye lens, as shown in Fig. 1. This map enables users to simultaneously view Focus (detail and large-scale map) and Context (small-scale map). In contrast to a Focus+Context map such as fisheye views [6, 7], since Glue absorbs all distortion properly, an F+G+C map has no distortion in Focus and Context. Moreover, the existing fisheye views method generate the whole region by using the displacement function, in the F+G+C map, only Glue is generated by using the displacement function. Since the generation cost of the F+G+C map is low, therefore, a high-quality F+G+C map can be generated in real-time. In fact, we have begun the F+G+C map Web service that was developed in collaboration with Yahoo Japan. Although Web map services are supposed to be controlled mainly by the mouse, mobile terminals on which a mouse cannot be mounted are not necessarily considered. Although touch display is a good device for controlling web maps, a user cannot
Fig. 1. Focus+Glue+Context map
Wired Fisheye Lens
155
control it on mobile terminals by using one hand. Then, we proposed a motion-based mobile map interface by using posture and acceleration sensors, which are popular in recent mobile terminals. It has been proposed that a posture sensor is useful in mobile map systems. Rekimoto [8] proposed a mobile map interface in which a bird's-eye-view map can be controlled by using a posture sensor and a button. Nadia [9] proposed a mobile map interface in which a 3D map can be controlled by applying VR technologies. Since these systems cannot enable users to simultaneously view multi-scale maps, users have to grasp geographical relations from a small-scale map (Context) before they view a large-scale map (Focus). In this regard, these systems have the same problem as existing Web maps. The purpose of this study is to propose a mobile Web map service that enables pedestrians to search areas surrounding their present location easily. We need to comply with the following requirements in order to realize this goal: ─ requirement 1 The focus area should be shown in detail in order to search the target area easily. ─ requirement 2 The direction of the focus area from the present location and the distance between them should be shown correctly so that the user can understand the geographical relation between these areas easily. ─ requirement 3 The present location should be shown constantly so that it is never out of sight. ─ requirement 4 The entire target area should be shown in order to survey the target area. We propose the motion-based mobile Web map interface based on a metaphor of the Wired Fisheye Lens in order to search the surrounding area. The proposed system has the following three features, as shown in Fig. 2. ─ feature 1 The F+G+C map enables users to view both Focus (large-scale map) and Context (small-scale map) on mobile terminals in real-time. Since in the F+G+C map, the direction of a center of any focus from any point of context as well as the distance between these two points are correct, we expect that the cognitive cost of grasping geographical relations is low (correspond to requirements 1 and 2). ─ feature 2 Scrolling the position of Focus by changing the posture of a mobile terminal enables users to search a target area. By automatically controlling the scale of Context to display both the present location and the focus area constantly, users can comprehend geographical relations between these regions (correspond to requirement 3). ─ feature 3 By shaking a mobile terminal, users can control the size and scale of Focus as required. By doing so, users can efficiently search target areas where sizes are different, e.g., a large park and a small park (correspond to requirement 4). In this study, we develop and evaluate a proposed system that operates on a mobile terminal.
156
D. Yamamoto, S. Ozeki, and N. Takahashi
Fig. 2. Conceptual diagram of a metaphor of Wired Fisheye Lens
2 Related Work The Focus+Context method [11, 12], which was proposed by Furnas, is an effective method for visualizing large-scale information. This method enables a user to view the area of interest (Focus) and overall structure (Context) together by reducing data of the Context based on the degree of the interest (DOI). Applying the Focus+Context method to network maps, the fisheye-views method [6, 7] allows the user to zoom in to a part of a network map, like a fisheye lens. This method generates the map by using a displacement function. In addition, Skopik [13] validated the memorability of fisheye views-type network maps. Since the main purpose of these studies is to propose the visualization methods of the fisheye views, in contrast to our method, these studies have not considered the mobile interfaces using posture sensors. In some studies, a map was controlled in the mobile terminals by using posture sensors. Rekimoto [8] proposed a mechanism that enables users to select and zoom target area in a bird's-eye-view map by tilting a mobile terminal by pushing a button. In contrast to the fisheye map, in this mechanism, a user has to search for a target area using a wide-area map before he/she views a detailed map of the target many times. Although Harrie [14] proposed the variable-scale method of presenting geodata for personal navigation using small-display mobile devices by applying a variable-scale mapping function, they did not propose a control method of mobile maps. Gutwin [15, 16, 17] proposed a method that enables users to view Web pages by using the Focus+Context method in a mobile terminal. Because Web pages for PC are not developed for small displays, in this method, the zoom in the focus area increases and that in the context area decreases. Although users can view Web maps by using this mechanism, this interface does not consider the characteristics of maps. In addition, there have been many studies on variable scale maps. Fairbairn [18] proposed a variable scale map that progressively zooms up from the periphery to the center of the map. Guerra [19] proposed digital tourist city-maps that transform variable scale maps according to grids. Takahashi [5] proposed the Pull method that pulls
Wired Fisheye Lens
157
in faraway map objects by transforming a map according to grids. Although these methods can transform a map freely, they do not consider the problem of the degree of interest (DOI), unlike the Focus+Context method. Moreover, there are some navigation methods based on camera metaphors. Guiard [20] and Fukatsu [21] proposed a method for navigating objects by using camera operations such as pan, zoom, and tilt. These mechanisms cannot be used to control multi-scale maps.
3 Proposed System In this section, we describe the Focus+Glue+Context map, a metaphor of the Wired Fisheye Lens, and system architectures. 3.1 Focus+Glue+Context Map An F+G+C map has Focus, Glue, and Context, which are based on cognitive maps [10]. As shown in Fig. 1, Focus is a large-scale map area that enables users to comprehend the details of the focus area, Context is a small-scale map area that enables users to comprehend global relations, and Glue shows the roads that connect Focus with Context. In contrast to the existing fisheye views [6], in the F+G+C map, there is no distortion in Focus and Context because Glue absorbs all distortion. Then, Glue is compressed by a large amount in the direction from Focus to Context. Major roads, rails, and the roads that run down from the area in Focus into the area in Context are drawn selectively in order to reduce the density of the roads in Glue. In addition, in the F+G+C map, both the direction of center of any focus from any point of context and the distance between these points are correct. If the target object is centered in Focus, we can comprehend geographical relations between Focus and Context directly. Moreover, in the F+G+C map, the calculation cost is lower than that in the existing fisheye views for maps, where the whole region has to be transformed. Although Glue must be dynamically generated according to its shape, the advantage of Focus and Context is that there is no need for dynamic generation. Therefore, by generating the Focus and Context maps in advance, we can generate the F+G+C map in real-time. 3.2 A Metaphor of Wired Fisheye Lens Although the F+G+C map has several advantages, it has a problem in that ordinary people may not easily control the map. For instance, the F+G+C map has six degrees of freedom: users have to independently control the position (x- and y-axes), size, and scale of the Focus; the width of the Glue; and the scale of the Context. Therefore, users need to perform complex operations to control the F+G+C map in a suitable manner. This problem becomes particularly serious when users have to view multiple
158
D. Yamamoto, S. Ozeki, and N. Takahashi
maps by changing scales. In order to easily control the F+G+C map, we need to develop intuitive operation methods that do not require the control of these parameters. Therefore we propose the Wired Fisheye Lens metaphor to control the F+G+C map easily. This metaphor enables users to control the F+G+C map easily as if the Focus were the lens put on the paper map. The proposed system can be used to realize a metaphor of the Wired Fisheye Lens, as shown in Fig. 2 and Fig. 3, in order to search maps visually. This metaphor is based on a fisheye lens that connected with present location by a rubber wire is on the map. This metaphor enables the three following operations that correspond to features stated in section 1: ─ Look: Users can view a large-scale map through the fisheye lens (correspond to feature 1). ─ Roll: Users can scroll the position of the Focus by tilting the mobile terminal. In other words, the Focus is moved by considering it to be a fisheye lens connected with the present location by a rubber wire on the map. Even if the lens approaches the edge of the screen, it can be kept within the screen by scaling down the Context as if the lens were pulled in by its rubber wire and as if the map were a rubber sheet and pulled in by the lens. As a result, the user can easily navigate through the area surrounding the present location while keeping the Focus within the map (correspond to feature 2). ─ Shake: By shaking the fisheye lens on the map, the size or scale of the fisheye lens can be changed (correspond to feature 3). By tilting the mobile terminal, the fisheye lens (Focus) is moved. As shown in Fig. 3-a, the fisheye lens can move freely unless the wire is stretched. However, when the wire is stretched, the lens protrudes from the display, and the wire generates an elastic force, as shown in Fig. 3-c. When the elastic force exceeds a threshold and the Focus is moved out of the screen, the Focus (fisheye lens) is shown within the display by scaling down Context (map), as shown in Fig. 4. In other words, Focus is moved as though there is a force F that rolls the fisheye lens and an elastic force K that pulls in the map according to gravity g, as shown in Fig. 5.
Fig. 3. A metaphor of Wired Fisheye Lens. a) Wire is slack. b) Wire is just stretched. c) Wire is stretched further.
Wired Fisheye Lens
159
Fig. 4. A mechanism for displaying both the present location and Focus. a) Focus protrudes from the display. b) By scaling down Context, Focus and present location are displayed.
Fig. 5. Forces in Wired Fisheye Lens
3.3 System Architecture The proposed system that we developed is based on a client-server model. This system consists of a Web map server and a mobile terminal that has a geomagnetic sensor and an acceleration sensor built-in, as shown in Fig. 6. We consider a mobile terminal consisting of a small tablet PC, PDA, and cellular phone. A map is displayed in the Web browser using Adobe Flash technologies. The Web map server was developed by using Java Servlet. We utilized a six-axis geomagnetic/acceleration sensor1 , which is a popular sensor for cellular phones. This sensor outputs three-axis posture data (roll, pitch, yaw) and three-axis dynamic acceleration data (ax, ay, az), as shown in Fig. 7. In other words, we can recognize the direction, tilts, and accelerations of the terminal mounted with this sensor. These data are mutually independent. In addition, to prevent unexpected behavior, this sensor does not output data until a user pushes the sensor button, as shown in the lower left corner of Fig. 8. A prototype system is shown in Fig. 8.
1
Aichi Micro Intelligence AMI603SD.
160
D. Yamamoto, S. Ozeki, and N. Takahashi
Fig. 6. System architecture
Fig. 7. Outputs of geomagnetic/acceleration sensor. Three-axis posture data (roll, pitch, yaw) and three-axis dynamic acceleration data (ax, ay, az) are output.
Fig. 8. A snapshot of the prototype system
4 Proposed Method In this section, we propose the algorithms of the Roll and the Shake operations.
Wired Fisheye Lens
161
4.1 Roll Algorithm in Wired Fisheye Lens In the Roll operation, a user can scroll a map simply by tilting a mobile terminal toward the direction in which the user wants to move. In addition, the scale of Context is automatically changed according to the distance between the present location and focus area in order to display both the present location and the focus area on the screen. However, if a large-scale map is displayed on a small screen, we cannot comprehend the details of map in general. Then, we put a Focus (a fisheye lens) on the map to show detail of focus area. In specific, we developed the system as follows. As shown in Fig. 9, let us define a tilt angle in the direction of dip of a terminal as t, where t>0. And let us define a unit direction vector in the horizontal direction of dip of a terminal as R. th1, th2, and k are constants, where k>0, 0
Fig. 9. Tilt and direction of a mobile terminal
162
D. Yamamoto, S. Ozeki, and N. Takahashi
Fig. 10. Procedure for displaying the far area; in a), b), and c), Context is a 1:70000 scale map, and Focus is a 1:25000 scale map; in d) and e), Context is a 1:110000 scale map, and Focus is a 1:25000 scale map
Algorithm 1. Roll Algorithm Loop If isDisplay(F,P) then If th1
Wired Fisheye Lens
163
This method has the advantage that users can view a detail of a far area at the same time as confirming geographical relations from the present location. Since the direction of a center of Focus from any location of Context and the distance between these points is correct in the F+G+C map, we expect that users can comprehend geographical relations easily. 4.2 Shake Algorithm in Wired Fisheye Lens If Focus is too small to view the whole target region, users can view it easily by increasing the radius of Focus. Further, if the scale of Focus is too small to view details of the target, users can view it by increasing the scale of Focus. We propose a mechanism by which users can control the size and scale of Focus by shaking a mobile terminal. In specific, we developed the system as follows: ─ When a sensor outputs acceleration to the right, the radius of Focus increases stepwise. When a sensor outputs acceleration to the left, the radius of Focus is reduced stepwise. ─ When a sensor outputs forward acceleration, the scale of Focus increases stepwise. When a sensor outputs backward acceleration, the scale of Focus reduces stepwise. Here, the scale of Focus is larger than that of Context. Although users can control the radius of Focus, how to define the radius of Focus is a problem. Although the radius of Focus should ideally be defined according to the shape for the target area, it is difficult. Therefore, we simply utilized the switching method in which the radius of Focus is alternated between 100 pixels and 240 pixels.
5 Experimental Result We performed experiments on the proposed system. The subjects were 20 university students. The window size of the system is 800 x 400 pixels, which is a popular size in Japanese mobile phones. We use the surrounding areas A (Kyoto station), B (Nara station), and C (Hakata station). Each area is a famous Japanese sightseeing area. The subjects search the locations of the three facilities in each area. In this experiment, to simplify the problem, we only utilize the 1:25000 and 1:70000 scale maps. Although the 1:70000 scale map provides users with only outlines of parks, a 1:25000 scale map provides users with the names and locations of structures within the park. In specific, users have to perform the following series of tasks many times: search candidate parks using the 1:70000 scale map and confirm the details of each park using the 1:25000 scale map. The area of the search range is within 1600 x 960 pixels in the 1:70000 scale map. The default radius of Focus is 100 pixels. We used the GIGABYTE M704 mobile terminal. The frame rate of this system is 4 fps, which is also confirmed to work appropriately in the communications zone of the cellular phone network in Japan (FOMA, 3Mbps).
164
D. Yamamoto, S. Ozeki, and N. Takahashi
In this experiment, we compared method 1, method 2, and method 3. ─ method 1 Users can scroll normal maps by tilting the mobile terminal. As shown in the lower right corner of Fig. 11, a small 1:500000 scale map enables users to comprehend geographical relations. Users can also switch between the 1:25000 and 1:70000 scale maps by shaking the terminal. (existing method) ─ method 2 Users can control the F+G+C map by Look and Roll operations, as shown in Fig. 12. (sub-system of the proposed method) ─ method 3 Users can control the F+G+C map by Look, Roll, and Shake operations as shown in Fig. 12 and 13. Users can alternate the radius of focus between 100 pixels and 240 pixels. (proposed method) We measured the time required by users to search for three structures within the parks around the present location for each method. The number of parks in each area is listed in Table 1. We categorize parks as follows. A park that lies within a Focus of radius 100 pixels is known as a small park, a park that lies 70% or more within its Focus is known as a medium park, and parks bigger than medium parks are known as big parks.
Fig. 11. An interface in method 1; the map in the lower right corner has a scale of 1:500000; the scale of context is 1:70000
Fig. 12. Interfaces in method 2 and method 3; the radius of Focus is 100 pixels; the scale of Context is 1:70000, and the scale of Focus is 1:25000
Wired Fisheye Lens
165
Fig. 13. A interface of method 3. Size of radius of Focus is 240 pixel. Context is 1:70000, Focus is 1:25000.
In addition, the subjects record the direction of the found structures from the present location as well as the distance between the two points. We consider the counterbalance to avoid the influence of the experimental order of area and method. We performed a total of 60 experiments, for three regions per subject. Subjects practice using the system for 5 minutes before each experiment. Let us compare methods 1, 2, and 3. First, Fig. 14 shows the average times required to find the target object in each method. In the case of All area, the time required in method 2 was 23% less than the time required in method 1, which was statistically significant at the 5% level. Table 1. Number of parks for each area
area A area B area C
small parks 19 13 6
medium parks 2 0 2
large parks 0 3 3
Total 21 16 11
Table 2. Number of Shake operations for each method
method 1 method 2 method 3
area A 22.4 0 16
area B 19.2 0 7.4
area C 19.8 0 5.8
All area 20.5 0 9.7
However, the difference between methods 2 and 3 is not statistically significant at the 5% level. We think that this result is influenced by area A for the following reason. Although area A has many small parks but no big park, areas B and C have some big parks, as listed in Table 1. In addition, although method 3 is effective for an area that has big parks, such as areas B and C, method 3 is not effective for an area that has
166
D. Yamamoto, S. Ozeki, and N. Takahashi
many small parks, such as area A, as shown in Fig. 14-A, B and C. Since a large Focus compresses Glue area significantly and parks in Glue can not be displayed, users could not locate some small parks within Glue in area A when method 3 was used. In fact, in the case of areas B and C, method 3 involved 39% less time than method 2, which was statistically significant at the 1% level. In total, method 3 is approximately 2 times faster than method 1 in areas B and C. Therefore, these results suggest that method 3 is more effective than method 1 and method 2 for areas B and C.
Fig. 14. Average time required for each method
Fig. 15. Rate of finding targets until specified times for each method
Next, Fig. 15 shows the rate at which users found the target objects until specified times (1 min, 2 min ... 8 min) for areas B and C. Only approximately 70% of target objects could be found within 3 minutes when methods 1 and 2 were used; in contrast, over 90% percent of target objects could be found when method 3 is used. This result suggests that method 3 is superior to methods 1 and 2. In addition, Table 2 lists the number of the Shake operations in each method. In contrast to method 2, method 1 and method 3 have to do the Shake operation by shaking the mobile terminal. Since the Shake operation is the operation that users change the size of Focus or the scale of the map consciously, the cognitive cost of that is
Wired Fisheye Lens
167
bigger than the Roll or the Look operations. Therefore, the number of the Shake operation is one of the barometers of the cognitive cost for users. Then, we compare the number of the Shake operations for each method. Method 1 involves 20.5 shake operations which change the scale of the maps on average. In contrast to method 1, method 2 involves no operations and method 3 involves only 9.7 shake operations related to the size of Focus. These results suggest that proposed methods enable users to find the target with lower cognitive cost than method 1.
Fig. 16. Average errors in recognition of the direction and distance. a) Direction errors (degrees), b) Distance errors (percent).
Moreover, Fig. 16 shows human errors in recognition of the direction of the target areas from the present location and the distance between the two points. There is no difference between methods 1, 2, and 3, which are statistically significant at the 5% level. Although methods 2 and method 3 have distortion in Glue, these results suggest that the cognitive cost of the F+G+C map is approximately the same as that in the normal map. Furthermore, we have to comprehend geographical relations in order to find the target quickly; Fig. 14 suggests that it is easy to grasp geographical relations by using method 3. We will confirm the detailed cognitive cost associated with geographical relations in a future study. Finally, we used the following questionnaire with a 5-stage Likert scale for areas B and C. Note that 5 denotes strongly agree, and 1 denotes strongly disagree. ─ Usability: Is this method easy to use? ─ Viewability: Is this map easy to view? ─ Connectivity: Do you feel that there is no gap between the 1:25000 scale map and 1:70000 scale map? ─ Stress: Do you feel stress when using the system? Fig. 17 shows the result of questionnaires. In terms of Usability and Viewability, method 3 is superior to method 2, which is statistically significant at the 5% level. This result suggests that the Shake operation is effective. Although methods 2 and 3 involve Glue, which is too compressed, method 2 (4.0) and method 3 (3.8) are superior to method 1 (3.3) in terms of Connectivity. The Stress in each method is not so good (2.7 or less on an average) because the experimental task is too complex, i.e., finding the target object within a short time. Therefore, all numbers are 4 or less on an average.
168
D. Yamamoto, S. Ozeki, and N. Takahashi
Fig. 17. Results of questionnaire for each method. Note that 5 denotes strongly agree, and 1 denotes strongly disagree.
Although the F+G+C map is utilized in method 3, which is not common, in short, this method has a significant advantage compared to methods 1 and 2 in terms of both the elapsed time and the questionnaire results. In particular, the Shake operation is effective for areas B and C. However, we found that there is a problem in the Shake operation leads to small structures being ignored. To solve this problem, we have to control the sizes of Focus and Glue such that they are as small as possible. We will have to propose a method to do so in our future study.
6 Conclusion We proposed a novel mobile map interface based on a metaphor of the Wired Fisheye Lens. This method enables users to effectively search for areas surrounding a location by using Look, Roll, and Shake operations. In particular, we obtained the following results by developing and testing the proposed system. First, it was found that the proposed method enables users to find targets approximately 2 times faster and by half operations than the existing method. Second, it was found that the Shake operation–––the operation by which users can interactively switch the size of Focus–––is particularly effective for searching areas that include both large parks and small parks. It follows from these results that the proposed method is superior to the existing method. However, the following tasks must be completed in the future. We have to propose the method for controlling the size of Focus according to the shape of the target object properly. Since Glue is significantly compressed, we have to propose a method for displaying objects in Glue properly. Moreover, in order to enable more complex operations, we will discuss combination with touch displays. Furthermore, from the viewpoint of the cognitive map, we will examine the usability of the proposed system. In addition, we will develop a system that can operate in a common cellular phone such as an iPhone [3], and we will make this system available. Our system will be able to contribute to the novel mobile Web map services with fisheye views for mobile terminals.
Wired Fisheye Lens
169
Acknowledgment In the development of the prototype system, we were supported by Yahoo Japan Corporation. We would like to thank Yahoo Japan Corporation. This work was also supported by JSPS KAKENHI 20509003.
References [1] [2] [3] [4] [5]
[6]
[7]
[8]
[9]
[10] [11] [12]
[13] [14] [15] [16]
Google Maps (2009), http://map.google.com/ Yahoo! Maps (2009), http://map.yahoo.com/ Apple: iPhone (2009), http://www.apple.com/iphone/ Arikawa, M., Konomi, S., Onishi, K.: Navitime: Supporting pedestrian navigation in the real world. IEEE Pervasive Computing 6(3), 21–29 (2007) Takahashi, N.: An elastic map system with cognitive map-based operations. In: Peterson, M.P. (ed.) International Perspectives on Maps and the Internet. Lecture Notes in Geoinformation and Cartography, pp. 73–87. Springer, Heidelberg (2008) Sarkar, M., Brown, M.H.: Graphical fisheye views of graphs. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 83–91. ACM Press, New York (1992) Sarkar, M., Snibbe, S.S., Tversky, O.J., Reiss, S.P.: Stretching the rubber sheet: a metaphor for viewing large layouts on small screens. In: Proceedings of the 6th annual ACM symposium on User interface software and technology, pp. 81–91. ACM Press, New York (1993) Rekimoto, J.: Tilting operations for small screen interfaces. In: Proceedings of the 9th Annual ACM Symposium on User Interface Software and Technology, pp. 167–168 (1996) Magnenat-Thalmann, N., Peternier, A., Righetti, X., Lim, M., Papagiannakis, G., Fragopoulos, T., Lambropoulou, K., Barsocchi, P., Thalmann, D.: A virtual 3d mobile guide in the intermedia project. The Visual Computer 24(7-9), 827–836 (2008) Tolman, E.C.: Cognitive maps in rats and men. The Psychological Review 55(4), 189– 208 (1948) Furnas, G.W.: Generalized fisheye views. In: Proceedings of the {SIGCHI} conference on Human factors in computing systems, pp. 16–23. ACM Press, New York (1986) Lamping, J., Rao, R., Pirolli, P.: A focus+context technique based on hyperbolic geometry for visualizing large hierarchies. In: Proceedings of the Conference on Human Factors in Computing Systems, pp. 401–408 (1995) Skopik, A., Gutwin, C.: Finding things in fisheyes: memorability in distorted spaces. In: Proceedings of the Graphics Interface 2003, pp. 67–75 (2003) Harrie, L., Sarjakoski, L.T., Lehto, L.: A mapping function for variable-scale maps in small-display cartography. Journal of Geospatial Engineering 4(2), 111–123 (2002) Gutwin, C., Fedak, C.: A comparison of fisheye lenses for interactive layout tasks. In: Proceedings of the Graphics Interface 2004, pp. 213–220 (2004) Gutwin, C., Skopik, A.: Fisheye views are good for large steering tasks. In: Proceedings of the SIGCHI 2003 conference on Human factors in computing systems, pp. 5–10 (2003)
170
D. Yamamoto, S. Ozeki, and N. Takahashi
[17] Gutwin, C., Fedak, C.: Interacting with big interfaces on small screens: a comparison of fisheye, zoom, and panning techniques. In: Proceedings of Graphics Interface 2004, pp. 145–152. ACM, New York (2004) [18] Fairvairn, D., Taylor, G.: Developing a variable-scale map projection for urban areas. Computers and Geosciences 21(9), 1053–1064 (1995) [19] Guerra, F., Boutoura, C.: An electronic lens on digital tourist city-maps. In: Proceedings of the 20th International Cartographic Conference, pp. 1151–1157 (2001) [20] Guiard, Y., Chapuis, O., Du, Y., Beaudouin-Lafon, M.: Allowing camera tilts for document navigation in the standard GUI: a discussion and an experiment. In: Proceedings of the working conference on Advanced visual interfaces, pp. 241–244 (2006) [21] Fukatsu, S., Kitamura, Y., Kishino, F.: Manipulation of viewpoints in 3d environment using interlocked motion of coordinate pairs. In: Proceedings of the INTERACT 2003, pp. 327–334 (2003)
Automatic Generation of an Adaptive WebGIS Sergio Di Martino1, Filomena Ferrucci2, Gavin McArdle3, and Giacomo Petillo1 1
University of Naples “Federico II”, via Cinthia – I-80126 Napoli, Italy [email protected] 2 Dipartimento di Matematica e Informatica (DMI), University of Salerno, Via Ponte Don Melillo, I-84084 Fisciano (SA), Italy [email protected] 3 School of Computer Science and Informatics, University College Dublin, Belfield, Dublin 4, Ireland [email protected]
Abstract. With the continuous increase in the availability of WebGIS platforms and on-line spatial data, information overload within the spatial domain is becoming a critical issue. In order to address this and assist users to understand and manage the large amount of geospatial data available to them, techniques to personalise this content are being introduced in WebGIS applications. By monitoring users as they interact with map data, inferences can be made regarding their preferences and interests by generating a user profile that can be exploited to adapt content. However, to date, the development of adaptive WebGIS solutions is a difficult task as the practitioner has to deal with many different and non-trivial technologies. To address this issue, in this paper we propose an approach to automatically generate WebGIS applications able to recommend content to users and adapt the interface of a web-based spatial map. The approach is based on an extension of WebML (Web Modeling Language) and the supporting tool WebRatio. Keywords: Information Overload, GIS, Personalization, Adaptive Interfaces, Location Based Services.
devices have limitations which traditional desktop mapping systems do not encounter. For example, portable devices such as mobile phones and digital assistants have a reduced screen size which caps the amount of space available for displaying information. Furthermore, such devices have a reduced processing power and are not capable of dealing with the same amount of data as a desktop computer can. Additionally, they rely on wireless data transfer which can be slow and costly. As a result of these limitations it is essential that the quantity of irrelevant information delivered to these devices, and presented to the user is reduced, while the quality of that which is displayed is improved. Such information personalization and adaptation has the capacity to significantly improve the user interaction experience with GIS and mapping software. The challenge faced by researchers in this area involves understanding the user and their specific requirements in order to produce relevant map content and recommendations. This can be achieved using two distinct techniques. A basic approach is to explicitly ask the user to confirm their level of interest in elements on a map or to rate the relevance of geospatial content. With this information a profile of that user can be generated and used to personalize content in the future. While this technique produces an accurate profile, it requires input from the user. Studies indicate implicit interest indication techniques are favored as they are unobtrusive and do not disrupt the user from completing the task at hand [3]. Implicit interaction monitoring builds a user profile based on previous interactions with content. Over time this can produce an accurate representation of the user interests which can be utilized to personalize content. Despite the perceived simplicity of these approaches, creating an adaptive GIS is extremely difficult. Various technologies are required to work together to produce the adaptive GIS. Technologies including the user interface, the recommender system, the profile generator and the interaction recorder all need to operate in unison. Combining such technologies poses a major hurdle for developers. This is particularly true when dealing with WebGIS, i.e.: geographical systems that can be accessed via a standard browser, since the developer also has to deal with the typical challenges of the web domain, such as the inability of the server to “push” contents and updates to the client. As a consequence, to date, these techniques have to be developed and incorporated manually, since no supporting tools exist. To address this issue, in this paper we describe a new technique for generating adaptive GIS applications by harnessing the power of WebML (Web Modeling Language) [2] and WebRatio[1]. The former is a high-level, visual language specifically conceived to design “data-intensive” Web applications, which rely mainly on data management and movements [1]. The latter is a CASE tool, which effectively supports WebML, allowing for the automatic generation of Web applications starting from a visual specification. The power of this approach has been successfully applied in many different contexts, from industry to academia, with positive results. The research presented in this paper extends existing applications which utilise WebML and WebRatio for generating WebGIS applications based on GM [8][5][6][4][7]. This paper reports on how we have improved the visual formalism of WebML and the capabilities of WebRatio for automatically generating adaptive
Automatic Generation of an Adaptive WebGIS
173
WebGISs using implicit interaction monitoring of user actions to build user profiles. The ability to automatically generate a WebGIS is extremely beneficial. With this functionality it is possible to begin addressing the prevalent problem of information overload by adding adaptivity to map-based applications by monitoring user interactions and offering recommendations of suitable content. The remainder of the paper is organized as follows, Section 2 recalls the main concepts of WebML, WebRatio, and GM, and describes how they have been integrated in previous work for the automatic generation of WebGIS applications. In Section 3 we describe the proposed modeling concepts and notations which extend WebML for adaptive WebGIS applications. In Section 4 we present an example of the proposed approach in action through the description of a case study. Finally some concluding remarks are presented in Section 5.
2 Background The solution proposed in this paper exploits both the expressiveness of WebML in describing a Web application with complex behaviors and the exhaustiveness of the GM API [10] in realizing WebGIS applications. By integrating such capability within a homogeneous framework, we have extended WebRatio, obtaining a CASE tool that provides developers with a visual environment to design and then automatically generate adaptive WebGIS applications. In the following sections we introduce some basic concepts of WebML and the Google Maps APIs which are useful to describe the extensions proposed in this paper. 2.1 WebML: A Modeling Language for Web Applications WebML is a modeling language suited to support users in designing data-intensive Web applications [2] by providing them with a set of visual notations to model the content, structure, data, and navigational aspects of a Web application. The design of a Web application is based on two orthogonal perspectives: data and navigation. The former is usually described by means of the well-known Entity- Relationship (E-R) data model, which characterizes all the relevant entities and relationships. The latter is specified by the WebML Hypertext Model, which describes how the previously defined contents should be arranged and provided within the Web application. In particular, the application structure and appearance are defined through four main levels of abstraction: site views, areas, pages and units [2]. Moreover, the semantics and syntax of notations allow WebML creators to propose a specifically conceived commercial CASE tool, WebRatio, an Eclipse Plug-in, which automatically generates a J2EE-based Web application. WebRatio allows applications to be developed from simple design models and effectively reduces the time and costs of software development. In [5]-[7] an extension of both WebML and WebRatio is proposed which supports designers in generating Web-based GIS applications. In particular the extension deals with complex, expensive, and voluminous geographic data. In particular, recognizing WebGIS applications as special cases of data-intensive Web systems, suggested employing the WebML approach for defining a development process which took into
174
S. Di Martino et al.
account the complex nature of geographic data. Moreover, new ad hoc units were introduced to enable the modeling of the functionality which characterizes those applications. 2.2 An Extension of WebML for Generating a Google Map-Based WebGIS In [8], the approach described above was specialized by exploiting the GM API for the automatic generation of WebGIS applications. This choice was motivated by the observation that GM is becoming the de-facto standard for web applications dealing with spatial information. Indeed, GM is a freely available web mapping application provided by Google that allows the search and visualization of geographic information. It provides both geo-referenced satellite images and vector maps, enhanced by a number of additional informative layers, related to points of interest such as restaurants, monuments and hotels. Moreover, the integration of many state-of-the-art web technologies, like Ajax, within the GM interface has led to a smart and easy interaction, which involves the drag&drop mechanism for moving and selecting items, the scroll wheel to zoom in and out, the drag of a map for panning, and the double click to zoom and center the viewpoint. It is worth noting that all these metaphors are widely accepted for traditional desktop applications, but their diffusion in Web applications is still limited, due to well-known technical difficulties. Finally, in order to improve the reactivity of the application to the user panning, GM splits a map into a set of tiles that are smartly pre-cached and represent the basic unit of a map. To merge GM with WebML/WebRatio, in [8], the following set of WebML Units were defined: – The MultiMap Unit, meant to represent the GM viewer, i.e. a graphic component within a Web browser capable of rendering geographical data arranged in layers. – The Marker Unit, meant to highlight, on the map, one or more locations, starting from their real address. – The GetDirection Unit, allowing users to calculate a route between two addresses, and to highlight it on the map. In Figure 1 there is an example of a WebML model, designed within WebRatio, exploiting the units described above. In particular, it depicts a specific section of a website, made up of two pages (the bigger boxes in the right panel): – a Home page that contains a single unit (the icon within the page), an instance of an Index unit, to specify that the Home Page has external links to other pages of the web site, and among them, one is pointing to the Map Page. – a Map Page containing the three new units, namely TheMap instance of the MultiMap Unit, the Marker instance of the Marker Unit and, finally, GetDirection1, an instance of the GetDirection Unit. Given this WebML model, designed within WebRatio, pressing “Generate” in the Eclipse button panel leads to the automatic generation of a Web application. The result of this operation is depicted in Figure 2.
Automatic Generation of an Adaptive WebGIS
175
Fig. 1. The WebML-based schema for the proposed example
Fig. 2. The User Interface of the generated WebGIS application
Indeed, this WebGIS provides the panning and zooming functionality, the map control buttons, the button for choosing the map type (satellite, raster, etc...) and the overview of the map. Moreover, users may insert and fix markers on the map through the Insert Marker link. Finally, through the GetDirections button, it is possible to calculate the route path connecting any arbitrary address (the city of Salerno in our example) and the campus of the University of Salerno or vice versa. Building on this
176
S. Di Martino et al.
work, we have defined new units in order to support the automatic generation of an adaptive WebGIS. In the following sections these new units are described in detail.
3 Extending WebML/WebRatio for Generating Adaptive Web GIS Applications The aim of the research presented in this paper is to provide a web designer with a tool capable of automatically generating an adaptive WebGIS. After implicitly monitoring user actions over a period of time, the system can infer user interests and provide the user with an adapted and personalised interface. The adaptation makes use of geographical layers so that only those layers which are deemed informative and relevant to the user are displayed. Furthermore, the user will receive a predefined map center and zoom level based on their previous map interactions. To achieve the automatic generation of this type of application using WebRatio, it was necessary to initially extend the underlying WebML visual language by defining some new units. In particular, we defined the following set of additional units: 1. 2. 3.
The DataSource Unit, useful to add any informative layer onto the map, given a database containing spatial information; An extension of the Multimap defined in [8], to implicitly monitor user actions, and, once a user profile is generated, to show an adapted interface; An Analysis Unit, to elaborate the collected information on user actions and generate the adaptive user profile.
Fig. 3. The DataSource Unit, in WebRatio
Automatic Generation of an Adaptive WebGIS
177
Table 1. Properties of the DataSource Unit Property Name Entity
KML Compliant
Lat
Description the name chosen by the WebGIS designer for the DataSource SubUnit instance the name of the table in the connected database containing the information of interest a parameter the developer can set to specify if the selected table is KML Compliant. A database table is KML-Compliant if it has the following attributes: Lat (latitude of the spatial element), Lng (longitude of the spatial element), HTML (HTML fragment used to render the spatial element), Title (caption text to describe the spatial element). If the table specified in the property “Entity” is KMLCompliant, the DataSource Unit automatically extract these information, and the following 4 fields are disabled. Otherwise, the designer has to specify them the name of the table attribute containing the information about the latitude of the spatial element
Type STRING
Mandatory YES
STRING, chosen from a drop-down list Boolean
YES
STRING
NO; Default OFF
=
YES if the datasource is not KML Compliant YES if the datasource is not KML Compliant NO
Lng
the name of the table attribute containing the information about the longitude of the spatial element
STRING
HTML
the name of the table attribute containing the HTML fragment to show associated with the spatial element the name of the table attribute containing the information about the caption to associate to the spatial element
STRING
the name of the table attribute that contains the information about the type (e.g: restaurant, hotel, etc…) of the spatial elements a list of types of spatial elements to show. The type must be specified in the attribute specified in the previous property “Feature Attribute”. The Designer can define a list of types, such as “Museums, Post Offices”, etc… a parameter the designer can use to limit the number of tuples extracted from the database. This is useful if the designer is experiencing performance limits. In this case, they can specify a threshold (e.g.: 5000) of spatial elements to extract
STRING
YES if the datasource is not KML Compliant NO
STRING
NO
INT. 0 means no limits
NO
Title
Feature Attribute ToVisualize
Limit
STRING
178
S. Di Martino et al.
3.1 The DataSource Unit This new unit provides additional informative layers on the map in the Web page when given any remote spatial data source that can be accessed via TCP/IP – JDBC. Indeed, the designer can add any (even 3rd party) georeferenced layer within their project to enhance the amount of information provided to the user. To this aim, the designer has to accomplish the following tasks: 1. 2. 3. 4.
Connect to a JDBC-compliant database to add it in the WebRatio DataModel; Add a DataSource Unit in the HyperMedia Model, and link it to the Map Unit where the layer will be displayed (as shown in Figure 3); Specify the Entity of the connected database containing the spatial information of interest; Optionally, specify filters and additional properties to customize the information to extract.
Figure 3 illustrates the icon of a DataSource Unit linked with a MultiMap, within WebRatio. As we can see, this unit is characterized by the properties described in Table 1. The code obtained from the generation of a DataSource Unit leads to the web page shown in Figure 4.
Fig. 4. Rendering of a new informative layer onto the map
As seen in Figure 4, the output is a set of checkboxes on the left of the map, with the informative layers the designer has chosen (in this example: Museums and Post Offices). The user can choose to selectively enable or disable these themes. Pinpoints are shown on the map for each tuple in the database matching the filtering criteria (in our case, all the Museums in the area of Naples, Italy).
Automatic Generation of an Adaptive WebGIS
179
3.2 Extending the MultiMap to Monitor User Actions The next unit we developed is an extension of the MultiMap defined in [8], able to implicitly monitor user actions, with the aim of inferring information about user interests. In particular, the logged actions include the zoom, the pan of the map and the clicks on the Points of Interest (markers). These three operations were chosen because they are by far the most used actions when interacting with GIS applications while also forming the set of the most basic map controls [16]. To avoid confusion for the designer, we choose to model the logging task as a SubUnit of the MultiMap rather than as a stand-alone unit. This helps to enforce the concept that the Map is responsible for monitoring the operations. Please note that in WebML, a SubUnit is a unit contained within another unit, in a hierarchical fashion. After the user profile is generated by the unit described in the next section, the MultiMap can be fed a new layer, containing only the markers which the algorithm has identified as potential interests of the user. A representation of the SubUnit within WebRatio, with the associated properties, is depicted in Figure 5 and characterized by the properties described in Table 2.
Fig. 5. The Monitor SubUnit within the MultiMap
This unit of course does not introduce any modification in the visual representation of the Map in the web browser, but simply adds a script to catch all the JavaScript events and sends them back to the web server for storing in the appropriate database tables. To optimize the connection bandwidth and the latency times, some optimizations have been introduced. In particular, to avoid an overwhelming amount of information being sent back to the server, due to a continuous zoom operation through the scroll wheel of the mouse for example, a new event is recorded only if there were passed 500ms from the previous one. At the same time, a click on a marker is caught only if the selected item is different from the previous (to avoid logging erroneous double clicks).
180
S. Di Martino et al. Table 2. Properties of the Monitor SubUnit
Property Name Zoom Entity Click Entity User Suggests
Description the name chosen by the WebGIS designer for the Monitor SubUnit instance the name of the database table chosen by the WebGIS designer to store the information about the zoom and pan actions the name of the database table chosen by the WebGIS designer to store the information about the Click actions the name of the database table containing the user profiles, generated by other components
Type STRING
Mandatory YES
STRING
YES
STRING
YES
STRING
YES
In the client code, some JavaScript callbacks are added, in order to handle the events. This is easily accomplished, thanks to the flexibility of the GM API, as follows: google.maps.Event.addListener(map, "zoomend", function(zoom, newZoom) { catchZoom( zoom, newZoom, map ); } );
In this example, catchZoom is the function returning the information to the server. It is invoked every time the user performs a zoom operation on the map. 3.3 The Analysis Unit This is the unit responsible for inferring the user profile, and thus the adaptive contents and interfaces, given the implicitly monitored information about user interactions. This unit is intended to be linked in the backend of the web application. Its output is a set of Points of Interest that have been inferred by the user’s actions. The generation of the user profiles can be launched explicitly (by clicking on a specific link of a backend web page), or implicitly (for instance, any time a logged user leaves the Map page). In Figure 6 the first case is depicted while its properties are described in Table 3. Since the definition of an ad-hoc algorithm for inferring the user profile was outside the scope of this work, we have mainly focused on the flexibility of the proposal. To this aim, we have decoupled the algorithm that generates the user profile from the rest of the system through the use of an interface, according to the Strategy Design Pattern [9]. The class diagram of part of the components within the Analysis Unit is shown in Figure 7. In our case, the algorithm named “Keyword Analysis” simply computes a weight for each click of the user on a marker. Utilising the power of this approach, sophisticated algorithms can be easily added to this module by implementing the specified interface, for trade-offs between computational time and profiling accuracy. For example, algorithms for creating user profiles which incorporate more user activity
Automatic Generation of an Adaptive WebGIS
181
Fig. 6. The Analysis Unit, in WebRatio Table 3. Properties of the DataSource Unit Property Name User Suggests Algorithm Zoom Entity Click Entity
Description the name chosen by the WebGIS designer for the Analysis Unit instance the name of the database table where the profiling algorithm will store the user profiles the name of the algorithm to use to infer the user profile the name of the database table chosen by the WebGIS designer to store the information about the zoom and pan actions the name of the database table chosen by the WebGIS designer to store the information about the Click actions
Type STRING
Mandatory YES
STRING
YES
STRING
YES
STRING
YES
STRING
YES
Fig. 7. The Class Diagram embedding the Strategy Pattern
182
S. Di Martino et al.
and are based on further interest indicators such as mouse pointer proximity, time or user navigational behavior [12] [13] [15] can easily be incorporated in to the WebGIS using the approach described here.
4 Case Study In order to demonstrate the effectiveness of the proposed solution, in this section we describe the design accomplished to realize a web application for a Tourist Agency. 4.1 The DataSet Firstly we introduce the dataset used by the application to extract the features presented in the application. The features contained in the datasets are Archaeological Sites, Gas Stations, Historic Landmarks, Museums, Picnic Areas, Pizzerias, Post Offices, Sightseeing Sites, Theatres, Touristic Harbours and Tourist Information Points. The dataset was built with the PostgreSQL DBMS and in particular with its spatial extension PostGIS. In this data model, the most important tables for the case study are the ZoomCatched, ClickCatched and Suggests, which store the information monitored during the working sessions. The E-R model is depicted in Figure 8.
Fig. 8. The Data Model underlying the case study
4.2 The HypertextModel The first Site View, which defines the frontend section of the web application, is made up of two pages, as shown in Figure 9. The first is a list of touristic activities,
Automatic Generation of an Adaptive WebGIS
183
while the second is the WebGIS page containing a Map, a DataSource, a Route Calculation feature, and an AddMarker unit. In the MultiMap, it is defined as an instance of the adaptive subunit, UserProfiling (this nested unit is not visible in Figure 9).
Fig. 9. The Hypertext Model for the public side of the web site
Another site view is used for the analysis section, in the backend of the web site which contains the units for the user preference analysis (Figure 10).
Fig. 10. The Hypertext Model for the back office
4.3 The Generated Application Once generated, the WebGIS application provides two distinct sections, one with a map, in which we can visualize the profiled features, and one acting as backend, where the
184
S. Di Martino et al.
site administrator can execute the analysis algorithm for a specific user. Before inferring any user profile, the interface contains all the information in the dataset. As one can see, in Figure 11, with a low level zoom, there is an overwhelming amount of spatial information represented on the map. Therefore a filtering is necessary. After some time, an administrator of the site can access the automatically generated back-end of the site where they can click on the link “Start Analysis”, associated with a particular user. The next time the logged user accesses the map page, he will find a new layer, selected by default, showing only the markers which the algorithm has identified as relevant to their interests, as shown in Figure 12.
Fig. 11. The generated adaptive WebGIS application
Fig. 12. The adaptive layer computed by the algorithm, now active onto the map
Automatic Generation of an Adaptive WebGIS
185
5 Conclusions In this paper, an approach for designing and automatically generating adaptive WebGIS applications that synergically combine the advantages of the visual notations of WebML,WebRatio and the power of the Google Maps API has been described. There is a real need to introduce adaptivity and personalization to the WebGIS domain in order to assist with the problems of information overload, particularly in the LBS domain. Adaptivity can be achieved implicitly through the addition of modules to record user interactions with the map, analyze this data and produce recommendations. Developing such an adaptive GIS is hard due to the challenge of combining various technologies. WebML and WebRatio however, offer a viable solution to easily generate such content by combining the visual notations of WebML and the generative power of WebRatio with the new personalization units, presented in this paper. The effectiveness of the approach has been demonstrated through the modeling and generation of a sample adaptive WebGIS application which assists tourists in resolving the problems of information overload. Acknowledgements. The authors wish to thank Alessandro Amoroso for his support in the development of the solution proposed in this paper. The Research presented in this paper was partly funded by a Strategic Research Cluster grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. The authors gratefully acknowledge this support.
References [1] Ceri, S., Fraternali, P., Bongio, A., Brambilla, M., Comai, S., Matera, M.: Designing Data-Intensive Web Applications. Morgan-Kaufmann Publishers, San Francisco (2002) [2] Ceri, S., Fraternali, P., Bongio, A.: Web Modeling Language (WebML): a Modeling Language for Designing Web Sites. Computer Networks: The International Journal of Computer and Telecommunications Networking 33(1-6), 137–157 (2000) [3] Claypool, M., Le, P., Waseda, M., Brown, D.: Implicit Interest Indicators. In: Proc of the ACM International Conference on Intelligent User Interfaces (IUI 2001), Santa Fe, New, Mexico, USA, pp. 33–40 (2001) [4] Di Martino, S., Ferrucci, F., Ginige, A., Paolino, L., Sebillo, M., Vitiello, G., Avagliano, G.: A method to develop Web GIS. In: Proceedings of the International Workshop on Visual Languages and Computing (VLC 2007), San Francisco, California, USA (2007) [5] Di Martino, S., Ferrucci, F., Paolino, L., Sebillo, M., Tortora, G., Vitiello, G., Avagliano, G.: Towards the Automatic Generation of Web GIS. In: Proc. of the 15th International Symposium on Advances in Geographic Information Systems (ACMGIS 2007), Seattle, USA (2007) [6] Di Martino, S., Ferrucci, F., Paolino, L., Sebillo, M., Vitiello, G., Avagliano, G.: A WebML-based visual language for the development of Web GIS Applications. In: IEEE Proceedings of the 2007 Symposium on Visual Languages and Human-Centric Computing (VL/HCC 2007), Coeur d’Alene, Idaho, USA, pp. 209–212. IEEE Press, Los Alamitos (2007)
186
S. Di Martino et al.
[7] Di Martino, S., Ferrucci, F., Paolino, L., Sebillo, M., Vitiello, G., Avagliano, G.: A WebML-based Approach for the Development of Web GIS Applications. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 385–397. Springer, Heidelberg (2007) [8] Avagliano, G., Di Martino, S., Ferrucci, F., Paolino, L., Sebillo, M., Tortora, G., Vitiello, G.: Embedding Google Maps APIs into WebRatio for the Automatic Generation of Web GIS Applications. In: Sebillo, M., Vitiello, G., Schaefer, G. (eds.) VISUAL 2008. LNCS, vol. 5188, pp. 259–270. Springer, Heidelberg (2008) [9] Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley, Reading (1994) [10] Google Maps APIs website (2009), http://code.google.com/apis/maps/ [11] Google Maps website (2009), http://maps.google.com [12] Mac Aoidh, E., Bertolotto, M., Wilson, D.C.: User Context Analysis from Spatial Interface Interactions. In: Proceedings of the 16th European Conference on Information Systems (ECIS), Galway, Ireland (2008) [13] Mobasher, B., Dai, H., Luo, T., Nakagawa, M.: Effective Personalization Based on Association Rule Discovery From Web Usage Data. In: Proceedings of the 3rd International Workshop on Web Information and Data Management, Atlanta, Georgia, USA, pp. 9–15. ACM, New York (2001) [14] MSN Maps and Directions (2009), http://www.maps.msn.com [15] Mueller, F., Lockerd, A.: Cheese: Tracking Mouse Movement Activity on Websites a Tool for User Modeling. In: Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2002), Seattle,Washington, USA, pp. 279–280 (2002) [16] Nivala, A.-M., Brewster, S., Sarjakoski, T.L.: Usability Evaluation of Web Mapping Sites. The Cartographic Journal 45(2), 129–138 (2008) [17] OpenStreetMap (2009), http://www.openstreetmap.org [18] Spiekermann, S.: General aspects of Location-Based Services. In: Schiller, J.H., Vodsard, A. (eds.) Location-Based Services, pp. 9–27. Morgan Kaufmann, San Francisco (2004)
Campus Guidance System for International Conferences Based on OpenStreetMap Ricky Jacob1 , Jianghua Zheng1 , Bla˙zej Ciepluch1 , Peter Mooney2 , and Adam C. Winstanley1 1
Abstract. We present a web-based, multi-lingual, campus guidance system with emphasis on pedestrian navigation aimed at providing support for delegates attending International Conferences at the National University of Ireland Maynooth (NUIM) campus. A special campus guidance system could improve the logistics of the conference and potentially attract more delegates to the conference. The Cloudmade Web Map Lite API which uses OpenStreetMap has been used for creating this interface. The system generates shortest pedestrian paths using both outdoor pavements and indoor corridors between various buildings and points of interests (POI). For visual assistance in pedestrian navigation geotagged images are used along the path at certain points in the route, such as road intersections, when the user needs to get their orientation correct. The interface is currently available in both English and Chinese language. Keywords: Campus guidance system, Pedestrian navigation, Cloudmade, Maps API, OpenStreetMap, International conference, WebGIS, LBS.
1
Introduction
Where is the conference dinner? Where are the conference sessions on? What is the best pub in town? These are all common questions asked by delegates when they are attending a conference. These type of queries are among a set of challenging navigation tasks for delegates attending conferences and meetings. So the main motivation for this research was to see how feasible it is to build a pedestrian navigation system built completely on open technologies and freely avaialable data and also to maximise re-use potential for other events in the campus. Delegates often arrive to the location of a conference with only a short time to familiarise themselves with the campus, the local transportation system, and to find what the best features in the near by locations are. This paper describes the development of a campus navigation system to assist delegates J.D. Carswell et al. (Eds.): W2GIS 2009, LNCS 5886, pp. 187–198, 2009. c Springer-Verlag Berlin Heidelberg 2009
188
R. Jacob et al.
attending conferences at the NUIM campus. While the system has been initially developed for the NUIM campus and a specific conference it has been designed to extend to other conferences within NUIM and beyond. 1.1
Existing Campus Guidance and Navigation Systems
Many visitors and new students come to the university campus at NUIM every year. A campus guidance system is always a helpful tool for those unfamiliar with the campus to orientate themselves and guide them around the campus. There are some information systems that have already been developed for this purpose. Most of these applications use commercial GIS components based on Component Object Model (COM/COM+) technology. Changbin et al[1] used MapX to provide a campus navigation service of Wuhan University in China. Baaser et al[2] and Guangchen[3] utilize ArcIMS (ESRI Map Server) to develop a campus navigation prototype. [4] used SuperMap Objects to build a campus navigation system for Henan Polytechnic University in China. GIS components of the systems are commercial and their data is generally privately owned and without regular update. With the widespread use of Google Maps and Virtual Earth mapping it is now easy to obtain free mapping data. This has allowed developers to build Location Based Services (LBS) for environments such as university campuses. [5] was one the earliest groups that focused on campus navigation and guidance system using Google Maps and since this many such applications become available. [6] developed a campus information system based on Google Maps for Xinjiang University in China. The University College London [7] has created a campus navigation system built using Google maps API. Using Google Maps or Virtual Earth Map APIs some campus information applications can provide 3D services. [8] utilize VRML (Virtual Reality Modeling Language) to build campus navigation system in 3D environment. Tsinghua University, Industrial and Commercial University of Chongqing, Nantong University have all established “virtual campuses” for visitor guidance [9]. There are also some other campus navigation systems with special functions or demonstrating specific technologies. Takraouri et al[10] discussed ubiquitous computing architecture of the eyeJOT, a new context-aware smart campus information system combining ambient wall-sized displays with location-aware, context-sensitive information sharing on mobile devices. Fengyuan et al[11] discussed Gippy, which is a campus GIS information system supported by P2P technology. Jia et al[12] show their research on a Service Orientated Architecture for campus information modelling. Applications like these could very well be used for delegates attending the international conferences held on a university campus. However participants usually have different goals from other visitors and new students on campus. Participants require more specific information from a campus guidance system. Compared to general campus navigation or information systems these type of applications are relatively‘ few. One such example is [13] who discussed a simple prototype of such an application based on MapXtreme. The last few years have seen an increased interest in the design of pedestrian navigation systems for mobile phones. Some Nokia phones now ship with Nokia
Campus Guidance System for International Conferences
189
Maps 2.0 which enhances pedestrian navigation and also provides city guides in a new improved interface. There has been considerable research carried out which evaluates various modes of pedestrian navigation feedback. The various modes used for input or output are audio or voice feedback [14], textual turn by turn instructions, graphical map interface [15][16], image based navigation [17], and haptic feedback [18]. Multi-modal techniques [19] are used when more than one mode is used for providing navigational cues. In this paper we describe how we can use freely available light weight mapping API to create a pedestrian navigation module within a campus guidance system. 1.2
Using Web-Based Maps API
Integrated web applications or mashups are increasingly being used. Mashups bring together utilities of two or more applications into one common interface. Research shows that mapping mashups are the most popular among all the mashups deployed. Various sites list and review these mashups. One such site is Programmableweb.com[20]. In the recent years there has been an increase in the use of free Maps API provided by various web map services [21] [22] [23] [24] to create ones own map based web service using their interface and data along with some of the users own data. Some of the hugely popular and successful mashups like Kayak for hotels listing in US, and Trulia for real estate listing have shown how with proper design and planning one can generate revenue using such freely available map APIs. When building such applications the availability of complete digital map data for the area of interest is an issue particularly if the extent of the area is not within the limits of a major city. But when this is extended to smaller towns and cities the availability of reasonably good digital web-based maps is poor. In this paper we look at maps of the NUIM area across various web based map service providers like Microsoft’s Bing Maps, Google Maps, Yahoo Maps and OpenStreetMap and we find for most map providers that there is a lack of high quality spatial data for our area of interest. Open Street Map(OSM)[23] is a free map of the entire world. OSM is a knowledge crowdsourcing model that provides user-generated street maps. There has been various geo-wiki applications [25][26] that utilises user-generated content for its maps. However OSM is probably the most extensive and effective project currently under development [27]. Some recent applications have even developed 3D routing functionality based on OpenStreetMap, geovis of OSM-3D.org is an recent example. Since OpenStreetMap is has a free creative commons open source license model it allows a very flexibile model of access to the underlying spatial data for developers and the ability to use the various APIs. The most important is it provides good quality user-generated street maps efficiently. OSM has a map editor that enables input of user generated content. It also provides basic GIS functions, such as zoom in/out, pan and feature information tools. CloudaMade has provided a lightweight maps API using which we can build our own custom applications. It also provides an API offering higher level GIS functions such as a routing API and browser API for mobile phones to view OSM. In our work, the
190
R. Jacob et al.
OpenStreetMap for Maynooth is a rich spatial model with many point, line and polygon features. The NUIM campus has been mapped extensively to almost its entirety and thus used as the test bed for creating this prototype to demonstrate how we can bring together the capabilities of a personalized Pedestrian Navigation System using a Maps API with the OSM providing the mapping to create a campus guidance system. In developing a system such as this there is an emphasis on creating an system which can be reused and not limited to a one time use [28]. The objective of the present work is to provide a campus guidance system for the delegates attending the China-Ireland Conference on Informations and Communication Technologies 2009 [29]. However, it should be easily extendible to other conferences in NUIM and possibly in other university locations in Ireland. The CloudMade Web Maps Lite API [30] for routing has been used in this work as it is built on the OpenStreetMap database. The remainder of this paper is organised as follows. In the next section we discuss the creation of the OSM map the NUIM Campus and Maynooth Town. Then the need for a campus guidance system is discussed and followed by our proposed system. Some lessons learnt during this work are also discussed and the scope for future work is discussed at the end of the paper.
2
Creation of Map Data
OSM data collection for areas in and around Maynooth started in December 2008. A GPS device manufactured by Globalsat(Model: DG-100) was used for this propose. This device has gained popularity with the OSM community and the drivers for the Linux OS are available. For reading stored GPS tracks and saving them on the computer the application called GPS Babel [31] was used. The first stage of data collection and mapping involved preparing the line features for roads and footpaths within the NUIM Campus. The more detailed NUIM Campus containing all the buildings and other points of interest(POI) was done using an aerial photograph of the campus. Using the GPS device the road network for the entire town of Maynooth and some of the surrounding hinterlands were mapped. A bicycle mounted with the GPS device in the front was used to travel across the small streets in and around the town of Maynooth. Travel by bicycle instead of by foot saved much time. While capturing the positions of POIs and other features of interest the attribute data for the same were also collected. For example the names of streets (both in English and Irish), the names of restaurants, shops, and also the house numbers in various localities. Various OSM users across Ireland are updating the OSM database and as of July 2009 the entire network of major roads and highways across Ireland has been mapped. It is noticeably that most of the cities in Ireland are mapped extensively in OSM. Some of the smaller towns are poorly represented with OSM at present. Figure 1 shows a large residential area in Maynooth where details like street names and house numbers were captured.
Campus Guidance System for International Conferences
191
Fig. 1. An example of high resolution mapping in OSM. This is the Castle Dawson estate in Maynooth.
3
Campus Navigation System
There are a number of requirements of a campus navigation system. These requirements are from the two sides of a conference: participants of conferences held on campus and the conference organisers themselves. From the organisers point of view there is the requirement that the campus navigation system for the conference can provide all the relevant information to the delegates in a simple easy to use interface. Many conferences are held in NUIM each year so the availability of a campus navigation system will be always useful. Rather than creating separate systems for each conference the ultimate goal is to have a standard campus navigation tool which can reused and be easily customised by the conference organisers to satisfy their own specific requirements. After a brief survey of the academic and research staff in the Department of Computer Science at NUIM the most frequently asked questions of delegates of a conference are as follows: – – – –
Where are the venues where the conference sessions will be held? How does one get to the conference venue? What are the various modes of transport available? What is the optimal route to get to from one location to another or from one POI to another POI? – Where are the accommodation services?
192
R. Jacob et al.
– Where are the nearest restaurants that serves the food ’I’ like? – Where are the scenic spots around the campus which I can visit during the conference? – Where are the locations of leisure activities on or near the conference location? The first five in the list are potentially the most important questions for any conference delegate. The additional questions are of interest to delegates who wish to explore the campus and/or the surrounding town or locality. 3.1
Our Proposed Campus Navigation System
The China-Ireland Conference has been used as a case study by using the CloudMade API to provide a Campus Navigation System with the inclusion of transportation related information and markers showing other POI within Maynooth Town. Initial we identified the various buildings and POI relevant to this conference. This list is based on the location of buildings where the sessions are held, hotels where the delegates will stay, the local pubs and restaurants, and the important transportation POI like train stations and bus stops [32]. Figure 2 shows a flowchart of the system architecture of the proposed campus navigation system. And the different elements of the flowchart has been discussed in the following paragraphs. Routing and Navigation. Routing finding is the most important feature of this system. As stated above we use the CloudMade API to provide a turn-byturn descriptions of shortest routes for car users, bicycle riders, and pedestrians.
Fig. 2. A flowchart showing the system architecture for the campus navigation system
Campus Guidance System for International Conferences
193
Fig. 3. The shortest pedestrian path between Callan and John Hume building using both internal corridors and external pavements
We have decided that we will only consider the case of pedestrian navigation around the campus. Most international delegates to the conference will not have access to an automobile or a bicycle during the duration of the conference. The data collection phase of the project has created a spatially rich OSM map of Maynooth. The richness of the OSM map of NUIM means that the CloudMade API can find routes on internal corridors and external footpaths, lanes, and streets. Figure 3 shows the shortest pedestrian path computed by the Cloudmade API between two buildings on the NUIM. The proposed route includes both indoor corridors and outdoor pedestrian pavements. The exact nature of the route is determined by the setting of the travelMode option as input to the route finding algorithm in the API. As mentioned above the choices available for travelMode are: {car, car/shortest, f oot, bicycle}. Geotagged Images and photos. Fourteen buildings and POI were identified in and around the campus that would be of relevance to the delegates attending the conference. For people new to an area the use of landmarks and geotagged images for navigation is of great assistance [33]. Figure 4 shows how the route description is supplemented with the insertion of geotagged images using markers at intersections or places where the user needs to change direction. These visual clues will help the user with orientation and assist them in their sense of direction when taking the suggested route. Figure 5 shows the deliivery of public transport travel information to the user if it is applicable to their chosen route. The system allows users to plan shortest routes between the specified set of buildings and POI. Users can also drag a marker and place it at any point on the campus from which they want to generate a shortest path from. The system dynamically
194
R. Jacob et al.
Fig. 4. Geotagged images used for assisting navigation
calculates the new shortest path and updates the map display accordingly. The geographical area from which the user can select a random points from which to plan new shortest paths between has been constrained to a 3KM grid centered on the campus. Transportation options. Sometimes when visiting a conference location understanding the local public transportation system can be a difficult task for strangers to the area. There is always a need to provide the user with travel and transport related information. Figure 5 shows the use of markers at various transit points and information and suggestion are provided to the user in the information window. This provides users with the option to view information on how to travel from the NUIM campus to the nearby city of Dublin. The information provided includes fares, timetables and links to all the important websites that he/she needs for using the public transportation in the area. This feature should save users a lot of time by providing them with targeted local information. Multi-lingual assistance. The China-Ireland conference is unique in that most of the delegates attending are either from Universities in Ireland or in China. The campus navigation system described in this paper includes pages in both English and Chinese. Figure 6 shows a screenshot of the campus navigation system interface provided in the Chinese language. We have incorporated the Chinese language to the system which includes the turn-by-turn description for routing and POI. In the future we also intend to include Irish, French, German, and Polish languages as the language options.
Campus Guidance System for International Conferences
195
Fig. 5. The Dublin Bus travel information appears in the marker infowindow
Fig. 6. A screenshot of the campus navigation system interface provided in the Chinese language
4
Conclusions and Future Work
Cloudmade Web Lite API is still in an early stage of development. One of the problems with this “early adopters” stage is that there is little documentation
196
R. Jacob et al.
Fig. 7. An example of the 3D model of the NUIM campus [34]
to help developers working with the API. We see much potential in how we can customise the use of OSM data using the CloudMade Web Lite API. The present work designs a multi-lingual campus guidance system solution for international conferences in NUIM using map data from OpenStreetMap. It aims to be a common tool for any international conferences on the same campus. In this paper we demonstrated a prototype solution for the CIICT 2009 conference on NUIM campus. Currently it uses 2D maps for the guidance system. Images provide geovisual assistance at present. Consequently we intend to build a campus navigation system which is context aware and to make this user interface reusable by changing the schema and by making it customizable by any department in the university. The use of open geospatial technologies like Cloudmade API and OSM data are demonstarted here and we see it’s use in creating such cost efficient applications. The larger research cluster we are working in has built a 3D model of the campus. In future work the extension of the campus navigation system with 3D visualisation inside and outside buildings will be investigated. Figure 7 shows a sample of the 3D model for the NUIM campus. It is hoped that this 3D extension will be provided also for mobile users in the near future.
Acknowledgements Research presented in this paper was funded by a Strategic Research Cluster grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. The authors gratefully acknowledge this support. Part of this work is funded by the Environmental Protection Agency STRIVE research programme under the National Development Plan. The assistance of ZhanGang Zhang (Department of Computer Science) is acknowledged for providing us with the Chinese translations of various places and features in Maynooth and Dublin.
Campus Guidance System for International Conferences
197
References 1. Yu, C., Xie, X.: Campus navigation system for new students based on.net technology. Journal of Fujian Computer 10, 121–122 (2008) 2. Baaser, U., Gnyp, M., Hennig, S., Hoffmeister, D., K¨ ohn, N., Laudien, R., Bareth, G.: Campusgis of the university of cologne: a tool for orientation, navigation, and management. In: Proceedings of Geoinformatics, Proc. of SPIE 6421 (2006) 3. Wu, G., Liu, Y.: Research into the digital campus navigation system based on arcgis engine. Journal of Liaoning Technology College 11(1), 12–13 (2009) 4. Guangchao, W., Yinsheng, D., Yongqiang, Y., Jun, X., Mingbo, Y.: Design and realization of campus navigation system for henan polytechnic university. Journal of Geomatics 33, 38–40 (2008) 5. Workman, R., Gschwender, A., Chan, J.L.: Campus google map applications. EB/OL (2005) 6. Yang, Y., Zheng, J.: Developing campus information service system based on googlemaps. In: Proceedings of the 6th Seminar on Cartography and GIS of China, Urumqi, Xinjiang China, pp. 555–562 (2008) 7. UCL: UCL Campus Route Finder (2009), http://crf.casa.ucl.ac.uk/ (last accessed, July 2009) 8. Guo, H., Liu, Y., Cui, Y.: The research and implementation of 3d digital campus navigation system. Journal of Shandong University of Technology (Natural Science Edition) 22(5), 30–32 (2008) 9. Dan, L., Wen, T.G., Shujun, L.Z.: A study for 3d virtual campus navigation system based on gis. In: Proceedings of 4th International Conference on Wireless Communications. Volume Networking and Mobile Computing (2008) 10. Takrouri, B.A., Canonico, A., Gongora, L.: Eyejot - a ubiquitous context-aware campus information system. In: Proceedings of 2nd International Conference on Pervasive Computing and Applications (2007) 11. Xu1, F., Fang1, Z.: Campus gis information system supported by p2p. In: Proceedings of the 2007 International Symposium on Applications and the Internet Workshops, SAINTW 2007 (2007) 12. Nong, J., Huang, T., Cheng, Z.: Research on soa for campus information system model. In: Proceedings of 2008 International Conference on Computer Science and Software Engineering, pp. 824–827 (2008) 13. Cai, L., Liu, Z., Zheng, J.: Query system design and implement based on webgis. In: Proceedings of the 6th Seminar on Cartography and GIS of China, Urumqi, Xinjiang, China, pp. 241–248 (2008) 14. Strachan, S.: Gpstunes: controlling navigation via audio feedback. Number 1-59593089-2, 275 – 278 (2005) 15. GoogleMM: Google Maps for Mobile (2009), http://www.google.com/mobile/products/maps.html (last accessed, July 2009) 16. Nokia: Nokia Maps (2009), http://europe.nokia.com/explore-services/maps/download (last accessed, July 2009) 17. Beeharee, A.K.: A natural wayfinding exploiting photos in pedestrian navigation systems. Number 1-59593-390-5, 81 – 88 (2006) 18. Ertan, S., Lee, C., Willets, A., Tan, H., Pentland, A.: A wearable haptic navigation guidance system. Number 0-8186-9074-7, 164–165 (2002) 19. Wasinger, R.: In: Chittaro, L. (ed.) Mobile HCI 2003. LNCS, vol. 2795, pp. 481– 485. Springer, Heidelberg (2003)
198
R. Jacob et al.
20. ProgrammableWeb: Programmable Web (2009), http://www.programmableweb.com/ (last accessed, July 2009) 21. Google: Google Maps API (2009), http://code.google.com/apis/maps/ (last accessed, July 2009) 22. Yahoo: Yahoo Maps API (2009), http://developer.yahoo.com/maps/ (last accessed, July 2009) 23. OSM: Open Street Map (2009), http://www.openstreetmap.org/ (last accessed, July 2009) 24. Microsoft: Bing Maps API (2009), http://www.microsoft.com/maps/developers/ (last accessed, July 2009) 25. GoogleMapMaker: Google MapMaker (2009), http://www.google.com/mapmaker (last accessed, July 2009) 26. Wikimapia: Wikimapia (2009), http://wikimapia.org/ (last accessed, July 2009) 27. Haklay, M.M., Weber, P.: Openstreetmap: User-generated street maps. Pervasive Computing 7(4), 12–18 (2008) 28. Bloch, J.: How to design a good api and why it matters. Technical report, Google Inc. (2006) 29. CIICT: CIICT Webpage (2009), http://www.ciict.org/ (last accessed, July 2009) 30. CloudMade: CloudMade Web Maps Lite API (2009), http://developers.cloudmade.com/projects/show/web-maps-lite (last accessed, July 2009) 31. Eckert, C.: GPS Babel (2009), http://www.gpsbabel.org (last accessed, July 2009) 32. CGEnglish: Campus Guidance - English (2009), http://ciict.cs.nuim.ie/files/location/MaynoothEn.html (last accessed, July 2009) 33. Hile, H., Vedantham, R., Cuellar, G., Liu, A.L., Gelfand, N., Grzeszczuk, R., Borriello, G.: Landmark-based pedestrian navigation from collections of geotagged photos. In: Wiberg, M., Zaslavsky, A.B. (eds.) MUM, pp. 145–152. ACM, New York (2008) 34. Zheng, J., Ciepuch, B., Mooney, P., Winstanley, A.: Location based services of university town based on openstreetmap: Nui maynooth as an example. In: To be published by Proceedings of CIICT 2009 (2009)