-
15th ACM International Symposium on Geographic Information Systems, ACM-GIS 2007, November 7-9, 2007, Seattle, Washington, USA, Proceedings; 01/2007
-
VLDB J. 01/2005; 14:330-353.
-
AI Magazine. 01/2005; 26:33-44.
-
[show abstract]
[hide abstract]
ABSTRACT: The Semantic Web promises seamless integration of heterogeneous data from distributed sources, letting agents (human users or automated programs) perform sophisticated and detailed analyses of this data. An agent would send a query, expressed in terms of its preferred ontology (schema), to a system that would then find and integrate the relevant data from multiple sources and return it using the agents ontology. Before achieving this vision, however, we must address several challenges. We need technologies to integrate data described in different ontologies, for example, as well as different types of data, such as images or structured data. In addition, a Semantic Web-based system must recognize when different objects at different sites denote the same real-world entity. Other challenges include efficiently querying distributed information and converting legacy data in traditional databases and Web sites (HTML) into more semantic representations such as RDF. Building Finder is a running application that showcases our approach to meeting these challenges. The application integrates satellite imagery, geospatial data, and structured and semistructured data from various online data sources using Semantic Web technologies. Users can query an integrated view of these sources and request Building Finder to accurately superimpose buildings and streets obtained from various sources on satellite imagery. The data sources integrated by Building Finder are heterogeneous not only in terms of the data, but also in terms of how the application accesses the sources.
07/2004;
-
[show abstract]
[hide abstract]
ABSTRACT: this paper, we describe our approach to automatically and accurately align satellite imagery with the various online maps that are currently available. The traditional approach to aligning these various geospatial products is to use a technique called conflation [7], which requires identifying a set of control point pairs on the two data sources. The identification of these control points is often performed manually, which is a tedious and time-consuming process that is made even harder by the fact that many of the online sources do not even provide the coordinates of the corner points of the maps. In previous work, we developed an approach to automatically conflating road vector data with satellite imagery [2]. In this paper we describe how we address the even more challenging problem of automatically conflating maps with satellite imagery. Since we build on our previous work, we first review our approach to automatically conflate road vector data with satellite imagery. Then we describe our approach to automatically conflating a map with the satellite imagery by first using the vector data to identify all of the intersections and then utilizing a specialized point matching algorithm to align the two datasets
03/2004;
-
IEEE Intelligent Systems. 01/2004; 19:72-79.
-
12th ACM International Workshop on Geographic Information Systems, ACM-GIS 2004, November 12-13, 2004, Washington, DC, USA, Proceedings; 01/2004
-
12th ACM International Workshop on Geographic Information Systems, ACM-GIS 2004, November 12-13, 2004, Washington, DC, USA, Proceedings; 01/2004
-
Proceedings of the Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applications of Artificial Intelligence, July 25-29, 2004, San Jose, California, USA; 01/2004
-
[show abstract]
[hide abstract]
ABSTRACT: Information sources on the web are controlled by di#erent organizations or people, utilize di#erent text formats, and have varying inconsistencies. Therefore, any system that integrates information from di#erent data sources must consolidate data from these sources. Data from many data sources on the web may not contain enough information to accurately consolidate the data even using state of the art object consolidation systems. We present an approach to accurately and automatically consolidate data from various data sources by utilizing a state of the art object consolidation system in conjunction with a mediator system. The mediator system is able to automatically determine which secondary sources need to be queried in cases where the object consolidation system is unable to confidently determine whether two records refer to the same entity. In turn, the object consolidation system is then able to utilize this additional information to improve the accuracy of the consolidation between datasets.
11/2003;
-
[show abstract]
[hide abstract]
ABSTRACT: Web services enable the user to integrate and manipulate data from distributed data sources without worrying about the underlying syntactical details. We describe extensions to the view integration approach to support dynamic integration of data from web services and support dynamic composition of web services from existing web services. In particular, we describe techniques to extend the "inverse rules" query reformulation algorithm to generate a universal integration plan to answer user queries. To demonstrate the effectiveness of these techniques we describe a mediator-based system that dynamically integrates various web services in response to a user query and provides a integrated web service that can handle a range of user queries.
06/2003;
-
[show abstract]
[hide abstract]
ABSTRACT: Information integration systems provide a uniform query interface to a set of sources. One of the key challenges for an information integration system is to provide maximally complete answers to user queries and to execute user queries efficiently. We describe an approach to map recursive datalog programs into a streaming, dataflow execution system. We show that our method can be used in conjunction with the Inverse Rules algorithm to create a new information integration system that can provide maximally complete answers to user queries and efficiently execute those queries. Our preliminary results show that in addition to generating maximally complete answers, we obtain performance improvements ranging from 8% to 24.3% over datalog execution.
06/2003;
-
[show abstract]
[hide abstract]
ABSTRACT: Recent growth of the geo-spatial information on the web has made it possible to easily access a wide variety of spatial data. By integrating these spatial datasets, one can support a rich set of queries that could not have been answered given any of these sets in isolation. However, accurately integrating geo-spatial data from different data sources is a challenging task. This is because spatial data obtained from various data sources may have different projections, different accuracy levels and different formats (e.g. raster or vector format). In this paper, we describe an information integration approach, which utilizes various geo-spatial and textual data available on the Intemet to automatically annotate and contlate satellite imagery with vector datasets. We describe two techniques to automatically generate control point pairs from the satellite imagery and vector data to perform the conflation. The first technique generates the control point pairs by integrating information from different online sources. The second technique exploits the information from the vector data to perform localized image-processing on the satellite imagery. Using these techniques, we can automatically integrate vector data with satellite imagery or align multiple satellite images of the same area. Our automatic conflation techniques can automatically identify the roads in satellite imagery with an average error of 8.61 meters compared to the original error of 26.19 meters for the city of E1 Segundo and 7.48 meters compared to 15.27 meters for the city of Adams Morgan in Washington, DC.
06/2003;
-
Shahram Gh,
Craig A. Knoblock,
Christos Papadopoulos,
Cyrus Shahabi,
Jos Luis Ambite,
Min Cai,
Ching-chien Chen,
Parikshit Pol,
Rolfe Schmidt, Snehal Thakkar,
Runfang Zhou
[show abstract]
[hide abstract]
ABSTRACT: Many organizations envision web services as an enabling component of Internet-scale computing. A final vision of web services is to realize a dynamic environment that identifies, composes and executes web services in response to a query. This vision shapes the design and implementation of Proteus. In addition to describing Proteus' novel components, this paper outlines its initial system design.
06/2003;
-
Shahram Gh,
Craig A. Knoblock,
Christos Papadopoulos,
Cyrus Shahabi,
Esam Alwagait,
Jos Luis Ambite,
Min Cai,
Ching-chien Chen,
Parikshit Pol,
Rolfe Schmidt,
Saihong Song, Snehal Thakkar,
Runfang Zhou
[show abstract]
[hide abstract]
ABSTRACT: Many organizations envision web services as an enabling component of Internet-scale computing. A final vision of web services is to realize a dynamic environment that identifies, composes and executes web services in response to a query. This vision shapes the design and implementation of Proteus. In addition to describing Proteus' novel components, this paper outlines its initial system design.
05/2003;
-
Shahram Ghandeharizadeh,
Craig A. Knoblock,
Christos Papadopoulos,
Cyrus Shahabi,
Esam Alwagait,
José Luis Ambite,
Min Cai,
Ching-Chien Chen,
Parikshit Pol,
Rolfe R. Schmidt,
Saihong Song, Snehal Thakkar,
Runfang Zhou
Proceedings of the International Conference on Web Services, ICWS '03, June 23 - 26, 2003, Las Vegas, Nevada, USA; 01/2003
-
[show abstract]
[hide abstract]
ABSTRACT: The power of web services can only be realized when web services are utilized as building blocks to dynamically compose new web services. The Building Finder application is an example application that integrates information from several web services by modeling the web services as information sources in a mediator-based architecture. The paper also describes a mediator framework to dynamically compose new web services, similar to the Building Finder, by generating an integration plan to integrate information from the existing web services.
08/2002;
-
[show abstract]
[hide abstract]
ABSTRACT: Due to the recent growth of the World Wide Web, numerous spatio-temporal applications can obtain their required information from publicly available web sources. We consider those sources maintaining moving objects with predefined paths and schedules, and investigate di#erent plans to perform queries on the integration of these data sources e#- ciently. Examples of such data sources are networks of railroad paths and schedules for trains running between cities connected through these networks. A typical query on such data sources is to find all trains that pass through a given point on the network within a given time interval. We show that traditional filter semi-join plans would not result in efficient query response times on distributed spatio-temporal sources. Hence, we propose a novel spatio-temporal filter, called deviation filter, that exploits both the spatial and temporal characteristics of the sources in order to improve the selectivity. We also report on our experiments in comparing the performances of the alternative query plans and conclude that the plan with spatio-temporal filter is the most viable and superior plan.
12/2001;
-
VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases, September 11-14, 2001, Roma, Italy; 01/2001
-
ACM-GIS 2001, Proceedings of the Ninth ACM International Symposium on Advances in Geographic Information Systems, Atlanta, GA, USA, November 9-10, 2001; 01/2001