Ontology based data integration

Ontology based data integration

Ontology based Data Integration involves the use of ontology(s) to effectively combine data and/or information from multiple heterogeneous sources cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] . It is one of the multiple data integration approaches and may be classified as Local-As-View (LAV)cite conference | author=Maurizio Lenzerini | title=Data Integration: A Theoretical Perspective | year=2002 | pages=243-246 | http://www.dis.uniroma1.it/~lenzerin/homepagine/talks/TutorialPODS02.pdf ] . The effectiveness of ontology based data integration is closely tied to the consistency and expressivity of the ontology used in the integration process.

Background

Data from multiple sources are characterized by multiple types of heterogeneity. The following hierarchy is often used cite book | author=A.P. Sheth | title = Changing Focus on Interoperability in Information Systems: From System, Syntax, Structure to Semantics | booktitle=Interoperating Geographic Information Systems. M. F. Goodchild, M. J. Egenhofer, R. Fegeas, and C. A. Kottman (eds.), Kluwer Academic Publishers | year=1999 | pages=5-30 | http://lsdis.cs.uga.edu/library/download/S98-changing.pdf] [ [http://daks.ucdavis.edu/~ludaesch/Paper/AHM02/tutorial5.html AHM02 Tutorial 5: Data Integration and Mediation; Contributors: B. Ludaescher, I. Altintas, A. Gupta, M. Martone, R. Marciano, X. Qian] ] :
* Syntactic Heterogeneity: is a result of differences in representation format of data
* Schematic or Structural Heterogeneity: the native model or structure to store data differ in data sources leading to structural heterogeneity. Schematic heterogeneity that particularly appears in structured databases is also an aspect of structural heterogeneity cite book | author=A.P. Sheth | title = Changing Focus on Interoperability in Information Systems: From System, Syntax, Structure to Semantics | booktitle=Interoperating Geographic Information Systems. M. F. Goodchild, M. J. Egenhofer, R. Fegeas, and C. A. Kottman (eds.), Kluwer Academic Publishers | year=1999 | pages=5-30 | http://lsdis.cs.uga.edu/library/download/S98-changing.pdf] .
* Semantic Heterogeneity: differences in interpretation of the 'meaning' of data are source of semantic heterogeneity
* System Heterogeneity: use of different operating system, hardware platforms lead to system heterogeneity

Ontologies, as formal models of representation with explicitly defined concepts and named relationships linking them, are used to address the issue of semantic heterogeneity in data sources. In domains like bioinformatics and biomedicine, the rapid development, adoption and public availability of ontologies [http://www.bioontology.org/repositories.html#obo] has made it possible for the data integration community to leverage them for semantic integration of data and information.

The Role of Ontologies

Ontologies enable the unambiguous identification of entities in heterogeneous information systems and assertion of applicable named relationships that connect these entities together. Specifically, ontologies play the following roles:

* Content Explication cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] The ontology enables accurate interpretation of data from multiple sources through the explicit definition of terms and relationships in the ontology.

* Query Model cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] In some systems like SIMS cite conference | author= Y. Arens, C. Hsu, C.A. Knoblock | title=Query Processing in sims information mediator | year=1996 | http://isi.edu/sims/papers/96arpibook.ps] , the query is formulated using the ontology as a global query schema.

* Verification cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] The ontology verifies the mappings used to integrate data from multiple sources. These mappings may either be user specified or generated by a system.

Approaches using ontologies for Data Integration

There are three main architectures that are implemented in ontology based data integration applications cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] namely, ;Single Ontology approach: A single ontology is used as a global reference model in the system. This is the simplest approach as it can be simulated by other approaches cite conference | author= H. Wache, T. Vögele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann, S. Hübner | title=Ontology-Based Integration of Information A Survey of Existing Approaches | year=2001 | http://www.cs.vu.nl/~heiner/public/ois-2001.pdf] . SIMS cite conference | author= Y. Arens, C. Hsu, C.A. Knoblock | title=Query Processing in sims information mediator | year=1996 | http://isi.edu/sims/papers/96arpibook.ps] is a prominent example of this approach.

;Multiple Ontologies: Multiple ontologies, each modeling an individual data source, are used in combination for integration. Though, this approach is more flexible than the single ontology approach, it requires creation of mappings between the multiple ontologies. Ontology mapping is a challenging issue and is focus of large number of research efforts in computer science [http://www.ontologymatching.org/] . The OBSERVER system cite conference | author= E. Mena, V. Kashyap, A. Sheth, A. Illarramendi | title=OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies | year=1996 | http://dit.unitn.it/~p2p/RelatedWork/Matching/MKSI96.pdf] is an example of this apporach.

;Hybrid approaches: The hybrid approach involves the use of multiple ontologies that subscribe to a common, top-level vocabulary cite conference | author=Cheng Hian Goh | title=Representing and Reasoning about Semantic Conflicts in Heterogeneous Information Systems | year=1997 | http://context2.mit.edu/coin/publications/goh-thesis/goh-thesis.pdf] . The top level vocabulary defines the basic terms of the domain. Thus, the hybrid approach makes it easier to use multiple ontologies for integration in presence of the common vocabulary.

ee also

* Schema matching
* Data Integration
* Enterprise application integration
* Enterprise Information Integration
* Data mapping
* Ontology mapping
* Semantic Integration

References


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Ontology-based data integration — involves the use of ontology(s) to effectively combine data and/or information from multiple heterogeneous sources [1]. It is one of the multiple data integration approaches and may be classified as Local As View (LAV)[2]. The effectiveness of… …   Wikipedia

  • Data integration — involves combining data residing in different sources and providing users with a unified view of these data.[1] This process becomes significant in a variety of situations, which include both commercial (when two similar companies need to merge… …   Wikipedia

  • Semantic integration — is the process of interrelating information from diverse sources, for example calendars and to do lists; email archives; physical, psychological, and social presence information; documents of all sorts; contacts (including social graphs); search… …   Wikipedia

  • Ontology editor — Ontology editors are applications designed to assist in the creation or manipulation of ontologies. They often express ontologies in one of many ontology languages. Some provide export to other ontology languages however. Decision criteria for… …   Wikipedia

  • Data mapping — Data transformation/Source transformation Concepts metadata · data mapping data transformation · model transf …   Wikipedia

  • Ontology engineering — Example of a constructed MBED Top Level Ontology based on the Nominal set of views.[1] Ontology engineering in computer science and information science is a new field, which studies the methods and methodologies for building ontologies: formal… …   Wikipedia

  • Data model — Overview of data modeling context: A data model provides the details of information to be stored, and is of primary use when the final product is the generation of computer software code for an application or the preparation of a functional… …   Wikipedia

  • Data modeling — The data modeling process. The figure illustrates the way data models are developed and used today. A conceptual data model is developed based on the data requirements for the application that is being developed, perhaps in the context of an… …   Wikipedia

  • Ontology alignment — Ontology alignment, or ontology matching, is the process of determining correspondences between concepts. A set of correspondences is also called an alignment. The phrase takes on a slightly different meaning, in computer science, cognitive… …   Wikipedia

  • Multimedia Web Ontology Language — (MOWL) has been designed to facilitate semantic interactions with multimedia contents. It supports perceptual modeling of concepts using expected media properties. While the reasoning in traditional ontology languages, e.g. Web Ontology Language… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”