- Data virtualization
-
Data virtualization describes the process of abstracting disparate data sources (databases, applications, file repositories, websites, data services vendors, etc.) through a single data access layer (which may be any of several data access mechanisms).
This abstraction enables data access clients to target a single data access layer, serialization, format, structure, etc., rather than making each client tool handle multiples of any or all of these.
This concept and software is commonly used within data integration, business intelligence, service-oriented architecture data services, cloud computing, enterprise search, master data management and virtual master data management.
Contents
Examples
- Linked Data can use a single hyperlink-based Data Source Name (DSN) to provide a connection to a virtual database layer that is internally connected to a variety of back-end data sources using ODBC, JDBC, OLE DB, ADO.NET, SOA-style services, and/or REST patterns.
- Database virtualization may use a single ODBC-based DSN to provide a connection to a similar virtual database layer.
Functionality
Data Virtualization software is an enabling technology which provides some or all of the following capabilities:
- Abstraction – Abstract data the technical aspects of stored data, such as location, storage structure, API, access language, and storage technology.
- Virtualized Data Access – Connect to different data sources and make them accessible from one logical place
- Transformation / Integration – Transform, improve quality, and integrate data based on need across multiple sources
- Data Federation – Combine results sets from across multiple source systems.
- Flexible Data Delivery – Publish result sets as views and/or data services executed by consuming application or users when requested
Data virtualization software may includes functions for development, operation, and/or management.
History
Enterprise Information Integration (EII) and data federation have been used by some vendors to describe a core element of data virtualization: the capability to create relational JOINs in a federated VIEW. Some forms of legacy data virtualization build on knowledge and concepts developed within EII and Data Federation.
Newer types of data virtualization do not always require movement of the data to construct the view. They may allow you to see the results of the relational joins before any data is moved anywhere. This additional capability is a very significant differentiation point between legacy data virtualization vendors (older EII technology) and newer technologies based upon persistent metadata servers.
Data Virtualization Vendors
- Composite Software
- Denodo Technologies
- IBM
- Informatica
- JBOSS - TEIID Data Virtualization
- OpenLink Software
- Queplix
- Radiant Logic
- SAS
- VirtualWorks Group
See also
- Data integration
- Enterprise Information Integration (EII)
- Master data management
- Database virtualization
References
Categories:- Database stubs
- Data management
Wikimedia Foundation. 2010.