Metadata repository

Metadata repository

A Metadata repository is a database created to gather, store, and distribute contextual information about business data, when documented it is known as metadata. This contextual information of business data include meaning and content, policies that govern, technical attributes, specifications that transform, and programs that manipulate[1].

Contents

Definition

The metadata repository is responsible for physically storing and cataloging metadata. The metadata that is stored should be generic, integrated, current, and historical. Generic for a metadata repository means that the meta model should store the metadata by generic terms instead of storing it by an applications-specific defined way, so that if your data base standard changes from one product to another the physical meta model of the metadata repository would not need to change. Integration of the metadata repository allows all entities of the enterprise business to view all metadata subject areas. The metadata repository should also be designed so that current and historical metadata both can be accessed[2]. Metadata repositories use to be referred to as a data dictionary[3].

Repository vs. Registry

A metadata repository is similar to a metadata registry in that they only store metadata. The metadata repository is different from a metadata registry in that a repository provides response times suitable for browsing and reporting, while registries provides response times suitable for service virtualization[4].

Reason for use

Each database management system (DBMS) and database tools have their own language for the metadata components within. Database applications already have their own repositories or registries that are expected to provide all of the necessary functionality to access the data stored within. Vendors do not want other companies to be capable of easily migrating data away from their products and into competitors products, so they are proprietary with the way the handle metadata . CASE tools, DBMS dictionaries, ETL tools, data-cleansing tools, OLAP tools, and data mining tools all handle and store metadata differently. Only a metadata repository can be designed to store the metadata components from all of these tools[5].

Design

Metadata repositories should store metadata in four classifications: ownership, descriptive characteristics, rules and policies, and physical characteristics. Ownership, showing the data owner and the application owner. The descriptive characteristics, define the names, types and lengths, and definitions describing business data or business processes. Rules and policies, will define security, data cleanliness, timelines for data, and relationships. Physical characteristics define the origin or source, and physical location.[6]. Like building a logical data model for creating a database, a logical meta model can help identify the metadata requirements for business data[7]. The metadata repository will be centralized, decentralized, or distributed.

Centralized/Decentralized/Distributed

  • A centralized metadata repository is the easiest to implement because there is only one database[8] A centralized design means that there is one database for the metadata repository that stores metadata for all applications business wide. A centralized metadata repository has the same advantages and disadvantages of a centralized database. Easier to manage because all the data is in one database, but the disadvantage is that bottlenecks may occur.
  • A decentralized metadata repository stores metadata in multiple databases, either separated by location and or departments of the business. This makes management of the repository more involved than a centralized metadata repository, but the advantage is that the metadata can be broken down into individual departments.
  • A distributed metadata repository uses a decentralized method, but unlike a decentralized metadata repository the metadata remains in its original application. An XML gateway is created[9] that acts as a directory for accessing the metadata within each different application. The advantages and disadvantages for a distributed metadata repository mirror that of a distributed database.

Entity-Relationship/Object-Oriented

Metadata repositories can be designed as either a Entity-relationship model, or an Object-oriented design.

Metadata Repository Solutions

If you choose not to build your own Metadata repository here are some vendors who can.

*Troux Technologies

See also

References

  1. ^ Page 171 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional.
  2. ^ Chapter 2, Marco, D., & Jennings, M. (2004). Universal Metadata Models. Wiley
  3. ^ Page 239 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional.
  4. ^ page 5 - http://www.gartner.com/it/content/754400/754413/qa_what_is_a_registry.pdf Jess Thompson 9 November, 2007 Q&A: What Is a Registry/Repository, and Who Should Consider One?
  5. ^ Marco, D. (2000). Building and Managing the Metadata Repository: A Full Lifecycle Guide. Wiley.
  6. ^ Page 176 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional.
  7. ^ Page 185 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional.
  8. ^ Page 242 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional.
  9. ^ P246 Moss, L. T., & Atre, S. (2003). Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications. Addison-Wesley Professional

Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • metadata repository — metaduomenų saugykla statusas T sritis dirbtinis intelektas apibrėžtis Specializuota saugykla – svarbiausias verslo, informacinių ir programų sistemų inžinerijos informacinės infrastruktūros komponentas, skirtas inžinerinei informacijai (apie… …   Lithuanian dictionary (lietuvių žodynas)

  • Metadata registry — A metadata registry is a central location in an organization where metadata definitions are stored and maintained in a controlled method. Contents 1 Use of Metadata Registries 2 Common characteristics of a metadata registry 3 Clear separatio …   Wikipedia

  • Metadata — For the page on metadata about Wikipedia, see Wikipedia:Metadata. The term metadata is an ambiguous term which is used for two fundamentally different concepts (types). Although the expression data about data is often used, it does not apply to… …   Wikipedia

  • Metadata — Métadonnée  Pour l’article homonyme, voir Métadonnée (audio).  Une métadonnée (mot composé du préfixe grec meta, indiquant l auto référence ; le mot signifie donc proprement « donnée de/à propos de donnée ») est une… …   Wikipédia en Français

  • Métadata — Métadonnée  Pour l’article homonyme, voir Métadonnée (audio).  Une métadonnée (mot composé du préfixe grec meta, indiquant l auto référence ; le mot signifie donc proprement « donnée de/à propos de donnée ») est une… …   Wikipédia en Français

  • Repository Open Service Interface Definition — ummaryThe Repository Open Service Interface Definition (OSID) is an O.K.I. specification which defines the storing and retrieving of digital content, referred to as Assets . OSIDs are programmatic interfaces which comprise a Service Oriented… …   Wikipedia

  • Learning object metadata — is a data model, usually encoded in XML, used to describe a learning object and similar digital resources used to support learning. The purpose of learning object metadata is to support the reusability of learning objects, to aid discoverability …   Wikipedia

  • Dryad (repository) — Dryad is a disciplinary repository for data underlying peer reviewed articles in the basic and applied biosciences, including biomedicine. Dryad aims to allow scientists to validate published findings, explore new analysis methodologies, re… …   Wikipedia

  • Content repository API for Java — (JCR) is a specification for a Java platform application programming interface (API) to access content repositories in a uniform manner.[1][dead link][2][not in citation given] …   Wikipedia

  • Open Archives Initiative Protocol for Metadata Harvesting — OAI PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the Open Archives Initiative. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”