Open Archives Initiative Protocol for Metadata Harvesting

Open Archives Initiative Protocol for Metadata Harvesting

OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the Open Archives Initiative. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.

The protocol is usually just referred to as the OAI Protocol.

OAI-PMH uses XML over HTTP. The current version is 2.0, updated in 2008.

Contents

History

In the late 1990s, Herbert Van de Sompel (Ghent University) was working with researchers and librarians at Los Alamos National Laboratory (US) and called a meeting to address difficulties related to interoperability issues of e-print servers and digital repositories. The meeting was held in Santa Fe, New Mexico, in October 1999. A key development from the meeting was the definition of an interface that permitted e-print servers to expose metadata for the papers it held in a structured fashion so other repositories could identify and copy papers of interest with each other. This interface/protocol was named the "Santa Fe Convention".

Several workshops were held in 2000 at the ACM Digital Libraries conference and elsewhere to share the ideas from the Santa Fe Convention. It was discovered at the workshops that the problems faced by the e-print community were also shared by libraries, museums, journal publishers, and others who needed to share distributed resources. To address these needs, the Coalition for Networked Information and the Digital Library Federation provided funding to establish an Open Archives Initiative (OAI) secretariat managed by Herbert Van de Sompel and Carl Lagoze. The OAI held a meeting at Cornell University (Ithaca, New York) in September 2000 to improve the interface developed at the Santa Fe Convention. The specifications were refined over e-mail.

OAI-PMH version 1.0 was introduced to the public in January 2001 at a workshop in Washington D.C., and another in February in Berlin, Germany. Subsequent modifications to the XML standard by the W3C required making minor modifications to OAI-PMH resulting in version 1.1. The current version, 2.0, was released in June 2002. It contained several technical changes and enhancements and is not backward compatible.

OAI registries

The OAI Protocol has become widely adopted by many digital libraries, institutional repositories, and digital archives. Although registration is not mandatory, it is encouraged.

There are several large registries of OAI-compliant repositories:

  1. The Open Archives list of registered OAI repositories
  2. The OAI registry at University of Illinois at Urbana-Champaign
  3. The Celestial OAI registry
  4. Eprint’s Institutional Archives Registry
  5. Openarchives.eu The European Guide to OAI-PMH compliant repositories in the world
  6. ScientificCommons.org A worldwide service and registry

Uses

Commercial search engines have started using OAI-PMH to acquire more resources. Google is using OAI-PMH to harvest information from the National Library of Australia Digital Object Repository. In 2004, Yahoo! acquired content from OAIster (University of Michigan) that was obtained through metadata harvesting with OAI-PMH. Google did accept OAI-PMH as part of their Sitemap Protocol, though decided to stop doing so in 2008.[1] Wikimedia uses an OAI-PMH repository to provide feeds of Wikipedia and related site updates for search engines and other bulk analysis/republishing endeavors.[2] Especially when dealing with thousands of files being harvested every day, OAI-PMH can help in reducing the network traffic and other resource usage by doing an incremental harvesting. NASA's Mercury: Metadata Search System uses OAI-PMH to index thousands of metadata records from Global Change Master Directory (GCMD) every day.[3]

The mod_oai project is using OAI-PMH to expose content to web crawlers that is accessible from Apache Web servers.

Software

OAI-PMH is based on a client–server architecture, in which "harvesters" request information on updated records from "repositories". Requests for data can be based on a datestamp range, and can be restricted to named sets defined by the provider. Data providers are required to provide XML metadata in Dublin Core format, and may also provide it in other XML formats.

A number of software systems support the OAI-PMH, including Fedora, GNU EPrints from the University of Southampton, Open Journal Systems from the Public Knowledge Project, Desire2Learn, DSpace from MIT, HyperJournal from the University of Pisa, Primo, DigiTool, Rosetta and MetaLib from Ex Libris, DOOR from the eLab in Lugano, Switzerland, panFMP from the PANGAEA (data library), SimpleDL from Roaring Development, and jOAI.

Archives

A number of large archives support the protocol including arXiv and the CERN Document Server.

Workshops

Since 2001 there has been a yearly workshop at CERN in Geneva.

See also

Notes

  1. ^ Google Webmaster blog
  2. ^ . Wikimedia Meta-Wiki. http://meta.wikimedia.org/wiki/Wikimedia_update_feed_service 
  3. ^ R. Devarakonda, G. Palanisamy, J. Green and B. Wilson (2010). "Data sharing and retrieval uses OAI-PMH". Earth Science Informatics (Springer Berlin / Heidelberg) 4 (1): 1–5. doi:10.1007/s12145-010-0073-0 

References

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • Open archives initiative protocol for metadata harvesting — (OAI PMH) est un protocole informatique fondé par l Open Archives Initiative pour échanger des métadonnées. Il permet de constituer et de mettre à jour automatiquement des entrepôts centralisés où les métadonnées de sources diverses peuvent être… …   Wikipédia en Français

  • Open Archives Initiative Protocol for Metadata Harvesting — (OAI PMH) est un protocole informatique fondé par l Open Archives Initiative pour échanger des métadonnées. Il permet de constituer et de mettre à jour automatiquement des entrepôts centralisés où les métadonnées de sources diverses peuvent être… …   Wikipédia en Français

  • OAI Protocol for Metadata Harvesting — Die Open Archives Initiative (OAI) ist eine Initiative von Betreibern von Preprint Servern und anderen Dokumentenservern, um die auf diesen Servern abgelegten elektronischen Publikationen im Internet besser auffindbar und nutzbar zu machen. Dazu… …   Deutsch Wikipedia

  • Open Archives Initiative — Die Open Archives Initiative (OAI) ist eine Initiative von Betreibern von Preprint Servern und anderen Dokumentenservern, um die auf diesen Servern abgelegten elektronischen Publikationen im Internet besser auffindbar und nutzbar zu machen. Dazu… …   Deutsch Wikipedia

  • Open Archives Initiative — The Open Archives Initiative (OAI) is an attempt to build a low barrier interoperability framework for archives (institutional repositories) containing digital content (digital libraries). It allows people (Service Providers) to harvest metadata… …   Wikipedia

  • Open Archives Initiative — L Open Archives Initiative (initiative pour des archives ouvertes), généralement abrégée en OAI est un projet qui vise à faciliter l échange et la valorisation d archives numériques. Elle permet à des fournisseurs de services de moissonner des… …   Wikipédia en Français

  • Open archive — An open archive is an institutional repository or some other web accessible digital database that is compliant with the Open Archives Initiative Protocol for Metadata Harvesting (OAI PMH). This World Wide Web related article is a stub. You can… …   Wikipedia

  • Open access — This article is about open access to research literature. For other uses, see Open access (disambiguation). Open Access logo, originally designed by Public Library of Science Open access (OA) refers to unrestricted access via the Internet to… …   Wikipedia

  • Open Access movement — The Open Access movement is a social movement in academia, dedicated to the principle of open access to information sharing for the common good.The movement traces its history at least back to the 1960s, but became much more prominent in the… …   Wikipedia

  • OAI-PMH — Open Archives Initiative Protocol for Metadata Harvesting Open Archives Initiative Protocol for Metadata Harvesting (OAI PMH) est un protocole informatique fondé par l Open Archives Initiative pour échanger des métadonnées. Il permet de… …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”