- LOCKSS
The LOCKSS (Lots of Copies Keep Stuff Safe) project, under the auspices of
Stanford University , develops and supports anopen source system allowing libraries to collect, preserve and provide their readers with access to material published on the Web. The system attempts to replicate the way libraries do this for material published on paper. It was originally designed for scholarly journals, [cite paper
author = David S. H. Rosenthal
authorlink = David S. H. Rosenthal
coauthors = Vicky Reich
title = Permanent Web Publishing
publisher = 2000USENIX Annual Technical Conference
date = June 18, 2000
url = http://www.usenix.org/events/usenix2000/freenix/full_papers/rosenthal/rosenthal.pdf
accessdate = 2008-01-19
format=PDF] but is now also used for a range of other materials. Examples include theSOLINET project to preserve theses and dissertations at eight universities, [cite press release
url = http://www.solinet.net/resources/resources_templ.cfm?doc_id=3680
title = ASERL and LOCKSS to Preserve e-Theses & Dissertations
accessdate = 2008-01-19
date = July 11, 2005
publisher =SOLINET ] and theMetaArchive project preserving at-risk digital content about the culture and history of the American South. [cite web
url = http://www.metaarchive.org/
title = The MetaArchive Cooperative
accessdate = 2008-01-19
work = Home page]Traditionally, academic libraries will retain issues of scholarly journals, either individually or collaboratively, providing their readers access to the content received, even after the publisher has ceased or the subscription has been canceled. In the digital age, libraries often subscribe to journals that are only available digitally over the
Internet . Although convenient at the time, this presents a problem for the preservation of data. If either the publisher ceases to publish, or the library cancels the subscription, the content that was previously paid for is no longer available.The LOCKSS system allows a library, with permission from the publisher, to collect, preserve and disseminate to its own readers its own copy of material to which it has subscribed, and open access material (perhaps published under a
Creative Commons license). Each library's system collects its copy using a specializedweb crawler that verifies that the publisher has granted suitable permission. The system is format-agnostic, collecting whatever formats the publisher delivers viaHTTP . Libraries which have collected the same material cooperate in apeer-to-peer network to ensure its preservation. Peers in the network vote oncryptographic hash functions of preserved content and a nonce; a peer that is outvoted regards its copy as damaged and repairs it from the publisher or other peers. [cite paper
author = Petros Maniatas
coauthors = Mema Roussopoulos, TJ Giuli, David S. H. Rosenthal, Mary Baker, Yanto Muliadi
title = Preserving Peer Replicas By Rate-Limited Sampled Voting
publisher = ACM Symposium on Operating Systems Principles
date = October 19, 2003
url = http://www.eecs.harvard.edu/%7Emema/publications/SOSP2003.pdf
accessdate = 2008-01-19
format=PDF] [cite paper
author = T.J. Giuli,
coauthors = Petros Maniatis, Mary Baker, David S. H. Rosenthal, Mema Roussopoulos
title = Attrition Defenses for a Peer-to-Peer Digital Preservation System
publisher =
date = November 27, 2004
url = http://arxiv.org/abs/cs.CR/0405111
accessdate = 2008-01-19]The LOCKSS license used by most publishers allows a library's readers access to its own copy, but does not allow similar access to other libraries or unaffiliated readers; the system does not support
file sharing . On request, a library may supply another library with content to effect a repair, but only if the requesting library proved in the past that it had a good copy by voting with the majority. If the reader's browser no longer supports the format in which the copy was collected, a "format migration process" can convert it to a current format. [cite journal
author = David S. H. Rosenthal
coauthors = Thomas Lipkis, Thomas S. Robertson, Seth Morabito
authorlink = David S. H. Rosenthal
month = January | year = 2005
title = Transparent Format Migration of Preserved Web Content
journal = D-Lib Magazine
volume = 11
issue = 1
pages =
publisher =Corporation for National Research Initiatives
location =
issn = 1082-9873
url = http://www.dlib.org/dlib/january05/rosenthal/01rosenthal.html
accessdate = 2008-01-19
laysummary =
laysource =
doi = 10.1045/january2005-rosenthal ] These limits on the use that may be made of preserved copies of copyright material have been effective in persuading copyright owners to grant the necessary permission. [cite web
url = http://www.lockss.org/lockss/Publishers_and_Titles
title = Publishers and Titles
accessdate = 2008-01-19
publisher = LOCKSS]The LOCKSS approach of selective collection with permission from the publisher, distributed storage, and restricted dissemination contrasts with, for example, the
Internet Archive 's approach of omnivorous collection without permission from the publisher, centralized storage, and unrestricted dissemination. The LOCKSS system is far smaller, but it can preserve subscription materials to which the Internet Archive has no access.The fact that each library administers its own LOCKSS peer and maintains its own copy of preserved material, and the fact that there are libraries doing so worldwide (see the list of participating libraries below), provides a much higher degree of replication than is usual in a
fault-tolerant system . The voting process makes use of this high degree of replication to eliminate the need forbackup s to off-line media, and to provide robust defenses against attacks aimed at corrupting preserved content.In addition to their role in preserving access, libraries have traditionally made it difficult to rewrite or suppress printed material. The existence of an indeterminate but large number of identical copies on a somewhat tamper-resistant medium under many independent administrations meant that attempts to alter or remove all copies would likely both fail and be detected. Web publishing, based on a single copy under a single administration, provides none of these safeguards against subversion. Web publishing is, therefore, a suitable tool for
Winston Smith 's job of rewriting history. By preserving many copies under diverse administration, by automatically auditing the copies at intervals against each other (and, in the future, against the publisher's copy), and by alerting libraries when changes are detected, the LOCKSS system attempts to restore many of these safeguards.The source code for the entire LOCKSS system carries BSD-style
open-source license s and is available from [http://sourceforge.net/projects/lockss SourceForge] . LOCKSS is a trademark of Stanford University.ee also
*
Digital library
*Digital preservation
*National Digital Information Infrastructure and Preservation Program References
External links
* [http://www.lockss.org/ LOCKSS site]
* [http://www.lockss.org/lockss/Libraries Participating libraries]
* [http://www.youtube.com/watch?v=0wdcnXrQkaI Configuring a LOCKSS Box (YouTube video)]
* [http://www.youtube.com/watch?v=TOE_Jw23cVg LOCKSS (YouTube video)]
Wikimedia Foundation. 2010.