Calgary Corpus

Calgary Corpus

The Calgary Corpus is a collection of text and binary data files, commonly used for comparing data compression algorithms. It was created by Ian Witten and Tim Bell in the 1980s and was commonly used in the 1990s. In 1997 it was replaced by the Canterbury Corpus, but the Calgary Corpus still exists for comparison and is still useful for its original intended purpose.

See also

* Comparison of file archivers

External links

* [http://links.uwaterloo.ca/calgary.corpus.html Original home of the Calgary Corpus]
* [http://corpus.canterbury.ac.nz/descriptions/#calgary New home]
* [http://pharos.cpsc.ucalgary.ca/Dienst/UI/2.0/Describe/ncstrl.ucalgary_cs/1988-327-39 Bell, Witten, and Cleary, 1988]


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Canterbury Corpus — Der Canterbury Corpus ist eine Sammlung von Dateien um die Leistung und den Kompressionsgrad verschiedener Kompressionsverfahren der verlustfreien Datenkompression zu messen. Er wurde 1997 von der Universität von Canterbury entwickelt und soll… …   Deutsch Wikipedia

  • Canterbury Corpus — The Canterbury Corpus is a collection of files intended for use as a benchmark for testing lossless data compression algorithms. It was created in 1997 at the University of Canterbury, New Zealand and designed to replace the Calgary Corpus. See… …   Wikipedia

  • Thorncliffe, Calgary — Infobox Settlement official name = Thorncliffe other name = settlement type = Neighbourhood imagesize = image caption = city logo = citylogo size = mapsize = map caption = image dot dot mapsize = 225px dot map caption = Location of Thorncliffe in …   Wikipedia

  • Greenview, Calgary — Infobox Settlement official name = Greenview other name = settlement type = Neighbourhood imagesize = image caption = city logo = citylogo size = mapsize = map caption = image dot dot mapsize = 225px dot map caption = Location of Greenview in… …   Wikipedia

  • Lossless data compression — is a class of data compression algorithms that allows the exact original data to be reconstructed from the compressed data. The term lossless is in contrast to lossy data compression, which only allows an approximation of the original data to be… …   Wikipedia

  • PAQ — A sample session of PAQ8O PAQ is a series of lossless data compression archivers that have evolved through collaborative development to top rankings on several benchmarks measuring compression ratio (although at the expense of speed and memory… …   Wikipedia

  • Data compression — Source coding redirects here. For the term in computer programming, see Source code. In computer science and information theory, data compression, source coding or bit rate reduction is the process of encoding information using fewer bits than… …   Wikipedia

  • Hello world program — Hello World redirects here. For the 2009 compilation album by Michael Jackson, see Hello World: The Motown Solo Collection. For the song by Lady Antebellum, see Hello World (song). A GUI Hello World program, written in Perl …   Wikipedia

  • Quine (computing) — A quine s output is exactly the same as its source code A quine is a computer program which takes no input and produces a copy of its own source code as its only output. The standard terms for these programs in the computability theory and… …   Wikipedia

  • Utah teapot — A modern render utilizing the Utah teapot model by Martin Newell. The Utah teapot or Newell teapot is a 3D computer model which has become a standard reference object (and something of an in joke) in the computer graphics community. It is a… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”