Universal data compression

Universal data compression

Universal data compression is impossible, though some claim to have found a way to do it. Any data compression algorithm necessarily leaves some input data uncompressed. A simple counting argument illustrates this.

Suppose there is an algorithm A which compresses all data sequences of length b (in bits) to shorter length data sequences. Let c be the length of the longest compressed sequence. Since A compresses all data sequences it must be the case that c < b. Now, there are 2^b sequences of length b, but only 2^c sequences of length c. Since 2^c < 2^b, it must be the case that at least two sequences, say U_1 and U_2 compress to the same value, say C. As a result, when uncompressing C it is not possible to determine whether U_1 or U_2 should be produced. Therefore, there is no way to undo algorithm A, and A is not correct.


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Data compression — Source coding redirects here. For the term in computer programming, see Source code. In computer science and information theory, data compression, source coding or bit rate reduction is the process of encoding information using fewer bits than… …   Wikipedia

  • Universal code (data compression) — In data compression, a universal code for integers is a prefix code that maps the positive integers onto binary codewords, with the additional property that whatever the true probability distribution on integers, as long as the distribution is… …   Wikipedia

  • Lossless data compression — is a class of data compression algorithms that allows the exact original data to be reconstructed from the compressed data. The term lossless is in contrast to lossy data compression, which only allows an approximation of the original data to be… …   Wikipedia

  • Universal coding — can refer to one of two concepts in data compression: * Universal code (data compression), a fixed prefix code that, for any probability mass function, has a data compression ratio within a constant of the optimal prefix code * Universal source… …   Wikipedia

  • Universal code — can refer to: * Universal code (data compression) * Universal Code (biology) * An alternate term for a Universal law * Universal code (ethics) * Universal Product Code * Universal code (typography) * Universal code (cartography) * Universal Code… …   Wikipedia

  • Compression artifact — Original image, with good color grade Loss of edge clarity and tone fuzziness in heavy JPEG compression A compression ar …   Wikipedia

  • Data URI scheme — The data URI scheme is a URI scheme (Uniform Resource Identifier scheme) that provides a way to include data in line in web pages as if they were external resources. It tends to be simpler than other inclusion methods, such as MIME with cid or… …   Wikipedia

  • Video compression picture types — In the field of video compression a video frame is compressed using different algorithms with different advantages and disadvantages, centered mainly around amount of data compression. These different algorithms for video frames are called… …   Wikipedia

  • Dynamic range compression — This article is about a process that intentionally reduces the dynamic range of audio signals. For similar reductions caused by circuit imperfections, see Gain compression. For processes that reduce the size of digital audio files, see Audio… …   Wikipedia

  • Dynamic Markov compression — (DMC) is a lossless data compression algorithm developed by Gordon Cormack and Nigel Horspool [1]. It uses predictive arithmetic coding similar to prediction by partial matching (PPM), except that the input is predicted one bit at a time (rather… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”