- Lossy compression
A lossy compression method is one where compressing data and then decompressing it retrieves data that may well be different from the original, but is close enough to be useful in some way. Lossy compression is most commonly used to compress
multimedia data (audio,video , still images), especially in applications such asstreaming media and internet telephony. By contrast,lossless compression is required for text and data files, such as bank records, text articles, etc. Lossy compression formats suffer fromgeneration loss : repeatedly compressing and decompressing the file will cause it to progressively lose quality.This is in contrast withlossless data compression .Information-theoretical foundations for lossy data compression are provided by
rate-distortion theory . Much like the use ofprobability in optimal coding theory, rate-distortion theory heavily draws onBayesian estimation anddecision theory in order to model perceptual distortion and evenaesthetic judgment.Types
There are two basic lossy compression schemes:
* In "lossy transform
codec s", samples of picture or sound are taken, chopped into small segments, transformed into a new basis space, and quantized. The resulting quantized values are then entropy coded.* In "lossy predictive codecs", previous and/or subsequent decoded data is used to predict the current sound sample or image frame. The error between the predicted data and the real data, together with any extra information needed to reproduce the prediction, is then quantized and coded.
In some systems the two techniques are combined, with transform codecs being used to compress the error signals generated by the predictive stage.
Lossy versus lossless
The advantage of lossy methods over lossless methods is that in some cases a lossy method can produce a much smaller compressed file than any known lossless method, while still meeting the requirements of the application.
Lossy methods are most often used for compressing sound, images or videos. This is because these types of data are intended for human interpretation where the mind can easily "fill in the blanks" or see past very minor errors or inconsistencies – ideally lossy compression is transparent (imperceptible), which can be verified via an
ABX test .Transparency
When a user acquires a lossily compressed file, (for example, to reduce download time) the retrieved file can be quite different from the original at the
bit level while being indistinguishable to the human ear or eye for most practical purposes. Many compression methods focus on the idiosyncrasies ofhuman physiology , taking into account, for instance, that the human eye can see only certain wavelengths of light. Thepsychoacoustic model describes how sound can be highly compressed without degrading perceived quality. Flaws caused by lossy compression that are noticeable to the human eye or ear are known ascompression artifact s.Compression ratio
The compression ratio (that is, the size of the compressed file compared to that of the uncompressed file) of lossy video codecs is nearly always far superior to that of the audio and still-image equivalents.
* Video can be compressed immensely (e.g. 300:1) with little visible quality loss;Fact|date=September 2008
* Audio can often be compressed at 10:1 with imperceptible loss of quality;Fact|date=September 2008
* Still images are often lossily compressed at 10:1, as with audio, but the quality loss is more noticeable, especially on closer inspection.Fact|date=September 2008Transcoding and Editing
An important caveat about lossy compression is that converting (formally,
transcoding ) or editing lossily compressed files causesdigital generation loss from the re-encoding. This can be avoided by only producing lossy files from (lossless) originals, and only editing (copies of) original files, such as images inraw image format instead ofJPEG .Lossless editing
Some lossless editing of lossily compressed files is possible, which consists of modifying the compressed data directly, without decoding and re-encoding.
JPEG
The primary programs for lossless editing of JPEGs are
, and the derivedjpegtran exiftran
(which also preserves EXIF information), and [http://sylvana.net/jpegcrop/ Jpegcrop] (which provides a Windows interface).These allow one to
* crop,
* rotate, flip, and flop, or
* convert tograyscale (by dropping thechrominance channel).[http://sylvana.net/jpegcrop/jpegjoin/ JPEGjoin] allows one to join different JPEG images (which have the same encoding), without re-encoding. (See also: [http://sylvana.net/jpegcrop/jpegtran/ New jpegtran features] .)
One can also make some changes to the compression without re-encoding:
* optimize the compression (so it takes less space),
* convert between progressive and non-progressive encoding,There is also the freeware Windows-only
IrfanView , which has some lossless JPEG operations in itsJPG_TRANSFORM
plugin .MP3
;Splitting and joining:
Mp3splt andMp3wrap (orAlbumWrap ) allow one to split an MP3 file into pieces or join them loselessly. These are analogous tosplit
andcat
. [Though the wrap programs do more, encoding the divisions between the original files.] ;Gain: VariousReplay Gain programs such asMP3gain allow one to modify thegain (overall volume) of MP3 files losslessly.Metadata
One can generally modify or remove metadata, such as
ID3 tag s,Vorbis comment s, orEXIF information, without modifying the underlying media.Downsampling / compressed representation scalability
One may wish to
downsample or otherwise decrease the resolution of the represented source signal and the quantity of data used for its compressed representation without re-encoding, as inbitrate peeling , but this functionality is not supported in all designs, as not all codecs encode data in a form that allows less important detail to simply be dropped.Some well known designs that have this capability include
JPEG 2000 for still images andH.264/MPEG-4 AVC basedScalable Video Coding for video. Actually such schemes have also been standardized for older designs as well, such asJPEG images with progressive encoding, andMPEG-2 andMPEG-4 Part 2 video, although those prior schemes had limited success in terms of adoption into real-world common usage.Without this capacity, which is often the case in practice, to produce a representation with lower resolution or lower fidelity than a given one, one needs to start with the original source signal and encode, or start with a compressed representation and then decompress and re-encode it (
transcoding ), thought this latter tends to causedigital generation loss .On a related point, some audio formats feature a combination of a lossy format and a lossless correction; this allows stripping the correction to easily obtain a lossy file, though whether the lossy portion itself can be further stripped is a separate question. Such formats include
MPEG-4 SLS (Scalable to Lossless),WavPack , andOptimFROG DualStream .Methods
Graphics
Image
*
Cartesian Perceptual Compression : Also known as CPC
*DjVu
*Fractal compression
* HAM, hardware compression of color information used inAmiga computers
*ICER , used by the Mars Rovers: related toJPEG 2000 in its use of wavelets
*JPEG
*JPEG 2000 , JPEG's successor format that uses wavelets.
*JBIG2
* PGF, Progressive Graphics File (lossless or lossy compression)
*Wavelet compression
*S3TC texture compression for 3D computer graphics hardwareVideo
*
H.261
*H.263
*H.264/MPEG-4 AVC
*MNG (supports JPEG sprites)
*Motion JPEG
*MPEG-1 Part 2
*MPEG-2 Part 2
*MPEG-4 Part 2
*Ogg Theora (noted for its lack of patent restrictions)
*Sorenson video codec
*VC-1 Audio
Music
*AAC
*ADPCM
*ATRAC
*Dolby AC-3
*MP2
*MP3
*Musepack
*Ogg Vorbis (noted for its lack of patent restrictions)
*WMASpeech
*
CELP
*G.711
*G.726
*Harmonic and Individual Lines and Noise (HILN)
* AMR (used by GSM cell carriers, such as T-Mobile)
*Speex (noted for its lack of patent restrictions)Other data
Researchers have (semi-seriously) performed lossy compression on text by either using a thesaurus to substitute short words for long ones, or generative text techniques [cite web|title=Semantic and Generative Models for Lossy Text Compression |url=http://compression.ru/download/articles/text/witten_1994cj_lossy_text_compression.pdf |author=I. H. WITTEN, et al. |publisher=The Computer Journal |accessdate=2007-10-13 |format=PDF] , although these sometimes fall into the related category of
lossy data conversion .ee also
*
Data compression
*Lossless data compression *
Compression artifact
*Rate–distortion theory *
List of codecs *
Lenna Notes
External links
* [http://www.bobulous.org.uk/misc/lossy_audio_2006.html Lossy audio formats] , comparing the speed and compression strength of five lossy audio formats.
* [http://membled.com/work/apps/lossy_png/ lossy PNG image compression (research)]
* [http://www.websiteoptimization.com/speed/tweak/lossy/ using lossy GIF/PNG compression for the web (article)]
Wikimedia Foundation. 2010.