Masking threshold

Masking threshold: The masking threshold is the sound pressure level of a sound needed to make the sound perceptible in the presence of another noice, called a "masker". This threshold depends upon the frequency, the kind of masker, and the kind of sound being masked. The effect is strongest between two sounds close in frequency.

In the context of audio transmission, there are some advantages to being unable to perceive a sound. In audio encoding, for example, better compression can be achieved by omitting the imperceptible tones, thus requiring fewer bits to encode the sound and reducing the size of the final file.

Applications in audio compression

It is uncommon to work with only one tone; most sounds are composed of multiple tones. There may be many possible maskers at the same frequency. In this situation, it is necessary to compute the global masking threshold using a high resolution Fast Fourier transform via 512 or 1024 points to determine the frequencies that comprise the sound. Because there are bands that humans are not able to hear, it is necessary to know the signal level, masker type, and the frequency band before computing the individual thresholds. To avoid having the masking threshold under the threshold in quiet, one adds the last one to the computation of partial thresholds.^{[clarification needed]} This allows computation of the signal-to-mask ratio (SMR).

The spectrum of a 1 kHz tone. A sound will not be heard if it is under the threshold in quiet. This limit changes around the masker frequency, making it more difficult to hear a nearby tone. The slope of the masking threshold is steeper toward lower frequencies than toward higher frequencies, which means it is easier to mask with higher frequency tones.

The psychoacoustic model

The MPEG audio encoding process leverages the masking threshold. In this process, there is a block called "Psychoacoustic model". This is communicated with the band filter and the quantify block. The psychoacoustic model analyzes the samples sent to it by the filter band, computing the masking threshold in each frequency band using a Fast Fourier transform. The number of points used depends upon the MPEG layer. Using these thresholds, the signal-to-mask ratio is determined and sent to the quantifier. The quantifier assigns more or less bits in each block based upon the SMR. The block with the highest SMR will encode with the maximum number of bits.

Categories:
Hearing
MPEG

Игры ⚽ Нужен реферат?

Look at other dictionaries:

Masking (in art) — Contents 1 In painting 1.1 Solid masks 1.2 Liquid masks 2 … Wikipedia
Auditory masking — occurs when the perception of one sound is affected by the presence of another sound (Gelfand 2004). The term masking is not confined to auditory perception as it can also be used in visual perception tasks.Masking can be simultaneous or non… … Wikipedia
Unsharp masking — is an image manipulation technique now familiar to many users of digital image processing software, but it seems to have been first used in Germany in the 1930s as a way of increasing the acutance, or apparent sharpness, of photographic images.… … Wikipedia
Temporal masking — or non simultaneous masking occurs when a sudden stimulus sound makes inaudible other sounds which are present immediately preceding or following the stimulus. Masking that obscures a sound immediately preceding the masker is called backward… … Wikipedia
Absolute threshold of hearing — The absolute threshold of hearing (ATH) is the minimum sound level of a pure tone that an average ear with normal hearing can hear with no other sound present. The absolute threshold relates to the sound that can just be heard by the… … Wikipedia
Unsharp masking — Необработанное изображение (вверху); изображение, обработанное с помощью нерезкого маскирования (в центре); изображение с явно завышенными параметрами (внизу) на профессиональном жаргоне «перешарпленное» Нерезкое маскирование (англ. Unsharp… … Википедия
MPEG-1 — Moving Picture Experts Group Phase 1 (MPEG 1) Filename extension .mpg, .mpeg, .mp1, .mp2, .mp3, .m1v, .m1a, .m2a, .mpa, .mpv Internet media type audio/mpeg, video/mpeg Developed by ISO, IEC Type of format audio, vid … Wikipedia
Data compression — Source coding redirects here. For the term in computer programming, see Source code. In computer science and information theory, data compression, source coding or bit rate reduction is the process of encoding information using fewer bits than… … Wikipedia
Audio compression (data) — For processes which reduce the amount of time it takes to listen to and understand a recording, see time compressed speech. Audio compression is a form of data compression designed to reduce the size of audio files. Audio compression algorithms… … Wikipedia
Sub-band coding — (SBC) is any form of transform coding that breaks a signal into a number of different frequency bands and encodes each one independently. This decomposition is often the first step in data compression for audio and video signals.Basic… … Wikipedia

Academic Dictionaries and Encyclopedias

Masking threshold

Applications in audio compression

The psychoacoustic model

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Masking threshold

Applications in audio compression

The psychoacoustic model

Look at other dictionaries:

Share the article and excerpts

Direct link