PSQM

PSQM

PSQM ("Perceptual Speech Quality Measure") is a computational and modeling algorithm defined in ITU Recommendation ITU-T P.861 that objectively evaluates and quantifies voice quality of voice-band (300 - 3400 Hz) .It may be used to rank the performance of these with differing speech input levels, talkers, bit rates and transcodings. The ITU-T has Withdrawn P.861 and replaced it with P.862 (PESQ) which contains an improved speech assessment algorithm.

Why It's Used

Using the PSQM standard allows automated, simulation-based test methodologies to objectively rate both speech clarity and transmitted voice quality. Various software and/or hardware products have been developed to facilitate this testing. This results in considerable savings in cost and time over the traditional practice of using large groups of people to subjectively evaluate voice signals and assess voice quality. Moreover, it yields objective results that are reliable and reproducible. This is very important to telephony providers who are mandated to maintain high Quality of Service standards.

The Algorithm

PSQM uses a psychoacoustical mathematical modeling (both perceptual and cognitive) algorithm to analyze the pre and post transmitted voice signals, yielding a PSQM value which is a measure of signal quality degradation and ranges from 0 (no degradation) to 6.5 (highest degradation). In turn, this result may be translated into a Mean Opinion Score (MOS), which is an accepted measure of the perceived quality of received media on a numeric scale ranging from 1 to 5. A value of 1 indicates unacceptable, poor quality voice while a value of 5 indicates high voice quality with no perceptible issues.

The PSQM algorithm converts the physical-domain signal(s) into the perceptually meaningful psychoacoustic domain through a series of nonlinear processes such as time-frequency mapping, frequency warping and intensity warping,

The quality of the coded speech is judged on the differences in the internal representation. The difference is used for the calculation of the noise disturbance as a function of time and frequency. Besides perceptual modeling, the PSQM algorithm uses cognitive modeling such as loudness scaling and asymmetric masking in order to get high correlations between subjective and objective measurements.

Its Limitations

PSQM as originally conceived was not developed to account for network Quality of Service perturbations common in Voice over IP applications, items such as packet loss, delay variance (jitter) or non-sequential packets. These conditions usually give inappropriate results under heavy network load simulations, failing to account for a very real perceived loss of voice quality. Attempts to duplicate network fault conditions by introducing significant packet loss result in PSQM values that correspond to falsely inflated MOS values.

In order to overcome this limitation, PSQM+ was developed by modifying the original algorithm. PSQM+ generates results that seem to more accurately reflect the adverse performance of under realistic network load conditions.

Other Considerations

Other issues involve the lack of standardization in test signals used to evaluate various . PSQM provides more reliable and consistent MOS scores if used in accordance with ITU recommended methods for objective and subjective assessment of quality (ITU-T p.800/p.830/p.861). These recommendations include using both male and female gender voice reference signals at an average level of -20dB. The type, gender, duration, gain of the voice or signal can all have a minor impact on the PSQM value or MOS score as does the threshold levels, number of calls made and other configuration settings of the environment. When comparing voice quality measurements the signal, environment and configurations should all be taken into account.

Many exist and are used in a wide variety of applications. Careful selection of appropriate speech codec(s) is necessary to match system requirements. A list of common and their associated PSQM/PSQM+ derived MOS values obtained under various network load conditions is available.

ee also

*Perceptual Evalution of Speech Quality (PESQ), the successor technology for PSQM
*Mean Opinion Score
*
*Voice over IP

External links

* [http://www.itu.int/rec/T-REC-P.861/e ITU-T P.861]


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • PSQM — Perceptual Speech Quality Measure (Computing » Telecom) **** Perceptual Speech Quality Measurement (Medical » Physiology) …   Abbreviations dictionary

  • ILBC — (internet Low Bitrate Codec) это свободный от лицензионных отчислений кодек для голосовой связи через интернет. Кодек предназначен для узкополосных интернет каналов, со скоростью передачи аудио сигнала (человеческой речи) 13.33 кбит/с при длине… …   Википедия

  • G.723.1 — is an audio codec for voice that compresses voice audio in 30 ms frames. An algorithmic look ahead of 7.5 ms duration means that total algorithmic delay is 37.5 ms.Note that this is a completely different codec from G.723.There are two bit rates… …   Wikipedia

  • G.711 — is an ITU T standard for audio companding. It is primarily used in telephony. The standard was released for usage in 1972.G.711 represents logarithmic pulse code modulation (PCM) samples for signals of voice frequencies, sampled at the rate of… …   Wikipedia

  • Adaptive multi-rate compression — Infobox file format name = Adaptive Multi Rate Narrow Band (AMR NB) icon = logo = caption = extension = .amr mime = audio/amr type code = uniform type = magic = owner = genre = Audio container for = contained by = extended from = extended to =… …   Wikipedia

  • G.726 — is an ITU T ADPCM speech codec standard covering the transmission of voice at rates of 16, 24, 32, and 40 kbit/s. It was introduced to supersede both G.721, which covered ADPCM at 32 kbit/s, and G.723, which described ADPCM for 24 and 40 kbit/s.… …   Wikipedia

  • G.729a — is an audio data compression algorithm for voice that compresses voice audio in chunks of 10 milliseconds. G.729a is compatible with G.729, but requires less computation. This lower complexity is not free since speech quality is marginally… …   Wikipedia

  • Mean opinion score — The Mean Opinion Score (MOS) test has been used for decades in telephony networks to obtain the human user s view of the quality of the network. In multimedia (audio, voice telephony, or video) especially when codecs are used to compress the… …   Wikipedia

  • Internet Low Bit Rate Codec — (iLBC) is a royalty free [ [http://ilbcfreeware.org/documentation/gips iLBClicense.pdf Global IP Solutions iLBC Freeware Public License] ( [http://google.com/search?q=cache:ilbcfreeware.org/documentation/gips iLBClicense.pdf HTML] ) ] narrowband… …   Wikipedia

  • PESQ — PESQ, Perceptual Evaluation of Speech Quality, is a family of standards comprising a test methodology for automated assessment of the speech quality as experienced by a user of a telephony system. It is standardised as ITU T recommendation P.862… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”