Code-excited linear prediction

Code-excited linear prediction (CELP) is a speech coding algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algorithms, such as residual-excited linear prediction and linear predictive coding vocoders (e.g., FS-1015). Along with its variants, such as algebraic CELP, relaxed CELP, low-delay CELP and vector sum excited linear prediction, it is currently the most widely used speech coding algorithm. It is also used in MPEG-4 Audio speech coding. CELP is commonly used as a generic term for a class of algorithms and not for a particular codec.

1 Introduction
2 CELP decoder
3 CELP encoder
- 3.1 Noise weighting
4 See also
5 External links
6 References

Introduction

The CELP algorithm is based on four main ideas:

Using the source-filter model of speech production through linear prediction (LP)(see the textbook "speech coding algorithm");
Using an adaptive and a fixed codebook as the input (excitation) of the LP model;
Performing a search in closed-loop in a “perceptually weighted domain”.
Applying vector quantization (VQ)

The original algorithm as simulated in 1983 by Schroeder and Atal required 150 seconds to encode 1 second of speech when run on a Cray-1 supercomputer. Since then, more efficient ways of implementing the codebooks and improvements in computing capabilities have made it possible to run the algorithm in embedded devices, such as mobile phones.

CELP decoder

Figure 1: CELP decoder

Before exploring the complex encoding process of CELP we introduce the decoder here. Figure 1 describes a generic CELP decoder. The excitation is produced by summing the contributions from an adaptive (aka pitch) codebook and a stochastic (aka innovation or fixed) codebook:

$e[n]=e_a[n]+e_f[n]\,$

where $e a [n]$ is the adaptive (pitch) codebook contribution and $e f [n]$ is the stochastic (innovation or fixed) codebook contribution. The fixed codebook is a vector quantization dictionary that is (implicitly or explicitly) hard-coded into the codec. This codebook can be algebraic (ACELP) or be stored explicitly (e.g. Speex). The entries in the adaptive codebook consist of delayed versions of the excitation. This makes it possible to efficiently code periodic signals, such as voiced sounds.

The filter that shapes the excitation has an all-pole model of the form $1 / A (z)$ , where $A (z)$ is called the prediction filter and is obtained using linear prediction (Levinson–Durbin algorithm). An all-pole filter is used because it is a good representation of the human vocal tract and because it is easy to compute.

CELP encoder

The main principle behind CELP is called Analysis-by-Synthesis (AbS) and means that the encoding (analysis) is performed by perceptually optimizing the decoded (synthesis) signal in a closed loop. In theory, the best CELP stream would be produced by trying all possible bit combinations and selecting the one that produces the best-sounding decoded signal. This is obviously not possible in practice for two reasons: the required complexity is beyond any currently available hardware and the “best sounding” selection criterion implies a human listener.

In order to achieve real-time encoding using limited computing resources, the CELP search is broken down into smaller, more manageable, sequential searches using a simple perceptual weighting function. Typically, the encoding is performed in the following order:

Linear Prediction Coefficients (LPC) are computed and quantized, usually as LSPs
The adaptive (pitch) codebook is searched and its contribution removed
The fixed (innovation) codebook is searched

Noise weighting

Most (if not all) modern audio codecs attempt to shape the coding noise so that it appears mostly in the frequency regions where the ear cannot detect it. For example, the ear is more tolerant to noise in parts of the spectrum that are louder and vice versa. That's why instead of minimizing the simple quadratic error, CELP minimizes the error for the perceptually weighted domain. The weighting filter W(z) is typically derived from the LPC filter by the use of bandwidth expansion:

$W(z) = \frac{A(z/\gamma_1)}{A(z/\gamma_2)}$

where $γ 1 > γ 2$ .

External links

This is based on a paper presented at Linux.Conf.Au
Some parts based on the Speex codec manual
reference implementations of CELP 1016A (CELP 3.2a) and LPC 10e.
Linear Predictive Coding (LPC)

References

B.S. Atal, "The History of Linear Prediction," IEEE Signal Processing Magazine, vol. 23, no. 2, March 2006, pp. 154–161.
M. R. Schroeder and B. S. Atal, "Code-excited linear prediction (CELP): high-quality speech at very low bit rates," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 10, pp. 937–940, 1985.

Data compression methods

Information theory

Entropy · Complexity · Redundancy · Lossy · Timeline of information theory

Lossless

Entropy encoding	Shannon–Fano · Shannon–Fano–Elias · Huffman · Adaptive Huffman · Arithmetic · Range · Golomb · Universal (Gamma · Exp-Golomb · Fibonacci · Levenshtein)

Dictionary	RLE · Byte pair encoding · DEFLATE · Lempel–Ziv (LZ77/78 · LZSS · LZW · LZWL · LZO · LZMA · LZX · LZRW · LZJB · LZS · LZT · ROLZ) · Statistical Lempel Ziv

Others	CTW · BWT · PPM · DMC · Delta

Audio

Theory	Companding · Convolution · Dynamic range · Latency · Sampling · Nyquist–Shannon theorem · Sound quality

Audio codec parts	LPC (LAR · LSP) · WLPC · CELP · ACELP · A-law · μ-law · ADPCM · DPCM · MDCT · Fourier transform · Psychoacoustic model

Others	Bit rate (CBR · ABR · VBR) · Speech compression · Sub-band coding

Image

Terms	Color space · Pixel · Chroma subsampling · Compression artifact · Image resolution

Methods	RLE · Fractal · Wavelet · EZW · SPIHT · LP · DCT · Chain code · KLT

Others	Test images · PSNR quality measure · Quantization

Video

Terms	Video characteristics · Frame · Frame rate · Interlace · Frame types · Video quality · Video resolution

Video codec parts	Motion compensation · DCT · Quantization

Others	Video codecs · Rate distortion theory · Bit rate (CBR · ABR · VBR)

See Compression formats for formats and Compression software implementations for codecs

Multimedia compression and container formats

Video

ISO/IEC	MJPEG · Motion JPEG 2000 · MPEG-1 · MPEG-2 (Part 2) · MPEG-4 (Part 2/ASP · Part 10/AVC) · HEVC

ITU-T	H.120 · H.261 · H.262 · H.263 · H.264 · HEVC

Others	AVS · Bink · CineForm · Cinepak · Dirac · DV · Indeo · Microsoft Video 1 · OMS Video · Pixlet · Prores · RealVideo · RTVideo · SheerVideo · Smacker · Sorenson Video, Spark · Theora · VC-1 · VC-2 · VC-3 · VP3 · VP6 · VP7 · VP8 · WMV

Audio

ISO/IEC	MPEG-1 Layer III (MP3) · MPEG-1 Layer II (Multichannel) · MPEG-1 Layer I · AAC · HE-AAC · MPEG Surround · MPEG-4 ALS · MPEG-4 SLS · MPEG-4 DST · MPEG-4 HVXC · MPEG-4 CELP · USAC

ITU-T	G.711 · G.718 · G.719 · G.722 · G.722.1 · G.722.2 · G.723 · G.723.1 · G.726 · G.728 · G.729 · G.729.1

Others	AC-3 · AMR · AMR-WB · AMR-WB+ · Apple Lossless · Asao · ATRAC · CELT · DRA · DTS · EVRC · EVRC-B · FLAC · GSM-HR · GSM-FR · GSM-EFR · iLBC · iSAC · Monkey's Audio · TTA (True Audio) · MT9 · A-law · μ-law · Musepack · OptimFROG · Opus · OSQ · QCELP · RealAudio · RTAudio · SD2 · SHN · SILK · Siren · SMV · Speex · SVOPC · TwinVQ · VMR-WB · Vorbis · WavPack · WMA

Image

ISO/IEC/ITU-T	JPEG · JPEG 2000 · JPEG XR · Lossless JPEG · JBIG · JBIG2 · PNG · TIFF/EP · TIFF/IT

Others	APNG · BMP · DjVu · EXR · GIF · ICER · ILBM · MNG · PCX · PGF · TGA · QTVR · TIFF · WBMP · WebP

Containers

ISO/IEC	MPEG-PS · MPEG-TS · ISO base media file format · MPEG-4 Part 14 · Motion JPEG 2000 · MPEG-21 Part 9

ITU-T	H.222.0 · T.802

Others	3GP and 3G2 · AMV · ASF · AIFF · AVI · AU · Bink · DivX Media Format · DPX · EVO · Flash Video · GXF · M2TS · Matroska · MXF · Ogg · QuickTime File Format · RealMedia · REDCODE RAW · RIFF · Smacker · MOD and TOD · VOB · WAV · WebM

See Compression methods for methods and Compression software implementations for codecs

Categories:

Speech codecs

Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

Code excited linear prediction — (CELP) is a speech coding algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit rate algorithms, such as RELP and LPC vocoders (e.g. FS 1015). Along with … Wikipedia
Code-Excited Linear Prediction — Prinzipaufbau des CELP Decoders Code( book) Excited Linear Prediction (CELP) ist ein hybrides Verfahren zur Audiodatenkompression, das die Vorteile der Signalformkodierung mittels Vektorquantisierung und der parametrischen Verfahren vereint. Das… … Deutsch Wikipedia
Code Excited Linear Prediction — Code( book) Excited Linear Prediction (CELP) ist ein hybrides Audiokompressionsverfahren, das die Vorteile der Signalformkodierung mittels Vektorquantisierung und der parametrischen Verfahren vereint. Das Ergebnis ist eine gute Sprachqualität,… … Deutsch Wikipedia
Algebraic Code Excited Linear Prediction — Der Algebraic Code Excited Linear Prediction, abgekürzt ACELP, stellt einen patentierten Vocoder der Firma VoiceAge Corporation dar, welcher bei der verlustbehafteten Kompression von Sprachsignalen im Telekommunikationsbereich Anwendung findet.… … Deutsch Wikipedia
Algebraic code excited linear prediction — (ACELP) is a speech encoding algorithm where a limited set of pulses is distributed as excitation to linear prediction filter.The ACELP method is widely employed in current speech coding standards such as AMR, EFR, AMR WB and ITU T G series… … Wikipedia
Relaxed code excited linear prediction — (RCELP) is a method used in some advanced speech codecs. The RCELP algorithm does not attempt to match the original signal exactly. Instead, it matches a time warped version of this original signal that conforms to a simplified pitch contour … Wikipedia
Code-book Excited Linear Prediction — Code( book) Excited Linear Prediction (CELP) ist ein hybrides Audiokompressionsverfahren, das die Vorteile der Signalformkodierung mittels Vektorquantisierung und der parametrischen Verfahren vereint. Das Ergebnis ist eine gute Sprachqualität,… … Deutsch Wikipedia
Residual-excited linear prediction — (RELP) is an obsolete speech coding algorithm. It was originally proposed in the 1970s and can be seen as an ancestor of Code Excited Linear Prediction (CELP). Unlike CELP however, RELP directly transmits the residual signal. To achieve lower… … Wikipedia
Codebook Excited Linear Prediction — Code( book) Excited Linear Prediction (CELP) ist ein hybrides Audiokompressionsverfahren, das die Vorteile der Signalformkodierung mittels Vektorquantisierung und der parametrischen Verfahren vereint. Das Ergebnis ist eine gute Sprachqualität,… … Deutsch Wikipedia
CELP — Code Excited Linear Prediction (Computing » General) Code Excited Linear Prediction (Computing » Networking) * Cellular Products, Inc. (Business » NASDAQ Symbols) * Code Excited Linear Projection (Academic & Science » Chemistry) … Abbreviations dictionary

Academic Dictionaries and Encyclopedias

Code-excited linear prediction

Contents

Introduction

CELP decoder

CELP encoder

Noise weighting

See also

External links

References

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Code-excited linear prediction

Contents

Introduction

CELP decoder

CELP encoder

Noise weighting

See also

External links

References

Look at other dictionaries:

Share the article and excerpts

Direct link