Source-filter model of speech production

Source-filter model of speech production

The source-filter model of speech production models speech as a combination of a sound source, such as the vocal cords, and a filter, the vocal tract (and radiation characteristic).

While only an approximation, the model is widely used in a number of applications because of its relative simplicity. To varying degrees, different phonemes can be distinguished by the properties of their source(s) and their spectral shape. Voiced sounds (e.g., vowels) have (at least) a source due to (mostly) periodic glottal excitation, which can be approximated by an impulse train in the time domain and by harmonics in the frequency domain, and a filter that depends on, e.g., tongue position and lip protrusion. On the other hand, fricatives have (at least) a source due to turbulent noise produced at a constriction in the oral cavity (e.g., the sounds represented by orthographically by "s" and "f"). So called "voiced fricatives" (such as "z" and "v") have two sources - one at the glottis and one at the supra-glottal constriction.

The source-filter model is used in both speech synthesis and speech analysis, and is related to linear prediction. The development of the model is due, in large part, to the early work of Gunnar Fant, although others, notably Ken Stevens, have also contributed substantially to the models underlying acoustic analysis of speech and speech synthesis.


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Speech perception — is the process by which the sounds of language are heard, interpreted and understood. The study of speech perception is closely linked to the fields of phonetics and phonology in linguistics and cognitive psychology and perception in psychology.… …   Wikipedia

  • Speech synthesis — Stephen Hawking is one of the most famous people using speech synthesis to communicate Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented… …   Wikipedia

  • Code-excited linear prediction — (CELP) is a speech coding algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit rate algorithms, such as residual excited linear prediction and linear… …   Wikipedia

  • Code excited linear prediction — (CELP) is a speech coding algorithm originally proposed by M.R. Schroeder and B.S. Atal in 1985. At the time, it provided significantly better quality than existing low bit rate algorithms, such as RELP and LPC vocoders (e.g. FS 1015). Along with …   Wikipedia

  • Spectral modelling synthesis — Spectral Modeling Synthesis or simply SMS is an Acousic modeling approach for speech and other signals.SMS considers sounds as a combination of harmonic content and noise content. Harmonic components are identified based on peaks in the frequency …   Wikipedia

  • Manner of articulation — Manners of articulation Obstruent Plosive (occlusive) Affricate Fricative Sibilant Sonorant Nasal Flap/Tap …   Wikipedia

  • Tongue shape — Human vocal tract In linguistics (articulatory phonetics), tongue shape describes the shape that the tongue assumes when making a sound. Tongue shape is primarily important for the sibilant sounds. Because these sounds have such a high perceptual …   Wikipedia

  • Mechanical filter — Figure 1. A mechanical filter made by the Kokusai Electric Company intended for selecting the narrow 2 kHz bandwidth signals in SSB radio receivers. It operates at 455 kHz, a common IF for these receivers, and is dimensioned 45×15×15 mm ( …   Wikipedia

  • Second language acquisition — is the process by which people learn a second language in addition to their native language(s). The term second language is used to describe the acquisition of any language after the acquisition of the mother tongue. The language to be learned is …   Wikipedia

  • Linear predictive coding — (LPC) is a tool used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model. It is one of the most… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”