Audio-visual speech recognition

Audio-visual speech recognition

Audio visual speech recognition (AVSR) is a technique that uses image processing capabilities in lip reading to aid speech recognition systems in recognizing undeterministic phones or giving preponderance among near probability decisions.

Each system lip reading and speech recognition works separately then their results are mixed at the stage of feature fusion.

External links

* [http://www.research.ibm.com/AVSTG IBM Research - Audio Visual Speech Technologies]
* [http://www.intel.com/technology/computing/applications/avcsr.htm Intel Applications Research]


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Speech recognition — For the human linguistic concept, see Speech perception. The display of the Speech Recognition screensaver on a PC, in which the character responds to questions, e.g. Where are you? or statements, e.g. Hello. Speech recognition (also known as… …   Wikipedia

  • Recognition (disambiguation) — Recognition is identification of something already known or acknowledgement of something as valid. The term may have the following specialized meanings.*Recognition (sociology), an acknowledgement of merits. *Recognition (diplomacy), acceptance… …   Wikipedia

  • Speech Application Programming Interface — The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date a number of versions of the API have been released, which have… …   Wikipedia

  • Speech synthesis — Stephen Hawking is one of the most famous people using speech synthesis to communicate Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented… …   Wikipedia

  • Audio engineering — An audio engineer at an audio console. An audio engineer, also called audio technician, audio technologist or sound technician, is a specialist in a skilled trade that deals with the use of machinery and equipment for the recording, mixing and… …   Wikipedia

  • Speech repetition — Children copy with their own mouths the words spoken by the mouths of those around them. This enables them to learn the pronunciation of words not already in their vocabulary. Speech repetition is the saying by one individual of the spoken… …   Wikipedia

  • Microsoft Speech API — This article is about the Speech API. For other uses, see SAPI (disambiguation). The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows… …   Wikipedia

  • Reverse speech — This article is about the theory of reversed messages in normal speech. For hidden messages in recordings, see backmasking. For the act of speaking backwards, see phonetic reversal. Reverse speech is a pseudoscience[1][2][3] first advocated by… …   Wikipedia

  • VBScript — Visual Basic Scripting Edition (обычно просто VBScript) скриптовый язык программирования, интерпретируемый компонентом Windows Script Host. Он широко используется при создании скриптов в операционных системах семейства Microsoft Windows. VBScript …   Википедия

  • Technical features new to Windows Vista — This article is part of a series on Windows Vista New features Overview Technical and core system Security and safety Networking technologies I/O technologies Management and administration Removed features …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”