XHTML+Voice

XHTML+Voice

XHTML+Voice (commonly X+V) is an XML language for describing multimodal user interfaces. The two essential modalities are visual and auditory. Visual interaction is defined like most current web pages via XHTML. Auditory components are defined by a subset of Voice XML. Interfacing the voice and visual components of X+V documents is accomplished through a combination of ECMAScript, JavaScript, and XML Events.

Voice input

Voice input or speech recognition is based on grammars that define the set of possible input text. In contrast to a probabilistic approach employed by popular software packages such as Dragon Naturally Speaking, the grammar based approach provides the recognizer with important contextual information that significantly boosts recognition accuracy. The specific formats for grammars include JSGF.

Voice output

Voice output or speech synthesis can read any string at virtually any time. Pitch, volume, and other charactaristics can be customized using CSS and Speech Synthesis Markup Language (SSML) however the Opera web browser doesn't currently support all these features.

MIME types

The previously recommended MIME type for any X+V document is application/xhtml+voice+xml which is what the Opera browser uses. Opera will also interpret X+V documents served as text/xml. The current recommended MIME type for any X+V document is application/xv+xml. Since most web servers associate the .xml extension with text/xml, an xml extension is a fairly safe way of making your static X+V document files browsable.

X+V-enabled browsers

The most commonly used X+V browser is the Opera browser. Users of the Opera browser can enable X+V support through steps described at [http://www.opera.com/voice/ http://www.opera.com/voice/] . Voice is not yet supported in Opera Mini or on platforms other than Windows.

Detecting support for X+V is best done from the server by checking the HTTP header "Accept" for the MIME type application/xhtml+voice+xml. Here is some PHP code that returns "true" if and only if the requesting browser supports XHTML+Voice: 0)) { echo "true"; } } else echo "false"; ?>

Related Technology

Speech Application Language Tags(SALT) is a very similar format developed by Microsoft in 2001 to compete with VoiceXML and XHTML+Voice. SALT also provides users with multimodal support including grammar based recognition and speech synthesized output. The main differences are in the providers of support. Many different companies support VoiceXML and XHTML+Voice by providing various development tools and in particular IBM and Opera Software. SALT is supported almost exclusively from Microsoft by products such as the Microsoft Speech Application SDK and Microsoft Speech Server.

External links

* [http://www.voicexml.org/specs/multimodal/x+v/12/ XHTML+Voice v1.2]
* [http://dev.opera.com/articles/voice/ Voice - Opera Developer Community]
* [ftp://ftp.software.ibm.com/software/pervasive/info/multimodal/XHTML_voice_programmers_guide.pdf XHTML+Voice Programmer's Guide]
* [http://www.opera.com/download/ Download Opera Web Browser]
* [http://davinci.newcs.uwindsor.ca/~speechweb/movie.mov Video demonstration using XHTML+Voice]
* [http://cs.uwindsor.ca/~speechweb/ The SpeechWeb Project]
* [http://www.apps.ietf.org/rfc/rfc4374.txt RFC 4374 on MIME type]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Comparison of layout engines (XHTML) — HTML HTML and HTML5 Dynamic HTML XHTML XHTML Mobile Profile and C HTML Canvas element Character encodings Document Object Model Font family HTML editor HTML element HTML Frames HTML5 video HTML scrip …   Wikipedia

  • VXML — VoiceXML (Voice Extensible Markup Language) ist eine XML Applikation, die der Beschreibung von Dialog Abläufen in einem Sprachdialogsystem dient. Sie ist besonders für Telefonanwendungen entwickelt worden. Seit März 2004 ist VoiceXML eine… …   Deutsch Wikipedia

  • VoiceXML — (Voice Extensible Markup Language) ist eine XML Applikation, die der Beschreibung von Dialog Abläufen in einem Sprachdialogsystem dient. Sie ist besonders für Telefonanwendungen entwickelt worden. Seit März 2004 ist VoiceXML eine Empfehlung des… …   Deutsch Wikipedia

  • Multimodal interaction — Not to be confused with Mode (computer interface). Multimodal interaction provides the user with multiple modes of interfacing with a system. A multimodal interface provides several distinct tools for input and output of data. Contents 1… …   Wikipedia

  • Opera (web browser) — Opera Opera 11.51 displaying Wikipedia on Windows 7. Developer(s …   Wikipedia

  • Опера (браузер) — Opera Скриншот браузера Opera 9.6 Тип Браузер Разработчик Opera Software ASA Написана на …   Википедия

  • Opera — Иное название этого понятия  «Опера»; см. также другие значения. Opera …   Википедия

  • T. V. Raman — is a blind computer scientist born and raised from Pune, India.cite web|url=http://hci.stanford.edu/cs547/abstracts/94 95/941028 raman.html|title=PCD Seminar 10/28/94 T.V. Raman|accessdate=2007 11 07|publisher=Stanford University] His… …   Wikipedia

  • История браузера Opera — Основная статья: Opera История браузера Opera началась в 1994 году, прототипом будущего браузера стала разработка компании Televerket  крупнейшей телекоммуникационной компании Норвегии. В 1995 году проект выделился в разработку специально… …   Википедия

  • T. V. Raman — (* 1965 in Pune, Maharashtra, Indien) ist Autor und Informatiker. Er ist bekannt für seine Arbeit im Bereich des barrierefreien Internet.[1][2] Inhaltsverzeichnis 1 Biographie 2 …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”