Speech analytics

Speech analytics

Speech Analytics is a term used to describe automatic methods of analyzing speech to extract useful information about the speech content or the speakers. Although it often includes elements of automatic speech recognition, where the identities of spoken words or phrases are determined, it may also include analysis of one or more of the following:
* the topic(s) being discussed
* the identities of the speaker(s)
* the genders of the speakers
* the emotional character of the speech
* the amount and locations of speech versus non-speech (e.g. background noise or silence)

One use of speech analytics applications is to spot spoken keywords or phrases, either as real-time alerts on live audio or as a post-processing step on recorded speech. This technique is also known as audio mining. Other uses include categorization of speech, for example in the contact center environment, to identify calls from unsatisfied customers. Speech analytics technology may combine results from different techniques to achieve its aims. For example knowledge about where certain keywords were spoken in a customer telephone conversation could be combined with knowledge about which speaker (customer or contact center agent) spoke the words and perhaps knowledge of how often the two speakers were talking at the same time as each other.

Speech Analytics in contact centers can be used to extract critical business intelligence that would otherwise be lost. By analyzing and categorizing recorded phone conversations between companies and their customers, useful information can be discovered relating to strategy, product, process, and operational issues. This information gives decision-makers insight into what customers really think about their company so that they can quickly react.

Technology

There are two main approaches "under the hood", phonetic approach and LVCSR. There are some variations on top of these technologies, such as "direct" analysis which is built on top of LVCSR. Some Speech Analytics vendors use the "Engine" of a 3rd party. The biggest names creating core engines today are IBM and Nuance/Scansoft.weasel-inline|date=August 2008 There are however smaller players such as SER and Philips, and there are some Speech Analytics vendors that have developed there own proprietary engine (such as Nexidia).

Phonetic

This is the fastest approach, mostly because the size of the grammar is very small. The basic recognition unit is a phoneme. There are only few tens unique phoneme in most languages, and the output of this recognition is a stream (text) of phonemes. The speed may be an advantage, and also the fact that since it is at its core dealing with phonemes, not words, it may correctly handle any "new" word that is not part of the language (think names, product names etc').

LVCSR

Much slower, since the basic unit is a word, it needs to have hundred of thousands of words to match the audio against. The output however is a stream of words, making it easier to work with.

Quality

The best LVCSR engines may reach about 50% WER (word error rate). This is considered top of the line performance in today's standards.weasel-inline|date=August 2008 Notice however that this means that every second word you read in the output is wrong! Still, it may provide more then enough accuracy for statistical analytics. The quality of the phonetic engines is considerably lower. While hard to compare apples to apples, it is in the range of what would be 20% WER.

ee also

* Customer intelligence


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Speech recognition — For the human linguistic concept, see Speech perception. The display of the Speech Recognition screensaver on a PC, in which the character responds to questions, e.g. Where are you? or statements, e.g. Hello. Speech recognition (also known as… …   Wikipedia

  • Noisy text analytics — is a process of information extraction whose goal is to automatically extract structured or semistructured information from noisy unstructured text data. While Text analytics is a growing and mature field that has great value because of the huge… …   Wikipedia

  • Predictive analytics — encompasses a variety of techniques from statistics and data mining that analyze current and historical data to make predictions about future events. Such predictions rarely take the form of absolute statements, and are more likely to be… …   Wikipedia

  • Customer intelligence — Contents 1 What it is 2 CI and CRM 3 Process 4 Example sources of data for CI 5 Benefits …   Wikipedia

  • Customer dynamics — is an emerging theory on customer business relationships that describes the ongoing interchange of information and transactions between customers and organizations. These exchanges occur over a wide range of communication channels, such as phone …   Wikipedia

  • Audio mining — is a technique by which the content of an audio signal can be automatically analysed and searched. It is most commonly used in the field of automatic speech recognition, where the analysis tries to identify any speech within the audio. The audio… …   Wikipedia

  • Noble Systems Corporation — Contents 1 Noble Systems Corporation 2 Corporation History and Acquisitions 3 Customers and Markets 4 Partners …   Wikipedia

  • List of acquisitions by Google — Google s logo Google is a computer software and a web search engine company. Each acquisition listed is for the respective company in its entirety, unless otherwise specified. The acquisition date listed is the date of the agreement between… …   Wikipedia

  • Aristotle — /ar euh stot l/, n. 384 322 B.C., Greek philosopher: pupil of Plato; tutor of Alexander the Great. * * * born 384, Stagira died 322 BC, Chalcis Greek philosopher and scientist whose thought determined the course of Western intellectual history… …   Universalium

  • Alcibiades — Infobox Military Person name= Alcibiades Ἀλκιβιάδης Alkibiádēs caption= Alcibiades allegiance= Athens (415–412 BC Sparta) rank= general (strategos) commands= nickname= lived= 450–404 BC placeofbirth= Athens placeofdeath=… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”