Factored language model

Factored language model

The factored language model (FLM) is an extension of a conventional language model. In an FLM, each word is viewed as a vector of "k" factors: w_i = {f_i^1, ..., f_i^k}. An FLM provides the probabilistic model P(f|f_i, ..., f_N) where the prediction of a factor f is based on N parents {f_1, ..., f_N}. For example, if w represents a word token and t represents a Part of speech tag for English, the expression P(w_i|w_{i-2}, w_{i-1}, t_{i-1}) gives a model for predicting current word token based on a traditional Ngram model as well as the Part of speech tag of the previous word.

A major advantage of factored language models is that they allow users to specify linguistic knowledge such as the relationship between word tokens and Part of speech in English, or morphological information (stems, root, etc.) in Arabic.

Like N-gram models, smoothing techniques are necessary in parameter estimation. In particular, generalized back-off is used in training an FLM.

References

*cite conference | author=J Bilmes and K Kirchhoff | url=http://ssli.ee.washington.edu/people/bilmes/mypapers/hlt03.pdf | title=Factored Language Models and Generalized Parallel Backoff | booktitle=Human Language Technology Conference | pages= | year=2003


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Language model — A statistical language model assigns a probability to a sequence of m words P(w 1,ldots,w m) by means of a probability distribution.Language modeling is used in many natural language processing applications such as speech recognition, machine… …   Wikipedia

  • Oz (programming language) — Oz Paradigm(s) multi paradigm: logic, functional, imperative, object oriented, constraint, distributed, concurrent Appeared in 1991 Designed by Gert Smolka, his students Developer Mozart …   Wikipedia

  • Global climate model — AGCM redirects here. For Italian competition regulator, see Autorità Garante della Concorrenza e del Mercato. Climate models are systems of differential equations based on the basic laws of physics, fluid motion, and chemistry. To “run” a model,… …   Wikipedia

  • Comparison of machine translation applications — A machine translation application is a program which can translate text or speech from one natural language to another. Machine translation applications are essential to the modern language industry. Please see the individual products articles… …   Wikipedia

  • Nobel Prizes — ▪ 2009 Introduction Prize for Peace       The 2008 Nobel Prize for Peace was awarded to Martti Ahtisaari, former president (1994–2000) of Finland, for his work over more than 30 years in settling international disputes, many involving ethnic,… …   Universalium

  • Database — A database is an organized collection of data for one or more purposes, usually in digital form. The data are typically organized to model relevant aspects of reality (for example, the availability of rooms in hotels), in a way that supports… …   Wikipedia

  • Inheritance (object-oriented programming) — In object oriented programming (OOP), inheritance is a way to reuse code of existing objects, establish a subtype from an existing object, or both, depending upon programming language support. In classical inheritance where objects are defined by …   Wikipedia

  • Analysis of algorithms — To analyze an algorithm is to determine the amount of resources (such as time and storage) necessary to execute it. Most algorithms are designed to work with inputs of arbitrary length. Usually the efficiency or running time of an algorithm is… …   Wikipedia

  • Wardlaw-Hartridge School — Infobox Private School background = #f0f6fa (standard color) border = #ccd2d9 (standard color) name = The Wardlaw Hartridge School motto = Cognoscere et conficere To learn and to achieve [The Wardlaw Hartridge School.… …   Wikipedia

  • Inheritance (computer science) — In object oriented programming, inheritance is a way to form new classes (instances of which are called objects) using classes that have already been defined. The inheritance concept was invented in 1967 for Simula. [ [http://heim.ifi.uio.no/… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”