Fisher kernel

Fisher kernel

In mathematics, the Fisher kernel, named in honour of Sir Ronald Fisher, is a kernel. It was introduced in 1998 by Tommi Jaakkola ["Exploiting Generative Models in Discriminative Classifiers" (1998) [http://people.csail.mit.edu/tommi/papers/gendisc.ps PS] , [http://citeseer.ist.psu.edu/jaakkola98exploiting.html Citeseer] ] .

The Fisher kernel combines the advantages of generative statistical models (like the Hidden Markov model) and those of discriminative methods (like Support vector machine):
* generative model can process data of variable length (adding or removing data is well-supported)
* discriminative methods can have flexible criteria and yield better results.

Derivation

Fisher score

The Fisher kernel makes use of the Fisher score, defined as

U_X = abla_{ heta} log P(X| heta)

with heta being a set (vector) of parameters. log P(X| heta) is the log-likelihood of the probabilistic model.

Fisher kernel

The Fisher kernel is defined as

K(X_i, X_j) = U_{X_{i^T I^{-1} U_{X_{j

with "I" the Fisher information matrix

Applications

Information retrieval

The Fisher kernel is the kernel for a generative probabilistic model. As such, it constitutes a bridge between generative and probabilistic models of documents ["Generative vs Discriminative Approaches to Entity Recognition from Label-Deficient Data" (2003) [http://www.xrce.xerox.com/Publications/Attachments/2003-079/2003_079.pdf PDF] , [http://citeseer.ist.psu.edu/goutte03generative.html Citeseer] ] . Fisher kernels exist for numerous models, notably tf–idf ["Deriving TF-IDF as a fisher kernel" (2005) [http://www-cse.ucsd.edu/users/elkan/papers/spire05.pdf PDF] [http://cat.inist.fr/?aModele=afficheN&cpsidt=17416010] ] , Naive Bayes and PLSI.

See also

* Fisher information metric

Notes and references


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Kernel methods — (KMs) are a class of algorithms for pattern analysis, whose best known elementis the Support Vector Machine (SVM). The general task of pattern analysis is to find and study general types of relations (for example clusters, rankings, principal… …   Wikipedia

  • Kernel trick — In machine learning, the kernel trick is a method for using a linear classifier algorithm to solve a non linear problem by mapping the original non linear observations into a higher dimensional space, where the linear classifier is subsequently… …   Wikipedia

  • Ronald Fisher — R. A. Fisher Born 17 February 1890(1890 02 17) East Finchley, London …   Wikipedia

  • List of mathematics articles (F) — NOTOC F F₄ F algebra F coalgebra F distribution F divergence Fσ set F space F test F theory F. and M. Riesz theorem F1 Score Faà di Bruno s formula Face (geometry) Face configuration Face diagonal Facet (mathematics) Facetting… …   Wikipedia

  • Probabilistic latent semantic analysis — (PLSA), also known as probabilistic latent semantic indexing (PLSI, especially in information retrieval circles) is a statistical technique for the analysis of two mode and co occurrence data. PLSA evolved from Latent semantic analysis, adding a… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Windows Vista — Part of the Microsoft Windows family …   Wikipedia

  • List of Splinter Cell characters — The following is a list of characters found throughout the Splinter Cell series of video games and novels. Contents 1 Major characters 1.1 Lieutenant Commander Samuel Sam Fisher, USN (Ret.) 1.2 Colonel Irving Lambert, USA (KIA) …   Wikipedia

  • Dirac delta function — Schematic representation of the Dirac delta function by a line surmounted by an arrow. The height of the arrow is usually used to specify the value of any multiplicative constant, which will give the area under the function. The other convention… …   Wikipedia

  • Fedora- und Red-Hat-Versionsnamen — Die Linux Distribution Red Hat Linux benannte, wie andere Distributionen auch, die jeweiligen Versionen der Software neben der Versionsnummer auch mit einzelnen Codenamen für jedes größere Release. Diese Tradition wurde auch in das Fedora Core… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”