Pointwise mutual information

Pointwise mutual information

Pointwise mutual information (PMI) (or specific mutual information) is a measure of association used in information theory and statistics.

The PMI of a pair of outcomes "x" and "y" belonging to discrete random variables quantifies the discrepancy between the probability of their coincidence given their joint distribution versus the probability of their coincidence given only their individual distributions and assuming independence. Mathematically,

: SI(x,y) = logfrac{p(x,y)}{p(x)p(y)}.

The mutual information of "X" and "Y" is the expected value of the Specific Mutual Information of all possible outcomes.

The measure is symmetric (SI(x,y)=SI(y,x).) It is zero if "X" and "Y" are independent, and equal to -log("p"("x")) if "X" and "Y" are perfectly associated. Finally, SI(x,y) will increase if "p"("x"|"y") is fixed, but "p"("x") decreases.

External links

* [http://cwl-projects.cogsci.rpi.edu/msr/ Demo at Rensselaer MSR Server] (PMI values normalized to be between 0 and 1)


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Mutual information — Individual (H(X),H(Y)), joint (H(X,Y)), and conditional entropies for a pair of correlated subsystems X,Y with mutual information I(X; Y). In probability theory and information theory, the mutual information (sometimes known by the archaic term… …   Wikipedia

  • Information theory — Not to be confused with Information science. Information theory is a branch of applied mathematics and electrical engineering involving the quantification of information. Information theory was developed by Claude E. Shannon to find fundamental… …   Wikipedia

  • Semantic similarity — or semantic relatedness is a concept whereby a set of documents or terms within term lists are assigned a metric based on the likeness of their meaning / semantic content. Concretely, this can be achieved for instance by defining a topological… …   Wikipedia

  • Semantic relatedness — Computational Measures of Semantic Relatedness are [http://cwl projects.cogsci.rpi.edu/msr/ publically available] means for approximating the relative meaning of words/documents. These have been used for essay grading by the Educational Testing… …   Wikipedia

  • Sentiment analysis — or opinion mining refers to the application of natural language processing, computational linguistics, and text analytics to identify and extract subjective information in source materials. Generally speaking, sentiment analysis aims to determine …   Wikipedia

  • PMI — may stand for:* Mathematical induction, a principle of mathematics * Private Medical Insurance, a type of insurance * Peoples Ministry Inc., A Christian ministry in Toronto, Ontario, Canada, that operates Peoples Christian Academy * People s… …   Wikipedia

  • Function (mathematics) — f(x) redirects here. For the band, see f(x) (band). Graph of example function, In mathematics, a function associates one quantity, the a …   Wikipedia

  • Erlangen program — An influential research program and manifesto was published in 1872 by Felix Klein, under the title Vergleichende Betrachtungen über neuere geometrische Forschungen . This Erlangen Program ( Erlanger Programm ) mdash; Klein was then at Erlangen… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”