Zipf–Mandelbrot law

Zipf–Mandelbrot law

Probability distribution
name =Zipf–Mandelbrot
type =mass
pdf_

cdf_

parameters =N in {1,2,3ldots} (integer)
q in [0;infty) (real)
s>0, (real)
support =k in {1,2,ldots,N}
pdf =frac{1/(k+q)^s}{H_{N,q,s

cdf =frac{H_{k,q,s{H_{N,q,s
mean =frac{H_{N,q,s-1{H_{N,q,s-q
median =
mode =1,
variance =
skewness =
kurtosis =
entropy =
mgf =
char =
In probability theory and statistics, the Zipf–Mandelbrot law is a discrete probability distribution. Also known as the Pareto-Zipf law, it is a power-law distribution on ranked data, named after the linguist George Kingsley Zipf who suggested a simpler distribution called Zipf's law, and the mathematician Benoît Mandelbrot, who subsequently generalized it.

The probability mass function is given by:

:f(k;N,q,s)=frac{1/(k+q)^s}{H_{N,q,s

where H_{N,q,s} is given by:

:H_{N,q,s}=sum_{i=1}^N frac{1}{(i+q)^s}

which may be thought of as a generalization of a harmonic number. In the limit as N approaches infinity, this becomes the Hurwitz zeta function zeta(q,s). For finite N and q=0 the Zipf–Mandelbrot law becomes Zipf's law. For infinite N and q=0 it becomes a Zeta distribution.

Applications

The distribution of words ranked by their frequency in a random
text corpus is generally a power-law distribution, knownas Zipf's law.

If one plots the frequency rank of words contained in a large
corpus of text data versus the number of occurrences or actual
frequencies, one obtains a power-law distribution, with exponent close to one (but see Gelbukh and Sidorov 2001).

References and links

* Cite book
author = B. Mandelbrot
chapter = Information Theory and Psycholinguistics
title = Scientific psychology
editor= B.B. Wolman and E. Nagel
year = 1965
publisher = Basic Books
Reprinted as
** Cite book
author = B. Mandelbrot
chapter = Information Theory and Psycholinguistics
title = Language
editor= R.C. Oldfield and J.C. Marchall
year = 1968
origyear = 1965
publisher = Penguin Books

* [http://arxiv.org/abs/physics/9901035 Z. K. Silagadze: Citations and the Zipf-Mandelbrot's law]
* [http://www.nist.gov/dads/HTML/zipfslaw.html NIST: Zipf's law]
* [http://www.nslij-genetics.org/wli/zipf/index.html W. Li's References on Zipf's law]
* [http://www.gelbukh.com/CV/Publications/2001/CICLing-2001-Zipf.htm Gelbukh and Sidorov 2001: Zipf and Heaps Laws’ Coefficients Depend on Language]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Zipf's law — Probability distribution name =Zipf s law type =mass pdf Zipf PMF for N = 10 on a log log scale. The horizontal axis is the index k . (Note that the function is only defined at integer values of k . The connecting lines do not indicate continuity …   Wikipedia

  • Zipf'sches Gesetz — Dieser Artikel befasst sich mit dem Echten Zipfschen Gesetz. Für das sogenannte Falsche siehe Falsches Zipfsches Gesetz. Das Zipf sche Gesetz (nach George Kingsley Zipf, der dieses Gesetz in den 1930er Jahren aufstellte) ist ein Modell, mit… …   Deutsch Wikipedia

  • Power law — A power law is any polynomial relationship that exhibits the property of scale invariance. The most common power laws relate two variables and have the form:f(x) = ax^k! +o(x^k),where a and k are constants, and o(x^k) is of x. Here, k is… …   Wikipedia

  • Bradford's law — is a pattern first described by Samuel C. Bradford in 1934 that estimates the exponentially diminishing returns of extending a search for references in science journals. One formulation is that if journals in a field are sorted by number of… …   Wikipedia

  • Perfekter Zipf — Dieser Artikel befasst sich mit dem Echten Zipfschen Gesetz. Für das sogenannte Falsche siehe Falsches Zipfsches Gesetz. Das Zipf sche Gesetz (nach George Kingsley Zipf, der dieses Gesetz in den 1930er Jahren aufstellte) ist ein Modell, mit… …   Deutsch Wikipedia

  • List of probability distributions — Many probability distributions are so important in theory or applications that they have been given specific names.Discrete distributionsWith finite support* The Bernoulli distribution, which takes value 1 with probability p and value 0 with… …   Wikipedia

  • Zeta distribution — Probability distribution name =zeta type =mass pdf Plot of the Zeta PMF on a log log scale. (Note that the function is only defined at integer values of k. The connecting lines do not indicate continuity.) cdf parameters =sin(1,infty) support =k… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Riemann zeta function — ζ(s) in the complex plane. The color of a point s encodes the value of ζ(s): dark colors denote values close to zero and hue encodes the value s argument. The white spot at s = 1 is the pole of the zeta function; the black spots on the… …   Wikipedia

  • List of mathematics articles (Z) — NOTOC Z Z channel (information theory) Z factor Z function Z group Z matrix (mathematics) Z notation Z order (curve) Z test Z transform Z* theorem Zadoff–Chu sequence Zahorski theorem Zakai equation Zakharov–Schulman system Zakharov system ZAMM… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”