Ewens's sampling formula

Ewens's sampling formula

In population genetics, Ewens' sampling formula, introduced by Warren Ewens, states that under certain conditions (specified below), if a random sample of "n" gametes is taken from a population and classified according to the gene at a particular locus then the probability that there are "a"1 alleles represented once in the sample, and "a"2 alleles represented twice, and so on, is

:operatorname{Pr}(a_1,dots,a_n)={n! over heta( heta+1)cdots( heta+n-1)}prod_{j=1}^n{ heta^{a_j} over j^{a_j} a_j!},

for some positive number "θ", whenever "a"1, ..., "a""n" is a sequence of nonnegative integers such that

:a_1+2a_2+3a_3+cdots+na_n=n.,

The phrase "under certain conditions", used above, must of course be made precise. The assumptions are (1) the sample size "n" is small by comparison to the size of the whole population, and (2) the population is in statistical equilibrium under mutation and genetic drift and the role of selection at the locus in question is negligible, and (3) every mutant allele is novel. (See also idealised population.)

This is a probability distribution on the set of all partitions of the integer "n". Among probabilists and statisticians it is often called the Ewens distribution.

When "θ" = 0, the probability is 1 that all "n" genes are the same. When "θ" = 1, then the distribution is precisely that of the integer partition induced by a uniformly distributed random permutation. As "θ" → ∞, the probability that no two of the "n" genes are the same approaches 1.

This family of probability distributions enjoys the property that if after the sample of "n" is taken, "m" of the "n" gametes are chosen without replacement, then the resulting probability distribution on the set of all partitions of the smaller integer "m" is just what the formula above would give if "m" were put in place of "n".

The Ewens distribution arises naturally from the Chinese restaurant process.

References

* Warren Ewens, "The sampling theory of selectively neutral alleles", "Theoretical Population Biology", volume 3, pages 87—112, 1972.
* J.F.C. Kingman, "Random partitions in population genetics", "Proceedings of the Royal Society of London, Series B, Mathematical and Physical Sciences", volume 361, number 1704, 1978.
* S. Tavare and W. J. Ewens, [http://www.cs.cmu.edu/~epxing/CBML/coalescent/esfrep.ps "The Ewens sampling formula"] . In "Multivariate discrete distributions" by N.L. Johnson, S. Kotz, and N. Balakrishnan (eds), 1997, Wiley.

ee also

* Coalescent theory
* Unified neutral theory of biodiversity


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Warren Ewens — Warren Ewens, FRS, is an Australian born professor of biology at the University of Pennsylvania. He concentrates his research on the mathematical, statistical and theoretical aspects of population genetics. Ewens has worked in human population… …   Wikipedia

  • Unified neutral theory of biodiversity — The unified neutral theory of biodiversity and biogeography (here Unified Theory or UNTB ) is a theory and the title of a monograph [http://www.pupress.princeton.edu/titles/7105.html] by ecologist Stephen Hubbell. The theory aims to explain the… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • List of mathematics articles (E) — NOTOC E E₇ E (mathematical constant) E function E₈ lattice E₈ manifold E∞ operad E7½ E8 investigation tool Earley parser Early stopping Earnshaw s theorem Earth mover s distance East Journal on Approximations Eastern Arabic numerals Easton s… …   Wikipedia

  • List of probability topics — This is a list of probability topics, by Wikipedia page. It overlaps with the (alphabetical) list of statistical topics. There are also the list of probabilists and list of statisticians.General aspects*Probability *Randomness, Pseudorandomness,… …   Wikipedia

  • Neutral theory of molecular evolution — The neutral theory of molecular evolution states that the vast majority of evolutionary changes at the molecular level are caused by random drift of selectively neutral mutants (not affecting fitness).[1] The theory was introduced by Motoo Kimura …   Wikipedia

  • Partition (number theory) — Young diagrams associated to the partitions of the positive integers 1 through 8. They are so arranged that images under the reflection about the main diagonal of the square are conjugate partitions. In number theory and combinatorics, a… …   Wikipedia

  • Population genetics — is the study of the allele frequency distribution and change under the influence of the four evolutionary forces: natural selection, genetic drift, mutation and gene flow. It also takes account of population subdivision and population structure… …   Wikipedia

  • List of partition topics — This is a list of partition topics, in the mathematical sense. Partition (disambiguation) lists meanings in other fields. In mathematics, a partition may be a partition of a set or an ordered partition of a set, or a partition of a graph, or a… …   Wikipedia

  • List of geneticists — This is a list of people who have made notable contributions to genetics. The growth and development of genetics represents the work of many people. This list of geneticists is therefore by no means complete. Contributors of great distinction to… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”