Moran process

A Moran process, named after Patrick Moran, is a stochastic process used in biology to describe finite populations. It can be used to model variety-increasing processes such as mutation as well as variety-reducing effects such as genetic drift and natural selection. The process can describe the probabilistic dynamics in a finite population of constant size N in which two alleles A and B are competing for dominance. The two alleles are considered to be true replicators (i.e. entities that make copies of themselves). In each time step a random individual (which is of either type A or B) is chosen for reproduction and a random individual is chosen for death; thus ensuring that the population size remains constant. To model selection, one type has to have a higher fitness and is thus more likely to be chosen for reproduction. The same individual can be chosen for death and for reproduction in the same step.

1 Neutral drift
2 Selection
3 Rate of evolution
4 Literature
5 External links

Neutral drift

Neutral drift is the idea that a neutral mutation can spread throughout a population, so that eventually the original allele is lost. A neutral mutation does not bring any fitness advantage or disadvantage to its bearer. The simple case of the Moran process can describe this phenomenon.

If the number of A individuals is given by i then the Moran process is defined on the state space i = 0, ..., N. Since the number of A individuals can change at most by one at each time step, a transition exists only between state i and state i − 1, i and i + 1. Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are

$\begin{align} P_{0,0}&=1\\ P_{i,i-1} &= \frac{N-i}{N} \frac{i}{N}\\ P_{i,i} &= 1- P_{i,i-1} - P_{i,i+1}\\ P_{i,i+1} &= \frac{i}{N} \frac{N-i}{N}\\ P_{N,N}&=1. \end{align}$

The entry $P i, j$ denotes the probability to go from state i to state j. To understand the formulas for the transition probabilities one has to look at the definition of the process which states that always one individual will be chosen for reproduction and one is chosen for death. Once the A individuals have died out, they will never be reintroduced into the population since the process does not model mutations (A cannot be reintroduced into the population once it has died out and vice versa) and thus $P 0,0 = 1$ . For the same reason the population of A individuals will always stay N once they have reached that number and taken over the population and thus $P N, N = 1$ . The states 0 and N are called absorbing while the states 1, ..., N − 1 are called transient. The intermediate transition probabilities can be explained by considering the first term to be the probability to choose the individual whose abundance will increase by one and the second term the probability to choose the other type for death. Obviously, if the same type is chosen for reproduction and for death, then the abundance of one type does not change.

Eventually the population will reach one of the absorbing states and then stay there forever. In the transient states, random fluctuations will occur but eventually the population of A will either go extinct or reach fixation. This is one of the most important differences to deterministic processes which cannot model random events. The expected value and the variance of the number of A individuals X(t) at timepoint t can be computed when an initial state X(0) = i is given:

$\begin{align} E[X(t)|X(0) = i] &= i \\ Var(X(t)|X(0) = i) &= 2i/N(1-i/N) \frac{1-(1-2/N^2)^t}{2/N^2} \end{align}$

For a mathematical derivation of the equation above, click on "show" to reveal

For the expected value the calculation runs as follows. Writing $p = i / N,$

$\begin{align} E [X(t) | X(t-1) = i] &= (i-1)P_{i,i-1} + iP_{i,i} + (i+1)P_{i,i+1}\\ &= 2ip(1-p) + p(p^2 + (1-p)^2) \\ &= i. \end{align}$

Writing $Y = X (t)$ and $Z = X (t - 1)$ , and applying the law of total expectation, $E [Y] = E [E [Y | Z]] = E [Z].$ Applying the argument repeatedly gives $E [X (t)] = E [X (0)],$ or $E [X (t) | X (0) = i] = i .$

For the variance the calculation runs as follows. Writing $V t = V a r (X (t) | X (0) = i),$ we have

$\begin{align} V_1 &= E[X(1)^2|X(0) = i] - E[X(1)|X(0)=i]^2 \\ &= (i-1)^2p(1-p) + i^2(p^2+(1-p)^2) + (i+1)^2p(1-p) - i^2 \\ &= 2p(1-p). \end{align}$

For all t, $(X (t) | X (t - 1) = i)$ and $(X (1) | X (0) = i)$ are identically distributed, so their variances are equal. Writing as before $Y = X (t)$ and $Z = X (t - 1)$ , and applying the law of total variance,

$\begin{align} Var(Y) &= E[Var(Y|Z)] + Var(E[Y|Z]) \\ &= E[(2Z/N)(1-Z/N)] + Var(Z)\\ &= (2E[Z]/N)(1-E[Z]/N) + (1-2N^2)Var(Z). \end{align}$

If $X (0) = i$ , we obtain $V t = V 1 + (1 - 2 / N 2) V t - 1$ . Rewriting this equation as

$V_t - \frac{V_1}{2/N^2} = (1-2/N^2)\left(V_{t-1}-\frac{V_1}{2/N^2}\right) = (1-2/N^2)^{t-1}\left(V_1-\frac{V_1}{2/N^2}\right)$

yields

$V_t = V_1 \frac{1-(1-2/N^2)^t}{2/N^2}$

as desired.

The probability of A to reach fixation is called fixation probability. For the simple Moran process this probability is

$\begin{align} x_i = \frac{i}{N}. \end{align}$

Since all individuals have the same fitness, they also have the same chance of becoming the ancestor of the whole population; this probability is 1 / N and thus the sum of all i probabilities (for all A individuals) is just i / N. The mean time to absorption starting in state i is given by

$\begin{align} k_i = N \left[ \sum\limits_{j=1}^{i} \frac{N-i}{N-j} + \sum\limits_{j=i+1}^{N-1} \frac{i}{j} \right] \end{align}$

For a mathematical derivation of the equation above, click on "show" to reveal

The mean time spent in state j when starting in state i which is given by

$\begin{align} k_i^j = \delta_{ij}+P_{i,i-1}k_{i-1}^j + P_{i,i}k_{i}^j + P_{i,i+1}k_{i+1}^j \end{align}$

Here $δ i j$ denotes the Kroenecker delta. This recursive equation can be solved using a new variable $q i$ so that $P i, i - 1 = P i, i + 1 = q i$ and thus $P i, i = 1 - 2 q i$ and rewritten

$\begin{align} k_{i+1}^j &= 2 k_{i}^j- k_{i-1}^j -\frac{\delta_{ij}}{q_i} \end{align}$

The variable $y_i^{j} = k_{i}^j- k_{i-1}^j$ is used and the equation becomes

$\begin{align} y_{i+1}^{j} &= y_i^{j} -\frac{\delta_{ij}}{q_i} \\ \sum\limits_{i=1}^m y_i^{j} &= (k_{1}^j- k_{0}^j) + (k_{2}^j- k_{1}^j) + \cdots + (k_{m-1}^j- k_{m-2}^j) + (k_{m}^j- k_{m-1}^j) \\ &= k_{m}^j - k_{0}^j \\ \sum\limits_{i=1}^m y_i^{j} &= k_{m}^j \\ \\ y_1^{j} &= (k_{1}^j- k_{0}^j) = k_{1}^j \\ y_2^{j} &= y_1^{j} -\frac{\delta_{1j}}{q_1} = k_1^{j} -\frac{\delta_{1j}}{ q_1 } \\ y_3^{j} &= k_1^{j} -\frac{\delta_{1j}}{q_1} -\frac{\delta_{2j}}{ q_2 } \\ & \; \vdots \\ y_i^{j} &= k_1^{j} -\sum\limits_{r=1}^{i-1} \frac{\delta_{rj}}{ q_r} \quad = \quad \left\{ \begin{array}{lcr} k_1^j & \text{for} & j \geq i\\ k_1^j - \frac{1}{q_j} & \text{for} & j \leq i \end{array} \right. \\ k_i^j &= \quad \quad \; \sum\limits_{m=1}^i y_m^{j} \quad = \quad \left\{ \begin{array}{lcr} i \cdot k_1^j & \text{for} & j \geq i\\ i \cdot k_1^j - \frac{i-j}{q_j} & \text{for} & j \leq i \end{array} \right. \end{align}$

Now $k_1^j$ can be calculated, knowing that $k_N^j = 0$ and $q_j = P_{j,j+1}=\frac{j}{N} \frac{N-j}{N}$

$\begin{align} k_N^j = \sum\limits_{i=1}^m y_i^{j} = N \cdot k_1^j &- \frac{N-j}{q_j} = 0 \\ k_1^j &= \frac{N}{j} \end{align}$

Therefore

$\begin{align} k_i^j \quad = \quad \left\{ \begin{array}{lcr} \frac{i}{j} \cdot k_j^j & \text{for} & j \geq i\\ \frac{N - i}{N-j} \cdot k_j^j & \text{for} & j \leq i \end{array} \right. \end{align}$

with $k_j^j = N$ . Now $k i$ , the total time until fixation starting from state i, can be calculated

$\begin{align} k_i = \sum\limits_{j=1}^{N-1}k_i^j &= \sum\limits_{j=1}^{i}k_i^j + \sum\limits_{j=i+1}^{N-1}k_i^j \\ &= \sum\limits_{j=1}^{i}N \frac{N-i}{N-j} + \sum\limits_{j=i+1}^{N-1}N \frac{i}{j} \end{align}$

For large N the approximation

$\begin{align} \lim \limits_{N\rightarrow \infty} k_i \approx -N^2 \left[ (1-p) \ln(1-p) + p \ln(p) \right] \end{align}$

holds.

Selection

If one allele has a fitness advantage over the other allele, it will be more likely to be chosen for reproduction. This can be incorporated into the model if individuals with allele A have fitness $f i$ and individuals with allele B have fitness $g i$ where i is the number of individuals of type A; thus describing a general birth-death process. The transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are

$\begin{align} P_{0,0}&=1\\ P_{i,i-1} &= \frac{g_i (N-i) }{f_i \cdot i + g_i (N-i)} \cdot \frac{i}{N}\\ P_{i,i} &= 1- P_{i,i-1} - P_{i,i+1}\\ P_{i,i+1} &= \frac{f_i \cdot i}{f_i \cdot i + g_i (N-i)} \cdot \frac{N-i}{N}\\ P_{N,N}&=1. \end{align}$

The entry $P i, j$ denotes the probability to go from state i to state j. To understand the formulas for the transition probabilities one has to look again at the definition of the process and see that the fitness enters only the first term in the equations which is concerned with reproduction. Thus the probability that individual A is chosen for reproduction is not i / N any more but dependent on the fitness of A and thus $f_i \cdot i / (f_i \cdot i + g_i (N-i) )$ . Also in this case, fixation probabilities when starting in state i is defined by recurrence

$\begin{align} x_0 &= 0\\ x_i &= \beta_i x_{i-1}+(1-\alpha_i-\beta_i)x_i+\alpha_ix_{i+1}\quad i=1,\dots,N-1\\ x_N &= 1 \end{align}$

And the closed form is given by

$\begin{align} x_i = \frac{ {\displaystyle 1 + \sum\limits_{j=1}^{i-1}\prod\limits_{k=1}^{j}\gamma_k } } { {\displaystyle 1 + \sum\limits_{j=1}^{N-1}\prod\limits_{k=1}^{j}\gamma_k } } \qquad \text{(1)} \end{align}$

where $γ i = P i, i - 1 / P i, i + 1$ per definition and will just be $g i / f i$ for the general case.

For a mathematical derivation of the equation above, click on "show" to reveal

Also in this case, fixation probabilities can be computed, but the transition probabilities are not symmetric. The notation $P i, i + 1 = α i$ , $P i, i - 1 = β i$ , $P i, i = 1 - α i - β i$ and $γ i = β i / α i$ is used. The fixation probability can be defined recursively and a new variable $y i = x i - x i - 1$ is introduced.

$\begin{align} x_i &= \beta_i x_{i-1} + (1-\alpha_i - \beta_i)x_i + \alpha_i x_{i+1} \\ \beta_i (x_i - x_{i-1} ) &= \alpha_i (x_{i+1} - x_i ) \\ \gamma_i \cdot y_i &= y_{i+1} \end{align}$

Now two properties from the definition of the variable $y i$ can be used to find a closed form solution for the fixation probabilities:

$\begin{align} \sum\limits_{i=1}^{m} y_i &= x_m &1\\ y_k &= x_1 \cdot \prod\limits_{l=1}^{k-1}\gamma_l &2\\ \Rightarrow \sum\limits_{m=1}^{i}y_m &= x_1 + x_1 \sum\limits_{j=1}^{i-1}\prod\limits_{k=1}^{j}\gamma_k = x_i &3 \end{align}$

From (3) and the knowlegde $x N = 1$ follows

$\begin{align} x_N = x_1 \left( 1 + \sum\limits_{j=1}^{N-1}\prod\limits_{k=1}^{j}\gamma_k \right) &= 1 \quad \Rightarrow \quad x_1 = \frac{1}{ 1 + \sum\limits_{j=1}^{N-1}\prod\limits_{k=1}^{j}\gamma_k } \\ x_i &= \frac{ {\displaystyle 1 + \sum\limits_{j=1}^{i-1}\prod\limits_{k=1}^{j}\gamma_k } } { {\displaystyle 1 + \sum\limits_{j=1}^{N-1}\prod\limits_{k=1}^{j}\gamma_k } } \end{align}$

This general case where the fitness of A and B depends on the abundance of each type is studied in evolutionary game theory.

Less complex results are obtained if a constant fitness difference r is assumed. Individuals of type A reproduce with a constant rate r and individuals with allele B reproduce with rate 1. Thus if A has a fitness advantage over B, r will be larger than one, otherwise it will be smaller than one. Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are

$\begin{align} P_{0,0}&=1\\ P_{i,i-1} &= \frac{N-i}{r \cdot i + N-i} \cdot \frac{i}{N}\\ P_{i,i} &= 1- P_{i,i-1} - P_{i,i+1}\\ P_{i,i+1} &= \frac{r \cdot i}{r \cdot i + N-i} \cdot \frac{N-i}{N}\\ P_{N,N}&=1. \end{align}$

In this case $γ i = 1 / r$ is a constant factor for each composition of the population and thus the fixation probability from equation (1) simplifies to

$\begin{align} x_i = \frac{1-r^{-i}} { 1-r^{-N} } \quad \Rightarrow \quad x_1 = \rho = \frac{1-r^{-1}} { 1-r^{-N} } \qquad \text{(2)} \end{align}$

where the fixation probability of a single mutant A in a population of otherwise all B is often of interest and is denoted by $ρ$ .

Also in the case of selection, the expected value and the variance of the number of A individuals may be computed

$\begin{align} E [ X(t) | X(t-1) = i ] &= p s \dfrac{1-p}{p s + 1} + i \\ Var( X(t+1) | X(t)=i) &=p(1-p)\dfrac{ (s+1) + (p s + 1)^2 }{(p s +1)^2} \end{align}$

where p = i/N and r = 1 + s.

For a mathematical derivation of the equation above, click on "show" to reveal

For the expected value the calculation runs as follows

$\begin{align} E [ \Delta(1) | X(0) = i ] &= (i-1-i) \cdot P_{i,i-1} + (i-i) \cdot P_{i,i} + (i+1-i) \cdot P_{i,i+1} \\ &= - \frac{N-i}{r i + N -i} \frac{i}{N} + \frac{ri}{r i + N -i} \frac{N-i}{N} \\ &= - \frac{(N-i)i}{(r i + N -i)N} + \frac{i(N-i)}{(r i + N -i)N} + \frac{si(N-i)}{(r i + N -i)N} \\ &= p s \dfrac{1-p}{p s + 1}\\ E [ X(t) | X(t-1) = i ] &= p s \dfrac{1-p}{p s + 1}+i \end{align}$

For the variance the calculation runs as follows, using the variance of a single step

$\begin{align} Var( X(t+1) | X(t)=i) &= Var(X(t)) + Var(\Delta(t+1)| X(t)=i)\\ &= 0 + E[\Delta(t+1)^2| X(t)=i] - E[\Delta(t+1)| X(t)=i]^2\\ &= (i-1-i)^2 \cdot P_{i,i-1} + (i-i)^2 \cdot P_{i,i} + (i+1-i)^2 \cdot P_{i,i+1} - E[\Delta(t+1)| X(t)=i]^2\\ &= P_{i,i-1} + P_{i,i+1} - E[\Delta(t+1)| X(t)=i]^2\\ &= \frac{(N-i)i}{(r i + N -i)N} + \frac{(N-i)i(1+s)}{(r i + N -i)N} - E[\Delta(t+1)| X(t)=i]^2\\ &= i (N-i)\frac{2+s }{(r i + N -i)N} - E[\Delta(t+1)| X(t)=i]^2\\ &= i (N-i)\frac{2+s }{(r i + N -i)N} - (p s \dfrac{1-p}{p s + 1})^2\\ &= p(1-p)\frac{2+s (p s + 1)}{(p s + 1)^2} - p(1-p) \frac{p s^2(1-p)}{(p s + 1)^2}\\ &= p(1-p)\dfrac{2+2 p s + s + p^2 s^2 }{(p s +1)^2} \end{align}$

Rate of evolution

In a population of all B individuals, a single mutant A will take over the whole population with the probability

$\rho = \frac{1-r^{-1}} { 1-r^{-N} }. \qquad \text{(2)}$

If the mutation rate (to go from the B to the A allele) in the population is u then the rate with which one member of the population will mutate to A is given by N x u and the rate with which the whole population goes from all B to all A is the rate that a single mutant A arises times the probability that it will take over the population (fixation probability):

$\begin{align} R = N \cdot u \cdot \rho = u \quad \text{if} \quad \rho = \frac{1}{N}. \end{align}$

Thus if the mutation is neutral (i.e. the fixation probability is just 1/N) then the rate with which an allele arises and takes over a population is independent of the population size and is equal to the mutation rate. This important result is the basis of the neutral theory of evolution and suggests that the number of observed point mutations in the genomes of two different species would simply be given by the mutation rate multiplied by two times the time since divergence. Thus the neutral theory of evolution provides a molecular clock, given that the assumptions are fulfilled which may not be the case in reality.

Literature

Nowak, Martin A: Evolutionary Dynamics: Exploring the Equations of Life. Belknap Press (2006) ISBN 978-0674023383
Moran, Patrick Alfred Pierce: The Statistical Processes of Evolutionary Theory. Oxford, Clarendon Press (1962).

External links

Evolutionary dynamics on graphs

Categories:

Evolutionary dynamics
Stochastic processes
Population genetics

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

Pat Moran (statistician) — Pat Moran Patrick Alfred Pierce Moran (1917 1988) Born 14 July 1917( … Wikipedia
Godinez v. Moran — SCOTUSCase Litigants=Godinez v. Moran ArgueDate=April 21 ArgueYear=1993 DecideDate=June 24 DecideYear=1993 FullName=Salvador Godinez, Warden, Petitioner v. Richard Allan Moran USVol=509 USPage=389 Citation= Prior= Subsequent= Holding=The… … Wikipedia
James Moran (writer) — James Moran is a British screenwriter for television and film, who first came to public attention with the horror comedy Severance . He was born on March 5 in York, England. [cite web|url=http://www.pfd.co.uk/clients/moranj/f ftw.html|title=James … Wikipedia
Judy Moran — is the matriarch of the infamous Moran family of Melbourne, Victoria, Australia. Judy Moran was first married to Leslie Johnny Cole, who was shot dead in Sydney in 1982. Cole was the natural father of her son, Mark Moran, who was murdered in 2000 … Wikipedia
Rolando Morán — Comandante Rolando Morán (December 29, 1929, Quetzaltenango ndash; September 11, 1998, Guatemala City) is the nom de guerre of Ricardo Arnoldo Ramírez de León, a former leader of Guatemalan National Revolutionary Unity (URNG), an armed Guatemalan … Wikipedia
Tony Moran — Anthony Tony Moran is a remixer/producer and DJ known for remixing popular songs. In 2007, he hit number one on the U.S. Billboard Hot Dance Club Play chart with Walk Away featuring Kristine W. He is also rumored as the original Michael Myers in… … Wikipedia
Design Rationale — In the survey on design rationale (DR) for software engineering [Jarczyk, Loffler Shipman, Design Rationale for Software Engineering: A Survey] the authors give a very clear definition to design rationale, it is “the explicit listing of decisions … Wikipedia
Design rationale — A Decision Based Design Structure, which spans the areas of Engineering Design, Design Rationale and Decision Analysis. A Design Rationale is an explicit documentation of the reasons behind decisions made when designing a system or artifact. As… … Wikipedia
HEBREW LANGUAGE — This entry is arranged according to the following scheme: pre biblical biblical the dead sea scrolls mishnaic medieval modern period A detailed table of contents precedes each section. PRE BIBLICAL nature of the evidence the sources phonology… … Encyclopedia of Judaism
History of Christianity — Church history redirects here. For the journal, see American Society of Church History#Church History. For the magazine, see Christianity Today#Christian History. Church historian redirects here. For LDS official church historian, see Church… … Wikipedia

Academic Dictionaries and Encyclopedias

Moran process

Contents

Neutral drift

Selection

Rate of evolution

Literature

External links

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Moran process

Contents

Neutral drift

Selection

Rate of evolution

Literature

External links

Look at other dictionaries:

Share the article and excerpts

Direct link