Conway–Maxwell–Poisson distribution

Conway–Maxwell–Poisson
parameters:	$\lambda > 0, \nu \geq 0$
support:	$x \in \{0,1,2,\dots\}$
pmf:	$\frac{\lambda^x}{(x!)^\nu}\frac{1}{Z(\lambda,\nu)}$
cdf:	$\sum_{i=0}^x \mathbb{P}(X = i)$
mean:	$\sum_{j=0}^\infty \frac{j\lambda^j}{(j!)^\nu Z(\lambda, \nu)}$
median:	No closed form
mode:	Not listed
variance:	$\sum_{j=0}^\infty \frac{j^2\lambda^j}{(j!)^\nu Z(\lambda, \nu)} - \mu^2$
skewness:	Not listed
ex.kurtosis:	Not listed
entropy:	Not listed
mgf:	Not listed
cf:	Not listed

In probability theory and statistics, the Conway–Maxwell–Poisson (CMP or COM-Poisson) distribution is a discrete probability distribution named after Richard W. Conway, William L. Maxwell, and Siméon Denis Poisson that generalizes the Poisson distribution by adding a parameter to model overdispersion and underdispersion. It is a member of the exponential family, has the Poisson distribution and geometric distribution as special cases and the Bernoulli distribution as a limiting case.

1 Conway–Maxwell–Poisson distribution
2 Parameter estimation
- 2.1 Quick and crude method: weighted least squares
- 2.2 Accurate and intensive method: maximum likelihood
3 Generalized linear model
4 References
5 External links

Conway–Maxwell–Poisson distribution

The COM-Poisson distribution was originally proposed by Conway and Maxwell in 1962 ^[1] as a solution to handling queueing systems with state-dependent service rates. The probabilistic and statistical properties of the distribution were published by Shmueli et al. (2005).^[2]

The COM-Poisson is defined to be the distribution with probability mass function

$\Pr(X = x) = f(x; \lambda, \nu) = \frac{\lambda^x}{(x!)^\nu}\frac{1}{Z(\lambda,\nu)},$

for x = 0,1,2,... , $λ > 0$ and $ν$ ≥ 0, where

$Z(\lambda,\nu) = \sum_{j=0}^\infty \frac{\lambda^j}{(j!)^\nu}.$

The function $Z (λ,ν)$ serves as a normalization constant so the probability mass function sums to one. Note that $Z (λ,ν)$ does not have a closed form.

The additional parameter $ν$ which does not appear in the Poisson distribution allows for adjustment of the rate of decay. This rate of decay is a non-linear decrease in ratios of successive probabilities, specifically

$\frac{\Pr(X = x-1)}{\Pr(X = x)} = \frac{x^\nu}{\lambda}.$

When $ν = 1$ , the COM-Poisson distribution becomes the standard Poisson distribution and as $\nu \to \infty$ , the distribution approaches a Bernoulli distribution with parameter $λ / (1 + λ)$ . When $ν = 0$ the CoM-Poisson distribution reduces to a geometric distribution with probability of success $1 - λ$ provided $λ < 1$ .

For the COM-Poisson distribution, moments can be found through the recursive formula

$\operatorname{E}[X^{r+1}] = \begin{cases} \lambda \, \operatorname{E}[X+1]^{1-\nu} & \text{ if } r = 0 \\ \lambda \, \frac{d}{d\lambda}\operatorname{E}[X^r] + \operatorname{E}[X]\operatorname{E}[X^r] & \text{ if } r > 0. \\ \end{cases}$

Parameter estimation

There are a few methods of estimating the parameters of the CMP distribution from the data. Two methods will be discussed, the "quick and crude method" and the "accurate and intensive method".

Quick and crude method: weighted least squares

The "quick and crude method" provides a simple, efficient method to derive rough estimates of the parameters of the CMP distribution and determine if the distribution would be an appropriate model. Following the use of this method, an alternative method should be employed to compute more accurate estimates of the parameters if the model is deemed appropriate.

This method uses the relationship of successive probabilities as discussed above. By taking logarithms of both sides of this equation, the following linear relationship arises

$\log \frac{p_{x-1}}{p_x} = - \log \lambda + \nu \log x$

where $p x$ denotes $\mathbb{P}(X = x)$ . When estimating the parameters, the probabilities can be replaced by the relative frequencies of $x$ and $x - 1$ . To determine if the CMP distribution is an appropriate model, these values should be plotted against $log x$ for all ratios without zero counts. If the data appear to be linear, then the model is likely to be a good fit.

Once the appropriateness of the model is determined, the parameters can be estimated by fitting a regression of $\log (\hat p_{x-1} / \hat p_x)$ on $log x$ . However, the basic assumption of homoscedasticity is violated, so a weighted least squares regression must be used. The inverse weight matrix will have the variances of each ratio on the diagonal with the one-step covariances on the first off-diagonal, both given below.

$\mathbb{V}\left[\log \frac{\hat p_{x-1}}{\hat p_x}\right] \approx \frac{1}{np_x} + \frac{1}{np_{x-1}}$

$\text{cov}\left(\log \frac{\hat p_{x-1}}{\hat p_x}, \log \frac{\hat p_x}{\hat p_{x+1}} \right) \approx - \frac{1}{np_x}$

Accurate and intensive method: maximum likelihood

The COM-Poisson likelihood function is

$\mathcal{L}(\lambda,\nu|x_1,\dots,x_n) = \lambda^{S_1} \exp(-\nu S_2) Z^{-n}(\lambda, \nu)$

where $S_1 = \sum_{i=1}^n x_i$ and $S_2 = \sum_{i=1}^n \log x_i!$ . Maximizing the likelihood yields the following two equations

$\mathbb{E}[X] = \bar X$

$\mathbb{E}[\log X!] = \overline{\log X!}$

which do not have an analytic solution.

Instead, the maximum likelihood estimates are approximated numerically by the Newton–Raphson method. In each iteration, the expectations, variances, and covariance of $X$ and $log X!$ are approximated by using the estimates for $λ$ and $ν$ from the previous iteration in the expression

$\mathbb{E}[f(x)] = \sum_{j=0}^\infty f(j) \frac{\lambda^j}{(j!)^\nu Z(\lambda, \nu)}.$

This is continued until convergence of $\hat\lambda$ and $\hat\nu$ .

Generalized linear model

The basic COM-Poisson distribution discussed above has also been used as the basis for a generalized linear model (GLM) using a Bayesian formulation. A dual-link GLM based on the CMP distribution has been developed,^[3] and this model has been used to evaluate traffic accident data.^[4]^[5] The CMP GLM developed by Guikema and Coffelt (2008) is based on a reformulation of the CMP distribution above, replacing $λ$ with $μ = λ 1 / ν$ . The integral part of $μ$ is then the mode of the distribution. A full Bayesian estimation approach has been used with MCMC sampling implemented in WinBugs with non-informative priors for the regression parameters.^[3]^[4] This approach is computationally expensive, but it yields the full posterior distributions for the regression parameters and allows expert knowledge to be incorporated through the use of informative priors.

A classical GLM formulation for a COM-Poisson regression has been developed which generalizes Poisson regression and logistic regression.^[6] This takes advantage of the exponential family properties of the COM-Poisson distribution to obtain elegant model estimation (via maximum likelihood), inference, diagnostics, and interpretation. This approach requires substantially less computational time than the Bayesian approach, at the cost of not allowing expert knowledge to be incorporated into the model.^[6] In addition it yields standard errors for the regression parameters (via the Fisher Information matrix) compared to the full posterior distributions obtainable via the Bayesian formulation. It also provides a statistical test for the level of dispersion compared to a Poisson model. Code for fitting a COM-Poisson regression, testing for dispersion, and evaluating fit is available.^[7]

The two GLM frameworks developed for the COM-Poisson distribution significantly extend the usefulness of this distribution for data analysis problems.

References

^ Conway, R. W.; Maxwell, W. L. (1962), "A queuing model with state dependent service rates", Journal of Industrial Engineering 12: 132–136
^ Shmueli G., Minka T., Kadane J.B., Borle S., and Boatwright, P.B. "A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution." Journal of the Royal Statistical Society: Series C (Applied Statistics) 54.1 (2005): 127–142.[1]
^ ^a ^b Guikema, S.D. and J.P. Coffelt (2008) "A Flexible Count Data Regression Model for Risk Analysis", Risk Analysis, 28 (1), 213–223. doi:10.1111/j.1539-6924.2008.01014.x
^ ^a ^b Lord, D., S.D. Guikema, and S.R. Geedipally (2008) "Application of the Conway–Maxwell–Poisson Generalized Linear Model for Analyzing Motor Vehicle Crashes," Accident Analysis & Prevention, 40 (3), 1123–1134. doi:10.1016/j.aap.2007.12.003
^ Lord, D., S.R. Geedipally, and S.D. Guikema (2010) "Extension of the Application of Conway-Maxwell-Poisson Models: Analyzing Traffic Crash Data Exhibiting Under-Dispersion," Risk Analysis, 30 (8), 1268-1276. doi:10.1111/j.1539-6924.2010.01417.x
^ ^a ^b Sellers, K. S. and Shmueli, G. (2010), "A Flexible Regression Model for Count Data", Annals of Applied Statistics, 4 (2), 943-961
^ Code for COM_Poisson modelling, Georgetown Univ.

External links

Probability distributions

Discrete univariate with finite support

Benford · Bernoulli · Beta-binomial · binomial · categorical · hypergeometric · Poisson binomial · Rademacher · discrete uniform · Zipf · Zipf-Mandelbrot

Discrete univariate with infinite support

beta negative binomial · Boltzmann · Conway–Maxwell–Poisson · discrete phase-type · extended negative binomial · Gauss–Kuzmin · geometric · logarithmic · negative binomial · parabolic fractal · Poisson · Skellam · Yule–Simon · zeta

Continuous univariate supported on a bounded interval, e.g. [0,1]

Arcsine · ARGUS · Balding-Nichols · Bates · Beta · Noncentral beta · Irwin–Hall · Kumaraswamy · logit-normal · raised cosine · triangular · U-quadratic · uniform · Wigner semicircle

Continuous univariate supported on a semi-infinite interval, usually [0,∞)

Benini · Benktander 1st kind · Benktander 2nd kind · Beta prime · Bose–Einstein · Burr · chi-squared · chi · Coxian · Dagum · Davis · Erlang · exponential · F · Fermi–Dirac · folded normal · Fréchet · Gamma · generalized inverse Gaussian · half-logistic · half-normal · Hotelling's T-squared · hyper-exponential · hypoexponential · inverse chi-squared (scaled-inverse-chi-squared) · inverse Gaussian · inverse gamma · Kolmogorov · Lévy · log-Cauchy · log-Laplace · log-logistic · log-normal · Maxwell–Boltzmann · Maxwell speed · Mittag–Leffler · Nakagami · noncentral chi-squared · Pareto · phase-type · Rayleigh · relativistic Breit–Wigner · Rice · Rosin–Rammler · shifted Gompertz · truncated normal · type-2 Gumbel · Weibull · Wilks' lambda

Continuous univariate supported on the whole real line (−∞, ∞)

Cauchy · exponential power · Fisher's z · generalized normal · generalized hyperbolic · geometric stable · Gumbel · Holtsmark · hyperbolic secant · Landau · Laplace · Linnik · logistic · noncentral t · normal (Gaussian) · normal-inverse Gaussian · skew normal · slash · stable · Student's t · type-1 Gumbel · variance-gamma · Voigt

Continuous univariate with support whose type varies

generalized extreme value · generalized Pareto · Tukey lambda · q-Gaussian · q-exponential · shifted log-logistic

Mixed continuous-discrete univariate distributions

rectified Gaussian

Multivariate (joint)

Discrete: Ewens · multinomial · multivariate Pólya · negative multinomial Continuous: Dirichlet · Generalized Dirichlet · multivariate normal · Multivariate stable · multivariate Student · normal-scaled inverse gamma · normal-gamma Matrix-valued: inverse-Wishart · matrix normal · Wishart

Directional

Univariate (circular) directional: Circular uniform · univariate von Mises · wrapped normal · wrapped Cauchy · wrapped exponential · wrapped Lévy Bivariate (spherical): Kent Bivariate (toroidal): bivariate von Mises Multivariate: von Mises–Fisher · Bingham

Degenerate and singular

Degenerate: discrete degenerate · Dirac delta function Singular: Cantor

Families

Circular · compound Poisson · elliptical · exponential · natural exponential · location-scale · maximum entropy · mixture · Pearson · Tweedie · wrapped

Categories:

Discrete distributions
Poisson processes

Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

Conway-Maxwell-Poisson distribution — Probability distribution name =Conway Maxwell Poisson pdf cdf type =density parameters =lambda > 0, u geq 0 support =x in {0,1,2,dots} pdf =frac{lambda^x}{(x!)^ u}frac{1}{Z(lambda, u)} cdf =sum {i=0}^x mathbb{P}(X = i) mean =sum {j=0}^infty… … Wikipedia
Maxwell–Boltzmann distribution — Maxwell–Boltzmann Probability density function Cumulative distribution function parameters … Wikipedia
Maxwell speed distribution — Classically, an ideal gas molecules bounce around with somewhat arbitrary velocities, never interacting with each other. In reality, however, an ideal gas is subjected to intermolecular forces. It is to be noted that the aforementioned classical… … Wikipedia
Compound Poisson distribution — In probability theory, a compound Poisson distribution is the probability distribution of the sum of a Poisson distributed number of independent identically distributed random variables. In the simplest cases, the result can be either a… … Wikipedia
Maxwell'sche Geschwindigkeitsverteilung — Maxwell Boltzmann Verteilung Parameter Definitionsbereich Wahrscheinlichkeitsdichte … Deutsch Wikipedia
Maxwell-Boltzmann-Geschwindigkeitsverteilung — Maxwell Boltzmann Verteilung Parameter Definitionsbereich Wahrscheinlichkeitsdichte … Deutsch Wikipedia
Maxwell-Verteilung — Maxwell Boltzmann Verteilung Parameter Definitionsbereich Wahrscheinlichkeitsdichte … Deutsch Wikipedia
Maxwell-Boltzmann-Verteilung — Parameter Definitionsbereich Wahrscheinlichkeitsdichte … Deutsch Wikipedia
Siméon Denis Poisson — Infobox Scientist box width = 300px name = Siméon Poisson image size = 200px caption = Siméon Denis Poisson (1781 1840) birth date = 21 June 1781 birth place = Pithiviers, France death date = 25 April 1840 death place = Sceaux, France residence … Wikipedia
Normal distribution — This article is about the univariate normal distribution. For normally distributed vectors, see Multivariate normal distribution. Probability density function The red line is the standard normal distribution Cumulative distribution function … Wikipedia

Academic Dictionaries and Encyclopedias

Conway–Maxwell–Poisson distribution

Contents

Conway–Maxwell–Poisson distribution

Parameter estimation

Quick and crude method: weighted least squares

Accurate and intensive method: maximum likelihood

Generalized linear model

References

External links

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Conway–Maxwell–Poisson distribution

Contents

Conway–Maxwell–Poisson distribution

Parameter estimation

Quick and crude method: weighted least squares

Accurate and intensive method: maximum likelihood

Generalized linear model

References

External links

Look at other dictionaries:

Share the article and excerpts

Direct link