Variance inflation factor

Variance inflation factor

In statistics, the variance inflation factor (VIF) is a method of detecting the severity of multicollinearity. More precisely, the VIF is an index which measures how much the variance of a coefficient (square of the standard deviation) is increased because of collinearity. Considering the following regression equation with k independent variables

: Y = β0 + β1 "X"1 + β2 "X" 2 + ... + β"k" "X""k" + ε

VIF can be calculated in three steps:

Step one

One can calculate "k" different VIFs, one for each "X""i" by first running an ordinary least square regression that has "X""i" as a function of all the other explanatory variables in the first equation.
If "i" = 1, for example, the equation would be"X"1 = α2"X"2 + α3 "X"3 + ... + α"k" "X""k" +"c"0 + "e"

where "c"0 is a constant and "e" is the error term.

Step two

Then one can calculate the VIF factor for hateta_i with the following formula: mathrm{VIF}(hat{eta_i})= frac{1}{1-R^2_i}where R²iis the coefficient of determination of the regression equation in step one.

Step three

Analyse the magnitude of multicollinearity by considering the size of the VIF(hat eta_i). A common rule of thumb is that if VIF(hat eta_i) > 5 then multicollinearity is high. Also 10 has been proposed (see KNN book referenced below) as a cut off value.

Some software calculates the tolerance which is just the reciprocal of the VIF. The choice of which formula to use is mostly a personal preference of the researcher.

Interpretation

The square root of the variance inflation factor tells you how much larger the standard error is, compared with what it would be if that variable were uncorrelated with the other independent variables in the equation.
Example
If the variance inflation factor of an independent variable were 5.27 (sqrt{5.27} = 2.3) this means that the standard error for the coefficient of that independent variable is 2.3 times as large as it would be if that independent variable were uncorrelated with the other independent variables.

References

Longnecker, M.T & Ott, R.L :"A First Course in Statistical Methods", page 615. Thomson Brooks/Cole, 2004.
Studenmund, A.H: "Using Econometrics: Apractical guide",5th Edition, page 258-259. Pearson International Edition, 2006.
Hair JF, Anderson R, Tatham RL, Black WC: "Multivariate Data Analysis". Prentice Hall: Upper Saddle River, N.J. 2006.
Marquardt, D.W. 1970 "Generalized Inverses, Ridge Regression, Biased Linear Estimation, and Nonlinear Eestimation" Technometrics 12(3), 591, 605-07
Allison, P.D. "Multiple Regression: a primer", page 142. Pine Forge Press: Thousand Oaks, C.A. 1999.
Kutner, Nachtsheim, Neter, "Applied Linear Regression Models", 4th edition, McGraw-Hill Irwin, 2004.


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Variance Inflation Factor — A measure of the amount of multicollinearity in a set of multiple regression variables. The presence of multicollinearity within the set of independent variables can cause a number of problems in the understanding the significance of individual… …   Investment dictionary

  • Inflation (cosmology) — Inflation model and Inflation theory redirect here. For a general rise in the price level, see Inflation. For other uses, see Inflation (disambiguation). Physical cosmology …   Wikipedia

  • Cosmic inflation — In physical cosmology, cosmic inflation is the idea that the universe passed through a phase of exponential expansion that was driven by a negative pressure vacuum energy density. [Liddle and Lyth (2000) and Mukhanov (2005) are recent cosmology… …   Wikipedia

  • VIF — Variance Inflation Factor (Governmental » US Government) Variance Inflation Factor (Academic & Science » Mathematics) ** Visiting International Faculty (Business » Positions) * Video InterFace (Academic & Science » Electronics) * Khoros… …   Abbreviations dictionary

  • VIF — variance inflation factor; virus induced interferon …   Medical dictionary

  • VIF — • variance inflation factor; • virus induced interferon …   Dictionary of medical acronyms & abbreviations

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Multicollinearity — is a statistical phenomenon in which two or more predictor variables in a multiple regression model are highly correlated. In this situation the coefficient estimates may change erratically in response to small changes in the model or the data.… …   Wikipedia

  • List of mathematics articles (V) — NOTOC Vac Vacuous truth Vague topology Valence of average numbers Valentin Vornicu Validity (statistics) Valuation (algebra) Valuation (logic) Valuation (mathematics) Valuation (measure theory) Valuation of options Valuation ring Valuative… …   Wikipedia

  • Design effect — In statistics, the design effect is an adjustment used in some kinds of studies, such as cluster randomised trials, to allow for the design structure. The adjustment inflates the variance of parameter estimates, and therefore their standard… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”