Absolute deviation

In statistics, the absolute deviation of an element of a data set is the absolute difference between that element and a given point. Typically the point from which the deviation is measured is a measure of central tendency, most often the median or sometimes the mean of the data set.

D i = | x i - m (X) |

where

D_i is the absolute deviation,

x_i is the data element

and m(X) is the chosen measure of central tendency of the data set—sometimes the mean ( $\overline{x}$ ), but most often the median.

1 Measures of dispersion
2 Minimization
3 Estimation
4 See also
5 External links

Measures of dispersion

Several measures of statistical dispersion are defined in terms of the absolute deviation.

Average absolute deviation

The average absolute deviation, or simply average deviation of a data set is the average of the absolute deviations and is a summary statistic of statistical dispersion or variability. It is also called the mean absolute deviation, but this is easily confused with the median absolute deviation.

The average absolute deviation of a set {x₁, x₂, ..., x_n} is

$\frac{1}{n}\sum_{i=1}^n |x_i-m(X)|.$

The choice of measure of central tendency, $m (X)$ , has a marked effect on the value of the average deviation. For example, for the data set {2, 2, 3, 4, 14}:

Measure of central tendency $m (X)$	Average absolute deviation
Mean = 5	$\frac{\|2 - 5\| + \|2 - 5\| + \|3 - 5\| + \|4 - 5\| + \|14 - 5\|}{5} = 3.6$
Median = 3	$\frac{\|2 - 3\| + \|2 - 3\| + \|3 - 3\| + \|4 - 3\| + \|14 - 3\|}{5} = 2.8$
Mode = 2	$\frac{\|2 - 2\| + \|2 - 2\| + \|3 - 2\| + \|4 - 2\| + \|14 - 2\|}{5} = 3.0$

The average absolute deviation from the median is less than or equal to the average absolute deviation from the mean. In fact, the average absolute deviation from the median is always less than or equal to the average absolute deviation from any other fixed number.

The average absolute deviation from the mean is less than or equal to the standard deviation; one way of proving this relies on Jensen's inequality.

For the normal or "Gaussian" distribution, the ratio of mean absolute deviation to standard deviation is $\scriptstyle \sqrt{2/\pi} = 0.79788456\dots$ . Thus if X is a normally distributed random variable with expected value 0 then

$\frac{ E|X| }{ \sqrt{E(X^2)} } = \sqrt{\frac{2}{\pi}}.$

In other words, for a Gaussian, mean absolute deviation is about 0.8 times the standard deviation.

Mean absolute deviation

The mean absolute deviation (MAD), also referred to as the mean deviation, is the mean of the absolute deviations of a set of data about the data’s mean. In other words, it is the average distance of the data set from its mean during certain number of time periods.

The equation for MAD is as follows:

MAD = 1/n ∑(|e_i|) , where e_i = F_i - D_i

This method forecast accuracy is very closely related to the mean squared error (MSE) method which is just the average squared error of the forecasts. Although these methods are very closely related MAD is more commonly used because it does not require squaring.

The equation for MSE is as follows:

MSE = 1/n Σ(e_i²) , where e_i = F_i - D_i

Median absolute deviation (MAD)

Main article: Median absolute deviation

The median absolute deviation is the median of the absolute deviation from the median. It is a robust estimator of dispersion.

For the example {2, 2, 3, 4, 14}: 3 is the median, so the absolute deviations from the median are {1, 1, 0, 1, 11} (reordered as {0, 1, 1, 1, 11}) with a median of 1, in this case unaffected by the value of the outlier 14, so the median absolute deviation (also called MAD) is 1.

Maximum absolute deviation

The maximum absolute deviation about a point is the maximum of the absolute deviations of a sample from that point. It is realized by the sample maximum or sample minimum and cannot be less than half the range.

Minimization

The measures of statistical dispersion derived from absolute deviation characterize various measures of central tendency as minimizing dispersion: The median is the measure of central tendency most associated with the absolute deviation, in that

L² norm statistics: just as the mean minimizes the standard deviation,
L¹ norm statistics: the median minimizes average absolute deviation,
L^∞ norm statistics: the mid-range minimizes the maximum absolute deviation, and
trimmed L^∞ norm statistics: for example, the midhinge (average of first and third quartiles) which minimizes the median absolute deviation of the whole distribution, also minimizes the maximum absolute deviation of the distribution after the top and bottom 25% have been trimmed off.

Estimation

The mean absolute deviation of a sample is a biased estimator of the mean absolute deviation of the population. In order for the absolute deviation to be an unbiased estimator, the expected value (average) of all the sample absolute deviations must equal the population absolute deviation. However, it does not. For the population 1,2,3 the population absolute deviation is 2/3. The average of all the sample standard deviations of size 3 that can be drawn from the population is 40/81. Therefore the absolute deviation is a biased estimator.

External links

Advantages of the mean absolute deviation

Statistics

Descriptive statistics

Continuous data

Location	Mean (Arithmetic, Geometric, Harmonic) · Median · Mode

Dispersion	Range · Standard deviation · Coefficient of variation · Percentile · Interquartile range

Shape	Variance · Skewness · Kurtosis · Moments · L-moments

Count data

Index of dispersion

Summary tables

Grouped data · Frequency distribution · Contingency table

Dependence

Pearson product-moment correlation · Rank correlation (Spearman's rho, Kendall's tau) · Partial correlation · Scatter plot

Statistical graphics

Bar chart · Biplot · Box plot · Control chart · Correlogram · Forest plot · Histogram · Q-Q plot · Run chart · Scatter plot · Stemplot · Radar chart

Data collection

Designing studies	Effect size · Standard error · Statistical power · Sample size determination

Survey methodology	Sampling · Stratified sampling · Opinion poll · Questionnaire

Controlled experiment	Design of experiments · Factorial experiment · Randomized experiment · Random assignment · Replication · Blocking · Optimal design

Uncontrolled studies	Natural experiment · Quasi-experiment · Observational study

Statistical inference

Statistical theory	Sampling distribution · Sufficient statistic · Meta-analysis

Bayesian inference	Bayesian probability · Prior · Posterior · Credible interval · Bayes factor · Bayesian estimator · Maximum posterior estimator

Frequentist inference	Confidence interval · Hypothesis testing · Likelihood-ratio

Specific tests	Z-test (normal) · Student's t-test · F-test · Pearson's chi-squared test · Wald test · Mann–Whitney U · Shapiro–Wilk · Signed-rank · Kolmogorov–Smirnov test

General estimation	Mean-unbiased · Median-unbiased · Maximum likelihood · Method of moments · Minimum distance · Density estimation

Correlation and regression analysis

Correlation	Pearson product-moment correlation · Partial correlation · Confounding variable · Coefficient of determination

Regression analysis	Errors and residuals · Regression model validation · Mixed effects models · Simultaneous equations models

Linear regression	Simple linear regression · Ordinary least squares · General linear model · Bayesian regression

Non-standard predictors	Nonlinear regression · Nonparametric · Semiparametric · Isotonic · Robust

Generalized linear model	Exponential families · Logistic (Bernoulli) · Binomial · Poisson

Partition of variance	Analysis of variance (ANOVA) · Analysis of covariance · Multivariate ANOVA · Degrees of freedom

Categorical, multivariate, time-series, or survival analysis

Categorical data	Cohen's kappa · Contingency table · Graphical model · Log-linear model · McNemar's test

Multivariate statistics	Multivariate regression · Principal components · Factor analysis · Cluster analysis · Copulas

Time series analysis	Decomposition (Trend · Stationary process) · ARMA model · ARIMA model · Vector autoregression · Spectral density estimation

Survival analysis	Survival function · Kaplan–Meier · Logrank test · Failure rate · Proportional hazards models · Accelerated failure time model

Applications

Biostatistics	Bioinformatics · Biometrics · Clinical trials & studies · Epidemiology · Medical statistics · Pharmaceutical statistics

Engineering statistics	Methods engineering · Probabilistic design · Process & Quality control · Reliability · System identification

Social statistics	Actuarial science · Census · Crime statistics · Demography · Econometrics · National accounts · Official statistics · Population · Psychometrics

Spatial statistics	Cartography · Environmental statistics · Geographic information system · Geostatistics · Kriging

Category · Portal · Outline · Index

Categories:

Statistical deviation and dispersion

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

absolute deviation — absoliutusis nuokrypis statusas T sritis fizika atitikmenys: angl. absolute deviation vok. absolute Abweichung, f rus. абсолютное отклонение, n pranc. écart absolu, m … Fizikos terminų žodynas
absolute deviation — noun a) The difference (without regard to sign) between a given value and a variate value. b) The shortest distance between the center of the target and the point where the projectile hit or burst … Wiktionary
Median absolute deviation — In statistics, the median absolute deviation (MAD) is a robust measure of the variability of a univariate sample of quantitative data. It can also refer to the population parameter that is estimated by the MAD calculated from a sample. For a… … Wikipedia
Deviation — may refer to: Deviation (statistics), the difference between the value of an observation and the mean of the population in mathematics and statistics Standard deviation, which is based on the square of the difference Absolute deviation, where the … Wikipedia
mean absolute deviation — MAD A measure of forecast error, for example when carrying out adaptive exponential smoothing of time series data. MAD is the average forecast error, either positive or negative, calculated as the sum of the absolute value of forecast error for… … Big dictionary of business and management
Deviation (statistics) — In mathematics and statistics, deviation is a measure of difference for interval and ratio variables between the observed value and the mean. The sign of deviation (positive or negative), reports the direction of that difference (it is larger… … Wikipedia
absolute Abweichung — absoliutusis nuokrypis statusas T sritis fizika atitikmenys: angl. absolute deviation vok. absolute Abweichung, f rus. абсолютное отклонение, n pranc. écart absolu, m … Fizikos terminų žodynas
Deviation analysis — may mean; in statistics; measurement of the absolute difference between any one number in a set and the mean of the set. in social psychology; monitoring of the behavior of people or objects within systems to measure compliance with expected or… … Wikipedia
absolute frequency deviation — absoliutusis dažnio nuokrypis statusas T sritis Standartizacija ir metrologija apibrėžtis Didžiausias skirtumas tarp moduliuotojo dažnio bangos akimirkinio dažnio ir nešlio bangos vidutinio dažnio. atitikmenys: angl. absolute frequency deviation… … Penkiakalbis aiškinamasis metrologijos terminų žodynas
Absolute curvature — Curvature Cur va*ture (k?r v? t?r; 135), n. [L. curvatura. See {Curvate}.] 1. The act of curving, or the state of being bent or curved; a curving or bending, normal or abnormal, as of a line or surface from a rectilinear direction; a bend; a… … The Collaborative International Dictionary of English

Academic Dictionaries and Encyclopedias

Absolute deviation

Contents