Pooled variance

Pooled variance

In statistics, many times, data are collected for a dependent variable, y, over a range of values for the independent variable, x. For example, the observation of fuel consumption might be studied as a function of engine speed while the engine load is held constant. If, in order to achieve a small variance in y, numerous repeated tests are required at each value of x, the expense of testing may become prohibitive. Reasonable estimates of variance can be determined by using the principle of pooled variance after repeating each test at a particular x only a few times.

Consider the following set of data for y obtained at various levels of the independent variable, x.

The number of trials, mean, variance and standard deviation are presented in the next table.

These statistics represent the variance and standard deviation for each subset of data at the various levels of x. If we can assume that the same phenomena are generating random error at every level of x, the above data can be “pooled” to express a single estimate of variance and standard deviation. In a sense, this suggests finding a mean variance or standard deviation among the five results above. This mean variance is calculated by weighting the individual values with the size of the subset for each level of x. Thus, the pooled variance is defined by:

S_{P}^{2}=frac{(n_{1}-1)S_{1}^{2}+(n_{2}-1)S_{2}^{2}+...+(n_{k}-1)S_{k}^{2{(n_{1}-1)+(n_{2}-1)+...+(n_{k}-1)}

where n1, n2, . . . nk are the sizes of the data subsets at each level of the variable x, and S12, S22, . . . Sk2 are their respective variances.

The pooled variance of the data shown above is therefore:S_{P}^{2}=2.765

ee also

*Pooled standard deviation

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Pooled standard deviation — In statistics, pooled standard deviation is a way to find an estimate of the population standard deviation given several different samples taken in different circumstances where the mean may vary between samples but the true standard deviation… …   Wikipedia

  • Student's t-test — A t test is any statistical hypothesis test in which the test statistic follows a Student s t distribution if the null hypothesis is supported. It is most commonly applied when the test statistic would follow a normal distribution if the value of …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Meta-analysis — In statistics, a meta analysis combines the results of several studies that address a set of related research hypotheses. In its simplest form, this is normally by identification of a common measure of effect size, for which a weighted average… …   Wikipedia

  • Welch's t test — In statistics, Welch s t test is an adaptation of Student s t test intended for use with two samples having possibly unequal variances. As such, it is an approximate solution to the Behrens–Fisher problem.FormulasWelch s t test defines the… …   Wikipedia

  • List of mathematics articles (P) — NOTOC P P = NP problem P adic analysis P adic number P adic order P compact group P group P² irreducible P Laplacian P matrix P rep P value P vector P y method Pacific Journal of Mathematics Package merge algorithm Packed storage matrix Packing… …   Wikipedia

  • Effect size — In statistics, an effect size is a measure of the strength of the relationship between two variables in a statistical population, or a sample based estimate of that quantity. An effect size calculated from data is a descriptive statistic that… …   Wikipedia

  • Statistical hypothesis testing — This article is about frequentist hypothesis testing which is taught in introductory statistics. For Bayesian hypothesis testing, see Bayesian inference. A statistical hypothesis test is a method of making decisions using data, whether from a… …   Wikipedia

  • Standard deviation — In probability and statistics, the standard deviation is a measure of the dispersion of a collection of values. It can apply to a probability distribution, a random variable, a population or a data set. The standard deviation is usually denoted… …   Wikipedia

  • Resampling (statistics) — In statistics, resampling is any of a variety of methods for doing one of the following: # Estimating the precision of sample statistics (medians, variances, percentiles) by using subsets of available data (jackknife) or drawing randomly with… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”