- Prediction interval
In
statistics , a prediction interval bears the same relationship to a future observation that aconfidence interval bears to an unobservable population parameter. Prediction intervals predict the distribution of individual points, whereas confidence intervals estimate the true population mean or other quantity of interest that cannot be observed.In other words, an interval estimate of a
parameter , such as a populationmean is usually called aconfidence interval . An interval estimate of a variable is sometimes called aprediction interval .A common example given in statistics classes is the prediction interval for a response variable when finding the
least squares regression line. If the entire population is given in the data, this is not needed. However, if the data is asample , then the true regression line may not be known. The predicted value of the response variable "y", found using the equation of the regression line from the sample data, will have a margin of error. The predicted "y" value is astatistic , not aparameter . For this "y" value, a prediction interval can be found. We use thestandard deviation (standard error) of thedistribution of theslope to do this. The "y" value is a point estimate and we are looking for a prediction interval for that estimate.Example
Suppose one has drawn a sample from a normally distributed population. The
mean andstandard deviation of the population are unknown except insofar as they can be estimated based on the sample. It is desired to predict the next observation. Let "n" be the sample size; let μ and σ be respectively the true (unobservable) mean and standard deviation of the population. Let "X"1, ..., "X""n", be the sample; let "X""n"+1 be the future observation to be predicted. Let the sample mean be:
and the sample variance be
:
Then it is fairly routine to show that
:
has a
Student's t-distribution with "n" − 1 degrees of freedom. Consequently we have:
where "Ta" is the 100((1 + "p")/2)th
percentile ofStudent's t-distribution with "n" − 1 degrees of freedom. Therefore the numbers:
are the endpoints of a 100"p"% prediction interval for "X""n" + 1.
ee also
*
Confidence interval
*Extrapolation
*Prediction
*Regression analysis
*Seymour Geisser
*Trend estimation References
*Chatfield, C. (1993) "Calculating Interval Forecasts," "Journal of Business and Economic Statistics," 11 121–135.
*Meade, N. and T. Islam (1995) "Prediction Intervals for Growth Curve Forecasts," "Journal of Forecasting," 14 413–430.
*Lawless, J.F. and Fredette, M. (2005) "Frequentist prediction intervals and predictive distributions". "Biometrika", 92 (3) 529–542.
Wikimedia Foundation. 2010.