Hypothesis of linear regression

Hypothesis of linear regression

In statistics, the linear regression problem can be formalized precisely, although one seldom uses this formalization in most practical cases.

Given the mathematical formalization of the statistical regression problem, let ThetasubseteqGamma be a set of coefficients. The hypothesis of the linear regression is:

exists (eta^0,cdots,eta^p)in heta^{p+1}:mathbb{E}(Y|X_1,cdots,X_p)=eta^0 + sum_{j=1}^p eta^j X_j

and the metric used is:

forall f,gin F, d(f,g) = mathbb{E} [(f-g)^2]

We therefore want to minimize mathbb{E} [(Y-f(X_1,cdots,X_p))^2] , which means that

f(X_1,cdots,X_p)=mathbb{E}(Y|X_1,cdots,X_p) = eta^0 + sum_{j=1}^p eta^j X_j

Hence, we only need to find eta^0,cdots,eta^p.


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Linear regression — Example of simple linear regression, which has one independent variable In statistics, linear regression is an approach to modeling the relationship between a scalar variable y and one or more explanatory variables denoted X. The case of one… …   Wikipedia

  • Least-squares estimation of linear regression coefficients — In parametric statistics, the least squares estimator is often used to estimate the coefficients of a linear regression. The least squares estimator optimizes a certain criterion (namely it minimizes the sum of the square of the residuals). In… …   Wikipedia

  • Regression analysis — In statistics, regression analysis is a collective name for techniques for the modeling and analysis of numerical data consisting of values of a dependent variable (response variable) and of one or more independent variables (explanatory… …   Wikipedia

  • Regression toward the mean — In statistics, regression toward the mean (also known as regression to the mean) is the phenomenon that if a variable is extreme on its first measurement, it will tend to be closer to the average on a second measurement, and a fact that may… …   Wikipedia

  • Regression discontinuity design — In statistics, econometrics, epidemiology and related disciplines, a regression discontinuity design (RDD) is a design that elicits the causal effects of interventions by exploiting a given exogenous threshold determining assignment to treatment …   Wikipedia

  • Statistical hypothesis testing — This article is about frequentist hypothesis testing which is taught in introductory statistics. For Bayesian hypothesis testing, see Bayesian inference. A statistical hypothesis test is a method of making decisions using data, whether from a… …   Wikipedia

  • Null hypothesis — For the periodical, see Null Hypothesis: The Journal of Unlikely Science. The practice of science involves formulating and testing hypotheses, assertions that are capable of being proven false using a test of observed data. The null hypothesis… …   Wikipedia

  • Alternative hypothesis — Main article: Statistical hypothesis testing In statistical hypothesis testing, the alternative hypothesis (or maintained hypothesis or research hypothesis) and the null hypothesis are the two rival hypotheses which are compared by a statistical… …   Wikipedia

  • Robust regression — In robust statistics, robust regression is a form of regression analysis designed to circumvent some limitations of traditional parametric and non parametric methods. Regression analysis seeks to find the effect of one or more independent… …   Wikipedia

  • General linear model — Not to be confused with generalized linear model. The general linear model (GLM) is a statistical linear model. It may be written as[1] where Y is a matrix with series of multivariate measurements, X is a matrix that might be a design matrix, B… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”