Semiparametric model

Semiparametric model

In statistics a semiparametric model is a model that has parametric and nonparametric components.

A model is a collection of distributions: {P_ heta: heta in Theta} indexed by a parameter heta.

* A parametric model is one in which the indexing parameter is a finite-dimensional vector (in k-dimensional Euclidean space for some integer k); i.e. the set of possible values for heta is a subset of mathbb{R}^k, or Theta subset mathbb{R}^k. In this case we say that heta is finite-dimensional.
* In nonparametric models, the set of possible values of the parameter heta is a subset of some space, not necessarily finite dimensional. For example, we might consider the set of all distributions with mean 0. Such spaces are vector spaces with topological structure, but may not be finite dimensional as vector spaces. Thus, Theta subset mathbb{F} for some possibly infinite dimensional space mathbb{F}.
* In semiparametric models, the parameter has both a finite dimensional component and an infinite dimensional component (often a real-valued function defined on the real line). Thus the parameter space Theta in a semiparametric model satisfies Theta subset mathbb{R}^k imes mathbb{F}, where mathbb{F} is an infinite dimensional space.

It may appear at first that semiparametric models include nonparametric models, since they have an infinite dimensional as well as a finite dimensional component. However, a semiparametric model is considered to be "smaller" than a completely nonparametric model because we are often interested only in the finite-dimensional component of heta. That is, we are not interested in estimating the infinite-dimensional component. In nonparametric models, by contrast, the primary interest is in estimating the infinite dimensional parameter. Thus the estimation task is statistically harder in nonparametric models.

These models often use smoothing or kernels.

Example

A well-known example of a semiparametric model is the Cox proportional hazards model. If we are interested in studying the time T to an event such as death due to cancer or failure of a light bulb, the Cox model specifies the following distribution function for T::F(t) = 1 - expleft(-int_0^t lambda_0(u) e^{eta'x(u)} du ight),where x(u) is a known function of time (the covariate vector at time u), and eta and lambda_0(u) are unknown parameters. heta = (eta, lambda_0(u)). Here eta is finite dimensional and is of interest; lambda_0(u) is an unknown non-negative function of time (known as the baseline hazard function) and is often a nuisance parameter. The collection of possible candidates for lambda_0(u) is infinite dimensional.

External links

* [http://www.quantlet.com/mdstat/scripts/csa/html/node181.html Semiparametric Models] , in: [http://www.quantlet.com/mdstat/scripts/spm/html/spmhtml.html Nonparametric and Semiparametric models: an introduction] (Springer, 2004)


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Model selection — is the task of selecting a statistical model from a set of candidate models, given data. In the simplest cases, a pre existing set of data is considered. However, the task can also involve the design of experiments such that the data collected is …   Wikipedia

  • Semiparametric regression — refers to regression models in which the predictor contains both parametric and nonparametric components …   Wikipedia

  • General linear model — Not to be confused with generalized linear model. The general linear model (GLM) is a statistical linear model. It may be written as[1] where Y is a matrix with series of multivariate measurements, X is a matrix that might be a design matrix, B… …   Wikipedia

  • First-hitting-time model — In statistics, first hitting time models are a sub class of survival models. The first hitting time, also called first passage time, of a set A with respect to an instance of a stochastic process is the time until the stochastic process first… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • List of mathematics articles (S) — NOTOC S S duality S matrix S plane S transform S unit S.O.S. Mathematics SA subgroup Saccheri quadrilateral Sacks spiral Sacred geometry Saddle node bifurcation Saddle point Saddle surface Sadleirian Professor of Pure Mathematics Safe prime Safe… …   Wikipedia

  • Proportional hazards models — are a class of survival models in statistics. Survival models relate the time that passes before some event occurs to one or more covariates that may be associated with that quantity. In a proportional hazards model, the unique effect of a unit… …   Wikipedia

  • Richard D. Gill (mathematician) — Richard David Gill (born 11 September 1951) was born in the United Kingdom. He studied mathematics at the University of Cambridge (1970 1973), and subsequently followed the Diploma of Statistics course there (1973 1974). Marrying a Dutch woman,… …   Wikipedia

  • Linear regression — Example of simple linear regression, which has one independent variable In statistics, linear regression is an approach to modeling the relationship between a scalar variable y and one or more explanatory variables denoted X. The case of one… …   Wikipedia

  • Least squares — The method of least squares is a standard approach to the approximate solution of overdetermined systems, i.e., sets of equations in which there are more equations than unknowns. Least squares means that the overall solution minimizes the sum of… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”