Glivenko-Cantelli theorem

In the theory of probability, the Glivenko-Cantelli theorem determines the asymptotic behaviour of the empirical distribution function as the number of iid observations grows. This uniform convergence of more general empirical measures becomes an important property of the Glivenko-Cantelli classes of functions or sets. The Glivenko-Cantelli classes arise in VC theory, with applications to machine learning.

Glivenko-Cantelli theorem

Assume that $X_1,X_2,dots$ are iid random variables in $mathbb{R}$ with common cumulative distribution function "F(x)". The "empirical distribution function" for $X_1,dots,X_n$ is defined by

: $F_n(x)=frac{1}{n}sum_{i=1}^n I_{(-infty,x] }(X_i),$

where $I_C$ is the indicator function. For fixed "x", $F_n(x)$ is a random variable which converges to "F"("x") almost surely by the strong law of large numbers, that is, $F_n(x)$ converges to "F"("x") pointwise. Glivenko and Cantelli strengthened this result by proving uniform convergence of $F_n$ to "F".

Theorem (Glivenko, Cantelli, 1933):

: $|F_n - F|_infty = sup_{xin R} |F_n(x) - F(x)| {longrightarrow} 0$ almost surely.

Empirical measures

One can generalize the "empirical distribution function" by replacing the set $(-infty,x]$ by an arbitrary set "c" from a class of sets $mathcal{C}$ to obtain an empirical measure indexed by "c"

: $P_n(c)=frac{1}{n}sum_{i=1}^n I_c(X_i), cinmathcal{C},$

Further generalization is the map induced by $P_n$ on measurable real-valued functions "f", which is given by

: $fmapsto P_nf=int_SfdP_n=frac{1}{n}sum_{i=1}^n f(X_i), finmathcal{F},$

Then it becomes an important property of these classes that the strong law of large numbers holds uniformly on $mathcal{F}$ or $mathcal{C}$ .

Glivenko-Cantelli class

Consider a set $mathcal{S}$ with a sigma algebra of Borel subsets "A" and a probability measure "P". For a class of subsets,

: ${mathcal C}subset{c:c mbox{ is measurable subset of }mathcal{S}}$

and a class of functions

: $mathcal{F}subset{f:mathcal{S} o mathbb{R}, f mbox{ is measurable},}$

define random variables

: $|P_n-P|_{mathcal C}=sup_{cin {mathcal C |P_n(c)-P(c)|$

: $|P_n-P|_{mathcal F}=sup_{fin {mathcal F |P_nf-mathbb{E}f|$

where $P_n(c)$ is the empirical measure, $P_n f$ is the corresponding map, and

: $mathbb{E}f=int_mathcal{S} fdP$ , assuming that it exists.

Definitions
* A class $mathcal C$ is called a "Glivenko-Cantelli class" (or "GC class") with respect to a probability measure "P" if any of the following equivalent statements is true.::1. $|P_n-P|_mathcal{C} o 0$ almost surely as $n oinfty$ .::2. $|P_n-P|_mathcal{C} o 0$ in probability as $n oinfty$ .::3. $mathbb{E}|P_n-P|_mathcal{C} o 0$ , as $n oinfty$ (convergence in mean).:The Glivenko-Cantelli classes of functions are defined similarly.
*A class is called a "universal Glivenko-Cantelli class" if it is a GC class with respect to any probability measure "P" on ("S","A").
*A class is called "uniformly Glivenko-Cantelli" if the convergence occurs uniformly over all probability measures "P" on ("S","A")::: $sup_{Pin mathcal{P}(S,A)} mathbb E |P_n-P|_mathcal{C} o 0;$ :: $sup_{Pin mathcal{P}(S,A)} mathbb E |F_n-F|_mathcal{F} o 0.$

Theorem (Vapnik and Chervonenkis, 1968): "A class of sets $mathcal{C}$ is uniformly GC if and only if it is a Vapnik-Chervonenkis class. "

Examples

* Let $S=mathbb R$ and ${mathcal C}={(-infty,t] :tin {mathbb R}}$ . The classical Glivenko-Cantelli theorem implies that this class is a universal GC class. Furthermore, by Kolmogorov's theorem,: $sup_{Pin mathcal{P}(S,A)}|P_n-P|_{mathcal C} sim n^{-1/2}$ , that is $mathcal{C}$ is uniformly Glivenko-Cantelli class.

* Let "P" be a nonatomic probability measure on "S" and $mathcal{C}$ be a class of all finite subsets in "S". Because $A_n={X_1,ldots,X_n}in mathcal{C}$ , $P(A_n)=0$ , $P_n(A_n)=1$ , we have that $|P_n-P|_{mathcal C}=1$ and so $mathcal{C}$ is "not" a GC class with respect to "P".

ee also

* Donsker's theorem
* Dvoretzky–Kiefer–Wolfowitz inequality - strengthens Glivenko-Cantelli theorem by quantifying the rate of convergence.

Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

Glivenko-Cantelli-Theorem — Der Gliwenko Cantelli Satz, auch Hauptsatz der Statistik oder Fundamentalsatz der Statistik genannt (nach Waleri Iwanowitsch Gliwenko und Francesco Cantelli, 1933), ist ein mathematischer Satz, der besagt, dass die empirische Verteilungsfunktion… … Deutsch Wikipedia
Francesco Paolo Cantelli — (1875 1966) was an Italian mathematician. He was the founder of the Istituto Italiano degli Attuari for the applications of mathematics and probability to economics. His early papers were on problems in astronomy and celestial mechanics.The later … Wikipedia
Donsker's theorem — In probability theory, Donsker s theorem, named after M. D. Donsker, identifies a certain stochastic process as a limit of empirical processes. It is sometimes called the functional central limit theorem. A centered and scaled version of… … Wikipedia
Gliwenko-Cantelli-Satz — Der Gliwenko Cantelli Satz, auch Hauptsatz der Statistik oder Fundamentalsatz der Statistik genannt (nach Waleri Iwanowitsch Gliwenko und Francesco Cantelli, 1933), ist ein mathematischer Satz, der besagt, dass die empirische Verteilungsfunktion… … Deutsch Wikipedia
Empirical process — The study of empirical processes is a branch of mathematical statistics and a sub area of probability theory. It is a generalization of the central limit theorem for empirical measures. DefinitionIt is known that under certain conditions… … Wikipedia
List of mathematics articles (G) — NOTOC G G₂ G delta space G networks Gδ set G structure G test G127 G2 manifold G2 structure Gabor atom Gabor filter Gabor transform Gabor Wigner transform Gabow s algorithm Gabriel graph Gabriel s Horn Gain graph Gain group Galerkin method… … Wikipedia
Kolmogorov-Smirnov test — In statistics, the Kolmogorov ndash;Smirnov test (also called the K S test for brevity) is a form of minimum distance estimation used as a nonparametric test of equality of one dimensional probability distributions used to compare a sample with a … Wikipedia
Empirical distribution function — In statistics, an empirical distribution function is a cumulative probability distribution function that concentrates probability 1/ n at each of the n numbers in a sample.Let X 1,ldots,X n be iid random variables in mathbb{R} with the cdf F ( x… … Wikipedia
Dvoretzky–Kiefer–Wolfowitz inequality — In the theory of probability and statistics, the Dvoretzky–Kiefer–Wolfowitz inequality predicts how close an empirically determined distribution function will be to the distribution function from which the empirical samples are drawn. It is named … Wikipedia
Empirical measure — In probability theory, an empirical measure is a random measure arising from a particular realization of a (usually finite) sequence of random variables. The precise definition is found below. Empirical measures are relevant to mathematical… … Wikipedia

Academic Dictionaries and Encyclopedias

Glivenko-Cantelli theorem

Look at other dictionaries:

Share the article and excerpts

Academic Dictionaries and Encyclopedias

Wikipedia

Glivenko-Cantelli theorem

Look at other dictionaries:

Share the article and excerpts

Direct link