- Glivenko-Cantelli theorem
In the
theory of probability , the Glivenko-Cantelli theorem determines the asymptotic behaviour of theempirical distribution function as the number ofiid observations grows. This uniform convergence of more generalempirical measure s becomes an important property of the Glivenko-Cantelli classes of functions or sets. The Glivenko-Cantelli classes arise inVC theory , with applications tomachine learning .Glivenko-Cantelli theorem
Assume that are iid
random variable s in with commoncumulative distribution function "F(x)". The "empirical distribution function" for is defined by:
where is the
indicator function . For fixed "x", is a random variable which converges to "F"("x")almost surely by the stronglaw of large numbers , that is, converges to "F"("x") pointwise. Glivenko and Cantelli strengthened this result by provinguniform convergence of to "F".Theorem (
Glivenko , Cantelli, 1933)::
almost surely .Empirical measures
One can generalize the "empirical distribution function" by replacing the set by an arbitrary set "c" from a class of sets to obtain an
empirical measure indexed by "c":
Further generalization is the map induced by on measurable real-valued functions "f", which is given by
:
Then it becomes an important property of these classes that the strong
law of large numbers holds uniformly on or .Glivenko-Cantelli class
Consider a set with a sigma algebra of Borel subsets "A" and a
probability measure "P". For a class of subsets,:
and a class of functions
:
define random variables
:
:
where is the empirical measure, is the corresponding map, and
:, assuming that it exists.
Definitions
* A class is called a "Glivenko-Cantelli class" (or "GC class") with respect to a probability measure "P" if any of the following equivalent statements is true.::1. almost surely as .::2. in probability as .::3. , as (convergence in mean).:The Glivenko-Cantelli classes of functions are defined similarly.
*A class is called a "universal Glivenko-Cantelli class" if it is a GC class with respect to any probability measure "P" on ("S","A").
*A class is called "uniformly Glivenko-Cantelli" if the convergence occurs uniformly over all probability measures "P" on ("S","A")::: ::Theorem (
Vapnik andChervonenkis , 1968): "A class of sets is uniformly GC if and only if it is aVapnik-Chervonenkis class . "Examples
* Let and . The classical Glivenko-Cantelli theorem implies that this class is a universal GC class. Furthermore, by Kolmogorov's theorem,:, that is is uniformly Glivenko-Cantelli class.
* Let "P" be a nonatomic probability measure on "S" and be a class of all finite subsets in "S". Because , , , we have that and so is "not" a GC class with respect to "P".
ee also
*
Donsker's theorem
*Dvoretzky–Kiefer–Wolfowitz inequality - strengthens Glivenko-Cantelli theorem by quantifying the rate of convergence.
Wikimedia Foundation. 2010.