- Rand index
In
statistics , the Rand index or Rand measure is a measure of the similarity between two data clusters.Definition
Given a set of elements and two partitions of to compare, and , we define the following:
* , the number of pairs of elements in that are in the same set in and in the same set in
* , the number of pairs of elements in that are in different sets in and in different sets in
* , the number of pairs of elements in that are in the same set in and in different sets in
* , the number of pairs of elements in that are in different sets in and in the same set in The Rand index, , is::Intuitively, one can think of as the number of agreements between and and as the number of disagreements between and .The Rand index has a value between 0 and 1, with 0 indicating that the two data clusters do not agree on any pair of points and 1 indicating that the data clusters are exactly the same.
References
* Cite journal
author = W. M. Rand
title = Objective criteria for the evaluation of clustering methods
journal =Journal of the American Statistical Association
volume = 66
pages = 846–850
year = 1971
doi = 10.2307/2284239
* Cite journal
author = K. Y. Yeung & W. L. Ruzzo
title = Principal component analysis for clustering gene expression data
journal = Bioinformatics
volume = 17
issue = 9
year = 2001
pages = 763–774
url = http://bioinformatics.oxfordjournals.org/cgi/content/abstract/17/9/763
doi = 10.1093/bioinformatics/17.9.763
Wikimedia Foundation. 2010.