Multiple correspondence analysis

Multiple correspondence analysis

In statistics, multiple correspondence analysis (MCA) is a data analysis technique for nominal categorical data, used to detect and represent underlying structures in a data set. It does this by representing data as points in a low-dimensional Euclidean space. The procedure thus appears to be the counterpart of principal component analysis for categorical data [1][2]. MCA is an extension of simple correspondence analysis (CA) in that it is applicable to a large set of categorical variables.

MCA is performed by applying the CA algorithm to either an indicator matrix or a Burt table formed from these variables.[3] An indicator matrix is an individuals × variables matrix, where the rows represent individuals and the columns are dummy variables representing categories of the variables.[4]. Analyzing the indicator matrix allows the direct representation of individuals as points in geometric space. The Burt table is the symmetric matrix of all two-way crosstabulations between the categorical variables, and has an analogy to the covariance matrix of continuous variables. Analyzing the Burt table is a more natural generalization of simple correspondence analysis, and individuals or the means of groups of individuals can be added as supplementary points to the graphical display.

Associations between variables are uncovered by calculating the chi-square distance between different categories of the variables and between the individuals (or respondents). These associations are then represented graphically as "maps", which eases the interpretation of the structures in the data. Oppositions between rows and columns are then maximized, in order to uncover the underlying dimensions best able to describe the central oppositions in the data. As in factor analysis or principal component analysis, the first axis is the most important dimension, the second axis the second most important, and so on, in terms of the amount of variance accounted for. The number of axes to be retained for analysis, is determined by calculating modified eigenvalues.

In the social sciences, MCA is arguably best known[citation needed] for its application by Pierre Bourdieu, notably in his books La Distinction, Homo Academicus and The State Nobility. Bourdieu argued that there was an internal link between his vision of the social as spatial and relational --– captured by the notion of field, and the geometric properties of MCA.[5]. Sociologists following Bourdieu's work most often opt for the analysis of the indicator matrix, rather than the Burt table, largely because of the central importance accorded to the analysis of the 'cloud of individuals'.[6]

References

  1. ^ Le Roux, B. and H. Rouanet (2004), Geometric Data Analysis, From Correspondence Analysis to Structured Data Analysis, Dordrecht. Kluwer: p.180
  2. ^ Greenacre, Michael and Blasius, Jörg (editors) (2006). Multiple Correspondence Analysis and Related Methods. London: Chapman & Hall/CRC. 
  3. ^ Greenacre, Michael (2007). Correspondence Analysis in Practice, Second Edition. London: Chapman & Hall/CRC. 
  4. ^ Le Roux, B. and H. Rouanet (2004), Geometric Data Analysis, From Correspondence Analysis to Structured Data Analysis, Dordrecht. Kluwer: p.179
  5. ^ Rouanet, Henry (2000) "The Geometric Analysis of Questionnaires. The Lesson of Bourdieu's La Distinction", in Bulletin de Méthodologie Sociologique 65, pp. 4–18
  6. ^ Lebaron, Frédéric (2009) "How Bourdieu “Quantified” Bourdieu: The Geometric Modelling of Data", in Robson and Sanders (eds.) Quantifying Bourdieu. Springer, pp. 11-30.

External links

  • Le Roux, B. and H. Rouanet (2004), Geometric Data Analysis, From Correspondence Analysis to Structured Data Analysis at Google Books: [1]
  • Greenacre, Michael (2008), La Práctica del Análisis de Correspondencias, BBVA Foundation, Madrid, available for free download at the foundation's web site [2]

Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Correspondence analysis — (CA) is a multivariate statistical technique proposed[1] by Hirschfeld[2] and later developed by Jean Paul Benzécri.[3] It is conceptually similar to principal component analysis, but applies to categorical rather than continuous data. In a… …   Wikipedia

  • Detrended correspondence analysis — (DCA) is a multivariate statistical technique widely used by ecologists to find the main factors or gradients in large, species rich but usually sparse data matrices that typify ecological community data. For example, Hill and Gauch (1980, p. 55) …   Wikipedia

  • Detrended Correspondence Analysis — (DCA) is a multivariate statistical technique widely used by ecologists to find the main factors or gradients in large, species rich but usually sparse data matrices that typify ecological community data. For example, Hill and Gauch (1980, p. 55) …   Wikipedia

  • Analysis of categorical data — These are statistical procedures which can be used for the analysis of categorical data:* regression * analysis of variance * linear modeling * log linear modeling * logistic regression * repeated measures analysis * simple correspondence… …   Wikipedia

  • Principal component analysis — PCA of a multivariate Gaussian distribution centered at (1,3) with a standard deviation of 3 in roughly the (0.878, 0.478) direction and of 1 in the orthogonal direction. The vectors shown are the eigenvectors of the covariance matrix scaled by… …   Wikipedia

  • Geometric data analysis — can refer to geometric aspects of image analysis, pattern analysis and shape analysis or the approach of multivariate statistics that treats arbitrary data sets as clouds of points in n dimensional space. This includes topological data analysis,… …   Wikipedia

  • Correspondence principle — This article discusses quantum theory and relativity. For other uses, see Correspondence principle (disambiguation). In physics, the correspondence principle states that the behavior of systems described by the theory of quantum mechanics (or by… …   Wikipedia

  • analysis — /euh nal euh sis/, n., pl. analyses / seez /. 1. the separating of any material or abstract entity into its constituent elements (opposed to synthesis). 2. this process as a method of studying the nature of something or of determining its… …   Universalium

  • Linear discriminant analysis — (LDA) and the related Fisher s linear discriminant are methods used in statistics, pattern recognition and machine learning to find a linear combination of features which characterize or separate two or more classes of objects or events. The… …   Wikipedia

  • Behavior analysis of child development — Child development in behavior analytic theory has origins in John B. Watson s behaviorism.[1] Watson wrote extensively on child development and conducted research (see Little Albert experiment). Watson was instrumental in the modification of… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”