Detrended Correspondence Analysis

Detrended Correspondence Analysis

Detrended Correspondence Analysis (DCA) is a multivariate statistical technique widely used by ecologists to find the main factors or gradients in large, species-rich but usually sparse data matrices that typify ecological community data. For example, Hill and Gauch (1980, p. 55) analyse the data of a vegetation survey of southeast England including 876 species in 3270 relevés. After eliminating outliers, DCA is able to identify two main axes: The first axis goes from dry to wet conditions, and the second axis from woodland to weed communities.

History of DCA

It was created in 1979 by Mark Hill of the United Kingdom's Institute for Terrestrial Ecology (now merged into Centre for Ecology and Hydrology) and implemented in an obsolete FORTRAN code package called DECORANA (Detrended Correspondence Analysis), a Correspondence analysis method. DECORANA is sometimes erroneously used for DCA; however, DCA is the underlying algorithm, while DECORANA is a tool for implementing it.

The problems solved by DCA

According to Hill and Gauch (1980), DCA is used to suppress two artefacts inherent in most other multivariate analyses when applied to gradient data. An example is a time-series of plant species colonising a new habitat; early successional species are replaced by mid-successional species, then by late successional ones (see example below). When such data are analysed by a standard ordination such as a correspondence analysis

* the ordination scores of the samples will exhibit the edge effect, i.e. the variance of the scores at the beginning and the end of a regular succession of species will be considerably smaller than that in the middle,

* when presented as a graph the points will be seen to follow a horseshoe shaped curve rather than a straight line (arch effect), even though the process under analysis is a steady and continuous change that human intuition would prefer to see as a linear trend.

Outside ecology, the same artefacts occur when gradient data are analysed (eg soil properties along a transect running between 2 different geologies, or behavioural data over the lifespan of an individual) because the curved projection is an accurate representation of the shape of the data in multivariate space.

Ter Braak and Prentice (1987, p. 121) cite a simulation study analysing two-dimensional species packing models resulting in a better performance of DCA compared to CA.

How DCA solves the problems

DCA is a messy and iterative algorithm, but in practice has shown itself to be a highly reliable and useful tool for data exploration and summary that deserves to be more widely known outside of community ecology (Shaw 2003). It starts by running a standard ordination (CA or reciprocal averaging) on the data, to produce the initial horse-shoe curve in which the 1st ordination axis distorts into the 2nd axis. It then divides the first axis into segnments (default = 26), and rescales each segment to have mean value of zero on the 2nd axis - this effectively squashes the curve flat. It also rescales the axis so that the ends are no longer compressed relative to the middle, so that 1 DCA unit approximates to the same rate of turnover all the way through the data: the rule of thumb is that 4 DCA units mean that there has been a total turnover in the community.Ter Braak and Prentice (1987, p.122) warn against the non-linear rescaling of the axes due to robustness issues and recommend to use detrending-by-polynomials only.

The drawbacks of DCA

No significance tests are available with DCA, although there is a constrained (canonical) version called DCCA in which the axes are forced by Multiple linear regression to correlate optimally with a linear combination of other (usually environmental) variables; this allows testing of a null model by Monte-Carlo permutation analysis.

Example

The example shows an ideal data set: The species data is in rows, samples in columns. For each sample along the gradient a new species is introduced but another species is no longer present. The result is a sparse matrix, ones indicate the presence of a species in a sample. Except at the edges each sample contains five species. The plot of the first two axes of the correspondence analysis result on the right hand side clearly shows the disadvantages of this procedure: the edge effect, i.e. the points are clustered at the edges of the first axis and the arch effect.

ee also

* Eigenanalysis

* Ordination (statistics)

* Seriation (archaeology) - including additional examples for the arch effect

References

* Hill, M.O. (1979). "DECORANA - A FORTRAN program for Detrended Correspondence Analysis and Reciprocal Averaging". Section of Ecology and Systematics, Cornell University, Ithaca, New York, 52pp.

* Hill, M.O. and Gauch, H.G. (1980). Detrended Correspondence Analysis: An Improved Ordination Technique. "Vegetatio" 42, 47-58.

* Oksanen J and Minchin PR (1997). Instability of ordination results under changes in input data order: explanation and remedies. "Journal of vegetation science" 8, 447-454

* Shaw PJA (2003). "Multivariate Statistics for the Environmental Sciences". London: Hodder Arnold

* Ter Braak, C.J.F. and Prentice, I.C. (1988). A Theory of Gradient Analysis. "Advances in Ecological Research" 18, 271-371. ISBN 0-12-013918-9. Reprinted in: Ter Braak, C.J.F. (1987). "Unimodal models to relate species to environment". Wageningen: PhD thesis Agricultural Mathematics Group, 101-146.

External Links

* [http://folk.uio.no/ohammer/past/ PAST (PAlaeontological STatistics)] - free software including DCA with modifications according to Oxanen and Minchin (1997)

* [http://www.uni-koeln.de/~al001/ WINBASP] - free software including DCA with detrending-by-polynomials according to Ter Braak and Prentice (1988)


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Detrended correspondence analysis — (DCA) is a multivariate statistical technique widely used by ecologists to find the main factors or gradients in large, species rich but usually sparse data matrices that typify ecological community data. For example, Hill and Gauch (1980, p. 55) …   Wikipedia

  • Correspondence analysis — (CA) is a multivariate statistical technique proposed[1] by Hirschfeld[2] and later developed by Jean Paul Benzécri.[3] It is conceptually similar to principal component analysis, but applies to categorical rather than continuous data. In a… …   Wikipedia

  • Principal component analysis — PCA of a multivariate Gaussian distribution centered at (1,3) with a standard deviation of 3 in roughly the (0.878, 0.478) direction and of 1 in the orthogonal direction. The vectors shown are the eigenvectors of the covariance matrix scaled by… …   Wikipedia

  • Seriation (archaeology) — In archaeology, seriation is a relative dating method in which assemblages or artifacts from numerous sites, in the same culture, are placed in chronological order. Where absolute dating methods, such as carbon dating, cannot be applied,… …   Wikipedia

  • Ordination (statistics) — In multivariate analysis, ordination is a method complementary to data clustering, and used mainly in exploratory data analysis (rather than in hypothesis testing). Ordination orders objects that are characterized by values on multiple variables… …   Wikipedia

  • Landscape ecology — is the science and art of studying and improving the relationship between spatial pattern and ecological processes on a multitude of scales and organizational levels (Wu 2006, 2008; Wu and Hobbs 2007). In a broad sense, landscape ecology… …   Wikipedia

  • Análisis de correspondencias — Saltar a navegación, búsqueda En estadística multivariante, el análisis de correspondencias es una técnica descriptiva desarrollada por Jean Paul Benzécri.[1] Suele aplicarse al estudio de tablas de contingencia y es conceptualmente similar al… …   Wikipedia Español

  • Ordinationsverfahren — Unter dem Begriff Ordinationen werden viele mathematische Verfahrensweisen zusammengefasst, die darauf zielen, bestimmte Daten grafisch in einem Koordinatensystem zu veranschaulichen. Diese werden z. B. in der Ökologie bzw. Vegetationskunde… …   Deutsch Wikipedia

  • DCA — may refer to: Contents 1 Organizations 2 Science and technology 3 Computers …   Wikipedia

  • DCA — Die Abkürzung DCA steht für: die Fluglinie DC Aviation, vormals DaimlerChrysler Aviation Democratic Congress Alliance, eine ehemalige Partei in Gambia (1960–1965) Department for Constitutional Affairs, eine britische Justizbehörde (2003–2007),… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”