Singular Spectrum Analysis

Singular Spectrum Analysis

The Singular Spectrum Analysis (SSA) techniqueis a powerful technique of time series analysisincorporating the elements of classical time series analysis,multivariate statistics, multivariate geometry, dynamical systemsand signal processing.

The birth of SSA is usually associated withthe publication of papers by Broomhead (e.g. Broomhead and King,1986) while the ideas of SSA were independently developed in Russia(St. Petersburg, Moscow) and in several groups in the UK and USA. Atpresent, the papers dealing with the methodological aspects and theapplications of SSA number several hundred (see, for example,Vautard et al., 1992; Ghil and Taricco, 1997; Allen and Smith, 1986;Danilov, 1997; Yiou et al., 2000 and references therein). A thoroughdescription of the theoretical and practical foundations of the SSAtechnique (with several examples) can be found in Danilov andZhigljavsky (1997) and Golyandina et al. (2001). An elementaryintroduction to the subject can be found in Elsner and Tsonis(1996).

The SSA technique is a nonparametric spectral estimation method based on embedding a time series X(t): { extit t=1,N} in a vector space of dimension M. SSA proceeds by diagonalizing the M imes M lag-covariance matrix { extbf C}_X of X(t) to obtain spectral information on the time series, assumed to be stationary in the weak sense. The matrix { extbf C}_X can be estimated directly from the data as a Toeplitz matrix with constant diagonals (Vautard and Ghil, 1989), i.e., its entries c_{ij} depend only on the lag |i-j|:
c_{ij} = frac{1}{N-|i-j sum_{t=1}^{N-|i-j X(t) X(t+|i-j|).


An alternative way to compute { extbf C}_X, is by using the N' imes M ``trajectory matrix" { extbf D} that is formed by M lag-shifted copies of {it X(t)}, which are N' =N-M+1 long; then


{ extbf C}_X = frac{1}{N'} { extbf D}^{ m t} { extbf D}.


The M eigenvectors { extbf E}_k of the lag-covariance matrix { extbf C}_ X are called temporal empirical orthogonal functions (EOFs). The eigenvalues lambda_k of { extbf C}_{X} account for the partial variance in thedirection { extbf E}_k and the sum of the eigenvalues, i.e., the trace of{ extbf C}_{X}, gives the total variance of the original time seriesX(t). The name of the method derives from the singular values lambda^{1/2}_k of { extbf C}_{X}.

Decomposition and reconstruction

Projecting the time series onto each EOF yields the correspondingtemporal principal components (PCs) { extbf A}_k:
A_k(t) = sum_{j=1}^{M} X(t+j-1) E_k(j).


An oscillatory mode is characterized by a pair ofnearly equal SSA eigenvalues and associated PCs that are in approximate phase quadrature (Ghil et al., 2002). Such a pair can represent efficiently a nonlinear, anharmonic oscillation. This is due to the fact that a single pair of data-adaptive SSA eigenmodes often will capture better the basic periodicity of an oscillatory mode than methods with fixed basis functions, such as the { extit sines} and { extit cosines} used in the Fourier Transform.

The window width M determines the longest periodicity captured by SSA. Signal-to-noise separation can be obtained by merely inspecting the slope break in a "scree diagram" of eigenvalues lambda_k or singular values lambda^{1/2}_k vs. k. The point k^* = S at which this break occurs should not be confused with a ``dimension" D of the underlying deterministic dynamics (Vautard and Ghil, 1989).

A Monte-Carlo test (Allen and Robertson, 1996) can be applied to ascertain the statistical significance of the oscillatory pairs detected by SSA. The entire time series or parts of it that correspond to trends, oscillatory modes or noise can be reconstructed by using linear combinations of the PCs and EOFs, which provide the reconstructed components (RCs) { extbf R}_k:
R_{ k}(t) = frac{1}{M_t} sum_{kin { extit K sum_{j={L_t^{U_t}A_k(t-j+1)E_k(j);
here extit K is the set of EOFs on which the reconstruction is based. The values of the normalization factor M_t, as well as of the lower and upper bound of summation L_t and U_t, differ between the central part of the time series and the vicinity of its endpoints (Ghil et al., 2002).

Multivariate extension

Multi-channel SSA (or M-SSA) is a natural extension of SSA to an L-channel time series of vectors or maps with N data points {X_{l}(t): l=1,dots, L; t=1,dots, N}. In the meteorological literature, extended EOF (EEOF) analysis is often assumed to be synonymous with M-SSA. The two methods are both extensions of classical principal component analysis (PCA) but they differ in emphasis: EEOF analysis typically utilizes a number L of spatial channels much greater than the number M of temporal lags, thus limiting the temporal and spectral information. In M-SSA, on the other hand, one usually chooses L leq M. Often M-SSA is applied to a few leading PCs of the spatial data, with M chosen large enough to extract detailed temporal and spectral information from the multivariate time series (Ghil et al., 2002).

Spatio-temporal gap filling

The gap-filling version of SSA can be used to analyze data sets that are unevenly sampled or contain missing data (Kondrashov and Ghil, 2006). For a univariate time series, the SSA gap filling procedure utilizes temporal correlations to fill in the missing points. For a multivariate data set, gap filling by M-SSA takes advantage of both spatial and temporal correlations. In either case: (i) estimates of missing data points are produced iteratively, and are then used to compute a self-consistent lag-covariance matrix { extbf C}_X and its EOFs { extbf E}_k; and (ii) cross-validation is used to optimize the window width M and the number of leading SSA modes to fill the gaps with the iteratively estimated ``signal," while the noise is discarded.

References


* Allen, M.R., and A.W. Robertson: Distinguishing modulated oscillations from coloured noise in multivariate datasets, Clim. Dyn., 12, 775--784, 1996.
*Ghil, M., R. M. Allen, M. D. Dettinger, K. Ide, D. Kondrashov, et al.: Advanced spectral methods for climatic time series, Rev. Geophys. 40(1), 3.1--3.41, doi: 10.1029/2000RG000092, 2002.
*Kondrashov, D., and M. Ghil: [http://www.nonlin-processes-geophys.net/13/151/2006/npg-13-151-2006.html Spatio-temporal filling of missing points in geophysical data sets] , Nonlin. Processes Geophys., 13, 151--159, 2006.
*Vautard, R., and M. Ghil: Singular spectrum analysis in nonlinear dynamics, with applications to paleoclimatic time series, Physica D, 35, 395--424, 1989.
* Danilov, D. and Zhigljavsky, A. (Eds.). (1997): [http://www.gistatgroup.com/cat/books.html Principal Components of Time Series: the Caterpillar method] , University of St. Petersburg Press. (In Russian.)
* Golyandina, N., Nekrutkin, V. and Zhigljavsky, A. (2001): [http://www.amazon.com/Analysis-Time-Structure-Related-Techniques/dp/1584881941/ref=tag_tdp_sv_edpp_i Analysis of Time Series Structure: SSA and related techniques] . Chapman and Hall/CRC.

* Elsner, J.B. and Tsonis, A.A. (1996). Singular Spectral Analysis. A New Tool in Time Series nalysis. Plenum Press.

ee also

*Multitaper method
*Short-time Fourier transform
*Spectral density estimation

External links


* [http://www.atmos.ucla.edu/tcd/ssa Singular Spectrum Analysis--Multi-taper Method (SSA-MTM) Toolkit] freeware from UCLA.
*kSpectra Toolkit for Mac OS X from [http://www.spectraworks.com SpectraWorks.]
* [http://www.math.uni-bremen.de/~theodore/ssawiki/ Yet another SSAwiki page.]


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Principal component analysis — PCA of a multivariate Gaussian distribution centered at (1,3) with a standard deviation of 3 in roughly the (0.878, 0.478) direction and of 1 in the orthogonal direction. The vectors shown are the eigenvectors of the covariance matrix scaled by… …   Wikipedia

  • Principal components analysis — Principal component analysis (PCA) is a vector space transform often used to reduce multidimensional data sets to lower dimensions for analysis. Depending on the field of application, it is also named the discrete Karhunen Loève transform (KLT),… …   Wikipedia

  • Singular value decomposition — Visualization of the SVD of a 2 dimensional, real shearing matrix M. First, we see the unit disc in blue together with the two canonical unit vectors. We then see the action of M, which distorts the disk to an ellipse. The SVD decomposes M into… …   Wikipedia

  • Decomposition of spectrum (functional analysis) — In mathematics, especially functional analysis, the spectrum of an operator generalizes the notion of eigenvalues. Given an operator, it is sometimes useful to break up the spectrum into various parts. This article discusses a few examples of… …   Wikipedia

  • Hilbert spectrum — The Hilbert spectrum (sometimes referred to as the Hilbert amplitude spectrum ) is a statistical tool that can help in distinguishing among a mixture of moving signals. The spectrum itself is decomposed into its component sources using… …   Wikipedia

  • Least-squares spectral analysis — (LSSA) is a method of estimating a frequency spectrum, based on a least squares fit of sinusoids to data samples, similar to Fourier analysis. [cite book | title = Variable Stars As Essential Astrophysical Tools | author = Cafer Ibanoglu |… …   Wikipedia

  • Essential spectrum — In mathematics, the essential spectrum of a bounded operator is a certain subset of its spectrum, defined by a condition of the type that says, roughly speaking, fails badly to be invertible .The essential spectrum of self adjoint operatorsIn… …   Wikipedia

  • Independent component analysis — (ICA) is a computational method for separating a multivariate signal into additive subcomponents supposing the mutual statistical independence of the non Gaussian source signals. It is a special case of blind source separation. Definition When… …   Wikipedia

  • Time series — Time series: random data plus trend, with best fit line and different smoothings In statistics, signal processing, econometrics and mathematical finance, a time series is a sequence of data points, measured typically at successive times spaced at …   Wikipedia

  • Z Andromedae — Starbox begin name=Z Andromedae Starbox image caption = Z Andromedae Starbox observe epoch=J2000 ra=23h 33m 39.9505s dec= +48° 49 prime; 05.947 Prime; appmag v=10.53 constell=Andromeda Starbox character class=M6.5 b v=1.1 u b= variable=symbiotic… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”