- Principal axis theorem
In the mathematical fields of
geometry andlinear algebra , a principal axis is a certain line in aEuclidean space associated to anellipsoid orhyperboloid , generalizing the major and minor axes of an ellipse. The "principal axis theorem" states that the principal axes are perpendicular, and gives a constructive procedure for finding them.Mathematically, the principal axis theorem is a generalization of the method of
completing the square fromelementary algebra . Inlinear algebra andfunctional analysis , the principal axis theorem is a geometrical counterpart of thespectral theorem . It has applications to thestatistics ofprincipal components analysis and thesingular value decomposition . Inphysics , the theorem is fundamental to the study ofangular momentum .Motivation
The equations in the
Cartesian plane R2:::define, respectively, an ellipse and a hyperbola. In each case, the "x" and the "y" axes are the principal axes. This is easily seen, given that there are no "cross-terms" involving products "xy" in either expression. However, the situation is more complicated with equations like:Here some method is required to determine whether this is an ellipse or a hyperbola. The basic observation is that if, by completing the square, this can be reduced to a sum of two squares then the equation is that of an ellipse, whereas if it reduces to a difference of two squares then it is the equation of a hyperbola:::Thus the problem is how to absorb the coefficient of the cross-term 8"xy" into the functions "u" and "v". Formally, this problem is similar to a problem ofmatrix diagonalization where one tries to find a suitable coordinate system in which the matrix of a linear transformation is diagonal. One first finds a matrix to which the technique of diagonalization can be applied.The trick is to write the equation in the following form::where the cross-term has been split into two equal parts. The matrix "A" in the above decomposition is a
symmetric matrix . In particular, by thespectral theorem , it has realeigenvalues and isdiagonalizable by anorthogonal matrix ("orthogonally diagonalizable").To orthogonally diagonalize "A", one must first find its eigenvalues, and then find an
orthonormal eigenbasis . Calculation reveals that the eigenvalues of "A" are:with corresponding eigenvectors:Dividing these by their respective lengths yields an orthonormal eigenbasis::Now the matrix "S" = [u1 u2] is an orthogonal matrix, since it has orthonormal columns, and "A" is diagonalized by::
This applies to the present problem of "diagonalizing" the equation through the observation that:Thus the equation is that of an ellipse, since it is the sum of two squares.
It is tempting to simplify this expression by pulling out factors of 2, however it is important "not" to do this. The quantities:have a geometrical meaning. They determine an "orthonormal coordinate system" on R2. In other words, they are obtained from the original coordinates by the application of a rotation (and possibly a reflection). As a consequence, one may use the "c"1, "c"2 coordinates to make statements about "length and angles" (particularly length), which would otherwise be more difficult in a different choice of coordinates (by rescaling them, for instance). For example, the maximum distance from the origin on the ellipse "c"12 + 9"c"22 = 1 occurs when "c"2=0, so at the points "c"1=±1. Similarly, the minimum distance is where "c"2=±1/3.
It is possible now to read off the major and minor axes of this ellipse. These are precisely the individual eigenspaces of the matrix "A", since these are where "c"2 = 0 or "c"1=0. Symbolically, the principal axes are:Collecting all of this information:
* The equation is for an ellipse, since both eigenvalues are positive. (Otherwise, if one were positive and the other negative, it would be a hyperbola.)
* The principal axes are the lines spanned by the eigenvectors.
* The minimum and maximum distances to the origin can be read off the equation in diagonal form.Using this information, it is possible to attain a clear geometrical picture of the ellipse: to graph it, for instance.Formal statement
The principal axis theorem applies to
quadratic forms in Rn; that is polynomials "Q"(x) which are homogeneous of degree 2. Any quadratic form can be put in the form:where "A" is a symmetric matrix.The first part of the theorem is contained in the following statements guaranteed by the spectral theorem:
* The eigenvalues of "A" are real.
* "A" is diagonalizable, and the eigenspaces of "A" are mutually orthogonal.In particular, "A" is "orthogonally diagonalizable", since one may take a basis of each eigenspace and apply theGram-Schmidt process separately within the eigenspace to obtain an orthonormal eigenbasis.For the second part, suppose that the eigenvalues of "A" are λ1, ..., λn (possibly repeated according to their algebraic multiplicities) and the corresponding orthonormal eigenbasis is u1,...,un. Then
* where the "c"i are the coordinates with respect to the given eigenbasis. Furthermore,
* The "i"-th principal axis is the line determined by the "n"-1 equations "c"j = 0, "j" ≠ "i". This axis is the span of the vector ui.References
*
Wikimedia Foundation. 2010.