 Perturbation theory (quantum mechanics)

In quantum mechanics, perturbation theory is a set of approximation schemes directly related to mathematical perturbation for describing a complicated quantum system in terms of a simpler one. The idea is to start with a simple system for which a mathematical solution is known, and add an additional "perturbing" Hamiltonian representing a weak disturbance to the system. If the disturbance is not too large, the various physical quantities associated with the perturbed system (e.g. its energy levels and eigenstates) can, from considerations of continuity, be expressed as 'corrections' to those of the simple system. These corrections, being 'small' compared to the size of the quantities themselves, can be calculated using approximate methods such as asymptotic series. The complicated system can therefore be studied based on knowledge of the simpler one.
Contents
 1 Applications of perturbation theory
 2 Timeindependent perturbation theory
 3 Timedependent perturbation theory
 4 Strong perturbation theory
 5 See also
 6 References
Applications of perturbation theory
Perturbation theory is an important tool for describing real quantum systems, as it turns out to be very difficult to find exact solutions to the Schrödinger equation for Hamiltonians of even moderate complexity. The Hamiltonians to which we know exact solutions, such as the hydrogen atom, the quantum harmonic oscillator and the particle in a box, are too idealized to adequately describe most systems. Using perturbation theory, we can use the known solutions of these simple Hamiltonians to generate solutions for a range of more complicated systems. For example, by adding a perturbative electric potential to the quantum mechanical model of the hydrogen atom, we can calculate the tiny shifts in the spectral lines of hydrogen caused by the presence of an electric field (the Stark effect). This is only approximate because the sum of a Coulomb potential with a linear potential is unstable although the tunneling time (decay rate) is very long. This shows up as a broadening of the energy spectrum lines, something which perturbation theory fails to reproduce entirely.
The expressions produced by perturbation theory are not exact, but they can lead to accurate results as long as the expansion parameter, say α, is very small. Typically, the results are expressed in terms of finite power series in α that seem to converge to the exact values when summed to higher order. After a certain order n∼1 / α, however, the results become increasingly worse since the series are usually divergent (being asymptotic series). There exist ways to convert them into convergent series, which can be evaluated for largeexpansion parameters, most efficiently by Variational method.
In the theory of quantum electrodynamics (QED), in which the electronphoton interaction is treated perturbatively, the calculation of the electron's magnetic moment has been found to agree with experiment to eleven decimal places. In QED and other quantum field theories, special calculation techniques known as Feynman diagrams are used to systematically sum the power series terms.
Under some circumstances, perturbation theory is an invalid approach to take. This happens when the system we wish to describe cannot be described by a small perturbation imposed on some simple system. In quantum chromodynamics, for instance, the interaction of quarks with the gluon field cannot be treated perturbatively at low energies because the coupling constant (the expansion parameter) becomes too large. Perturbation theory also fails to describe states that are not generated adiabatically from the "free model", including bound states and various collective phenomena such as solitons. Imagine, for example, that we have a system of free (i.e. noninteracting) particles, to which an attractive interaction is introduced. Depending on the form of the interaction, this may create an entirely new set of eigenstates corresponding to groups of particles bound to one another. An example of this phenomenon may be found in conventional superconductivity, in which the phononmediated attraction between conduction electrons leads to the formation of correlated electron pairs known as Cooper pairs. When faced with such systems, one usually turns to other approximation schemes, such as the variational method and the WKB approximation. This is because there is no analogue of a bound particle in the unperturbed model and the energy of a soliton typically goes as the inverse of the expansion parameter. However, if we "integrate" over the solitonic phenomena, the nonperturbative corrections in this case will be tiny; of the order of e ^{− 1 / g} or in the perturbation parameter g. Perturbation theory can only detect solutions "close" to the unperturbed solution, even if there are other solutions (which typically blow up as the expansion parameter goes to zero).
The problem of nonperturbative systems has been somewhat alleviated by the advent of modern computers. It has become practical to obtain numerical nonperturbative solutions for certain problems, using methods such as density functional theory. These advances have been of particular benefit to the field of quantum chemistry. Computers have also been used to carry out perturbation theory calculations to extraordinarily high levels of precision, which has proven important in particle physics for generating theoretical results that can be compared with experiment.
Timeindependent perturbation theory
Timeindependent perturbation theory is one of two categories of perturbation theory, the other being timedependent perturbation (see next section). In timeindependent perturbation theory the perturbation Hamiltonian is static (i.e., possesses no time dependence). Timeindependent perturbation theory was presented by Erwin Schrödinger in a 1926 paper,^{[1]} shortly after he produced his theories in wave mechanics. In this paper Schrödinger referred to earlier work of Lord Rayleigh,^{[2]} who investigated harmonic vibrations of a string perturbed by small inhomogeneities. This is why this perturbation theory is often referred to as RayleighSchrödinger perturbation theory.
First order corrections
We begin with an unperturbed Hamiltonian H_{0}, which is also assumed to have no time dependence. It has known energy levels and eigenstates, arising from the timeindependent Schrödinger equation:
For simplicity, we have assumed that the energies are discrete. The (0) superscripts denote that these quantities are associated with the unperturbed system. Note the use of Braket notation.
We now introduce a perturbation to the Hamiltonian. Let V be a Hamiltonian representing a weak physical disturbance, such as a potential energy produced by an external field. (Thus, V is formally a Hermitian operator.) Let λ be a dimensionless parameter that can take on values ranging continuously from 0 (no perturbation) to 1 (the full perturbation). The perturbed Hamiltonian is
 H = H_{0} + λV
The energy levels and eigenstates of the perturbed Hamiltonian are again given by the Schrödinger equation:
Our goal is to express E_{n} and n> in terms of the energy levels and eigenstates of the old Hamiltonian. If the perturbation is sufficiently weak, we can write them as power series in λ:
where
and
When λ = 0, these reduce to the unperturbed values, which are the first term in each series. Since the perturbation is weak, the energy levels and eigenstates should not deviate too much from their unperturbed values, and the terms should rapidly become smaller as we go to higher order.
Plugging the power series into the Schrödinger equation, we obtain
Expanding this equation and comparing coefficients of each power of λ results in an infinite series of simultaneous equations. The zerothorder equation is simply the Schrödinger equation for the unperturbed system. The firstorder equation is
Operating through by <n^{(0)}. The first term on the lefthand side cancels with the first term on the righthand side. (Recall, the unperturbed Hamiltonian is hermitian). This leads to the firstorder energy shift:
This is simply the expectation value of the perturbation Hamiltonian while the system is in the unperturbed state. This result can be interpreted in the following way: suppose the perturbation is applied, but we keep the system in the quantum state n^{(0)}>, which is a valid quantum state though no longer an energy eigenstate. The perturbation causes the average energy of this state to increase by <n^{(0)}Vn^{(0)}>. However, the true energy shift is slightly different, because the perturbed eigenstate is not exactly the same as n^{(0)}>. These further shifts are given by the second and higher order corrections to the energy.
Before we compute the corrections to the energy eigenstate, we need to address the issue of normalization. We may suppose <n^{(0)}n^{(0)}>=1, but perturbation theory assumes we also have <nn>=1. It follows that at first order in λ, we must have <n^{(0)}n^{(1)}>+<n^{(1)}n^{(0)}>=0. Since the overall phase is not determined in quantum mechanics, without loss of generality, we may assume <n^{(0)}n> is purely real. Therefore, <n^{(0)}n^{(1)}>=<n^{(1)}n^{(0)}>, and we deduce
To obtain the firstorder correction to the energy eigenstate, we insert our expression for the firstorder energy correction back into the result shown above of equating the firstorder coefficients of λ. We then make use of the resolution of the identity,
where is in the orthogonal complement of . The result is
For the moment, suppose that the zerothorder energy level is not degenerate, i.e. there is no eigenstate of H_{0} in the orthogonal complement of with the energy . We multiply through by <k^{(0)}, which gives
and hence the component of the firstorder correction along k^{(0)}> since by assumption . In total we get
The firstorder change in the nth energy eigenket has a contribution from each of the energy eigenstates k ≠ n. Each term is proportional to the matrix element <k^{(0)}Vn^{(0)}>, which is a measure of how much the perturbation mixes eigenstate n with eigenstate k; it is also inversely proportional to the energy difference between eigenstates k and n, which means that the perturbation deforms the eigenstate to a greater extent if there are more eigenstates at nearby energies. We see also that the expression is singular if any of these states have the same energy as state n, which is why we assumed that there is no degeneracy.
Secondorder and higher corrections
We can find the higherorder deviations by a similar procedure, though the calculations become quite tedious with our current formulation. Our normalization prescription gives that 2<n^{(0)}n^{(2)}>+<n^{(1)}n^{(1)}>=0. Up to second order, the expressions for the energies and (normalized) eigenstates are:
Extending the process further, the thirdorder energy correction can be shown to be ^{[3]}
Corrections to fifth order (energies) and fourth order (states) in compact notationIf we introduce the notation,
 ,
 ,
then the energy corrections to fifth order can be written
and the states to fourth order can be written
All terms involved k_{j} should be summed over k_{j} such that the denominator does not vanish.
Effects of degeneracy
Suppose that two or more energy eigenstates are degenerate. The firstorder energy shift is not well defined, since there is no unique way to choose a basis of eigenstates for the unperturbed system. The calculation of the change in the eigenstate is problematic as well, because the operator
does not have a welldefined inverse.
Let D denote the subspace spanned by these degenerate eigenstates. No matter how small the perturbation is, in the degenerate subspace D the energy differences between the eigenstates H_{0} are zero, so complete mixing of at least some of these states is assured. Thus the perturbation can not be considered small in the D subspace and in that subspace the new Hamiltonian must be diagonalized first. These correct perturbed eigenstates in D are now the basis for the perturbation expansion:
where only eigenstates outside of the D subspace are considered to be small. For the firstorder perturbation we need to solve the perturbed Hamiltonian restricted to the degenerate subspace D
simultaneously for all the degenerate eigenstates, where are firstorder corrections to the degenerate energy levels. This is equivalent to diagonalizing the matrix
This procedure is approximate, since we neglected states outside the D subspace. The splitting of degenerate energies is generally observed. Although the splitting may be small compared to the range of energies found in the system, it is crucial in understanding certain details, such as spectral lines in Electron Spin Resonance experiments.
Higherorder corrections due to other eigenstates can be found in the same way as for the nondegenerate case
The operator on the left hand side is not singular when applied to eigenstates outside D, so we can write
but the effect on the degenerate states is minuscule, proportional to the square of the firstorder correction .
Neardegenerate states should also be treated in the above manner, since the original Hamiltonian won't be larger than the perturbation in the neardegenerate subspace. An application is found in the nearlyfree electron model, where neardegeneracy treated properly gives rise to an energy gap even for small perturbations. Other eigenstates will only shift the absolute energy of all neardegenerate states simultaneously.
Generalization to Multiparameter Case
The generalization of the timeindependent perturbation theory to the multiparameter case can be formulated more systematically using the language of differential geometry, which basically defines the derivatives of the quantum states and calculate the perturbative corrections by taking derivatives iteratively at the unperturbed point.
Hamiltonian and Force Operator
From the differential geometric point of view, a parameterized Hamiltonian is considered as a function defined on the parameter manifold that maps each particular set of parameters to an Hermitian operator H(x^{μ}) that acts on the Hilbert space. The parameters here can be external field, interaction strength, or driving parameters in the quantum phase transition. Let E_{n}(x^{μ}) and be the n^{th} eigenenergy and eigenstate of H(x^{μ}) respectively. In the language of deferential geometry, the states form a vector bundle over the parameter manifold, on which derivatives of these states can be defined. The perturbation theory is to answer the following question: given and at a unperturbed reference point , how to estimate the E_{n}(x^{μ}) and at x^{μ} close to that reference point.
With out loss of generality, the coordinate system can be shifted, such that the reference point is set to be the origin. The following linearly parameterized Hamiltonian is frequently used
 H(x^{μ}) = H(0) + x^{μ}F_{μ}.
If the parameters x^{μ} are considered as generalized coordinates, then F_{μ} should be identified as the generalized force operators related to those coordinates. Different indices μ's label the different forces along different directions in the parameter manifold. For example, if x^{μ} denotes the external magnetic field in the μdirection, then F_{μ} should be the magnetization in the same direction.
Perturbation Theory as Power Series Expansion
The validity of the perturbation theory lies on the adiabatic assumption, which assumes the eigenenergies and eigenstates of the Hamiltonian are smooth functions of parameters such that their values in the vicinity region can be calculated in power series (like Taylor expansion) of the parameters:
Here denotes the derivative with respect to x^{μ}. When applying to the state , it should be understood as the Lie derivative if the vector bundle is equipped with nonvanishing connection. All the terms on the righthandside of the series are evaluated at x^{μ} = 0, e.g. and . This convention will be adopted though out the this subsection, that all functions without the parameter dependence explicitly stated are assumed to be evaluated at the origin. The power series may converge slowly or even not converging when the energy levels are close to each other. The adiabatic assumption breaks down when there is energy level degeneracy, and hence the perturbation theory is not applicable in that case.
HellmannFeynman Theorems
The above power series expansion can be readily evaluated if there is a systematic approach to calculate the derivates to any order. Using the chain rule, the derivatives can be broken down to the single derivative on either the energy or the state. The HellmannFeynman theorems are used to calculated these single derivatives. The first HellmannFeynman theorem gives the derivative of the energy,
 .
The second HellmannFeynman theorem gives the derivative of the state (resolved by the complete basis with m ≠ n),
 ,
 .
For the linearly parameterized Hamiltonian, simply stands for the generalized force operator F_{μ}.
The theorems can be simply derived by applying the differential operator to both sides of the Schrödinger equation , which reads
 .
Then overlap with the state from left and make use of the Schrödinger equation again ,
 .
Given that the eigenstates of the Hamiltonian always from a set of orthonormal basis , both the cases of m = n and m ≠ n can be discussed separately. The first case will lead to the first theorem and the second case to the second theorem, which can be shown immediately by rearranging the terms. With the differential rules given by the HellmannFeynman theorems, the perturbative correction to the energies and states can be calculated systematically.
Correction of Energy and State
To the second order, the energy correction reads
 .
The first order derivative is given by the first HellmannFeynman theorem directly. To obtain the second order derivative , simply applying the differential operator to the result of the first order derivative , which reads
 .
Note that for linearly parameterized Hamiltonian, there is no second derivative on the operator level. Resolve the derivative of state by inserting the complete set of basis,
 ,
then all parts can be calculated using the HellmannFeynman theorems. In terms of Lie derivatives, according to the definition of the connection for the vector bundle. Therefore the case m = n can be excluded from the summation, which avoids the singularity of the energy denominator. The same procedure can be carried on for higher order derivatives, from which higher order corrections are obtained.
The same computational scheme is applicable for the correction of states. The result to the second order is as follows
Both energy derivatives and state derivatives will be involved in deduction. Whenever a state derivative is encountered, resolve it by inserting the complete set of basis, then the HellmannFeynman theorem is applicable. Because differentiation can be calculated systematically, the series expansion approach to the perturbative corrections can be coded on computers with symbolic processing softwares like Mathematica.
Effective Hamiltonian
Let H(0) be the Hamiltonian completely restricted either in the lowenergy subspace or in the highenergy subspace , such that there is no matrix element in H(0) connecting the low and the highenergy subspaces, i.e. if . Let be the coupling terms connecting the subspaces. Then when the high energy degrees of freedoms are integrated out, the effective Hamiltonian in the low energy subspace reads
 .
Here are restricted in the low energy subspace. The above result can be derived by power series expansion of .
Timedependent perturbation theory
Method of variation of constants
Timedependent perturbation theory, developed by Paul Dirac, studies the effect of a timedependent perturbation V(t) applied to a timeindependent Hamiltonian H_{0}. Since the perturbed Hamiltonian is timedependent, so are its energy levels and eigenstates. Therefore, the goals of timedependent perturbation theory are slightly different from timeindependent perturbation theory. We are interested in the following quantities:
 The timedependent expectation value of some observable A, for a given initial state.
 The timedependent amplitudes of those quantum states that are energy eigenkets (eigenvectors) in the unperturbed system.
The first quantity is important because it gives rise to the classical result of an A measurement performed on a macroscopic number of copies of the perturbed system. For example, we could take A to be the displacement in the xdirection of the electron in a hydrogen atom, in which case the expected value, when multiplied by an appropriate coefficient, gives the timedependent dielectric polarization of a hydrogen gas. With an appropriate choice of perturbation (i.e. an oscillating electric potential), this allows us to calculate the AC permittivity of the gas.
The second quantity looks at the timedependent probability of occupation for each eigenstate. This is particularly useful in laser physics, where one is interested in the populations of different atomic states in a gas when a timedependent electric field is applied. These probabilities are also useful for calculating the "quantum broadening" of spectral lines (see line broadening).
We will briefly examine the ideas behind Dirac's formulation of timedependent perturbation theory. Choose an energy basis for the unperturbed system. (We will drop the (0) superscripts for the eigenstates, because it is not meaningful to speak of energy levels and eigenstates for the perturbed system.)
If the unperturbed system is in eigenstate at time , its state at subsequent times varies only by a phase (we are following the Schrödinger picture, where state vectors evolve in time and operators are constant):
We now introduce a timedependent perturbing Hamiltonian . The Hamiltonian of the perturbed system is
Let denote the quantum state of the perturbed system at time t. It obeys the timedependent Schrödinger equation,
The quantum state at each instant can be expressed as a linear combination of the eigenbasis . We can write the linear combination as
where the s are undetermined complex functions of t which we will refer to as amplitudes (strictly speaking, they are the amplitudes in the Dirac picture). We have explicitly extracted the exponential phase factors exp(iE_{n}t/
h) on the right hand side. This is only a matter of convention, and may be done without loss of generality. The reason we go to this trouble is that when the system starts in the state j> and no perturbation is present, the amplitudes have the convenient property that, for all t, c_{j}(t) = 1 and if .The absolute square of the amplitude c_{n}(t) is the probability that the system is in state n at time t, since
Plugging into the Schrödinger equation and using the fact that ∂/∂t acts by a chain rule, we obtain
By resolving the identity in front of V, this can be reduced to a set of partial differential equations for the amplitudes:
The matrix elements of V play a similar role as in timeindependent perturbation theory, being proportional to the rate at which amplitudes are shifted between states. Note, however, that the direction of the shift is modified by the exponential phase factor. Over times much longer than the energy difference E_{k}E_{n}, the phase winds many times. If the timedependence of V is sufficiently slow, this may cause the state amplitudes to oscillate. Such oscillations are useful for managing radiative transitions in a laser.
Up to this point, we have made no approximations, so this set of differential equations is exact. By supplying appropriate initial values c_{n}(0), we could in principle find an exact (i.e. nonperturbative) solution. This is easily done when there are only two energy levels (n = 1, 2), and the solution is useful for modelling systems like the ammonia molecule. However, exact solutions are difficult to find when there are many energy levels, and one instead looks for perturbative solutions, which may be obtained by putting the equations in an integral form:
By repeatedly substituting this expression for c_{n} back into right hand side, we get an iterative solution
where, for example, the firstorder term is
Many further results may be obtained, such as Fermi's golden rule, which relates the rate of transitions between quantum states to the density of states at particular energies, and the Dyson series, obtained by applying the iterative method to the time evolution operator, which is one of the starting points for the method of Feynman diagrams.
Method of Dyson series
Time dependent perturbations can be treated with the technique of Dyson series. Taking Schrödinger equation
this has the formal solution
being the time ordering operator such that
if t_{1} > t_{2} and
if so that the exponential will represent the following Dyson series
 .
Now, let us take the following perturbation problem
assuming that the parameter λ is small and that we are able to solve the problem . We do the following unitary transformation going to interaction picture or Dirac picture
and so the Schrödinger equation becomes
that can be solved through the above Dyson series as
being this a perturbation series with small λ. Using the solution of the unperturbed problem and (for the sake of simplicity we assume a pure discrete spectrum), we will have till first order
 .
So, the system, initially in the unperturbed state , due to the perturbation can go into the state . The corresponding probability amplitude will be
and the corresponding transition probability will be given by Fermi's golden rule.
Time independent perturbation theory can be derived from the time dependent perurbation theory. For this purpose, let us write the unitary evolution operator, obtained from the above Dyson series, as
and we take the perturbation V time independent. Using the identity
with for a pure discrete spectrum, we can write
We see that, at second order, we have to sum on all the intermediate states. We assume t_{0} = 0 and the asymptotic limit of larger times. This means that, at each contribution of the perturbation series, we have to add a multiplicative factor in the integrands so that, the limit will give back the final state of the system by eliminating all oscillating terms but keeping the secular ones. must be postulated as being arbitrarily small. In this way we can compute the integrals and, separating the diagonal terms from the others, we have
where the time secular series yields the eigenvalues of the perturbed problem and the remaining part gives the corrections to the eigenfunctions.^{[citation needed]} The unitary evolution operator is applied to whatever eigenstate of the unperturbed problem and, in this case, we will get a secular series that holds at small times.
Strong perturbation theory
In a similar way as for small perturbations, it is possible to develop a strong perturbation theory. Let us consider as usual the Schrödinger equation
and we consider the question if a dual Dyson series exists that applies in the limit of a perturbation increasingly large. This question can be answered in an affirmative way ^{[4]} and the series is the wellknown adiabatic series.^{[5]} This approach is quite general and can be shown in the following way. Let us consider the perturbation problem
being . Our aim is to find a solution in the form
but a direct substitution into the above equation fails to produce useful results. This situation can be adjusted making a rescaling of the time variable as τ = λt producing the following meaningful equations
that can be solved once we know the solution of the leading order equation. But we know that in this case we can use the adiabatic approximation. When V(t) does not depend on time one gets the WignerKirkwood series that is often used in statistical mechanics. Indeed, in this case we introduce the unitary transformation
that defines a free picture as we are trying to eliminate the interaction term. Now, in dual way with respect to the small perturbations, we have to solve the Schrödinger equation
and we see that the expansion parameter λ appears only into the exponential and so, the corresponding Dyson series, a dual Dyson series, is meaningful at large λs and is
After the rescaling in time τ = λt we can see that this is indeed a series in 1 / λ justifying in this way the name of dual Dyson series. The reason is that we have obtained this series simply interchanging H_{0} and V and we can go from one to another applying this exchange. This is called duality principle in perturbation theory. The choice H_{0} = p^{2} / 2m yields, as already said, a WignerKirkwood series that is a gradient expansion. The WignerKirkwood series is a semiclassical series with eigenvalues given exactly as for WKB approximation.^{[6]}
See also
References
 ^ Schrödinger, E. (1926). "Quantisierung als Eigenwertproblem [Quantification of the eigen value problem]" (in German). Annalen der Physik 80 (13): 437–490. Bibcode 1926AnP...385..437S. doi:10.1002/andp.19263851302.
 ^ Rayleigh, J. W. S. (1894). Theory of Sound. I (2nd ed.). London: Macmillan. pp. 115–118. ISBN 1152060236.
 ^ Landau, L. D.; Lifschitz, E. M.. Quantum Mechanics: Nonrelativistic Theory (3rd ed.). ISBN 008019012X.
 ^ Frasca, M. (1998). "Duality in Perturbation Theory and the Quantum Adiabatic Approximation". Phys. Rev. A 58 (5): 3439. arXiv:hepth/9801069. Bibcode 1998PhRvA..58.3439F. doi:10.1103/PhysRevA.58.3439.
 ^ Mostafazadeh, A. (1997). "Quantum adiabatic approximation and the geometric phase,". Phys. Rev. A 55 (3): 1653. arXiv:hepth/9606053. Bibcode 1997PhRvA..55.1653M. doi:10.1103/PhysRevA.55.1653.
 ^ Frasca, Marco (2007). "A strongly perturbed quantum system is a semiclassical system". Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 463 (2085): 2195. arXiv:hepth/0603182. Bibcode 2007RSPSA.463.2195F. doi:10.1098/rspa.2007.1879.
Categories:
Wikimedia Foundation. 2010.