This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
/2) acts on a two-component spinor X · Under rotations we change x as follows: - ia•n 0, we must have In) --> ln (0l ) and D." --> 0. Nevertheless, even for A -=f. 0, we can always add to In) a solution to the homogeneous equation (5.1.18), namely, cn ln<0l), so a In) == f<+l) -+ (x'I ; t). The I<J>; t) ket makes no contribution; as for its effect on the second term, [ in ( 8j8t ) - H0] acting on G+ just gives o(t - t'), by means of which the integration is immediately evaluated to be V I "' ( + >; t). So far we have not even required I "'< + >; t) to be an energy eigenket. If it is, we can separate the time dependence as usual: lcJ>; t) = l cJ> ) e - ,Et!h ( 7 . 1 1 .9) I "' ; t) l "' ) e - d't!h . Here there the implicit assumption that E does not change if V is +
(
)
On the other hand, the ak's themselves are to remain unchanged under rotations. So strictly speaking, despite its appearance, a is not to be regarded as a vector; rather, it is xtax which obeys the transformation property of a vector: (3.2.47) An explicit proof of this may be given using (3.2.48) and so on, which is the 2 X 2 matrix analogue of (3.2.6).
3 .2. Spin
;
167
Systems and Finite Rotations
In discussing a 27T rotation using the ket formalism, we have seen that a spin 1 ket Ia) goes into - Ia). The 2 X 2 analogue of this statement is
( - io·itcp ) l
(3.2.49) = - 1 , for any it, 2 q, � 2 " which is evident from (3.2.44). As an instructive application of rotation matrix (3 . 2.45), let us see how we can construct an eigenspinor of o · il with eigenvalue + 1 , where il is a unit vector m some specified direction. Our object is to construct x satisfying (3.2.50) In other words, we look for the two-component column matrix representa tion of IS · il; + ) defined by exp
S · it iS · n ; + )
= ( � ) IS · n ; + ) .
(3.2.51)
Actually this can be solved as a straightforward eigenvalue problem (see Problem 9 in Chapter 1 ), but here we present an alternative method based on rotation matrix (3.2.45). Let the polar and the azimuthal angles that characterize il be {3 and a, respectively. We start with (�) , the two-component spinor that represents the spin-up state. Given this, we first rotate about the y-axis by angle {3; we subsequently rotate by angle a about the z-axis. We see that the desired I
I
�
... .... -- -
'
.... ..
/ I I I
Second rotation
/
I I I I
I
FIGURE 3.3.
First rotation
�, : a
' '
'i
Construction of a · it eigenspinor.
168
Theory of Angular Momentum
spin state is then obtained; see Figure 3.3. In the Pauli spinor language this sequence of operations is equivalent to applying exp( - i a2/3/2) to ( �) followed by an application of exp( - i a3a j2). The net result is
[ ( � ) - ia3 sin( � ) ](cos(�) - ia2 sin( �) ] ( � ) cos (�) cos( � ) - i sin ( � ) 0 cos( � ) + i sin( � ) sin ( � ) 0 cos ( �) e -'a12 sin ( � ) e w/2
x = cos
(3.2.52)
in complete agreement with Problem 9 of Chapter 1 if we realize that a phase common to both the upper and lower components is devoid of physical significance. 3.3.
S0(3), SU(2), AND EULER ROTATIONS Orthogonal Group
We will now study a little more systematically the group properties of the operations with which we have been concerned in the previous two sections. The most elementary approach to rotations is based on specifying the axis of rotation and the angle of rotation. It is clear that we need three real numbers to characterize a general rotation: the polar and the azimuthal angles of the unit vector it taken in the direction of the rotation axis and the rotation angle cJ> itself. Equivalently, the same rotation can be specified by the three Cartesian components of the vector ncf>. However, these ways of characterizing rotation are not so convenient from the point of view of studying the group properties of rotations. For one thing, unless cJ> is infinitesimal or it is always in the same direction, we cannot add vectors of the form ncf> to characterize a succession of rotations. It is much easier to work with a 3 X 3 orthogonal matrix R because the effect of successive rotations can be obtained just by multiplying the appropriate orthogonal matrices. How many independent parameters are there in a 3 X 3 orthogonal matrix? A real 3 X 3 matrix has 9 entries, but we have the orthogonality constraint (3.3.1)
3.3 S0(3), SU(Z), and Euler Rotations
169
RRT,
which corresponds to 6 independent equations because the product being the same as is a symmetrical matrix with 6 independent entries. As a result, there are 3 (that is, 9 - 6) independent numbers in the same number previously obtained by a more elementary method. The set of all multiplication operations with orthogonal matrices forms a group. By this we mean that the following four requirements are satisfied: 1 . The product of any two orthogonal matrices is another orthogonal matrix, which is satisfied because
R rR,
R,
(R 1 R 2 )(R1R 2 )T = R1R 2 R�R'[ = 1 . 2. The associative law holds: R1(R 2 R3) = (R1R 2 )R3.
( 3 .3 .2) (3.3.3)
3. The identity matrix !-physically corresponding to no rotation -defined by
Rl =lR = R
(3.3.4)
is a member of the class of all orthogonal matrices. 4. The inverse matrix - 1 -physically corresponding to rotation in the opposite sense-defined by
R
RR-1 = R- 1R =l
(3 .3. 5)
is also a member. This group has the name S0(3) , where S stands for special, 0 stands for orthogonal, 3 for three dimensions. Note only rotational operations are considered here, hence we have S0(3) rather than 0(3) (which can include the inversion operation of Chapter 4 later).
Unitary unimodular group In the previous section we learned yet another way to characterize an arbitrary rotation-that is, to look at the 2 X 2 matrix (3.2.45) that acts on the two-component spinor X · Clearly, (3.2.45) is unitary. As a result, for the and defined in (3.2.28), (3.3.6) is left invariant. Furthermore, matrix (3.2.45) is unimodular; that is, its determinant is 1 , as will be shown explicitly below. We can write the most general unitary unimodular matrix as
c+
c
_,
l c + l 2 + l c l 2 =1
U( a , b ) =
)
( - ab*
b a* '
(3.3.7)
where a and b are complex numbers satisfying the unimodular condition a i + 1W = . (3.3.8)
l 2
1
Theory of Angular Momentum
170
We can easily establish the unitary property of (3.3.7) as follows:
(
U( a , b)t u( a , b ) = a * b*
-b a
)(
a - b*
)
b =1 a*
( 3 .3 .9) '
where we have used (3.3.8). Notice that the number of independent real parameters in (3.3.7) is again three. We can readily see that the 2 X 2 matrix (3.2.45) that characterizes a rotation of a spin 1 system can be written as U( a , b ). Comparing (3.2.45) with (3.3. 7), we identify Re( a ) = cos
( i)
Re( b ) = - n Y sin
,
( i),
Im( a ) = - nzsin
(i) (i)
Im( b ) = - nxsin
.
(3 .3.10) .
from which the unimodular property of (3.3.8) is immediate. Conversely, it is clear that the most general unitary unimodular matrix of form (3.3.7) can be interpreted as representing a rotation. The two complex numbers a and b are known as Cayley-Klein parameters. Historically the connection between a unitary unimodular ma trix and a rotation was known long before the birth of quantum mechanics. In fact, the Cayley-Klein parameters were used to characterize complicated motions of gyroscopes in rigid-body kinematics. Without appealing to the interpretations of unitary unimodular matrices in terms of rotations, we can directly check the group properties of multiplication operations with unitary unimodular matrices. Note in par ticular that (3.3 .1 1 ) U( a1 , b1 ) U( a 2 , b2 ) = U( a1 a 2 - b 1 bi , a1 b2 + a ib 1 ) , where the unimodular condition for the product matrix is la1a 2 - blb� l 2 + la1 b2 + a �b 1 l 2 = 1 .
(3.3 .12)
For the inverse of U we have
u-1( a , b ) = U( a * , - b ) .
(3.3.13) This group is known as SU(2), where S stands for special, U for unitary, and 2 for dimensionality 2. In contrast, the group defined by multiplication operations with general 2 X 2 unitary matrices (not necessarily constrained to be unimodular) is known as U(2). The most general unitary matrix in two dimensions has four independent parameters and can be written as e'Y (with y real) times a unitary unimodular matrix: U = e'Y .
(
a - b*
SU(2) is called a subgroup of U(2).
y * = y.
(3.3 .14)
3.3 S0(3), SU(2),
171
and Euler Rotations
Because we can characterize rotations using both the S0(3) language and the SU(2) language, we may be tempted to conclude that the groups S0(3) and SU(2) are isomorphic-that is, that there is a one-to-one cor respondence between an element of S0(3) and an element of SU(2). This inference is not correct. Consider a rotation by 21T and another one by 41T. In the S0(3) language, the matrices representing a 21T rotation and a 41T rotation are both 3 x 3 identity matrices; however, in the SU(2) language the corresponding matrices are 1 times the 2 x 2 identity matrix and the identity matrix itself respectively. More generally, U(a,b) and U( -a, - b) both correspond to a single 3 x 3 matrix in the S0(3) language. The cor respondence therefore is two-to-one ; for a given R , the corresponding U is double-valued. One can say, however, that the two groups are locally isomorphic . -
,
Euler Rotations From classical mechanics the reader may be familiar with the fact that an arbitrary rotation of a rigid body can be accomplished in three steps, known as Euler rotations. The Euler rotation language, specified by three Euler angles, provides yet another way to characterize the most general rotation in three dimensions. The three steps of Euler rotations are as follows. First, rotate the rigid body counterclockwise (as seen from the positive z-side) about the z-axis by angle a. Imagine now that there is a body y-axis embedded, so to speak, in the rigid body such that before the z-axis rotation is carried out, the body y-axis coincides with the usual y-axis, referred to as the space-fixed y-axis. Obviously, after the rotation about the z-axis, the body y-axis no longer coincides with the space-fixed y-axis; let us call the former the y '-axis. To see how all this may appear for a thin disk, refer to Figure 3.4a. We now perform a second rotation, this time about the y '-axis by angle {3. As a result, the body z-axis no longer points in the space-fixed z-axis direction. We call the body-fixed z-axis after the second rotation the z '-axis; see Figure 3.4b. The third and final rotation is about the z '-axis by angle y. The body y-axis now becomes the y "-axis of Figure 3.4c. In terms of 3 X 3 orthogonal matrices the product of the three operations can be written as (3.3.15) A cautionary remark is in order here. Most textbooks in classical mechanics prefer to perform the second rotation (the middle rotation) about the body x-axis rather than about the body y-axis (see, for example, Goldstein 1980). This convention is to be avoided in quantum mechanics for a reason that will become apparent in a moment. In (3.3.15) there appear R and R z'• which are matrices for rotations about body axes. This approach to Euler rotations is rather inconvenient in v
'
172
Theory of Angular Momentum
z
y
X
z
'
(a)
z
y'
z'
y" y' (b)
y FIGURE 3.4.
Euler rotations.
(c)
y
y'
quantum mechanics because we earlier obtained simple expressions for the space-fixed (unprimed) axis components of the S operator, but not for the body-axis components. It is therefore desirable to express the body-axis rotations we considered in terms of space-fixed axis rotations. Fortunately there is a very simple relation, namely, (3 .3.16) The meaning of the right-hand side is as follows. First, bring the body y-axis of Figure 3.4a (that is, the y'-axis) back to the original fixed-space y-direction by rotating clockwise (as seen from the positive z-side) about the z-axis by angle a ; then rotate about the y-axis by angle /3. Finally, return the body y-axis to the direction of the y '-axis by rotating about the fixed-space z-axis (not about the z '-axis!) by angle a. Equ!ltion (3.3.16) tells us that the net effect of these rotations is a single rotation about the y '-axis by angle {3.
173
3.3 S0(3), SU(2), and Euler Rotations
To prove this assertion, let us look more closely at the effect of both sides of (3.3.16) on the circular disc of Figure 3.4a. Clearly, the orientation of the body y-axis is unchanged in both cases, namely, in the y '-direction. Furthermore, the orientation of the final body z-axis is the same whether we apply R v.( /3 ) or In both cases the final body z-axis makes a polar angle with the fixed z-axis (the same as the initial z-axis), and its azimuthal angle, as measured in the fixed-coordinate system, is just In other words, the final body z-axis is the same as the z '-axis of Figure 3.4b. Similarly, we can prove
Rz< a)R y( f3 )R; 1 ( a). f3
a.
(3.3.17) Using (3.3.16) and (3.3.17), we can now rewrite (3.3.15). We obtain
R z-{ y) R v.( f3 ) R z (a) = R v.( f3) Rz ( y) R ;.t( f3) R v.( f3) R z( a) R z(a) R v(f3) R ; 1 (a) R 2(y) R 2(a) (3.3.18) = R z( a) R v(f3) R z( Y ) , where in the final step we used the fact that Rz( Y ) and R2( a ) commute. To =
summanze,
(3 .3 . 19)
where all three matrices on the right-hand side refer to fixed-axis rotations. Now let us apply this set of operations to spin t systems in quantum mechanics. Corresponding to the product of orthogonal matrices in (3.3.19) there exists a product of rotation operators in the ket space of the spin t system under consideration:
2 2
The X matrix representation of this product is - a - a exp exp exp
(
� 3a ) ( i;2 f3 ) ( � 3y ) 2 ) )( e-' r/2 0 ) ( e -0ia/2 ew/0 2 )( cossin(( f3/3j2/2)) - sincos( (f3/ f3j2 ) 0 e 'Y/2 = ( ee' ,( a-+ y/J/2cos(f3/2 ) - e I(a-/YJ/2 sin( f3/2) ) ' (3.3.21) ( a r l 2 sin( /3/2) e ( a + r l 2cos( /3/2) where (3.2.44) was used. This matrix is clearly of the unitary unimodular form. Conversely, the most general 2 X 2 unitary unimodular matrix can be =
-
-
'
written in this Euler angle form. Notice that the matrix elements of the second (middle) rotation exp( - iav.P/2) are purely real. This would not have been the case had we chosen to rotate about the x-axis rather than the y-axis, as done in most
1 74
Theory of Angular Momentum
textbooks in classical mechanics. In quantum mechanics it pays to stick to our convention because we prefer the matrix elements of the second rotation, which is the only rotation matrix containing off-diagonal elements, to be purely real. * The 2 X 2 matrix in (3.3.21) is called the j = t irreducible representa tion of the rotation operator � ( a , {3, ) Its matrix elements are denoted by ��.��,2 l ( a , {3, y ). In terms of the angular-momentum operators we have
y
�m(1/2) 'm ( a, {3 , )
_ -
j\ J -
x
_
exp
y.
.!.
'I ( - iJza )
m exp 2'
li
( - iJY/3 ) ( - iJ.y ) I;� ft
exp
h
•
1
= 2, m
).
(3.3 .22)
In Section 3.5 we will extensively study higher )-analogues of (3.3.21).
3.4. DENSITY OPERATORS AND PURE VERSUS MIXED ENSEMBLES Polarized Versus Unpolarized Beams The formalism of quantum mechanics developed so far makes statis tical predictions on an ensemble, that is, a collection, of identically prepared physical systems. More precisely, in such an ensemble all members are supposed to be characterized by the same state ket Ia). A good example of this is a beam of silver atoms corning out of an SG filtering apparatus. Every atom in the beam has its spin pointing in the same direction, namely, the direction determined by the inhomogeneity of the magnetic field of the filtering apparatus. We have not yet discussed how to describe quantum mechanically an ensemble of physical systems for which some, say 60%, are characterized by j ot.), and the remaining 40% are characterized by some other ket 1 /3 ) . To illustrate vividly the incompleteness of the formalism developed so far, let us consider silver atoms coming directly out of a hot oven, yet to be subjected to a filtering apparatus of the Stern-Gerlach type. On symme try grounds we expect that such atoms have random spin orientations; in other words, there should be no preferred direction associated with such an ensemble of atoms. According to the formalism developed so far, the most general state ket of a spin t system is given by (3.4.1) *This, of course, depends on our convention that the matrix elements o f S�. (or, more generally, J,. ) are taken to be purely imaginary.
3.4. Density Operators and Pure Versus Mixed Ensembles
175
Is this equation capable of describing a collection of atoms with random spin orientations? The answer is clearly no; (3.4.1) characterizes a state ket whose spin is pointing in some definite direction, namely, in the direction of il, whose polar and azimuthal angles, {3 and a, respectively, are obtained by solving c+ cos( /3/2) (3.4.2) ; e 'asin( f3j2) see (3.2.52). To cope with a situation of this kind we introduce the concept of fractional population, or probability weight. An ensemble of silver atoms with completely random spin orientation can be viewed as a collection of silver atoms in which 50% of the members of the ensemble are characterized by I + ) and the remaining 50% by 1 - ) . We specify such an ensemble by ass1 gnmg (3.4.3 ) w + 0.5, W _ 0.5,
-;:- =
=
=
where w + and w are the fractional population for spin-up and -down, respectively. Because there is no preferred direction for such a beam, it is reasonable to expect that this same ensemble can be regarded equally well as a 50-50 mixture of ISx; + ) and ISx ; - ). The mathematical formalism needed to accomplish this will appear shortly. It is very important to note that we are simply introducing here two real numbers w + and w There is no information on the relative phase between the spin-up and the spin-down ket. Quite often we refer to such a situation as an incoherent mixture of spin-up and spin-down states. What we are doing here is to be clearly distinguished from what we did with a coherent linear superposition, for example,
_
_ .
(3 .4.4) where the phase relation between 1 + ) and 1 - ) contains vital information on the spin orientation in the xy-plane, in this case in the positive x-direc tion. In general, we should not confuse w + and with lc+ l 2 and l c - 1 2 . The probability concept associated with w + and w is much closer to that encountered in classical probability theory. The situation encountered in dealing with silver atoms directly from the hot oven may be compared with that of a graduating class in which 50% of the graduating seniors are male, the remaining 50% female. When we pick a student at random, the probabil ity that the particular student is male (or female) is 0.5. Whoever heard of a student referred to as a coherent linear superposition of male and female with a particular phase relation? The beam of silver atoms coming directly out of the oven is an example of a completely random ensemble; the beam is said to be un-
w
_
_
176
Theory of Angular Momentum
polarized because there is no preferred direction for spin orientation. In contrast, the beam that has gone through a selective Stern-Gerlach-type measurement is an example of a pure ensemble; the beam is said to be polarized because all members of the ensemble are characterized by a single common ket that describes a state with spin pointing in some definite direction. To appreciate the difference between a completely random ensem ble and a pure ensemble, let us consider a rotatable SG apparatus where we can vary the direction of the inhomogeneous B just by rotating the appara tus. When a completely unpolarized beam directly out of the oven is subjected to such an apparatus, we always obtain two emerging beams of equal intensity no matter what the orientation of the apparatus may be. In contrast, if a polarized beam is subjected to such an apparatus, the relative intensities of the two emerging beams vary as the apparatus is rotated. For some particular orientation the ratio of the intensities actually becomes one to zero. In fact, the formalism we developed in Chapter 1 tells us that the relative intensities are simply cos2(/J/2) and sin2( ,8/2), where .B is the angle between the spin direction of the atoms and the direction of the inhomoge neous magnetic field in the SG apparatus. A complete random ensemble and a pure ensemble can be regarded as the extremes of what is known as a mixed ensemble. In a mixed ensemble a certain fraction-for example, 70%-of the members are characterized by a state ket I a) , the remaining 30% by I .B ) . In such a case the beam is said to be partially polarized. Here Ia) and need not even be orthogonal; we can, for example, have 70% with spin in the positive x-direction and 30% with spin in the negative z-direction. *
1.8)
Ensemble Averages and Density Operator We now present the density operator formalism, pioneered by J. von Neumann in 1927, that quantitatively describes physical situations with mixed as well as pure ensembles. Our general discussion here is not restricted to spin 1 systems, but for illustrative purposes we return re peatedly to spin ! systems. A pure ensemble by definition is a collection of physical systems such that every member is characterized by the same ket I a). In contrast, in a mixed ensemble, a fraction of the members with relative population w1 are characterized by la<1l ) , some other fraction with relative population w2 , by la(2)), and so on. Roughly speaking, a mixed ensemble can be viewed as a mixture of pure ensembles, just as the name suggests. The fractional *In the literature what we call pure and mixed ensembles are often referred to as pure and mixed states. In this book, however, we use state to mean a physical system described by a definite state ket 1 <> ) .
3.4. Density Operators and Pure Versus Mixed Ensembles
177
populations are constrained to satisfy the normalization condition
[w, = l . la(ll) and la(2J )
(3.4.5)
As we mentioned previously, need not be orthogonal. Furthermore, the number of terms in the i sum of (3.4.5) need not coincide with the dimensionality N of the ket space; it can easily exceed N. For example, for spin ! systems with N = 2, we may consider 40% with spin in the positive z-direction, 30% with spin in the positive x-direction, and the remaining 30% with spin in the negative y-direction. Suppose we make a measurement on a mixed ensemble of some observable A . We may ask what is the average measured value of A when a large number of measurements are carried out. The answer is given by the ensemble average of A , which is defined by [ A ] = [ w/a<'JIA ia(i l )
I a ')
= L L w, l( a 'la< ' J ) I 2a ', eigenket of A . Recall that (a<' J 1 A 1 a< ) '
a
( 3 .4.6)
'
where is an l) is the usual quantum mechanical expectation value of A taken with respect to state la< ' J). Equation (3.4.6) tells us that these expectation values must further be weighted by the corresponding fractional populations w,. Notice how prob abilistic concepts enter twice; first in I( a 'laUJ) 1 2 for the quantum-mechani cal probability for state laUJ) to be found in an A eigenstate Ia') ; second, in the probability factor w; for finding in the ensemble a quantum-mechani cal state characterized by laU l ) . We can now rewrite ensemble average (3.4.6) using a more general basis, { I b ' )}: *
[ A ] = [ w; [ [(a
= L L ( L w, ( b"laUl)( a<01b ') )< b'IAib"). h' b"
1
(3 .4.7)
i
The number of terms in the sum of the b' ( b") is just the dimensionality of the ket space, while the number of terms in the sum of the depends on how the mixed ensemble is viewed as a mixture of pure ensembles. Notice that in this form the basic property of the ensemble which does not depend on the particular observable A is factored out. This motivates us to define the density operator p as follows: p = L w,laUl)( a
*
Quite often in the literature the ensemble average is also called the expectation value. However, in this book, the term expectation value is reserved for the average measured value when measurements are carried on a pure ensemble.
17R
Theory of Angular Momentum
The elements of the corresponding density matrix have the following form: (3.4.9) The density operator contains all the physically significant information we can possibly obtain about the ensemble in question. Returning to (3.4.7), we see that the ensemble average can be written as [ A 1 = L L ( b ''lpl b ')( b 'I A i b" )
h' h" (3 .4.10) = tr( pA ) . Because the trace is independent of representations, tr(pA) can be evaluated using any convenient basis. As a result, (3.4.10) is an extremely powerful relation. There are two properties of the density operator worth recording. First, the density operator is Hermitian, as is evident from (3.4.8). Second, the density operator satisfies the normalization condition tr( p ) = L :[ w, ( b 'la(il) (a< • ll b') i h' = 1. (3.4.11) Because of the Hermiticity and the normalization condition, for spin 1 systems with dimensionality 2 the density operator, or the corresponding density matrix, is characterized by three independent real parameters. Four real numbers characterize a 2 X 2 Hermitian matrix. However, only three are independent because of the normalization condition. The three numbers needed are [ Sx], [S.J, and [Sz]; the reader may verify that knowledge of these three ensemble averages is sufficient to reconstruct the density oper ator. The manner in which a mixed ensemble is formed can be rather involved. We may mix pure ensembles characterized by all kinds of l a < l ) s with appropriate w, ' s; yet for spin 1 systems three real numbers completely characterize the ensemble in question. This strongly suggests that a mixed ensemble can be decomposed into pure ensembles in many different ways. A problem to illustrate this point appears at the end of this chapter. A pure ensemble is specified by w; = 1 for some la< l ) ) - with i = n , for instance-and w, = 0 for all other conceivable state kets, so the corre sponding density operator is written as •
'
(3.4.12) with no summation. Clearly, the density operator for a pure ensemble is idempotent, that is, (3.4.13)
3.4. Density Operators and Pure Versus Mixed Ensembles
179
or, equivalently,
(3 .4.14) p ( p - 1 ) = 0. Thus, for a pure ensemble only, we have ( 3 .4.15) tr( p2 ) = 1 . in addition to (3.4.1 1). The eigenvalues of the density operator for a pure ensemble are zero or one, as can be seen by inserting a complete set of base kets that diagonalize the Hermitian operator p between p and (p - 1) of (3.4.1 4). When diagonalized, the density matrix for a pure ensemble must therefore look like 0
0
0 0
p "
1
0
(diagonal form) 0
0
0
0
(3.4.16) It can be shown that tr(p2 ) is maximal when the ensemble is pure; for a mixed ensemble tr(p2) is a positive number less than one. Given a density operator, let us see how we can construct the corresponding density matrix in some specified basis. To this end we first recall that (3.4 .17) Ia)( al = L L i b')( b 'la)( alb")( b''l b' b"
this shows that we can form the square matrix corresponding to la('l ) (a(ill by combining, in the sense of outer product, the column matrix formed by ( b'laUl ) with the row matrix formed by (aUllb"), which, of course, is equal to ( b"la(il )*. The final step is to sum such square matrices with weighting factors wi , as indicated in (3.4.8). The final form agrees with (3.4.9), as expected. I t is instructive to study several examples, all referring to spin -l: systems.
Example 1. A completely polarized beam with Sz + . P = I + )( + I =
=
(� �)·
( � )(l,o)
(3.4.18)
l RO
Theory of Angular Momentum
Sx ± . p = I Sx ; ± )( Sx ; ± l= ( � ) (1+) ± 1 - )) ( � ) ((+ 1 ± (- 1) 1 1 + (3.4.19) - 21 - -21
Example 2. A completely polarized beam with
+ -2 2 The ensembles of Examples 1 and 2 are both pure.
Example 3. An unpolarized beam. This can be regarded as an incoherent mixture of a spin-up ensemble and a spin-down ensemble with equal weights (50% each):
p= 0:)1+)(+ 1+0)1 - )( - 1 ( �l 0! ) ,
(3 .4.20)
=
which is just the identity matrix divided by 2. As we remarked earlier, the same ensemble can also be regarded as an incoherent mixture of an ensemble and an - ensemble with equal weights. It is gratifying that our formalism automatically satisfies the expectation
Sx +
Sx ( ! 0 ) = 1( ! ! ) + l( ! 1 ) ( 3 .4.21) 0 where we see from Example 2 that the two terms on the right-hand side are the density matrices for pure ensemble with Sx + and Sx - . Because p in this case is just the identity operator divided by 2 (the dimensionality), we have (3.4.22) tr( pSJ = tr( p s ) = tr( p SJ = 0, where we used the fact that Sk is traceless. Thus for the ensemble average of S we have l 2
2
l 2
l 2
2
-
_ l 2
1
,
2
.
(3 .4.23)
[S] = 0.
This is reasonable because there should be no preferred spin direction in a completely random ensemble of spin ! systems.
Example 4. As an example of a partially polarized beam, let us consider a and the other with 75-25 mixture of two pure ensembles, one with
Sx + :
w(S, +) = 0.75, w(Sx + )
Sz +
=
0.25 .
(3.4.24)
1 81
3.4. Density Operators and Pure Versus Mixed Ensembles
The corresponding p can be represented by p
.
J
( 1 0)+
=· 0
0
= ( i ;)
from which follows
l 4
(
t
l 2
(3 .4.25)
.
(3.4.26)
We leave as an exercise for the reader the task of showing that this ensemble can be decomposed in ways other than (3.4.24).
Time Evolution of Ensembles How does the density operator p change as a function of time? Let us suppose that at some time t 0 the density operator is given by (3.4.27) p ( t 0 ) = L w,la
op
a1
= " w ( Hia<'l , t o ·, t ) (a
I
(3 .4.29) This looks like the Heisenberg equation of motion except that the sign is wrong! This is not disturbing because p is not a dynamic observable in the Heisenberg picture. On the contrary, p is built up of Schrodinger-picture state kets and state bras which evolve in time according to the Schrodinger equation. It is amusing that (3.4.29) can be regarded as the quantum-mechani cal analogue of Liouville's theorem in classical statistical mechanics, = - [p, H].
(} Pciass;caJ
at
=
-
[ Pc1assical ' H ] classical'
(3.4.30)
where Pciasskal stands for the density of representative points in phase space.* Thus the name density operator for the p appearing in (3.4.29) is * Remember, a pure classical state is one represented by a single moving point in phase space
( q1 , . . . , q1, p 1 , . . . , p1 ) at each instant of time. A classical statistical state, on the other hand, is described by our nonnegative density function Pc�a,ical ( q1 , . . . , q1, p 1 , . . . , p1, t) such that the probability that a system is found in the interval dq1 , . . . , dp1 at time t is Pd»>ical dq1 , . . . , dp1.
Theory of Angular Momentum
1 82
indeed appropriate. The classical analogue of (3.4.10) for the ensemble average of some observable A is given by Pc!assical A ( q , P ) d fq , p f Aave,age = ------!Pc1assical d f
(3.4.31)
q. P
where d rq . p stands for a volume element in phase space.
Continuum Generalizations So far we have considered density operators in ket space where the base kets are labeled by the discrete-eigenvalues of some observable. The concept of density matrix can be generalized to cases where the base kets used are labeled by continuous eigenvalues. In particular, let us consider the ket space spanned by the position eigenkets lx '). The analogue of (3.4.10) is given by (3.4.32) The density matrix here is actually a function of x ' and x ", namely,
{
)
(x " l p lx ') = (x " L w, la(' 1 ) ( aUll lx') '
(3.4.33)
laUl).
where 1/J i is the wave function corresponding to the state ket Notice that the diagonal element (that is, x' = x") of this is just the weighted sum of the probability densities. Once again, the term density matrix is indeed appropriate. In continuum cases, too, it is important to keep in mind that the same mixed ensemble can be decomposed in different ways into pure ensembles. For instance, it is possible to regard a " realistic" beam of particles either as a mixture of plane-wave states (monoenergetic free-par ticle states) or as a mixture of wave-packet states.
Quantum Statistical Mechanics We conclude this section with a brief discussion on the connection between the density operator formalism and statistical mechanics. Let us first record some properties of completely random and of pure ensembles.
1X3
3.4. Density Operators and Pure Versus Mixed Ensembles
The density matrix of a completely random ensemble looks like 1 1 p =. N
1
0
1
(3.4.34)
1
0
1
1
in any representation [compare Example 3 with (3.4.20)]. This follows from the fact that all states corresponding to the base kets with respect to which the density matrix is written are equally populated. In contrast, in the basis where p is diagonalized, we have (3.4.16) for the matrix representation of the density operator for a pure ensemble. The two diagonal matrices (3.4.34) and (3.4.16), both satisfying the normalization requirement (3.4.11), cannot look more different. It would be desirable if we could somehow construct a quantity that characterizes this dramatic difference. Thus we define a quantity called a by a=
- tr( p In p ) .
(3 .4.35)
The logarithm of the operator p may appear rather formidable, but the meaning of (3.4.35) is quite unambiguous if we use the basis in which p is diagonal: Jn p(diag) " P(diagl a = _ L., (3 .4.36) kk kk • k
Because each element p�d:_agJ is a real number between 0 and 1 , a is necessarily positive semidefinite. For a completely random ensemble (3.4.34), we have N 1 1 (3.4.37) a = - L - In - = In N. k �I N
( ) N
In contrast, for a pure ensemble (3.4.16) we have where we have used
P(dia kk g) 0 =
a=O or
(3.4.38)
In P(kk diag) 0 =
(3.4.39)
for each term in (3.4.36). We now argue that physically a can be regarded as a quantitative measure of disorder. A pure ensemble is an ensemble with a maximum amount of order because all members are characterized by the same quantum-mechanical state ket; it may be likened to marching soldiers in a
Theory of Angular Momentum
184
well-regimented army. According to (3.4.38), 11 vanishes for such an ensem ble. On the other extreme, a completely random ensemble, in which all quantum-mechanical states are equally likely, may be likened to drunken soldiers wandering around in random directions. According to (3.4.37), 11 is large; indeed, we will show later that In N is the maximum possible value for subject to the normalization condition
o
L Pkk = l . k
(3.4.40)
o
In thermodynamics we learn that a quantity called entropy measures disorder. It turns out that our is related to the entropy per constituent member, denoted by S, of the ensemble via
S = k o,
(3.4.41 )
where k is a universal constant identifiable with the Boltzmann constant. In fact, (3.4.41) may be taken as the definition of entropy in quantum statistical mechanics. We now show how the density operator p can be obtained for an ensemble in thermal equilibrium. The basic assumption we make is that nature tends to maximize subject to the constraint that the ensemble average of the Hamiltonian has a certain prescribed value. To justify this assumption would involve us in a delicate discussion of how equilibrium is established as a result of interactions with the environment, which is beyond the scope of this book. In any case, once thermal equilibrium is established, we expect
o
ap
at
= O·
(3.4.42)
Because of (3.4.29), this means that p and H can be simultaneously diagonalized. So the kets used in writing (3.4.36) may be taken to be energy eigenkets. With this choice p kk stands for the fractional population for an energy eigenstate with energy eigenvalue Ek . Let us maximize 11 by requiring that
611 = 0.
(3 .4.43)
However, we must take into account the constraint that the ensemble average of H has a certain prescribed value. In the language of statistical mechanics, [H] is identified with the internal energy per constituent denoted by U:
[H] = tr( pH )
(3 .4.44) In addition, we should not forget the normalization constraint (3.4.40). So our basic task is to require (3.4.43) subject to the constraints =
U.
8 [ H ] = L6Pk J! Ek = 0 k
(3.4.45a)
3.4. Density Operators and Pure Versus Mixed Ensembles
185
and
8 (tr p ) = [ 8p kk = . k
(3 .4.45b)
O
We can most readily accomplish this by using Lagrange multipliers. We obtain (3 .4.46) L llPkk [(in Pkk + 1 ) + f3Ek + = 0, k which for an arbitrary variation is possibly only if (3.4.47) Pkk = exp( - /3Ek - y - 1). The constant y can be eliminated using the normalization condition (3.4.40), and our final result is exp( - f3Ek ) (3 .4.48) Pkk = N L exp( - /3E1)
y]
which directly gives the fractional population for an energy eigenstate with eigenvalue Ek . It is to be understood throughout that the sum is over distinct energy eigenstates; if there is degeneracy we must sum over states with the same energy eigenvalue. The density matrix element (3.4.48) is appropriate for what is known in statistical mechanics as a canonical ensemble. Had we attempted to maximize o without the internal-energy constraint (3.4.45a), we would have obtained instead 1 (3 .4.49) Pkk = N , (independent of k ) which is the density matrix element appropriate for a completely random ensemble. Comparing (3.4.48) with (3.4.49), we infer that a completely random ensemble can be regarded as the f3 0 limit (physically the high-temperature limit) of a canonical ensemble. We recognize the denominator of (3.4.48) as the partition function ,
->
N
Z = [ exp( - /3Ek ) k in statistical mechanics. It can also be written as
(3.4.50)
(3.4.51 ) Knowing Pkk given in the energy basis, we can write the density operator as (3 .4.52) This is the most basic equation from which everything follows. We can
Theory of Angular Momentum
1 86
immediately evaluate the ensemble average of any observable A : [ A ) tr( e - il HA ) = z
[*
( A ) kexp( - PEk )
]
(3.4.53)
In particular, for the internal energy per constituent we obtain U
[ � =
Ekexp( - PEk )
L:; exp( - PEJ N
k
=-
a p (ln Z ) a
]
,
(3.4.54)
a formula well known to every student of statistical mechanics. The parameter P is related to the temperature T as follows: 1 P= ( 3 .4.55)
kT
where k is the Boltzmann constant. It is instructive to convince ourselves of this identification by comparing the ensemble average [ H] of simple harmonic oscillators with the kT expected for the internal energy in the classical limit, which is left as an exercise. We have already commented that in the high-temperature limit, a canonical ensemble becomes a completely random ensemble in which all energy eigenstates are equally populated. In the opposite low-temperature limit ( /$ -+ oo ), (3.4.48) tells us that a canoni cal ensemble becomes a pure ensemble where only the ground state is populated. As a simple illustrative example, consider a canonical ensemble made up of spin � systems, each with a magnetic moment en j2m ec subjected to a uniform magnetic field in the z-direction. The Hamiltonian relevant to this problem has already been given [see (3.2.1 6)]. Because H and S= commute, the density matrix for this canonical ensemble is diagonal in the Sz basis. Thus
( e- ilhw/2
) e ilhw/2
(3 .4.56)
where the partition function is just z = e -il h w /2 + e ilh w/2 .
(3 .4.57)
p =.
0
0
z
187
3.5. Eigenvalues and Eigenstates of Angular Momentum
From this we compute [Sx]
=
[Sy]
= 0,
[Sz] =
-
(�) ( � ) tanh
13 w .
(3.4.58)
The ensemble average of the magnetic moment component is just elmec times [Szl · The paramagnetic susceptibility x may be computed from
( )
...!.__ [Sz]
= xB . mec In this way we arrive at Brillouin's formula for x : l e l li tanh J31i.w . x = 2mecB 2
( ) ( )
(3.4.59)
(3.4.60)
3.5. EIGENVALUES AND EIGENSTATES OF
ANGULAR MOMENTUM Up to now our discussion of angular momentum has been confined exclu sively to spin ! systems with dimensionality N = 2. In this and subsequent sections we study more-general angular momentum states. To this end we first work out the eigenvalues and eigenkets of J2 and Jz and derive the expressions for matrix elements of angular momentum operators, first obtained in a 1 926 paper by M. Born, W. Heisenberg, and P. Jordan.
Commutation Relations and the Ladder Operators Everything we will do follows from the angular-momentum commu tation relations (3.1.20), where we may recall that l; is defined as the generator of infinitesimal rotation. The first important property we derive from the basic commutation relations is the existence of a new operator J2, defined by J Z = JX JX + Jyjy + JZ JZ , (3.5.1) that commutes with every one of Jk: [J2, Jk] = 0, ( k = 1 , 2 , 3) .
(3.5.2)
To prove this let us look at the k = 3 case:
[ JJx + JVJV + JJz' Jz] = Jx( Jx ' Jz] + [ Jx' Jz] Jx + Jy [ JV' Jz] + [ Jy ' Jz ] Jy = Jx( - inJy} + ( - inJ ) Jx + Jy (iliJJ + (inJJ Jv Y
= 0. (3.5 .3) The proofs for the cases where k = 1 and 2 follow by cyclic permutation (1 -+ 2 -+ 3 -+ 1 ) of the indices. Because Jx, JY , and Jz do not commute with
188
Theory of Angular Momentum
each other, we can choose only one of them to be the observable to be diagonalized simultaneously with 12• By convention we choose J= for this purpose. We now look for the simultaneous eigenkets of J 2 and J=- We denote the eigenvalues of J 2 and J= by a and b , respectively:
J2 la, b) = ala, b)
(3.5 .4a) (3.5 .4b)
lzla, b) = bla, b ) . To determine the allowed values for a and b, it is convenient to work with the non-Hermitian operators
( 3 .5 .5) called the ladder operators, rather than with Jx and Jv. They satisfy the commutation relations (3.5 .6a) and (3.5 .6b) [ Jz , J ± 1 = ± nJ ± , which can easily be obtained from (3.1.20). Note also that
[J2, 1 ± ] = 0,
(3.5 .7)
which is an obvious consequence of (3.5.2). What is the physical meaning of J ±? To answer this we examine how Jz acts on J ± Ia , b):
lz( J ± Ia, b)) = ( [ lz , 1 ± 1 + J ± Jz ) I a, b) = ( b ± n )( J ± Ia , b) )
(3.5 .8) where we have used (3.5.6b). In other words, if we apply J + ( J ) to a Jz eigenket, the resulting ket is still a Jz eigenket except that its eigenvalue is now increased (decreased) by one unit of n. So now we see why J ± ' which step one step up (down) on the " ladder" of J= eigenvalues, are known as the ladder operators. We now digress to recall that the commutation relations in (3.5.6b) are reminiscent of some commutation relations we encountered in the earlier chapters. In discussing the translation operator s-(1) we had (3.5 .9) [ x,, s-(1) l t,s-(1) ; _
=
also, in discussing the simple harmonic oscillator we had
[ N, a 1 = - a .
(3 .5.10) We see that both (3.5.9) and (3.5.10) have a structure similar to (3.5.6b). The physical interpretation of the translation operator is that it changes the eigenvalue of the position operator x by I in much the same way as the
189
3 . 5 . Eigenvalues and Eigenstates of Angular Momentum
ladder operator J + changes the eigenvalue of Jz by one unit of h. Likewise, the oscillator creation operator a t increases the eigenvalue of the number operator N by unity. Even though J ± changes the eigenvalue of Jz by one unit of h, it does not change the eigenvalue of J 2 :
J2 ( J
±
I a , b ) ) = J ± J 2 I a , b)
(3.5.1 1 ) where we have used (3.5.7). To summarize, J + I a, b ) are simultaneous eigenkets of and Jz with eigenvalues a and b f h. We may write J ± l a , b) = c ± l a , b ± h ), (3.5. 12) where the proportionality constant c ± will be determined later from the normalization requirement of the angular-momentum eigenkets.
J2
Eigenvalues of
J
2
and -'z
We now have the machinery needed to construct angular-momentum eigenkets and to study their eigenvalue spectrum. Suppose we apply J + successively, say n times, to a simultaneous eigenket of and ' We then obtain another eigenket of J 2 and Jz with the Jz eigenvalue increased by nli, while its eigenvalue is unchanged. However, this process cannot go on indefinitely. It turns out that there exists an upper limit to b (the Jz eigenvalue) for a given a (the eigenvalue): a � b2. (3.5.13) To prove this assertion we first note that J} = H ' + ' - + ' - ' + )
J2
J2
=-
J2
J2 -
= H '+ 1t + 1t '+ ) .
(3.5.14)
Now J + J t and Jt J + must have nonnegative expectation values because DC
J� l a ,b) - (a,biJ + ,
DC
J+ la,b) - (a,b iJ� ;
(3.5. 15)
thus (3.5.16) which, in turn, implies (3.5.13). It therefore follows that there must be a bmax such that (3.5 .17) Stated another way, the eigenvalue of b cannot be increased beyond bmax · Now (3.5.17) also implies (3.5 .18) j _ J+ I a , bmax ) = O.
1 90
Theory of Angular Momentum
But 1 _ 1 + = 1} + 1} - i{ JJx - 1JJ = J 2 - 1} - n1z .
So
(3.5 .19)
(3.5 .20) Because J a, bmax ) itself is not a null ket, this relationship is possible only if (3 .5 .21) a - b�ax - bmax n = 0 or (3.5 .22) In a similar manner we argue from (3.5.13) that there must also exist a bmin such that ( 3 .5 .23) By writing 1 J as (3.5 .24) J + J - = J 2 - J/ + nJZ in analogy with (3.5.19), we conclude that (3.5.25) a = bmin ( bmin - n ) . By comparing (3.5.22) with (3.5.25) we infer that (3.5 .26) bmax = - brnin> with bmax positive, and that the allowed values of b lie within ( 3 .5 .27) - bmax � b � bmax · Clearly, we must be able to reach Ja, bmax) by applying J successively to J a , bmin ) a finite number of times. We must therefore have (3.5 .28) bmax = bmin + nn , where n is some integer. As a result, we get nn (3.5 .29) bmax = T · +
_
+
It is more conventional to work with j, defined to be bmaxl n, instead of with bmax so that n . (3.5 .30) =
; 2.
The maximum value of the Jz eigenvalue is jn, where j is either an integer or a half-integer. Equation (3.5.22) implies that the eigenvalue of J 2 is given by (3.5 .31)
3.5. Eigenvalues and Eigenstates of Angular Momentum
191
Let us also define m so that (3.5 .32) If j is an integer, all m values are integers; if j is a half-integer, all m values are half-integers. The allowed m-values for a given j are (3.5 .33) m = - j, - j + 1 , . . . , j - 1 , j .
b = mi L
2j + 1 states Instead of I a , b) it is more convenient to denote a simultaneous eigenket of J 2 and Jz by IJ, m). The basic eigenvalue equations now read (3.5.34a) and (3.5.34b) Jz i J, m ) = mh iJ, m ) , with j either an integer or a half-integer and m given by (3.5.33). It is very important to recall here that we have used only the commutation relations (3.1.20) to obtain these results. The quantization of angular momentum, manifested in (3.5.34), is a direct consequence of the angular-momentum commutation relations, which, in turn, follow from the properties of rota tions together with the definition of Jk as the generator of rotation.
Matrix Elements of Angular-Momentum Operators Let us work out the matrix elements of the various angular-momen tum operators. Assuming IJ, m ) to be normalized, we obviously have from (3.5.34) (3 .5 .35a) and ( J ', m 'I Jz i J, m ) = mh81 •}m·m ·
(3.5 .35b)
To obtain the matrix elements of J ± ' we first consider
( J , m i J ! J I J , m ) = (J, m i (J 2 - J} - hJz } I J, m )
2
+
(3.5 .36) = n [ J(J + 1 ) - m 2 - m ] . Now J + IJ, m ) must be the same as IJ, m + 1 ) (normalized) up to a multiplicative constant [see (3.5.12)]. Thus (3.5 .37) J+ IJ, m ) = c;';., IJ, m + 1 ) . Comparison with (3.5.36) leads to
lc;+;,l 2 = n 2 [ j( J + 1 ) - m ( m + 1)] h2 ( j - m ) ( j + m + 1 ) .
=
(3.5 .38)
Theory of Angular Momentum
192
So we have determined c;",., up to an arbitrary phase factor. It is customary to choose c/,., to be real and positive by convention. So J + IJ, m ) = ,j( J - m ) ( j + m + 1 ) n i J, m + 1 ) .
( 3 .5 .39)
Similarly, we can derive J IJ, m ) = ,j( J + m ) ( J - m + 1) n i J, m - 1 ) .
(3.5.40)
<J', m 'I J ± IJ , m ) = ,j( J -:f. m )( J ± m + 1 ) n 8J'J8,., .. ,., ± 1 ·
(3.5.41)
�
Finally, we determine the matrix elements of J ± to be
Representations of the Rotation Operator
Having obtained the matrix elements of Jz and J ± ' we are now in a position to study the matrix elements of the rotation operator {')( R ). If a rotation R is specified by il and >, we can define its matrix elements by a.,., (j) = · ,.,
( R ) - ( j. , m 'Iexp
( - iJn· ) 1 j,· m ) . il>
(3.5 .42)
These matrix elements are sometimes called Wigner functions after E. P. Wigner, who made pioneering contributions to the group-theoretical proper ties of rotations in quantum mechanics. Notice here that the same )-value appears in the ket and bra of (3.5.42); we need not consider matrix elements of � ( R ) between states with different )-values because they all vanish trivially. This is because {')( R )IJ, m ) is still an eigenket of J 2 with the same eigenvalue )( ) + 1)112: J 2� ( R ) Jj , m ) = � ( R )J 2 1), m )
= J{ J + l) n 2 [ � ( R ) I J, m ) ] ,
(3 .5 .43)
which follows directly from the fact that J 2 commutes with Jk (hence with any function of Jk ). Simply stated, rotations cannot change the )-value, which is an eminently sensible result. Often in the literature the (2) + 1) X (2) + 1 ) matrix formed by �:.;,�( R ) is referred to as the (2 ) + I)-dimensional irreducible representation of the rotation operator �(R). This means that the matrix which corre sponds to an arbitrary rotation operator in ket space not necessarily characterized by a single }-value can, with a suitable choice of basis, be
3.5. Eigenvalues and Eigenstates of Angular Momentum
193
brought to block-diagonal form:
0
0 0 0
0 0
0 0 '
0 0 0
0 '
(3 .5.44)
'
where each shaded square is a (2 } + 1) X (2} + 1) square matrix formed by !!}�� with some definite value of j. Furthermore, each square matrix itself cannot be broken into smaller blocks k
t
..,_ 2j+ 1 --����
2}+1
(3.5.45) with any choice of basis. The rotation matrices characterized by definite j form a group. First, the identity is a member because the rotation matrix corresponding to no rotation ( q, = 0) is the (2} + 1) X (2} + 1) identity matrix. Second, the inverse is also a member; we simply reverse the rotation angle ( q, -> q,) without changing the rotation axis it Third, the product of any two -
Theory of Angular Momentum
1 94
members is also a member; explicitly we have L ��Pm•( R 1 ) ��(� ( R 2 ) = ��!)m ( R1R 2 ) ,
(3.5 .46)
m'
where the product R 1 R 2 represents a single rotation. We also note that the rotation matrix is unitary because the corresponding rotation operator is unitary; explicitly we have ( 3 .5 .47) �m'm ( R - 1 ) = �:.m• ( R ) . To appreciate the physical significance of the rotation matrix let us start with a state represented by IJ, m ) . We now rotate it: I J , m ) --> � ( R ) IJ , m ) . (3.5 .48) Even though this rotation operation does not change j, we generally obtain states with m-values other than the original m. To find the amplitude for being found in I J, m'), we simply expand the rotated state as follows: � ( R ) I J , m ) = L ! J, m ' ) ( j, m 'I � ( R ) I J , m ) m'
= L I J, m ') ��� ( R ) , m'
(3.5 .49)
where, in using the completeness relation, we took advantage of the fact that � ( R ) connects only states with the same j. So the matrix element ���-�.( R ) is simply the amplitude for the rotated state to be found in I J, m') when the original unrotated state is given by I J, m ) . In Section 3.3 we saw how Euler angles may be used to characterize the most general rotation. We now consider the matrix realization of (3.3.20) for an arbitrary j (not necessarily 1): - if,y . - ilJ3 - il2a _ . , exp exp IJ, m) n �m(J'ml ( a , f3 , y ) - ( J , m lexp n n
) ( ) ( ( )
(
- il,./3 . - m m . e l( a + y) m'l e xp (J, = IJ, m ) . n '
)
(3.5 .50)
Notice that the only nontrivial part is the middle rotation about the y-axis, which mixes different m-values. It is convenient to define a new matrix as - ilv /3 d:.f.�. ( /3 ) = < J , m'lexp (3.5.51) I J, m ) . n
(
d Ul( f3 )
)
Finally, let us turn to some examples. The j = 1 case has already been worked out in Section 3.3. See the middle matrix of (3.3.21),
d (l/2)
cos =
(�) (�)
sin
- sin cos
(�) (�)
(3.5 .52)
195
3.6. Orbital Angular Momentum
The next simplest case is j = 1, which we consider in some detail. Clearly, we must first obtain the 3 X 3 matrix representation of 1,. . Because 1,.
= ( J + 2-i 1 _ )
(3.5 .53)
from the defining equation (3.5.5) for 1 ± ' we can use (3.5.41) to obtain m=1 m=O m = -1 m' = 1 - fi i (3.5 .54)
( !202
0 0 - Iii 0 Iii
i
0
)
m ' = 0. m' = - 1
Our next task is to work out the Taylor expansion of exp( - iJJ3/ h ). Unlike the case j = J , [ J,<;� l l ] 2 is independent of 1 and 1? � 1 ) . However, it is easy to work out
( :T : 1,< � 1
=
1,< � 1 )
Consequently, for j = 1 only, it is legitimate to replace 2 J,. . J. . exp --> 1 - h (1 - cos,B ) - 1 h, sm ,B, h as the reader may verify in detail. Explicitly we have
-( iJ..f3 ) ( )
d O l ( ,B ) =
( � )(1 + cos ,B ) - ( � ) sin ,B
(�)
sin f3
( � ) (1 - cos,B )
cos,B
(�)
sin ,B
( ) (�) -(�)
(3 .5 .55)
(3.5 .56)
(1 - cos ,B ) sin ,B
( � ) ( 1 + cos ,B )
( 3.5 .57) Clearly, this method becomes time-consuming for large j. In Section 3.8 we will learn a much easier method for obtaining d�{�,-( f3 ) for any j. 3.6. ORBITAL ANGULAR MOMENTUM
We introduced the concept of angular momentum by defining it to be the generator of an infinitesimal rotation. There is another way to approach the subject of angular momentum when spin-angular momentum is zero or can be ignored. The angular momentum J for a single particle is then the same as orbital angular momentum, which is defined as L = x X p. (3.6.1) In this section we explore the connection between the two approaches.
196
Theory of Angular Momentum
Orbital Angular Momentum as Rotation Generator We first note that the orbital angular-momentum operator defined as (3.6.1) satisfies the angular-momentum commutation relations (3 .6 .2) [ by virtue of the commutation relations among the components of x and p. This can easily be proved as follows:
L; , L1] = ie,1k hLk
[ Lx , L v]
= [ yp, - zpv , zpx - xpzl
= [ yp, , zpxJ+ [zpv , XPzl = ypxfpz , z]+pvx [z , Pz ] = ih (xpv - ypJ = ihL z
(3.6.3)
Next we let
8"' ) Lz = 1 - I· ( fl 8"' ) ( xpv - ypJ (3.6.4) 1 - I· ( fl act on an arbitrary position eigenket lx', y', z ') to examine whether it can be interpreted as the infinitesimal rotation operator about the z-axis by angle 8 cp . Using the fact that momentum is the generator of translation, we obtain [see (1.6.32)] [ 1 - i ( 8: ) Lz] lx ', y ', z ') = [1 - i ( �) ( 8cpx ') + i ( �x ) ( 8cpy ')] l x ', y ', z ') (3.6 .5) = lx' - y '8 cp, y' + x'8 cp, z'). This is precisely what we expect if Lz generates an infinitesimal rotation
about the z-axis. So we have demonstrated that if p generates translation, then L generates rotation. Suppose the wave function for an arbitrary physical state of a spinless particle is given by After an infinitesimal rotation about the z-axis is performed, the wave function for the rotated state is
(x', y', z'i a).
(x', y ', z '1[1 - i( 8: ) Lz ] ia) = (x'+ y'8cp , y '- x'8 cp, z'la).
(3 .6.6)
It is actually more transparent to change the coordinate basis
(x', y', z 'ia) --> (r , O, cf>ia).
(3.6.7)
For the rotated state we have, according to (3.6.6),
(r, 0 , >1[1 - i( 8: ) Lz ] Ia) = (r, O, cp - 8cf> i a ) = (r, O , cf>ia) - 8> aacp (r, O,cf>i a ).
( 3 .6 .8)
197
3.6. Orbital Angular Momentum
Because r, 0,
( >I is an arbitrary position eigenket, we can identify (x'I Lz l a ) = - in aa> (x'l a) ,
(3.6.9)
which is a well-known result from wave mechanics. Even though this relation can also be obtained just as easily using the position representation of the momentum operator, the derivation given here emphasizes the role of as the generator of rotation. We next consider a rotation about the x-axis by angle 8
L=
' [ ( : ) Lx] I a) =
( X ', Y , Z 'I 1 - i {j x
By expressing x ', y ', and
Likewise,
z
'
( X ' , Y ' + Z '{j
'{j
·
(3.6 .10)
in spherical coordinates, we can show that
(x'I Lx l a) = - in ( - sin > :o - cot O cos
(3.6.11)
(3.6.12)
Using (3.6.11) and (3.6.12), for the ladder operator L ± defined as in (3.5.5), we have (3.6.13) Finally, it is possible to write
(x'IL2 1 a) using
L2 = L; + ( cJ: ) ( L + L + L L + ) ,
(3 .6.14)
(3.6.9), and (3.6.13), as follows:
a a (x'IL2 1 a ) n 2 [+ £ + _l_ o ( sin O o ) ] <x'l a ) . (3.6.15) a>2 O a a Apart from ljr2, we recognize the differential operator that appears here to be just the angular part of the Laplacian in spherical coordinates. = -
Sill 0
Sill
It is instructive to establish this connection between the L2 operator and the angular part of the Laplacian in another way by looking directly at the kinetic-energy operator. We first record an important operator identity,
where
x 2 is understood to be the operator x x, •
(3.6.16) just as p2 stands for the
198
Theory of Angular Momentum
operator p•p. The proof of this is straightforward:
L2 = ijlmk L Et}kXtP/01mkX/Pm = L
ij/m
= L =
ijlm
( 8,,8J m - {jim8jl)x,pJ X IPm [ {jil{jjmx ( x,pj - ih8;1 ) Pm - 8,m8;,x, P; ( PmXI + ih 8,m ) l i
x2p2 - ihx · p -
2: 8,m8J, [ x,pm (x,pj - ih8},) + ih8,mx,pJ
ij/m
(3.6.17) (x'l and Ia), first note that (x'lx ·pla) = x'· ( - ihV' '(x'l a ) ) . a = - !hr (x'la). (3.6.18) ar
Before taking the preceding expression between
Likewise,
(3.6.19) Thus
(x'IL21a) r2(x'lp21a) + h 2 ( r 2 :r22 (x'la) + 2r : (x'l a ) ) . (3.6 .20) In terms of the kinetic energy p2j2m, we have 2� (x'lp21a) ( ;: ) '2(x'l a ) ( .!!:_2m _ ) ( �ar 2 (x'l a) + �r �ar (x'la) - -1h 2r-2 (x'IL2Ia) ) . (3.6.21) The first two terms in the last line are just the radial part of the Laplacian acting on (x'la). The last term must then be the angular part of the Laplacian acting on (x'l a ), in complete agreement with (3.6.15). ,
=
= -
=
V'
-
Spherical Harmonics Consider a spinless particle subjected to a spherical symmetrical potential. The wave equation is known to be separable in spherical coordi-
199
3.6. Orbital Angular Momentum
nates and the energy eigenfunctions can be written as
(3.6.22) (x'ln, /, m ) = R n1 ( r )Yt( 8, 1/l ) , where the position vector x' is specified by the spherical coordinates r, 8, and <jl , and n stands for some quantum number other than I and m, for
example, the radial quantum number for bound-state problems or the energy for a free-particle spherical wave. As will be made clearer in Section 3.10, this form can be regarded as a direct consequence of the rotational invariance of the problem. When the Hamiltonian is spherically symmetri cal, H commutes with and L2 , and the energy eigenkets are expected to be eigenkets of L2 and also. Because L k with k = 2, 3 satisfy the angular-momentum commutation relations, the eigenvalues of Li and = and are expected to be Because the angular dependence is common to all problems with spherical symmetry, we can isolate it and consider (3.6 .23) where we have defined a direction eigenket I n) . From this point of view, Y/ (e, ) is the amplitude for a state characterized by to be found in the direction ii specified by e and . Suppose we have relations involving orbital angular-momentum ei genkets. We can immediately write the corresponding relations involving the spherical harmonics. For example, take the eigenvalue equation (3 .6.24) =
Lz
Lz /( / + 1 )n2,
1, Lz mn [- In,( - I + 1)n, ... , (l - 1)n, In].
/, m
Lzll , m) mnll, m).
Multiplying (ill on the left and using (3.6.9), we obtain
- in :
(3.6 .25)
- in a: Yim ( 8, 1/l) = mn Yt( 8, 1/l ) ,
(3 .6.26)
We recognize this equation to be
which implies that the !-dependence Yim ( 8, <jl) must behave like e• m
[
we have [see (3.6.15)] 1 sin O
a ( sm. O a ) + 1 a2z + / ( / + l) Yt = 0, ao ao sin28 a
(3.6.28)
which is simply the partial differential equation satisfied by Y1m itself. The orthogonality relation (3.6.29)
Theory of Angular Momentum
200
12"d4J! d(cos 8 ) Y/!' ( 8 , 4J ) Yt( 8 , 4J) = 8w8mm'•
leads to
,.
1 -1
0
(3 .6.30)
where we have used the completeness relation for the direction eigenkets, (3.6.31) To obtain the have
�m themselves, we may start with the m = I case. We L+ /I , I) = 0 ,
(3 .6.32)
which, because of (3.6.13), leads to
..i.. ]
Remembering that the q>-dependence must behave like show that this partial differential equation is satisfied by where
(3 .6.33)
·
e;1"', we can easily
( n/1, /) = Y/( 8, .p) = c1 e 1"' sin18 ,
(3.6.34)
;
c1 is the normalization constant determined from (3.6.30) to be* [ ( -1) I [(21 +41)(2/)!] (3 .6.35) cl = 21/ ! ] '1T Starting with (3.6.34) we can use
(n/l, m - 1 ) =
( ft/L_ /I, m)
/(l + m)(l - m + 1) h 1 ( - -a8a + i cot 8-a.pa ) (fill' m) = e-• /(l + m)(l - m + 1) "'
(3 .6.36)
successively to obtain all �m with I fixed. Because this is done in many textbooks on elementary quantum mechanics, we will not work out the details here. The result for � 0 is
y m ( 8 , "" ) -_ ( - l ) l I
-;-
21/ !
m (21+ 1) {l + m) ! e im
.
I
( 3.6.37)
*Normalization condition (3.6.30), of course, does not determine the phase of c1• The factor ( - 1) 1 is inserted so that when we use the L_ operator successively to reach the state m = 0, we obtain Y1° with the same sign as the Legendre polynomial P1(cos 9 ) whose phase is fixed by P1 (1) 1 [see (3.6.39)]. =
201
3.6. Orbital Angular Momentum
and we define y;- "' by
y;- '" ( O , q,) = ( - l) "' [ Yt"' ( B ,
(3.6 .38)
Regardless of whether m is positive or negative, the 8-dependent part of Yt"'( 8, q,) is [sin O ] I "' I times a polynomial in cos O with a highest power of / - l m l. For m = 0, we obtain Y,o ( 8 , q,) =
{f[+l V --;r;- P1 (cos 8 ) .
(3 .6.39)
From the point of view of the angular-momentum commutation relations alone, it might not appear obvious why I cannot be a half-integer. It turns out that several arguments can be advanced against half-integer /-values. First, for half-integer /, and hence for half-integer m, the wave function would acquire a minus sign, i (lw) e m
=-
1
'
( 3 .6 .40)
under a 2"1T rotation. As a result, the wave function would not be single valued; we pointed out in Section 2.4 that the wave function must be single-valued because of the requirement that the expansion of a state ket in terms of position eigenkets be unique. We can prove that if L, defined to be x X p, is to be identified as the generator of rotation, then the wave function must acquire a plus sign under a 2"1T rotation. This follows from the fact that the wave function for a 2'1T-rotated state is the original wave function itself with no sign change: (X , I exp
( - iLhz21T) I �) = �
( x'cos21T
+ y'sin21T, y'cos21r - x'sin21T,
= (x'l a ) ,
z
'
I a)
(3 .6.41)
where we have used the finite-angle version of (3.6.6). Next, let us suppose Y,m( 8,
Y1lj{ ( 8 , q, ) = c112ei•I2Jsin 8 .
- 1 /2 8 Y1 / ( , f/> ) 2
L_
-
_
e
-,-�>(
)(
(3 .6.42)
[see (3.6.36)] we would then obtain a - 8 a
+ . I
COt
8
a aq,
= - c112e - i•l2cot 8/sin 8 .
CI / e 2
i.j>/2 � 8
ySin u
)
(3 .6.43)
This expression is not permissible because it is singular at 8 = 0, 7T . What is worse, from the partial differential equation
202
Theory of Angular Momentum
we directly obtain
y- c 1' e - i
(3.6.45)
in sharp contradiction with (3.6.43). Finally, we know from the Sturm Liouville theory of differential equations that the solutions of (3.6.28) with I integer form a complete set. An arbitrary function of (} and q, can be expanded in terms of r;m with integer I and m only. For all these reasons it is futile to contemplate orbital angular momentum with half-integer /-val ues.
Spherical Harmonics as Rotation Matrices
We conclude this section on orbital angular momentum by discuss ing the spherical harmonics from the point of view of the rotation matrices introduced in the last section. We can readily establish the desired connec tion between the two approaches by constructing the most general direction eigenket lit) by applying appropriate rotation operators to li), the direction eigenket in the positive z-direction. We wish to find IfJ( R ) such that lit) = IiJ ( R ) Ii) . ( 3.6.46 ) We can rely on the technique used in constructing the eigenspinor of o · it in Section 3.2. We first rotate about the y-axis by angle 8, then around the z-axis by angle .p; see Figure 3.3 with {3 ---> (}, a ---> q,. In the notation of Euler angles we have ( 3 .6.47) IiJ ( R ) = IiJ ( a = .p, {3 = 8 , y = O) . Writing (3.6.46) as lit) = L L iiJ ( R ) Il, m ) (/, mli ) , (3.6.48) I
m
we see that lit), when expanded in terms of II, m ) , contains all possible /-values. However, when this equation is multiplied by ( /, m'l on the left, only one term in the /-sum contributes, namely,
(3.6.49) ( /, m 'lit ) = l:IiJ,;.',�( a = q, , {3 = 8 , y = 0) ( /, m li ) . m Now ( /, m li ) is just a number; in fact, it is precisely Yr*< 8 , q,) evaluated at 8 = 0 with q, undetermined. At (} = 0, r;m is known to vanish for m * 0, which can also be seen directly from the fact that li) is an eigenket of Lz (which equals xpy - YPx ) with eigenvalue zero. So we can write ( /, mli) = r;m*( (} = 0, q, undetermined) llm o = =
fi¥}1
P1 (cos 0 )
[(2i+Tf . V � umo·
cos 8 = 1
llm o ( 3 .6 . 50)
3.7. Addition of Angular Momenta
203
Returning to (3.6.49), we have }';
m'*( 0 , > ) = v{(2i'+lf � �m( /'O) ( a =
(3.6.51)
or
Notice the
m
0
= case, which is of particular importance: (3.6.53)
3.7.
ADDITION OF ANGULAR MOMENTA
Angular-momentum addition has important applications in all areas of modern physics-from atomic spectroscopy to nuclear and particle colli sions. Furthermore, a study of angular-momentum addition provides an excellent opportunity to illustrate the concept of change of basis, which we discussed extensively in Chapter 1.
Simple Examples of Angular-Momentum Addition Before studying a formal theory of angular-momentum addition, it is worth looking at two simple examples with which the reader may be familiar: (1) how to add orbital angular momentum and spin-angular momentum and (2) how to add the spin-angular momenta of two spin ! particles. Previously we studied both spin 1 systems with all quantum-mecha nical degrees gf freedom other than spin- such as position and momentum - ignored and quantum-mechanical particles with the space degrees of freedom (such as position or momentum) taken into account but the internal degrees of freedom (such as spin) ignored. A realistic description of a particle with spin must of course take into account both the space degree of freedom and the internal degrees of freedom. The base ket for a spin ! particle may be visualized to be in the direct-product space of the infinite dimensional ket space spanned by the position eigenkets { lx ') } and the two-dimensional spin space spanned by I + ) and 1 - ) . Explicitly, we have for the base ket
lx ', ± ) = lx ') ® l ± ) ,
(3 .7. 1)
where any operator in the space spanned by { lx ') } commutes with any operator in the two-dimensional space spanned by 1 ± ) .
Theory of Angular Momentum
204
The rotation operator still takes the form exp( - iJ · ilcf>/ li ) but J, the generator of rotations, is now made up of two parts, namely, (3 .7.2) J = L + S. It is actually more obvious to write (3.7.2) as J = L ® 1 + 1 ® S, (3.7 .3) where the 1 in L ® 1 stands for the identity operator in the spin space, while the 1 in 1 ® S stands for the identity operator in the infinite-dimensional ket space spanned by the position eigenkets. Because L and S commute, we can write !Z ( R } = !Z(orb ) ( R } ®!Z {spin) ( R }
(
)
(
- iS · il
li
)
·
(3.7.4)
The wave function for a particle with spin is written as The two components lj; ± follows:
( x ', ± !a) = 1j; ± ( x ' ) .
(3 .7.5 ) are often arranged m column matrix form as
(�:i::n .
(3.7 6 ) .
where I.P ± ( x ') ! 2 stands for the probability density for the particle to be found at x ' with spin up and down, respectively. Instead of !x') as the base kets for the space part, we may use In, !, m ) , which are eigenkets of L2 and Lz with eigenvalues li 2/( / + 1) and m Jz, respectively. For the spin part, 1 ± ) are eigenkets of S2 and Sz with eigenvalues 31i 2j4 and ± li j2, respectively. However, as we will show later, we can also use base kets which are eigenkets of J 2 , Jz , L2, and S2• In other words, we can expand a state ket of a particle with spin in terms of simultaneous eigenkets of L2, S2, Lz, and Sz or in terms of simultaneous eigenkets of J 2, Jz , L2, and S2• We will study in detail how the two descriptions are related. As a second example, we study two spin � particles-say two electrons-with the orbital degree of freedom suppressed. The total spin operator is usually written as S = S1 + S , (3 .7.7) 2 but again it is to be understood as S1® 1 + l ® S , (3.7.8) 2 where the 1 in the first (second) term stands for the identity operator in the spin space of electron 2 (1). We, of course, have (3.7.9) [ stx• s2y] = o and so forth. Within the space of electron 1 (2) we have the usual
3.7. Addition of Angular Momenta
205
commutation relations [ Slx, Sl v ] = iiJ Slz , [ S2x, S2 v ] = i!JS2z , . . . . As a direct consequence of (3.7.9) and (3.7.10), we have
(3.7 .10)
(3.7.1 1 ) [ Sx , Sv] = iiJSz and so on for the total spin operator. The eigenvalues of the various spin operators are denoted as follows: S 2 = (S1 + S2 ) 2 : s(s + 1) /J 2 sz = slz + s2 z : mh sl z
s2z
(3.7.12)
: mth : m2n
Again, we can expand the ket corresponding to an arbitrary spin state of two electrons in terms of either the eigenkets of S 2 and Sz or the eigenkets of S1 z and S2 r The two possibilities are as follows: 1 . The { m1, m 2 } representation based on the eigenkets of S1 z and Sz z : (3.7.13) I + + ) , I + - ) , 1 - + ) , and 1 - - ) , where 1 + ) stands for m1 = � , m 2 = ·L and so forth. 2. The { s, m } representation (or the triplet-singlet representation) based on the eigenkets of S 2 and Sz : (3 .7 .14) \s = 1 , m = ± 1 , 0) , \s = O, m = O ) , where s = 1 (s = 0) is referred to as spin triplet (spin singlet). -
-
Notice that in each set there are four base kets. The relationship between the two sets of base kets is as follows: (3.7.15a) ls = 1 , m = 1 ) = 1 + + ) ,
is = 1 , m = 0) =
(�)
( I + - ) + 1 - + )) ,
is = 1 , m = - 1 ) = 1 - - ) ,
l s = O, m = O> =
(� )
( 1 + - ) - 1 + )) . -
(3.7.15b) ( 3.7. 1 5c)
(3 .7 .15d)
The right-hand side of (3.7.15a) tells us that we have both electrons with spin up; this situation can correspond only to s = 1, m = 1 . We can obtain (3.7.15b) from (3.7.15a) by applying the ladder operator S_ ;; S1 _ + S2 _ (3 .7 .16) = ( S1x - iS1 y ) + ( S2x - iS2 v )
Theory of Angular Momentum
206
to both sides of (3.7.15a). In doing so we must remember that an electron 1 operator like S1 _ affects just the first entry of I + + ), and so on. We can write 3.7.17 S _ ls = 1 , m = 1 ) = ( S1 _ + S2 _ ) I + + ) as
(
/(1 + 1}(1 - l + l } l s = l , m = O) = /( ! + t}{t - 1 + 1 ) X I - + ) + /( 1 + 1 )( 1 - 1 + 1 ) I + - ) ,
}
( 3 .7.18}
which immediately leads to (3.7.15b). Likewise, we can obtain Is = 1, m = - 1 ) by applying (3.7.16) once again to (3.7.15b). Finally, we can obtain (3.7.15d) by requiring it to be orthogonal to the other three kets, in particular to (3.7.15b). The coefficients that appear on the right-hand side of (3.7.15) are the simplest example of Clebsch-Gordan coefficients to be discussed further at a later time. They are simply the elements of the transformation matrix that connects the { m1 , m2 } basis to the { s, m } basis. It is instructive to derive these coefficients in another way. Suppose we write the 4 X 4 matrix corre sponding to
2 S = Si + Si + 2 S 1 • S2 ( 3.7.19} = Si + Si + 2 SizSz z + SI + Sz _ + St _ Sz + using the (m1, m 2 ) basis. The square matrix is obviously not diagonal because an operator like Sl + connects 1 - + ) with 1 + + ) . The unitary matrix that diagonalizes this matrix carries the l m 1 , m 2 ) base kets into the Is, m ) base kets. The elements of this unitary matrix are precisely the Clebsch-Gordan coefficients for this problem. The reader is encouraged to work out all this in detail.
Fonnal Theory of Angular-Momentum Addition Having gained some physical insight by considering simple examples, we are now in a position to study more systematically the formal theory of angular-momentum addition. Consider two angular-momentum operators J1 and J2 in different subs paces. The components of J1 (J2) satisfy the usual angular-momentum commutation relations:
(3.7.20a} and
( 3 .7 .20b )
3.7. Addition of Angular Momenta
207
However, we have (3.7 .21) between any pair of operators from different subspaces. The infinitesimal rotation operator that affects both subspace 1 and subspace 2 is written as
( h ) ( h )= _ iJz · fio 1 ilt · fio ,0, '6' 1
1
h
_ i(J1®1 + 1®J2) · fio<J>
We define the total angular momentum by J = JI ® l + 1 ® Jz , which is more commonly written as J = JI + Jz . The finite-angle version of (3.7.22) is - iJ� · it
(
) (
.
(3.7 .22) (3.7.23) (3.7 .24)
)
(3 .7.25)
Notice the appearance of the same axis of rotation and the same angle of rotation. It is very important to note that the total J satisfies the angular momentum commutation relations (3.7 .26) as a direct consequence of (3.7.20) and (3.7.21). In other words, J is an angular momentum in the sense of Section 3.1 . Physically this is reasonable because J is the generator for the entire system. Everything we learned in Section 3.5- for example, the eigenvalue spectrum of J 2 and Jz and the matrix elements of the ladder operators-also holds for the total J. As for the choice of base kets we have two options. and denoted Option A : Simultaneous eigenkets of J? , J{, by 11112; m 1 m 2 ) . Obviously the four operators commute with each other. The defining equations are
J1z , 122 ,
Jt2 liif2 ; mt m z ) = itUt + 1 ) 1l 2 ifth; m t m z ), Jtzl iih; mt m z ) = mtnl ftiz; mt m z ), Jfiiifz; m t mz) = iz ( Jz +1) 1l 2 iitf2 ; m t m z ), lzz l ii Jz ; m l mz ) = mznlft fz ; mtmz ). 2
(3.7 .27a) (3.7.27b)
(3.7 .27c) (3.7.27d) Option B: Simultaneous eigenkets of J 2, J?, J2 , and Jz . First, note that this set of operators mutually commute. In particular, we have (3.7 .28)
Theory of Angular Momentum
208
which can readily be seen by writing J 2 as
J l = J12 + J22 + 2 Jl z ]l + Jl + Jl - + Jl - Jl + · z
(3.7.29)
We use \)1 , iz ; jm ) to denote the base kets of option B:
2 J1 \i1iz ; Jm) = J1 ( Jl + 1 ) 1i 2 \ )1 Jz ; Jm ) , 2 Jz2 li1 Jz ; Jm) = iz Uz + 1 ) 1i \ )l j ; Jm ) , 2 2 2 ) )2 1 J \ 1 ; jm) = j ( J + ) 1i \i1iz : jm ) , 1,\ itiz ; Jm) = mli l itiz ; jm ) .
(3.7 .30a) (3.7.30b) (3.7 .30c) (3.7 .30d)
Quite often )1, )2 are understood, and the base kets are written simply as \ ), m ) .
It is very important to note that even though (3.7 .31)
we have (3.7 .32) as the reader may easily verify using (3.7.29). This means that we cannot 2 add J to the set of operators of option A. Likewise, we cannot add J1, andjor J2 z to the set of operators of option B. We have two possible sets of base kets corresponding to the two maximal sets of mutually compatible observables we have constructed. Let us consider the unitary transformation in the sense of Section 1 .5 that connects the two bases: \ i1 h ; Jm ) = L E \i1 h ; mlm 2 ) ( jliz ; m l m 2 \ i1 h ; Jm ) , (3.7 .33)
where we have used (3 .7.34) and where the right-hand side is the identity operator in the ket space of given )1 and )2• The elements of this transformation matrix ( j1 iz ; m 1 m 2 \ )1 j2 ; jm) are Clebsch-Gordan coefficients. There are many important properties of Clebsch-Gordan coefficients that we are now ready to study. First, the coefficients vanish unless (3 .7.35) To prove this, first note that (3.7.36) ( Jz - Jlz - lzJ litiz ; Jm ) = 0. Multiplying (j1j2 ; m1m 2 \ on the left, we obtain (3 .7.37) ( m - m l - m z ) ( jl jz ; m 1 m z \i1 iz ; Jm ) = 0,
209
3. 7. Addition of Angular Momenta
which proves our assertion. Admire the power of the Dirac notation! It really pays to write the Clebsch-Gordan coefficients in Dirac's bracket form, as we have done. Second, the coefficients vanish unless l i1 - i2 l � i � i1 + i2 ·
(3.7 .38)
This property may appear obvious from the vector model of angular momentum addition, where we visualize J to be the vectorial sum of J1 and J2 . However, it is worth checking this point by showing that if (3.7.38) holds, then the dimensionality of the space spanned by { I }1 }2; m 1 m 2 ) } is the same as that of the space spanned by { I J1 j2; jm) } . For the (m1, m 2 ) way of counting we obtain N = (2j1 + 1 )(2f2 + 1) ( 3 .7.39) because for given }1 there are 2 }1 + 1 possible values of m1; a similar statement is true for the other angular momentum f2. As for the ( j, m ) way of counting, we note that for each j, there are 2 } + 1 states, and according to (3 .7.38), j itself runs from }1 - }2 to }1 + }2, where we have assumed, without loss of generality, that }1 � iz. We therefore obtain N=
}, + h
L
J-h-h
(2} + 1)
= t [ { 2 ( jl - }2 ) + 1 } + { 2 ( }1 + }2 ) + 1 } ] ( 2 }2 + 1 ) =
( 2 }1 + 1)( 2 }2 + 1 ) .
(3.7 .40)
Because both ways of counting give the same N-value, we see that (3.7.38) is quite consistent.* The Clebsch-Gordan coefficients form a unitary matrix. Further more, the matrix elements are taken to be real by convention. An immediate consequence of this is that the inverse coefficient ( }1}2; Jm l J1j2; m 1 m 2 ) is the same as ( }1}2; m1m2I J1 j2; jm ) itself. A real unitary matrix is orthogo nal, so we have the orthogonality condition L L ( }l iz ; m ! m zlitJ2 ; Jm ) ( jt iz ; m \ m 2 1 itiz ; Jm ) = 8m , m;8m , m2' j
m
(3.7.41)
which is obvious from the orthonormality of { 1 }1 }2 ; m 1 m 2 ) } together with the reality of the Clebsch-Gordan coefficients. Likewise, we also have L L < Jti2 ; m t m 2 litJ2 ; }m) ( itJ2 ; m 1 m 2 l Jti2 ; J 'm ') = {jJJ'{jmm'·
(3.7.42)
• A complete proof of (3. 7.38) is given in Gottfried 1 966, 215, and also in Appendix
book.
B of this
Theory of Angular Momentum
210
As a special case of this we may set } ' = j, obtain
m '= m = m 1 + m 2 . We then (3.7.43)
which is just the normalization condition for I J1 f2 ; jm ). Some authors use somewhat different notations for the Clebsch Gordan coefficients. Instead of (j1f2 ; m1m 2 i}1f2 ; jm ) we sometimes see ( i1m1}2m2 li1}2jm) , C(j1}2j; m1m2m), SJ,( Jm ; m1m2 ), and so on. They can also be written in terms of Wigner's 3-j symbol, which is occasionally found in the literature:
. .
. ( m1m 2 _ m )
( h. h. ·. m l m2 I h. h ·. Jm ) - ( - 1) ),- h + my2 J + 1
hhJ
.
(3.7 .44)
Recursion Relations for the Clebsch-Gordan Coefficients* With }1 , }2, and j fixed, the coefficients with different m 1 and m 2 are related to each other by recursion relations. We start with 1 litiz; Jm > = U1 ± + 12 ± n::: L l il i2 ; m1m 2><J�J2 ; m1m2lit i2; Jm > . ±
(3 .7.45) Using (3.5.39) and (3.5.40) we obtain (with ;
m1 --> m;, m2 --> m ;)
/(J + m)(j ± m + 1 ) 1 }1 }2 j , m ± 1) = L :L ( J( Jl + m ;)(J1 ± m; + 1 ) 1}1}2; m; ± 1, m; ) ml m 2
+ /(J2 + m;)(h ± m; + 1 ) li1 f2 ; m;, m; ± 1) )
X ( iJiz ; m;m 21iliz; Jm) . (3.7 .46) Our next step is to multiply by ( j1iz ; m 1 m 2 1 on the left and use orthonor
mality, which means that nonvanishing contributions from the right-hand side are possible only with (3.7.47)
for the first term and (3.7.48) * More-detailed discussion of Clebsch-Gordan and Racah coefficients. recoupling, and the like is given in A. R. Edmonds 1 960, for instance.
21 1
3.7. Addition of Angular Momenta
for the second term. In this manner we obtain the desired recursiOn relations: /( J + m ) ( J ± m + 1 ) ( i1 J2 ; m1 m z l ii J2 ; } , m ± 1 )
=
/U1 + m 1 + l ) ( jl ± m l ) ( JIJz ; m 1 + 1 , rn z l i1 Jz ; Jm )
+ /( Jz + m z + 1 ) { jz ± m z ) < J1 iz ; m 1 , m z + 1 i iJ iz ; Jm ) . (3 .7 .49)
It is important to note that because the J ± operators have shifted the m-values, the nonvanishing condition (3.7.35) for the Clebsch-Gordan coefficients has now become [when applied to (3.7.49)] (3.7.50) We can appreciate the significance of the recursion relations by looking at (3.7.49) in an m1m 2-plane. The J recursion relation (upper sign) tells us that the coefficient at ( m 1 , m 2 ) is related to the coefficients at (m1 - 1 , m 2 ) and ( m 1 , m 2 - 1), as shown in Figure 3.5a. Likewise, the J recursion relation (lower sign) relates the three coefficients whose m 1 , m 2 values are given in Figure 3.5b. Recursion relations (3.7.49), together with normalization condition (3.7.43), almost uniquely determine all Clebsch-Gordan coefficients. (We say " almost uniquely" because certain sign conventions have yet to be specified.) Our strategy is as follows. We go back to the m1m 2-plane, again for fixed j1, iz, and j, and plot the boundary of the allowed region determined by +
_
(3.7.51) -j< ml + mz < j; l ml l < h , l mz l < h , see Figure 3.6a. We may start with the upper right-hand corner, denoted by A. Because we work near A at the start, a more detailed " map" is in order;
(m,-1,m2) LHS � - - - - - - - -, (m,,m2)
RHS
',
'
',
'
'
I
J
•
,
'
I I I I
I '' I
RHS
(a) J. relation
,1 •
{m,,m2 -1) (b) J_ relation
FIGURE 3.5. m1 m 2-plane showing, the Clebsch-Gordan coefficients related by the recursion relations (3.7.49).
Theory of Angular Momentum
212
see Figure 3.6b. We apply the J _ recursion relation (3.7.49) (lower sign), with (m1, m 2 + 1) corresponding to A . Observe now that the recursion relation connects A with only B because the site corresponding to (m1 + 1 , m 2 ) is forbidden by m1 s )1• As a result, we can obtain the Clebsch Gordan coefficient of B in terms of the coefficient of A . Next, we form a J triangle made up of A, B, and D. This enables us to obtain the coefficient of D once the coefficient of A is specified. We can continue in this fashion: Knowing B and D, we can get to E; knowing B and E we can get to C, and so on. With enough patience we can obtain the Clebsch-Gordan coefficient of every site in terms of the coefficient of starting site, A . For overall normalization we use (3.7.43). The final overall sign is fixed by convention. (See the following example.) As an important practical example we consider the problem of adding the orbital and spin-angular momenta of a single spin 1 particle. We have )1 = I (integer) , +
-s- z1 , }.2 -
(3.7 .52)
The allowed values of j are given by (3 .7.53 )
J=L 1=0
J = I ± L I > O;
So for each I there are two possible )-values; for example, for I = 1 ( p state) we get, in spectroscopic notation, p 312 and p1 12 , where the subscript refers to j. The m 1m 2-plane, or better the m 1 m -plane, of this problem is particu larly simple. The allowed sites form only two rows: the upper row for m , = 1 and the lower row for ms = - ! ; see Figure 3.7. Specifically, we work out s
m,=-j,
I I I I I --- - - + - - - - 1 I I I
FIGURE 3.6.
(a)
A
m, =j,
•
•
•
•
•
D E F
t', -,.� Forbidden!! , , , J. I ' I ' ' ' , I l J_ ....,.., I J_ '- , �- - - '-tB - - - �x I ,"- , J. I I '- '- I : J_ ' ,'- I -- - ---l c - -
I
(b)
Use of the recursion relations to obtain the Clebsch·Gordan coefficients.
3.7. Addition of Angular Momenta
213
X 1\. I
m.
•
•
•
•
•
•
',
X "'
,
X " I
',
I J_ ' I J_ ' � - - � - - � - -� I J_
I
'
-------4--� m , •
•
•
•
FIGURE 3.7. Recursion relations used to obtain the Clebsch-Gordan coefficients for }1 = I and 12. = s = 2I .
the case j = I + 1 - Because m cannot exceed 1, we can use the J recursion in such a way that we always stay in the upper row ( m 2 = m = 1 ), while the mrvalue changes by one unit each time we consider a new J triangle. Suppressing j1 = I, j2 1 in writing the Clebsch-Gordan coefficients, we obtain from (3.7.49) (lower sign) _
s
s
_
=
y( l + 1 + m + 1) (1 + ! - m ) (m - ! . !i l + ! , m ) = y( t + m + 1 ) ( 1 - m + ·D <m + 1 , W + 1 , m + l), (3.7.54) where we have used
m l = rn, = m - 2 ,
m 2 -- m s -- 21 ·
1
(3 .7.55)
In this way we can move horizontally by one unit:
\
1
J
l+m+! 1 1 1 m + -2 , -2 l + -2 , m + l . (3.7.56) l+m+1
We can in turn express ( m + ! , W + !, m + 1) in terms of ( m + t, W + !, m + 2), and so forth. Clearly, this procedure can be continued until m1 reaches I, the maximum possible value:
l+m+! I+ m +1 I+ m+! I+ m +!
l+ m +! 21 + 1
I+ m+! I+m+�
l+m+� l+m+�
(3 .7.57)
Theory of Angular Momentum
214
m1 ms m = m1 + ms j = 1 - lm1
Consider the angular-momentum configuration in which and are both maximal, that is, I and 1, respectively. The total is I 1 and not for ! . So = I, + t , which is possible only for must be equal to up to a phase factor. We take this phase factor to be real and positive by convention. With this choice we have
j=I+ If= / + !, m = / + 1)
ms = D
(t, W + L t + ! ) = I . Returning to
(3.7.57), we finally obtain ( m - .!_2 ' .!_2 I/ + .!_2 ' m ) = l +2/m++1 1
(3.7.58)
(3.7.59}
But this is only about one-fourth of the story. We must still de termine the value of the question marks that appear in the following:
l+m+! 21 + 1 (3.7.60} +? l mt = m + � , ms = - � ) · l j = l - � . m ) = ?l mt = m - � , ms = � ) +?l m1 = m + � , m 5 = - � ) · We note that the transformation matrix with fixed m from the (m1, mJ basis to the ( j, m) basis is, because of orthogonality, expected to have the form
(
cos a - Sin ex
sin ex cos a
)
(3.7.61}
.
Comparison with shows that cos a is readily determine sino: up to a sign ambiguity:
(3.7.60)
(3.7.59) itself; so we can (t + m + !} = -'--: (1--m-.,.::. + !) .:. sm2cx = 1 (3.7 .62} (2/ +1} (2/ + 1 ) We claim that (m1 = m + !, ms = - tl i = I + !, m) must be positive because all j = I + ! states are reachable by applying the J operator successively to l i = I + ! , m = I + ! ) , and the matrix elements of J are always positive by convention. So the 2X2 transformation matrix (3.7.61) can be only l+m+! l-m+! 2/ + 1 2/ + 1 (3.7.63) 1-m+! l+m+! 2/ + 1 2/ + 1 _
_
3.7.
215
Addition of Angular Momenta
We define spin-angular functions in two-component form as follows:
qy; - / ± 1/2, m
----7-:--=-=-
= + / ±2/m+1+ -21 Y,m - 1/2 ( 8 , > ) x l +.,.m +-:-- -21 -"'- Y,m 1 /2 ( 8 , > ) + ----,:2/ + 1 1 + Ji ± m + 1 Y,m - 1 12( 8 , > ) v2/ +1 Ji + m + 1 y;m +l/2 ( 8 , > ) =
(
+
X
+
_
)
.
(3.7 .64)
They are, by construction, simultaneous eigenfunctions of L2, S 2 , J 2, and ]2• They are also eigenfunctions of L•S but L•S, being just
L•S = (1)(J 2 - L2 - S2 ) , (3.7.65) is not independent. Indeed, its eigenvalue can easily be computed as follows:
j = I + 1, for j = 1 - 1.
for
( �2 ) [j(J+l)- l(l +l)- !] =
Clebsch-Gordan Coefficients and Rotation Matrices
(3.7.66)
Angular-momentum addition may be discussed from the point of view of rotation matrices. Consider the rotation operator p) (iil( R ) in the ket space spanned by the angular-momentum eigenkets with eigenvalue 1 . Likewise, consider !2
j
0
.... .... ....
0
...,...
_,
_ _ _ _
(3.7 .67)
Theory of Angular Momentum
216
In the notation of group theory this is written as (3 .7.68) In terms of the elements of rotation matrices, we have an important expansion known as the Clebsch-Gordan series: ��·l; ( R ) �,V,'l, ( R ) = L L L <J1 J2 ; m 1 m 2 l iJh ; Jm ) j m m'
(3.7.69) where the }-sum runs from I J1 - h i to }1 + }2 The proof of this equation follows. First, note that the left-hand side of (3.7.69) is the same as •
< JJiz ; mi m z i � ( R ) I JIJz ; m ;m;_ ) = ( JJ m d� ( R ) I JJ m ; ) ( jz m z i � ( R ) I Jz m2)
( 3 .7 .70)
But the same matrix element is also computable by inserting a complete set of states in the ( j, m ) basis. Thus ( Jij2; m l m z i � ( R ) I JI}2 ; m ; m ;_ )
= L L L L <J1 Jz; m l m ziJIJz; Jm ) ( j!Jz; Jm i � ( R ) I JJJz ; J 'm ') j
m j ' m'
X ( }1 }2 ; j 'm 'l }1 }2 ; m ; m ;_ )
= L: L: L: L: U1Jz; mlm ziJJ J2 ; Jm> �:.:l. ( R ) 81j' j
m j ' m'
(3.7.71) which is just the right-hand side of (3.7.69). As an interesting application of (3.7.69), we derive an important formula for an integral involving three spherical harmonics. First, recall the connection between !Z),:/6 and }/m* given by (3.6.52). Letting }1 ---. /1 , h ---> /2 , m ; ---. 0, m 2 --+ 0 (hence m ' --+ 0) in (3.7.69), we obtain, after complex conjugation,
We multiply both sides by }/m*( O, q,) and integrate over solid angles. The summations drop out because of the orthogonality of spherical harmonics,
217
3.8. Schwinger's Oscillator Model of Angular Momentum
and we are left with
jdQ y;m*( (}, > ) Yt' ( (} ,
(/1/2; 00 I /1/2 ; /0) (/1/2 ; m 1 m 2 l/1/2 ; /m ) .
(3.7.73)
The square root factor times the first Clebsch-Gordan coefficient is indepen dent of orientations; that is, of m1 and m 2 • The second Clebsch-Gordan coefficient is the one appropriate for adding /1 and /2 to obtain total /. Equation (3.7.73) turns out to be a special case of the Wigner-Eckart theorem to be derived in Section 3.10. This formula is extremely useful in evaluating multipole matrix elements in atomic and nuclear spectroscopy.
3.8. SCHWINGER'S OSCILLATOR MODEL OF ANGULAR MOMENTUM
Angular Momentum and Uncoupled Oscillators There exists a very interesting connection between the algebra of angular momentum and the algebra of two independent (that is, uncoupled) oscillators, which was worked out in J. Schwinger's notes [see Quantum Theory of Angular Momentum, edited by L. C. Biedenharn and H. Van Dam, Academic Press (1965), p. 229]. Let us consider two simple harmonic oscillators, which we call the 'Plus type and the minus type. We have the annihilation and creation operators, denoted by a+ and a l for the plus-type oscillator; likewise, we have a_ and a!. for the minus-type oscillators. We also define the number operators N + and N _ as follows:
N_ = a!. a _ .
(3.8.1) We assume that the usual commutation relations among a, at, and N hold for oscillators of the same type (see Section 2.3). [a + , a1 ] = 1 , (3.8.2a) [ a _ , a!. ] = 1 , ( 3 . 8.2b) [ N+ , a+ ] = - a + , [ N_ , a_ ] = - a_ , (3.8.2c) [ N_, a!. ] = a!. . [ N+, al ] = al ,
However, we assume that any pair of operators between different oscillators commute:
[ a at ] = [a at+ ] = 0 +'
-
_ ,
(3 .8.3)
and so forth. So it is in this sense that we say the two oscillators are uncoupled.
218
Theory of Angular Momentum
Because N + and N commute by virtue of (3.8.3), we can build up simultaneous eigenkets of N + and N _ with eigenvalues n and n _ , respectively. So we have the following eigenvalue equations for N ±; _
+
N - I n + , n _ ) = n _ l n + , n _ ). (3.8.4) In complete analogy with (2.3.16) and (2.3.17), the creation and annihilation operators, at± and a ±' act on In n _ ) as follows: +•
at_ I n + , n _ ) = jn _ + l i n + , n _ + 1 ) , (3.8 .5a)
a - I n + , n _ ) = fn"-=- ln + , n - 1 ) . _
(3.8 .5b)
We can obtain the most general eigenkets of N + and N _ by applying at and at_ successively to the vacuum ket defined by
a _ IO ,O) = O.
(3.8.6)
In this way we obtain (3.8 .7) Next, we define
J + = nal a _ , and
(3.8.8a)
( )
( )
Jz = n2 (at+ a + - at- a - ) = !!_2 ( N + - N - ) .
(3.8 .8b)
We can readily prove that these operators satisfy the angular-momentum commutation relations of the usual form [ Jz , J± 1 =
± nJ ± '
[ J + , L 1 = 2 nJ= .
(3.8 .9a) (3 .8.9b)
For example, we prove (3.8.9b) as follows: t t h2 [a+a,a_a+ 1
t t t t h2a+a_a_a+ - h2a_a+a+a- h2a� (at_a_ + l)a+ - h 2at_ (a�a+ + l)a_ = ft2(ata+ - at_a_) = 2ftlz. =
(3.8. 10)
Defining the total N to be
N = N + + N _ = al a + + at_ a _ ,
(3.8.11 )
3.8.
Schwinger's Oscillator Model of Angular Momentum
we can also prove
219
( � ) ( J + J- + J- J + ) = ( �2 )N( � + 1 ) ,
J 2 J/ + =
(3.8 .12)
which is left as an exercise. What are the physical interpretations of all this? We associate spin up ( m = !) with one quantum unit of the plus-type oscillator and spin down ( m = - t ) with one quantum unit of the minus-type oscillator. If you like, you may imagine one spin ! " particle" with spin up (down) with each quantum unit of the plus- (minus-) type oscillator. The eigenvalues n + and n _ are just the number of spins up and spins down, respectively. The meaning of J + is that it destroys one unit of spin down with the z-compo nent of spin-angular momentum - li /2 and creates one unit of spin up with the z-component of spin-angular momentum + li /2; the z-component of angular momentum is therefore increased by li. Likewise J _ destroys one unit of spin up and creates one unit of spin down; the z-component of angular momentum is therefore decreased by h . As for the Jz operator, it simply counts h /2 times the difference of n+ and n _ , just the z-component of the total angular momentum. With (3.8.5) at our disposal we can easily examine how J ± and Jz act on I n+ , n _ ) as follows:
J + In + , n _ ) = lia1 a_ ln+ , n _ ) = /n _ ( n+ + 1 ) liln + + 1 , n _ - 1 ), J - in +• n
_
)=
Jzln+, n _ ) =
ha� a+ in + • n
_
)=
,
(3.8.13a)
/n + ( n + 1 ) hln + - 1 , n + 1 ) _
_
(3.8.13b)
( � )(N N _ ) In + , n _ ) = ( � ) ( n + - n_ ) hln+, n _ ) . +
-
(3.8.13c) Notice that in all these operations, the sum n+ + n _, which corresponds to the total number of spin t particles remains unchanged. Observe now that (3.8.13a), (3.8.1 3b), and (3.8.13c) reduce to the familiar expressions for the J ± and Jz operators we derived in Section 3.5, provided we substitute (3 .8.14) n _ --+ j - m . The square root factors in (3.8.13a) and (3.8.13b) change to
_
Jn ( n + 1 ) --+ /(J - m)(J + m +l) , /n+ (n _ + 1) -->/(J+m)( J - m + l) , +
(3.8.15)
which are exactly the square root fadors appearing in (3.5.39) and (3.5.41).
220
Theory of Angular Momentum
Notice also that the eigenvalue of the J 2 operator defined by (3.8.12) changes as follows: (3 .8.16) All this may not be too surprising because we have already proved that J ± and J 2 operators we constructed out of the oscillator operators satisfy the usual angular-momentum commutation relations. But it is in structive to see in an explicit manner the connection between the oscillator matrix elements and angular-momentum matrix elements. In any case, it is now natural to use .
} ""'
(n+
+
n_ )
(3.8.17)
2
in place of n and n _ to characterize simultaneous eigenkets of J 2 and J=. According to (3.8.13a) the action of J + changes n + into n + + 1, n _ into n _ - 1 , which means that j is unchanged and m goes into m + 1 . Likewise, we see that the J _ operator that changes n + into n + - 1 ; n _ into n _ + 1 lowers m by one unit without changing j. We can now write as (3.8.7) for the most general N + ' N _ eigenket +
( a� V + m ( at_ V - m . m IJ, ) = 10 ) , /( J + m ) ! ( j - m ) !
(3.8.18)
where we have used 10 ) for the vacuum ket, earlier denoted by 10,0 ) . A special case of (3.8.18) is of interest. Let us set m = j, which physically means that the eigenvalue of Jz is as large as possible for a given j. We have
( a t+ )2j
I · )= 10 ) . ) , J /(2 j ) !
( 3 .8 .19)
We can imagine this state to be built up of 2j spin � particles with their spins all pointing in the positive z-direction. In general, we note that a complicated object of high j can be visualized as being made up of primitive spin 1 particles, j + m of them with spin up and the remaining j - m of them with spin down. This picture is extremely convenient even though we obviously cannot always regard an object of angular momentum j literally as a composite system of spin 1 particles. All we are saying is that, as far as the transformation properties under rotations are concerned, we can visualize any object of angular momentum j as a composite system of 2 j spin 1 particles formed in the manner indicated by (3.8.18).
3.8.
Schwinger's Oscillator Model of Angular Momentum
221
From the point of view of angular-momentum addition developed in the previous section, we can add the spins of 2 j spin 1 particles to obtain states with angular momentum j, j - l , j - 2, . . . . As a simple example, we can add the spin-angular momenta of two spin ! particles to obtain a total angular momentum of zero as well as one. In Schwinger's oscillator scheme, however, we obtain only states with angular momentum j when we start with 2 } spin 1 particles. In the language of permutation symmetry to be developed in Chapter 6, only totally symmetrical states are constructed by this method. The primitive spin � particles appearing here are actually bosons ! This method is quite adequate if our purpose is to examine the properties under rotations of states characterized by j and m without asking how such states are built up initially. The reader who is familiar with isospin in nuclear and particle physics may note that what we are doing here provides a new insight into the isospin (or isotopic spin) formalism. The operator J that destroys one unit of the minus type and creates one unit of the plus type is completely analogous to the isospin ladder operator T + (sometimes denoted by I + ) that annihilates a neutron (isospin down) and creates a proton (isospin up), thus raising the z-component of isospin by one unit. In contrast, Jz is analogous to Tz, which simply counts the difference between the number of protons and neutrons in nuclei. +
Explicit Formula for Rotation Matrices Schwinger's scheme can be used to derive, in a very simple way, a closed formula for rotation matrices, first obtained by E. P. Wigner using a similar (but not identical) method. We apply the rotation operator ffJ( R ) to I J, m ) , written as (3.8.18). In the Euler angle notation the only nontrivial rotation is the second one about the y-axis, so we direct our attention to ffJ ( R ) = ffJ ( a , ,B , y ) l a � y �o = exp
We have ffJ ( R ) I J , m ) =
II
( - ily/3 )
·
[ §J ( R ) a � !»- l ( R ) ] J + m [ §J ( R ) a t_ f)- l ( R ) ] 1 - m J( J + m ) ! ( J - m ) !
(3.8.20)
ffJ ( R ) IO) .
(3.8.21) Now, ffJ( R ) acting on 10) just reproduces 10) because, by virtue of (3.8.6), only the leading term, 1, in the expansion of exponential (3.8.20) contrib utes. So (3.8 .22)
222
Theory of Angular Momentum
Thus we may use formula
(2.3.47). Letting y , ,G -> -
-J
(3.8.23)
(2.3.47), we realize that we must look at various commutators, namely, [ -,'Y , a� ] = ( ;i ) [ a!_a + , a� ] = ( L ) at_ , (3.8.24) -,'Y , [ / , a� ]] = [ -,'Y ' �t ] = (�) a� [ and so forth. Clearly, we always obtain either at or a!_ . Collecting terms, we get (3.8.25 ) � ( R)a� ��1 (R) = a� cos(�) + a!_ sin(�) .
in
t
Likewise,
�(
R )a!_ �-1 ( R) = at_ cos(�) - a� sin(�) .
(3.8.26)
Actually this result is not surprising. After all, the basic spin-up state is supposed to transform as
(3.8.27) a� JO) -> cos(�) at JO) + sin (�) at_ JO) under a rotation about the y-axis. Substituting (3.8.25) and (3.8.26) into (3.8.21) and recalling the binomial theorem (3.8.28) we obtain
J -_m )! � ( a = O, jJ, y = O J ;., m) = L Lt ( l. + m(J-+km) '.)!( k '. ( J. m - / ) .If .! k ( /J/2) ] k [ at cos( /J/2)] J + m - k [a!_ sin��� x ���v�(=;. == + m�)�!(=J=-=m�)=, X [ - at sin( J3/2)Y - m - t[a!_ cos(J3/2)]' I O) . (3.8.29) We may compare (3.8.29) with � ( a = o, p , y = o) J J , m) = f.J J , m ')d�� ( P ) ( a ) J + m' ( a ) j - m' = f.d�� ( IJ ) ++ ')!(J_- ')! 10) . j(J m m (3.8.30) m' m'
t
t
3.9. Spin Correlation Measurements and Bell's Inequality
223
We can obtain an explicit form for d:J,�( /1 ) by equating the coefficients of powers of a� in (3.8.29) and (3.8.30). Specifically, we want to compare a� raised to j + m' in (3.8.30) with a� raised to 2 } - k - I, so we identify
l = j - k - m'. (3.8 .31) We are seeking d m'm (/3 ) with m' fixed. The k-sum and the /-sum in (3.8.29) are not independent of each other; we eliminate I in favor of k by taking
advantage of (3.8.31). As for the powers of a� , we note that a� raised to j - m ' in (3.8.30) automatically matches with at_ raised to k + I in (3.8.29) when (3.8.31) is imposed. The last step is to identify the exponents of cos(/3/2), sin(/3/2), and ( - 1), which are, respectively,
j + m - k + I = 2 } - 2k + m - m ', (3.8.32a) k + j - m - I = 2k - m + m ', (3.8 .32b) j - m - l = k - m + m', (3.8 .32c) where we have used (3.8.31) to eliminate I. In this way we obtain Wigner's
formula for d ;,;,l,( /3): . d(J) m 'm ( /} ) =
P
"
'7 ( - 1)
m( J +) !( J - m) ! (J + m ')! ( J- m')!k - m + m, ---'v-
( )
f
( )
f1 2j - 2k + m - m' . f1 2k - m + m '
X cos 2
......,---
.....,.--
( J + m - k )! k ! ( j - k - m ' ) ! ( k - m + m ' )! sm 2
,
(3.8.33)
where we take the sum over k whenever none of the arguments of factorials in the denominator are negative.
3.9. SPIN CORRELATION MEASUREMENTS AND
BELL'S INEQUALITY Correlations in Spin-Singlet States The simplest example of angular-momentum addition we encoun tered in Section 3.7 was concerned with a composite system made up of spin 1 particles. In this section we use such a system to illustrate one of the most astonishing consequences of quantum mechanics. Consider a two-electron system in a spin-singlet state, that is, with a total spin of zero. We have already seen that the state ket can be written as [see (3.7.15d)] fspin singlet) =
( � ) ( IZ +
;
z - ) - fz -
;
Z + )) ,
(3.9.1)
where we have explicitly indicated the quantization direction. Recall that
224
Theory of Angular Momentum
lz + ; z - )
means that electron 1 is in the spin-up state and electron 2 is in the spin-down state. The same is true for lz - ; z + ) . Suppose we make a measurement on the spin component of one of the electrons. Clearly, there is a 50-50 chance of getting either up or down because the composite system may be in lz + ; z - ) or lz - ; z + ) with equal probabilities. But if one of the components is shown to be in the spin-up state, the other is necessarily in the spin-down state, and vice versa. When the spin component of electron 1 is shown to be up, the measurement apparatus has selected the first term, lz + ; z - ) of (3.9.1); a subsequent measurement of the spin component of electron 2 must ascertain that the state ket of the composite system is given by lz + ; z - ) . It is remarkable that this kind of correlation can persist even if the two particles are well separated and have ceased to interact provided that as they fly apart, there is no change in their spin states. This is certainly the case for a J = 0 system disintegrating spontaneously into two spin 1 particles with no relative orbital angular momentum, because angular momentum conservation must hold in the disintegration process. An exam ple of this would be a rare decay of the 1J meson (mass 549 MeVjc 2 ) into a muon pa1r (3.9.2)
which, unfortunately, has a branching ratio of only approximately 6 X 10- 6. More realistically, in proton-proton scattering at low kinetic energies, the Pauli principle to be discussed in Chapter 6 forces the interacting protons to be in 1S0 (orbital angular momentum 0, spin-singlet state), and the spin states of the scattered protons must be correlated in the manner indicated by (3.9.1) even after they get separated by a macroscopic distance. To be more pictorial we consider a system of two spin 1 particles moving in opposite directions, as in Figure 3.8. Observer A &pecializes in measuring S, of particle 1 (flying to the right), while observer B specializes in measuring Sz of particle 2 (flying to the left). To be specific, let us assume that observer A finds Sz to be positive for particle 1 . Then he or she can predict, even before B performs any measurement, the outcome of B's measurement with certainty: B must find Sz to be negative for particle 2. On the other hand, if A makes no measurement, B has a 50-50 chance of getting S, + or Sz - . This by itself might not be so peculiar. One may say, " It is just like an urn known to contain one black ball and one white ball. When we blindly pick one of them, there is a 50-50 chance of getting black or white. But if the first ball we pick is black, then we can predict with certainty that the second ball will be white." It turns out that this analogy is too simple. The actual quantum mechanical situation is far more sophisticated than that! This is because observers may choose to measure Sx in place of S,. The same pair of
3.9. Spin Correlation Measurements and Bell's Inequality
I I· B
Pa rticle 2
!
FIGURE 3.8.
0
225
i
·I
Pa rticle 1
A
Spin correlation in a spin-singlet state.
"quantum-mechanical balls" can be analyzed either in terms of black and white or in terms of blue and red! Recall now that for a single spin ! system the Sx eigenkets and Sz eigenkets are related as follows:
lx ±) = ( � )(li + )±li - )), IZ±>= ( � )(lx +)±lx - )).
(3.9.3 )
Returning now to our composite system, we can rewrite spin-singlet ket (3.9.1 ) by choosing the x-direction as the axis of quantization: Ispin singlet) =
( � ) ( lx - ; x + ) -1x + ; x - ) )
.
(3.9.4)
Apart from the overall sign, which in any case is a matter of convention, we could have guessed this form directly from (3.9.1) because spin-singlet states have no preferred direction in space. Let us now suppose that observer A can choose to measure s. or Sx of particle 1 by changing the orientation of his or her spin analyzer, while observer B always specializes in measuring Sx of particle 2. If A determines s. of particle 1 to be positive, B clearly has a 50-50 chance for getting Sx or Sx ; even though Sz of particle 2 is known to be negative with certainty, its Sx is completely undetermined. On the other hand, let us suppose that A also chooses to measure Sx; if observer A determines Sx of particle 1 to be positive, then without fail, observer B will measure Sx of particle 2 to be negative. Finally, if A chooses to make no measurement, B, of course, will have a 50-50 chance of getting Sx or Sx . To sum up:
+
-
-
+
1 . If A measures S, and B measures Sx, there is a completely random correlation between the two measurements. 2. If A measures Sx and B measures Sx, there is a 100% (opposite sign) correlation between the two measurements. 3. If A makes no measurement, B's measurements show random results.
Table 3.1 shows all possible results of such measurements when B and A are allowed to choose to measure Sx or Sz.
226
Theory of Angular Momentum
TABLE 3.1. Spin-correlation Measurements Spin component measured by A
A's result
z
+
z
z
B's result
X
+
z
+
z
X X
Spin component measured by B
z
+
X
z
+
X
+
z
+
X X
+
z z
X
X
+ +
X X X z z
+
+
These considerations show that the outcome of B's measurement appears to depend on what kind of measurement A decides to perform: an Sx measurement, an Sz measurement, or no measurement. Notice again that A and B can be miles apart with no possibility of communications or mutual interactions. Observer A can decide how to orient his or her spin-analyzer apparatus long after the two particles have separated. It is as though particle 2 " knows" which spin component of particle 1 is being measured. The orthodox quantum-mechanical interpretation of this situation is as follows. A measurement is a selection (or filtration) process. When Sz of particle 1 is measured to be positive, then component IZ + ; z - ) is selected. A subsequent measurement of the other particle's Sz merely ascertains that the system is still in IZ + ; z - ). We must accept that a measurement on what appears to be a part of the system is to be regarded as a measurement on the whole system. Einstein's Locality Principle and Bell's Inequality
Many physicists have felt uncomfortable with the preceding ortho dox interpretation of spin-correlation measurements. Their feelings are typified in the following frequently quoted remarks by A. Einstein, which we call Einstein's locality principle: "But on one supposition we should, in my opinion, absolutely hold fast: The real factual situation of the system S2 is independent of what is done with the system S1, which is spatially separated from the former." Because this problem was first discussed in a
3. 9. Spin Correlation Measurements and Bell's Inequality
227
1935 paper of A. Einstein, B. Podolsky, and N. Rosen, it is sometimes known as the Einstein-Podolsky-Rosen paradox. * Some have argued that the difficulties encountered here are inherent in the probabilistic interpretations of quantum mechanics and that the dynamic behavior at the microscopic level appears probabilistic only be cause some yet unknown parameters-so-called hidden variables-have not been specified. It is not our purpose here to discuss various alternatives to quantum mechanics based on hidden-variable or other considerations. Rather, let us ask, Do such theories make predictions different from those of quantum mechanics? Until 1964, it could be thought that the alternative theories could be concocted in such a way that they would give no predictions, other than the usual quantum-mechanical predictions, that could be verified experimentally. The whole debate would have belonged to the realm of metaphysics rather than physics. It was then pointed out by J. S. Bell that the alternative theories based on Einstein's locality principle actually predict a testable inequality relation among the observables of spin-correlation experiments that disagrees with the predictions of quantum mechanics. We derive Bell's inequality within the framework of a simple model, conceived by E. P. Wigner, that incorporates the essential features of the various alternative theories. Proponents of this model agree that it is impossible to determine Sx and Sz simultaneously. However, when we have a large number of spin 1 particles, we assign a certain fraction of them to have the following property: If Sz is measured, we obtain a plus sign with certainty. If Sx is measured, we obtain a minus sign with certainty. A particle satisfying this property is said to belong to type (z + , x - ). Notice that we are not asserting that we can simultaneously measure Sz and Sx to be + and - , respectively. When we measure Sz, we do not measure Sx, and vice versa. We are assigning definite values of spin components in more than one direction with the understanding that only one or the other of the components can actually be measured. Even though this approach is funda mentally different from that of quantum mechanics, the quantum-mechani cal predictions for Sz and Sx measurements performed on the spin-up ( Sz + ) state are reproduced provided there are as many particles belonging to type (z + , x + ) as to type (z + , x - ) Let us now examine how this model can account for the results of spin-correlation measurements made on composite spin-singlet systems. .
* To be historically accurate, the original Einstein-Podolsky-Rosen paper dealt with measurements of x and p. The use of composite spin 1 systems to illustrate the Einstein Podolsky-Rosen paradox started with D. Bohm.
228
Theory of Angular Momentum
Clearly, for a particular pair, there must be a perfect matching between particle 1 and particle 2 to ensure zero total angular momentum: If particle 1 is of type (z + , x - ), then particle 2 must belong to type (z - , x + ), and so forth. The results of correlation measurements, such as in Table 3.1, can be reproduced if particle 1 and particle 2 are matched as follows: particle 1 particle 2 ( 3.9.5a) (Z + , x - ) ( z - , x + ) , ( 3.9.5b) (Z + , x + ) (z - , x - ) , .....
.....
(z - , x + ) (Z + , x - ) , (z- , x - ) (Z + , x + ) .....
.....
( 3.9.5c)
( 3.9.5d)
with equal populations, that is, 25% each. A very important assumption is implied here. Suppose a particular pair belongs to type (3.9.5a) and observer A decides to measure Sz of particle 1; then he or she necessarily obtains a plus sign regardless of whether B decides to measure S1 or Sx. It is in this sense that Einstein's locality principle is incorporated in this model: A's result is predetermined independently of B's choice as to what to measure. In the examples considered so far, this model has been successful in reproducing the predictions of quantum mechanics. We now consider more-complicated situations where the model leads to predictions different from the usual quantum-mechanical predictions. This time we start with three unit vectors a, b, and c, which are, in general, not mutually orthogonal. We imagine that one of the particles belongs to some definite type, say (a - , b + , c + ), which means that if S · a is measured, we obtain a minus sign with certainty; if S•b is measured, we obtain a plus sign with certainty; if S ·c is measured, we obtain a plus with certainty. Again there must be a perfect matching in the sense that the other particle necessarily belongs to type (a + , b - , c - ) to ensure zero total angular momentum. In any given event, the particle pair in question must be a member of one of the eight types shown in Table 3.2. These eight possibilities are mutually exclusive and disjoint. The population of each type is indicated in the first column. Let us suppose that observer A finds S1 · a to be plus and observer B finds S2·b to be plus also. It is clear from Table 3.2 that the pair belong to either type 3 or type 4, so the number of particle pairs for which this situation is realized is N3 + N4• Because Ni is positive semidefinite, we must have inequality relations like ( 3.9.6 ) Let P(a + ; b + ) be the probability that, in a random selection, observer A measures 8 1 • a to be + and observer B measures 82 • b to be + , and so on.
3. 9. Spin Correlation Measurements and Bell's Inequality
229
TABLE 3.2. Spin-component Matching in the Alternative Theories Population
Particle 1
Particle 2
Nt N2 N3 N•
(1t+ ,b + ,C + ) (a + ,h+ , c � ) (1t+ ,h� , C + ) (11+ ,h� , c ) (a � ,h + , C + J (a � ,h+ ,c � ) (a � , h � , e + J (a � , b � , c � J
(lt � ,h� , c � ) (a� ,h� ,C + ) (a� ,h+ , c � ) (a � ,h+ , C + ) (a+ ,h� , c � ) (1t+ ,b � ,C + ) (1t+ ,h+ ,c � )
�
Ns N6 N7 Ns
(11+ ,b + ,C + )
Clearly, we have (3 .9.7)
In a similar manner, we obtain P (a + ,· c + ) =
( N2 + N ) 'isN I I
•
and
The positivity condition (3.9.6) now becomes
P (H ; b + ) s P(H ; C + ) + P(C+ ; b + ) .
(3 .9.9)
This is Bell's inequality, which follows from Einstein's locality principle. Quantum Mechanics and Bell's Inequality
We now return to the world of quantum mechanics. In quantum mechanics we do not talk about a certain fraction of particle pairs, say N3/'L�N;, belonging to type 3. Instead, we characterize all spin-singlet systems by the same ket (3.9.1); in the language of Section 3.4 we are concerned here with a pure ensemble. Using this ket and the rules of quantum mechanics we have developed, we can unambiguously calculate each of the three terms in inequality (3.9.9). We first evaluate P(a + ; b + ). Suppose observer A finds S 1 · a to be positive; because of the (opposite sign) correlation we discussed earlier, B's measurement of S2 · a will yield a minus sign with certainty. But to calculate P(a + ; b + ) we must consider a new quantization axis b that makes an angle ()ab with a; see Figure 3.9. According to the formalism of Section 3.2, the probability that the S2 ·b measurement yields + when particle 2 is known to be in an eigenket of S2· a with negative eigenvalue is
100%
230
Theory of Angular Momentum
a direction
t I I
6 direction
e••
FIGURE 3.9. Evaluation of P(a + ; b + ) .
given by COS
As a result, we obtain
[
] ( 2 ). ( ) sm ( )
2 ( '1T - 8ab ) - . 2 8ah - Sill
2
1 P ( a' + ; 'b + ) = 2
(3.9.10)
eab 2 ,
(3.9.11) where the factor � arises from the probability of initially obtaining S 1 • a with + . Using (3.9.11) and its generalization to the other two terms of (3.9.9), we can write Bell's inequality as .
2
(3.9.12) We now show that inequality (3.9.12) is not always possible from a geometric point of view. For simplicity let us choose a, b, and c to lie in a plane, and let c bisect the two directions defined by a and b: (3.9.13) Inequality (3.9.12) is then violated for '1T 0<8< . (3.9.14) 2 For example, take 8 = 'IT/4; we then obtain 0.500 .::; 0.292 ?? (3.9.15) So the quantum-mechanical predictions are not compatible with Bell's inequality. There is a real observable�in the sense of being experimentally verifiable� difference between quantum mechanics and the alternative theo ries satisfying Einstein's locality principle.
3. 9. Spin Correlation Measurements and Bell's Inequality
231
Several experiments have been performed to test Bell's inequality. In one of the experiments spin correlations between the final protons in low-energy proton-proton scattering were measured. In all other experi ments photon-polarization correlations between a pair of photons in a cascade transition of an excited atom (Ca, Hg, . . . ), (3.9.16) or in the decay of a positronium (an e + e - bound state in 1S0) were measured; studying photon-polarization correlations should be just as good in view of the analogy developed in Section 1.1: * sz + ---- £ in x-direction, s - ---- £ in y-direction, (3.9.17) sx + -> £ in 45 ° diagonal direction, s - -> £ in 135° diagonal direction. The results of all recent precision experiments have conclusively established that Bell's inequality was violated, in one case by more than nine standard deviations. Furthermore, in all these experiments the inequality relation was violated in such a way that the quantum-mechanical predict ions were fulfilled within error limits. In this controversy, quantum mecha nics has triumphed with flying colors. The fact that the quantum-mechanical predictions have been verified does not mean that the whole subject is now a triviality. Despite the experimental verdict we may still feel psychologically uncomfortable about many aspects of measurements of this kind. Consider in particular the following point: Right after observer A performs a measurement on particle 1, how does particle 2-which may, in principle, be many light years away from particle 1 -get to " know" how to orient its spin so that the remark able correlations apparent in Table 3.1 are realized? In one of the experi ments to test Bell's inequality (performed by A. Aspect and collaborators) the analyzer settings were changed so rapidly that A's decision as to what to measure could not be made until it was too late for any kind of influence, traveling slower than light, to reach B. We conclude this section by showing that despite these peculiarities we cannot use spin-correlation measurements to transmit any useful infor mation between two macroscopically separated points. In particular, super luminal (faster than light) communications are impossible. Suppose A and B both agree in advance to measure Sz; then, without asking A, B knows precisely what A is getting. But this does not mean that z
X
• It should be kept in mind here that by working with photons we are going outside the realm of nonrelativistic quantum mechanics, which is the subject of this book.
232
Theory of Angular Momentum
A and B are communicating; B just observes a random sequence of positive and negative signs. There is obviously no useful information contained in it. B verifies the remarkable correlations predicted by quantum mechanics only after he or she gets together with A and compares the notes (or computer sheets). It might be thought that A and B can communicate if one of them suddenly changes the orientation of his or her analyzing apparatus. Let us suppose that A agrees initially to measure Sz, and B, Sx. The results of A's measurements are completely uncorrelated with the results of B's measure ments, so there is no information transferred. But then, suppose A suddenly breaks his or her promise and without telling B starts measuring Sx. There are now complete correlations between A's results and B's results. However, B has no way of inferring that A has changed the orientation of his or her analyzer. B continues to see just a random sequence of + 's and - 's by looking at his or her own notebook only. So again there is no information transferred. 3.10. TENSOR OPERATORS Vector Operator
We have been using notations such as x, p, S, and L, but as yet we have not systematically discussed their rotational properties. They are vector operators, but what are their properties under rotations? In this section we give a precise quantum-mechanical definition of vector operators based on their commutation relations with the angular-momentum operator. We then generalize to tensor operators with more-complicated transformation prop erties and derive an important theorem on the matrix elements of vector and tensor operators. We all know that a vector in classical physics is a guantity with three components that transforms by definition like V;--+ L-jR;jVj under a rota tion. It is reasonable to demand that the expectation value of a vector operator V in quantum mechanics be transformed like a classical vector under rotation. Specifically, as the state ket is changed under rotation according to (3.10.1) Ia) ---> �(R)Ia) , the expectation value of V is assumed to change as follows: (aiV,Ia) ---> (al�t(R ) V,�(R)Ia) = [ R iJ (aiJjla). (3.10.2) j This must be true for an arbitrary ket Ia) . Therefore, (3.10.3) �t ( R ) V, �(R) = L R ;}'j j
233
3.10. Tensor Operators
must hold as an operator equation, where R ,1 is the 3 X 3 matrix that corresponds to rotation R. Let us now consider a specific case, an infinitesimal rotation. When the rotation is infinitesimal. we have
�
· it � ( R ) = 1 - ie .
(3.10.4)
We can now write (3.10.3) as V, + .Eli [ V,, J · it] = L;R;1 (it; e)v; . I
. J
In particular, for it along the z-axis, we have R (z ; e) =
so
U
-e
1 0
�),
(3.10.6)
i = 1 : Vx + Jj [ Vx , Jz ] = Vx - eVv E
I
.
i = 2 : v, + i� [ vv , Jz] = eVx + Vv i = 3 : V, + i/i [ V, , Jz ] = V, .
(3.10.5)
E
(3.10.7a) (3.10.7b) (3.10.7c)
This means that V must satisfy the commutation relations
(3.10.8) [ v,, � ] = ieukli Vk . Clearly, the behavior of V under a finite rotation is completely
determined by the preceding commutation relations; we just apply the by-now familiar formula (2.3.47) to exp
(��) v, exp ( -��).
(3.10.9)
We simply need to calculate
(3.10.10) Multiple commutators keep on giving back to us V, or Vk (k * i, j) as in spin case (3.2.7). We can use (3.10.8) as the defining property of a vector operator. Notice that the angular-momentum commutation relations are a special case of (3.10.8) in which we let V, .f;, Vk ...... Jk. Other special cases are [y , Lz l = ilix, [x, Lz] = - iliy, [ px, Lz] = - ilipy, [ py, Lz ] = ilipx; these can be proved explicitly. ......
234
Theory of Angular Momentum
Cartesian Tensors Versus Irreducible Tensors
In classical physics it is customary to define a tensor T;1k ... by generalizing V; --+ 'L1R,1 � as follows: T;jk .. .
--+
L L L . . . R ; , ' R jj' ... T;'j'k ' ...
i' j' k' under a rotation specified by the 3 X 3 orthogonal matrix
(3.10.11)
R. The number of
indices is called the rank of a tensor. Such a tensor is known as a Cartesian tensor.
The simplest example of a Cartesian tensor of rank 2 is a dyadic formed out of two vectors U and V. One simply takes a Cartesian compo nent of U and a Cartesian component of V and puts them together: (3 .10.12)
Notice that we have nine components altogether. They obviously transform like (3.10.1 1) under rotation. The trouble with a Cartesian tensor like (3.10.12) is that it is reducible-that is, it can be decomposed into objects that transform differ ently under rotations. Specifically, for the dyadic in (3.10.12) we have
- �V;) + ( U;� + �V; _ U · V ) . U·V (U;� = U;� 3 8ij + 2 2 3 8ij
(3.10.13)
The first term on the right-hand side, U V, is a scalar product invariant under rotation. The second is an antisymmetric tensor which can be written as vector product eiJk (V X Vh . There are altogether 3 independent compo nents. The last is a 3 X 3 symmetric traceless tensor with 5 ( 6 - 1, where 1 comes from the traceless condition) independent components. The number of independent components checks: •
=
3 X 3 = 1 +3+5.
(3.10.14) We note that the numbers appearing on the right-hand side of (3.10.14) are precisely the multiplicities of objects with angular momentum I = 0, I = 1, and I = 2, respectively. This suggests that the dyadic has been decomposed into tensors that can transform like spherical harmonics with I = 0, 1, and 2. In fact, (3.10.13) is the simplest nontrivial example to illustrate the reduc
tion of a Cartesian tensor into irreducible spherical tensors. Before presenting the precise definition of a spherical tensor, we first give an example of a spherical tensor of rank k. Suppose we take a spherical harmonic lT( 8, 4> ) We have already seen that it can be written as Yim (il), where the orientation of n is characterized by 0 and q,. We now replace n by some vector V. The result is that we have a spherical tensor of rank k (in place of / ) with "magnetic quantum number q (in place of m ), .
3.10. Tensor Operators
235
namely,
( 3 .10.15 ) spherical harmonics with l = 1 and
q T
Specifically, in the case k = 1 we take replace ( jr ) = (it). by Y., and so on.
z
Y0 I =
{f
cos O =
•
V z --+ T.<1> = 4 '17 r 0 4 '17 {f {f -
y ± I = + /3 X ± iy --+ T (l) = /3 1 V 4; fir ± I V 4;
(
-
+
z >
vx ± i� ) .
fi
( 3 .10.16)
Obviously this can be generalized for higher k, for example,
( x ± iy ) 2 --+ <2> / y±2= V v ·v ) 2 ( 3 .10.17 ) T± 2 = ( 2 '17 32 32 '17 x ± l y • r2 T?> are irreducible, just as Yim are. For this reason, working with spherical
15
15
tensors is more satisfactory than working with Cartesian tensors. To see the transformation of spherical tensors constructed in this manner, let us first review how Yim transform under rotations. First, we have for the direction eigenket;
(3 .10.18) I n ) --+ !i! ( R ) I it) = In') , which defines the rotated eigenket In') . We wish to examine how l[m(it') = (it 'II, m) would look in terms of l[m( it ). We can easily see this by starting with
(3.10.19)
m'
and contracting with (itl on the left, using (3.10.18):
lfm (n') = L l[m'( n )!iJ�,� ( R - 1 ) . m
'
( 3 .10.20)
If there is an operator that acts like l[m(V), it is then reasonable to expect m
( 3.10.21 )
'
where we have used the unitarity of the rotation operator to rewrite !iJ�,�( R - I). All this work is just to motivate the definition of a spherical tensor. We now consider spherical tensors in quantum mechanics. Motivated by (3.10.21) we define a spherical tensor operator of rank k with (2k + 1) components as
!iJt( R ) Tq( k )!iJ( R ) =
k
L
q' - -
k
!i!J;t( R ) Tq\k )
( 3 .10 .22a)
236
Theory of Angular Momentum
or, equivalently,
k
� ( R ) Tq( k )�t( R ) = L �J·kq)( R ) Tq(_k ) . ( 3 .10.22b) q' � - k This definition holds regardless of whether Tt > can be written as }/::; q(V) ; for example, ( Ux + i Uv )( Vx + i V,) is the q = + 2 component of a spherical tensor of rank 2 even though, unlike ( Vx + iVv)2, it cannot be written as ykq(V ) .
A more convenient definition of a spherical tensor is obtained by considering the infinitesimal form of (3.10.22b), namely,
( 1 + iJ�n e ) Tql k l ( 1 - iJte ) = ,t q
or
�
-
k
)
(
Tt l( kq 'i l + iJ�ne i kq )
(3 .10.23)
[J · n , T�k)l = 2: T��l(kq ' IJ · n l kq) . q'
(3 . 10.24)
By taking n in the z- and in the (x ± iy) directions and using the nonvanish ing matrix elements of Jz and J [see (3.5.35b) and (3.5.41)] , we obtain (3.10.25a) [ Jz , Tq( k ) ] = liqTq( k ) and ±
(3.10.25b)
These commutation relations can be considered as a definition of spherical tensors in place of (3.10.22). Product of Tensors
We have seen how to form a scalar, vector (or antisymmetric tensor), and a traceless symmetric tensor out of two vectors using the Cartesian tensor language. Of course, spherical tensor language can also be used (Baym 1969, Chapter 17), for example, T,(O) = - U · V = ( U + 1V_ 1 + U_ 1 V + 1 - U0V0 ) 0
(IJ _
Tq -
3
3
(U x V) q i.fi '
T(l) ±2- u ± 1 v± 1 ' lo -
7' (2)
(3 .10.26)
U 1 V _ 1 + 2 U0 V0 + U _ 1 V + 1 +
'
16
3. 10. Tensor Operators
237
where Uq( Vq ) is the qth component of a spherical tensor of rank 1, corresponding to vector U(V). The preceding transformation properties can be checked by comparing with �m and remembering that U ( ux + i UY )I 12' U_ 1 = ( ux - iU, )/ 12' Uo uz ' A similar check can be made for V For instance, -
+I
=
+ 1 0•
3z2 - r 2
_ yo -
2
where 3z2 - r 2 can be written as 2z2 + 2
[
_
rz (
X
+ iy)
fi T0(2)
=
,
]
( - iy ) . X
fi
' for U = V = r.
hence, Y2° is just a special case of A more systematic way of forming tensor products goes as follows. We start by stating a theorem: Let XJ,' Il and zJ;'J be irreducible spherical tensors of rank k 1 and k 2 , respectively. Then Tq< k l L [ ( k1k 2; q 1q 2 lk 1 k 2 ; kq ) X�,k , lzg ,J ( 3.10.27 ) Theorem.
=
q, q,
is a spherical (irreducible) tensor of rank k. Proof We must show that under rotation according to (3.10.22)
T? l
must transform
f!Ct( R ) Tq( k lf!C ( R ) = L [ (k1k2; q1q2i k 1 k 2 ; kq ) q, q, X !0t( R ) XJ,k , l!0( R ) !0t( R ) z�:,l!0 ( R ) =
=
E E E E ( k 1 k 2 ; q1q2 i k 1 k 2 ; kq) q] q2 q} q2
L: E E E E E E < k l k 2 ; q1q2 lk1k 2 ; kq) k " ql q2 q} q2 q" q'
X ( k 1 k 2 ; q1q2 i k 1 k 2 ; k"q") E!CJ.k;;) ( R ) XJt' Z��, J , -I
l
where we have used the Clebsch-Gordan series formula (3.7.69). The preced ing expression becomes =
L L L L L likk',l)qq"(ktk 2 ; q{q�lk1k 2 ; k "q') !0J!';:,> ( R - 1 ) XJt'lZ�f'l, k " q{ q?_ qn q '
238
Theory of Angular Momentum
where we have used the orthogonality of Clebsch-Gordan coefficients (3.7.42). Finally, this expression reduces to
(
)
= L L L
q{ q2
-
= L Tq�k)��!'j ( R 1 ) = L��;t(R ) Tq�k) q'
D
q'
The foregoing shows how we can construct tensor operators of higher or lower ranks by multiplying two tensor operators. Furthermore, the manner in which we construct tensor products out of two tensors is completely analogous to the manner in which we construct an angular momentum eigenstate by adding two angular momentums; exactly the same Clebsch-Gordan coefficients appear if we let k 1• 2 ..... }1• 2 , q1 , 2 ..... m 1 • 2 • Matrix Elements of Tensor Operators; the Wigner-Eckart Theorem
In considering the interactions of an electromagnetic field with atoms and nuclei, it is often necessary to evaluate matrix elements of tensor operators with respect to angular-momentum eigenstates. Examples of this will be given in Chapter 5. In general, it is a formidable dynamic task to calculate such matrix elements. However, there are certain properties of these matrix elements that follow purely from kinematic or geometric considerations, which we now discuss. First, there is a very simple m-selection rule: m-selection Rule
(a', j 'm'I Tq(klla, jm ) = 0, unless m'= q + m .
( 3.10.28 )
Using (3.10.25a), we have (a', j'm'l( [ Jz , T,?> ] - nqTq< k J) Ia , jm) = [ (m' - m ) n - llq] X ( j 'm'I Tq(k) ia, jm ) = 0; hence, 0 (a', j'm'ITq< k > la , jm) = 0 unless m ' = q + m. Proof
' o: ,
Another way to see this is to note that transformation property of Tq< k > la , jm ) under rotation, namely, ( 3.10.29) If we now let � stand for a rotation operator around the z-axis, we get [see
239
3 . 1 0. Tensor Operators
(3.10.22b) and (3.1.16)] (3.10.30)
which is orthogonal to I a' ,j' m ') unless q + m = m ' . W e are going to prove one of the most important theorems in quan tum mechanics, the Wigner-Eckart theorem.
The Wigner-Eckart Theorem . The matrix elements of tensor operators with respect to angular-momentum eigenstates satisfy
< a J 'IIT ( k ) ll aj) , ..j < a ' , j 'm 'ITq( k ) la , jm) = ( jk ; mqi Jk ; j 'm')
2) + 1
(3.10.31)
where the double-bar matrix element is independent of m and m ', and q .
Before we present a proof of this theorem, let us look at its significance. First, we see that the matrix element is written as the product of two factors. The first factor is a Clebsch-Gordan coefficient for adding j and k to get j '. It depends only on the geometry, that is, the way the system is oriented with respect to the z-axis. There is no reference whatsoever to the particular nature of the tensor operator. The second factor does depend on the dynamics, for instance, a may stand for the radial quantum number and its evaluation may involve, for example, evaluation of radial integrals. On the other hand, it is completely independent of the magnetic quantum numbers m , m ', and q, which specify the orientation of the physical system. To evaluate (a', j 'm 'I T?ll a, jm ) with various combina tions of m , m ' and q ' it is sufficient to know just one of them; all others can be related geometrically because they are proportional to Clebsch Gordan coefficients, which are known. The common proportionality factor is ( a 'i 'II T ( k lll a )), which makes no reference whatsoever to the geometric features. The selection rules for the tensor operator matrix element can be immediately read off from the selection rules for adding angular momen tum. Indeed, from the requirement that the Clebsch-Gordan coefficient be nonvanishing, we immediately obtain the m-selection rule (3.10.28) derived before and also the triangular relation (3.10.32) l j - k l < j' < j + k. Now we prove the theorem. ,
Proof
Using (3.10.25b) we have
( a' , j 'm 'l[ J ± ' Tq(k ) ] Ia , jm ) = nJ( k + q ) ( k ± q + 1) < a', j 'm'ITq(�\la , Jm) ,
(3.10.33)
240
Theory of Angular Momentum
or using (3.5.39) and (3.5.40) we have
j( J' ± m ') ( J' + m ' + 1 ) (a' , }' , m ' + li T? l la , jm) = j( J � m )(j ± m + 1) (a' , j 'm 'IT?>Ia , j, m ± 1) + j( k + q ) ( k ± q + 1 ) (a' , j 'm 'I Tq< �l1 1a , jm) .
(3 . 10 . 34)
Compare this with the recursion relation for the Clebsch-Gordan coefficient (3.7.49). Note the striking similarity if we substitute j' ---+ j, m ' ---+ m, j ---+ j1, m ---+ m1, k ---+ )2, and q ---+ m2• Both recursion relations are of the form 'L1a11x1 = 0, that is, first-order linear homogeneous equations with the same coefficients a;J ' Whenever we have
L:a;1x1 = 0, j
we cannot solve for the x1 (or ratios; so
(3. 10.35)
Y) individually but we can solve for the (3.10.36)
where c is a universal proportionality factor. Noting that (j1 f2; m 1 , m 2 ± 1111}2; jm) in the Clebsch-Gordan recursion relation (3.7.49) corresponds to ( a', j 'm 'ITq1l a, jm ), we see that
( a ', j 'm 'ITq11 a , jm) = (universal proportionality constant independent of
(3.10.37) m , q , and m ')(Jk; m q ± 1 1Jk; j 'm ') , D which proves the theorem. Let us now look at two simple examples of the Wigner-Eckart theorem.
Example 1. Tensor of rank 0, that is, scalar T0<0l = S. The matrix element of a scalar operator satisfies
a 'II S IIaJ) (a ' , j 'm 'IS ! a , jm ) = 1311 .13mm' ( } J2J + l
(3.10.38)
because S acting on Ia, jm) is like adding an angular momentum of zero. Thus the scalar operator cannot change j, m values.
Example 2. Vector operator which in the spherical tensor language is a rank 1 tensor. The spherical component of V can be written as have the selection rule !1m = m ' - m = ± 1 , 0
A · LJ.1
= 1· t - 1· =
{ ±0.1
Vq_
± l.O•
so we
(3.10.39)
241
3.10. Tensor Operators
In addition, the 0 0 transttlon is forbidden. This selection rule is of fundamental importance in the theory of radiation; it is the dipole selection rule obtained in the long-wavelength limit of emitted photons.* ---+
For j = j ' the Wigner-Eckart theorem-when applied to the vector operator-takes a particularly simple form, often known as the projection theorem for obvious reasons. The Projection Theorem
' . ' lq l Jm < a ,, Jm . 'lVq I a , Jm . - ( a , jm i.J • V I a , jm ) ( Jm )I . ), 2
n J( J + 1 ) where analogous to our discussion after (3.10.26) we choose - 1 ( Jx ± iJ., ) = +- 1 J + , J + l = + !"> FO -
v2
·
v2 -
Noting (3.10.26) we have ( a ', JmiJ • V ia, jm ) = ( a ', Jm i(J0V0 - 1 +IV Proof
-I -
= mn ( a ', Jm ! V0 1 a, jm ) +
J
-
(3.10.40)
(3.10.41)
F+l) Ia, jm )
fi J( j + m ) ( J - m + l ) n
X ( a ', j m - l ! V- 1 I a, jm )
- � J( i - m )(J
+ m + 1)
( a', j m + 1!V+ 1Ia, jm ) (3.10.42)
= C1m(a}IIV IIaJ)
by the Wigner-Eckart theorem (3.10.31), where c1 m is independent of a, a', and V, and the matrix elements of V0 + are all proportional to the double-bar matrix element (sometimes also called the reduced matrix ele ment). Furthermore, c1m is independent of m because J • V is a scalar operator, so we may as well write it as ci ' Because c1 does not depend on V, (3.10.42) holds even if we let V ---+ J and a' --+- a, that is, 1
(3.10.43)
Returning to the Wigner-Eckart theorem applied to Vq and Jq , we have ( a', Jm '! Vq l a, Jm ) - ( a'i iiY IIaJ) (3 .10.44) ( a, jm 'l lq l a, jm ) - ( aJIIJ I I aJ) . *Additional parity selection rules are discussed in Chapter 4, Section 2. They lead to these
£1 dipole selection rules.
Theory of Angular Momentum
242
But the right-hand side of (3.10.44) is the same as (a', jm iJ • V Ia, jm ) 2 j(a, jm iJ 1a, jm ) by (3.10.42) and (3.10.43). Moreover, the left-hand side of (3.10.43) is just j(j + 1)11 2. So < ' . 'lV I ( m a, m . . I lq l 1m ) , a , }m q a, Jm ) = a:', J 2 i.J · V I J ) ( Jm '
n J(J + 1)
.
(3 . 10 45)
which proves the projection theorem.
.
0
We will give applications of the theorem in subsequent sections. PROBLEMS
1. Find the eigenvalues and eigenvectors of = ( 0i - 0I ) . Suppose an electron is in the spin state ( ; ) . If s v is measured, what is the probability of the result lt/2? 2. Consider the 2 X 2 matrix defined by a y
u=
a0 + io •a aa - io ·a
-:-"'---
where a0 is a real number and a is a three-dimensional vector with real components. a. Prove that U is unitary and unimodular. b. In general, a 2 X 2 unitary unimodular matrix represents a rotation in three dimensions. Find the axis and angle of rotation appropriate for U in terms of a0, a 1 , a 2 , and a3• 3. The spin-dependent Hamiltonian of an electron-positron system in the presence of a uniform magnetic field in the z-direction can be written as H = AS (e-J.s(e+l + (:: ) ( szv> - sy l ) . Suppose the spin function of the system is given by x(r >x� + l. a. Is this an eigenfunction of H in the limit A --+ 0, eBjmc * 0? If it is, what is the energy eigenvalue? If it is not, what is the expectation value of H? b. Same problem when eB1 me --+ 0, A * 0. 4. Consider a spin 1 particle. Evaluate the matrix elements of Sz ( Sz + n )( Sz - n ) and Sx ( Sx + lt ) ( Sx - lt ). 5. Let the Hamiltonian of a rigid body be K12 K1 K'f H = -z1 f; + f'; + J; , where K IS the angular momentum in the body frame. From this
(
)
243
Problems
expression obtain the Heisenberg equation of motion for K and then find Euler's equation of motion in the correspondence limit. 6. Let U = e;c,ae ;c,f3e'c'Y, where ( a, /3, y) are the Eulerian angles. In order that U represent a rotation (a, /3 , y ), what are the commutation rules satisfied by the G k? Relate G to the angular momentum operators. 7. What is the meaning of the following equation: u- 1A P = 'L, R k,A, ,
where the three components of A are matrices? From this equation show that matrix elements ( m i A k l n ) transform like vectors. 8. Consider a sequence of Euler rotations represented by � < 112l ( a , ,B, y ) = exp
ia y ) ( - i/a a ) exp ( - i/a f3 ) exp ( - /
e - i(a+y)/2 cos
.B 2
e'
- e - ,( a - y)/2 sin
e'(a + y)/2 cos f3
f32
2
Because of the group properties of rotations, we expect that this sequence of operations is equivalent to a single rotation about some axis by an angle (). Find (). 9. a. Consider a pure ensemble of identically prepared spin ! systems. Suppose the expectation values ( Sx) and ( Sz ) and the sign of ( S,.) are known. Show how we may determine the state vector. Why is it unnecessary to know the magnitude of (Sv)? b. Consider a mixed ensemble of spin ! systems. Suppose the ensemble averages [ Sx], [Sv], and [S=] are all known. Show how we may construct the 2 X 2 density matrix that characterizes the ensemble. 10. a. Prove that the time evolution of the density operator p (in the Schrodinger picture) is given by p ( t ) =
b. Suppose we have a pure ensemble at t = 0. Prove that it cannot evolve into a mixed ensemble as long as the time evolution is governed by the Schrodinger equation. 11. Consider an ensemble of spin 1 systems. The density matrix is now a 3 X 3 matrix. How many independent (real) parameters are needed to characterize the density matrix? What must we know in addition to [ SxJ, [ Sv ], and [ S=] to characterize the ensemble completely? 12. angular-momentum eigenstate IJ, m = m max = j) is rotated by an infinitesimal angle e about the y-axis. Without using the explicit form of the d!,{,� function, obtain an expression for the probability for the new2 rotated state to be found in the original state up to terms of order e .
An
Theory of Angular Momentum
244
13. Show that the 3 x 3 matrices G; ( i = 1, 2, 3) whose elements are given by ( G, ) Jk = - ih e,1k ,
where j and k are the row and column indices, satisfy the angular momentum commutation relations. What is the physical (or geometric) significance of the transformation matrix that connects G; to the more usual 3 X 3 representations of the angular-momentum operator J, with 13 taken to be diagonal? Relate your result to V V + Mcp X V under infinitesimal rotations. (Note: This problem may be helpful in understanding the photon spin.) 14. a. Let J be angular momentum. It may stand for orbital L, spin S, or J,o,.,.) Using the fact that Jx, Jv, Jz ( J Jx ± U,.) satisfy the usual angular-momentum commutation relations, prove J 2 = Jz2 + J J - - hJz . b. Using (a) (or otherwise), derive the "famous" expressiOn for the coefficient that appears in --+
± =
+
c_
J - >J!,,
= c_ >J!,. m - 1 '
15. The wave function of a particle subjected to a spherically symmetrical potential V( r ) is given by >j� ( x) = ( + y + 3z )f( r ) . a. Is >jJ an eigenfunction of L2 ? If so, what is the /-value? If not, what are the possible values of I we may obtain when L2 is measured? b. What are the probabilities for the particle to be found in various m 1 states? c. Suppose it is known somehow that >jl (x) is an energy eigenfunction with eigenvalue E. Indicate how we may find V(r). 16. A particle in a spherically symmetrical potential is known to be in an eigenstate of L2 and Lz with eigenvalues 11 2/(1 + 1) and mn , respec tively. Prove that the expectation values between l im ) states satisfy [/(/ + 1)11 2 - m 2 n2] )= )=O x
( Lx
(Lv
,
( L� ) = (L� ) =
2
·
Interpret this result semiclassically. 17. Suppose a half-integer /-value, say ! , were allowed for orbital angular momentum. From we may deduce, as usual, Y1 12 , 112 ( 8, cJ> ) ex: e""12../sin8 .
245
Problems
Now try to construct Y112• 1 ( 8 .p); by (a) applying L_ to Y112• 112 ( fJ, .P ) ; and (b) using L _ Y112• _ 112 ( fJ, cp) 0. Show that the two procedures lead to contradictory results. (This gives an argument against half-integer /-values for orbital angular momentum.) 18. Consider an orbital angular-momentum eigenstate 1/ = 2, m = 0). Sup pose this state is rotated by an angle f3 about the y-axis. Find the probability for the new state to be found in m = 0, ± 1, and ± 2. (The spherical harmonics for I= 0, 1 , and 2 given in Appendix A may be useful.) 19. What is the physical significance of the operators K = at a� and K a+a_ in Schwinger's scheme for angular momentum? Give the nonvanishing matrix elements of K 20. We are to add angular momenta fi = 1 and h = 1 to form j = 2, 1 , and states. Using either the ladder operator method or the recursion 0relation, express all (nine) { j, m } eigenkets in terms of 1 )1)2 ; m 1 m 2 ) . Write your answer as _
1 2
+
,
=
_
=
±·
I J =1, m =1 ) = �1+ ,0) - �10, + ) , .. . , where + and 0 stand for m u =1, 0, respectively. 21. a. Evaluate L l d�1· (f3) 1 1 m j
m = -
j
for any j (integer or half-integer); then check your answer for j =!. b. Prove, for any j,
t m 2 ld�.� (f3)1 2 = �j(j + 1) sin2{3 + m '2 � (3cos2{3 -1).
m = -· 1
This can be proved in many ways. You may, for instance, examine the rotational properties of 1} using the spherical (irreduci ble) tensor language.] 22. a. Consider a system with j = 1. Explicitly write (j =1, m 'l fvl i =1, m) in 3 X 3 matrix form. b. Show that for j = 1 only, it is legitimate to replace e- '1r!ll� by 1 - i ( hJy ) sin {3 - hJy 2( 1 - cos {3 ) . [ Hint:
( )
Theory of Angular Momentum
246
c. Using (b), prove d(j� l ) ( /3 ) =
( � )( 1 + cos/3)
( � ) sin /3
23.
( � ) sin /3
( � ) ( 1 - cos /3 )
( � ) sin /3
( � ) (1 + cos /3 )
-
-
cos/3
( � ) (1 - cos/3 )
( � ) sin /3
Express the matrix element (a2/32y2 l lfl a1/31y1 ) in terms of a series in E&�n ( af3 y ) = (af3 y 1 Jmn ) .
24. Consider a system made up of two spin -l- particles. Observer A specializes in measuring the spin components of one of the particles (s12, s1x and so on), while observer B measures the spin components of the other particle. Suppose the system is known to be in a spin-singlet state, that is, stotal = 0. a. What is the probability for observer A to obtain s1z h /2 when observer B makes no measurement? Same problem for s1 x = h /2. b. Observer B determines the spin of particle 2 to be in the s22 h /2 state with certainty. What can we then conclude about the outcome of observer A's measurement if (i) A measures s 1 z and (ii) A measures s1) Justify your answer. 25. Consider a spherical tensor of rank 1 (that is, a vector) =
=
v<±l I) =
+
VX ± iVv
fi
.
v; ( l) = v
'
0
z·
Using the expression for J U � I ) given in Problem 22, evaluate
I,q d ��- ( f3 ) vq<,l) '
and show that your results are just what you expect from the transfor mation properties of Vx. under rotations about the y-axis. 26. a. Construct a spherical tensor of rank 1 out of two different vectors U = ( Ux, Uv , Uz) and V = ( Vx, V.v , V:, ). Explicitly write T�1l_ 0 in terms of ux, y , z and vx, v . z• b. Construct a spherical tensor of rank 2 out of two different vectors U and V. Write down explicitly T��- 1 , 0 in terms of Ux , v . and Vx, 27. Consider a spinless particle bound to a fixed center by a central force potential. .v.
z
±
z
v.z·
Problems
247
a. Relate, as much as possible, the matrix elements 1 ( n ' I m 'I + fi ( x ± iy ) In , I, m) and ( n ' , I ', m 'I In , I, m) ,
z
',
using only the Wigner-Eckart theorem. Make sure to state under what conditions the matrix elements are nonvanishing. b. Do the same problem using wave functions 1/-(x) = R ,1(r)Y/"( IJ, cp ). 28. a. Write xy, xz, and (x2 - y2) as components of a spherical (irreduci ble) tensor of rank 2. b. The expectation value Q = e (o: , j, m = j i (3z 2 - r 2 ) l o: , j, m = j) is known as the quadrupole moment. Evaluate e ( o:, j, m'l( x 2 - y2 ) 1o:, j, m j) , (where m' j, j j 2, . . . ) in terms of Q and appropriate =
-
1,
=
-
Clebsch-Gordan coefficients. spin � nucleus situated at the origin is subjected to an external inhomogeneous electric field. The basic electric quadrupole interaction may by taken to be
29. A
Hint =
2s(se�1 ) n2 [( ::� ts; + ( ::� Ls; + ( ::� ts/ l
where cp is the electrostatic potential satisfying Laplace's equation and the coordinate axes are so chosen that
) a z
energy eigenkets (in terms of lm), where m = ± �. ± �) and the corre sponding energy eigenvalues. Is there any degeneracy? A
CHAPTER 4
Symmetry in Quantum Mechanics
Having studied the theory of rotation in detail, we are in a positiOn to discuss, in more general terms, the connection between symmetries, degener acies, and conservation laws. We have deliberately postponed this very important topic until now so that we can discuss it using the rotation symmetry of Chapter 3 as an example. 4.1. SYMMETRIES, CONSERVATION LAWS, AND DEGENERACIES Symmetries in Classical Physics
We begin with an elementary review of the concepts of symmetry and conservation law in classical physics. In the Lagrangian formulation of quantum mechanics, we start with the Lagrangian L, which is a function of a generalized coordinate q1 and the corresponding generalized velocity q,. If L is unchanged under displacement, (4.1 .1) q, -> q, + 8 q, , then we must have aL = O . (4.1 .2) aq, 248
4.1 . Symmetries, Conservation Laws, and Degeneracies
It then follows, by virtue of the Lagrange equation, JLj Jq; = 0, that
249
djdt ( JLjJq, )
dp , = 0, dt
(4.1 .3)
JL . p, = a q· i
( 4.1 .4)
dp . =0 dt
(4.1.5)
JH = O. J qi
( 4.1 .6)
where the canonical momentum is defined as So if L is unchanged under displacement (4.1.1), then we have a conserved quantity, the canonical momentum conjugate to q,. Likewise, in the Hamiltonian formulation based on H regarded as a function of q; and P; , we have '
whenever
So if the Hamiltonian does not explicitly depend on q, , which is another way of saying H has a symmetry under q, --> q; + 8q;, we have a conserved quantity. Symmetry in Quantum Mechanics
In quantum mechanics we have learned to associate a unitary oper ator, say Y', with an operation like translation or rotation. It has become customary to call Y' a symmetry operator regardless of whether the physical system itself possesses the symmetry corresponding to Y'. Further, we have learned that for symmetry operations that differ infinitesimally from the identity transformation, we can write ( 4.1 .7) where G is the Hermitian generator of the symmetry operator in question. Let us now suppose that H is invariant under .'1'. We then have ( 4.1 .8) But this is equivalent to [ G , H ] = O. (4.1 .9) By virtue of the Heisenberg equation of motion, we have dG = · O' ( 4.1 .10) dt
Symmetry in Quantum Mechanics
250
hence, G is a constant of the motion. For instance, if H is invariant under translation, then momentum is a constant of the motion; if H is invariant under rotation, then angular momentum is a constant of the motion. It is instructive to look at the connection between (4.1.9) and conservation of G from the point of view of an eigenket of G when G commutes with H. Suppose at t0 the system is in an eigenstate of G. Then the ket at a later time obtained by applying the time-evolution operator (4.1 .11) is also an eigenket of G with the same eigenvalue g . In other words, once a ket is a G eigenket, it is always a G eigenket with the same eigenvalue. The proof of this is extremely simple once we realize that (4.1.9) and (4.1.10) also imply that G commutes with the time-evolution operator, namely, G [ U( t , t0 ) l g ')] = U( t , t0 ) G i g ') = g ' [ U( t , t 0 ) l g ')] . (4.1 .12) '
Degeneracies
Let us now turn to the concept of degeneracies. Even though degeneracies may be discussed at the level of classical mechanics-for instance in discussing closed (nonprecessing) orbits in the Kepler problem (Goldstein 1980)-this concept plays a far more important role in quantum mechanics. Let us suppose that [ H, sP ] = O
(4.1 .13)
for some symmetry operator and In) is an energy eigenket with eigenvalue En . Then sP i n ) is also an energy eigenket with the same energy, because (4.1 .14) H(Y'In)) = sPH i n ) = En ( sP i n )) . Suppose I n ) and sP i n ) represent different states. Then these are two states with the same energy, that is, they are degenerate. Quite often sP is characterized by continuous parameters, say 7\, in which case all states of the form sP( l\ ) l n ) have the same energy. We now consider rotation specifically. Suppose the Hamiltonian is rotationally invariant, so [ §J ( R ) , H ] = 0, ( 4.1 .15 ) which necessarily implies that [J , H ] = O, ( 4.1 .16) We can then form simultaneous eigenkets of H, J 2, and '=' denoted by In; j, m ) . The argument just given implies that all states of the form §J ( R ) In ; j, m ) (4.1.17) have the same energy. We saw in Chapter 3 that under rotation different
4.2.
Discrete Symmetries, Parity, or Space Inversion
251
m-values get mixed up. In general, �(R)In; j, m) is a linear combination of 2 j + 1 independent states. Explicitly, (4. 1. 18) qn(R) I n ; j,m) = L i n ; j,m')qn �)m(R) , m'
and by changing the continuous parameter that characterizes the rotation operator £iJ( R ), we can get different linear combinations of In; j, m '). If all states of form � ( R )In; j, m) with arbitrary �( R ) are to have the same energy, it is then essential that each of In; j, m) with different m must have the same energy. So the degeneracy here is (2} + 1)-fold, just equal to the number of possible m-values. This point is also evident from the fact that all states obtained by successively applying J which commutes with H, to In; jm) have the same energy. As an application, consider an atomic electron whose potential is written as V( r ) + VL5(r)L • S. Because r and L • S are both rotationally invariant, we expect a (2) + 1)-fold degeneracy for each atomic leveL On the other hand, suppose there is an external electric or magnetic field, say in the z-direction. The rotational symmetry is now manifestly broken; as a result, the (2} + 1)-fold degeneracy is no longer expected and states characterized by different m-values no longer have the same energy. We will examine how this splitting arises in Chapter 5. ±'
4.2. DISCRETE SYMMETRIES, PARITY, OR SPACE INVERSION
So far we have considered continuous symmetry operators-that is, oper ations that can be obtained by applying successively infinitesimal symmetry operations. All symmetry operations useful in quantum mechanics are not necessarily of this form. In this chapter we consider three symmetry operations that can be considered to be discrete, as opposed to continuous -parity, lattice translation, and time reversaL The first operation we consider is parity, or space inversion. The parity operation, as applied to transformation on the coordinate system, changes a right-handed (RH) system into a left-handed (LH) system, as shown in Figure 4.1. However, in this book we consider a transformation on state kets rather than on the coordinate system. Given Ia), we consider a space-inverted state, assumed to be obtained by applying a unitary operator 7T known as the parity operator, as follows: (4.2.1) Ia ) --> 7Ti a ) . We require the expectation value of taken with respect to the space inverted state to be opposite in sign. ( 4.2.2) x
Symmetry in Quantum Mechanics
252
z
RH
LH
y X FIGURE
new
/
/
/
/
/
/ new x
/ y - - - - - - -¥-----/
new z 4.1. Right-handed (RH) and left-handed (LH) systems.
a very reasonable requirement. This is accomplished if (4.2.3)
or X '7T =
-
'7TX,
(4.2.4)
where we have used the fact that '7T is unitary. In other words, x and must anticommute. How does an eigenket of the position operator transform under parity? We claim that '7T
(4.2.5)
where e ;6 is a phase factor ( lJ real). To prove this assertion let us note that X'7Tix') = - '7TXIx ') = ( x' ) '7Tix') . ( 4.2.6) This equation says that '7T ix ') is an eigenket of x with eigenvalue x ', so it must be the same as a position eigenket 1 - x ') up to a phase factor. It is customary to take e ;6 = 1 by convention. Substituting this in (4.2.5), we have '7T 2 1x ') = lx'); hence, '7T 2 = 1-that is, we come back to the same state by applying twice. We easily see from (4.2.5) that '7T is now not only unitary but also Hermitian: -
-
'7T
(4.2.7)
Its eigenvalue can be only + 1 or - 1 . What about the momentum operator? The momentum p is like m d xjdt so it is natural to expect it to be odd under parity, like x. more satisfactory argument considers the momentum operator as the generator of translation. Since translation followed by parity is equivalent to parity followed by translation in the opposite direction, as can be seen from Figure ,
A
4.2. Discrete Symmetries, Parity, or Space Inversion
253
4.2, then 7TY( dx') Y{ - dx') 7T
( 4.2.8)
( 1 - ip·dx') 7Tt = 1 + ip · dx' n n , { 7T,p } 0 or 7T t p7T - p.
( 4.2.9)
=
7T
from which follows
( 4.2.10) We can now discuss the behavior of J under parity. First, for orbital angular momentum we clearly have [7r,L] = O (4.2.11 ) because L = xxp, {4.2.12) and both x and p are odd under parity. However, to show that this property also holds for spin, it is best to use the fact that J 1s the generator of rotation. For 3 X 3 orthogonal matrices, we have ( 4.2.13) where explicitly =
=
( 4.2.14) that is, the parity and rotation operations commute. In quantum mechanics, it is natural to postulate the corresponding relation for the unitary oper ators, so (4.2.15) 7T!lJ( R ) = !1J ( R ) 7T ,
dx '
-dx ' FIGURE
4.2.
Translation followed by parity, and vice versa.
Symmetry in Quantum Mechanics
254
where � ( R ) = 1 - iJ · n j li . From (4.2.15) it follows that ( 4.2.16) [ 'IT,J] = 0 or 'ITtJ'IT = J. This together with (4.2.11) means that the spin operator (given by J = L + S) also transforms in the same way as L. Under rotations, x and J transform in the same way, so they are both vectors, or spherical tensors, of rank 1 . However, x (or p) is odd under parity [see (4.2.3) and (4.2.10)], while J is even under parity [see (4.2.16)]. Vectors that are odd under parity are called polar vectors, while vectors that are even under parity are called axial vectors, or pseudovectors. Let us now consider operators like S · x. Under rotations they trans form like ordinary scalars, such as S • L or x • p. Yet under space inversion we have (4.2.17) '1T - 1S · x'IT = - S · x while for ordinary scalars we have (4.2 .18) '1T - 1L · S'1T = L · S and so on. The operator S · x is an example of a pseudoscalar. e
'
Wave Functions Under Parity
Let us now look at the parity property of wave functions. First, let >/; be the wave function of a spinless particle whose state ket is I a ) : (4.2.19) 1/; (x ') = (x 'la) . The wave function of the space-inverted state, represented by the state ket 'IT i a ), is (x'I 'IT i a) = ( - x'la) = >/; ( - x ' ) . ( 4.2.20) Suppose I a) is an eigenket of parity. We have already seen that the eigenvalue of parity must be ± 1, so (4.2.21) 'IT ia) = ± Ia). Let us look at its corresponding wave function, (4.2.22) (x'I'IT ia) = ± (x 'la) . But we also have ( 4.2.23) (x 'I 'IT i a ) = ( - x 'l a ) , so the state I a ) is even or odd under parity depending on whether the corresponding wave function satisfies even parity, (4.2.24) >/; ( - x ) ± 1/; (x') odd parity.
,
=
{
4.2. Discrete Symmetries, Parity, or Space Inversion
255
Not all wave functions of physical interest have definite parities in the sense of (4.2.24). Consider, for instance, the momentum eigenket. The momentum operator anticommutes with the parity operator, so the momentum eigenket is not expected to be a parity eigenket. Indeed, it is easy to see that the plane wave, which is the wave function for a momentum eigenket, does not satisfy (4.2.24). An eigenket of orbital angular momentum is expected to be a parity eigenket because L and w commute [see (4.2.11)]. To see how an eigenket of 2 L and Lz behaves under parity, let us examine the properties of its wave function under space inversion,
( 4.2.25) The transformation
x
'
--> - x
'
r -> r 8 --> w - 8
> --> > + TT Using the explicit form of
is accomplished by letting (cos 8 --> - cos 8 ) ( e trn
( 4.2.26)
(2/+ 1)(/- m)! P m ( cos e) eim$ (4.2.27) 41r(l+ m)! 1 for positive m, with (3.6.38), where 1 -l m l 21 . ) d ! j m j ) ( ) i (/+ . m + l pm1 l (cos 8) = zit! (l- jm j ) ! "" ' 8 d(co e) sin 6, (4.2.28) ym 1 = ( - l )m
'
Sill -
we can readily show that
(
s
( 4.2.29) as 8 and <J> are changed, as in (4.2.26). Therefore, we can conclude that ' ( 4.2.30) wla, lm ) = ( - l ) la, lm ) . It is actually not necessary to look at Yt; an easier way to obtain the same result is to work with m = 0 and note that L'± II, m = 0) ( r = 0, 1, . . . , / )
must have the same parity because w and ( L ± )' commute. Let us now look at the parity properties of energy eigenstates. We
begin by stating a very important theorem. Theorem.
Suppose [H, w ] = O
and I n ) is a non degenerate eigenket of H with eigenvalue En:
(4.2.31) (4.2.32)
Symmetry in Quantum Mechanics
256
then I n ) is also a parity eigenket. Proof
We prove this theorem by first noting that
(4.2.33) H l ± 'IT ) In ) is a parity eigenket with eigenvalues ± 1 (just use 'IT 2 = 1 ). But this is also an
energy eigenket with eigenvalue En. Furthermore, I n ) and (4.2.33) must represent the same state, otherwise there would be two states with the same energy-a contradiction of our nondegenerate assumption. It therefore follows that I n ) , which is the same as (4.2.33) up to a multiplicative constant, must be a parity eigenket with parity ± 1 . 0
As an example, let us look at the simple harmonic oscillator (SHO). The ground state 10) has even parity because its wave function, being Gaussian, is even under x' ---> - x'. The first excited state, (4.2.34)
must have an odd parity because at is linear in and p, which are both odd [see (2.3.2)]. In general, the parity of the nth excited state of the simple harmonic operator is given by ( - 1 r. It is important to note that the nondegenerate assumption is essential here. For instance, consider the hydrogen atom in nonrelativistic quantum mechanics. As is well known, the energy eigenvalues depend only on the principal quantum number n (for example, 2 p and 2s states are degenerate) -the Coulomb potential is obviously invariant under parity-yet an energy eigenket x
(4.2.35)
is obviously not a parity eigenket. As another example, consider a momentum eigenket. Momentum anticommutes with parity, so-even though free-particle Hamiltonian H is invariant under parity-the momentum eigenket (though obviously an energy eigenket) is not a parity eigenket. Our theorem remains intact because we have here a degeneracy between IP') and 1 - p'), which have the same energy. In fact, we can easily construct linear combinations (1/ vl )( lp') + l - p')), which are parity eigenkets with eigenvalues ± 1. In terms of wave-function language, e iJi · x'/h does not have a definite parity, but cosp'•x'/ h and sinp'•x'/ h do. Symmetrical Double-Well Potential
As an elementary but instructive example, we consider a symmetrical double-well potential; see Figure 4.3. The Hamiltonian is obviously in variant under parity. In fact, the two lowest lying states are as shown in
4.2. Discrete Symmetries, Parity, or Space Inversion
Anti-symmetric lA>
Symmetric IS> FIGURE 4.3.
257
The symmetrical double well with the two lowest lying states IS) (sym
metrical) and l A ) (antisymmetrical) shown.
Figure 4.3, we can see by working out the explicit solutions involving sine and cosine in classically allowed regions and sinh and cosh in the classically forbidden region. The solutions are matched where the potential is discon tinuous; we call them the symmetrical state and the antisymmetrical state Of course, they are simultaneous eigenkets of H and Calcula tion also shows that EA > E5, (4.2.36 ) which we can infer from Figure 4.3 by noting that the wave function of the antisymmetrical state has a greater curvature. The energy difference is very tiny if the middle barrier is high, a point which we will discuss later. We can form 4.2.37a)
IS)
l A ).
and
I L ) = �(IS ) - IA ) ) .
'TT .
(
(4.2.37b )
The wave functions of (4.2.37a) and (4.2.37b) are largely concentrated in the right-hand side and the left-hand side, respectively. They are obviously not parity eigenstates; in fact, under parity and are interchanged. Note that they are not energy eigenstates either. Indeed, they are classical examples of nonstationary states. To be precise, let us assume that the system is represented by at t = At a later time, we have 1 fi ( e � • E,t/ h iS ) + e � ' EAtl h iA ) )
I R ) 0. I R , to = 0; t) =
IR) IL)
(4.2.38
)
Symmetry in Quantum Mechanics
258
At time t T/2 2'1Tiij2(EA - £5), the system is found in pure IL). At t T, we are back to pure IR), and so forth. Thus, in general, we have an oscillation between IR) and IL) with angular frequency =
=
=
( EA - Es ) li
( 4.2.39) This oscillatory behavior can also be considered from the viewpoint tunneling in quantum mechanics. A particle initially confined to the of right-hand side can tunnel through the classically forbidden region (the middle barrier) into the left-hand side, then back to the right-hand side, and so on. But now let the middle barrier become infinitely high; see Figure 4.4. The IS) and lA) states are now degenerate, so (4.2.37a) and (4.2.37b) are also energy eigenkets even though they are not parity eigenkets. Once the system is found in IR), it remains so forever (oscillation time between IS) and lA) is now oo ). Because the middle barrier is infinitely high, there is no possibility for tunneling. Thus when there is degeneracy, the physically realizable energy eigenkets need not be parity eigenkets. We have a ground state which is asymmetrical despite the fact that the Hamiltonian itself is symmetrical under space inversion, so with degeneracy the symmetry of H is not necessarily obeyed by energy eigenstates IS) and lA). This is a very simple example of broken symmetry and degeneracy. Nature is full of situations analogous to this. Consider a ferromagnet. The basic Hamiltonian for iron atoms is rotationally invariant, but the ferromag net clearly has a definite direction in space; hence, the (infinite) number of ground states is not rotationally invariant, since the spins are all aligned along some definite (but arbitrary) direction. A textbook example of a system that illustrates the actual importance of the symmetrical double well is an ammonia molecule, NH3; see Figure 4.5. We imagine that the three H atoms form the three corners of an equilateral triangle. The N atom can be up or down, where the directions up and down are defined because the molecule is rotating around the axis as w=
00
FIGURE
00
4.4. The symmetrical double well with an infinitely high middle barrier.
259
4.2. Discrete Symmetries, Parity, or Space Inversion
H N H
H H
H
N
(a)
(b)
An ammonia molecule, N H 3 , where the three H atoms form the three corners of an equilateral triangle.
FIGURE 4.5.
shown in Figure 4.5. The up and down positiOns for the N atom are analogous to R and L of the double-well potential. The parity and energy eigenstates are superpositions of Figure 4.5a and Figure 4.5b in the sense of (4.2.37a) and (4.2.37b), respectively, and the energy difference between the simultaneous eigenstates of energy and parity correspond to an oscillation frequency of 24,000 MHz-a wavelength of about 1 em, which is in the microwave region. In fact, NH 3 is of fundamental importance in maser physics. There are naturally occurring organic molecules, such as sugar or amino acids, which are of the R-type (or L-type) only. Such molecules which have definite handedness are called optical isomers. In many cases the oscillation time is practically infinite-on the order of 104-106 years-so R-type molecules remain right-handed for all practical purposes. It is amusing that if we attempt to synthesize such organic molecules in the laboratory, we find equal mixtures of R and L. Why we have a preponder ance of one type is nature's deepest mystery. Is it due to a genetic accident, like the spiral shell of a snail or the fact that our hearts are on the left-hand side? Parity-Selection Rule
Suppose Ia) and
1/3)
are parity eigenstates: 7T i a) = £al a)
( 4.2.40a)
Symmetry in Quantum Mechanics
260
and
7T [{3 ) = Ep [ /3 ) ,
( 4.2.40b)
({3[x [a) = 0
(4.2.41)
where E.,, Ep are the parity eigenvalues ( ± 1). We can show that unless E., = - Ep. In other words, the parity-odd operator x connects states of opposite parity. The proof of this follows: (4.2.42)
which is impossible for a finite nonzero ( j3[x[a) unless E., and Ep are opposite in sign. Perhaps the reader is familiar with this argument from ( 4.2.43)
if >/;13 and >/;" have the same parity. This selection rule, due to Wigner, is of importance in discussing radiative transitions between atomic states. As we will discuss in greater detail later, radiative transitions take place between states of opposite parity as a consequence of multipole expansion for malism. This rule was known phenomenologically from analysis of spectral lines, before the birth of quantum mechanics, as Laporte's rule. It was Wigner who showed that Laporte's rule is a consequence of the parity-selec tion rule. If the basic Hamiltonian H is invariant under parity, nondegenerate energy eigenstates (as a corollary of (4.2.43)) cannot possess a permanent electric dipole moment: ( 4.2.44) ( n [x [n) = 0. This follows trivially from (4.2.43), because with the nondegenerate assump tion, energy eigenstates are also parity eigenstates [see (4.2.32) and (4.2.33)).
For a degenerate state, it is perfectly all right to have an electric dipole moment. We will see an example of this when we discuss the linear Stark effect in Chapter 5. Our considerations can be generalized: Operators that are odd under parity, like p or S • x, have nonvanishing matrix elements only between states of opposite parity. In contrast, operators that are even under parity connect states of the same parity. Parity Nonconservation
The basic Hamiltonian responsible for the so-called weak interaction of elementary particles is not invariant under parity. In decay processes we can have final states which are superpositions of opposite parity states. Observable quantities like the angular distribution of decay products can
4.3. Lattice Translation as a Discrete Symmetry
261
depend on pseudoscalars such as (S) p. It is remarkable that parity conservation was believed to be a sacred principle until 1956, when Lee and Yang speculated that parity is not conserved in weak interactions and proposed crucial experiments to test the validity of parity conservation. Subsequent experiments indeed showed that observable effects do depend on pseudoscalar quantities such as correlation between (S) and p. Because parity is not conserved in weak interactions, previously thought " pure" nuclear and atomic states are, in fact, parity mixtures. These subtle effects have also been found experimentally. •
4.3. LATIICE TRANSLATION AS A DISCRETE SYMMETRY
We now consider another kind of discrete symmetry operation, namely, lattice translation. This subject has extremely important applications in solid-state physics. Consider a periodic potential in one dimension, where V( x ± a) = V( x ), as depicted in Figure 4.6. Realistically, we may consider the motion
(a)
I I I I I I 1.- a
I
I I I I I a __,_
I I I I I I - 1-- a
I
I
I I I I I I I I I I - t- a -t
FIGURE 4.6. (a) Periodic potential in one dimension with periodicity a. (b) The (b)
periodic potential when the barrier height between two adjacent lattice sites becomes infinite.
Symmetry in Quantum Mechanics
262
of an electron in a chain of regularly spaced positive ions. In general, the Hamiltonian is not invariant under a translation represented by T(/) with I arbitrary, where r(/) has the property (see Section 1.6) rt(l )xr(l) = x + I, r(/ ) lx ') = lx' + / ) . (4 . 3 .1 ) However, when I coincides with the lattice spacing a , we do have r t (a) V( x ) r(a ) = V( x + a ) = V( x ) .
(4.3.2)
Because the kinetic-energy part of the Hamiltonian H is invariant under the translation with any displacement, the entire Hamiltonian satisfies rt( a ) Hr ( a ) = H.
(4.3.3)
Because r(a) is unitary, we have [from (4.3.3)] [ H, r ( a ) ] = O,
(4.3 .4)
r(a ) ln) = In + 1 ) .
(4.3.5)
so the Hamiltonian and r(a) can be simultaneously diagonalized. Although r( a ) is unitary, it is not Hermitian, so we expect the eigenvalue to be a complex number of modulus 1. Before we determine the eigenkets and eigenvalues of r( a) and examine their physical significance, it is instructive to look at a special case of periodic potential when the barrier height between two adjacent lattice sites is made to go to infinity, as in Figure 4.6b. What is the ground state for the potential of Figure 4.6b? Clearly, a state in which the particle is completely localized in one of the lattice sites can be a candidate for the ground state. To be specific let us assume that the particle is localized at the nth site and denote the corresponding ket by In). This is an energy eigenket with energy eigenvalue E0, namely, H ln) = E01n). Its wave fu11ction (x 'ln) is finite only in the n th site. However, we note that a similar state localized at some other site also has the same energy E0 , so actually there are denumerably infinite ground states n, where n runs from - oo to + oo . Now In) is obviously not an eigenket of the lattice-translation operator, because when the lattice-translation operator is applied to it, we obtain In + 1): So despite the fact that r(a) commutes with H, ln)-which is an eigenket of H -is not an eigenket of r( a). This is quite consistent with our earlier theorem on symmetry because we have an infinitefold degeneracy. When there is such degeneracy, the symmetry of the world need not be the symmetry of energy eigenkets. Our task is to find a simultaneous eigenket of H and r(a). Here we may recall how we handled a somewhat similar situation with the symmetrical double-well potential of the previous section. We noted that even though neither 1 R) nor 1 L ) is an eigenket of 7T, we could
263
4.3. Lattice Translation as a Discrete Symmetry
easily form a symmetrical and an antisymmetrical combination of I R ) and I L ) that are parity eigenkets. The case is analogous here. Let us specifically form a linear combination 10) =
00
0 L e'" l n ) ,
n = - oo
(4.3 .6)
where 0 is a real parameter with 'TT s 0 s 'TT. We assert that 10) is a simultaneous eigenket of H and T(a). That it is an H eigenket is obvious because I n ) is an energy eigenket with eigenvalue E0 , independent of n. To show that it is also an eigenket of the lattice translation operator we apply 'T( a ) as follows: -
00
n = - oo
00
n = - oo
(4.3.7)
Note that this simultaneous eigenket of H and 'T( a) is parameterized by a continuous parameter 0. Furthermore, the energy eigenvalue E0 is indepen dent of 0. Let us now return to the more realistic situation of Figure 4.6a, where the barrier between two adjacent lattice sites is not infinitely high. We can construct a localized ket 1 n ) just as before with the property T( a ) 1 n) = [n + 1 ) . However, this time we expect that there is some leakage possible into neighboring lattice sites due to quantum-mechanical tunneling. In other words, the wave function < x l n ) has a tail extending to sites other than the n th site. The diagonal elements of H in the { I n ) } basis are all equal because of translation invariance, that is, '
(4.3 .8) independent of n, as before. However we suspect that H is not completely diagonal in the { [ n ) } basis due to leakage. Now, suppose the barriers between adjacent sites are high (but not infinite). We then expect matrix elements of H between distant sites to be completely negligible. Let us assume that the only nondiagonal elements of importance connect im mediate neighbors. That is, < n 'I H i n ) *- 0 only if n ' = n
or n ' = n ± l .
(4.3 .9)
In solid-state physics this assumption is known as the tight-binding ap proximation. Let us define (4.3. 1 0 )
264
Symmetry in Quantum Mechanics
we obtain
Hi n ) = E0 ln) -!lin + 1) - A In 1 ) -
I n)
Note that is no longer an energy eigenket. As we have done with the potential of Figure linear combination
18 ) =
"9l n ) L e ' n = - oo 00
·
(4. 3 .11)
.
4.6b, let us form a (4.3.12)
Clearly, I (J ) is an eigenket of translation operator r( a ) because the steps in (4.3.7) still hold. A natural question is, is I B ) an energy eigenket? To answer this question, we apply
H: HL:e' "9 l n ) E0Le '"9ln ) - AL:e'"9 l n + 1 ) - llLe' "9 l n -1 ) = Eo Le' " ol n )- LlL(e i n 0-, 0 + e i n O+iO ) I n ) (4. 3 . 13) =
The big difference between this and the previous situation is that the energy eigenvalue now depends on the continuous real parameter B. The degener acy is lifted as becomes finite, and we have a continuous distribution of energy eigenvalues between and See Figure where we visualize how the energy levels start forming a continuous energy band as is increased from zero. To see the physical meaning of the parameter (J let us study the wave function ( x '10 . For the wave function of the lattice-translated state r( a ) I B ),
A
E0 -28 E0 + 2ll.
4.7,
)
0 FIGURE 4.7.
zero.
�
Energy levels forming a continuous energy band as ll is increased from
A
4.3 . Lattice Translation as a Discrete Symmetry
we obtain
265
(x 'l'r( a ) [O) = (x ' - a[O)
(4.3 .14) by letting T( a) act on ( x '[. But we can also let ,-(a) operate on [0) and use (4.3.7). Thus so
( x 'H a ) [0) = e - iB( x '[0 ) ,
(4.3.15)
(x ' - a[O) = (x '[O ) e - '8.
(4.3.16)
We solve this equation by setting
(4.3.17) (x'[8) = ei k x 'uk ( x ') , with 8 = ka , where u k ( x ') is a periodic function with period a, as we can easily verify by explicit substitutions, namely, (4.3.18) Thus we get the important condition known as Bloch's theorem: The wave function of [ 8 ), which is an eigenket of T(a), can be written as a plane wave e ik x ' times a periodic function with periodicity a. Notice that the only fact we used was that [0) is an eigenket of T( a) with eigenvalue e - ;B [see (4.3.7)]. In particular, the theorem holds even if the tight-binding approxi mation (4.3.9) breaks down. We are now in a position to interpret our earlier result (4.3.13) for [8) given by (4.3.12). We know that the wave function is a plane wave characterized by the propagation wave vector k modulated by a periodic function uk( x ') [see (4.3.17)]. As 8 varies from - 1r to 7T, the wave vector k varies from - 1r1a to 1rIa. The energy eigenvalue E now depends on k as follows: E ( k ) E0 - 2/::,. cos ka. (4.3 .19) =
Notice that this energy eigenvalue equation is independent of the detailed shape of the potential as long as the tight-binding approximation is valid. Note also that there is a cutoff in the wave vector k of the Bloch wave function (4.3.17) given by [k[ = 1r1a. Equation (4.3.19) defines a dispersion curve, as shown in Figure 4.8. As a result of tunneling, the denumerably infinitefold degeneracy is now completely lifted, and the allowed energy values form a continuous band between E0 - 21::.. and E0 + 21::.. , known as the Brillouin zone. So far we have considered only one particle moving in a periodic potential. In a more realistic situation we must look at many electrons moving in such a potential. Actually the electrons satisfy the Pauli exclusion principle, as we will discuss more systematically in Chapter 6, and they start filling the band. In this way, the main qualitative features of metals,
Symmetry in Quantum Mechanics
266
E(k) Ea+26 Eo Ea-26
FIGURE 4.8.
0
+ 11"
/a
k
Dispersion curve for E ( k ) versus k in the Brillouin zone l k l ,.; wja.
semiconductors, and the like can be understood as a consequence of translation invariance supplemented by the exclusion principle. The reader may have noted the similarity between the symmetrical double-well problem of Section 4.2 and the periodic potential of this section. Comparing Figures 4.3 and 4.6, we note that they can be regarded as opposite extremes (two versus infinite) of potentials with a finite number of troughs. 4.4. THE TIME-REVERSAL DISCRETE SYMMETRY
In this section we study another discrete symmetry operator, called time reversal. This is a difficult topic for the novice, partly because the term time reversal is a misnomer; it reminds us of science fiction. Actually what we do in this section can be more appropriately characterized by the term reversal of motion. Indeed, that is the terminology used by E. Wigner, who for mulated time reversal in a very fundamental paper written in 1932. For orientation purposes let us look at classical mechanics. Suppose there is a trajectory of a particle subject to a certain force field; see Figure 4.9. At t = 0, let the particle stop and reverse its motion: P l 1 - o --> - P l1 _ 0 . The particle traverses backward along the same trajectory. If you run the motion picture of trajectory (a) backward as in (b), you may have a hard time telling whether this is the correct sequence. More formally, if x( t ) is a solution to mx = - v v(x) ,
(4.4.1)
then x( - t) is also a possible solution in the same force field derivable from V. It is, of course, important to note that we do not have a dissipative force
267
4.4. The Time-Reversal Discrete Symmetry
At I = 0 Reverse
(a)
FIGURE 4.9.
(b)
(a) Classical trajectory which stops at t � 0 and (b) reverses its motion
P l r-o .... - pl r-o·
here. A block sliding on a table decelerates (due to friction) and eventually stops. But have you ever seen a block on a table spontaneously start to move and accelerate? With a magnetic field you may be able to tell the difference. Imagine that you are taking the motion picture of a spiraling electron trajectory in a magnetic field. You may be able to tell whether the motion picture is run forward or backward by comparing the sense of rotation with the magnetic pole labeling N and S. However, from a microscopic point of view, B is produced by moving charges via an electric current; if you could reverse the current that causes B, then the situation would be quite symmetrical. In terms of the picture shown in Figure 4.10, you may have figured out that N and S are mislabeled! Another more formal way of saying all this is that the Maxwell equations, for example, V · E = 47rp '
c
V x B - ! iJ
and the Lorentz force equation t -+ - t provided we also let B -+ - B, E -+ E,
E
at
=
4'1T j' V x E = c
F = e[E + (ljc)(v X B)] p -+ p,
j ..... - j ,
-
1
iJB
-
c
at '
(4.4.2)
are invariant under v -+ - v .
(4.4.3)
Let us now look at wave mechanics, where the basic equation of the Schrodinger wave equation is
ill B>fat = ( - � v 2 v ) "'. 2m +
(4. 4.4 )
Suppose 1/; (x, t ) is a solution. We can easily verify that 1/; (x, - t) is not a solution, because of the appearance of the first-order time derivative. However, 1/;*(x, - t ) is a solution, as you may verify by complex conjuga tion of (4.4.4). It is instructive to convince ourselves of this point for an
268
Symmetry in Quantum Mechanics
s B
Electron trajectory
N FIGURE 4.10. Electron trajectory between the north and south poles of a magnet.
energy eigenstate, that is, by substituting into the Schrodinger equation (4.4.4). Thus we conjecture that time reversal must have something to do with complex conjugation. If at t = 0 the wave function is given by 1/; = (x l a ) ,
( 4.4.6)
then the wave function for the corresponding time-reversed state is given by (x l a)*. We will later show that this is indeed the case for the wave function of a spinless system. As an example, you may easily check this point for the wave function of a plane wave; see Problem 8 of this chapter.
Digression on Symmetry Operations Before we begin a systematic treatment of the time-reversal operator, some general remarks on symmetry operations are in order. Consider a symmetry operation
( 4.4.7 ) Ia) -- I ii ) , 1 /J ) -- 1 /3 ) . One may argue that it is natural to require the inner product ( /Jia) to be preserved, that is, ( 4.4.8) ( /3 1ii) = (/J i a) . Indeed, for symmetry operations such as rotations, translations, and even parity, this is indeed the case. If I a) is rotated and 1 /J ) is also rotated in the
4.4.
269
The Time-Reversal Discrete Symmetry
same manner, ( P ia) is unchanged. Formally this arises from the fact that, for the symmetry operations considered in the previous sections, the corre sponding symmetry operator is unitary, so
( 4.4.9) However, in discussing time reversal, we see that requirement (4.4.8)
turns out to be too restrictive. Instead, we merely impose the weaker requirement that ( 4.4.10) I( ,B [ ii ) l = i( ,B [ a) [ . Requirement (4.4.8) obviously satisfies (4.4.10). But this is not the only way;
( 4.4.11) ( ,B [a) = ( ,Bi a) * = (a [,B ) works equally well. We pursue the latter possibility in this section because from our earlier discussion based on the Schrodinger equation we inferred that time reversal has something to do with complex conjugation. Definition.
The transformation I a) -> I a) = Bi a) ,
i ,B ) -> [ ,8 ) = Bi ,B )
is said to be antiunitary if
(,8[&. ) = ( ,B ia) * , O ( c1[a) + c2 [ ,B ) ) = c;"Oia) + ciOi ,B ) .
( 4.4.12) ( 4.4.13a)
( 4.4.1 3b)
In such a case the operator (} is an antiunitary operator. Relation (4.4.1 3b) alone defines an antilinear operator. We now claim that an antiunitary operator can be written as (4.4.14) O = UK , where U is a unitary operator and K is the complex-conjugate operator that forms the complex conjugate of any coefficient that multiplies a ket (and stands on the right of K ). Before checking (4.4.1 3) let us examine the property of the K operator. Suppose we have a ket multiplied by a complex number c. We then have Kc[a) = c *K i a) . ( 4.4.15) One may further ask, What happens if [a) is expanded in terms of base kets { [a')}? Under the action K we have K
Ia) = L ! a ') ( a '[a) -> [&.) = [ (a 'la)*K [ a ') a
'
= I; (a '[a)* [ a ') . a
a
'
( 4.4.16)
'
Notice that K acting on the base ket does not change the base ket. The
270
Symmetry in Quantum Mechanics
explicit representation of I a ') is 0 0 Ia') = 0 , 1
( 4.4.17)
0
0 and there is nothing to be changed by K. The reader may wonder, for instance, whether the SY eigenkets for a spin -l: system change under K. The answer is that if the Sz eigenkets are used as base kets, we must change the Sv eigenkets because the SY eigenkets (1.1.14) undergo under K
(
)
1 i 1 _ i K - 1 + ) ± - 1 - ) --> - 1 + ) + - 1 - ) .
fi
fi
fi
fi
(4.4.18)
On the other hand, if the SY eigenkets themselves are used as the base kets, we do not change the SY eigenkets under the action of K. Thus the effect of K changes with the basis. As a result, the form of U in (4.4.14) also depends on the particular representation (that is, the choice of base kets) used. Gottfried puts it aptly: " If the basis is changed, the work of U and K has to be reapportioned." Returning to 8 = UK and (4.4.13), let us first check property (4.4.13b). We have 8 ( c d a) + c 2 1 /J ) ) = UK ( c 1 la) + c 2 1 /J ) )
= c tUK ia) + c j UK I /J )
= c�B ia) + c�B I � ) ,
(4.4.19)
so (4.4.1 3b) indeed holds. Before checking (4.4.13a), we assert that it is always safer to work with the action of 8 on kets only. We can figure out how the bras change just by looking at the corresponding kets. In particular, it is not necessary to consider 8 acting on bras from the right, nor is it necessary to define ot. We have 8
Ia) -> Iii) = L ( a 'la)* UK ia ') a
'
= L ( a 'la)*U i a ') a
'
= L (ala ') U ia'). a
'
(4.4.20)
4.4.
271
The Time-Reversal Discrete Symmetry
As for I.B ) . we have I P ) = L (a 'I.B) * Uia') � ( PI = L (a 'I .B ) ( a 'IUt a
a
'
"
a
a
'
'
= L ( a l a')( a 'I.B)
= ( ai .B )
a'
= ( .Bia)* ,
(4.4.21)
so this checks. In order for (4.4.10) to be satisfied, it is of physical interest to consider just two types of transformation-unitary and antiunitary. Other possibilities are related to either of the preceding via trivial phase changes. The proof of this assertion is actually very difficult and will not be discussed further here (Gottfried 1966, 226�28).
Time-Reversal Operator We are finally in a position to present a formal theory of time reversal. Let us denote the time-reversal operator by E>, to be distinguished from 8, a general antiunitary operator. Consider (4.4.22) Ia) -> E> la) , where E>la ) is the time-reversed state. More appropriately, E> la) should be called the motion-reversed state. If Ia) is a momentum eigenstate IP') , we expect E>la) to be 1 - p') up to a possible phase. Likewise, J is to be reversed under time reversal. We now deduce the fundamental property of the time-reversal oper ator by looking at the time evolution of the time-reversed state. Consider a physical system represented by a ket Ia), say at t = 0. Then at a slightly later time t 8t, the system is found in
=
( i: 8t ) Ia) ,
Ia, t 0 = 0 ; t = 8t) = 1 -
(4.4.23)
where H is the Hamiltonian that characterizes the time evolution. Instead of the preceding equation, suppose we first apply E> , say at t = 0, and then let the system evolve under the influence of the Hamiltonian H. We then have at 8t (4.4.24a) If motion obeys symmetry under time reversal, we expect the preceding state ket to be the same as (4.4.24b) E>l a , t 0 = O ,· t = - 8t)
272
Symmetry in Quantum Mechanics
that is, first consider a state ket at earlier time t = and J; see Figure 4.1 1 . Mathematically,
- lJt,
and then reverse p
( 4.4.25) If the preceding relation is to be true for any ket, we must have -
( 4.4.26)
iHBI) = BiH I) ,
where the blank ket I ) emphasizes that (4.4.26) is to be true for any ket. We now argue that B cannot be unitary if the motion of time reversal is to make sense. Suppose B were unitary. It would then be legitimate to cancel the i 's in (4.4.26), and we would have the operator equation
( 4.4.27)
- HB = BH.
Consider an energy eigenket I n ) with energy eigenvalue E The corre sponding time-reversed state would be B i n ), and we would have, because of •.
(4.4.27),
HB i n )
=
-
BHin )
=
(
-
(4.4.28)
E. ) B in ) .
This equation says that B i n ) is an eigenket of the Hamiltonian with energy eigenvalues E But this is nonsensical even in the very elementary case of a free particle. We know that the energy spectrum of the free particle is positive semidefinite-from 0 to + oo . There is no state lower than a -
•.
Momentum before reversal
�
Momentum before reversal
Momentum after reversal
at t =
at t=+ot
-
8t
Momentum
(a)
after reversal
(b)
FIGURE 4.ll. Momentum before and after time reversal at time 1
=
0 and 1
=
± 81.
4.4. The Time-Reversal Discrete Symmetry
273
particle at rest (momentum eigenstate with momentum eigenvalue zero); the energy spectrum ranging from oo to 0 would be completely unacceptable. We can also see this by looking at the structure of the free-particle Hamiltonian. We expect p to change sign but not p2 ; yet (4.4.27) would imply that -
2 p2 e - I Le = - ·
2m
2m
(4.4.29)
All these arguments strongly suggest that if time reversal is to be a useful symmetry at all, we are not allowed to cancel the i 's in (4.4.26); hence, El had better be antiunitary. In this case the right-hand side of (4 4 26) becomes .
.
EliH [ ) = - iElH [ )
( 4.4.30)
by antilinear property (4.4.1 3b). Now at last we can cancel the i 's in (4.4.26) leading, finally, via (4.4.30) to ElH = HEl .
(4.4.31)
Equation (4.4.31) expresses the fundamental property of the Hamiltonian under time reversal. With this equation the difficulties mentioned earlier [see (4.4.27) to (4.4.29)] are absent, and we obtain physically sensible results. From now on we will always take El to be antiunitary. We mentioned earlier that it is best to avoid an antiunitary operator acting on bras from the right. Nevertheless, we may use ( 4 . 4. 32) ( � [ El [ a) , which is always to be understood as ( ( � [) · ( El [a))
( 4.4.33)
( ( � [ El ) · [ a) .
( 4.4.34)
and never as In fact, we do not even attempt to define ( � [El. This is one place where the Dirac bra-ket notation is a little confusing. After all, that notation was invented to handle linear operators, not antilinear operators. With this cautionary remark, we are in a position to discuss the behavior of operators under time reversal. We continue to take the point of view that the El operator is to act on kets [a) = El [a) ,
t.B> = El [ P ) ,
(4.4.35)
yet it is often convenient to talk about operators-in particular, observables -which are odd or even under time reversal. We start with an important identity, namely, ( 4.4.36)
where ® is a linear operator. This identity follows solely from the anti-
274
Symmetry in Quantum Mechanics
unitary nature of 8 . To prove this let us define I Y ) = ® t i ,B ) . By dual correspondence we have DC
Hence,
( 4.4.37 )
I Y ) .... ( ,8 1 ® = (YI ·
( 4.4.38)
( .B I ® Ia) = (r ia) = (iWi') = (al 8 ®ti.B> = (al 8 ® te - 1 8 I.B> = (al 8 ®t8 - 1 1.8 ) ,
( 4.4.39)
which proves the identity. In particular, for Hermitian observables A we get
(4.4.40) We say that observables are even or odd under time reversal according to whether we have the upper or lower sign in
(4.4.41) Note that this equation, together with (4.4.40), gives a phase restriction on the matrix elements of A taken with respect to time reversed states as follows:
( 4.4.42) (,B iA i a) = ± ( .8 1Ai a )* . If I .B ) is identical to Ia) , so that we are talking about expectation values, we have (4.4.43) (aiAia) = ± (iiiAiii) , where (iiiAiii) is the expectation value taken with respect to the time reversed state. As an example, let us look at the expectation value of p. It is reasonable to expect that the expectation value of p taken with respect to the time-reversed state be of opposite sign. Thus ( a lp Ia) = - ( ii lp lii) , so we take p to be an odd operator, namely, 8p8 - l - p . =
This implies that
p8 1p') = - 8p8 - 18 lp') ( - p' ) 8 1p') .
=
(4.4.44) ( 4.4.45)
( 4.4.46)
Equation (4.4.46) agrees with our earlier assertion that 8 ip') is a momentum eigenket with eigenvalue - p'. It can be identified with 1 - p') itself with a
4.4. The Time-Reversal Discrete Symmetry
275
suitable choice of phase. Likewise, we obtain 8x8 - 1 = x 8 jx') = jx ')
(up to a phase) from the (eminently reasonable) requirement (alx la) = (iilxlii ) .
(4.4.47)
( 4.4.48)
We can now check the invariance of the fundamental commutation relation (4.4.49) where the blank ket 1 ) stands for any ket. Applying 8 to both sides of (4.4.49), we have 8 [ x , , p1 ] 8 - 18 1 ) = 8ill c'l,) ) ,
( 4.4.50)
which leads to, after passing 8 through ill,
[X ; , ( - P)] 8i > = - illc'J,J 8 j ) .
( 4.4.51 )
Note that the fundamental commutation relation [x,, p) = inc'J,1 is preserved by virtue of the fact that 8 is antiunitary. This can be given as yet another reason for taking 8 to be antiunitary; otherwise, we will be forced to abandon either (4.4.45) or (4.4.47)! Similarly, to preserve (4.4.52) [ 1; , � ] = in e,1kJk , the angular-momentum operator must be odd under time reversal, that is,
(4.4.53) 8J8 - 1 = - J . This is consistent for spinless system where J is just x X p. Alternatively, we could have deduced this relation by noting that the rotational operator and the time-reversal operator commute (note the extra i !).
Wave Function Suppose at some given time, say at t = 0, a spinless single-particle system is found in a state represented by ja). Its wave function (x 'ja) appears as the expansion coefficient in the position representation
J =J J
ja) =
d 3x ' jx')(x'ja) .
( 4.4.54)
Applying the time-reversal operator 8ja)
=
d 3x ' 8 1x ') (x 'la)* d 3x 'lx ') (x 'j a)* ,
(4.4.55)
where we have chosen the phase convention so that 8 jx ') is jx ') itself. We
Symr,Jetry in Quantum Mechanics
276
then recover the rule � (x') --+ �*(x')
( 4.4.56)
inferred earlier by looking at the Schrodinger wave equation [see (4.4.5)). The angular part of the wave function is given by a spherical harmonic Y,m_ With the usual phase convention we have
( 4.4.57) Now Yt( 8 , >) is the wave function for 11, m) [see (3.6.23)]; therefore, from (4.4.56) we deduce 81/, m ) = ( - l n /, - m ) .
( 4.4.58)
If we study the probability current density (2.4.16) for a wave function of type (3.6.22) going like R ( r )Yt , we shall conclude that for m > 0 the current flows in the counterclockwise direction, as seen from the positive z-axis. The wave function for the corresponding time-reversed state has its probability current flowing in the opposite direction because the sign of m is reversed. All this is very reasonable. As a nontrivial consequence of time-reversal invariance, we state an important theorem on the reality of the energy eigenfunction of a spinless particle. Theorem. Suppose the Hamiltonian is invariant under time reversal
and the energy eigenket I n ) is nondegenerate; then the corresponding energy eigenfunction is real (or, more generally, a real function times a phase factor independent of x). Proof To prove this, first note that
H81 n ) = 8Hin) = En8 1 n ) ,
( 4.4.59)
(x 'ln ) = (x'ln ) *
(4.4.60)
so I n ) and 8 1 n ) have the same energy. The nondegeneracy assumption prompts us to conclude that I n ) and 8 1 n ) must represent the same state; otherwise there would be two different states with the same energy En, an obvious contradiction! Let us recall that the wave functions for I n ) and 8 1 n ) are (x'ln) and (x 'ln )*, respectively. They must be the same-that is, for all practical purposes-or, more precisely, they can differ at most by a D phase factor independent of x. Thus if we have, for instance, a nondegenerate bound state, its wave function is always reaL On the other hand, in the hydrogen atom with I * 0, m * 0, the energy eigenfunction characterized by definite ( n , I, m ) quantum numbers is complex because �m is complex; this does not contradict the
277
4.4. The Time-Reversal Discrete Symmetry
theorem because In , I, m ) and In , I, - m ) are degenerate. Similarly, the wave function of a plane wave e i p · xfh is complex, but it is degenerate with e - , p • xfh _ We see that for a spinless system, the wave function for the time reversed state, say at t = 0, is simply obtained by complex conjugation. In terms of ket Ia) written as in (4.4.16) or in (4.4.54), the e operator is the complex conjugate operator K itself because K and E> have the same effect when acting on the base ket Ia') (or lx ')). We may note, however, that the situation is quite different when the ket Ia) is expanded in terms of the momentum eigenket because e must change IP') into 1 - p') as follows: E> l o:>
=I
d3p' l - p')(p' l o:>*
=I
d3p' 'p')( - p' l o:>*.
(4.4.61)
It is apparent that the momentum-space wave function of the time-reversed state is not just the complex conjugate of the original momentum-space wave function; rather, we must identify <[>*( - p') as the momentum-space wave function for the time-reversed state. This situation once again il lustrates the basic point that the particular form of e depends on the particular representation used.
Time Reversal for a Spin ! System The situation is even more interesting for a particle with spin-spin ! , in particular. We recall from Section 3.2 that the eigenket of S · it with eigenvalue h /2 can be written as (4.4.62) lit; + ) = e - ,S,afhe - i SJl/h l + ) , where it is characterized by the polar and azimuthal angles f3 and a, respectively. Noting (4.4.53) we have E> lil; + ) = e- 'S,afhe-, S ,fllh E> I + ) = 11 lil ; - ) . On the other hand, we can easily verify that lit ; _ ) = e - 'aS,/he - i( .- + fl)S, /h l + ) .
( 4.4.63) ( 4.4.64)
In general, we saw earlier that the product UK is an antiunitary operator. Comparing (4.4.63) and (4.4.64) with E> set equal to UK, and noting that K acting on the base ket I + ) gives just I + ), we see that (4.4.65) where TJ stands for an arbitrary phase (a complex number of modulus unity). Another way to be convinced of (4.4.65) is to verify that if x(it; + ) is the two-component eigenspinor corresponding to In; + ) (in the sense that
278
Symmetry in Quantum Mechanics
cr · itx(it; + ) = x(it; + )], then - icryx *(it ; + )
(4.4.66) (note the complex conjugation!) is the eigenspinor corresponding to In; - ), again up to an arbitrary phase, see Problem 7 of this chapter. The ap pearance of Sy or cry can be traced to the fact that we are using the rep resentation in which Sz is diagonal and the nonvanishing matrix elements of Sy are purely imaginary. Let us now note
( 4.4.67)
Using (4.4.67), we are itl a position to work out the effect of 0, written as (4.4.65), on the most general spin � ket: 0 ( c 1 + > + c - 1 - > ) = + 11 c! 1 - > - .,c>l< l + > . Let us apply 0 once again: 2 0 2 ( c + I + ) + c 1 - )) = - l 11 12c + I + > - l 11 1 c 1 - > = - ( c + l + ) + c l - )) or +
(4.4.68)
(4.4.69) ( 4.4.70)
(where 1 is to be understood as - 1 times the identity operator) for any spin orientation. This is an extraordinary result. It is crucial to note here that our conclusion is completely independent of the choice of phase; (4.4.70) holds no matter what phase convention we may use for 'II · In contrast, we may note that two successive applications of 0 to a spinless state give -
(4.4.71) as is evident from, say, (4.4.58). More generally, we now prove 02 1 } half-integer) = - I} half-integer)
(4.4.72a) (4.4.72b) 0 21 } integer) = + IJ integer). Thus the eigenvalue of 02 is given by ( - 1 ) 21. We first note that (4.4.65) generalizes for arbitrary j to
(4.4.73) For a ket Ia) expanded in terms of I J m) base eigenkets, we have 0 ( 0 L ! Jm ) (Jmlo:)) = 0 ( 1J �::e - i.,Jy /h iJm ) ( Jmlo:)* ) ,
( 4.4.74)
4.4. The Time-Reversal Discrete Symmetry
279
But ( 4.4.75) as is evident from the properties of angular-momentum eigenstates under rotation by 2?T. In (4.4.72b), li integer) may stand for the spin state (4.4.76) ¥'21 ( 1 + - ) ± 1 - + ) ) of a two-electron system or the orbital state 1 1, m ) of a spinless particle. It is important only that j is an integer. Likewise, li half-integer) may stand, for example, for a three-electron system in any configuration. Actually, for a system made up exclusively of electrons, any system with an odd (even) number of electrons-regardless of their spatial orientation (for example, relative orbital angular momentum)-is odd (even) under 8 2 ; they need not even be J 2 eigenstates! We make a parenthetical remark on the phase convention. In our earlier discussion based on the position representation, we saw that with the usual convention for spherical harmonics it is natural to choose the arbi trary phase for 1/, m ) under time reversal so that (4.4.77) Blt, m ) = ( - 1nt, - m ) . Some authors find it attractive to generalize this to obtain BIJ, m ) = ( - lnJ, - m ) ( J an integer) (4.4.78) regardless of whether j refers to I or s (for an integer spin system). We may naturally ask, is this compatible with (4.4.72a) for a spin 1 system when we visualize IJ, ) as being built up of " primitive" spin ! objects according to Wigner and Schwinger. It is easy to see that (4.4.72a) is indeed consistent provided we choose 11 in (4.4.73) to be + i. In fact, in general, we can take (4.4.79) BIJ, m ) = i 2miJ, - m ) for any j-either a half-integer j or an integer j; see Problem 10 of this chapter. The reader should be warned, however, that this is not the only convention found in the literature. (See, for instance, Frauenfelder and Henley 1974.) For some physical applications, it is more convenient to use other choices; for instance, the phase convention that makes the J op erator matrix elements simple is not the phase convention that makes the time-reversal operator properties simple. We emphasize once again that (4.4.70) is completely independent of phase convention. Having worked out the behavior of angular-momentum eigenstates under time reversal, we are in a position to study once again the expectation values of a Hermitian operator. Recalling (4.4.43), we obtain under time reversal (canceling the i 2 m factors) (4.4.80) (a , j , m iAia, j, m) = ± ( a , j , - m iAia, j, - m ) . m
±
280
Symmetry in Quantum Mechanics
Now suppose A is a component of a spherical tensor Tq
(4.4.81)
er
A = T0(k) becomes
< a , ]. , m I 'T'( + < a , J. , - m I 'T'( 1 0 k ) l a , J. , m > - 1 0 k ) l a , J. , - m > . (4 .4 . 82 ) Due to (3.6.46)-(3.6.49), we expect Ia, j, ...: m ) = £& (0, '71' , O) l a, j, m ) up to a phase. We next use (3.10.22) for T0
E& t ( 0 , '7T , 0) T0
(4.4.83)
where we have used E&Jt'l(O, '71',0) = Pk(cos '1T ) = ( 1 ) \ and the q * 0 com ponents -give vanishing contributions when sandwiched between (a, j, m I and Ia , j, m ) . The net result is -
( a , j, miT0
=
± ( - l ) k (a, j, m i T0< k l l a , j, m ) .
(4.4.84)
As an example, taking k = 1, the expectation value (x) taken with respect to eigenstates of j, m vanishes. We may argue that we already know (x) = 0 from parity inversion if the expectation value is taken with respect to parity eigenstates [see (4.2.41)). But note that here Ia, j, m ) need not be parity eigenkets! For example, the I J, m ) for spin � particles could be cs l s112 ) + cp i P I;z ) .
Interactions with Electric and Magnetic Fields; Kramers Degeneracy Consider charged particles in an external electric or magnetic field. If we have only a static electric field interacting with the electric charge, the interaction part of the Hamiltonian is just
V(x) = ecp(x),
( 4.4.85)
where cp(x) is the electrostatic potential. Because cp(x) is a real function of the time-reversal even operator x, we have [ 8 , H] = O. ( 4.4.86) Unlike the parity case, (4.4.86) does not lead to an interesting conservation law. The reason is that (4.4.87)
even if (4.4.86) holds, so our discussion following (4.1.9) of Section 4.1 breaks down. As a result, there is no such thing as the "conservation of
4.4. The Time-Reversal Discrete Symmetry
281
time-reversal quantum number." As we already mentioned, requirement (4.4.86) does, however, lead to a nontrivial phase restriction-the reality of a nondegenerate wave function for a spinless system [see (4.4.59) and (4.4.60)]. Another far-reaching consequence of time-reversal invariance is the Kramers degeneracy. Suppose H and E> commute, and let I n ) and El l n ) be the energy eigenket and its time-reversed state, respectively. It is evident from (4.4.86) that I n ) and Elln) belong to the same energy eigenvalue En ( HEl i n ) = E>H i n ) = EnElln)). The question is, Does I n ) represent the same state as El l n ) ? If it does, I n ) and El l n ) can differ at most by a phase factor. Hence,
( 4.4.88)
Applying e again to (4.4.88), we have El 2 1 n ) e - iBe + iB i n ) ; hence,
=
EJ e i81 n ) = e- '8Elln)
=
( 4.4.89) But this relation is impossible for half-integer j systems, for which E> 2 is always - 1, so we are led to conclude that I n ) and E> I n ) , which have the same energy, must correspond to distinct states-that is, there must be a degeneracy. This means, for instance, that for a system composed of an odd number of electrons in an external electric field E, each energy level must be at least twofold degenerate no matter how complicated E may be. Consider ations along this line have interesting applications to electrons in crystals where odd-electron and even-electron systems exhibit very different behav iors. Historically, Kramers inferred degeneracy of this kind by looking at explicit solutions of the Schrodinger equation; subsequently, Wigner pointed out that Kramers degeneracy is a consequence of time-reversal invariance. Let us now turn to interactions with an external magnetic field. The Hamiltonian H may then contain terms like (4.4.90) (8 = v x A), S·8 p·A + A · p , where the magnetic field is to be regarded as external. The operators S and p are odd under time reversal; these interaction terms therefore do lead to (4.4.91) As a trivial example, for a spin ! system the spin-up state 1 + ) and its time-reversed state 1 - ) no longer have the same energy in the presence of an external magnetic field. In general, Kramers degeneracy in a system containing an odd number of electrons can be lifted by applying an external magnetic field. Notice that when we treat 8 as external, we do not change 8 under time reversal; this is because the atomic electron is viewed as a closed quantum-mechanical system to which we apply the time-reversal operator. '
282
Symmetry in Quantum Mechanics
This should not be confused with our earlier remarks concerning the invariance of the Maxwell equations (4.4.2) and the Lorentz force equation under t -> - t and (4.4.3). There we were to apply time reversal to the whole world, for example, even to the currents in the wire that produces the B field!
PROBLEMS 1 . Calculate the three lowest energy levels, together with their degeneracies, for the following systems (assume equal mass distinguishable particles): a. Three noninteracting spin 1 particles in a box of length L. b. Four noninteracting spin ! particles in a box of length L. 2. Let Y.; denote the translation operator (displacement vector d); �(il, > ), the rotation operator (it and > are the axis and angle of rotation, respectively); and w the parity operator. Which, if any, of the following pairs commute? Why? a. §.; and §.;. (d and d' in different directions). b. �(it, .p) and �(it', >' ) (it and it' in different directions). c. §.; and w. d. � (il, .p) and w. 3. A quantum-mechanical state 'lt is known to be a simultaneous eigen state of two Hermitian operators A and B which anticommute, AB + BA = O .
What can you say about the eigenvalues of A and B for state 'lt? Illustrate your point using the parity operator (which can be chosen to satisfy w = w - 1 = wt) and the momentum operator. 4. A spin 1 particle is bound to a fixed center by a spherically symmetrical potential. a. Write down the spin angular function '!If;_�0112• m � 112, b. Express ( a • x) '!Yj_�0112· m � I;z in terms of some other '!11/' m . c. Show that your result in (b) is understandable in view of the transformation properties of the operator S·x under rotations and under space inversion (parity). 5. Because of weak (neutral-current) interactions there is a parity-violating potential between the atomic electron and the nucleus as follows:
V = 71. [ 8(3l (x) S· p + S ·p8(3l (x) j ,
where S and p are the spin and momentum operators of the electron, and the nucleus is assumed to be situated at the origin. As a result, the ground state of an alkali atom, usually characterized by In, I, j, m) actually contains very tiny contributions from other eigenstates as
283
Problems
follows:
I n , 1, j, m> � In,
1,
j, m>
+ n'/'j'2:m' cn·trm· I n' , l' , r , m '> .
On the basis of symmetry considerations alone, what can you say about (n' , l' , j' , m ' ) , which give rise to nonvanishing contributions? Suppose the radial wave functions and the energy levels are all known. Indicate how you may calculate Cn ·trm·· Do we get further restrictions on (n ' , l' , j' , m ? 6. Consider a symmetric rectangular double-well potential: '
)
for lxl > a + b ; for a < lxl < a + b ; v = 0 V0 > 0 for lxl < a.
100
7.
8.
9. 10.
Assuming that V0 is very high compared to the quantized energies of low-lying states, obtain an approximate expression for the energy split ting between the two lowest-lying states. a. Let lfi(x , t) be the wave function of a spinless particle corresponding to a plane wave in three dimensions. Show that lfi* x , - t) is the wave function for the plane wave with the momentum direction reversed. b. Let x(n) be the two-component eigenspinor of u · n with eigenvalue + 1 . Using the explicit form of x(n) (in terms of the polar and azimuthal angles p and 'Y that characterize n) verify that - iu2x*(n) is the two-component eigenspinor with the spin direction reversed. a. Assuming that the Hamiltonian is invariant under time reversal, prove that the wave function for a spinless nondegenerate system at any given instant of time can always be chosen to be real. b. The wave function for a plane-wave state at t = 0 is given by a complex function e;p.xf�. Why does this not violate time-reversal invariance? Let be the momentum-space wave function for state lo:}, that is, ( lo:) . Is the momentum-space wave function for the time reversed state elo:) given by Jus or tify your answer. a. What is the time-reversed state corresponding to 2/l(R) I j, m} ? b. Using the properties of time reversal and rotations, prove
(
tj>(p') tj>(p') = p'
tj>(p'), tj>( -p'), tj>* (p'), tj>* ( -p')?
2/l�i,:;(R) = ( - l)m - m'21J<J;,.., - m (R). c. Prove 8 l j, m } = i2m lj, - m}.
1 1 . Suppose a spinless particle is bound to a fixed center by a potential V(x) so asymmetrical that no energy level is degenerate. Using time-
Symmetry in Quantum Mechanics
284
reversal invariance prove (L)
= 0
for any energy eigenstate. (This is known as quenching of orbital an gular momentum.) If the wave function of such a nondegenerate ei genstate is expanded as L L Ftm(r) Y7'(8 , ) , I
m
what kind of phase restrictions do we obtain on F1m(r)? 12. The Hamiltonian for a spin 1 system is given by H = AS,2 + B ( s; - sn .
Solve this problem exactly to find the normalized energy eigenstates and eigenvalues. (A spin-dependent Hamiltonian of this kind actually ap pears in crystal physics.) Is this Hamiltonian invariant under time reversal? How do the normalized eigenstates you obtained transform under time reversal?
CHAPTER 5
Approximation Methods
Few problems in quantum mechanics-with either time-independent or time-dependent Hamiltonians-can be solved exactly. Inevitably we are forced to resort to some form of approximation methods. One may argue that with the advent of high-speed computers it is always possible to obtain the desired solution numerically to the requisite degree of accuracy; never theless, it remains important to understand the hasic physics of the approximate solutions even before we embark on ambitious computer calculations. This chapter is devoted to a fairly systematic discussion of approximate solutions to bound-state problems. 5.1.
TIME-INDEPENDENT PERTURBATION THEORY: NONDEGENERATE CASE Statement of the Problem
The approximation method we consider here is time-independent perturbation theory-sometimes known as the Rayleigh-Schrbdinger per turbation theory. We consider a time-independent Hamiltonian H such that it can be split into two parts, namely, H = H0 +
V,
(5.1.1) 2R5
286
Approximation Methods
where the V = 0 problem is assumed to have been solved in the sense that both the exact energy eigenkets ln<0l ) and the exact energy eigenvalues £�0l are known: (5.1 .2) We are required to find approximate eigenkets and eigenvalues for the full Hamiltonian problem (5.1.3) where V is known as the perturbation; it is not, in general, the full-potential operator. For example, suppose we consider the hydrogen atom in an external electric or magnetic field. The unperturbed Hamiltonian H0 is taken to be the kinetic energy p2j2m and the Coulomb potential due to the presence of the proton nucleus e 2jr. Only that part of the potential due to the interaction with the external E or B field is represented by the perturbation V. Instead of (5.1.3) it is customary to solve -
(5.1.4) where ,\ is a continuous real parameter. This parameter is introduced to keep track of the number of times the perturbation enters. At the end of the calculation we may set ,\ --> 1 to get back to the full-strength case. In other words, we assume that the strength of the perturbation can be controlled. The parameter ,\ can be visualized to vary continuously from 0 to 1 , the A = 0 case corresponding to the unperturbed problem and A = 1 correspond ing to the full-strength problem of (5.1.3). In physical situations where this approximation method is applicable, we expect to see a smooth transition of 0 ln°) into I n ) and £� 1 into En as ,\ is "dialed" from 0 to 1 . The method rests on the expansion of the energy eigenvalues and energy eigenkets in power of A. This means that we implicitly assume the analyticity of the energy eigenvalues and eigenkets in a complex ,\-plane around ,\ = 0. Of course, if our method is to be of practical interest, good approximations can better be obtained by taking only one or two terms in the expansion.
The Two-State Problem Before we embark on a systematic presentation of the basic method, let us see how the expansion in ,\ might indeed be valid in the exactly soluble two-state problem we have encountered many times already. Sup-
5.1.
2R7
Time-Independent Perturbation Theory: Nondegenerate Case
pose we have a Hamiltonian that can be written as
H = E{oll 1 (Ol )(1 (Ol l + Eioli2<0l ) (2<0J I + ;\ Vui1 <01 ) (2<01 1 + AV21 12 <01)( 1 <011,
(5.1 .5)
where 11
(
£ (0) H= I ;\ V2l
(5.1 .6)
where we have used the basis formed by the unperturbed energy eigenkets. The V matrix must, of course, be Hermitian; let us solve the case when V1 2 and V21 are real:
(5.1 .7)
hence, by Hermiticity
( 5 . 1 .8) This can always be done by adjusting the phase of 12<01) relative to that of 11<01). The problem of obtaining the energy eigenvalues here is completely analogous to that of solving the spin-orientation problem, where the ana logue of (5.1.6) is
)
a1 a0 - a3 '
(5.1 .9)
where we assume a = (a1,0, a3) is small and a 0 , a1, a3 are all real. The eigenvalues for this problem are known to be just
(5.1 .10) By analogy the corresponding eigenvalues for (5.1 .6) are
( 5 . 1 .11) Let us suppose A I Vd is small compared with the relevant energy scale, the difference of the energy eigenvalues of the unperturbed problem: We can then use
A I Vd « IE{01 - Ei011. �= 1+
1 -c2 -< + ... 2 8
(5.1 .12) ( 5 . 1 .13)
to obtain the expansion of the energy eigenvalues m the presence of
Approximation Methods
288
perturbation A J Vd, namely,
(5.1 .14)
These are expressions that we can readily obtain using the general for malism to be developed shortly. It is also possible to write down the energy eigenkets in analogy with the spin-orientation problem. The reader might be led to believe that a perturbation expansion always exists for a sufficiently weak perturbation. Unfortunately this is not necessarily the case. As an elementary example, consider a one-dimensional problem involving a particle of mass m in a very weak square-well potential of depth fi'0 ( V = - V0 for - a < x < a , V = 0 for lxl > a ) . This problem admits one bound state of energy, (5 . 1 .15 )
We might regard the square well as a very weak perturbation to be added to the free-particle Hamiltonian and interpret result (5.1 . 1 5) as the energy shift in the ground state from zero to IAVI 2. Specifically, because (5.1 .15) is quadratic in V, we might be tempted to associate this as the energy shift of the ground state computed according to second-order perturbation theory. However, this view is false because if this were the case, the system would also admit an E < 0 state for a repulsive potential case with A negative, which would be sheer nonsense. Let us now examine the radius of convergence of series expansion (5.1.14). If we go back to the exact expression of (5.1 . 1 1 ) and regard it as a function of a complex variable A , we see that as I A I is increased from zero, branch points are encountered at
+ i( E 1\ J Vd = \
-
< Ol 1
2
- E2(Ol )
.
( 5 . 1 . 1 6)
The condition for the convergence of the series expansion for the A = 1 full-strength case is (5 .1 .17 )
If this condition is not met, perturbation expansion (5.1 .14) is meaningless.* *See the discussion on convergence following ( 5 . 1 .44), under general remark>.
5.1.
Time-Independent Perturbation Theory: Nondegenerate Case
289
Formal Development of Perturbation Expansion We now state in more precise terms the basic problem we wish to solve. Suppose we know completely and exactly the energy eigenkets and energy eigenvalues of (5.1 .18)
The set { ln<0l ) } is complete in the sense that the closure relation 1 = I: n ln<0l ) ( n <0ll holds. Furthermore, we assume here that the energy spectrum is nondegenerate; in the next section we will relax this assumption. We are interested in obtaining the energy eigenvalues and eigenkets for the problem defined by (5.1 .4). To be consistent with (5.1 .18) we should write (5.1 .4) as
( H0 + .\ V ) I n );,
(5.1 .19) E�"lln)" to denote the fact that the energy eigenvalues £� A ) and energy eigenkets I n ) A are functions of the continuous parameter ,\; however, we will usually =
dispense with this correct but more cumbersome notation. As the continuous parameter ,\ is increased from zero, we expect the energy eigenvalue En for the nth eigenket to depart from its unperturbed value £�0l, so we define the energy shift for the n th level as follows: • = En - £n
(5.1 .20)
The basic Schrodinger equation to be solved (approximately) is
(5.1 .21 ) We may be tempted to invert the operator £�0l - H ; however, in general, 0 the inverse operator lj( £�0) - H ) is ill defined because it may act on 0 ln<0l ) . Fortunately in our case ( ,\V - � n)ln) has no component along l n <0l), as can easily be seen by multiplying both sides of (5.1.21 ) by (n<0ll on the left:
(5.1 .22) Suppose we define the complementary projection operator
1 1 "' " l k(Ol )( k (O) I . En� - H � k� * n En� - Ek� 0 Also from (5.1 .22) and (5.1.23), it is evident that =
( .\V - � n ) ln) = >n (,\V - � n ) l n ) .
(5.1 .24)
(5.1 .25)
Approximation Methods
290
We may therefore be tempted to rewrite (5.1.21 ) as ?
1 ( 5 .1 .26)
suitable final form is
( 5 . 1 .27) where
(5.1 .28) Note that
(5. 1 . 29 ) For reasons we will see later, it is convenient to depart from the usual normalization convention
(5.1 .30)
(nln) = l . Rather, we set
(5.1.31)
even for A -=f. 0 . W e can always do this if we are not worried about the overall normalization because the only effect of setting c * 1 is to introduce a common multiplicative factor. Thus, if desired, we can always normalize the ket at the very end of the calculation. It is also customary to write n
(5.1 .32) and similarly
1 1 1 = n = n n En(Ol - H0
(5.1 .33)
so we have
(5.1 .34) We also note from (5.1 .22) and (5.1.31) that
D. n = A( n
(5.1 .35) Everything depends on the two equations in (5.1 .34) and (5.1.35). Our basic strategy is to expand In) and D. n in the powers of A and then
5.1.
291
Time-Independent Perturbation Theory: Nondegenerate Case
match the appropriate coefficients. This is justified because (5.1.34) and (5.1.35) are identities which hold for all values of between 0 and 1 . We begin by writing
A
n
n
A Un
� A �(l ) + ' 2 A(2) + . . . =
(5.1 .36) ·
Substituting (5.1 .36) into (5.1 .35) and equating the coefficient of various powers of we obtain
A,
AI): �<� l ( n
=
=
( 5 .1 .37) 0(
AN ) �<:;! ) = ( n
so to evaluate the energy shift up to order it is sufficient to know only up to order We now look at (5.1.34); when it is expanded using (5.1 .36), we get
In)
AN
AN-I_ ln-, ( AV - A t�
-, V l n<0l). (5.1 .40) 0 Knowing �?;l , we can work out the A2-term in ket equation (5.1 .38) also =
"
H
"
· · ·
n
H
using (5.1.39) as follows:
En - H0
n
H
-, V ln<0l ) . 0
( 5 . 1 .41)
Approximation Method>
292
Clearly, we can continue in this fashion as long as we wish. Our operator method is very compact; it is not necessary to write down the indices each time. Of course, to do practical calculations we must use at the end the explicit form of cl>n as given by (5.1.23). To see how all this works, we write down the explicit expansion for the energy shift � n = En - £n<0> (5.1 .42) where (5.1 .43) that is, the matrix elements are taken with respect to unperturbed kets. Notice that when we apply the expansion to the two-state problem we recover the earlier expression (5.1.14). The expansion for the perturbed ket goes as follows:
+ . . . .
(5.1 .44)
Equation (5.1 .44) says that the n th level is no longer proportional to the unperturbed ket Jn<0> ) but acquires components along other unperturbed energy kets; stated another way, the perturbation V mixes various unper turbed energy eigenkets. A few general remarks are in order. First, to obtain the first-order energy shift it is sufficient to evaluate the expectation value of V with respect to the unperturbed kets. Second, it is evident from the expression of the second-order energy shift (5.1.42) that two energy levels, say the i th level and the j th level, when connected by f";1 tend to repel each other; the lower one, say the i th level, tends to get depressed as a result of mixing with the higher jth level by JV:) 2/( E/0> - E/0> ), while the energy of the jth level goes up by the same amount. This is a special case of the no-level crossing theorem, which states that a pair of energy levels connected by perturbation do not cross as the strength of the perturbation is varied. Suppose there is more than one pair of levels with appreciable matrix elements but the ket Jn ) , whose energy we are concerned with, refers to the ground state; then each term in (5.1 .42) for the second-order energy shift is negative. This means that the second-order energy shift is always negative
5 . 1 . Time-Independent Perturbation Theory: Nondegenerate Case
293
for the ground state; the lowest state tends to get even lower as a result of miXIng. It is clear that perturbation expansions (5.1 .42) and (5.1 .44) will converge if JV,,/( E/0l - E}0l )J is sufficiently "small." A more specific crite rion can be given for the case in which H0 is simply the kinetic-energy operator (then this Rayleigh-Schrooinger perturbation expansion is just the Born series): At an energy £0 < 0, the Born series converges if and only if neither H0 + V nor H0 - V has bound states of energy E � £0 (R. G. Newton 1982, p. 233).
Wave-function Renormalization We are in a position to look at the normalization of the perturbed ket. Recalling the normalization convention we use, (5.1.31), we see that the perturbed ket Jn) is not normalized in the usual manner. We can renormal ize the perturbed ket by defining (5 .1 .45) ln) N = Z�12 Jn), where Zn is simply a constant with N
z; l = (nJn) = ((n
(5.1 .48a)
J Vkn 1 2 Z =< 1 - A2 L H n ( E� - E2}2
(5 .1 .48b)
II
so up to order A.2 , we get for the probability of the perturbed state to be found in the corresponding unperturbed state n
The second term in (5.1.48b) is to be understood as the probability for
294
Approximation Methods
" leakage" to states other than J n (O) ) . Notice that zn is less than 1 , as expected on the basis of the probability interpretation for Z. It is also amusing to note from (5. 1.42) that to order l\.2 , Z is related to the derivative of En with respect to E�0) as follows: z
n
8En . aEn(O)
( 5 . 1 .49)
= --
We understand, of course, that in taking the partial derivative of En with respect to E�0l, we must regard the matrix elements of V as fixed quantities. Result (5.1 .49) is actually quite general and not restricted to second-order perturbation theory.
Elementary Examples To illustrate the perturbation method we have developed, let us look at two examples. The first one concerns a simple harmonic oscillator whose unperturbed Hamiltonian is the usual one: (5 . 1 .50 )
Suppose the spring constant k = m w2 is changed slightly. We may represent the modification by adding an extra potential ( 5 . 1 .5 1 ) V = �emw2x 2 , where e is a dimensionless parameter such that e « 1. From a certain point of view this is the silliest problem in the world to which to apply perturba tion theory; the exact solution is immediately obtained just by changing w as follows: ( 5 .1 . 52 ) yet this is an instructive example because it affords a comparison between the perturbation approximation and the exact approach. We are concerned here with the new ground-state ket JO) in the presence of V and the ground-state energy shift � 0 : JO) = J0 <0J ) + L J k <0J )
and
� 0 = v,
[)()
+
k * 0
"
L...
k * ()
vk O EJO) - EkO)
£ (0) - £ (0)
J Vkol 2
0
k
+ .
..
+
· ·
·
.
( 5 . 1 .53a) ( 5 . 1 . 53b)
The relevant matrix elements are (see Problem 5 in this chapter)
( ) ( )
2 Voo = £m2w
( Q (O ) J x 2 J0 (0) ) = £ n w
e w2 v2o = �
( 2 (0) J x 2 J0 (0) > = e n w .
4
2 fi
( 5 . 1 . 54)
5 . 1 . Time-Independent Perturbation Theory: Nondegenerate Case
295
All other matrix elements of form Vk o vanish. Noting that the nonvanishing energy denominators in (5. 1 . 5 3a) and (5. 1 . 5 3b) are -2nw, we can combine everything to obtain (5.1.55a) and �0 = E0 - EJ0>= nw[: - ;� + 0(<3)]. (5.1.55b) Notice that as a result of perturbation, the ground-state ket, when expanded (O ) incomponent terms ofalorigi n al unperturbed energy eigenkets { 1 n ) } , acquires a o ng the second excited state. The absence of a component along the first excited state is not surprising because our total H is invariant under parity; hence, an energy eigenstate is expected to be a parity eigenstate. energy Ashiftcomparison as follows:with the exact method can easily be made for the h w ( h2w )� = ( h2w ) [ 1 + ; - �2 + . . . ] , (5.1.56) 2 inthecomplete agreement with (5. 1 . 5 5b). As for the perturbed ket, we look at change in the wave function. In the absence of V the ground-state wave function is (5.1.57) where (5.1.58) Substitution (5.1.52) leads to Xo (5.1. 5 9) X o ---> + < 1 4 ' / (1 ) hence, ___.
•
1 1
7T l /4 Fo
""' --
--
e
- x 2 j2 x o2
+
E
7T l/4 Fo
e
- x 2 j2 x 2 o
[8 1
1
]J
X2
4x
- - - -
(5.1.60)
Approximation Method>
296
( ) -x'12x'[
where we have used 0 1 0 (x l zl l ) = -- (x i 0( ))H2 � 2� Xo 1 1 1 (5 . 1 .61) o -2 + = 2, ;:--2 174 , ,c- e 'IT v Xo v and H2(xlx0) is a Hermite polynomial of order 2. As another illustration of nondegenerate perturbation theory, we discuss the quadratic Stark effect. A one-electron atom-the hydrogen atom or a hydrogenlike atom with one valence electron outside the closed (spherically symmetrical) shell-is subjected to a uniform electric field in the positive z-direction. The Hamiltonian H is split into two parts, 2 p Ho = - + V0(r) and V = - e i E i z (e 0 for the electron). 2m (5. 1 .62) [Editor's Note: Since the perturbation V � - oo as z � - oo , particles bound by H0 can, of course, escape now, and all formerly bound states acquire a finite lifetime. However, we can still formally use perturbation theory to calculate the shift in the energy. (The imaginary part of this shift, which we shall ignore here, would give us the lifetime of, the state or the width of the corresponding resonance.)]
<
It is assumed that the energy eigenkets and the energy spectrum for the unperturbed problem (H0 only) are completely known. The electron spin turns out to be irrelevant in this problem, and we assume that with spin degrees of freedom ignored, no energy level is degenerate. This assumption does not hold for n � 1 levels of the hydrogen atoms, where V0 is the pure Coulomb potential; we will treat such cases later. The energy shift is given by l zk; l z • = - e l E I zkk + ezi E i z " (5. 1 .63) ak + L (O) j" k Ek - Ej(O) where we have used k rather than n to avoid confusion with the principal 0 quantum number n . With no degeneracy, I k< l) is espected to be a parity eigenstate; hence, zkk = 0, (5. 1 .64)
··· '
as we saw in Section 4.2. Physically speaking, there can be no linear Stark effect, that is, there is no term in the energy shift proportional to I E I because the atom possesses a vanishing permanent electric dipole, so the energy shift is quadratic in I E I if terms of order e3 l E 1 3 or higher are ignored. Let us now look at zkj • which appears in (5. 1 .63) , where k (or j) is the collective index that stands for (n , I, m ) and (n ' , I' , m ' ) . First, we recall the selection rule [see (3.10.39)] = l 1 (n ' , l' m ' l z l n , lm) = 0 unless (5 . 1 .65) ' = ;;;
{['
m
+
297
5. 1 . Time-Independent Perturbation Theory: Nondegenerate Case
that follows from angular momentum (the Wigner-Eckart theorem with T�120) and parity considerations.
There is another way to look at the m-selection rule. In the presence of V, the full spherical symmetry of the Hamiltonian is destroyed by the external electric field that selects the positive z-direction, but V (hence the total H ) is still invariant under rotation around the z-axis; in other words, we still have a cylindrical symmetry. Formally this is reflected by the fact that
(5.1 .66) This means that L= is still a good quantum number even in the presence of V. As a result, the perturbation can be written as a superposition of eigenkets of L= with the same m - m = 0 in our case. This statement is true for all orders, in particular, for the first-order ket. Also, because the second-order energy shift is obtained from the first-order ket [see (5.1.40)), we can understand why only the m = 0 terms contribute to the sum. The polarizability a of an atom is defined in terms of the energy shift of the atomic state as follows:
(5.1 .67) Let us consider the special case of the ground state of the hydrogen atom. Even though the spectrum of the hydrogen atom is degenerate for excited states, the ground state (with spin ignored) is nondegenerate, so the for malism of nondegenerate perturbation theory can be applied. The ground state 10 (0)) is denoted in the (n, /, m ) notation by (1,0, 0), so
l ( k (0l l z l 1 , 0, 0)1 2 a = _2e2 � 0 L., 0 [ E0(O) - £ ( ) ] k "'
k
'
( 5.1 .68 )
where the sum over k includes not only all bound states In, /, m ) (for n > 1) but also the positive-energy continuum states of hydrogen. There are many ways to estimate approximately or evaluate exactly the sum in (5.1 .68) with various degrees of sophistication. We present here the simplest of all the approaches. Suppose the denominator in (5.1 .68) were constant. Then we could obtain the sum by considering L l ( k (0llzll ,O ,O) I2 = L l ( k (0llzl1 , 0 , 0) 1 2 all k
(5.1 .69)
where we have used the completeness relation in the last step. But we can easily evaluate ( z 2 ) for the ground state as follows:
( z 2 ) = ( x2 ) = ( y 2 ) = t ( r 2 ),
(5.1 .70)
and using the explicit form for the wave function we obtain where
a0
( z2 ) = a6, stands for the Bohr radius. Unfortunately the expressiOn for
Approximation Methods
298
polarizability a involves the energy denominator that depends on E}0>, but we know that the inequality
]
- £(o0) + £k(0) > - £( 0) + £ ( 0) = _i!_ 1 - ! (5 1 71) o 1 2a 0 4 holds for every energy denominator in (5.1 .68). As a result, we can obtain
[
·
·
an upper limit for the polarizability of the ground state of the hydrogen atom, namely,
16ab 3 3- "" 5 .3a . a< 0
( 5 . 1 .72)
9a a = 2b = 4.5a 03 .
(5.1 .73)
It turns out that we can evaluate exactly the sum in (5.1 .68) using a method due to A. Dalgarno and J. T. Lewis (Merzbacher 1970, 424, for example), which also agrees with the experimentally measured value. This gives
We obtain the same result (without using perturbation theory) by solving the Schrodinger equation exactly using parabolic coordinates.
5.2. TIME-INDEPENDENT PERTURBATION THEORY: THE DEGENERATE CASE The perturbation method we developed in the previous section fails when the unperturbed energy eigenkets are degenerate. The method of the previ ous section assumes that there is a unique and well-defined unperturbed ket of energy £,�0l which the perturbed ket approaches as A --> 0. With degener acy present, however, any linear combination of unperturbed kets has the same unperturbed energy; in such a case it is not a priori obvious to what linear combination of the unperturbed kets the perturbed ket is reduced in the limit A --> 0. Here specifying just the energy eigenvalue is not enough; some other observable is needed to complete the picture. To be more specific, with degeneracy we can take as our base kets simultaneous eigen kets of H0 and some other observable A , and we can continue labeling the unperturbed energy eigenket by l k (0l ), where k now symbolizes a collective index that stands for both the energy eigenvalue and the A eigenvalue. • When the perturbation operator V does not commute with A, the zeroth order eigenkets for H (including the perturbation) are in fact not A eigenkets. From a more practical point of view, a blind application of formulas like (5.1.42) and (5.1.44) obviously runs into difficulty because
(5.2.1) becomes singular if Vnk is nonvanishing and £,�0l and Ef0l are equal. We must modify the method of the previous section to accommodate such a situation.
5.2.
Time-Independent Perturbation Theory: The Degenerate Case
299
Whenever there is degeneracy we are free to choose our base set of unperturbed kets. We should, by all means, exploit this freedom. Intuitively we suspect that the catastrophe of vanishing denominators may be avoided by choosing our base kets in such a way that V has no off-diagonal matrix elements (such as Vn k = 0 in (5.2.1)). In other words, we should use the linear combinations of the degenerate unperturbed kets that diagonalize H in the subspace spanned by the degenerate unperturbed kets. This is indeed the correct procedure to use. Suppose there is a g-fold degeneracy before the perturbation V is switched on. This means that there are g different eigenkets all with the same unperturbed energy Eb0>. Let us denote these kets by { J m ) ' mED
where the sum is over the energy eigenkets in the degenerate subspace. Before expanding in A, there is a rearrangement of the Schrodinger equation that will make it much easier to carry out the expansion. Let P0 be a projection operator onto the space defined by {lm<0>)}. We define P1 = 1 - P0 to be the projection onto the remaining states. We shall then write the Schrodinger equation for the states I f) as 0 = (E - H0 - AV) j l)
= (E - e<JJ> - AV)P0Il) + (E - H0 - AV)P1 ! l). (5 .2.2) We next separate (5.2.2) into two equations by projecting from the left on (5.2.2) with P0 and PI > (5.2.3) (E - E'JJ> AP0V)Po l l) - AP0VP, Il) = 0 -
(5.2.4)
We can solve (5.2.4) in the P1 subspace because P1(£ - H0 - AP1 VP1) is not singular in this subspace since E is close to £'Jjl and the eigenvalues of P1H0P1 are all different from E'Jj>. Hence we can write
�
I P, l l) = P, E (5.2.5) Ho X.P, VP, P1 VP0 l) or written out explicitly to order A when I f) is expanded as If) Ajf'l) + . . . . 0l < )Vkt k nl) "'\;"' j Pr I l' ) = L. (O) (5.2.6) ( _
k
ED
-
EkO)·
Approximation Method'
300
(
To calculate P0ll), we substitute (5.2.5) into (5.2.3) to obtain
E - E!jjl - AP0VP0 - A2P0VP1
E_
H�
_
AV
)
pt VPo Po l l) = 0.
(5.2.7)
Although there is a term of order A2 in (5 . 2.7) that results from the sub stitution, we shall find that it produces a term of order A in the state Pol l). To order A we obtain the equation for the energies to order A and eigen functions to order zero, (5.2.8)
This is an equation in the g dimensional degenerate subspace and clearly means that the eigenvectors are just the eigenvectors of the g x g matrix P0VP0 and the eigenvalues £<1 l are just the roots of the secular equation det (V - (E - Eljjl )] = 0 (5.2.9) where V = matrix of P0VP0 with matrix elements (m(O) I Vlm ' (0)). Explicitly in matrix form we have (0) l / (0 ) ) . . . (2(11) 1 / (11)) .. . .
( J (II) l f ( O))
. ")((1 ) - ( ) .
(I)
- 6.,
.
(5.2. 10)
The roots determine the eigenvalues t:..) l l-there are g altogether-and by substituting them into (5.2. 10), we can solve for (m(O) I /(0)) for each I up to an overall normalization constant. Thus by solving the eigenvalue problem, we obtain in one stroke both the first-order energy shifts and the correct zeroth-order eigenkets. Notice that the zeroth-order kets we obtain as A � 0 are just the linear combinations of the various I m <0l )'s that diagonalize the perturbation V, the diagonal elements immediately giving the first order shift (5.2. 1 1)
Note also that if the degenerate subspace were the whole space, we would have solved the problem exactly in this manner. The presence of unper turbed "distant" eigenkets not belonging to the degenerate subspace will show up only in higher orders-first order and higher for the energy ei genkets and second order and higher for the energy eigenvalues. Expression (5.2 . 1 1) looks just like the first-order energy shift [see (5 . 1 . 37)] in the nondegenerate case except that here we have to make sure that the base kets used are such that V does not have nonvanishing off diagonal matrix elements in the subspace spanned by the degenerate un perturbed eigenkets. If the V operator is already diagonal in the base ket representation we are using, we can immediately write down the first-order
30l
5.2. Time-Independent Perturbation Theory: The Degenerate Case
shift by taking the expectation value of V, just as in the nondegenerate case. Let us now look at (5.2.7). To be safe we keep all terms in the g X g effective Hamiltonian that appears in (5 .2.7) to order x.Z although we want P0l l) only to order A. We find
(
E - E<jjl - AP0VP0 - A2P0VP1
EiJ]l
� H/1VP0)P0Il) = 0.
(5.2. 12)
Let us call the eigenvalues of the g x g matrix P0 VP0v; and the eigenvectors 0 P0 l i} l). The eigen energies to first order are EP) = EiJ]l + Av;. We assume that the degeneracy is completely resolved so that EPl E?) = A(v; vi) are all non-zero. We can now apply non-degenerate perturbation theory (5. 1 .39) to the g x g dimensional Hamiltonian that appears in (5.2. 12). The resulting correction to the eigenvectors P0l /�0)) is 1 Pu l l)Ol) (O p0 I IP l) = " p1 Vl t(Ol) A (tJ ) I VPI £(0) L. H i i" Vi V; D - 0 or more explicitly
-
I
Po fW l)
-
I
> (0°) 1 Vl k) (O) L = i"Li A vPio W) Ev - V; k•D
1 _
0 (k I V I I;( ) ). ) o E�
(5 .2. 14)
Thus, although the third term in the effective Hamiltonian that appears in (5.2. 12) is of order A2 , it is divided by energy denominators of order A in forming the correction to the eigenvector, which then gives terms of order A in the vector. If we add together (5.2.6) and (5.2. 14), we get the eigen vector accurate to order A. As in the nondegenerate case, it is convenient to adopt the nor malization convention (l(o) l l) = 1. We then have, from (5 .2.3) and (5.2.4), A(f0l f vll) = il1 = At:..} I l + A2t:..F l + · · · . The A-term just reproduces (5.2. 1 1 ) . As for the A2 -term, we obtain t:..F l = (f0l l v 1 Pl) = (f0l l vi P1fll) + (f0l i VI PJPl). Since the vectors Po l 0°)) are eigenvectors of V, the cor rection to the vector, (5.2. 14), gives no contribution to the second-order energy shift, so we find using (5.2.6) A (2) U.t =
" L..,
1 Vktl 2 (5.2.15) ( ) E(O) kED EDO k Our procedure works provided that there is no degeneracy in the roots of secular equation (5.2.9). Otherwise we still have an ambiguity as to which linear contribution of the degenerate unperturbed kets the per turbed kets are reduced in the limit A ---? 0. Put in another way, if our method is to work, the degeneracy should be removed completely in first order. A challenge for the experts: How must we proceed if the degeneracy is not removed in first order, that is, if some of the roots of the secular equation are equal? (See Problem 12 of this chapter.) _
Approximation Methods
302
Let us now summarize the basic procedure of degenerate perturba tion theory: 1 . Identify degenerate unperturbed eigenkets and construct the per turbation matrix V, a g X g matrix if the degeneracy is g-fold. 2. Diagonalize the perturbation matrix by solving, as usual, the appropriate secular equation. 3. Identify the roots of the secular equation with the first-order energy shifts; the base kets that diagonalize the V matrix are the correct zeroth-order kets to which the perturbed kets approach in the limit A. -> 0. 4. For higher orders use the formulas of the corresponding nonde generate perturbation theory except in the summations, where we exclude all contributions from the unperturbed kets in the degen erate subspace D.
Linear Stark Effect As an example of degenerate perturbation theory, let us study the effect of a uniform electric field on excited states of the hydrogen atom. As is well known, in the Schrodinger theory with a pure Coulomb potential with no spin dependence, the bound state energy of the hydrogen atom depends only on the principal quantum number n . This leads to degeneracy for all but the ground state because the allowed values of I for a given n satisfy" (5.2. 16) 0< n. To be specific, for the n = 2 level, there is an I = 0 state called 2s and three I = 1 (m = ± 1 , 0) states called 2p, all with the same energy, - e 2j8a 0 . As we apply a uniform electric field in the z-direction, the appropriate per turbation operator is given by (5.2. 17) V = - ez i E I , which we must now diagonalize. Before we evaluate the matrix elements in detail using the usual (nlm) basis, let us note that the perturbation (5 .2. 1 7 ) has nonvanishing matrix elements only between states of opposite parity, that is, between l = 1 and l = 0 in our case. Furthermore, in order for the matrix element to be nonvanishing, the m-values must be the same because z behaves like a spherical tensor of rank one with spherical com ponent (magnetic quantum number) zero. So the only nonvanishing matrix elements are between 2s (m = 0 necessarily) and 2p with m = 0. Thus 2 p m = 1 2p m = - 1 2p m = 0 2s 0 <2sWI2 p , m = 0) 0 0 o o o V .= <2p , m = O I V I2 s) 0 0 0 0 0 0 0 0 (5.2. 18)
l<
303
5.2. Time-Independent Perturbation Theory: The Degenerate Case
Explicitly,
(2s!VI2 p , rn = 0) = (2p , rn = OIVI2s)
= 3 ea 01E I .
(5.2. 19)
It is sufficient to concentrate our attention on the upper left-hand corner of the square matrix. It then looks very much like the ox matrix, and we can immediately write down the answer-for the energy shifts we get
(5.2.20)
where the subscripts
±
refer to the zeroth-order kets that diagonalize V:
I ±)
=
V2
1
C l 2 s,m = 0)
+
l 2p,m
=
0) ).
(5.2.21)
Schematically the energy levels are as shown in Figure 5 . 1 . Notice that the shift is linear in the applied electric field strength, hence the term the linear Stark effect. One way we can visualize the existence of this effect is to note that the energy eigenkets (5.2.21) are not parity eigenstates and are therefore allowed to have nonvanishing electric permanent dipole moments, as we can easily see by explicitly evaluating (z). Quite generally, for an energy state that we can write as a superposition of opposite parity states, it is permissible to have a nonvanishing permanent electric dipole moment, which gives rise to the linear Stark effect. An interesting question can now be asked. If we look at the " real" hydrogen atom, the 2s level and 2p level are not really degenerate. Due to the spin orbit force, 2p312 is separated from 2p 1 12, as we will show in the next section, and even the degeneracy between the 2s 112 and 2 p 112 levels that persists in the single particle Dirac theory is removed by quantum electrodynamics effects (the Lamb shift). We might therefore ask, Is it realistic to apply degenerate perturbation theory to this problem? A com parison with the exact result shows that if the perturbation matrix elements (12s, m=0>-12p,m=O>) 31eiiE lao
V2
No change for 12p,m=± 1 >
(12s, m=O>+I2p,m=O>)
V2
FIGURE 5.1. Schematic energy-level diagram for the linear Stark effect as an example of degenerate perturbation theory.
Approximation Methods
304
are much larger when compared to the Lamb shift splitting, then the energy shift is linear in IE 1 for all practical purposes and the formalism of degenerate perturbation theory is applicable. On the opposite extreme, if the perturbation matrix elements are small compared to the Lamb shift split ting, then the energy shift is quadratic and we can apply nondegenerate perturbation theory; see Problem 1 3 of this chapter. This incidentally shows that the formalism of degenerate perturbation theory is still useful when the energy levels are almost degenerate compared to the energy scale defined by the perturbation matrix element. In intermediate cases we must work harder; it is safer to attempt to diagonalize the Hamiltonian exactly in the space spanned by all the nearby levels.
5.3. HYDROGENLIKE ATOMS: FINE STRUCTURE AND THE ZEEMAN EFFECT Spin-Orbit Interaction and Fine Structure In this section we study the atomic levels of hydrogenlike atoms, that atoms with one valence electron outside the closed shell. Alkali atoms such as sodium (Na) and potassium (K) belong to this category. The central (spin-independent) potential Vc( r ) appropriate for the valence electron is no longer of the pure Coulomb form. This is because the electrostatic potential >( r ) that appears in V, ( r ) = e
E= -
( ! ) v v, ( r ) .
(5.3.2)
But whenever a moving charge is subjected to an electric field, it " feels" an effective magnetic field given by
Berr = -
( � ) x E.
( 5.3 .3)
5.3. Hydrog,enlike Atoms: Fine Structure and the Zeeman Effect
305
Because the electron has a magnetic moment f.L given by eS (5.3.4) = 11 m ec ' we suspect a spin-orbit potential VLs contribution to H as follows:
= ·(� X E) (= ) • [ ec X ( --; ) ?
HL S ='=
11
f.L
•
Beff
eS
m ,c
p
m
x
1 dV, ( - e ) dr
l (5 .3.5)
When this expression is compared with the observed spin-orbit interaction, it is seen to have the correct sign, but the magnitude turns out to be too large by a factor of two. There is a classical explanation for this due to spin precession (Thomas precession after L. H . Thomas), but we shall not bother with that (Jackson 1975, for example) . We simply treat the spin orbit interaction phenomenologically and take VLs to be one-half of (5. 3.5). The correct quantum-mechanical explanation for this discrepancy must await the Dirac (relativistic) theory of the electron (Sakurai 1967, for instance) . We are now in a position to apply perturbation theory to hydrogenic atoms using VL s as the perturbation ( V of Sections 5.1 and 5.2). The unperturbed Hamiltonian H0 is taken to be p2
H0 = m + V, ( r ) , (5 .3 .6) 2 where the central potential )/, is no longer of the pure Coulomb form for alkali atoms. With just H0 we have freedom in choosing the base kets:
Set 1 : The eigenkets of L2 , L=, S 2 , S=. Set 2:
(5.3.7)
Without VLs (or HLs) either set is satisfqctory in the sense that the base kets are also energy eigenkets. With H1.s added it is far superior to use set 2 of (5.3.7) because L • S does not commute with L= and S=, while it does commute with J 2 and Jz . Remember the cardinal rule: Choose unperturbed kets that diagonalize the perturbation. You have to be either a fool or a masochist to use the Lz , Sz eigenkets [set 1 of (5.3.7)] as the base kets for this problem; if we proceeded to apply blindly the method of degenerate perturbation theory starting with set 1 as our base kets, we would be forced to diagonalize the VLs( HLs) matrix written in the L2 , S= representation. The
Approximation Methods
306
results of this, after a lot of hard algebra, give us just the J 2 , J, eigenkets as the zeroth-order unperturbed kets to be used! In degenerate perturbation theory, if the perturbation is already diagonal in the representation we are using, all we need to do for the first-order energy shift is to take the expectation value. The wave function in the two-component form is explicitly written as '¥nlm = •1,
1 12.m Rnl (r) cyJ�l± I
(5 .3.8)
where 01 { � 1 ± 112 ·m is the spin-angular function of Section 3 . 7 (see (3.7. 64)] . For the first-order shift, we obtain 1 j=l+ 2 1 . j=l-
( .!_ )
d� .!_ d � R r 2 dr = R nl r dr n 1 r dr nl 0 where we have used the m-independent identity [see (3.7.65)]
100
2
(5 . 3 . 9)
1 j=l+ 2 1 . )=I2
(5 .3 .10) Equation (5.3.9) is known as Lande's interval rule. To be specific, consider a sodium atom. From standard atomic spectroscopy notation, the ground-state configuration is The inner 10 electrons can be visualized to form a spherically symmetrical electron cloud. We are interested in the excitation of the eleventh electron from 3s to a possible higher state. The nearest possibility is excitation to 3p. Because the central potential is no longer of the pure Coulomb form, 3s and 3p are now split. The fine structure brought about by VL s refers to even a finer split within 3p, between 3p 112 and 3p312 , where the subscript refers to j. Experimentally, we observe two closely separated yellow lines-known as the sodium D lines one at 5896 A, the other at 5890 A; see Figure 5.2. Notice that 3p312 lies higher because the radial integral in (5.3.9) is positive. To appreciate the order of magnitude of the fine-structure splitting, let us note that for Z = 1
-
(5 .3 .12)
5.3. Hydrogenlike Atoms: Fine Structure and the Zeeman Effect
Doublet or "Fine" structure
307
{ ?.=5896A0 3S1;2
FIGURE 5.2. Schematic diagram of 3s and 3p lines. The 3s and 3p degeneracy is lifted because V, ( r) is now the screened Coulomb potential due to core electrons rather than pure Coulombic: V1.s then removes the 3p 112 and 3p3 12 degeneracy.
just on the basis of dimensional considerations. So the fine-structure split ting is of order ( e 2/a6)( h/mec) 2 , which is to be compared with Balmer splittings of order e 2ja0• It is useful to recall here that the classical radius of the electron, the Compton wavelength of the electron, and the Bohr radius are related in the following way: h e2 2 --2 : - : a 0 : : 1 : 1 37 : (137) , m c mec
(5.3.13)
e
where we have used 1 e2 (5.3.14) he 1 37 · Typically, fine-structure splittings are then related to typical Balmer split tings via -
=
--
( :; :: ) ( ::) ; 2 :
::
( 1 �7 ( 1 •
(5.3 .15)
which explains the origin of the term fine structure. There are other effects of similar orders of magnitude. Specifically, the relativistic mass correction arising from the expansion (5 .3.16)
is of the same order.
The Zeeman Effect We now discuss hydrogen or hydrogenlike (one-electron) atoms in a uniform magnetic field-the Zeeman effect, sometimes called the anomalous Zeeman effect with the electron spin taken into account. Recall that a
Approximation Methods
308
uniform magnetic field B is derivable from a vector potential
A = i(B x r) .
( 5 .3 .17)
For B i n the positive z-direction (B = Bz),
A = - t ( By x - Bxy) (5.3.18) stands for IB I. Apart from the spin term, the interaction
suffices where B Hamiltonian is generated by the substitution p --> p -
A. c e
( 5 .3 .19)
We therefore have ( 5 .3 .20)
Because
(x'lp•A(x) l) = - iliV '·[A(x')(x 'l )] = (x 'IA ( x )·pl ) + (x 'l ) [ - iliV '·A(x') ] , it is legitimate to replace p · A by A · p whenever v · A(x ) = 0,
( 5 .3 .21) ( 5 . 3 .22)
which is the case for the vector potential of (5.3.18). Noting
A·p = IB I( - hPx + ! xpy ) = ! IB I L,
( 5 .3 .23)
and ( 5 .3 .24)
we obtain for (5.3.20) p2
H _- 2 m-, + Vc ( r ) - IB I L, + 2m ,c e
2 e 2 2 2 181 ( X + y ) . ( 5 .3 .25) 2 8m c e
To this we may add the spin magnetic-moment interaction -e -e - IL · B = B S,. m ,c S · B = m ,c I I
( 5 .3 .26)
The quadratic IBI 2 (x 2 + y 2 ) is unimportant for a one-electron atom; the analogous term is important for the ground state of the helium atom where L�tot) and syo•> both vanish. The reader may come back to this problem when he or she computes diamagnetic susceptibilities in Problems 1 8 and 19 of this chapter.
309
5.3. Hydrogenlike Atoms: Fine Structure and the Zeeman Effect
To summarize, omitting the quadratic term, the total Hamiltonian is made up of the following three terms: p2
= me 1 HL S 2 m H0
+ ( ) 2 - Vc r
=
2 2 c
e
(5.3 .27a)
1 d Vc( r ) L· S r dr
( 5.3.27b )
-
(5.3.27c) Notice the factor 2 in front of Sz; this reflects the fact that the g-factor of the electron is 2. Suppose H8 is treated as a small perturbation. We can study the effect of H8 using the eigenkets of H0 + HLS - the J 2 , J, eigenkets-as our base kets. Noting (5.3 .28) Lz + 2 Sz Jz S, , the first-order shift can be written as
= +
(5 .3 .29) The expectation value of Jz immediately gives recall . 1
mh . As for
(SJ, we first
J=l± 2 ,m) = ± I ± m +1 l + m +11 l m 1= m + � , m s = - � )· + 21 + 1
21
(5 .3.30)
The expectation value of Sz can then easily be computed:
(SJJ-i± l/2. m
= h ( I2 l I2) = ; � [(I± m + � )-(I + m + �)] = ± (2�: 1) . 2 lc +
( 2/
-
e-
1)
(5 .3.31)
In this manner we obtain Lande's formula for the energy shift (due to B field), 1 (5.3.32) c (2 / 1 )
!lEn = ;emhBm [l ± + ] ,
·
m.
We see that the energy shift of (5.3.32) is proportional to To understand the physical origin for this, we present another method for
Approximation Methods
310
(5.3.31).
deriving We recall that the expectation value of Sz can also be obtained using the projection theorem of Section We get [see
3.10. ( Sz)j � i� ll2.m = [ <S · J)i� i · ti2j 2 �: hj 1)
{3.10.45)]
2 = m (J2 + S2 - L );� i � tn 2hj{j
+ 1) [{l±�)(/±�+1)+�-/(/+1)1 = mh 2{/±�){l±�+ 1) J mh (5 .3.33) = + -:---:---:{2/+1) ' which is in complete agreement with (5.3.31 ) . In the foregoing discussion the magnetic field is treated as a small
perturbation. We now consider the opposite extreme-the Paschen-Back limit-with a magnetic field so intense that the effect of H8 is far more important than that of HLs• which we later add as a small perturbation. With H0 H8 only, the good quantum numbers are L= and S=' Even J 2 is no good because spherical symmetry is completely destroyed by the strong 8 field that selects a particular direction in space, the z-direction.We are left with cylindrical symmetry only-that is, invariance under rotation around the z-axis. So the Lz, Sz eigenkets J l, s = 1 , m1, ms) are to be used as our base kets. The effect of the main term H8 can easily be computed:
+
( .3.3
5 4)
The 2(2 / + 1 ) degeneracy in m1 and ms we originally had with H0 [see (5.3.27a)] is now reduced by H8 to states with the same ( m 1 ) + (2m , ), namely, ( m 1 ) + (1) and (m1 + 2) + ( - 1). Clearly we must evaluate the expec tation value of L · S with respect to Jm1, m s ) :
(5.3.35) where we have used (5 .3.36) Hence, ( . . 37 )
53
5.3. Hydrogenlike Atoms: Fine Structure and the Zeeman Effect
311
*The exception is the stretched configuration, for example, p312 with m = ±
L, and
S, are both good ; this is because magnetic quantum number
can be satisfied in only one way.
�.
J, , m = m1
+
Here m,
In many elementary books there are pictorial interpretations of the weak-field result (5.3.32) and the strong-field result (5.3.34), but we do not bother with them here. We simply summarize our results in Table 5-1 , where weak and strong 8 fields are "calibrated" by comparing their magnitudes enB/2m,c with (l/137) 2e 2ja0• In this table almost good simply means good to the extent that the less dominant interaction could be ignored. Specifically, let us look at the level scheme of a p electron I = 1 ( p312 , p112 ) . In the weak 8 case the energy shifts are linear in 8, with slopes determined by
As we now increase 8, mixing becomes possible between states with the same m-value-for example, p312 with m = ± � and p 1 12 with m = ± t in this connection note the operator Lz + 2Sz that appears in H8 [(5 . 3 . 27c)] is a rank 1 tensor operator T�k�o1 ) with spherical component q = 0. In the intermediate 8 region simple formulas like (5. 3 . 32) and (5.3 . 34) for the expectation values are not possible; it is really necessary to diagonalize the appropriate 2 x 2 matrix (Gottfried 1966, 371-73). In the strong 8 limit the energy shifts are again proportional to 1 8 1 ; as we see in (5 .3.34), the slopes are determined by m1 + 2m5 •
Van der Waals' Interaction An important, nice application of the Rayleigh-Schrooinger per turbation theory is to calculate the long-range interaction, or van der Waals' force, between two hydrogen atoms in their ground states. It is easy to show that the energy between the two atoms for large separation r is attractive and varies as r - 6• Consider the two protons of the hydrogen atoms to be fixed at a distance r (along the z-axis) with r1 the vector from the first proton to its electron and r2 the vector from the second proton to its electron; see Figure
Approximation Methods
312
r
2
z-axis
}-------.-.. + }----r
FIGURE 5.3. Two hydrogen atoms with their protons ( + ) separated by a fixed distance r and their electrons ( - ) at displacements r, from them.
5.3. Then the Hamiltonian H can be written as H= H +V 0 h2 e2 e2 - -;:(5.3 .38) -;Ho = ) V'i + Y'{ ( 2m 1 2 e2 e2 e2 e2 V = - + ---r lr + r2 - r11 lr + r21 jr - rd The lowest-energy solution of H0 is simply the product of the ground-state
wave functions of the noninteracting hydrogen atoms
(5.3.39) U0<0l = Ul�(rl ) U1� (r2 ) Now for large r ( » the Bohr radius a0) expand the perturbation V in powers of r; /r to obtain 1 e2 (5.3 .40) V = -3 ( x 1 x 2 + y1 y2 - 2z 1 z 2 ) + 0 -4 + · · · r r The lowest-order ,- 3-term in (5.3.40) corresponds to the interaction of two electric dipoles e r1 and e r2 separated by r. The higher-order terms represent higher-order multipole interactions, and thus every term in V involves spherical harmonics lfm with I; > 0 for each hydrogen atom. Hence, for each term in (5.3.40) the first-order perturbation energy matrix element V00 ::::: 0, since the ground state U0<0l wave function (5.3.39) has I; = 0 (and = 0 for I and m * 0). The second-order perturbation
( )
f dr!. Yt(f!.)
l ( k
( 5 . 3 .41 )
will be nonvanishing. We immediately see that this interaction varies as 1jr6; since £h0l > EJ0l, it is negative. This 1jr6 long-range attractive van der Waals' potential is a general property of the interaction between two atoms in their ground state.* *See the treatment in Schiff ( 1968), pages 261-263, which gives a lower and upper bound on the magnitude of the van der Waals' potential from (5.3.41) and from a variational calculation. Also note the first footnote on page 263 of Schiff concerning retardation effects.
313
5.4. Variational Methods
5.4. VARIATIONAL METHODS The perturbation theory developed in the previous section is, of course, of no help unless we already know exact solutions to a problem whose Hamiltonian is sufficiently similar. The variational method we now discuss is very useful for estimating the ground state energy E0 when such exact solutions are not available. We attempt to guess the ground-state energy E0 by considering a " trial ket" 10 ), which tries to imitate the true ground-state ket 10 ). To this end we first obtain a theorem of great practical importance. We define H such that (5.4.1) where we have accommodated the possibility that 10 ) might not be normal ized. We can then prove the following. Theorem.
(5.4.2) This means that we can obtain an upper bound to E0 by considering various kinds of 10 ) . The proof of this is very straightforward. Proof Even though we do not know the energy eigenket of the Hamiltonian H, we can imagine that 10 ) can be expanded as
10) = L: l k ) ( k iO ) k�O where lk) is an exact energy eigenket of H: Hlk) = E, l k ) .
(5.4 .3) (5 .4.4)
The theorem (5.4.2) follows when we use Ek = Ek - E0 + E0 to evaluate H in (5.4.1). We have L l(ki0 ) 1 2Ek k ( 5 .4.5a) H= �O
L: l(kiO ) I2 k�O
oc
-
L l(k i0 ) 12( Ek - Eo )
k �J
L: l(kl6 ) 1 2 k�O
+ Eo
( 5 .4. 5 b ) ( 5 .4 . 5 c)
Approximation Methods
314
where we have used the fact that Ek - E0 in the first sum of (5.4.5b) is necessarily positive. It is also obvious from this proof that the equality sign in (5.4.2) holds only if 1 6> coincides exactly with I O), that is, if the coef ficients (k iO) all vanish for k � 0. D The theorem (5.4.2) is quite powerful because H provides an upper bound to the true ground-state energy. Furthermore, a relatively poor trial ket can give a fairly good energy estimate for the ground state because if _
( k iO ) - O( E ) for k * 0,
(5 .4.6)
then from (5.4.5) we have (5.4.7) We see an example of this in a moment. Of course, the method does not say anything about the discrepancy between H and E0; all we know is that H is larger than (or equal to) £0. Another way to state the theorem is to assert that H is stationary with respect to the variation (5 .4.8) that is, 8H = 0 when 10) coincides with 10) . By this we mean that if 10) + o i O) is used in place of 10) in (5.4.5) and we calculate H, then the error we commit in estimating the true ground-state energy involves IO) to order
( 8 10))2.
The variational method per se does not tell us what kind of trial kets are to be used to estimate the ground-state energy. Quite often we must appeal to physical intuition-for example, the asymptotic behavior of wave function at large distances. What we do in practice is to characterize trial kets by one or more parameters At, ;\ 2 , . . . , and compute H as a function of At, A 2 , . . . . We then minimize H by (1) setting the derivative with respect to the parameters all zero, namely, iJH
aA I = O,
aH = 0, . . . , aA 2
(5 .4.9)
(2) determining the optimum values of At, A 2 , . . . , and (3) substituting them back to the expression for H. If the wave function for the trial ket already has a functional form of the exact ground-state energy eigenfunction, we of course obtain the true ground-state energy function by this method. For example, suppose some body has the foresight to guess that the wave function for the ground state of the hydrogen atom must be of the form (5 .4.10) where a is regarded as a parameter to be varied. We then find, upon minimizing H with (5.4.10), the correct ground-state energy e 2j2a0 • Not -
315
5.4. Variational Methods
surprisingly, the minimum is achieved when a coincides with the Bohr radius a 0 . As a second example, we attempt to estimate the ground state of the infinite-well (one-dimensional box) problem defined by V=
for lxl < a for l x l > a .
{ 0,
00 ,
(5.4.11)
The exact solutions are, of course, well known:
( 1TX )
1 (xiO ) = {(i cos 2a ,
(5.4.12)
But suppose we did not know these. Evidently the wave function must vanish at x = ± a; furthermore, for the ground state the wave function cannot have any wiggles. The simplest analytic function that satisfies both requirements is just a parabola going through x = ± a:
(xl0 ) = a2 - x2 ,
(5.4.13)
where we have not bothered to normalize 10 ) . Here there is no variational parameter. We can compute H as follows:
H=
a ) f i ? ( 2m f -a (a2 - x2) -dxd22 (az - xz)dx .=_
J�a (a2 - x2)2 dx
�---------------
----
- (��) (;:2�)
= 1 .0132 £0•
(5.4.14)
It is remarkable that with such a simple trial function we can come within 1 .3% of the true ground-state energy. A much better result can be obtained if we use a more sophisticated trial function. We try
(5.4.15) where A. is now regarded as a variational parameter. Straightforward al gebra gives
[(A. + 1 )(2A. + 1 )] (2A.
which has a minimum at
- 1)
4ma2) ' (_!l_
( ) ' = 1 + /6 '"" 1 . 72 ,
{\
2
(5.4. 16)
(5.4.17)
Approximation Methods
316
- ( 5 + 2/6 ) Eo "" 1 .00298£0.
not far from A. = 2 (a parabola) considered earlier. This gives
Hmin =
'1r 2
(5.4.18 )
So the variational method with (5.4.15) gives the correct ground-state energy within 0.3% - a fantastic result considering the simplicity of the trial func tion used. How well does this trial function imitate the true ground-state wave function? It is amusing that we can answer this question without explicitly evaluating the overlap integral (010) . Assuming that iO) is normalized, we have [from (5.4.1)-(5.4.4)]
Hmin
00
= kL� O l(k i0) 1 2Ek 2 I (Oi0) 1 2E0 + 9E0( 1 - I(Oi0) 1 2 )
(5.4.19)
where 9£0 is the energy of the second excited state; the first excited state (k = 1) gives no contribution by parity conservation. Solving for 1 (010) 1 and using (5.4.18), we have
1(010 ) 1 2 2
9£° - Hm. 8£0 m = 0.99963 .
.
(5 .4 20)
Departure from unity characterizes a component of IO ) in a direction orthogonal to 10 ) . If we are talking about "angle" 8 defined by (5.4.21 ) (010) = cos O , then (5.4.20) corresponds to (J :$ 1 .1 ° , (5 .4.22 ) so 10 ) and 10 ) are nearly " parallel." One of the earliest applications of the variational method involved the ground-state energy of the helium atom, which we will discuss in Section 6.1. We can also use the variational method to estimate the energies of first excited states; all we need to do is work with a trial ket orthogonal to the ground-state wave function-either exact, if known, or an approximate one obtained by the variational method. 5.5.
TIME-DEPENDENT POTENTIALS: THE INTERACfiON PICTURE Statement of the Problem
So far in this book we have been concerned with Hamiltonians that do not contain time explicitly. In nature, however, there are many quantum-mechanical systems of importance with time dependence. In the
5.5.
Time-Dependent Potentials: The Interaction Picture
317
remaining part of this chapter we show how to deal with situations with time-dependent potentials. We consider a Hamiltonian H such that it can be split into two parts, (5.5.1) H = H0 + V( t ) , where H0 does not contain time explicitly. The problem V( t ) = 0 is assumed to be solved in the sense that the energy eigenkets In) and the energy eigenvalues £, defined by (5 .5 .2) are completely known.* We may be interested in situations where initially only one of the energy eigenstates of H0 - for example, li )-is populated. As time goes on, however, states other than li) are populated because with V( t ) * 0 we are no longer dealing with " stationary" problems; the time evolution operator is no longer as simple as e - , Ht / h when H itself involves time. Quite generally the time-dependent potential V( t ) can cause transi tions to states other than li). The basic question we address is, What is the probability as a function of time for the system to be found in In), with n * i? More generally, we may be interested in how an arbitrary state ket changes as time goes on, where the total Hamiltonian is the sum of H0 and V( t ). Suppose at t 0, the state ket of a physical system is given by =
(5 .5 .3) n
We wish to find c,(t) for t > 0 such that (5.5.4) n
where the ket on the left side stands for the state ket in the Schrodinger picture at t of a physical system whose state ket at t 0 was found to be =
Ia).
The astute reader may have noticed the manner in which we have separated the time dependence of the coefficient of In) in (5.5.4). The factor e- , E, t / h is present even if V is absent. This way of writing the time dependence makes it clear that the time evolution of c ( t ) is due solely to the presence of V(t); c,(t) would be identically equal to c,(O) and hence independent of t if V were zero. As we shall see in a moment, this separation is convenient because c,(t) satisfies a relatively simple differen tial equation. The probability of finding In) is found by evaluating lc,(t)l 2 . ,
*In ( 5.5.2) we no longer use the notation 1 n<0l ),
£�0l.
Approximation Methods
318
The Interaction Picture Before we discuss the differential equation for cn(t ) , we discuss the interaction picture. Suppose we have a physical system such that its state ket coincides with Ia) at t = t0, where t0 is often taken to be zero. At a later time, we denote the state ket in the Schrodinger picture by Ia, t0; t )5, where the subscript S reminds us that we are dealing with the state ket of the Schrodinger picture. We now define Ia, t0 ; t)1 = e ' Ho t/h la, t0; t )5 , (5.5.5) where I ) 1 stands for a state ket that represents the same physical situation in the interaction picture. At t 0, I ) 1 evidently coincides with I )5 . For operators (representing observables) we define observables in the interaction picture as (5.5.6) In particular, v, = e ' Hot/hve - ' Ho t /h (5.5.7 ) where V without a subscript is understood to be the time-dependent potential in the Schrodinger picture. The reader may recall here the connec tion between the Schrodinger picture and the Heisenberg picture: (5 .5.8) la)u = e +tflt/h la, t0 = 0 ; t )s (5 .5 .9) A H = e 'Ht/hA s e -, Ht /h . The basic difference between (5.5.8) and (5.5.9) on the one hand and (5.5.6) and (5.5.7) on the other is that H rather than H0 appears in the exponen tial. We now derive the fundamental differential equation that char acterizes the time evolution of a state ket in the interaction picture. Let us take the time derivative of (5.5.5) with the full H given by (5.5.1): =
iii
:t I a , t
0;
t )1
=
=
We thus see
iii
:t ( e' Hot/ h i a, t0 ; t )5 )
- Hoe 'Hot/h la, t o ; t )s + ei Hotl h ( Ho + V ) I a , t o ; t )s (5.5 .10) (5.5.11)
which is a Schrodinger-like equation with the total H replaced by V1• In other words Ia, t0; t)1 would be a ket fixed in time if V1 were absent. We can also show for an observable A (that does not contain time t explicitly in the Schrodinger picture) that dA 1 1 (5.5 .12) = [ A Ho dt ili , , ] ' which is a Heisenberg-like equation with H replaced by H0.
5.5.
Time-Dependent Potentials: The Interaction Picture
319
TABLE 5.2 Heisenberg picture
S tate ket
No change
Observable
Interaction picture
lution determined y VI
Schrooinger picture
Evo
Evolution determined
b
by H
Evolution determined
Evolution determined No
by H
by H0
change
In many respects, the interaction picture, or Dirac picture, is inter mediate between the Schrooinger picture and the Heisenberg picture; This should be evident from Table 5.2. In the interaction picture we continue using I n ) as our base kets. Thus we expand I ) 1 as follows: (5.5 .13) I a , t 0 ; t ) 1 = I:C, ( t ) l n ) . n
With 10 set equal to 0, we see that the c"( t ) appearing here are the same as the c ( t ) introduced earlier in (5.5.4), as can easily be verified by multiply ing both sides of (5.5.4) by e • Hot / h using (5.5.2). We are finally in a position to write the differential equation for c, ( t ). Multiplying both sides of (5.5.11) by ( n l from the left, we obtain ,
. a !11 at (nla, t0 ; t )1 = [ ( niV1Im)(m la, t 0; t )1 . m
(5.5 .14)
This can also be written using ( n le i H0 t/h V( t ) e -iH0t/h l m ) = V, m ( t ) e •<E" - Eml t/ h
and
[from (5.5.13)] as ili
!!_ c (t) = " � Vn me iw"m1c ( t ) dt n m
where
m
( 5 .5.15 )
•
( 5 .5 .16)
Explicitly,
1\
iii
Cz
i:3
-
vu V2 1e '"'"'
V1 2e '"''21 v22
v33
ci Cz c3
( 5 .5.17 )
This is the basic coupled differential equation that must be solved to obtain the probability of finding I n ) as a function of t .
320
Approximation Methods
Time-Dependent Two-State Problems: Nuclear Magnetic Resonance, Masers, and So Forth Exact soluble problems with time-dependent potentials are rather rare. In most cases we have to resort to perturbation expansion to solve the coupled differential equations as we will discuss in the next section. There is, however, a problem of enormous practical importance, which can be solved exactly-a two-state problem with a sinusoidal oscillating poten tial. The problem is defined by > H0
(5.5.17),
= EJ11 )(11+ £212 )(21 ( £2 E1 ) (5.5.18) 2 )(11, (t) ' 'l 1 )(21+ ye-'"''1 ye' = " V where y and "' are real and positive. In the language of (5.5.14) and (5.5.15), we have vl 2 = VzT = ye ' "' ' (5.5.19) Vu = Vzz = 0.
We thus have a time-dependent potential that connects the two energy eigenstates of H0• In other words, we can have a transition between the two states <=t An exact solution to this problem is available. If initially-at -only the lower level is populated so that [see
11 ) 12).
(5.5.3)] c1(0) = 1, c2 (0) = 0,
t=0 (5.5.20)
then the probability for being found in each of the two states is given by (Rabi's formula, after I. I. Rabi, who is the father of molecular beam techniques)
l cz (tW = where
Y 2/tz 2 sinz { [ hy: (w -4w21)2 ]1/\ } y 2jh2+(w-w21) j4 2
+
(5.5.21a) (5.5.21b)
lc1(t) IZ = 1 - l c 2 (t)I Z ,
(5.5.22) 30
as the reader may verify by working out Problem of this chapter. a little more closely . We see that the Let us now probability for finding the upper state £2 exhibits an oscillatory time de pendence with angular frequency, two times that of
look at l cz l 2
(5.5.23)
321
5.5. Time-Dependent Potentials: The Interaction Picture
The amplitude of oscillation is very large when (5.5 .24) that is, when the angular frequency of the potential-usually due to an externally applied electric or magnetic field-is nearly equal to the angular frequency characteristic of the two-state system. Equation (5.5.24) is there fore known as the resonance condition. It is instructive to look at (5.5.2la) and (5.5.2lb) a little closely exactly at resonance: (5.5 .25) We can plot lc1 ( t ) l 2 and l c 2 ( t ) l 2 as a function of t; see Figure 5.4. From t 0 to t '1Tiij2y, the two-level system absorbs energy from the time dependent potential V( t); l c1( t W decreases from unity as lc2( t ) l 2 grows. At t = '1Tii j2y, only the upper state is populated. From t = '1Tiij2y to t 'TT !i jy, the system gives up its excess energy [of the excited (upper) state] to V( t); l c2 1 2 decreases and lcd 2 increases. This absorption-emission cycle is repeated indefinitely, as is also shown in Figure 5 .4, so V( t ) can be regarded as a source or sink of energy; put in another way, V( t ) can cause a transition from 11) to 12) (absorption) or from 12) to 11) (emission). We will come back to this point of view when we discuss emission and absorption of radiation. The absorption-emission cycle takes place even away from reso nance. However, the amplitude of oscillation for 12) is now reduced; l c2 ( t ) l �ax is no longer 1 and lc 1 ( t ) l 2 does not go down all the way to 0. In Figure 5.5 we plot l c2 ( t ) l �ax as a function of w . This curve has a resonance peak centered around w w2 1 , and the full width at half maxima is given by =
=
=
=
l c2(t)l 2 1
rrh/
0 ��----��---��-�
Absorption fl
FIGURE �
5.4.
Plot of
2Y
3rrh
2Y Emission
jc1 (1)12
and
l c2 (t)12
t
Absorption against time t exactly at resonance
w � w2 1
yj n . The graph also illustrates the back-and-forth behavior between 11 ) and 12).
and
Approximation Methods
322
1
Full width at half maximum = 4yln
FIGURE S.S.
Graph of
resonant frequency.
w .,
lc2 ( t)l�ax
as a function of
w,
w
where
w w21 =
corresponds to the
li . It is worth noting that the weaker the time-dependent potential ( y small), the narrower the resonance peak.
4y1
Spin Magnetic Resonance The two-state problem defined by (5.5.18) has many physical appli cations. As a first example, consider a spin � system-say a bound electron - subjected to a !-independent uniform magnetic field in the z-direction and, in addition, a !-dependent magnetic field rotating in the xy-plane: (5.5 .26) with B0 and B 1 constant. We can treat the effect of the uniform !-indepen dent field as H0 and the effect of the rotating field as V. For e S (5.5 .27) 11 = m ,c we have
( ) ( ;::_� ) eliB0
Ho = - 2 m ,c ( 1 + ) ( + 1 - 1 - )( - 1)
V( t ) = -
[cos w t ( I + ) ( - I + 1 - ) ( + I)
+ sin wt( - i l + ) ( - I + i l - ) ( + I) ) ,
( 5 .5 .28)
where we have used the ket-bra forms of 2Sj li [see (3.2.1)]. With e < O, E + has a higher energy than E and we can identify I + ) -> 12) (upper level) (5.5 .29) (lower level) 1 1 ) 1-) _,
->
323
5.5. Time-Dependent Potentials: The Interaction Picture
to make correspondence with the notation of (5.5.18). The angular frequency characteristic of the two-state system is (5.5 .30) which is just the spin-precession frequency for the B0 * 0, B1 = 0 problem already treated in Section 2.1. Even though the expectation values of (Sx,y) change due to spin precession in the counterclockwise direction (seen from the positive z-side), l c + 1 2 and lc - 1 2 remain unchanged in the absence of the rotating field. We now add a new feature as a result of the rotating field: 1 c 1 2 and 1 c - 1 2 do change as a function of time. This can be seen by identifying - enB1 (5.5 .31} w -> w +
2m
_,
c
e
y,
to make correspondence to the notation of (5.5.18); our time-dependent interaction (5.5.28) is precisely of form (5.5.18). The fact that l c (tW and l c ( t ) 1 2 vary in the manner indicated by Figure 5.4 for w w2 1 and the correspondence (5.5.29), for example, implies that the spin ! system under goes a succession of spin-flops, 1 + ) � 1 - ) in addition to spin precession. Semiclassically, spin-flops of this kind can be interpreted as being due to the driving torque exerted by rotating B. The resonance condition is satisfied whenever the frequency of the rotating magnetic field coincides with the frequency of spin precession determined by the strength of the uniform magnetic field. We see that the probability of spin-flops is particularly large. In practice, a rotating magnetic field may be difficult to produce experimentally. Fortunately, a horizontally oscillating magnetic field-for instance, in the x-direction- is just as good. To see this, we first note that such an oscillating field can be decomposed into a counterclockwise compo nent and a clockwise component as follows: +
=
_
,
2 B 1x cos w t
=
B1 (X. cos wt + y sin wt ) + B 1 (X. cos wt - y sin wt ) . (5.5 .32)
We can obtain the effect of the counterclockwise component simply by reversing the sign of w. Suppose the resonance condition is met for the counterclockwise component (5.5.3 3} Under a typical experimental condition, B1 _ « 1, B0
(5.5.34)
324
Approximation Methods
which implies from (5.5.30) and (5.5.31) that ]_ tz
<< w 21 .,
( 5 .5 .35 )
as a result, whenever the resonance condition is met for the counterclock wise component, the effect of the clockwise component becomes completely negligible, since it amounts to w --> - w, and the amplitude becomes small in magnitude as well as very rapidly oscillating. The resonance problem we have solved is of fundamental importance in interpreting atomic molecular beam and nuclear magnetic resonance experiments. By varying the frequency of oscillating field, it is possible to make a very precise measurement of magnetic moment. We have based our discussion on the solution to differential equations (5.5.17); this problem can also be solved, perhaps more elegantly, by introducing the rotating axis representation of Rabi, Schwinger, and Van Vleck.
Maser As another application of the time-dependent two-state problem, let us consider a maser. Specifically, we consider an ammonia molecule NH 3 , which-as we may recall from Section 4.2-has two parity eigenstates IS) and l A ) lying close together such that lA) is slightly higher. Let ll el be the electric dipole operator of the molecule. From symmetry considerations we expect that 11 ,1 is proportional to x, the position operator for the N atom. The basic interaction is like - 11 .1 · E, where for a maser E is a time-depen dent electric field in a microwave cavity: E = IE imax z cos wt . ( 5 .5 .36 ) It is legitimate to ignore the spatial variation of E because the wavelength in the microwave region is far larger than molecular dimension. The frequency w is tuned to the energy difference between l A ) and I S ) : E Es ) w ::: ( A tz
( 5.5 .37)
The diagonal matrix elements of the dipole operator vanish by parity: ( A IIl.,I A ) = (SIIl.,I S ) 0, ( 5 .5 .38 ) but the off-diagonal elements are, in general, nonvanishing: ( 5 .5 .39) ( S ix i A ) = (A ix iS ) * 0 . This means that there is a time-dependent potential that connects IS ) and I A ) , and the general two-state problem we discussed earlier is now applica ble. We are now in a position to discuss how masers work. Given a molecular beam of NH 3 containing both I S ) and lA), we first eliminate the I S ) component by letting the beam go through a region of time-independent =
5.6.
Time-Dependent Perturbation Theory
325
inhomogeneous electric field . Such an electric field separates IS) from lA) in much the same way as the inhomogeneous magnetic field in the Stern Gerlach experiment separates I + ) from 1 - ) . A pure beam of lA) then enters a microwave cavity tuned to the energy difference EA - E5. The dimension of the cavity is such that the time spent by the molecule is just (-rr/2)h/-y. As a result we stay in the first emission phase of Figure 5 .4; we have l A) in and I S) out. The excess energy of lA) is given up to the time dependent potential as lA) turns into I S) and the radiation (microwave) field gains energy. In this way we obtain Microwave Amplification by Stimulated Emission of Radiation , or MASER.
There are many other applications of the general time-dependent two-state prob lem , such as the atomic clock and optical pumping. In fact. it is amusing to see that as many as four Nobel Prizes in physics have been awarded to those who exploited time-dependent two-state systems of some form. *
5.6. TIME-DEPENDENT PERTURBATION THEORY Dyson Series With the exception of a few problems like the two-level time-depen dent problem of the previous section, exact solutions to the differential equation for c.( t ) are usually not available. We must be content with approximate solutions to (5.5.17) obtained by perturbation expansion : c ( t ) = c <01 + c (l l + c < 2l + n
n
n
·
·
n
·
'
( 5 .6.1 )
where c�ll, c�2l, . . . signify amplitudes of first order, second order, and so on in the strength parameter of the time-dependent potential. The iteration method used to solve this problem is similar to what we did in time-inde pendent perturbation theory. If initially only the state i is populated, we approximate c. on the right-hand side of differential equation (5.5.17) by c�0l = 8., ( independent of t ) and relate it to the time derivative of c�,1 ), integrate the differential equation to obtain c�1l, plug c�l) into the right-hand side [of (5.5.17)] again to obtain the differential equation for c�2l, and so on. This is how Dirac developed time-dependent perturbation theory in 1927. Instead of working with c.( t ), we propose to look at the time evolution operator U,(t, t 0 ) in the interaction picture, which we will define later. We obtain a perturbation expansion for U,( t, t 0 ), and at the very end * Nobel Prize winners who took advantage of resonance in ( 1 944) on molecular beams and nuclear magnetic resonance;
the two-level systems are Rabi Bloch and Purcell ( 1 952) on B field in atomic nuclei and nuclear magnetic moments: Townes. Basov. and Prochorov ( 1 964) on masers. lasers, and quantum optics; and Kastler ( 1966) on optical pumping.
Approximation Methods
326
we relate the matrix elements of U1 to c n< t ). If we are interested only in solving simple problems in nonrelativistic quantum mechanics, all this might look superfluous; however, the operator formalism we develop is very powerful because it can immediately be applied to more-advanced prob lems, such as relativistic quantum field theory and many-body theory. The time-evolution operator in the interaction picture is defined by (5 .6 .2) Differential equation (5.5.11) for the state ket of the interaction picture is equivalent to (5 .6 .3) We must solve this operator differential equation subject to the initial condition (5 .6.4) U1(t , t0) 11�10 = 1 . First, let us note that the differential equation together with the initial condition is equivalent to the following integral equation:
U1(t, t0) = 1 - � j'v1(t')UAt', t0) dt'.
(5 .6.5)
'o
We can obtain an approximate solution to this equation by iteration:
[
U1( t , t0) = 1 - � j'v1(t') 1 - � j"v1(t")U1(t", t0) dt"1 dt' J i 1 � f' dt' vl ( t ') + ( � f' dt' r dt" vl ( t ') vl ( t ") . n + . . . + ( h ) f dt' f dt" . . . to
=
-
to
- l
X
to
t
to
/1
to
r
f'
+ . . . .
to
to
· · ·
V1(t< n ))
(5 .6.6)
This series is known as the Dyson series after Freeman J. Dyson, who applied this method to covariant quantum electrodynamics (QED). * Setting aside the difficult question of convergence, we can compute U1(t, t0) to any finite order of perturbation theory. *Note that in QED, the time-ordered product ( t ' > t " > · · · ) is introduced, and then this perturbation series can be summed into an exponential form. This exponential form im mediately gives U( t, t0) = U( t, IJ l U( 11 , t0 ) (Bjorken and Drell l965, 175-78).
327
5.6. Time-Dependent Perturbation Theory
Transition Probability Once U1(t, t0) is given, we can predict the time development of any state ket. For example, if the initial state at t = 0 is one of the energy eigenstates of H0, then to obtain the initial state ket at a later time, all we need to do is multiply by U1(t, 0): li, t0 = 0 ; t)1 = u, ( t , O ) I i ) =
L i n) (niU, ( t , O) Ii ) . n
(5 .6.7)
In fact, (niU, (t,O) I i ) is nothing more than what we called c"( t ) earlier [see (5.5.1 3)]. We will say more about this later. We earlier introduced the time-evolution operator U(t, t0) in the Schrodinger picture (see Section 2.2). Let us now explore the connection between U(t, t0) and U1(t, t0). We note from (2.2.13) and (5.5.5) that Ia ' t0 ·' t ) l = e ' Hot /h la ' t0 '· t )s =
e illot /h U( t ' t 0 ) Ia ' t 0 ' t 0 )s ·
(5 .6.8)
So we have (5 .6.9) Let us now look at the matrix element of U1(t, t0) between energy eigen states of H0: (5 .6.10) We recall from Section 2.2 that (niU(t, t0)1i) is defined to be the transition amplitude. Hence our (niU1(t, t0)1i) here is not quite the same as the transition amplitude defined earlier. However, the transition probability defined as the square of the modulus of (niU(t, t0)1i) is the same as the analogous quantity in the interaction picture (5.6.11)
Parenthetically, we may remark that if the matrix elements of V, are taken between initial and final states that are not energy eigenstates-for example, between Ia ' ) and lb ' ) (eigenkets of A and B, respectively), where [ H0, A ] * 0 andjor [ H0, B ] * O-we have, in general, l (b'IU1 ( t , t0 ) l a ') l * l(b'IU( t , t0 ) l a ') l ,
as the reader may easily verify. Fortunately, in problems where the interac tion picture is found to be useful, the initial and final states are usually taken to be H0 eigenstates. Otherwise, all that is needed is to expand Ia '), l b'), and so on in terms of the energy eigenkets of H0•
328
Approximation Methods
Coming back to (niU1(t, t0)1i), we illustrate by considering a physi cal situation where at t = t0, the system is known to be in state li). The state ket in the Schrodinger picture li, t0; t)s is then equal to li) up to a phase factor. In applying the interaction picture, it is convenient to choose the phase factor at t t 0 so that
=
t0 ; t0)s
=
e - , E, to/h l i ) ,
(5.6 . 12) which means that in the interaction picture we have the simple equation ( 5 .6 .13) At a later time we have (5 .6.14) l i , t0 ; t)1 = U1 ( t, t0 ) 1 i ) . Comparing this with the expansion li,
(t) n), l i , to; t) l = l::C n n l
we see
(5 .6.15)
(5 .6 .16) We now go back to the perturbation expansion for U1(t, t0) [see (5.6.6)]. We can also expand c " (t) as in (5.6.1), where c�ll is first order in V1(t), c�2l is second order in V1(t), and so on. Comparing the expansion of both sides of (5.6.16), we obtain [using (5.5.7)] c�0l( t ) = 8"' (independent of t ) c�1l ( t )
=
� i j'
=h - l !'e 'w,,r'v"' ( t ') dt'
(5 .6.17)
to
c�2l( t )
=
( � i rz: j'dt ' j" dt " e'w,mr'vn m ( t ') e 'w,,r"vm, ( t " ) , m to
where we have used
to
e t ( En - E, )tjli
=
e t w,,t.
l i) -> I n) with n * i is obtained by P (i ...... n) = lc�1l(t)+ c�2l ( t ) + · · · 1 2.
(5 .6.18)
The transition probability for
(5 .6.19)
Constant Perturbation As an application of (5.6.17), let us consider a constant perturbation turned on at t = 0: for t < 0 V(t) = (5 .6.20) (independent of t ) , for t ;:o: 0.
{�
5.6. Time-Dependent Perturbation Theory
329
Even though the operator V has no explicit dependence on time, it is, in general, made up of operators like x, p, and s. Now suppose at t = 0, we have only l i ) _ With t0 taken to be zero, we obtain c
-I v n
fi
1
1 1e'w"' . 1 , dt ' 0
(5 .6.21) or
(5.6.22) The probability of finding In ) depends not only on I Vn , 1 2 but also on the energy difference En - E;, so let us try to see how (5.6.22) looks as a function of En . In practice, we are interested in this way of looking at (5.6.22) when there are many states with E - En so that we can talk about a continuum of final states with nearly the same energy. To this end we define En - E, w = --"'----'-
(5 .6.23) fi and plot 4 sin2 ( wt/2)jw2 as a function of w for fixed t, the time interval during which the perturbation has been on; see Figure 5.6. We see that the height of the middle peak, centered around w = 0, is and the width is proportional to ljt. As t becomes large, lc�1l ( t ) l 2 is appreciable only for those final states that satisfy 2 '7Tfi 2'7T . t--= (5.6.24)
t2
lwl
l En - E, l
If we call !J.t the time interval during which the perturbation has been turned on, a transition with appreciable probability is possible only if (5 .6.25) where by !J.E we mean the energy change involved in a transition with appreciable probability. If !J.t is small, we have a broader peak in Figure 5 .6, and as a result we can tolerate a fair amount of energy non conservation. On the other hand, if the perturbation has been on for a very long time, we have a very narrow peak, and approximate energy conservation is required for a transition with appreciable probability. Note that this " uncertainty relation" is fundamentally different from the x - p uncertainty relation of !J. t !J. E - fi ,
Approximation Methods
330
\
-•rr I
,
-2rr
I
,
w=O
\
\
'
2rr
I
'
,
Plot of 4sin2 ( w tj2)jw2 versus w for a fixed have regarded £,1 as a continuous variable. FIGURE 5.6.
'
1,
....
.....
4rr I,
w
where in w = ( En - E, )/fz we
Section 1.6. There x and p are both observables. In contrast, time in nonrelativistic quantum mechanics is a parameter, not an observable. For those transitions with exact energy conservation En = E, we have (5.6 .26) The probability of finding In) after a time interval t is quadratic, not linear, in the time interval during which V has been on. This may appear intuitively unreasonable. There is no cause for alarm, however. In a realistic situation where our formalism is applicable, there is usually a group of final states, all with nearly the same energy as the energy of the initial state ji). In other words, a final state forms a continuous energy spectrum in the neighborhood of E, . We give two examples along this line. Consider for instance, elastic scattering by some finite range potential (see Figure 5.7), which we will consider in detail in Chapter 7. The initial state is taken to be a plane wave state with its propagation direction oriented in the positive z-direction; the final state may also be a plane wave state of the same energy but with its propagation direction, in general, in a direction other than the positive z-direction. Another example of interest is the de-excitation of an excited atomic state via the emission of an Auger electron. The simplest example is a helium atom. The initial state may be (2s ) 2 , where both the electrons are excited; the final state may be (Is ) (that is, one of the electrons still bound) of the He + ion, while the second electron escapes with a positive energy E; see Figure 5.8. In such a case we are interested in the
5.6. Time-Dependent Perturbation Theory
331
}
Nearly the same energy
+z-direction
FIGURE 5.7.
Elastic scattering of plane wave
by some finite range potential.
One of the electrons is in 1 s state and the other is in free state
(2s)2 Is
(ls) (2s) -----(ls)2
FIGURE 5.8.
Schematic diagram of two electron energy levels of helium
atom.
total probability-that is, the transition probabilities summed over final states with En = E,: (5 .6 .27) L lc�l)l2 • n , £11
=
E1
It is customary to define the density of final states as the number of states within energy interval ( E , E + dE ) as (5 .6.28) p ( E ) dE . We can then write (5.6.27) as L
n , £11 :::: £,
l c�1) 1 2
=
=
J
4
dEn p ( EJ i c�l) l 2
f·
Sill
]
1 Vm l 2 z ( En - E; ) t _ 2 p ( En ) dE., . (5 .6.29) 2 11 l En E, l
[
[
]
As t -+ oo , we can take advantage of ( En - E; ) t 1 . 'ITt . 2 s hm m = 2 fl" 8 ( En - E, ) , 2 11 t - oo i En - Ei l 2
(5 .6.30)
Approximation Methods
332
which follows from
. I liD
-1 sinax2 2x
(5 .6.31) ( ). It is now possible to take the average of !Vn,l 2 outside the integral sign and a - oo
'IT
a
=
., u
X
perform the integration with the 8-function:
ri�� j dE"p( E" ) I c�il( t W = ( 2n7T )w" ' l 2p ( E" ) t i E, � E, · (5 .6 .32) Thus the total transition probability is proportional to t for large values of t, which is quite reasonable. Notice that this linearity in t is a consequence of the fact that the total transition probability is proportional to the area under the peak of Figure 5.6, where the height varies as t2 and the width varies as 1 /t.
It is conventional to consider the transition rate--that is, the transi tion probability per unit time. Expression (5.6.32) tells us that the total transition rate, defined by (5 .6.33) is constant in t for large t . Calling (5.6.33) w, [n J • where [ n ) stands for a group of final states with energy similar to i, we obtain 27T -(5 .6.34) Vn ,l2p ( En k � E, wi - [ n ] = hi _
independent of t, provided the first-order time-dependent perturbation theory is valid. This formula is of great practical importance; it is called Fermi's golden rule even though the basic formalism of !-dependent per turbation theory is due to Dirac.* We sometimes write (5.6.34) as (5 .6.35)
where it must be understood that this expression 1s integrated with / dEn p (En)• We should also understand what is meant by !Vm1 2. If the final states In) form a quasi-continuum, the matrix elements v" ' are often similar if In) are similar. However, it may happen that all energy eigenstates with the same En do not necessarily have similar matrix elements. Consider, for example, elastic scattering. The IV"'I 2 that determines the scattering cross section may depend on the final momentum direction. In such a case the * Editorial Note. Dr. Edward Teller, in teaching a quantum mechanics class at Berkeley in 1955 (in which the editor was in attendance), mentioned that this was actually Fermi's golden rule 2. Since Teller did not subscribe to golden rule 1 , the class was not informed what rule 1 was. However, golden rule 1 is given in E. Fermi 1950, 136 and 148.
5.6. Time-Dependent Perturbation Theory
333
group of final states we should consider must have not only approximately the same energy, but they must also have approximately the same momen tum direction. This point becomes clearer when we discuss the photoelectric effect. Let us now look at the second-order term, still with the constant perturbation of (5.6.20). From (5.6.17) we have 2 ' • w t ' l ' • w t" 1 cn(2 ) = -n i " dt ' e , m i dt " e ,, i..J vn m vml
( )
m
vnmvm, i = _ "' L.. Ji m Em - E,
1 '( 0
0
i ,1( e W
'
0
_
l' l e W nm
) dt ',
(5 .6.36)
The first term on the right-hand side has the same t-dependence as c�l ) [see (5.6.21)]. If this were the only term, we could then repeat the same argument as before and conclude that as t --> oo , the only important contribution arises from En "" E;. Indeed, when Em differs from En and E;, the second contribution gives rise to a rapid oscillation, which does not give a contribu tion to the transition probability that grows with t. With c< 1 l and c<2l together, we have (5.6.37) The formula has the following physical interpretation. We visualize that the transition due to the second-order term takes place in two steps. First, l i ) makes an energy nonconserving transition to lm ) ; subsequently, l m ) makes an energy nonconserving transition to In), where between In) and l i ) there is overall energy conservation. Such energy nonconserving transitions are often called virtual transitions. Energy need not be conserved for those virtual transitions into (or from) virtual intermediate states. In contrast, the first-order term V"' is often said to represent a direct energy-conserving " real" transition. A special treatment is needed if Vn m Vm , * 0 with Em "" E,. The best way to treat this is to use the slow-turn-on method V --> e�1V, which we will discuss in Section 5 . 8 and Problem 31 of this chapter. The net result is to change the energy denominator in (5.6.37) as follows: (5.6 .38)
Harmonic Perturbation We now consider a sinusoidally varying time-dependent potential, commonly referred to as harmonic perturbation: where
'"Y
may still depend on
x,
p,
s,
(5.6.39) and so on. Actually, we have already
Approximation Methods
334
encountered a time-dependent potential of this kind in Section 5.5 in discussing t-dependent two-level problems. Again assume that only one of the eigenstates of H0 is populated initially. Perturbation (5.6.39) is assumed to be turned on at t = 0, so
1
= -
li
[
1-
e l(w + w,,)I i/
w + wn 1
"' +
1-
e i(w, - w)I
- w + wn t
j/t "'
]
(5.6 .40)
where � � actually stands for (i/t )"'. We see that this formula is similar to the constant perturbation case. The only change needed is w" '
En - E, --> wn 1 ± w . li
(5.6.41)
So as t --> oo , lc�1ll2 is appreciable only if =
w" ' + w ::: 0 or En E, - li w wn , - w ::: 0 or En == E, + li w . :::
(5.6.42a) (5.6 .42b)
Clearly, whenever the first term is important because of (5.6.42a), the second term is unimportant, and vice versa. We see that we have no energy-con servation condition satisfied by the quantum-mechanical system alone; rather the apparent lack of energy conservation is compensated by the energy given out to-or energy taken away from-the "external" potential V( t ) . Pictorially, we have Figure 5.9. In the first case (stimulated emission ), the quantum-mechanical system gives up energy liw to V; this is clearly possible only if the initial state is excited. In the second case ( absorption ), the quantum-mechanical system receives energy li w from V and ends up as an excited state. Thus a time-dependent perturbation can be regarded as an inexhaustible source or sink of energy. In complete analogy with (5.6.34), we have 2w
I
2 w, � [n] = tz 1 � , 1 p ( E" ) E
n
2£. - •w '
(5.6.43)
or, more commonly, (5.6.44) Note also that (5.6 .45)
5.7.
335
Applications to Interactions with the Classical Radiation Field
(i)
(li)
�hw
E,
En
t hw
�--------
-----
(i) Stimulated emission: Quantum-mechanical system give s up hw to V (possi ble only if initial state is excited). (ii) Absorption: Quantum mechanical system receives hw from V and ends up as an excited state. FIGURE 5.9.
which is a consequence of (5.6.46) (remember -,v t l n )
DC
<->
( n i Y). Combining (5.6.43) and (5.6.45), we have
absorption rate for n -> [ i 1 density of final states for [ i ] ' (5.6.47) where in the absorption case we let i stand for final states. Equation (5.6.47), which expresses symmetry between emission and absorption, is known as detailed balancing. To summarize, for constant perturbation, we obtain appreciable transition probability for l i ) -> I n ) only if En = E,. In contrast, for harmonic perturbation we have appreciable transition probability only if En = E, - h w (stimulated emission) or En = E, + hw (absorption). emission rate for i -> [ n ] density of final states for [ n 1
-
5.7. APPLICATIONS TO INTERACTIONS WITH THE CLASSICAL RADIATION FIELD Absorption and Stimulated Emission We apply the formalism of time-dependent perturbation theory to the interactions of atomic electron with the classical radiation field. By a classical radiation field we mean the electric or magnetic field derivable from a classical (as opposed to quantized) radiation field. The basic Hamiltonian, with IAI 2 omitted, is
which is justified if
e p2 + e<j>(x) - -A H= · p'
(5.7.1)
v·A = o ;
(5.7.2)
2me
m ec
Approximation Methods
336
specifically, we work with a monochromatic field of the plane wave for
A = 2 A 0 £ cos( ; n · x - wt )
(5 .7.3)
where £ and n are the (linear) polarization and propagation direction. Equation (5.7.3) obviously satisfies (5.7.2) because £ is perpendicular to the propagation direction n. We write (5 .7.4) and treat - ( e jm ,c)A • p as time-dependent potential, where we express in (5.7.3) as
A = Ao£[
e i<wlcJn · x - iw t + e - i( w/cJn · x + iwt
].
(5.7.5)
Comparing this result with (5.6.39), we see that the e - iw'-term in _
( m ec ) A = ( m ec ) A £ · p ( e __
·p
_
e __
o
A
e ' < w !cJn · x - i w t + e - ,< w !c)n · x + tw t
j
(5 .7 .6) is responsible for absorption, while the e + iw'-term is responsible for stimu lated emission. Let us now treat the absorption case in detail. We have (5 .7.7) and
The meaning of the o-function is clear. If I n ) forms a continuum, we simply integrate with p ( En ). But even if I n ) is discrete, because I n ) cannot be a ground state (albeit a bound-state energy level), its energy is not infinitely sharp; there may be a natural broadening due to a finite lifetime (see Section 5.8); there can also be a mechanism for broadening due to colli sions. In such cases, we regard o( w - w"' ) as 8 ( w - wn ; ) = lim r�o
( 2y7T ) [ ( w - wn \ + Y 2/4 i)
]"
(5 .7.9)
Finally, the incident electromagnetic wave itself is not perfectly monochro matic; in fact, there is always a finite frequency width. We derive an absorption cross section as (Energy/unit time) absorbed by the atom ( i --> n ) (5.7.10) Energy flux of the radiation field For the energy flux (energy per area per unit time), classical electromagnetic
5. 7. Applications to Interactions with the Classical Radiation Field
theory gives us where we have used
(
)
337
(5.7.11)
E�ax + -B�ax 'ft = -1 -(5 .7.12) 8w 2 8w for energy density (energy per unit volume) with 1 a E= -- A B = v x A. (5 .7.13) c ar ' Putting everything together, we get (remembering that nw = energy ab sorbed by atom for each absorption process) h w (2wjn)(e2/m;c 2 ) IA01 2I(nl e '< "'lcJ(h·x>£ · p ji)i 28 ( £. - E1 - nw) = (1/2'77 ) ( w2/c) I A01 2 -
0abs
( )
. J
(5 .7.14)
Equation (5.7.14) has the correct dimension [1/( M 2/ T )] ( M2L2jT2)T = L2 if we recognize that a = e2jnc == 1/137 (dimensionless) and 8 ( £. - £1 nw) = (1/ n )8( w.1 - w ), where 8( w., - w) has time dimension T.
Electric Dipole Approximation
The electric dipole approximation (E1 approximation) is based on the fact that the wavelength of the radiation field is far longer than the atomic dimension, so that the series (remember wjc = 1/A)
e i(w/c)n·x = 1 + i "'c n · x + . . .
(5 .7.15)
can be approximated by its leading term, 1. The validity of this for a light atom is as follows: First, the nw of the radiation field must be of order of atomic level spacing, so This leads to In other words,
(5.7.16)
.!:.... w
1_ 37_R-=a,to= m = A _ cnR atom ::: _ . z ze2
(5 .7.17)
1 z ( 5 .7 . 1 8 ) R atom - 1 37 « 1 A for light atoms (small Z). Because the matrix element of x is of order R atom•
Approximation Methods
338
that of x 2 , of order R �tom' and so on, we see that the approximation of replacing (5.7.15) by its leading term is an excellent one. Now we have (5 .7.19) ( n l e '(w/c)(n x) £ · p l i ) -> £ · ( n lp li) . In particular, we take £ along the x-axis (and it along the z-axis). We must calculate ( n i Px l i ) . Using ilipx [ x , H0 1 = -, m
we have
(5.7.20)
m ( n i Pxl i ) = iii (n l [ x , H0 ] 1i ) = imw"' (nl x l i ) .
(5 .7 .21 ) Because of the approximation of the dipole operator, this approximation scheme is called the electric dipole approximation. We may here recall [see (3 . 1 0.39)] the selection rule for the dipole matrix element. Since x is a spherical tensor of rank 1 with q = ± 1 , we must have m ' - m = ± 1 , I/ - j l = 0,1 (no 0 � 0 transition). If E is along the y-axis, the same selection rule applies. On the other hand, if E is in the z-dircction, q = 0; hence, m'
=
m.
With the electric dipole approximation, the absorption cross section (5.7.14) now takes a simpler form upon using (5.7.19) and (5.7.21) as 2 2 (Jabs = 4 '1T o: wn , l(nl x l i ) l 8 ( W - wn , ) .
(5.7.22) In other words, aabs treated as a function of w exhibits a sharp 8-function like peak whenever li w corresponds to the energy-level spacing at w ( En - E, )/ li . Suppose li) is the ground state, then wn , is necessarily positive; integrating (5. 7.22), we get ==
J
2 2 (aJ bs ( W ) d w = [ 4 '1T 0:Wn11( n lxli) 1 • n
(5.7.23)
In atomic physics we define oscillator strength, fn;, as
_ 2mwn,
fn i =
li
. l(n lx l ! ) l 2 ·
(5 .7.24)
It is then straightforward (consider [ x , [x, H0 ]]) to establish the Thomas Reiche-Kuhn sum rule, L fn, = 1 . (5.7.25) n
In terms of the integration over the absorption cross section, we have {5 .7 .26)
5. 7. Applications to Interactions with the Classical Radiation Field
339
Notice how n has disappeared. Indeed, this is just the oscillation sum rule already known in classical electrodynamics (Jackson 1 975, for instance). Historically, this was one of the first examples of how " new quantum mechanics" led to the correct classical result. This sum rule is quite remarkable because we did not specify in detail the form of the Hamilto man.
Photoelectric Effect
We now consider the photoelectric effect- that is, the ejection of an electron when an atom is placed in the radiation field. The basic process is considered to be the transition from an atomic (bound) state to a continuum state E > 0. Therefore, l i ) is the ket for an atomic state, while In) is the ket for a continuum state, which can be taken to be a plane-wave state lk1 ) , an approximation that is valid if the final electron is not too slow. Our earlier formula for aa hs < w ) can still be used, except that we must now integrate <'l( wni - w ) together with the density of final states p ( En ). Our basic task is to calculate the number of final states per unit energy interval. As we will see in a moment, this is an example where the matrix element depends not only on the final state energy but also on the momentum direction. We must therefore consider a group of final states with both similar momentum directions and similar energies. To count the number of states it is convenient to use the box normalization convention for plane-wave states. We consider a plane-wave state normalized if when we integrate the square modulus of its wave function for a cubic box of side L , we obtain unity. Furthermore, the state is assumed to satisfy the periodic boundary condition with periodicity of the side of the box. The wave function must then be of form (5 .7.27) where the allowed values of k must satisfy x
(5.7.28) with n a positive or negative integer. Similar restrictions hold for k v and kz. Notice that as L � oo , kx, kY, and kz become continuous variabies. The problem of counting the number of states is reduced to that of counting the number of dots in three-dimensional lattice space. We define n such that (5 .7.29) x
Approximation Methods
340
As L oo , it is a good approximation to treat n as a continuous variable; in fact it is just the magnitude of the radial vector in the lattice space. Let us consider a small-volume element such that the radial vector falls within n and n + dn and the solid angle element dfl.; clearly, it is of volume n 2 dndfl.. The energy of the final-state plane wave is related to k1 and hence to n ; we have �
2 7T ) 2 2k} -h 2 2( n h -E = 2me = 2me L2
( 5 .7 .30)
Furthermore, the direction of the radial vector in the lattice space is just the momentum direction of the final state, so the number of states in the interval between E and E + dE with direction into dfl. being k1 is (remember dE = (h 2 k1jmJ dk1) given by * 1
dn dE = ( 2L'7T ) 3 ( k12 ) dk dfl. dE n 2 dfl. dE dE
(5.7 .31) We can now put everything together to obtain an express10n for the differential cross section for the photoelectric effect:
doo - 4'7T 2ah l(k le' ( w/c)(nx) £ ·pli)l 2 m ek1 L3 . d mew / h 2( 2'1T ) 3 ••
2
( 5 .7 .32)
To be specific, let us consider the ejection of a K shell (the innermost shell) electron caused by absorption of light. The initial-state wave function is essentially the same as the ground-state hydrogen atom wave function except that the Bohr radius a0 is replaced by a0jZ. Thus
( 5 .7 .33) Integrating by parts, we can pass
'i7
to the left side. Furthermore,
£ · [ 'ii'e i( wjc)(n·x) ] = 0 because £ is perpendicular to n. On the other hand, brings down - ik1, which can be taken outside the
(5 .7.34) 'i7 acting on e-' kr · x integral. Thus to evaluate (5.7.33), all we need to do is take the Fourier transform of the atomic wave function with respect to
Q "" k1 - ( � )n. *This is equivalent to taking one state per cube d3xd3p j(21Tii )3 in phase space.
(5 .7.35)
5.8.
341
Energy Shift and Decay Width
z
y
FIGURE 5.10.
X
Polar coordinate system with
k1 � ( k1 sin ll co s <j> , k1 sin ll sin .p, k1 cos ll ) .
•
and il along
Ox
and
Oz,
respectively, and
The final answer is (see Problem 39 of this chapter for the Fourier transform of the hydrogen atom wave function) da
dO
= 32e zk
2z 1 {£-k/) s a5 [( z 2;a6 ) + q 2 ] 4 . f me w
(5.7 .36)
If we introduce the coordinate system shown in Figure 5.10, we can write the differential cross section in terms of (} and .p using
(5.7.37)
5.8. ENERGY SHIFT AND DECAY WIDTH Our considerations so far have been restricted to the question of how states other than the initial state get populated. In other words, we have been concerned with the time development of the coefficient c"(t) with n =I= i. The question naturally arises, What happens to c;(t) itself? To avoid the effect of a sudden change in the Hamiltonian, we propose to increase the perturbation very slowly. In the remote past ( t -> oo) the time-dependent potential is assumed to be zero. We then -
Approximation Methods
342
gradually turn on the perturbation to its full value; specifically,
V( t ) = e��v (5.8.1) where V is assumed to be constant and 11 is small and positive. At the end of the calculation, we let 11 --> 0 (see Figure 5.11), and the potential then becomes constant at all times. In the remote past, we take this time to be - oo , so the state ket in the interaction picture is assumed to be [i). Our basic aim is to evaluate c, ( t ) . However, before we do that, let us make sure that the old formula of the golden rule (see Section 5.6) can be reproduced using this slow-turn-on method. For cn ( t ) with n =!= i, we have [using (5.6.17)] c�0)( t ) = 0
(5.8.2)
To lowest nonvanishing order, the transition probability is therefore given by (5.8.3)
or (5.8.4)
We now let 11 --> 0. Clearly, it is all right to replace e �1 by unity, but note 11
2 = 7T8 ( wn , ) = 7T il8 ( En - EJ . lim 2 � o 11 + w" '
(5 .8.5)
�
V(t)
=--===_:::,::.;.;::;.=..=-:..:--=-t--=-=-==�- - -FIGURE 5.1 1 .
Plot o f
V(t) versus
- - - - - - V(t) = V as ,- 0
t r
in the adiabatic (slow-turn-on) picture.
5.8. Energy Shift and Decay Width
343
This leads to the golden rule,
( 21,'rr ) 1 V I 28 ( E" - E. ) .
wl � n ""
(5.8.6)
..
Encouraged by this result, let us calculate C:0l, cfll, and cf2l, again using (5.6.17). We have c
-i dI 1 l = VI I lim t. fl
t0
-
- oo
-i j'e"'' dt' = V e "' t. ., fl .,
r0
II
(5 .8.7) Thus up to second order we have . ' _I 2 _ e 2" _ _ _ _I_ c; ( t ) - 1 li ., V,; e " ' + li I V,; I 2 +
( )
.
_
'I
2 112
( ) mL* 1 211 ( IV1 m1 1E2em2"'+ I·;, 11 ) · li
_
I
.
__
.
E
_
(5.8 .8)
Now consider the time derivative of c; [dc;( t )jdt = c1 ], which we have from (5.8.8). Upon dividing by c; and letting 11 --> 0 (thus replacing e"' and e 2"' by unity), we get C; ""
( )
( ) mL* i ( E1 -WEmi m + ih 11 )
-=i. 2 1 V�i -=l. -=l. v + h + 1 h h 1 11
, -=i. v + h II
V11 1 - -i h .,
( -=i.h ) mL:* i Ei -Wmi Em + in ., ·
(5.8.9)
Expansion (5.8.9) is formally correct up to second order in V. Note here that c;( t )/c;(t) is now independent of t. Equation (5.8.9) is a differential equation that is to hold at all times. Having obtained this, it is convenient to renormalize C; so that c;(O) = 1 . We now try the ansatz
C; ( t )
=
e - ili, t/ h ,
�:�;�
=
�j � �
(5.8.10)
with � ; constant (in time) but not necessarily real. Clearly (5.8.10) is consistent with (5.8.9) because the right-hand side of (5.8.10) is constant. We can see the physical meaning of �; by noting that e - ili, r/h l i ) in the interaction picture implies e - ili,r/h - iE,r/h li) in the Schrodinger picture. In
Approximation Methods
344
other words,
E1 ---> E; + ti1
(5.8.11) as a result of perturbation. That is, we have calculated the level shift using time-dependent perturbation theory. Now expand, as usual,
( 5 .8 .1 2 ) and compare (5.8.10) with (5.8.9); we get to first order
A (l) v (5 .8.13) ut u· But this is just what we expect from t-independent perturbation theory. Before we look at ti �2), recall 1 lim . = Pr . ..!_ - i'1T8 (x ) . (5.8 .14) _
x
. � o x + te
Thus
I V ;I 2 _: E Em oF i ; m Im( ti\2) ) = - 'IT L I Vm; I 2 8 ( E; - Em ) . m '#- i Re( ti�2) ) = Pr. L
(5 .8.1 5a) (5.8.15b)
But the right-hand side of (5.8.15b) is familiar from the golden rule, so we can identify L W; - m = 2 '/T L 1 Vm 11 28 ( E; - Em ) = - Im [ ti�2l ] . (5 .8.16) 11 m -4= i m =P i Coming back to C1 ( t ), we can write (5.8.10) as If we define
C
1
( t ) = e - (i/ h)[Re(d, )1] + (1 / h)[lm(d, )1] . rl
11
then
�
=
-
� Im( ti )
11
1
'
(5 .8.17) (5 .8.18) (5 .8.19)
Therefore, f1 characterizes the rate at which state li) disappears. It is worth checking the probability conservation up to second order in V for small t:
( 5 .8 . 20 ) where (5.8.16) has been used. Thus the probabilities for finding the initial state and all other states add up to 1 . Put in another way, the depletion of state l i ) is compensated by the growth of states other than li).
Problems
345
To summarize, the real part of the energy shift is what we usually associate with the level shift. The imaginary part of the energy shift is, apart from - 2 (see the Note also (5.8. 18)],
decay width. Jj
(5.8.21)
r = ,., '
where ,., is the mean lifetime of state l i ) because l c,l2 e-t/r,. (5 .8.22) To see why f, is called width, we look at the Fourier decomposition =
(5.8 .23)
Using the Fourier inversion formula, we get (5.8.24) 1 / (£) 1 2 a: { E - [ £, Re(1 Ll , ) ] } 2 + f,2/4 Therefore, f, has the usual meaning of full width at half maximum. Notice that we get the time-energy uncertainty relation from (5.8.21) Ll t Ll E - n , (5 .8.25) where we identify the uncertainty in the energy with and the mean lifetimeEven with though we discussed the subject of energy shift and decay width using the constant perturbation V obtained as the limit of (5.8.1) when 11 0, we can easily generalize our considerations to the harmonic perturbation case discussed in Section 5.6. All we must do is to let +
r,
M.
-+
(5.8 .26)
inscription of unstableandstates we have and sodeveloped on. The here quantum-mechanical de is original l y due to Wigner and Weisskopf in 1930. (5.8.2), (5.8.8) ,
(5.8. 15),
PROBLEMS 1.
Aperturbation simple harmonic oscillator (in one dimension) subjected to a where b is a real constant. a. Calculate the energy shift of the ground state to lowest nonvanishing IS
order.
Approximation Methods
346
b. Solve this problem exactly and compare with your result obtained in (a). [You may assume without proof ( u n • lxlu n ) =
V;
n 2 w (rn+TBn' , n + l + f,!Bn', - 1 ) .
]
2. In nondegenerate time-independent perturbation theory, what is the
probability of finding in a perturbed energy eigenstate ( lk)) the corre sponding unperturbed eigenstate ( lk <0l))? Solve this up to terms of order g 2• 3. Consider a particle in a two-dimensional potential v; _ 0 for 0 s x s L , 0 s y s L , 0 oo otherwise. Write the energy eigenfunctions for the ground and first excited states. We now add a time-independent perturbation of the form -
V =
1
{
{ Axy
for O s x s L , O s y ::; L ,
0
otherwise. Obtain the zeroth-order energy eigenfunctions and the first-order energy shifts for the ground and first excited states. 4. Consider an isotropic harmonic oscillator in two dimensions. The Hamiltonian is given by
Px2 P 2 mw 2 ( x 2 y 2 ) Ho = 2 + 2 y + 2+ m m
·
a. What are the energies of the three lowest-lying states? Is there any degeneracy? b. We now apply a perturbation V = 8m w2xy ,
where 8 is a dimensionless real number much smaller than unity. Find the zeroth-order energy eigenket and the corresponding energy to first order (that is, the unperturbed energy obtained in (a) plus the first-order energy shift] for each of the three lowest-lying states. c. Solve the H0 + V problem exactly. Compare with the perturbation results obtained in (b). [You may use ( n 'l x ln ) = Jnj2 m w Nn + l 8n'. n + I + f,lan'. n - I ) .] 5 . Establish (5.1 .54) for the one-dimensional harmonic oscillator given by (5.1 .50) with an additional perturbation V = !-em w2x 2• Show that all other matrix elements Vk o vanish. 6. A slightly anisotropic three-dimensional harmonic oscillator has wz wx = wy . A charged particle moves in the field of this oscillator and is at the same time exposed to a uniform magnetic field in the x direction. Assuming that the Zeeman splitting is comparable to the splitting produced by the anisotropy, but small compared to hw, cal""'
Problems
7.
347
culate to first order the energies of the components of the first excited state. Discuss various limiting cases. (From Merzbacher, Quantum Mechanics, 2/e, © 1970. Re printe d by permission of Ellis Horwood, Ltd.) A one-electron atom whose ground state is nondegenerate is placed in a uniform electric field in the z-direction. Obtain an approximate expres
sion for the induced electric dipole moment of the ground state by
considering the expectation value of
ez
with respect to the perturbed
state vector computed to first order. Show that the same expression can also be obtained from the energy shift � = - a:IE1 2/2 of the ground state computed to second order. ( Note: a stands for the polarizability.)
Ignore spin. 8. Evaluate the matrix elements (or expectation values) given below. If any vanishes, explain why it vanishes using simple symmetry (or other) arguments. a. (n = 2 , I = 1 , m = Olxl n = 2 , I = 0, m = 0) . b. (n = 2 , I = 1 , m = O IPz l n = 2, I = 0, m = 0). [In (a) and (b), lnlm) stand for the energy eigenket of a nonrela tivistic hydrogen atom with spin ignored.] c. (Lz) for an electron in a central field with j t m t I d. (singlet, ms = OIS �e- J - S �e+ l ltriplet, ms = O) for an s-state posi tronium. e. (S(ll · s<2>) for the ground state of a hydrogen molecule. 9. A p-orbital electron characterized by I n , = 1 , m = ± 1 ,0) (ignore spin) is subjected to a potential V A ( 2 - y 2 ) ( A constant) . a. Obtain the "correct" zeroth-order energy eigenstates that diagonalize the perturbation. You need not evaluate the energy shifts in detail, but show that the original threefold degeneracy is now completely s
=
=
=
4.
/
=
x
=
removed. b. Because V is invariant under time reversal and because there is no
longer any degeneracy, we expect each of the energy eigenstates
obtained in (a) to go into itself (up to a phase factor or sign) under time reversal. Check this point explicitly. 10. Consider a spinless particle in a two-dimensional infinite square well: for 0 < x :s; a , O :s; y :s; a , otherwise.
a. b.
What are the energy eigenvalues for the three lowest states? Is there any degeneracy? We now add a potential V1 = l\xy, O < x :s; a , O :s; y :s; a .
34R
Approximation Methods
11 .
Taking this as a weak perturbation, answer the following: (i) Is the energy shift due to the perturbation linear or quadratic in ,\ for each of the three states? (ii) Obtain expressions for the energy shifts of the three lowest states accurate to order A. (You need not evaluate integrals that may appear.) (iii) Draw an energy diagram with and without the perturbation for the three energy states. Make sure to specify which unperturbed state is connected to which perturbed state. The Hamiltonian matrix for a two-state system can be written as
Clearly the energy eigenfunctions for the unperturbed problems ( ,\ = 0) are given by a. Solve this problem exactly to find the energy eigenfunctions 1/; 1 and 1/; 2 and the energy eigenvalues £1 and £2. b. Assuming that A l � l « l £1° - Efl, solve the same problem using time-independent perturbation theory up to first order in the energy eigenfunctions and up to second order in the energy eigenvalues. Compare with the exact results obtained in (a). c. Suppose the two unperturbed energies are " almost degenerate," that IS,
12.
I E? - Ef l
«
A I�I-
Show that the exact results obtained in (a) closely resemble what you would expect by applying degenerate perturbation theory to this problem with E? set exactly equal to Ef. (This is a tricky problem because the degeneracy between the first and the second state is not removed in first order. See also Gottfried 1 966, 397, Problem 1.) This problem is from Schiff 1 968, 295, Problem 4. A system that has three unperturbed states can be represented by the perturbed Hamiltonian matrix £1
0
0 £1
a b
a* b* £2 where £2 > £1 • The quantities a and b are to be regarded as perturba
tions that are of the same order and are small compared with £2 - £ 1 • Use the second-order nondegenerate perturbation theory to calculate
349
Problems
the perturbed eigenvalues. (Is this procedure correct?) Then diagonalize the matrix to find the exact eigenvalues. Finally, use the second-order degenerate perturbation theory. Compare the three results obtained. 1 3. Compute the Stark effect for the 2S1 12 and 2P1 12 levels of hydrogen for a field e sufficiently weak so that ew0 is small compared to the fine structure, but take the Lamb shift 8 ( 8 = 1057 MHz) into account (that is, ignore 2 P312 in this calculation). Show that for eea 0 « 8, the energy shifts are quadratic in e, whereas for eea 0 » 8 they are linear in e. (The radial integral you need is (2slrl2p) = 3/3 a 0 .) Briefly discuss the consequences (if any) of time reversal for this problem. This problem is from Gottfried 1966, Problem 7-3. 14. Work out the Stark effect to lowest nonvanishing order for the n 3 level of the hydrogen atom. Ignoring the spin-orbit force and relativistic correction (Lamb shift), obtain not only the energy shifts to lowest
=
nonvanishing order but also the corresponding zeroth-order e�genket.
15. Suppose the electron had a very small intrinsic electric dipole moment analogous to the spin magnetic moment (that is, J.L ,1 proportional to a ). Treating the hypothetical J.L ,1 • E interaction as a small perturbation, discuss qualitatively how the energy levels of the Na atom ( Z = 1 1 ) would be altered in the absence of any external electromagnetic field. Are the level shifts first order or second order? State explicitly which states get mixed with each other. Obtain an expression for the energy shift of the lowest level that is affected by the perturbation. Assume throughout that only the valence electron is subjected to the hypotheti cal interaction. 16. Consider a particle bound to a fixed center by a spherically symmetric potential V( r ). a. Prove
-
I>V(O)I 2
=
C:11 2 ) ( �)
for all s states, ground and excited. b. Check this relation for the ground state of a three-dimensional isotropic oscillator, the hydrogen atom, and so on. ( Note: This relation has actually been found to be useful in guessing the form of the potential between a quark and an antiquark.) 17. a. Suppose the Hamiltonian of a rigid rotator in a magnetic field perpendicular to the axis is of the form (Merzbacher 1970, Problem 1 7-1) 2 AL + BLZ + CLY if terms quadratic in the field are neglected. Assuming B » C, use perturbation theory to lowest nonvanishing order to get approximate energy eigenvalues.
Approximation Methods
350
b. Consider the matrix elements
(n 'l 'm ;m;l(3z2 - r2)inlm 1m,), ( n'l 'm ;m;l xyl nlm1m ,) of a one-electron (for example, alkali) atom. Write the selection rules for !:l/, !:lm1, and !:lms. Justify your answer. 18. Work out the quadratic Zeeman effect for the ground-state hydrogen atom [(xiO) (1/V'ITa b)e-rlaoj due to the usually neglected e2A2/ 2 m c2-term in the Hamiltonian taken to first order. Write the energy =
,
shift as
!:l
= - t x 82
diamagnetic susceptibility,
and obtain an expression for ing definite integral may be useful:
X·
(The follow
19. (Merzbacher 1970, 448, Problem 1 1 . ) For the He wave function, use ljl
(xl , x 2 ) = ( Zerc/7Tao )exp [ - Zerca( rol + rz ) ] 3
3
with Zerr = 2 1� , as obtained by the variational method. The mea sured value of the diamagnetic susceptibility is 1.88 x 10-6 cm3/mole. Using the Hamiltonian for an atomic electron in a magnetic field, determine, for a state of zero angular momentum, the energy change to order B2 if the system is in a uniform magnetic field represented by the vector potential A = � 8 x r. Defining the atomic diamagnetic susceptibility by E = calculate for a helium atom in the ground state and compare the result with the measured value. 20. Estimate the ground-state energy of a one-dimensional simple harmonic oscillator using -
- �xB2 ,
x
x
(xiO) = e-JlJxl as a trial function with /3 to be varied. (You may use n! ) f ooe-axx"dx an+ l 2 1 . Estimate the lowest eigenvalue ( A ) of the differential equation d21ji + ( A - lxl) ljl = 0, ljl 0 for l x l dx 2 ljl { c(a- l x l) , = (a to be varied) 0, for l x l > a 0
= -- .
->
-> oo
351
Problems
as a trial function. ( Caution : dlji 1dx is discontinuous at x = 0.) Numerical data that may be useful for this problem are: 3 113 = 1 .442 ,
3 213 = 2 .080,
5 1 13 = 1 .710,
'TT
213 = 2 .145 .
The exact value of the lowest eigenvalue can be shown to be 1 .019. 22 . Consider a one-dimensional simple harmonic oscillator whose classical angular frequency is w0. For t < 0 it is known to be in the ground state. For t > 0 there is also a time-dependent potential V( t ) = F0x cos wt
where F0 is constant in both space and time. Obtain an expression for the expectation value ( x) as a function of time using time-dependent perturbation theory to lowest nonvanishing order. Is this procedure valid for w :: w0? [You may use ( n 'l x l n ) = /li f2 m w0 (Vn + 1 8n', n + J + Vn 8". n - 1 ).]
23. A one-dimensional harmonic oscillator is in its ground state for t < 0. For t � 0 it is subjected to a time-dependent but spatially uniform force (not potential!) in the x-direction, F( t ) = F0e - 11' .
a. Using time-dependent perturbation theory to first order, obtain the probability of finding the oscillator in its first excited state for t > 0. Show that the t -> oo ( 7' finite) limit of your expression is indepen dent of time. Is this reasonable or surprising? b. Can we find higher excited states? [You may use ( n 'l x l n ) = ..jli j2 m w ( Vn 8n' , n - I + vn + 1 8n' , n + ) . ] 1
24. Consider a particle bound in a simple harmonic oscillator potential. Initially ( t < 0), it is in the ground state. At t = 0 a perturbation of the
form
H '( x , t ) = Ax2e- '1'
is switched on. Using time-dependent perturbation theory, calculate the probability that, after a sufficiently long time ( t » r ), the system will have made a transition to a given excited state. Consider all final states. 25 . The unperturbed Hamiltonian of a two-state system is represented by H0 =
( :? :� )·
There is, in addition, a time-dependent perturbation V( t ) -
( A. cos0 wt
A cos wt O
)
{ A. real) .
Approximation Methods
352
a. At t = 0 the system is known to be in the first state, represented by
( �).
Using time-dependent perturbation theory and assuming that £2 E� is not close to ± hw, derive an expression for the probability that the system be found in the second state represented by
(�)
as a function of t ( t > 0). b. Why is this procedure not valid when £2 - E� is close to ± hw? 26. A one-dimensional simple harmonic oscillator of angular frequency w is acted upon by a spatially uniform but time-dependent force ( not potential) F( t ) =
( F0rjw ) (rz + tz) '
- oo < t < oo .
At t = - oo , the oscillator is known to be in the ground state. Using the time-dependent perturbation theory to first order, calculate the prob ability that the oscillator is found in the first excited state at t = + oo . Challenge for experts: F( t ) is so normalized that the impulse
fF( t ) dt
imparted to the oscillator is always the same-that is, independent of r; yet for T » l jw, the probability for excitation is essentially negli gible. Is this reasonable? (Matrix element of x: ( n 'l x l n ) = ( h j2mw ) 1 12 ( Vn on', n - l + v'n+T on', n + l ). ]
27. Consider a particle in one dimension moving under the influence of some time-independent potential. The energy levels and the correspond ing eigenfunctions for this problem are assumed to be known. We now subject the particle to a traveling pulse represented by a time-dependent potential, V( t ) = A o ( x - ct ) .
a. Suppose at t = - oo the particle is known to be in the ground state whose energy eigenfunction is (xli) = u,(x). Obtain the probability for finding the system in some excited state with energy eigenfunc tion ( x l / ) = u1 ( x ) at t = + oo . b . Interpret your result in (a) physically by regarding the o-function pulse as a superposition of harmonic perturbations; recall
J
OO 1 dw e lw[(xjc) o ( x - ct ) = 2Trc _ 00
I]
353
Problems
Emphasize the role played by energy conservation, which holds even quantum mechanically as long as the perturbation has been on for a very long time. 28. A hydrogen atom in its ground state [(n, /, m) = (1,0, 0)] is placed between the plates of a capacitor. A time-dependent but spatial uniform electric field (not potential!) is applied as follows:
for t < 0, for t > 0
(E0 in the positive z-direction) .
Using first-order time-dependent perturbation theory, compute the probability for the atom to be found at t » r in each of the three 2p states: ( n, !, m) = (2, 1 , ± 1 or 0). Repeat the problem for the 2s state: ( n , !, m) = (2, 0,0) . You need not attempt to evaluate radial integrals, but perform all other integrations (with respect to angles and time). 29. Consider a composite system made up of two spin t objects. For t < 0, the Hamiltonian does not depend on spin and can be taken to be zero by suitably adjusting the energy scale. For t > 0, the Hamiltonian is given by Suppose the system is in 1 + - ) for t :5: 0. Find, as a function of time, the probability for being found in each of the following states I + + ), 1 + - ), 1 - + ), and 1 - - ) : a. By solving the problem exactly. b. By solving the problem assuming the validity of first-order time dependent perturbation theory with H as a perturbation switched on at t = 0. Under what condition does (b) give the correct results? 30. Consider a two-level system with £1 < £2 • There is a time-dependent potential that connects the two levels as follows: Vu = Vzz = 0,
V21 = ye-'"'' ( y real) .
At t = 0, it is known that only the lower level is populated-that is, c1(0) = 1, c 2 (0) = 0. a. Find lc1(t)1 2 and h(t)l 2 for t > 0 by exactly solving the coupled differential equation
2 ihi:k = .E vk. (t) e1"'kn1cn , n-1
( k = 1 , 2) .
b. Do the same problem using time-dependent perturbation theory to lowest nonvanishing order. Compare the two approaches for small values of y. Treat the following two cases separately: (i) w very different from w21 and (ii) w close to w21 .
Approximation Methods
354
Answer for (a): (Rabi's formula)
/h l cz(t) l 2 = /lhz + --( l 2 · l ct(t) i 2 = 1 l cz(t) i1 .
-
w
_
3 1 . Show that the slow-turn-on of perturbation V � Ve�' (see Baym 1969, 257) can generate contribution from the second term in (5.6.36). 32. a. Consider the positronium problem you solved in Chapter 3, Prob lem 3 . In the presence of a uniform and static magnetic field B
along the z-axis, the Hamiltonian is given by
Solve this problem to obtain the energy levels of all four states using degenerate time-independent perturbation theory (instead of diagon alizing the Hamiltonian matrix). Regard the first and the second terms in the expression for H. as H0 and V, respectively. Compare your results with the exact expressions
{
singlet m = 0 . 1 for tnp et m = 0 for triplet m = ± 1 , where triplet (singlet) m 0 stands for the state that becomes a pure triplet (singlet) with m = 0 as B � 0. b. We now attempt to cause transitions (via stimulated emission and absorption) between the two m = 0 states by introducing an oscil lating magnetic field of the "right" frequency. Should we orient the magnetic field along the z-axis or along the x- (or y-) axis? Justify your choice. (The original static field is assumed to be along the z axis throughout.) c. Calculate the eigenvectors to first order. 32' . Repeat Problem 32 above, but with the atomic hydrogen Hamiltonian H = AS1 • S2
+
(meceB ) S1 ·
B
where in the hyperfine term AS, · S2, S 1 is the electron spin, while S2 is the proton spin. [Note the problem here has less symmetry than that of the positronium case]. 33. Consider the spontaneous emission of a photon by an excited atom.
Problems
34.
355
E1
The process is known to be an transition. Suppose the magnetic quantum number of the atom decreases by one unit. What is the angular distribution of the emitted photon? Also discuss· the polar ization of the photon with attention to angular-momentum conserva tion for the whole (atom plus photon) system. Consider an atom made up of an electron and a singly charged (Z = 1) triton CH). Initially the system is in its ground state (n = 1, l = 0). Suppose the nuclear charge suddenly increases by one unit (realistically by emitting an electron and an antineutrino). This means that the triton nucleus turns into a helium ( Z = 2) nucleus of mass 3 e He). Obtain the probability for the system to be found in the ground state of the resulting helium ion. The hydrogenic wave function is given by
( ) 't' n � l . t�o
,,
x
=
1
V 'TT
-
c
( ) 3/2 z
-
ao
e - 7r!uo
·
35. The ground state of a hydrogen atom ( n = 1 , / = 0) is subjected to a time-dependent potential as follows:
V( x, t ) = V cos( k 0
z
-
wt ).
Using time-dependent perturbation theory, obtain an expression for the transition rate at which the electron is emitted with momentum p. Show, in particular, how you may compute the angular distribution of the ejected electron (in terms of 8 and cp defined with respect to the z-axis). Discuss briefly the similarities and the differences between this problem and the (more realistic) photoelectric effect. ( Note: For the initial wave function see Problem 34. If you have a normalization problem, the final wave function may be taken to be If;
f
( ) x
=
(-1- )e'p·x/h L 312
with L very large, but you should be able to show that the observable effects are independent of L.) 36. Derive an expression for the density of free particle states in two dimensions, that is, the two-dimensional analog of
p ( E ) dE dQ =
( 2� ) 3 ( �� ) dEd Q ,
(
k= �,E
=
:: ) .
Your answer should be written as a function of k (or E ) times dE dcp, where cp is the polar angle that characterizes the momentum direction in two dimensions. 37. A particle of mass m constrained to move in one dimension is confined
Approximation Methods
356
within 0 < x < L by an infinite-wall potential V = oo for x < 0, x > L , V = O for O s x s L . Obtain an expression for the density of states (that is, the number of states per unit energy interval) for high energies as a function of E. (Check your dimension!) 38. Linearly polarized light of angular frequency is incident on a one electron "atom" whose wave function can be approximated by the ground state of a three-dimensional isotropic harmonic oscillator of angular frequency Show that the differential cross section for the ejection of a photoelectron is given by do d fJ
w
w0 .
=
h [ k12 ( w )2]} { m 2ww0 mw0 m w0 [( 2�:�: ) ]
4 rxh 2k} � 7rh
x sin20 cos 2� exp
exp
_
+
_
c
cos 0
hk1
provided the ejected electron of momentum can be regarded as being in a plane-wave state. (The coordinate system used is shown in Figure 5.10.) 39. Find the probability l�(p')l 2d 3p' of the particular momentum p' for the ground-state hydrogen atom. (This is a nice exercise in three-dimen sional Fourier transforms. To perform the angular integration choose the z-axis in the direction of p.) 40. Obtain an expression for T(2 p --> ls) for the hydrogen atom. Verify that it is equal to 1 .6 X 10 - 9 s.
CHAPTER 6
Identical Particles
This short chapter is devoted to a discussion of some striking quantum mechanical effects arising from the identity of particles. We also consider some applications to atoms more complex than hydrogen or hydrogenlike atoms. 6. 1. PERMUTATION SYMMETRY
In classical physics it is possible to keep track of individual particles even though they may look alike. When we have particle 1 and particle 2 considered as a system, we can, in principle, follow the trajectory of 1 and that of 2 separately at each instant of time. For bookkeeping purposes, you may color one of them blue and the other red and then examine how the red particle moves and how the blue particle moves as time passes. In quantum mechanics, however, identical particles are truly indis tinguishable. This is because we cannot specify more than a complete set of commuting observables for each of the particles; in particular, we cannot label the particle by coloring it blue. Nor can we follow the trajectory because that would entail a position measurement at each instant of time, which necessarily disturbs the system; in particular the two situations (a) and (b) shown in Figure 6.1 cannot be distinguished-not even in principle. For simplicity consider just two particles. Suppose one of the par ticles, which we call particle 1, is characterized by l k ), where k ' ts a 357
Identical Particles
358
(a)
(b)
FIGURE 6.1. Two different paths, (a) and (b), of a two-electron system, for example, in which we cannot assert even in principle through which of the paths the electrons pass.
collective index for a complete set of observables. Likewise, we call the ket of the remaining particle l k") . The state ket for the two particles can be written in product form, (6.1 .1) l k ') l k " ) , where it is understood that the first ket refers to particle 1 and the second ket to particle 2. We can also consider (6.1 .2) lk")lk'), where particle 1 is characterized by l k ") and particle 2 by l k ') . Even though the two particles are indistinguishable, it is worth noting that mathematically (6.1.1) and (6.1 .2) are distinct kets for k ' * k ". In fact, with k ' * k ", they are orthogonal to each other. Suppose we make a measurement on the two-particle system. We may obtain k ' for one particle and k " for the other. However, we do not know a priori whether the state ket is lk') lk "), lk")lk'), or-for that matter-any linear combination of the two. Put in another way, all kets of form (6.1 .3) lead to an identical set of eigenvalues when measurement is performed. This is known as exchange degeneracy. Exchange degeneracy presents a difficulty because, unlike the single particle case, a specification of the eigenvalue of a complete set of observables does not completely determine the state ket. The way nature avoids this difficulty is quite ingenious. But before proceeding further, let us develop the mathematics of permutation symmetry. We define the permutation operator P12 by PJ2 i k ) l k ") = lk") l k ') . '
Clearly,
(6.1 .4)
(6.1 .5)
359
6.1. Permutation Symmetry
Under P12 , particle 1 having k ' becomes particle 1 having k " ; particle 2 having k " becomes particle 2 having k '. In other words, it has the effect of interchanging 1 and 2. In practice we often encounter an observable that has particle labels. For example in S1•S2 for a two-electron system, S1 (S2) stands for the spin operator of particle 1 (2). For simplicity we consider a specific case where the two-particle state ket is completely specified by the eigenvalues of a single observable A for each of the particles: A1la ') la")
and
(6 . 1 .6a)
= a 'la')la")
{6.1 .6b)
A 2 l a ') la") = a"la') la"),
where the subscripts on A denote the particle labels, and A1 and A 2 are thus the observables A for particles 1 and 2, respectively. Applying P1 2 to both sides of (6.1.6a), we have 1 P12A 1 Pi2 Pda ') la") = a'Pd a ') l a ") 1 P12 A 1 Pi2 la") l a ')
=
= a 'la" ) l a ') .
{6. 1 .7 )
This is consistent with (6.1.6b) only if
(6.1.8) It follows that P12 must change the particle labels of observables. Let us now consider the Hamiltonian of a system of two identical particles. The observables, such as momentum and position operators, must necessarily appear symmetrically in the Hamiltonian-for example, H
Pi P� = 2m + 2m + Vpair( lxl -x21) + Vext (xl ) + V.xt (x z ) .
(6 . 1 .9 )
Here we have separated the mutual interaction between the two particles from their interaction with some other external potential. Clearly, we have {6 .1 .10) for H made up of observables for two identical particles. Because P12 commutes with H, we can say that P12 is a constant of the motion. The eigenvalues of P12 allowed are + 1 and - 1 because of (6.1.5). It therefore follows that if the two-particle state ket is symmetric (antisymmetric) to start with, it remains so at all times. If we insist on eigenkets of P12 , two particular linear combinations are selected: lk'k")
+
=
� ( l k ') lk") + lk") lk') ) ,
(6.1.11 a)
Identical Particles
360
and l k 'k") _ =
+
�
( lk')lk") - lk " ) l k ') ) .
We can define the symmetrizer and antisymmetrizer as follows: S12 i(l P1 2 ) , A 12 i ( l - P12 ) . ==
==
(6.1 .llb)
(6.1 . 12)
If we apply S12 ( A 12 ) to an arbitrary linear combination of l k ') lk ") and l k ") l k ') , the resulting ket is necessarily symmetric (antisymmetric). This can easily be seen as follows:
{ �::} [ =
=
c1 l k ' ) l k " ) + c2 1k ") lk ')]
H c1 l k ') l k ") + c2 1 k ") l k ')) ± H c 1 1 k ") l k ') + c2lk ') l k ")) c 1 + c2 2 ( l k ') l k" ) ± l k" ) l k ') ) .
(6.1 .13)
Our consideration can be extended to a system made up of many identical particles. We define P;) k ') l k ") · · l k ; ) l k ; + I ) · · · l k 1 ) · · · ·
Clearly,
=
l k ') lk") . . · l k 1 ) l k ' + 1 ) . . · l k ' ) . . . .
p2 = 1 lj
( 6.1 .14) ( 6.1 .15)
just as before, and the allowed eigenvalues of P;1 are + 1 and - 1. It is important to note, however, that in general ( 6 .1 .16) It is worth explicitly working out a system of three identical particles. First, there are 3! = 6 possible kets of form l k ') lk ") l k "' )
(6.1 .17 ) where k ', k " , and k "' are all different. Thus there is sixfold exchange degeneracy. Yet if we insist that the state be totally symmetrical or totally antisymmetrical, we can form only one linear combination each. Explicitly, we have l k 'k"k "' ) ± "'
�
{ l k') l k " ) l k '" ) ± l k " ) l k ') l k '" )
+ l k " ) l k "' ) l k ') ± lk"' ) l k " ) l k ' )
+ lk"' ) l k ')lk" ) ± l k ') lk "' ) l k") } . {6.1 .18) These are both simultaneous eigenkets of P12 , P23, and P13• We remarked
361
6.2. Symmetrization Postulate
that there are altogether six independent state kets. It therefore follows that there are four independent kets that are neither totally symmetrical nor totally antisymmetrical. We could also introduce the operator by defining
P123
(6.1 .19) P123 ( lk ') l k ") lk"')) = lk") lk"' ) lk '). Note that P1 23 = P1 2 P13 because P12 P13 (1k')l k ")lk"' )) = P1 2(lk"' )l k ")lk')) lk")lk"' )lk'). =
k ', k",
(6.1 .20)
In writing (6.1.18) we assumed that and are all different. If two of the three indices coincide, it is impossible to have a totally antisymmetrical state. The totally symmetrical state is given by
k "'
lk'k'k ") = � (lk')l k')l k ") + lk')lk")lk') + lk")lk')lk') ) , +
(6.1 .21)
where the normalization factor is understood to be J2!j3! . For more-gen eral cases we have a normalization factor
v N1·'N2 '· · · · Nn •' , (6.1 .22) N! where N is the total number of particles and N, the number of times lk (il ) occurs.
6.2. SYMMETRIZATION POSTULATE So far we have not discussed whether nature takes advantage of totally symmetrical or totally antisymmetrical states. It turns out that systems containing identical particles are either totally symmetrical under the interchange of any pair, in which case the particles are said to satisfy Bose-Einstein (B-E) statistics, hence known as bosons, or totally antisym metrical, in which case the particles are said to satisfy Fermi-Dirac (F-D) statistics, hence known as fermions. Thus
N
P,1
P,) N identical bosons) = + IN identical bosons) P,) N identical fermions) = - IN identical fermions) ,
(6.2.la) (6.2.lb)
where is the permutation operator that interchanges the ith and the jth particle, with i and j arbitrary. It is an empirical fact that a mixed symmetry does not occur.
Identical Particles
362
Even more remarkable is that there is a connection between the spin of a particle and the statistics obeyed by it: (6.2.2a) Half-integer spin particles are fermions; (6.2.2b)
Integer spin particles are bosons.
Here particles can be composite; for example, a 3He nucleus is a fermion just as the e- or the proton; a 4 He nucleus is a boson just as the w ±, w 0 meson. This spin-statistics connection is, as far as we know, an exact law of nature with no known exceptions. In the framework of nonrelativistic quantum mechanics, this principle must be accepted as an empirical pos tulate. In the relativistic quantum theory, however, it can be proved that half-integer spin particles cannot be bosons and integer spin particles cannot be fermions. An immediate consequence of the electron being a fermion is that the electron must satisfy the Pauli exclusion principle, which states that no two electrons can occupy the same state. This follows because a state like 1 k ') 1 k ') is necessarily symmetrical, which is not possible for a fermion. As is well known, the Pauli exclusion principle is the cornerstone of atomic and molecular physics as well as the whole of chemistry. To illustrate the dramatic differences between fermions and bosons, let us consider two particles, each of which can occupy only two states, characterized by k ' and k". For a system of two fermions, we have no choice; there is only one possibility:
� (lk')lk") - lk")lk')).
( 6 .2.3 )
For bosons there are three states possible:
I k ') lk '),
lk'')lk " ),
� (lk')lk")
+
lk'')lk') ).
( 6 .2 .4)
In contrast, for "classical" particles satisfying Maxwell-Boltzmann (M-B) statistics with no restriction on symmetry, we have altogether four indepen dent states: lk')lk"), lk'') lk ')' lk ') lk ')' lk")lk"). ( 6.2.5 ) We see that in the fermion case it is impossible for both particles to occupy the same state. In the boson case, for two out of the three allowed kets, both particles occupy the same state. In the classical (M-B) statistics case, both particles occupy the same state for two out of the four allowed kets. In this sense fermions are the least sociable; they avoid each other to make sure that they are not in the same state; in contrast, bosons are the most sociable, they really love to be in the same state, even more so than classical particles obeying M-B statistics.
363
6.3. Two-Electron System
The difference between fermions and bosons show up most dramati cally at low temperatures; a system made up of bosons, such as liquid 4He, exhibits a tendency for all particles to get down to the same ground state at extremely low temperatures. This is known as Bose-Einstein condensation, a feature not shared by a system made up of fermions. 6.3. 1WO-ELECTRON SYSTEM
Let us now consider specifically a two-electron system. The eigenvalue of the permutation operator is necessarily - 1. Suppose the base kets we use may be specified by xl , x 2 , m sl • and m s2> where m sl and m s2 stand for the spin-magnetic quantum numbers of electron 1 and electron 2, respectively. We can express the wave function for a two-electron system as a linear combination of the state ket with eigenbras of x 1 , x 2, m 51, and m s2 as follows: (6.3.1) If the Hamiltonian commutes with St�t >
[ stoP H] = 0,
(6.3.2) then the energy eigenfunction is expected to be an eigenfunction of St�P and if If is written as (6.3 .3) 1/1 = c�> (xl , x 2 ) x , then the spin function X is expected to be one of the following: X++ 1 triplot (rym�tl) 12 ( x + - + X-+ )
}
X-1
l2 ( X+- - X - + )
singlet ( antisymmetrical) ,
(6.3.4) where X + - corresponds to x ( m 51 -l:, m 52 = - 1 ) Notice that the triplet spin functions are all symmetrical; this is reasonable because the ladder operator S1 _ + S2 _ commutes with P12 and the I + ) I + ) state is even under =
P1 2 .
.
We note (xl , msl ; x 2 , mdPJ2ia)
Fermi-Dirac statistics thus requires (xl , m sl ; x 2 , mda)
=
(x 2 , ms2 ; xl , mslla).
(6.3.5)
- (x 2 , ms2; xl, mslla) .
(6.3.6)
=
364
Identical Particles
Clearly, P12 can be written as
p1 2 p(s2pace)p1(s2pin) =
(6.3 .7)
1
where PWaceJ just interchanges the position coordinate, while pgpinJ just interchanges the spin states. It is amusing that we can express pgpinJ as
p12<spinJ .!_ =
2
(1
+
_±_S 2 1 · S2 li
which follows because
)
'
(triplet)
( 6.3.8)
( 6.3.9)
(singlet} It follows from (6.3.3) that letting Ia) --> Pula)
(6.3.10)
amounts to This together with (6.3.6) implies that if the space part of the wave function is symmetrical (antisymmetrical) the spin part must be antisymmetrical (symmetrical). As a result, the spin triplet state has to be combined with an antisymmetrical space function and the spin singlet state has to be com bined with a space symmetrical function. The space part of the wave function cj>(x 1,x2) provides the usual probabilistic interpretation. The probability for finding electron 1 in a volume element d 3x1 centered around x1 and electron 2 in a volume element d3x 2 is (6.3.12) To see the meaning of this more closely, let us consider the specific case where the mutual interaction between the two electrons [for example, Vpair( lx 1 - x 2 i), S 1 ·S2 ] can be ignored. If there is no spin dependence, the wave equation for the energy eigenfunction 1/; [see (6.1 .9)], (6.3 .13) is now separable. We have a solution of the form wA (x 1 ) w8(x2) times the spin function. With no spin dependence S1�1 necessarily (and trivially) commutes with H, so the spin part must be a triplet or a singlet, which have definite symmetry properties under N2pinJ. The space part must then be written as a symmetrical and antisymmetrical combination of wA (x 1 )w8(x 2 )
365
6.3. Two-Electron System
and
wA (x 2 )w8(x1): cp ( x l , x 2 )
=
{i
1
[ wA ( x l ) wn ( x z ) ± wA ( x 2 ) wn ( x l )]
(6.3 .14)
where the upper sign is for a spin singlet and the lower is for a spin triplet. The probability of observing electron 1 in d 3x 1 around x1 and electron 2 in d 3x 2 around x 2 is given by
H lwA x l ) i 2 1 w n ( x 2 ) 1 2 + lwA ( x 2 ) 1 2 lw 8( x 1 ) 1 2 + 2 Re [ wA x1 ) w8(x 2 ) w1 ( x 2 ) w}j ( x1 ) ] } d 3x1 d 3x 2 •
(6.3.15)
The last term in the curly bracket is known as the exchange density. We immediately see that when the electrons are in a spin-triplet state, the probability of finding the second electron at the same point in space vanishes. Put another way, the electrons tend to avoid each other when their spins are in a triplet state. In contrast, when their spins are in a singlet state, there is enhanced probability of finding them at the same point in space because of the presence of the exchange density. Clearly, the question of identity is important only when the exchange density is nonnegligible or when there is substantial overlap between function wA and function w8• To see this point clearly, let us take the extreme case where l wA (x)l 2 (where x may refer to x1 or x 2 ) is big only in region A and lw 8(x) l 2 is big only in region B such that the two regions are widely separated. Now choose d 3x 1 in region A and d 3x 2 in region B; see Figure 6.2. The only important term then is just the first term in (6.3.15), (6.3 .16) which is nothing more than the joint probability density expected for classical particles. In this connection, recall that classical particles are necessarily well localized and the question of identity simply does not arise. Thus the exchange-density term is unimportant if regions A and B do not overlap. There is no need to antisymmetrize if the electrons are far apart
FIGURE 6.2. Two widely separated regions A and B; f wA (x)f 2 is large in region A while
fw8 (x)f 2 is large in region B.
Identical Particles
366
and the overlap is negligible. This is quite gratifying. We never have to worry about the question of antisymmetriz_;1tion with 10 billion electrons, nor is it necessary to take into account the antisymmetrization requirement between an electron in Los Angeles and an electron in Beijing.
6.4. THE HELIUM ATOM A study of the helium atom is rewarding for several reasons. First of all, it is the simplest realistic problem where the question of identity-which we encountered in Section 6.3-plays an important role. Second, even though it is a simple system, the two-particle Schrodinger equation cannot be solved analytically; therefore, this is a nice place to illustrate the use of perturba tion theory and also the use of the variational method. The basic Hamiltonian is given by
�
2 2 2 � 2e e 2e H= + +2m 2 m r1 r2 r12 ' -
2
-
-
(6 .4 . 1 )
-
where r1 = lx11, r2 = lx 2 1, and r12 = lx1 - x 21; see Figure 6.3. Suppose the e j r12 term were absent. Then, with the identity question ignored, the wave function would be just the product of two hydrogen atom wave functions with Z 1 changed into Z 2. The total spin is a constant of the motion, so the spin state is either singlet or triplet. The space part of the wave function for the important case where one of the electrons is in the ground state and the other in an excited state characterized by (nlm) is 1 -
=
=
<J> ( x l , x 2 ) = fi [ • hoo (x t Hn/m ( x 2 ) ± •hoo( x 2 ) .Vn/m (x l )]
(6.4. 2 )
where the upper (lower) sign is for the spin singlet (triplet). We will come back to this general form for an excited state later. For the ground state, we need a special treatment. Here the con figuration is characterized by (1s )2, that is, both electrons in n 1, I = 0. =
Electron
1
Nucleus of charge + 21el
FIGURE 6.3. Schematic diagram of the helium atom.
367
6.4. The Helium Atom
The space function must then necessarily be symmetric and only the spin singlet function is allowed. So we have ,1,
x 't' lOO ( X I ) ·'· 't' IOO ( 2 ) X singlet _
z3
_
3
'1Ta o
e - Z( r1 + r2)/a0 X
(6.4.3)
with Z = 2. Not surprisingly, this " unperturbed" wave function gives
E=2x4( - � 2a 0 )
(6 .4.4)
for the ground-state energy, which is about 30% bigger than the experimen tal value. This is just the starting point of our investigation because in obtain ing the above form (6.4. 3), we have completely ignored the last term in (6.4.1) that describes the interaction between the two electrons. One way to approach the problem of obtaining a better energy value is to apply first-order perturbation theory using (6.4. 3) as the unperturbed wave func tion and e 2jr12 as the perturbation. We obtain
d (ls)
2
= ( 12 ) JJ -- - 2 _
e2 r
(Is) '
=
z6
'IT
2G o6
e
Z ( r1
+ r2)/a 0 _ d 3x d 3x I 2.
e2 r
12
( 6 .4.5)
To carry out the indicated integration we first note (6 .4.6)
where r ( r < ) is the larger (smaller) of r1 and r2 and y is the angle between x 1 and x 2 • The angular integration is easily performed by expressing P,(cos y ) in terms of using the addition theorem and of spherical harmonics: >
Y,m(81 ,cp1 ) Yt(82 ,cjl2 )
P,(cos y ) = 2
:: 1
I
L �m *( 81 , >l ) �m ( 82 , >2 ) .
m - -I
( 6.4.7)
The angular integration is now trivial:
j Yt (e,,
( 6 .4.8)
The radial integration is elementary (but involves tedious algebra!); it leads to
a6 -- -128 5
zs .
( 6 .4.9)
Identical Particles
368
Combining everything, we have (for Z = 2)
2) ( )( 6 ) ( �(Is)' = 8 ( � ) ( ;:J. z6e
4w(V4w )
W 2a
2
a
5 128
z5
=
( 6 .4.10)
Adding this energy shift to (6.4.4), we have
(
Ecai = - 8 +
� ) ( ;:0)
- 74.8 eV.
�
(6.4.11)
Compare this with the experimental value, Eexp
=
- 78.8 eV.
( 6 .4.12)
This is not bad, but we can do better! We propose to use the variational method with Z, which we call Zerr • as a variational parameter. The physical reason for this choice is that the effective Z seen by one of the electrons is smaller than 2 because the positive charge of 2 units at the origin (see Figure 6.3) is "screened" by the negatively charged cloud of the other electron; in other words, the other electron tends to neutralize the positive charge due to the helium nucleus at the center. For the normalized trial function we use ( 6 .4.13)
From this we obtain
( (
Pi
p�
J
(-
Ze 2
Ze 2
_
)( ) _
e2
_
H= 0 - + - 0 - 0 - + - 0 + 0 - 0 2m 2m r1 r2 r1 2 -
-
-
z;11 5 = 22 - 2ZZeff + g Zerr
)( )
e2 ao .
(6.4.14)
We easily see that the minimization of H is at
zeff = 2 - t6 = 1 .6875 .
( 6 .4.15)
This is smaller than 2, as anticipated. Using this value for Zerr we get (6.4. 16) Ecal - 77. 5 eV, =
which is already very close considering the crudeness of the trial wave function. Historically, this achievement was considered to be one of the earliest signs that Schrodinger's wave mechanics was on the right track. We
6.4. The Helium Atom
E,oo +
FIGURE
6.4.
369
J
Singlet
J
Triplet
Enlm
Schematic diagram for the energy-level splittings of (ls )( nl) for the helium
atom.
cannot get this kind of number by the purely algebraic (operator) method. The helium calculation was first done by A. Unsold in 1927. * Let us briefly consider excited states. This is more interesting from the point of view of illustrating quantum-mechanical effects due to identity. We consider just (1s )( nl ) . We write the energy of this state as
E = Etoo + Entm + 11 £.
(6.4.17)
( �r12 ) = I± J,
( 6 .4.18)
In first-order perturbation theory, /1 £ is obtained by evaluating the expec tation value of e 2j r1 2• We can write where I and J, known respectively as the direct integral and the exchange integral, are given by (6.4.19a)
J jd 3X1 jd 3X 21hoo(x 1 ) tit nlm (x 2) ..C tltroo(x 2 ) t�t:,m ( X I ) · r12 =
J
I
(6.4.19b)
The upper (lower) sign goes with the spin singlet (triplet) state. Obviously, is positive; we can also show that is positive. So the net result is such that for the same configuration, the spin singlet state lies higher, as shown in Figure 6.4. The physical interpretation for this is as follows: In the singlet case the space function is symmetric and the electrons have a tendency to come close to each other. Therefore, the effect of the electrostatic repulsion is more serious; hence, a higher energy results. In the triplet case, the space function is antisymmetric and the electrons tend to avoid each other. Helium in spin-singlet states is known as parahelium, while helium in spin-triplet states is known as orthohelium. Each configuration splits into •
A. Unsold, Ann. Phys. 82 (1927) 355.
Identical Particles
370
l !1_ s_ H2 _P_ ___
<�'-----
__
(1 s)(2s)
<
------< (1 s) 2 ' So
------
para 1 P, ortho 3P2,1,o
------- para ' So
'------
ortho 3S,
Spin singlet, necessarily "para"
FIGURE 6.5. Schematic energy level diagram for low-lying configurations of the helium atom.
the para state and the ortho state, the para state lying higher. For the ground state only parahelium is possible. See Figure 6.5 for a schematic energy level diagram of the helium atom. It is very important to recall that the original Hamiltonian is spin-independent because the potential is made up of just three Coulomb terms. There was no S1• S2-term whatsoever. Yet there is a spin-dependent effect-the electrons with parallel spins have a lower energy-that arises from Fermi-Dirac statistics. This explanation of the apparent spin dependence of the helium atom energy levels is due to Heisenberg. The physical origin of ferromag netism-alignment of the electron spins extended over microscopic dis tances-is also believed to be essentially the same, but the properties of ferromagnets are much harder to calculate quantitatively from first princi ples.
6.5. PERMUTATION SYMMETRY AND YOUNG TABLEAUX In keeping track of the requirement imposed by permutation symmetry, there is an extremely convenient bookkeeping technique known as Young tableaux, after A. Young, an English clergyman who published a fundamen tal paper on this subject in 1901. This section is meant to be an introduction to Young tableaux for the practical-minded reader. We do not necessarily present all derivations; this is one of those cases where the rules are simpler than the derivations. To illustrate the basic techniques involved, we consider once again the spin states of a two-electron system. We have three symmetric states corresponding to the three possible orientations of the spin-triplet state and an antisymmetric state corresponding to the spin singlet. The spin state of an individual electron is to be represented by a box. We use [I] for spin up
6.5. Permutation Symmetry and Young Tableaux
371
and [TI for spin down. These boxes are the basic primitive objects of SU(2). A single box represents a doublet. We define a symmetric tableau, ITJ, and an antisymmetric tableau, B When applied to the spin states of a two-electron system, rn is the Young tableau for a spin triplet, while EJ is the Young tableau for a spin singlet. Returning now to [I] and (}], we can build up the three-triplet states as follows:
.___.__.I
m1H
(6. 5 .1)
1 [T] 2 m
(6.5.2)
We do not consider [TII] because when we put boxes horizontally, symme try is understood. So we deduce an important rule: Double counting is avoided if we require that the number (label) not decrease going from the left to the right. As for the antisymmetric spin-singlet state,
I �I I; I Ii I
is the only possibility. Clearly and are impossible because of the requirement of antisymmetry; for a vertical tableau we cannot have a is discarded to avoid double counting. To symmetric state. Furthermore, eliminate the unwanted symmetry states, we therefore require the number (label) to increase as we go down. We take the following to be the general rule. In drawing Young tableau, going from left to right the number cannot decrease; going down the number must increase. We deduced this rule by considering the spin states of two electrons, but we can show this rule to be applicable to the construction of any tableau. Consider now three electrons. We can construct totally symmetric spin states by the following rule:
111 112 122 222
(6.5.3)
This method gives four states altogether. This is just the multiplicity of the j 1 state, which is obviously symmetric as seen from the m 1 case, where all three spins are aligned in the positive z-direction. What about the totally antisymmetric states? We may try vertical tableau like =
=
Identical Particles
372
But these are illegal, because the numbers must increase as we go down. This is not surprising because total antisymmetry is impossible for spin states of three electrons; quite generally, a necessary (but, of course, not sufficient) condition for total antisymmetry is that every state must be different. In fact, in SU(2) we cannot have three boxes in a vertical column. We now define a mixed symmetry tableau that looks like this: EP. Such a tableau can be visualized as either a single box attached to a symmetric tableau [as in Equation (6.5.4a)) or a single box attached to an antisymmetric tableau (as in Equation (6.5.4b)].
rn I
(6.5 .4a)
I
l - -
( 6.5 .4b)
If the spin function for three electrons is symmetric in two of the indices, neither of them can be antisymmetric with respect to a third index. For example, (6.5.5) satisfies symmetry under 1 - 2 , but it is neither symmetric nor antisym metricwith respect to 1 - 3 (or 2 - 3) . We may try to enforce antisymmetry under 1 - 3 by subtracting the same thing from (6.5.5) with 1 and 3 interchanged:
I + > 1 1 - >z l - h - I + hi - >zl - > 1 + 1 - > 1 l + >zl - h - 1 - hi + >zl - > 1 (6.5.6) =
This satisfies antisymmetry under 1 <-+ 3, but we no longer have the original symmetry under 1 <-+ 2. In any case the dimensionality of SJ is 2; that is, it represents a doublet ( j = ! ). We can see this by noting that such a tableau corresponds to the angular momentum addition of a spin doublet ( 0 ) and a spin triplet ( rn ). We have used up the totally symmetric quartet ITIJ , so the remainder must be a doublet. Alternatively, we may attach a doublet ( 0 ) to a singlet ( B ) as in (6.5.4b); the net product is obviously a doublet. So no matter how we consider it, BJ must represent a doublet. But this is precisely what the rule gives. If the number cannot decrease in the horizon tal direction and must increase in the vertical direction, the only possibilities are
[BTI
and
�[ill]·
(6.5 .7)
Because there are only two possibilities, BJ must correspond to a doublet. Notice that we do not consider ca .
6.5.
Permutation Symmetry and Young Tableaux
373
We can consider Clebsch-Gordan series or angular-momentum ad dition as follows:
f EtJ (l/2) ® .@ (1/2) = EtJ {l) EJ:)EtJ (O) \ o ® o = rn EJ:) B f EtJ (l) ®EtJ (l/2) = EtJ (3/2) EJ:) EtJ (l/2) \ [0 ® 0 = 1 I I EJ:) [p
(2 X 2 = 3 + 1) (6.5 .8)
(3 X 2 = 4 + 2)
{ a�:@�,� as,
We now extend our consideration to three primitive objects. A box can assume three possibilities: (6.5.9) The labels 1 , 2, and 3 may stand for the magnetic quantum numbers of p-orbitals in atomic physics or charge states of the pion 'TT + , '1T 0, 'TT -, or the u , d, and s quarks in the SU(3) classification of elementary particles. We assume that the rule we inferred using two primitive objects can be generalized and work out such concepts as the dimensionality; we then check to see whether everything is reasonable. rn
w : dimensionality 3 dimensionality 3
tE Bl
Antisymmetry
only one
(In SU(3) 3* is used to distinguish
B from 0.)
(Totally antisymmetrical.)
(6.5.10)
Symmetry
rn
[II] []1}] [jJJJ [Iill lliil [Til] : dimensionality 6 \ 1\1\1\ \ I \ l \ 21 \ I I l \ 31 \ 1 1 3 1 3 1 \ 2 \ 2121 \ 2 1 2 \ 3 \
Mixed symmetry
BJ
Identical Particles
374
These tableaux correspond to representations of SU(3). A1 boxes in first row ··· I I A 2 boxes in second row ::: A 3 boxes in third row (6.5 .11) For the purpose of figuring the dimensionality, it is legitimate to strike out
I I ::: 1 1 1
II
• • • •
.
• •
(6.5 .12)
.
•
a " singlet" at the left-hand side of (6.5.1 1). We can show the dimensionality to be (6.5 .13) d(A 1 ' A 2 ' A ) = ( p + 1)( q + 21) ( p + q + 2)
3 where p = A1 - A 2 and q = A 2 - A3•
In the SU(3) language, a tableau corresponds to a definite irreducible representation of SU(3). On the other hand, when each box is interpreted to mean a j = 1 object, then a tableau does not correspond to an irreducible representation of the rotation group. Let us discuss this point more specifi cally. We start with putting together two boxes, where each box represents a j = 1 state. (3 X 3 = 6 + 3) (6.5. 14)
The horizontal tableau has six states; the tableau is to be broken down into j = 2 (multiplicity 5) and j = 0 (multiplicity 1), both of which are symmet ric. In fact this is just how we construct a symmetric second-rank tensor out of two vectors. The vertical tableau 8 corresponds to an antisymmetric j = 1 state behaving like a X b. To work out the addition of three angular momenta with }1 = }2 = }3 = 1, let us first figure out
0] ® 0=1 EP B ® o� tp � § I I I
(6 X 3 = 10 + 8)
(6.5.15)
(3 x 3 = 8 + 1)
To see how ITIJ breaks down, we note that it must contain j = 3. However, this is not the only totally symmetric state; a(b • c)+b(c·a)+c(a·b) is also totally symmetric, and this has the transformation property of j = 1 . So CID contains both j = 3 (seven states) and j = 1 (three states). As for EP with eight possibilities altogether, the argument is more involved, but we note that this 8 cannot be broken into 7 + 1 because 7 is totally symmetric, while 1 is totally antisymmetric when we know that 8 is of mixed symmetry. So the only other possibility is 8 = 5 + 3-in other words j = 2 and j = 1.
6.5.
Permutation Symmetry and Young Tableaux
375
(6.5.16)
In terms of angular-momentum states, we have j = 3 (dimension 7) once (totally symmetric) j
=
2 (dimension 5) twice
(both mixed symmetry)
j = 1 (dimension 3) three times (one totally symmetric, two mixed symmetry) (totally antisymmetric) j = 0 (dimension 1) once
(6.5.17) That the j = 0 state is unique corresponds to the fact that the only product of a, b, and c invariant under rotation is a · (b X c), which is necessarily antisymmetric. We can also look at the j = 1 states using the tensor approach. We have three independent vectors constructed from a, b, and c: a(b• c), b(c• a), and c(a•b). From these we can construct only one totally symmetric combination, (6.5 .18) a(b · c) + b( c· a) + c(a · b) .
We can apply this consideration to the (2 p )3 configuration of the nitrogen atom, N ( Z = 7, shell structure (1s)2(2s )2(2p)3). There are al together 6·5·4 6! = 20 states. (6.5 .19) 3!3! = 3 . 2
The space function behaving like ITIJ , which would have I tot is actually impossible because it must be combined with a totally antisymmet ric spin function. Remember =
3,
§.
where each box stands for spin up or down, is impossible for spin states of three electrons. In contrast the totally antisymmetric space function
with / tot = 0 is perfectly allowable if combined with a totally symmetric spin function ITIJ , which has total spin angular momentum 1 ; we have 4S 1 for this configuration. The mixed-symmetry space function tfJ ( / tot = 2,31)2 must be combined with a mixed-symmetry spin function, necessarily a spin doublet EP . So we have 2D512• 312 and 2P312, 112 • The counting of states
Identical Particles
376
leads to the following: j = J. J=1 J=1
6 states 4 X 3 states 2 states 20 states
( 6 .5 .20)
This agrees with the other way of counting. Finally, we consider applications to elementary particle physics. Here we apply SU(3) considerations to the nonrelativistic quark model, where the primitive objects are u, d, s (up, down, and strange, where up and down refer to isospin up and isospin down ). A box can now stand for u, d, or s. We look at the decuplet representation ITIJ (10): 0 � +, , � +, 0, + +,
-o -
w .:.
·
ddd udd uud uuu ddsuds uus dssuss sss
l = 'f,
/ = 1,
J=l 1 = 0,
(6.5.21)
2'
where I stands for isospin. Now all ten states are known to be spin 1 objects. It is safe to assume that the space part is in a relative S-state for low-lying states of three quarks. We expect total symmetry in the spin degree of freedom. For instance, the j = 1, m = 1 state of � can be visualized to have quark spins all aligned. But the quarks are spin 1 objects; we expect total antisymmetry because of Fermi-Dirac statistics. Yet Quark label (now called flavor): symmetric Spin:
symmetric
Space:
symmetric
(6.5.22)
for the j = � decuplet. But as is evident from (6.5.22), the total symmetry in this case is even! This led to the " statistics paradox," which was especially embarrassing because other aspects of the nonrelativistic quark model were so successful. The way out of this dilemma is to postulate that there is actually an additional degree of freedom called color (red, blue, or yellow) and pos-
377
Problems
tulate that the observed hadrons (strongly interacting particles, including J 1 + states considered here) are color singlets =
�
( JRBY) - JBRY) + JBYR) - JYBR) + JYRB) - JRYB)). ( 6. 5 .23)
This is in complete analogy with the unique
of (6.5.10), a totally antisymmetric combination in color space. The statistics problem is now solved because p
.
lj
(-)
=
p (flavor) p (spin) p (space) p (color) lj
(+)
l)
(+)
(+)
lj
(-)
lj
( 6. 5 .24)
One might argue that this is a cheap way to get out of the dilemma. Fortunately, there were also other pieces of evidence in favor of color, such as the decay rate of '17 ° and the cross section for electron-positron annihila tion into hadrons. In fact, this is a very good example of how attempts to overcome difficulties lead to nontrivial prediction of color.
PROBLEMS 1. a. N identical spin ! particles are subjected to a one-dimensional simple harmonic oscillator potential. What is the ground-state energy? What is the Fermi energy? b. What are the ground state and Fermi energies if we ignore the mutual interactions and assume N to be very large? 2. I t is obvious that two nonidentical spinl particles with no orbital angular momenta (that is, s-states for both) can form j = 0, j = 1 , and j = 2. Suppose, however, that the two particles are identical. What restrictions do we get? 3. Discuss what would happen to the energy levels of a helium atom if the electron were a spinless boson. Be as quantitative as you can. 4. Three spin 0 particles are situated at the corners of an equilateral triangle. Let us define the z-axis to go through the center and in the direction normal to the plane of the triangle. The whole system is free to rotate about the z-axis. Using statistics considerations, obtain restrictions on the magnetic quantum numbers corresponding to Jz.
Identical Particles
378
Consider three weakly interacting, identical spin 1 particles. a. Suppose the space part of the state vector is known to be symmetric under interchange of any pair. Using notation I + ) 10 ) 1 + ) for particle 1 in m = + 1, particle 2 in m 0, particle 3 in m + 1 , and so on, construct the normalized spin states in the following three cases: (i) All three of them in I + ). (ii) Two of them in I + ), one in 10). (iii) All three in different spin states. What is the total spin in each case? b. Attempt to do the same problem when the space part is antisymmetric under interchange of any pair. 6. Suppose the electron were a spin � particle obeying Fermi-Dirac statis tics. Write the configuration of a hypothetical Ne ( Z = 1 0) atom made up of such "electrons" [that is, the analog of (1s) 2 (2s) 2 (2p) 6 ]. Show that the configuration is highly degenerate. What is the ground state (the lowest term) of the hypothetical Ne atom in spectroscopic notation es+ 1LJ , where S, L, and J stand for the total spin, the total orbital angular momentum, and the total angular momentum, respectively) when exchange splitting and spin-orbit splitting are taken into account? 7. Two identical spin 1 fermions move in one dimension under the in fluence of the infinite-wall potential V = oo for x < 0, x > L, and V = 0 for 0 s; x s; L. a. Write the ground-state wave function and the ground-state energy when the two particles are constrained to a triplet spin state ( ortho state). b. Repeat (a) when they are in a singlet spin state ( para state). c. Let us now suppose that the two particles interact mutually via a very short-range attractive potential that can be approximated by V = - M ( x1 - x 2 ) { X > O) . Assuming that perturbation theory is valid even with such a singular potential, discuss semiquantitatively what happens to the energy levels obtained in (a) and (b). 5.
s
s =
s =
CHAPTER 7
Scattering Theory
This last chapter of the book is devoted to the theory of scattering and, more generally, collision processes. It is impossible to overemphasize the importance of this subject.
7.1. THE LIPPMANN-SCHWINGER EQUATION We begin with a time-independent formulation of scattering processes. We assume that the Hamiltonian can be written as H = H0 + V (7.1 .1) where
H0
stands for the kinetic-energy operator
(7 .1 .2) In the absence of a scatterer, V would be zero, and an energy eigenstate would be just a free particle state lp). The presence of V causes the energy eigenstate to be different from a free-particle state. However, if the scatter ing process is to be elastic-that is, no change in energy-we are interested in obtaining a solution to the full-Hamiltonian Schrodinger equation with the same energy eigenvalue. More specifically, let I>) be the energy eigenket 379
Scattering Theory
380
(7 .1 .3) Hol
E l >f ) .
The basic Schrodinger equation we wish to solve is ( H0 + V ) I .P ) =
(7 .1 .4)
Both H0 and H0 + V exhibit continuous energy spectra. We look for a solution to (7.1 .4) such that, as V --> 0, we have 1 >1- ) --> I ), where I ) is the solution to the free-particle Schrooinger equation [(7.1 .3)] with the same energy eigenvalue. It may be argued that the desired solution is 1 (7.1 .5) 1 >1- ) = V l >f ) + l
H0
apart from complications arising from the singular nature of the operator 1/(E - H0 ). We can see this by noting that E - H0 applied to (7.1.5) immediately gives the correct equation, (7.1 .4). The presence of I>) is reasonable because 1 >1- ) must reduce to I ) as V vanishes. However, without prescriptions for dealing with a singular operator, an equation of type (7.1.5) has no meaning. The trick we used in time-independent perturbation theory-inserting the complimentary projection operator, and so on (see Section 5.1)-does not work well here because both 1<1>) and 1 >1- ) exhibit continuous eigenvalues. Instead, this time the solution is specified by making E slightly complex: 1 ± l) . (7 .1.6) l >f< l ) = I ) + E - H0 ± z.e
Vl >f <
±
This is known as the Lippmann-Schwinger equation. The physical meaning of ± is to be discussed in a moment by looking at (x j,p< ± l ) at large distances. The Lippmann-Schwinger equation is a ket equation independent of particular representations. We now confine ourselves to the position basis by multiplying (x i from the left. Thus
j
(
(x j .p < ± l ) = (x l
E-
�o ± i£ x') (x'j V I >f < ± >).
(7.1 .7)
This is an integral equation for scattering because the unknown ket 1 .p< ± l) appears under an integral sign. If I ) stands for a plane-wave state with momentum p, we can write (x i
e
ip•x/ h
( 2wll
) J/2
( 7 .1 . 8)
7. 1 .
The Lippmann-Schwinger Equation
381
[Editor's Note: In contrast to bound states, the plane-wave state (7 . 1 .8) is,
of course, not normalizable and not really a vector in Hilbert space. Dealing with such states is one of the inconveniences of time-independent scattering theory. The "normalization" in (7 . 1 .8) is such that
I
d3x(p' l x)(x lp> = o<3>(p - p').
(7. 1.9)]
If, on the other hand, the Lippman-Schwinger equation is written using the momentum basis, we obtain (7. 1 . 10) We shall come back to this equation in Section 7.2. Let us consider specifically the position basis and work with (7 . 1 . 7). To make any progress we must first evaluate the kernel of the integral equation defined by G ± (x, x')
,
=
li? (
1 - x 2m E - Ho
We claim that G ± (x x') is given by
(
where E
h2
2m
=
X
fi2J(lf2m.
1
G "' (x, x ' )
±
x' .
(7. 1 . 1 1)
--4 I 'I
1 e ± iklx - x' l 'IT
(7. 1 . 12)
X - X
To show this, we evaluate (7 . 1 . 11) as follows:
±
E - H0
=
)
IE.
iE X IP ' \ E-
1 (p' 2/2m)
±
I P" I X ' ) ' iE P ,\(
(7. 1 . 13)
where H0 acts on (p' 1 . Now use
; , __ 1___ ,\ = o(3>(p' - p") .. \P E - (___,. p ' 2/2m) ± iE P I E - ( p ' 2/2m) iE
I
(x p' ) =
ip' . e
x/t,
(21rh)3;2 '
The right-hand side of (7. 1 . 13) becomes ��2
2m
I
+
(p" lx' ) =
eip· . (x - x·)/. d3p · (21Th)3 [£ - (p' 2/2m) ±
(Z1Th)3;2
(7 . 1 . 14)
iE ]'
(7. 1 . 15)
e
- ip' · X'If,
3R2
Scattering Theory
Now write E = h 2 k 2j2m and set p' = llq. Equation (7.1.15) becomes
_1 = 8'172
l
(
+ x dqq e rqj x - x'l - e - "tl• - x 'l
i lx - x'l
j_ x '
1 e ± r k jx - x j 4'17 lx - x ' l
q 2 - k 2 + ie
) (7 .1 .16)
In the last step we used the method of residues, noting that the integrand has poles in the complex q-plane at
/ (� )
q=±k 1±
£ ""' ± k ± ie'. 2
(7.1 .17)
The reader may recognize that G ± is nothing more than Green's function for the Helmholtz equation,
(7.1 .18) Armed with the explicit form of G ± as in Equation (7.1.12), we can use (7.1.11) to write Equation (7.1.7) as
2m j 3 e +- rk jx x 'l (x'I VI.f< ± l ) . (7.1 .19) (x l .f < ± l ) = (x l cl> ) - - d x ' 4?T ix - x 'l ll 2
Notice that the wave function (xl.f< ± l ) in the presence of the scatterer is written as the sum of the wave function for the incident wave ( x l
7.1. The Lippmann-Schwinger Equation
383
As a result, we obtain ( x 'I V I I/ P l ) = d 3x"(x 'W ix" ) (x" l lf( ± > )
j
= V(x ') ( x'l.f< ± l ) .
(7.1 .21)
The integral equation (7.1.19) now simplifies as
Let us attempt to understand the physics contained in this equation. The vector x is understood to be directed towards the observation point at which the wave function is evaluated. For a finite range potential, the region that gives rise to a nonvanishing contribution is limited in space. In scattering processes we are interested in studying the effect of the scatterer (that is, the finite range potential) at a point far outside the range of the potential. This is quite relevant from a practical point of view because we cannot put a detector at short distance near the scattering center. Observation is always made by a detector placed very far away from the scatterer at r greatly larger than the range of the potential. In other words, we can safely set lx l » lx 'l ,
as depicted in Figure 7.1. Introducing
(7 .1 .23)
r = lx l
(7 .1 .24)
r ' = lx'l
and
a = L( x,x ') ,
(7 .1 .25) p
k =p,
Observation point
11'1
Finite-range scattering potential. The observation point P is where the wave function (xi>J;< ± l ) is to be evaluated, while the contribution to the integral in Equation (7.1 .22) is for lx'l less than the range of the potential, as depicted by the shaded region of the figure. FIGURE 7.1 .
384
Scattering Theory
we have for r >> r ',
=
lx - x 'l Vr 2 - 2rr 'cos a + r'2
(.
,2 2 r' __ = r 1 - _ _cos a + !:___ ,z
r
"" r - r · x
where
1/2
(7 .1 .26)
'
.
(7 .1 .27)
k ' = kf.
(7 .1 .28)
r= •
Also define
)
X
IXI
The motivation for this definition is that k' represents the propagation vector for waves reaching observation point x. We then obtain
(7.1 .29) for large r. It is legitimate to replace 1/ lx - x'l by just 1/ r . Furthermore, to get rid of the ll 's in expressions such as 1/(27TII )312 , it is convenient to use lk) rather than IP, ), where
(7 .1 .30) Because lk) is normalized as
(klk') = s (3>(k - k')
we have
(x lk) =
So, finally,
1
(2 '1T )
r k ' e ---- J d3x' e [ ik· x ' k r j (k ', k) ] . e / 2 3 ) 7T (2
1 2m (x l�<+>) r arge (xlk) - 4 '1T 11 2 =
e 1k ·3/x 2
1
+
r
·
(7.1 .31) (7 . 1 .32)
e - ik' · x'v(x ' ) ( x ' l � < + > )
r
(7 .1 .33) This form makes it very clear that we have the original plane wave in propagation direction k plus an outgoing spherical wave with amplitude f(k', k) given by
e -ik'•x'
/(k', k) = - _!_ 2 m2 (2'1T )3 j d3x' V(x ')(x'l� ( +)) 2 V 11 (27T ) '1T 4 2 m (k' I �< + > ) . _!_ 7T )3 (7 . 1.34) = (2 4 7T tz 2 I V
7.1 .
The Lippmann-Schwinger Equation
385
Similarly, we can readily show from (7.1.22) and (7.1.29) that (x l l/;< - l ) corresponds to the original plane wave in propagation direction k plus an incoming spherical wave with spatial dependence e - ,kr/r and amplitude - ( l /4 7T )(27T ) 3 (2m/ n2)( - k'I V W - 1 ) . To obtain the differential cross section daldil, we may consider a large number of identically prepared particles all characterized by the wave function (7.1.32). We can then ask, what is the number of incident particles crossing a plane perpendicular to the incident direction per unit area per unit time? This is just proportional to the probability flux due to the first
term on the right-hand side in (7 . 1 .33). Likewise we may ask, what is the number of scattered particles going into a small area du subtending a differential solid-angle element dil? Clearly, da number of particles scattered into d Q per unit time dQ = dQ number of incident particles crossing unit area per unit time l di = r lj�cau l = IJ(k' ,k) l 2dil . (7. 1 . 35)
IJincid l
Hence the differential cross section is
:� = l f(k', k) i 2 .
(7 .1 .36)
Wave-Packet Description The reader may wonder here whether our time-independent formula tion of scattering has anything to do with the motion of a particle being
®------+-
--- - -- f/1b
(a) (b) FIGURE 7.2. (a) Incident wave packet approaching scattering center initially. (b) Incident wave packet continuing to move in the original direction plus spherical outgoing wave front (after a long time duration).
Scattering Theory
386
bounced by a scattering center. The incident plane wave we have used is infinite in extent in both space and time. In a more realistic situation, we consider a wave packet (a difficult subject!) that approaches the scattering center.* After a long time we have both the original wave packet moving in the original direction plus a spherical wave front that moves outward, as in Figure 7.2. Actually the use of a plane wave is satisfactory as long as the dimension of the wave packet is much larger than the size of the scatterer (or range of V ).
7.2.
THE BORN APPROXIMATION
Equation (7.1.34) is still not directly useful in computing the differential cross section because in the expression for f(k',k) the unknown ket l>f< + J ) appears. If the effect of the scatterer is not very strong, we may infer that it is not such a bad approximation to replace (x'l>f< + l ) (which appears under the integral sign) by (x'I
=
e ik •x'
(2'1T )3 /2
.
(7.2.1 )
We then obtain an approximate expression for f(k', k). Because we treat the potential V to first order, the approximate amplitude so obtained is known as the first-order Born amplitude and is denoted by J<1l:
(7.2.2) In other words, apart from - (2mj4'1Tii 2 ), the first-order amplitude is just the three-dimensional Fourier transform of the potential V with respect to q = k - k'.
For a spherically symmetric potential, j <1l(k', k) is a function of lk - k 'l, given by (remember lk'l k by energy conservation) =
lk - k ' l = q = 2 k sin
�
;
(7. 2 .3)
see Figure 7.3. We can perform the angular integration explicitly to obtain
2 m � ioo C V( r )( e iqr - e - , qr ) dr r 2 li 2 tq 2 m 1 i oo r V( r ) sin qrdr . =--
o
li 2 q
-
0
(7 .2.4)
*For a fuller account of the wave-packet approach, see M. L. Goldberger and K. M. Watson, Collision Theory, Chapter 3 (New York: John Wiley, 1964); R. G. Newton. Scattering Theory of Waves and Particles, Chapter 6 (New York: McGraw-Hill, 1 966).
7.2.
387
The Born Approximation
k FIGURE
7.3. Scattering through angle
0,
he e q
w
r
�
k - k'.
An example 1s now m order. Consider scattering by a Yukawa potential V( r )
=
v: e - w
o
p.r
(7 .2.5)
,
where V0 is independent of, r and 1/p. corresponds, in a certain sense, to the range of the potential. Notice that V goes to zero very rapidly for r » 1/p.. For this potential we obtain [from (7.2.4)] (l (fJ) J l
where we have used Im
=_
( )
1
2m V0 , p. ll 2 q 2 + J.!2
[ )0 e - we iqr dr] = - Im( - p.1 { oo
+
(7 _2_6)
.
tq
)
(7 .2.7)
Notice also that (7.2.8)
So, in the first Born approximation, the differential cross section for scattering by a Yukawa potential is given by (7 .2.9) It is amusing to observe here that as p. --> 0, the Yukawa potential is reduced to the Coulomb potential, provided the ratio V0jp. is fixed-for example, to be ZZ 'e 2 -in the limiting process. We see that the first Born differential cross section obtained in this manner becomes (7 .2 .10)
388
Scattering Theory
Even the h disappears if hk is identified as IP I, so (7.2.11) where EKE = IPI 2/2m; this is precisely the Rutherford scattering cross section that can be obtained classically. Coming back to (7.2.4), the Born amplitude with a spherically symmetric potential, there are several general remarks we can make if f(k ', k) can be approximated by the corresponding first Born amplitude, t
1 . dojdrl , or f(f}), is a function of q only; that is, /(8 ) depends on the energy ( h 2k 2j2m ) and 8 only through the combination
2. 3.
4.
2 k 2 (1 - cos 8). f( 8) is always real. dojdrl is independent of the sign of V. For small k ( q necessarily small), 2m _!_ j v(r) d 3x J0> ( 8 ) 4 '17 h 2 involving a volume integral independent of 8. /( 8 ) is small for large q due to rapid oscillation of the integrand. =
5.
-
Let us now discuss the validity of the first-order Born approximation. It is clear from the derivation that if the Born approximation is to be applicable, (xl �< + >) should not be too different from (xl cJ> ) inside the range of potential- that is, in the region where V(x) is appreciable. Otherwise, it is not legitimate to replace W + l ) by lcJ> ). In other words, the distortion of the incident wave must be small. Going back to the exact expression (7.1.22), we note that the condition that (xl >t,< + >) be not too different from (x I ) at the center of the scattering potential x 0 is seen to be 2m 1 li? 4'TT
I d3x' r'
eikr· -
"'
. e V(x ') 'k x·
«
1.
(7.2.12)
We now consider what the special case of the Yukawa potential in (7.2.5) may imply. At low energies-that is, for small k ( k « p, )-it is legitimate to replace e ' kr' by 1. So we must have 2m 1 Vo l l « . h 2 1-"2
( 7 .2 .13 )
This requirement may be compared with the condition for the Yukawa potential to develop a bound state, which we can show to be 2m (7 .2.14) 22 1 Vol 2 2.7 h
!L
7.2. The
Born Approximation
389
with V0 negative. In other words, if the potential is strong enough to develop a bound state, the Born approximation will probably give a misleading result. In the opposite high k-limit, the condition that the second term in (7.1 .22) is small can be shown to imply 2 m I Vo l
;,2 }lk
ln
() k
Jl
«
1.
(7 .2.15)
As k becomes larger, this inequality is more easily satisfied. Quite generally, the Born approximation tends to get better at higher energies.
The Higher-Order Born Approximation Let us now consider the higher-order Born approximation. Here, it is more compact to use the symbolic approach. For the transition operator T, we define T such that (7 .2.16) Multiplying the Lippmann-Schwinger equation (7.1 .6) by V, we obtain T l
E
-
. HO + IE 1
T i
(7 .2 .1 7)
This is supposed to hold for I ) taken to be any plane-wave state; furthermore, we know that these momentum eigenkets are complete. There fore, we must have the following operator equation satisfied: T= V+ V
E-
1 HO + I. E T.
(7 .2.18)
The scattering amplitude of (7. 1 .34) can now be written as [using (7.2.16) with the I ) as momentum eigenkets] / (k', k)
=
-
}'lT 2117 (27T ) 3 (k'IT ik) .
(7 .2.19)
Thus to determine /(k', k) , it is sufficient to know the transition operator T. We can obtain an iterative solution for T as follows: T= V+ V
1 1 1 V+ V V V+ · · · · E - H0 + ie E - H0 + ie E - H0 + ie (7 .2 .20)
Correspondingly, we can expand f as follows:
/( k ', k)
L /( n l( k ', k) , 00
=
n �1
(7.2.21 )
Scattering Theory
390
FIGURE 7.4.
Physical interpretation of the higher-order Born term f(2)(k',k).
where n is the number of times the V operator enters. We have
= - 41'/T �7 (2'1T ) 3 (k'I V Ik ) 2 m (2'1T ) 3 (k 'I V 1 . Vik) (7 2 22) j < 2l ( k ', k) = - _!__ E - Ho + I f 4'1T n 2 J<1l (k', k)
.
.
If an explicit form is required for / (2), we can write it as J<2l
= - _!__ 2 m (2 '1T ) 3j d 3x ' j d 3x "(k'lx ') V(x') 4 '1T 112
(
X x'
1
. )
E - H0 + I f
x" V( x" ) (x "lk)
2 m j d 3x ' j d 3x " e - 'k'·x' v(x ') = - _!__ 4 '1T n 2 X
[ �7 G + (x',x" ) ] V(x " ) e'k- x". (7.2.23)
(7 .2.23) 7.4,
A physical interpretation of is given in Figure where the incident wave interacts at x" -which explains the appearance of V(x")-and then propagates from x" to x ' via Green's function for the Helmholtz equation (7.1.18); subsequently, a second interaction occurs at x' - thus the appearance of V(x')-and, finally, the wave is scattered into the direction k '. In other words, j < 2l corresponds to scattering viewed as a two-step process; likewise, /(3) is viewed as a three-step process, and so on. 7.3 OPTICAL THEOREM
There is a very famous relationship popularly attributed to Bohr, Peierls, and Placzek [ Editor's Note: This relationship is in fact due to Eugene Feenberg* (Phys. Rev. 40, 1932); see R. G. Newton (A m. J. Phys. 44, • As pointed out by Newton in his Review (c . f. Ref. 8 therein), Feenberg's paper is also remarkable for the fact that it was published on 1 April 1932, with a receipt date stated as 8 September 1932-thus violating causality!
7.3. Optical
391
Theorem
relates the imaginary part of the forward scattering amplitude f( () = 0) to the total cross section O',o,, as follows:
639, 1976) for the historical background. ] called the optical theorem, which Optical Theorem
lm f( 8 = 0) = katot
(7.3 . 1 )
4?T
where (7.3 .2) f( 8 = 0) "" f(k, k) , the setting of k ' k imposes scattering in the forward direction , and f d d fJ . (7.3 .3) dQ. Proof From (7.2.1 9) we have =
a
atot =
/ ( 8 = 0) = / (k , k) =
-
L �7 (2 '1T )3 (k i T ik) .
(7 .3 .4)
We next evaluate Im(k i T ik) using (7.2.16), (7.1 .6), and the hermiticity of V: lm(k i T ik ) = Im(k iV I tf;< + l ) =
Im
[(
( 1¥( + ) 1 _ N( + ) I V
E-
Now we use the well-known relation E_
�0 - iE ) v w + ) > 1 ·
(7 . 3 . 5 )
�o - iE = Pr. ( E � HJ + i?T8 ( E - H0 )
to reduce the right-hand side of (7.3.5) to the form Im( ( l/.,< + l i V I tf;< + l ) ) - Im(t¥< + l i V Pr .
E
� H0 V l tf;< + l )
(7.3.6)
The first two terms of (7.3.6) vanish because of the hermiticity of V and V Pr. [1/( E - H0 )] V; hence (7.3.6) reduces to - ?T(t/; < + l i VB ( E - H0 ) V I tf;< + l ) .
(7.3 .7)
Again, we can recast (7.3.7) using (7.2.16) and I ) = lk) as lm(k i T ik)
=
-
= -
?T(k 1Tt8 ( E - H0 ) T ik) ?T
j d 3 k ' (k i T t lk')(k'I T ik)8
= - ?T j df2'
�� l(k' I T ik) l 2 ,
( h.�� ) E-
2
(7.3 .8)
Scattering Theory
392
where we have used d3k ' = k'2 dE( dk'jdE) df!.', the 8-function constraint E = li2k '2/2m [hence dE = (li2k 'j m) dk '], and-finally- k ' = k. From (7.3.4) and (7.3.8), we have Im j (O )
(
1 3 �7 (2'1T) = 4 '/T - k otot ,
-
'IT�k j dW I ( k'ITik )l 2
)
4'1T where in the last step we have used (7.1.36), (7.2.19), and (7.3.3).
(7.3.9) 0
We can appreciate the physical significance of the optical theorem after we discuss shadow scattering.
7.4. EIKONAL APPROXIMATION This approximation covers a situation in which V(x) varies very little over a distance of order of wavelength i\ (which can be regarded as " small"). Note that V itself need not be weak as long as E » lVI; hence the domain of validity here is different from the Born approximation. Under these condi tions, the semiclassical path concept becomes applicable, and we replace the exact wave function >/;( + l by the semiclassical wave function [see (2.4.18) and (2.4.22)], namely, (7.4.1) This leads to the Hamilton-Jacobi equation for S,
/i 2k 2 E 2m + V = = 2m '
( vs ) z
(7 .4.2)
as discussed in Section 2.4. We propose to compute S from (7.4.2) by making the further approximation that the classical trajectory is a straight line path, which should be satisfactory for small deflection at high energy.* Consider the situation depicted in Figure 7.5, where the straight-line trajec tory is along the z-direction. Integrating (7.4.2) we have
] 1 12 2m [ 2 h = j_ "' k2 - 2 v( Vb2 + z ' ) dz' + constant. S
z
/i
(7 .4.3)
The additive constant is to be chosen in such a way that
-sli ...... kz
as
V
......
0
(7 .4.4)
so that the plane-wave form for (7.4.1) is reproduced in this zero-potential *Needless to say, solution of (7.4.2) to forbidding task in general.
determine
the classical trajectory would be a
7.4.
Eikonal Approximation
393
limit. We can then write Equation (7.4.3) as
! = kz + f j/k2 - � V{ Jb2 + z'2 ) - k ] dz' kz - n7k t v( Vb2 + z '2 ) dz' �
(7 .4.5)
- oc
where for E » V, we have used
1/;( + l (x) = 1/; ( + ) (b +
1
- im z zz) "" (27T1)3/2 e ikzexp [ -v( vb2 + z'2 ) dz'] . J n2k - 00
(7 .4.6) Though (7.4.6) does not have the correct asymptotic form appropri'!,tt:.for an incident plus spherical outgoing wave (that is, it is not of form ei k + f(O)(eikrlr) and indeed refers only to motion along the original direction), it can nevertheless still be used in (7 1 34) to obtain an approximate expres sion for f(k' ,k), to wit, ·
.
x
.
/(k',k) = - 417T �7 j d3x • e -,k' ·x'v( Jb2 + z'2 )e ' k·x' X exp [ - i: r· V{Jb2 + z" 2 ) dz " ] . n k
(7 .4.7)
- oo
Note that without the last factor, exp [ . . ), (7 .4.7) is just like the first-order ') integra Born amplitude in (7.2.2). We perform the three-dimensional ( tion in (7.4.7) by introducing cylindrical coordinates (see .
b
t
- - - - - - -
II �
',
z
---
/
d 3x d3x'= bdbd.phdz'
z-direction
b
- - -
'
'
,
_ __
.,
/
I
Scattering region
FIGURE 7.5. Schematic diagram of eikonal approximation scattering where the classical straight line trajectory is along the z-direction, lxl r, and b lbl is the impact parameter. �
�
Scattering Theory
394
Figure 7.5) and noting that
( k - k ' ) ·x ' = (k - k ') · ( b + z 'z )
(7 .4.8) where we have used k .l b and (k - k ') · z 0( 0 2 ) , which can be ignored for small deflection 0. Without loss of generality we choose scattering to be in the xz-plane and write k '• b = ( k sin O x + k cos8z) · ( b cos
- k '• b,
-
The expression for f(k',k) becomes X f + oo dz Vexp - oo
We next use the following identities:
[ -:h km r
]
Vdz ' .
- oo
(7 .4.1 0) (7.4.11)
and
z � + <XJ ] ] 2 k n i z z -i -i r r m m f-+oooo dz Vexp f_ Vdz ' f_ Vdz ' = --;;;- exp h2k
00
h2k
where, of course, the contribution from of (7 .4.12) vanishes in the exponent. So, finally z
where
/ ( k ', k) = - ik
� ( b) =
=
00
(7 .4.1 2) oo on the right-hand side
{"0 dbbJ0 ( kbfJ ) [ e 2 .t'(b) - 1 ] ,
2kh 2 -m
--
� + oo v(vIb 2 + z 2 ) dz . - 00
z � - oo '
(7 .4.13)
(7 .4.14)
e2'�( hl
In (7.4.14) we fix the impact parameter b and integrate along the straight-line - 1 ] in path z, shown in Figure 7.5. There is no contribution from [ (7.4.13) if b is greater than the range of V. It can be shown in a straightforward manner that the eikonal approximation satisfies the optical theorem (7.3.1). This proof plus some interesting applications-for example, when V is a Gaussian potential �(b) becomes Gaussian in b-space-are discussed in the literature (Gottfried 1966). For the case where V is a Yukawa potential, see Problem 7 in this chapter.
7.5.
Free-Particle States: Plane Waves Versus Spherical Waves
395
7.5. FREE-PARTICLE STATES: PLANE WAVES VERSUS SPHERICAL WAVES
In considering scattering by a spherically symmetric potential, we often examine how states with definite angular momenta are affected by the scatterer. Such considerations lead to the method of partial waves, to be discussed in detail in Section 7.6. However, before discussing the angular momentum decomposition of scattering states, let us first talk about free particle states, which are also eigenstates of angular momentum. For a free particle the Hamiltonian is just the kinetic-energy oper ator, which obviously commutes with the momentum operator. It is for this reason that in Section 7.1 we also took 1<1> ) , an eigenket of the free-particle Hamiltonian, to be a momentum eigenket or a plane-wave state lk), where the eigenvalue of the momentum operator is hk. We note, however, that the free-particle Hamiltonian also commutes with U and Lz- Thus it is possible to consider a simultaneous eigenket of H0, L2 , and Lz. Ignoring spin, such a state is denoted by IE, /, m ) , often called a spherical-wave state. More generally, the most general free-particle state can be regarded as a superposition of IE, /, m ) with various E, /, and m in much the same way as the most general free-particle state can be regarded as a superposi tion of lk) with different k, different in both magnitude and direction. Put in another way, a free-particle state can be analysed using either the plane-wave basis { lk) } or the spherical-wave basis { I E, /, m ) } . We now derive the transformation function (k i E , /, m ) that connects the plane-wave basis with the spherical-wave basis. We can also regard this quantity as the momentum-space wave function for the spherical wave characterized by E, /, and m. We adopt the normalization convention for the spherical-wave eigenket as follows: ( E ', / ', m 'I E , /, m) = 811,8mm'8 ( E - E ' ) . (7.5.1) In analogy with the position-space wave function, we may guess the angular dependence: m (7.5.2) (k i E , I, m ) = g1E ( k ) Y; (k) . To prove this rigorously, we proceed as follows. First, consider the momen tum eigenket lki)-that is, a plane-wave state whose propagation direction is along the positive z-axis. An important property of this state is that it has no orbital angular-momentum component in the z-direction: (7.5.3) Actually this is plausible from classical considerations: The angular momentum component must vanish in the direction of propagation because L · p = (x X p)·p = 0. Because of (7.5.3)-and since ( E ', / ', m'lkz) 0 for =
Scattering Theo ry
396
m ' * O-we
must be able to expand l ki) = L /'
lki)
as follows:
j dE 'I E ', 1 ', m' = 0 ) ( £ ', / ', m ' = O i kZ) .
(7.5.4)
Notice that there is no m' sum; m ' is always zero. We can obtain the most general momentum eigenket, with the direction of k specified by 8 and cj>, from lki) by just applying the appropriate rotation operator as follows [see Figure 3.3 and (3.6.47)]: lk) = 22 ( a = cJ> , /1 = 8 , y = O) I kZ) . (7.5.5) Multiplying (7.5.5) by ( E, /, ml on the left, we obtain ( E , /, m lk) =
f. j dE '( E, l, m l 22 ( a = cp, j1 = 8 , y = O) I E ', l ', m ' = O) /'
x ( E ', / ' , m ' = OikZ) =L /'
j dE ' 22:!,�>( a = cj> , /1 = 8 , y = 0)
X 811.8 ( £ - E ' ) ( E ', / ', m ' = O i kZ)
= 22:!,6 ( a = > , /1 = 8 , y = 0)( E , I, m = OikZ) .
(7.5.6)
0\ k-1.7 i� independent of the orit:ntation of k-that is, 1 g,�(k). So independent of 8 and
Now
\E,
l,
m
/'4:
-
we can write, using (3.6.51) ,
(k i E,/,m) =
g,E
(k) Y/" (k) .
(7.5.7)
Let us determine g1 E( k ). First, we note that
(7.5.8) But we also let H0 - E operate on a momentum eigenbra (k l as follows: ( H0 - E ) I E , /, m ) = 0 .
(k i( H0 - E ) =
Multiplying (7.5.9) with
I E , /, m )
( 11;� ) 2
- E (k l .
(7 .5. 9)
on the right, we obtain ( 7 . 5.1 0)
This means that (kiE, /, m ) can be nonvanishing only if E = n 2ej2m; so we must be able to write g1E ( k ) as g1E( k ) = N8
(
)
11 2k 2 2m - E .
(7.5 .11)
397
7.5. Free-Particle States: Plane Waves Versus Spherical Waves
To determine obtain
N
we go back to our normalization convention (7.5.1). We
( E ' , ! ', m 'I E , I , m ) =
j d 3k "( E ', ! ', m 'lk")(k" I E , I , m )
( �:·2 - )
= jk" 2 dk" j d Q k" I N I 2o li X8 =
( �:· ) 2
li
- E Y/""(k") Y,m (k")
k"2 dE " f d fl
! dE "jdk " X
k, I N I 28
�:n'*(k") Yt(k")
= INI2
E'
(
/i 2k "2 2m
)(
- E' 8
� o ( E - E ') owomm''
m '
tz 2
k 2-E n
2m
(7.5.12)
li
where we have defined E " = li 2k" 2j2m to change k"-integration into £"-integration. Comparing this with (7.5.1), we see that N = lij vmk will suffice. Therefore, we can finally write (7 .5 .13)
hence (k i E , /, m ) =
li -./mk
8
( 1i2k ) 2m
2
- E �m (k) . A
(7.5 .14)
From (7.5.14) we infer that the plane-wave state lk) can be expressed as a superposition of free spherical-wave states with all possible /-values; in particular, lk) =
L L j dE l E , I , m ) ( E , I, m lk) I
m
(7 .5.15)
Because the transverse dimension of the plane wave is infinite, we expect that the plane wave must contain all possible values of impact parameter b (semiclassically, the impact parameter b "" IIi 1p ). From this point of view it is no surprise that the momentum eigenstates lk), when analyzed in terms of spherical-wave states, contain all possible values of I. We have derived the wave function for IE, I, m ) in momentum space. Next, we consider the corresponding wave function in position space.
)
Scattering Theory
398
From wave mechanics, the reader should be familiar with the fact that the wave function for a free spherical wave is j1(kr)Y,"'(r), where it(kr) is the spherical Bessel function of order I (see Appendix A). The second solution n 1( kr ) although it satisfies the appropriate differential equation, is inadmis sible because it is singular at the origin. So we can write (x[ E ,l, m) = ctf/kr)Yt(r). (7.5.16) To determine c1, all we have to do is compare ,
e i k• x
(x[k) = (2w )312 = L Lm j dE(x [ E , I, m)( E, I, m [k) � � j dEctf/kr)Yt(r) V�k 8( E - ��;�2 )�rn*(k) (21 + 1) (7.5.17) L 4 w P1(k·r) vmk c1Jt(kr), 1 I
=
A
=
A
��
�
0
where we have used the addition theorem
Lm �rn(r)�rn*(k) = [(21 + l ) j4w ] P, (k·r) in the last step. Now (x[k) = e ik ·x;(2w ) 312 can also be written as e ik -x 2( w ) 312 (2w)1 312 [(1 2 1 + 1)i)1( kr)P1(k·r), (7.5.18) which can be proved by using the following integral representation for J,(kr ) (7 .5. 1 9) j1( kr) = 21I f-+l1 e 'k rcosOp,(cos 0 ) d (cos 0 ) . Comparing (7.5.17) with (7.5.18), we have 1i 2mk c, - ll V • (7 5 .20) To summarize, we have (7.5.21a) (k[ E , I, m) = v�k 8( £ - ��2 ) �rn(k) (x[£, /, m) = � J 2:k jl(kr)�rn(r). (7 .5 .21b) :
·I
_
11"
.
These expressions are extremely useful in developing the partial-wave expansion discussed in the next section. We conclude this section by applying (7.5.21a) to a decay process. Suppose a parent particle of spin j disintegrates into two spin-zero particles A (spin j) --> B (spin 0)+ C (spin 0). The basic Hamiltonian responsible for
399
7.6. Method of Partial Waves
such a decay process is, in general, very complicated. However, we do know that angular momentum is conserved because the basic Hamiltonian must be rotationally invariant. So the momentum-space wave function for the final state must be of the form (7.5.21a), with I identified with the spin of the parent particle. This immediately enables us to compute the angular distribution of the decay product because the momentum-space wave func tion is nothing more than the probability amplitude for finding the decay product with relative momentum direction k. As a concrete example from nuclear physics, let us consider the 0 decay of an excited nucleus, Ne2 *: {7.5.22) Both 016 and He4 are known to be spinless particles. Suppose the magnetic quantum number of the parent nucleus is ± 1, relative to some direction Then the angular distribution of the decay product is proportional to I yl ± 1( e, > ) 1 2 = (3 /8'17 )sin28, where ( e, >) are the polar angles defining the relative direction k of the decay product. On the other hand, if the magnetic quantum number is 0 for a parent nucleus with spin 1, the decay angular distribution varies as I Y1°( 8, >) 12 (3 j4'1T )cos28. For a general spin orientation we obtain z.
=
L I
m =
-1
w { m ) l �: 1 1 2 .
(7.5.23)
For an unpolarized nucleus the various w ( m ) are all equal, and we obtain an isotropic distribution; this is not surprising because there is no preferred direction if the parent particle is unpolarized. For a higher spin object, the angular distribution of the decay is more involved; the higher the spin of the parent decaying system, the greater the complexity of the angular distribution of the decay products. Quite generally, through a study of the angular distribution of the decay products, it is possible to determine the spin of the parent nucleus. 7.6. METHOD OF PARTIAL WAVES
Partial-Wave Expansion
Let us now come back to the case V * 0. We assume that the potential is spherically symmetric, that is, invariant under rotations in three dimensions. It then follows that the transition operator T, which is given by (7.2.20), commutes with L2 and L. In other words, T is a scalar operator. It is now useful to use the spherical-wave basis because the Wigner Eckart theorem [see (3.10.38)], applied to a scalar operator, immediately
400
Scattering Theory
(7.6.1) In other words, T is diagonal both in I and in m; furthermore, the (nonvanishing) diagonal element depends on E and I but not on m. This leads to an enormous simplification, as we will see shortly. Let us now look at the scattering amplitude (7.2.19): ( £ ' , I ', m 'ITIE , I , m ) = 1f ( E ) 811,{jmm' ·
1 j (k ', k) = - 4 7T
�7 (27T ) 3 (k'ITik)
� 2m = - 4 7T fj2 (27T ) 3 L L L L j dE j dE '(k'I E 'I 'm ') X
I m I' m '
( E 'I 'm 'IT i Eim ) ( Eimlk)
� 2m Yt(k') >'tm *(k) = - 4 7T fj 2 (27T ) 3 !£_ L L T1 ( E ) mk I m E � h2k2j2 rn 4 2 L L Jf ( £ ) >'tm (k') >'tm *(k) . (7 .6.2)
:
I m
E - h2k2j2 m
To obtain the angular dependence of the scattering amplitude, let us choose the coordinate system in such a way that k, as usual, is in the positive z�direction. We then have [see (3.6.50)] 2 �: l >'tm (k) (7.6.3) =
J
{jm O '
where we have used P1 (1 ) = 1 ; hence only the terms m = 0 contribute. Taking 8 to be the angle between k' and k, we can write Y10 ('k ' ) = v(2T+T 4;-- P1 ( cos 8 ) .
(7 .6.4)
It is customary here to define the partial-wave amplitude ft(k) as follows: For (7.6.2) we then have
7TT1 ( E ) !I ( k ) = -- k
j(k', k) = j( 8 ) =
.
( 7 .6.5 )
L (2 1 + l )f1 ( k ) P1( cos 8 ) ,
(7 .6.6)
00
t� 0
where /( 8) still depends on k (or the incident energy) even though k is suppressed. To appreciate the physical significance of f1( k ), let us study the large-distance behavior of the wave function (x i >Jt < + l ) given by (7.1.33). Using the expansion of a plane wave in terms of spherical waves [(7.5.18)]
401
7.6. Method of Partial Waves
and noting that (Appendix A )
ei ( kr-( 1 � 1 2 )) e-i ( kr-(/TI/ 2 )) l a rge r Mkr) (7. 6. 7 ) 2ikr and that f(8) is given by (7.6.6), we have e k ' ] e + k z ' [ ' + � ) ) (x l >f( ) f(8 r (2'77 )3/2 [ L (21 + 1) P1 (cos 8) ( e'k' - e-1( kr-lw) ) 3 2 = 2 1kr (2'77 ) / 1 e k ' + L(21 + l)f1(k )P1 (cos 8) r ' ] e r e-l ( kr-l w k ) ' 1 ] 1 )] 1) (2 f1(k k 2i [t L + 2. t+ [ 31 r r . p1k (2'77 ) 2 1 (7. 6 . 8 ) 1
1
I
=
The physics of scattering is now clear. When the scatterer is absent, we can analyze the plane wave as the sum of a spherically outgoing wave behaving like and a spherically incoming wave behaving like for each /. The presence of the scatterer changes only the coefficient of the outgoing wave, as follows:
e ' k '/ r
- e -,( kr -lw );r
1
---.
1
k
+ 2 ikM ) .
The incoming wave is completely unaffected.
(7.6 .9)
Unitarity and Phase Shifts
We now examine the consequences of probability conservation, or unitarity. In a time-independent formulation, the ftux current density j must satisfy = o. v ·j =
-
ill�l2
(7.6 .10)
Let us now consider a spherical surface of very large radius. By Gauss's theorem, we must have
fsph ricalj • dS = O. e surface
(7.6 .11)
Physically and mean that there is no source or sink of particles. The outgoing ftux must equal the incoming ftux. Furthermore, because of angular-momentum conservation, this must hold for each partial
(7.6.10)
(7.6. 1 1)
Scattering Theory
402
wave separately. In other words, the coefficient of e ' k 'j r must be the same in magnitude as the coefficient of e ' k jr. Defining S1(k) to be s, ( k ) = l + 2ikf1 ( k ) , (7 .6 .12) this means [from (7.6.9)] that (7.6.13) that is, the most that can happen is a change in the phase of the outgoing wave. Equation (7.6.13) is known as the unitarity relation for the /th partial wave. In a more advanced treatment of scattering, S1( k ) can be regarded as the /th diagonal element of the S operator, which is required to be unitary as a consequence of probability conservation. We thus see that the only change in the wave function at a large distance as a result of scattering is to change the phase of the outgoing wave. Calling this phase 281 (the factor of 2 here is conventional), we can write S = e z,s, (7.6.14) with 81 real. It is understood here that 81 is a function of k even though we do not explicitly write 81 as 81(k). Returning to /1, we can write [from (7 .6.12)] (7.6.15) or, explicitly in terms of 81, 2 81 1 /, = e '812ik- 1 = e'6' sin = (7.6.16) k k cot 81 - ik ' whichever is convenient. For the full scattering amplitude we have -
I
f( 8 ) =
1�0 (21 + 1 )
'
'
( e 2;�; 1 ) P1 (cos 8 )
= ! IL� 0 ( 2 1 + 1)e '8'sin81P1(cos 8 )
( 7 .6.17)
with 81 real. This expression for /(8) rests on the twin principles of rotational invariance and probability conservation. In many books on wave mechanics, (7.6.17) is obtained by explicitly solving the Schrodinger equa tion with a real, spherically symmetric potential; our derivation of (7.6.17) may be of interest because it can be generalized to situations when the potential described in the context of nonrelativistic quantum mechanics may fail. The differential cross section dojdQ can be obtained [see (7.1.36)] by just taking the modulus squared of (7.6.17). To obtain the total cross
403
7.6. Method of Partial Waves
section we have (Jtot = /11( 8 ) i l d Q =
1
(cos ) d j (2/ 1)(2/' + 1) � k [" <j> + d 8 f. f. + X
0
-1
I
I'
e'8'sinli1e- '8''sin li1 , P1P1 , 4: L (21 1 )sin2li1• (7.6.18) + k I We can check the optical theorem (7.3.1), which we obtained earlier using a more general argument. All we need to do is note from (7 .6.17) that (2/ + 1)lm [ e' 8'sinli1 ] � Im 1( e = o) = " P1 ( cos e ) k I " (2/ + 1 ) sm u1, (7 .6.19) -- "-k I which is the same as (7.6.18) except for 4'1'1'jk. As a function of energy, 81 changes; hence /1( k ) changes also. The unitarity relation of (7.6.13) is a restriction on the manner in which 11 can vary. This can be most conveniently seen by drawing an Argand diagram for kl1• We plot k/1 in a complex plane, as shown in Figure 7.6, which is =
.
0
2�
0
Argand diagram for kf1. OP is the magnitude of k/1 • while CO and CP are
0
FIGURE 7.6.
�
each radii of length
�
on the unitary circle; angle OCP
�
2 81.
Scattering Theory
404
self-explanatory if we note from (7.6.16) that kj, =
� + � e - ( tw/2)+ 2 161_
(7.6.20) known as the unitary circle, on
Notice that there is a circle of radius ! , which k/1 must lie. We can see many important features from Figure 7.6. Suppose 81 is small. Then f1 must stay near the bottom of the circle. It may be positive or negative, but f1 is almost purely real: e'81 sin81 ( 1 + i81)8 1 81 (7.6.21) j, =
"" k "" k . is near TT/2, k/1 is almost purely imaginary, k
and the On the other hand, if 81 magnitude of kf1 is maximal. Under such a condition the /th partial wave may be in resonance, a concept to be discussed in some detail in Section 7.8. Note that the maximum partial cross section a�lx = 4TTA2(2 / + 1 ) (7 .6.22) is achieved [see (7.6.18)] when sin281 = 1. Connection with the Eikonal Approximation
The eikonal approximation discussed in 7.4 is valid at high energies (A « range R ); hence many partial waves contribute. We may regard I as a continuous variable. As an aside we note the semiclassical argument that I = bk (because angular momentum In = bp, where b is the impact parame ter and momentum p = Ilk). We take (7 .6.23) / max = kR ; then we make the following substitutions in expression (7.6.17): large I, small 8
8, -. !l ( b l l b � l/k > where Imax = kR implies that We have
(7.6.24)
e2;8, - 1 = e 2i!J. ( b) - 1 = 0
/( 8 ) -> k j db
10 ( /0 ) = J0 ( kb 0 ) ,
for I > / max ·
��: ( e M( b ) - 1 ) 10 ( kb8 )
= - ik dbbJ0 ( kb8 ) [ e M< b) - 1 ] .
j
( 7.6. 25)
(7.6.26) The computation of 81 can be done by using the explicit form for !l( b) given by (7.4.14) (see Problem 7 in this chapter).
405
7.6. Method of Partial Waves
Detennination of Phase Shifts
Let us now consider how we may actually determine the phase shifts given a potential V. We assume that V vanishes for r > R , R being the range of the potential. Outside (that is, for r > R ) the wave function must be that of a free spherical wave. This time, however, there is no reason to exclude n 1( r ) because the origin is excluded from our consideration. The wave function is therefore a linear combination of j1 ( kr ) P/ os 0 ) and n ( kr ) P (cos O ) or equivalently h ( Ilp and h(2)P where h O I and h ( 2 1 are the spherical Hankel functions defined by I
I
'
'
I
I
I
I'
. . these have the asymptotic behavior (see Appendix A) -h '( l } - ;, + m , ,
h /( ll
r
I argc
e r( kr - ( lw/2 )) ----,
ikr
h (l2 1
h (I2l
-
r
=
;·l
I argc
I
- in I '· _
c
I
(7 .6 .27)
e - ,U r - ( /w/2 11
ikr
( 7.6 .28 )
The full-w
1 ( 2 '1T )
1
312 L i (2l + l ) A 1 ( r ) P1 (cos 0 )
( r > R ) . (7.6 .29)
For r > R we have (for the radial-wave function)
A 1 = cj l 1h � 1 1 ( kr ) + cj 21h �2 1 ( kr ) ,
( 7 . 6 .30)
where the coefficient that multiplies A1 in (7.6.29) is chosen so that, for V = O, A1( r ) coincides with j1( kr ) everywhere. Using (7.6.28), we can compare the behavior of the wave function for large r given by (7.6.29) and (7.6.30) with ( 7 .6 .31)
Clearly, we must have CI(21 = l2
(7.6 .32)
•
So the radial-wave function for r > R is now written as (7 .6.33)
Using this, we can evaluate the logarithmic derivative at r outside the range of the potential-as follows: r dA1
( (3, =
[
A 1 dr
= kR
)
=
R -that is, just
r� R
J with respect to
j/( kR )cos 81 - n ; ( kR )sin 81 j1 ( kR )cos 81 - n 1 ( kR )sin 81 '
where }/( kR ) stands for the derivative of }1
(7 .6 .34) kr evaluated
at
406
Scattering Theory
knowing the logarithmic derivative at krobtainkR.theConversely, phase shift as follows: =
R,
we can (7 .6.35)
The problem of determining the phase shift is thus reduced to that of obtaining {31• We now look at the solution to the Schrodinger equation for < R -that is, inside the range of the potential. For a spherically symmetric potential, we can solve the Schrooinger equation in three dimensions by looking at the equivalent one-dimensional equation
r
d2u1 + ( k 2_ 2m v_ l{l + l ) ) dr 2
where
r
n2
2
(7 .6.36)
u, = O'
(7 .6.37)
subject to the boundary condition (7.6.38)
We integrate this one-dimensional Schrodinger equation-if necessary, numerically-up to = R, starting at = 0. In this way we obtain the logarithmic derivative at R . By continuity we must be able to match the logarithmic derivative for the inside and outside solutions at R :
r
r
r
.B, I
inside solution
=
=
(7.6 .39)
/3/ l outside solution'
where the left-hand side is obtained by integrating the Schrodinger equation up to = R , while the right-hand side is expressible in terms of the phase shifts that characterize the large-distance behavior of the wave function. This means that the phase shifts are obtained simply by substituting {31 for the inside solution into tan81 [(7.6.35)]. For an alternative approach it is possible to derive an integral equation for from which we can obtain phase shifts (see Problem 8 of this chapter).
r
A1(r),
Hard-Sphere Scattering
Let us work out a specific example. We consider scattering by a hard, or rigid, sphere for < R V = { ; for (7 .6.40) > R. In this problem we need not even evaluate /31 (which is actually oo). All we need to know is that the wave function must vanish at R because the
rr
r
=
7.6. Method of Partial Waves
407
sphere is impenetrable. Therefore, (7.6.41) or, from (7.6.33),
j1( kR ) cos 81 - n ( kR ) sin81 = 0 1
or
=
j1( kR ) tano, n,(kR ).
(7 .6.42) (7.6.43)
Thus the phase shifts are now known for any /. Notice that no approxima tions have been made so far. To appreciate the physical significance of the phase shifts, let us consider the l = 0 case (S-wave scattering) specifically. Equation (7.6.43) becomes, for I 0, kRjkR = - tan kR, tan 80 = _sin (7 .6.44) cos kR I kR =
or 80 - kR. The radial-wave function (7.6.33) with e'60 omitted varies as =
Therefore, if we plot rA 1�0(r) as a function of distance r, we obtain a sinusoidal wave, which is shifted when compared to the free sinusoidal wave by amount R; see Figure 7.7.
R=-6ol k
'
r ........
... ...
..... .... _ _
FIGURE 7.7. Plot o f rA1_0(r) versus r (with the e'60 factor removed). The dashed curve for V � 0 behaves like sin kr, while the solid curve is for S·wave hard·sphere scattering, shifted by R 80 /k from the case V 0. �
-
�
Scattering Theory
408
Let us now study the low and high energy limits of tan81. Low energy means kR small, kR 1 . We can then use (see Appendix A) «
r) I ( k . kr ) Jt ( - (2/ + 1 ) ! ! n ( kr ) "" - (2/ - I1+) !1! ( kr )
(7 .6.46)
I
to obtain
tan 81
( kR ) 2 t + l { (21 + 1 ) [ (21 - l ) ! !P } _
=
(7 .6.47)
-'-----'-----
-
It is therefore all right to ignore 81 with I * 0. In other words, we have S-wave scattering only, which is actually expected for almost any finite-range potential at low energy. Because 80 = - kR regardless of whether k is large or small, we obtain da d fJ
(7.6.48)
It is interesting that the total cross section, given by
fd (7.6.49) (Jtot = d fJa d •• = 4 7TR 2 ' is four times the geometric cross section 7TR 2. By geometric cross section we mean the area of the disc of radius R that blocks the propagation of the "
plane wave (and has the same cross section area as that of a hard sphere). Low-energy scattering, of course, means a very large wavelength scattering, and we do not necessarily expect a classically reasonable result. One might conjecture that the geometric cross section is reasonable to expect for high-energy scattering because at high energies the situation might look similar to the semiclassical situation. At high energies many /-values contribute, up to /max "" kR, a reasonable assumption. The total cross section is therefore given by (7 .6.50) But using (7.6.43), we have 1 ( j1( kR W 2 sin 77 kR 2 ' [ )1 ( kR ) ] 2 + [ n 1 ( kR ) ] 2 ""
(
)
(7.6.51)
409
7.6. Method of Partial Waves
where we have used
: (
n 1 ( kr ) - - r cos kr
;).
1 -
(7 .6.52)
We see that 81 decreases by 90° each time I increases by one unit. Thus, for an adjacent pair of partial waves, sin281 + sin281+1 = sin281 + sin2 (81 - TT/2) = sin281 + cos 281 = 1, and with so many /-values contributing to (7.6.50), it is legitimate to replace sin281 by its average value, t. The number of terms in the /-sum is roughly kR , as is the average of 2 1 + 1. Putting all the ingredients together, (7.6.50) becomes 47T ( 7 .6 .53) o101 = f:i ( kR ) 2 2.1 = 21rR , 2
which is not the geometric cross section 1rR 2 either! To see the origin of the factor of 2, we may split (7.6.17) into two parts: 1 ' L (21 + 1 ) PJcos O ) L (21 + 1 ) e 2 8 P, ( cos 0 ) + /( 0 ) = 2ik 1 � 0 2k 1 � 0 (7 .6 .54) = /reflection + /shadow AR
1
.
'
kR
·
In evaluating /lf e I 2 d!J, the orthogonality of the P1(cos O)'s ensures that there is no interference amongst contributions from different /, and we obtain the sum of the square of partial-wave contributions: r
H
7T f �a:x
TTR 1
(7 .6.55)
k2 Turning our attention to fshad• we note that it is pure imaginary. It is particularly strong in the forward direction because P1(cos 0 ) = 1 for fJ = Q, and the contributions from various /-values all add up coherently-that is, with the same phase, pure imaginary and positive in our case. We can use the small-angle approximation for P1 [see (7.6.24)] to obtain = -- =
"" ik
f bdbJ0 ( kb O ) u
=
iRJ1 ( kRfJ ) e
( 7 .6.56)
But this is just the formula for Fraunhofer diffraction in optics with a strong
Scattering Theory
410
peaking near (J ::: 0. Letting � = kR(J and dU� = d(J/8, we can evaluate
f1 /shadl
2
dQ = 2-lr
f
+1
-1
= 2wR2
R 2 [ 11 ( kR(J) ] 2
{ "'
lo
(j
2
[ 11(0] 2 d � �
d(cos 8 )
Finally, the interference between /shad and freft vanishes: Re( /shad frefl ) '"' 0 because the phase of freft oscillates (281 + 1 = 281 ing to zero, while /shad is pure imaginary. Thus
(7 .6.57) (7 .6.58)
w), approximately averag (7 .6.59)
The second term (coherent contribution in the forward direction) is called a shadow because for hard-sphere scattering at high energies, waves with impact parameter less than R must be deflected. So, just behind the scatterer there must be zero probability for finding the particle and a shadow must be created. In terms of wave mechanics, this shadow is due to destructive interference between the original wave (which would be there even if the scatterer were absent) and the newly scattered wave. Thus we need scattering in order to create a shadow. That this shadow amplitude must be pure imaginary may be seen by recalling from (7.6.8) that the coefficient of eikr/2ikr for the /th partial wave behaves like 1 + 2ikf1( k ), where the 1 would be present even without the scatterer; hence there must be a positive imaginary term in f1 to get cancellation. In fact, this gives a physical interpretation of the optical theorem, which can be checked ex plicitly. First note that 4w 4w [ = im k f (O) k im fshad (O) ]
(7 .6.60) because Im [ fre (O)] averages to zero due to oscillating phase. Using (7.6.54), we obtain 4 · kR 4 1 ; Im f ad ( O) = ; Im 2 k 1�0 (2 / + 1 ) P1 (1) = 2 1TR 2 (7.6 .61 )
[
ft
sh
which is indeed equal to
o,ot·
]
7.7. LOW-ENERGY SCATIERING AND BOUND STATES
At low energies-or, more precisely, when A = 1/ k is comparable to or larger than the range R -partial waves for higher I are, in general, unim portant. This point may be obvious classically because the particle cannot
7.7. Low-Energy Scattering and Bound States
411
penetrate the centrifugal barrier; as a result the potential inside has no effect. In terms of quantum mechanics, the effective potential for the /th partial wave is given by 11.2 1(/ + 1) (7.7.1) Verr = V(r ) + 2m r2 ; unless the potential is strong enough to accommodate I * 0 bound states near E 0, the behavior of the radial-wave function is largely determined by the centrifugal barrier term, which means that it must resemble j1(kr ). More quantitatively, it is possible to estimate the behavior of the phase shift using the integral equation for the partial wave (see Problem 8 of this chapter): e •sm u1 2m joo . ( kr ) V( r ) A,(r)r 2 dr. (7.7.2) k = h2 0 JJ If A1(r) is not too different from j1(kr) and 1/k is much larger than the 1 2 range of the potential, the right-hand side would vary as k ; for small 8 1, the left-hand side must vary as 81/ k. Hence, the phase shift k goes to zero as 8, - k 2/+ l (7.7 .3) for small k. This is known as threshold behavior. It is therefore clear that at low energies with a finite range potential, S-wave scattering is important. ""
.s
.
�
Rectangular Well or Barrier
To be specific let us consider S-wave scattering by repulsive V = { :0 = constant for r < R attractive (7.7.4) otherwise Many of the features we obtain here are common to more-complicated finite range potentials. We have already seen that the outside-wave function [see (7.6.33) and (7.6.45)) must behave like ei"'sin(kr+ 80) . . e'80[J0(kr) cos &0 - n0(kr)sm &0] (7.7.5) ·
=
kr
.
The inside solution can also easily be obtained for V0 a constant: (7.7.6) sin k 'r, u = rA1_ 0 ( r ) with k' determined by (7 .7 .7) where we have used the boundary condition u = 0 at r = 0. In other words, a:
Scattering Theory
412
the inside wave function is also sinusoidal as long as E > V0 . The curvature of the sinusoidal wave is different than in the free-particle case; as a result the wave function can be pushed in ( 80 > 0) or pulled out ( 80 < 0) depend ing on whether V0 < 0 (attractive) or V0 > 0 (repulsive), as shown in Figure 7.8. Notice also that (7.7.6) and (7.7.7) hold even if V0 > E, provided we understand sin to mean sinh-that is, the wave function behaves like ( ) sinh[ ] (7.7.6' ) where u
r
ex:
Kr
,
(7.7.7 ') u(r)
0
I I
,
,
,
""" - - .... ...
'
No potential
I
I
/ /.,.,.
I
', I \I
'
I I
_ _ _ ...
;"
... (a)
bigger
,-
- - ....
r
.....
r
(b)
Vo
u(r) /
6ol k V o >O
0
I
R
/
sinh function if Vo>E
"
I I I
, - - - .....
.. ...
(c)
r
FIGURE 7.8 Plot of u( r) versus r. (a) For V 0 (dashed line). (b) For V0 < 0, &0 > 0 with the wave function (solid line) pushed in. (c) For V0 > 0, 110 < 0 with the wave function (solid line) pulled out. =
7.7. Low-Energy Scattering and Bound States
413
We now concentrate on the attractive case and imagine that the magnitude of V0 is increased. Increased attraction will result in a wave function with a larger curvature. Suppose the attraction is such that the interval [0, R] just accommodates one-fourth cycle of the sinusoidal wave. Working in the low energy kR«-1 limit, the phase shift is now 1>0 = 7r/2, and this results in a maximal S-wave cross section for a given k because sin2&0 is unity. Now increase the well depth V0 even further. Eventually the attraction is so strong that one-half cycle of the sinusoidal wave can be fitted within the range of the potential. The phase shift 1>0 is now 1r ; in other words, the wave function outside R is 180° out of phase compared to the free-particle-wave function. What is remarkable is that the partial cross section vanishes ( sin21>0 = 0), (7 .7 .8) a1 � o = 0 , despite the very strong attraction of the potential. In addition, if the energy is low enough for l � 0 waves still to be unimportant, we then have an almost-perfect transmission of the incident wave. This kind of situation, known as the Ramsauer-Townsend effect, is actually observed experimen tally for scattering of electrons by such rare gases as argon, krypton, and xenon. This effect was first observed in 1923 prior to the birth of wave mechanics and was considered to be a great mystery. Note the typical parameters here are R - 2 x 10- 8 em for electron kinetic energy of order 0. 1eV, leading to kR - 0 .324. Zero-Energy Scattering and Bound States Let us consider scattering at extremely low energies (k 0). For r > R and for I = 0, the outside radial-wave function satisfies d2 u (7.7.9) - = 0. dr2 The obvious solution to (7.7.9) is u ( r ) = constant ( r - a ) , (7 .7.10) just a straight line! This can be understood as an infinitely long wavelength limit of the usual expression for the outside-wave function (see (7.6.37) and (7.6.45)), =
] ) , k
[(
lim sin( kr + S0 ) = lim sin k r + So
k-o
k-o
[ ( ') ]
which looks like (7.7.10). We have u' u
k -o
(7.7 . 1 1 )
(7.7.12) k r-a Setting r = 0 [even though at r = 0, (7.7.10) is not the true wave function], we obtain k-o 1 (7.7.13) lim k cot 80 -
=
k cot
k-o
k r+
80
-
--+
--+
a
1
-- .
Scattering Theory
414
The quantity a is known as the scattering length. The limit of the total cross section as k 0 is given by (see (7.6.16)) ......
" (7.7.14) .k = 4 �a 2 . cot Even though a has the same dimension as the range of the potential R , a and R can differ by orders of magnitude. In particular, for an attractive potential, it is possible for the magnitude of the scattering length to be far greater than the range of the potential. To see the physical meaning of a, we note that a is nothing more than the intercept of the outside-wave function. For a repulsive potential, a > 0 and is roughly of order of R , as seen in Figure 7.9a. However, for an attractive potential, the intercept is on the negative side (Figure 7.9b). If we increase the attraction, the outside-wave function can again cross the r-axis on the positive side (Figure 7.9c). The sign change resulting from increased attraction is related to the development of a bound state. To see this point quantitatively, we note from Figure 7.9c that for a very large and positive, the wave function is essentially flat for r > R . But (7.7.10) with a very large is not too different from e - •r with essentially zero. Now e - "' with 0 is just a bound state-wave function for r > R with energy E infinitesimally negative. The inside-wave function ( r < R ) for the E = 0 + case (scattering with zero kinetic energy) and the E = 0 - case (bound state with infinitesimally small binding energy) are essentially the same because in both cases k ' in sin k 'r [(7.7.6)] is determined by li 2 k ,2 -y,;; = E - V "" V I (7 .7 .15)
rrtot =
ll·m
k-o
1
k
� u0 - I
2
K
K ""
o
i o
with E infinitesimal (positive or negative). Because the inside-wave functions are the same for the two physical situations ( E 0 + and E 0 - ), we can equate the logarithmic derivative of the bound-state-wave function with that of the solution involving zero kinetic-energy scattering, =
=
-
or, if R « a,
K e - "' e � Kr
The binding energy satisfies
I
r
=
R
=
(
1 r-a
K "' -1 . a
)I
r =
R
'
(7 .7 . 1 6) (7.7.17) (7.7.18)
415
7. 7. Low-Energy Scattering and Bound States
(a)
(b)
R a
r
(c)
0
FIGURE 7.9.
Plot of
a>O
v
u ( r)
R a
versus
deeper attraction. The intercept shown for each of three cases.
a
I I
I I
.
r
-
(b) attractive potential, and (c) of the zero-energy outside-wave function with the r-axis is r
for (a) repulsive potential,
and we have a relation between scattering length and bound-state energy. This is a remarkable result. To wit, if there is a loosely bound state, we can infer its binding energy by performing scattering experiments near zero kinetic energy, provided is measured to be large compared with the range R of the potential. This connection between the scattering length and the bound-state energy was first pointed out by Wigner, who attempted to apply (7.7.18) to np-scattering. Experimentally, the 351-state of the np-system has a bound state, that is, the deuteron with a
EsE = 2.22 MeV.
(7.7. 19)
Scattering Theory
416
The scattering length is measured to be
a triplet = 5 .4 X 10 - !3 em,
(7.7.20)
leading to the binding-energy prediction
= (938 MeV)
(
2"1 X 1 0 - 14 em 5 .4 x l0 - 1 3 em
)2
=
1 .4 MeV
(7.7.21)
where J..l. is the reduced mass approximated by mn.p/2. The agreement between (7.7.19) and (7.7.21) is not too satisfactory. The discrepancy is due to the fact that the inside-wave functions are not exactly the same and that triplet » R is not really such a good approximation for the deuteron. A better result can be obtained by keeping the next term in the expansion of k cot 8 as a function of k, a
k cot 80 =
-
2
1 1 rk a+2 0 ,
(7 .7 .22)
where r0 is known as the effective range (see, for example, Preston 1 962, 23). Bound States as Poles of S1 ( k)
We conclude this section by studying the analytic properties of the amplitude S1( k ) for I = 0. Let us go back to (7.6.8) and (7.6.12), where the radial wave function for I = 0 at large distance was found to be proportional to (7.7.23)
Compare this with the wave function for a bound state at large distance, e - Kr r
(7.7 .24)
The existence of a bound state implies that a nontrivial solution to the Schrodinger equation with E < 0 exists only for a particular (discrete) value of We may argue that e - "';r is like e'k'/r, except that k is now purely imaginary. Apart from k being imaginary, the important difference between (7.7.23) and (7.7.24) is that in the bound-state case, e- "'/r is present even without the analogue of the incident wave. Quite generally only the ratio of the coefficient of e'k'/r to that of e- ikr;r is of physical interest, and this is given by S1( k ). In the bound-state case we can sustain the outgoing wave (with imaginary k ) even without an incident wave. So the ratio is which means that S1_ 0 ( k ), regarded as a function of a complex variable k, has a K.
oo ,
417
7.7. Low-Energy Scattering and Bound States
lm k
+ X
pole
__/
l·
k>O (real)
Region of physical scattering Re k
FIGURE 7.10. The complex k-plane with bound-state pole at k
=
+ iK.
pole at k = iK. Thus a bound state implies a pole (which can be shown to be a simple pole) on the positive imaginary axis of the complex k-plane; see Figure 7 . 10 . For k real and positive, we have the region of physical scattering. Here we must require [compare with (7.6. 14)]
(7.7.25) with &0 real. Furthermore, as k -+ 0, k cot &0 has a limiting value - 1/a [(7.7. 13)), which is finite, so &0 must behave as follows: &o -+ 0, ± 'IT, . . . . (7.7.26)
Hence
S1�o
= e2;80 -+ 1 as k -+ 0.
Now let us attempt to construct a simple function satisfying: 1 . Pole at k iK (existence of bound state). 2 . ! St �ol = 1 for k > 0 real (unitarity). 3 . S1�o = 1 at k = 0 (threshold behavior). =
(7.7.27)
The simplext function that satisfies all three conditions of (7.7.27) is
- k - iK St= o(k) = . . k - lK
(7.7.28)
[Editor's Note: Equation (7.7.28) is chosen for simplicity rather than as a physically realistic example. For reasonable potentials (not hard spheres!) the phase shift vanishes as k -+ oo.J
An assumption implicit in choosing this form is that there is no other singularity that is important apart from the bound-state pole . We can then use (7.6. 15) to obtain, for ft =o(k), St=O - 1 1 (7.7.29) ft =O = 2ik Comparing this with (7.6. 16),
ft=O
=
1
k cot uo - l"k' <:
(7.7.30)
Scattering Theory
418
we see that
lim k cot 80 - _!_a = =
k-o
(7 .7 . 31 )
K,
precisely the relation between bound state and scattering length [(7.7.17)]. It thus appears that by exploiting unitarity and analyticity of S1( k) in the k-plane, we may obtain the kind of information that can be secured by solving the Schrodinger equation explicitly. This kind of technique can be very useful in problems where the details of the potential are not known. 7.8. RESONANCE SCATIERING
In atomic, nuclear, and particle physics, we often encounter a situation where the scattering cross section for a given partial wave exhibits a pronounced peak. This section is concerned with the dynamics of such a resonance.
We continue to consider a finite-ranged potential V(r). The effective potential appropriate for the radial wave function of the lth partial wave is V(r ) plus the centrifugal barrier term as given by (7.7.1 ). Suppose V(r) itself is attractive. Because the second term, ..!'.!_ 1( 1 + 1) 2m ,2
is repulsive, we have a situation where the effective potential has an attractive well followed by a repulsive barrier at larger distances, as shown in Figure 7 .11. Suppose the barrier were infinitely high. It would then be possible for particles to be trapped inside, which is another way of saying that we expect bound states, with energy E > 0. They are genuine bound states in the sense that they are eigenstates of the Hamiltonian with definite values of E. In other words, they are stationary states with infinite lifetime. v.,. (or V for £=0)
Quasi-bound state
(
barrier
r
2 2 V.cc � V( r ) + ( li j2m)[/(l + l)/r ] versus r. For I * 0 2 2 ( li /2 m )[ /(I + 1)/ r ] ; for I � 0 barrier must be due to V itself.
FIGURE 7.1 1 .
to
the barrier can be due
7.8. Resonance Scattering
419
In the more realistic case of a finite barrier, the particle can be trapped inside, but it cannot be trapped forever. Such a trapped state has a finite lifetime due to quantum-mechanical tunneling. In other words, a particle leaks through the barrier to the outside region. Let us call such a state quasi-bound state because it would be an honest bound state if the barrier were infinitely high. The corresponding scattering phase shift o1 rises through the value Tr/2 as the incident energy rises through that of the quasi-bound state, and at the same time the corresponding partial-wave cross section passes through its maximum possible value 4Tr(2/ + 1)/k2 . [Editor's Note: Such a sharp rise in the phase shift is, in the time-dependent Schrodinger equa tion, associated with a delay of the emergence of the trapped particles, rather than an unphysical advance, as would be the case for a sharp decrease through Tr/2.] It is instructive to verify this point with explicit calculations for some known potential. The result of a numerical calculation shows that a res onance behavior is in fact possible for l ¥- 0 with a spherical-well potential. '
\
287T
!?
\ /-
ch(k) 1T
\
\
\
'
for £=3
kres
k
(a)
- - - - - - - - :;;..::=--I
_
1
(47r/k �., ) x 7 � 287T/k;" FIGURE 7.12.
2 m V,1 R2jh2
�
Plots of (a)
5.5.
/R
o1 � 3
kres versus
_
...1... I I I I I
_
_
_
_
_
k
(b)
k , where at resonance o3 ( A "J
�
'TT/2 and
o1� 3
�
and (b) 83 ( k ) versus k . The curves are for a spherical well with
Scattering Theory
420
To be specific we show the results for a spherical well with 2m V0R 2j h 2 = 5.5 and I = 3 in Figure 7.12. The phase shift (Figure 7.12b), which is small at extremely low energies, starts increasing rapidly past k = 1/R , and goes through /2 around k = 1.3 IR. Another very instructive example is provided by a repulsive o-shell potential that is exactly soluble (see Problem 9 in this chapter): 1r
2m 2 V( r ) = yo ( r - R ) .
(7.8.1)
h
Here resonances are possible for I = 0 because the o-shell potential itself can trap the particle in the region 0 < r < R . For the case y we expect a series of bound states in the region r < R with kR = TT , 2 1T , 0 0 0 ; (7 .8.2) this is because the radial wave function for I = 0 must vanish not only at r = 0 but also at r = R - in this case. For the region r > R , we simply have hard-sphere scattering with the S-wave phase shift, given by 80 = - kR . (7 .8.3) W i th y = there is no connection between the two problems because the wall at r = R cannot be penetrated. The situation is more interesting with a finite barrier, as we can show explicitly. The scattering phase shift exhibits a resonance behavior whenever (7.8 .4) £ incident E Moreover, the larger the y, the sharper the resonance peak. However, away from the resonance o0 looks very much like the hard-sphere phase shift. Thus we have a situation in which a resonance behavior is superimposed on a smoothly behaving background scattering. This serves as a model for neutron-nucleus scattering, where a series of sharp resonance peaks are observed on top of a smoothly varying cross section. = oo ,
oo ,
==
quasi-bound state ·
Coming back to our general discussion of resonance scattering, we
ask how the scattering amplitudes vary in the vicinity of the resonance energy. If we are to have any connection between being large and the quasi-bound states, o1 must go through rr/2 (or 3rr/2, . . . ) from below, as discussed above. In other words cot o1 must go through zero from above. Assuming that cot &1 is smoothly varying near the vicinity of resonance, that is, (7.8.5) E E,, we may attempt to expand cot 81 as follows: cot 81 = cot 81 1 £ � c ( E - E, ) + 0 [ ( E - EY] . (7 .8.6) a1
==
0
E, -
421
7.9. Identical Particles and Scattering
This leads to
1
.!._
1
l, ( k ) = k cot 81 - 1k = k [ c E - E, ) - i ] - ( .
f/2 where we have defined the width r by d (cot 8 ) dE
1
1
(7.8.7)
f
(7.8.8)
= - c = - 3_
Notice that r is very small if cot 81 varies rapidly. If a simple resonance dominates the /th partial-wave cross section, we obtain a one-level reso nance formula (the Breit-Wigner formula): E � E,
(7 .8.9) So it is legitimate to regard r as the full width at half-maximum, provided the resonance is reasonably narrow so that variation in 11 k 2 can be ignored. 7.9. IDENTICAL PARTICLES AND SCATTERING
As an example to illustrate the phase of the scattering amplitude, let us consider the scattering of two identical spinless charged particles via the Coulomb potential (which we discuss further in Section 7.13). The space wave function must now be symmetric, so the asymptotic wave function must look like e ik - x
e - rk -x
+ [ I ( 8 ) + I( 'TT - 8 ) ] � , rkr
(7.9.1) where = is the relative position vector between the two particles 1 and 2. This results in a differential cross section, x
+
r
x1 - x 2
dcr fJ d
= 11(8 ) + I( 'TT - 8 ) 1 2
= I I ( 8 ) 1 2 + I I ( 'TT - o) 1 2 + 2 Re [ I ( 8 ) I * ( 'TT - 8 ) ] .
(7.9.2) The cross section is enhanced through constructive interference at 8 "' 'TT/2. In contrast, for spin ! - spin ! scattering with unpolarized beam and V independent of spin, we have the spin-singlet scattering going with space-symmetrical function and the spin triplet going with space-antisym metrical wave function (see Section 6.3). If the initial beam is unpolarized,
422
Scattering Theory
we have the statistical contribution t for spin singlet and � for spin triplet; hence da 1 3 = f( 8 ) + - 8 ) 1 2 + 1!( 8 ) - j( 'IT - 8 ) 1 2 dQ
41
1('1T
4
= I I ( 8 ) 1 2 + I I ( 'IT - 8 ) 1 2 - Re [ I ( 8 ) I * ( 'IT - 8 ) ] .
In other words, we expect destructive interference at fact, been observed.
8
=
(7.9.3) j2. This has, in
'IT
7.10. SYMMETRY CONSIDERATIONS IN SCATTERING
Suppose V and H0 are both invariant under some symmetry operation. We may ask what this implies for the matrix element of T or for the scattering amplitude f(k ', k). If the symmetry operator is unitary (for example, rotation and parity), everything is quite straightforward. Using the explicit form of T as given by (7.2.20), we see that u vu t = v (7.10.1) implies that T is also invariant under U-that is, urut = T. (7 .10.2) We define (7.10.3) lk ') U lk ') . lk) = Ulk) , Then =
(k'ITik) = (k ' lu turutulk) = (k'ITik) .
(7 .10.4) As an example, we consider the specific case where U stands for the parity operator (7 .10.5) 'IT I - k) = lk) . 'IT ik) = 1 - k) , Thus invariance of H0 and V under parity would mean (7.10.6) ( - k'I T I - k) = (k'ITik) . Pictorially, we have the situation illustrated in Figure 7 .13a. We exploited the consequence of angular-momentum conservation when we developed the method of partial waves. The fact that T is diagonal in the I Elm ) representation is a direct consequence of T being invariant under rotation. Notice also that (k'ITik) depends only on the relative orientation of k and k', as depicted in Figure 7.13b. When the symmetry operation is antiunitary (as in time reversal), we must be more careful. First, we note that the requirement that V as well as
423
7.10. Symmetry Considerations in Scattering
--�
k
/ k'
/ _k ' _____..A rotated k
-k
...1---
/_ -k '
Equality of
H0
T
/
/
/
-
(a)
FIGURE 7.13.
k '/
k
(b)
(a) Equality of
T
matrix elements between k .... k' and
- k .... - k'. (b)
matrix elements under rotation.
be invariant under time reversal invariance requires that ere - 1 = rt .
This is because the antiunitary operator changes 1 1 into E - H0 + iE E-H
0 - iE
(7.10.7) (7.10.8)
in (7.2.20). We also recall that for an antiunitary operator [see (4.4.11)], (7 .1 0.9) < .B i a) = < ii i ,B ) , where (7 .10.1 0) lii ) = eia) and i .B ) = ei .B ) . Let us consider (7.10.11) Ia) = T ik ) , <.B I = (k 'l : then Ia> = erik> = ere-1E>Ik> = rt l - k>
(7.10.12)
1� > = elk> = 1 - k'> .
As a result (7.10.9) becomes
(7.10.13) Notice that the initial and final momenta are interchanged, in addition to the fact that the directions of the momenta have been reversed. It is also interesting to combine the requirements of time reversal [(7.10.13)] and parity [(7.10.6)]: ( k'I T ik ) = < - kiTI - k ') .
under 8
=
< - k i TI - k')
that is, from (7 .2.19) and (7 .2.22) we have
under
=
/(k , k ') = f(k ', k) ,
'71'
(7.10.14) (7 .10.15)
Scattering Theory
424
which results in
da (k dQ
�
k') =
da (k' � k) . dQ
(7.10.16)
Equation (7.10.16) is known as detailed balance. It is more interesting to look at the analogue of (7.10.14) when we have spin. Here we may characterize the initial free-particle ket by lk,m5), and we exploit (4.4. 79) for the time-reversal portion:
= =
< - k, - m51 TI - k', - m'5) i-Zm,+ 2m,.
i-2m,+Zm,.
'
s
'
s
.
For unpolarized initial states, we sum over the initial spin states and divide by (2s + 1); if the final polarization is not observed, we must sum over final states. We then obtain detailed balance in the form
da (k dQ
�
k ')
=
da (k ' drl
�
k)
•
(7 .10.18)
where we understand the bar on the top of dajdrl in (7.10.18) to mean that we average over the initial spin states and sum over the final spin states.
7.1 1. TIME-DEPENDENT FORMULATION OF SCATTERING Our discussion of scattering so far has been based on the time-independent formulation. It is also possible to develop a formalism of scatto;:ring based on the time-dependent Schrodinger equation. We show that this formalism leads to the same Lippmann-Schwinger equation, (7.1.6). In the time-dependent formulation we conceive of a scattering pro cess as a change in the state ket from a free-particle ket to a state ket influenced by the presence of the potential V. The basic equation of motion IS
(7.11.1)
where I .V; t ) is the time-dependent Schrodinger ket in the presence of V. The boundary condition appropriate for the scattering problem is that in the remote past ( t - oo ), the particle was free. This requirement is automatically accomplished if we turn on the potential adiabatically-that is, very slowly-as in Section 5.8: �
(7 .11.2) Just as the partial differential equation with an inhomogeneous term is solved by introducing a Green's function [see (7.1.18)], the operator
7. 1 1 .
Time-Dependent Formulation of Scattering
425
Schrodinger equation (7. 1 1 .1 ) is solved by introducing the Green's operator G + ( t, t ') satisfying ( 7 . 1 1 .3)
The causality requirement we impose on G + is that the interaction of a particle at t ' has an effect only for t > t'. We therefore impose the retarded boundary condition ( 7 . 1 1 .4) G+ (t, t') = O for t < t '. We claim that the solution to (7.1 1 .3) and (7. 1 1 .4) is
( 7 . 1 1 .5) G ( t ' t ' ) = - � (} ( t - t ' ) e - ( I - ' ' )/ h . For t > t' , a constant times e - iHo(t- t· )th clearly satisfies the differential equation (7. 1 1 .3) because the &-function on the right-hand side is inoper ative. For t < t', we obviously have the desired condition (7. 1 1 .4) because G (t, t') here is identically zero. At t = t', we note that there is an extra contribution due to the discontinuity of the e-function that just balances the right-hand side of (7. 1 1 .3): 'Ho
+
+
in
where
%r [- � B(t - t') ]
=
o(r - r').
( 7 . 1 1 .6)
The solution to the full problem can now be written as follows: > ( 7 . 1 1 .7) I "' < + ; t) = I > ; t) + f + oo G ( t, t ' ) v I "'< + > ; t ') dt ', - oo
+
( 7 . 1 1 .8)
and the upper limit of the integration in (7. 1 1 .7) may as well be t because G vanishes for t ' > t. As t -+ - I "'< + >; t) coincides with I
+
oo ,
=
IS
oo
Scattering Theory
426
switched on adiabatically according to (7 .11. 7), evaluated at t 0, now yield
= I t( +)) = I.P) - .!..h r -
oc
(7.11.2).
Equations
dt ' e' Ho t'l he - t Et'lh V I t ( + ) > ·
(7.11.5)
and
(7 .11 .10)
The integral may appear to oscillate indefinitely. However, we recall that V is really to be understood as Ve�'. As a result, the time integration is straightforward :
(7.1 1 .11) this is just the Lippmann-Schwinger equation (7.1.6).
But as t " -4 - oo , The reader should note carefully how the if prescription appears in the two formalisms. Earlier, using the time-independent formalism we saw that the choice of positive sign of the if-term (see Section 7.1) corresponds to the statement that the scatterer affects only outgoing spherical waves. In the /-dependent formulation presented here, the presence of ie ( e = T/h) in (7 .11.11) arises from the requirement that the particle was free in the remote past. One may argue that the slow switch-on of the potential (7 .11.2) on which we have relied is somewhat artificial. But let us suppose that we are using a wave-packet formalism to describe scattering. When the wave packet is well outside the range of the potential, it does not matter whether the potential is zero or finite. In particular, V may be zero in the remote past, which then leads to no difficulty.
Connection With Time-Dependent Perturbation Theory Inasmuch as scattering can be discussed using the time-dependent formulation, which is based on the time-dependent Schrodinger equation (7 .11.1 ), we should be able to apply an approximation scheme based on (7.11.1). In particular we should be able to deploy the methods of time dependent perturbation theory developed earlier in Section 5.6 to the scattering problem-provided, of course, the potential can be regarded as weak in some sense. First we will show how the golden rule can be applied to compute dajdrl, leading to the results of the Born approximation of Section 7.2. We assume in the remote past that the state ket is represented by a momentum eigenket lk). When the interaction is slowly switched on, as in (7.1 1.2), momentum eigenkets other than lk) -for example, lk') -are
427
7 . 1 1 . Time-Dependent Formulation of Scattering
populated. As we have seen (see Section 5.6), the transition probability is (to first order) evaluated in the following manner. First we write
(k'IUPl( t, - oo) ik) - � { (k'JV,( t') lk) dt'. (7 .11 .12) =
where VJ(t') eiHot·tAve - iHoi'IAe��' and Ek = ability of finding l k') at time t is then =
oc
fi?l2!2m .
The transition prob
2�1 e z VI k )l (k'J l 2 oo . , l(k'IUP l(t ) lk)l 112 2 2 2 [ ( E k - E k- ) /11 + 17 ] =
(7.11 .13) finite), the transition rate is just what we expect from the golden
As 17 --> 0 (t rule: � l(k'IU} l )(t , - oo ) lk )l 2 211"' l ( k'JVIk )l 21l ( Ek - Ek.) , (7.11 .14) where we have used (5.8.5). Note that the right-hand side of (7.11.14) is independent of t . It is important here to review the normalization conventions used for plane-wave states. In Section 1.7 we used the ll-function normalization (7 .11 .15) (k'lk) 1)<3l(k-k') . The completeness relation is then written as =
=
(7 .11.16) In applying !-dependent perturbation theory to scattering processes, it is more helpful to use box normalization, (7.11 .17) (k'lk) = ll k. . k ' where the allowed values for k are given by kx , y , z =
The wave function is given by
2 7r n x. L.
r.z
(7.11 .18)
l_erk·x (x lk) _ L312
(7.11 .19)
1 = Llk)(k l,
(7.11 .20)
=
instead of [ 1 /(27r)312]e'k·x. The completeness relation is k but for a very large box we might as well treat the variables
kx. ,
..
z
as
428
Scattering Theory
continuous, so
3
1 = ( 2� ) f d 3klk )(kl. (7.11 .21) This looks like (7.11.16) except for the presence of (Lj2 7T ) 3 . A very useful relation that enables us to go from the 8-function normalization convention to that of box normalization is (7 .11 .22) 8 - fn box normalization
normalization
as the reader may easily verify by inserting a complete set of states in the position representation. The box normalization is very convenient for evaluating the density of states. For this reason it is used more often in a treatment of scattering based on time-dependent perturbation theory. Thus if we are interested in scattering into the solid angle dQ, the relevant formula is
n 2 dn drl. = ( 2� r :� dEdQ
(7.11 .23)
2 ) 3 km d r!. w = h7T IWI V1k )12 ( _f_ 21T tz2
(7.11 .24)
[see (5.7.31)]. Coming back to scattering, which we view as a transition rate w from lk) into a group of states lk') subtending the solid-angle element dQ, we have w given by (for elastic scattering k ' = k) from the golden rule (5.6.34) and (7.11.14). This must be equated to (7 .11 .25) (Incident flux) X ;� drl.. As for the incident flux, we obtain [from j = ( n I m )1m(>/;* V'>/1 )]
(
- tk · x
e h U l = m Im L312
tk•x
V' �
) = hk
(7.11 .26) mL3 . Alternatively, incident flux = velocityjvolume = hkjmL3 . Putting every
3/2
thing together
da = ( mL3 ) ( 2 7T )( _f_ ) 3 ( km ) drl. hk h 21r tz2 = 41T1 2m tz2 j d 3 V( ) e
�-
X
X
_!_ j d 3x V( x) e ' (k - k') · x 2 L3 2
1(k - k') · x
(7.11 .27)
But this is precisely the result of the first-order Born approximation [(7.2.22)]. Higher-order Born terms can be obtained in a similar manner.
429
7. 1 2. Inelastic Electron-Atom Scattering
To sum up, the time-dependent formulation based on the time dependent Schrodinger equation [(7.11.1 )] enables us to derive easily the results we obtained earlier using the time-independent formalism-the Lippmann-Schwinger equation, the Born approximation, and so on. Fur thermore, it turns out that the time-dependent formalism is more suitable for discussing more-general reaction processes other than elastic scattering. As a concrete example to illustrate this point, we now turn to a discussion of the inelastic scattering of electrons by atoms. 7.12. INELASTIC ELECTRON-ATOM SCATTERING
Let us consider the interactions of electron beams with atoms assumed to be in their ground states. The incident electron may get scattered elastically with final atoms unexcited: e - + atom (ground state) e- + atom (ground state). (7 .12.1) This is an example of elastic scattering. To the extent that the atom can be regarded as infinitely heavy, the kinetic energy of the electron does not change. It is also possible for the target atom to get excited: e - + atom (ground state) -> e - + atom (excited state) . ( 7.12 .2) In this case we talk about inelastic scattering because the kinetic energy of the final outgoing electron is now less than that of the initial incoming electron, the difference being used to excite the target atom. The initial ket of the electron plus the atomic system is written as ->
Jk,O)
(7.12 .3)
where k refers to the wave vector of the incident electron and 0 stands for the atomic ground state. Strictly speaking (7.12.3) should be understood as the direct product of the incident electron ket Jk) and the ground-state atomic ket JO ). The corresponding wave function is (7 .12.4)
where we use the box normalization for the plane wave. We may be interested in a final-state electron with a definite wave vector k'. The final-state ket and the corresponding wave function are Ik' n > ,
and
1 L
312 e
I k'
lfn ( xl , . . . , x, ) '
·X
(7 .12.5)
where n = 0 for elastic scattering and n * 0 for inelastic scattering. Assuming that time-dependent perturbation theory is applicable, we can immediately write the differential cross section, as in the previous
430
Scattering Theory
section: du - (0 ---> n) df!
( )( )
2 L (k , n I V I kO) l 21T I (hklme L 2 1 2me k' k L 1T ---,;z (k , , n I V I k ,O) 4
1
()
3
1T 2 )h
3
k ' me 7
(7.12.6)
6
Everything is similar, including the cancellation of terms such as L3, with one important exception: k ' o= lk'l is not, in general, equal to k o= lk l for inelastic scattering. The next question is, What V is appropriate for this problem? The incident electron can interact with the nucleus, assumed to be situated at the origin; it can also interact with each of the atomic electrons. So V is to be written as Ze 2 + -e2 V= - [ r 1 lx - x 1 1
(7 .12.7)
Here complications may arise because of the identity of the incident electron with one of the atomic electrons; to treat this rigorously is a nontrivial task. Fortunately, for a relatively fast electron we can legitimately ignore the question of identity; this is because there is little overlap between the bound-state electron and the incident electron in momentum space. We must evaluate the matrix element (k'n ! VIkO), which, when explicitly written, is
z
1 -L 3 j d 3xe q·xn1. jd3Xi ·"* ( X 1
with q
X l/Jo(x l , . . . , x z )
o=
k - k '.
't' n
I'..•
' Xz
[ )
2e 2 __
r +
J (7.12.8)
e2 "L..1 :-1x - X-1..,.I
Let us see how to evaluate the matrix element of the first term, - Ze 2jr, where r actually means 1x 1. First we note that this is a potential between the incident electron and the nucleus, which is independent of the atomic electron coordinates. So it can be taken outside the integration in (7.12.8); we simply obtain
(7 .12.9) for the remainder. In other words, this term contributes only to the
7.12. Inelastic Electron-Atom Scattering
431
the elastic case we must still integrate e • q · x1 r with respect to x, which amounts to taking the Fourier transform of the Coulomb potential; this can readily elastic-scattering case, where the target atom remains unexcited. In
be done because we already evaluated in Section 7.2 the Fourier transform of the Yukawa potential [see (7.2.6) in conjunction with (7.2.2)]. Hence
--
f d 3x e•q·x = hm d 3xe iq · x -Ju 47T . . = 2 J q r
r
p. � o
(7 .12.10)
As for the second term in (7.12.8), we can evaluate the Fourier transform of 1 1 lx - x ,1. We can accomplish this by shifting the coordinate variables x -> x + x ,:
(7 .12.11)
Notice that this is just the Fourier transform of the Coulomb potential multiplied by the Fourier transform of the electron density due to the atomic electrons situated at x ,: Patom( x) =
2:8 <3l( x - x , ) .
(7.12.12)
We customarily define the form factor Fn(q) for excitation follows:
I O) to I n >
as
(7 .12.13) which is made of coherent-in the sense of definite phase relation ships-contributions from the various electrons. Notice that as q --> 0, we
have
for n = 0; hence the form factor approaches unity in the elastic-scattering case. For n i' 0 (inelastic scattering), F,(q) --> 0 as q --> 0 by orthogonality between I n ) and 10). We can then write the matrix element in (7.12.8) as
I . I (- - , d3xe•q
Ze' r
'(n
+L
e2
Ix
_
�
)I I
0) =
41T Z ' q
2
e
-
[ o,o + F,( q ) ]
(7.12.14)
We are finally in a position to write the differential cross section for inelastic (or elastic) scattering of electrons by atoms: du dO
(0 � n )
=
(k'k)
2me 41TZe2 q (;2 1T 4 2 [ - ono + 4m; (Ze2)2
= 7
1
q4
(k'k) 1 -
Ono
+
Fn{q) ]
Fn ( q) I
2 ·
2 (7 . 12. 15)
432
Scattering Theory
For inelastic scattering the ll"0-term does not contribute, and it is customary to write the differential cross section in terms of the Bohr radius, p, 2
as follows:
ao = , e 2me
da ( 0
d fJ
(7 .12.16)
n ) = 4Z 2a 02 ( kk ' ) ( qa1 ) 4 1 Fn (q) I 2 .
(7 .12.17)
o Quite often dajdq is used in place of dajdfJ; using q 2 = lk - k '1 2 = k 2 + k '2 - 2 kk 'cos 8 (7.12.18) and dq = - d(cos 8)kkjq, we can write da 2 7rq da (7 .12.19) dq = kk ' dfJ The inelastic cross section we have obtained can be used to discuss stopping power -the energy loss of a charged particle as it goes through matter. A number of people, including H. A. Bethe and F. Bloch, have discussed the quantum-mechanical derivation of stopping power from the point of view of the inelastic-scattering cross section. We are interested in the energy loss of a charged particle per unit length traversed by the incident charged particle. The collision rate per unit length is Na, where N is the number of atoms per unit volume; at each collision process the energy lost by the charged particle is En - E0. So dEjdx is written as -->
0
dE dx
N L (E,
Eo)
N L (E,
Eo)
n
n
I �; (0 -'> n) I""'·" k' 1 I4m.o> (n l � e'q
4Z2 ao
2
dq
k
qrlllll
2 'Trq
( l dq q kk' I F, q) 4 2
dq '' (7.12.20) O) '•I (E q k"a� � , There are many papers written on how to evaluate the sum in (7.12.20). * The upshot of all this is to justify quantum mechanically Bohr' s 1913 formula for stopping power, dE 4 7r NZe4 n 2m ev 2 (7.12 .21) ' I dx m ev 2 I where I is a semiempirical parameter related to the average excitation 81rN
-
_
-
Eo)
z
'lmm
(
2
)
*For a relatively elementary discussion, see K. Gottfried ( 1 966) and H. A. Bcthe and R. W . Jack:iw ( 1 968).
433
7.12. Inelastic Electron-Atom Scattering
energy ( En - E0). If the charged particle has electric charge ± ze, we just replace Ze 4 by z 2Ze4• It is also important to note that even if the projectile is not an electron, the that appears in (7 . 1 2.21 ) is still the electron mass, not the mass of the charged particle. So the energy loss is dependent on the charge and the velocity of the projectile but is independent of the mass of the projectile. This has an important application to the detection of charged particles. Quantum mechanically, we view the energy loss of a charged particle as a series of inelastic-scattering processes. At each interaction between the charged particle and an atom, we may imagine that a "measurement" of the position of the charged particle is made. We may wonder why particle tracks in such media as cloud chambers and nuclear emulsions are nearly straight. The reason is that the differential cross section (7.12.17) is sharply peaked at small q; in overwhelming number of collisions, the final direction of momentum is nearly the same as the incident electron due to the rapid falloff of q�4 and F"(q) for large q. me
an
Nuclear Fonn Factor
The excitation0 of atoms due to inelastic scattering is important for q - 109 em � 1 to 101 em� 1 . If q is too large, the contributions due to F0(q) or F"(q) drop off very rapidly. At extremely high q, where q is now of order 1/R nucleus - 10 1 2 em�\ the structure of the nucleus becomes important. The Coulomb potential due to the point nucleus must now be replaced by a Coulomb potential due to an extended object, 2 - ze r
__.
-
ze 2 f d 3x 'N ( r ') , lx - x'l
(7 .12.22)
where N( r ) is a nuclear charge distribution, normalized so that (7 .12.23)
The pointlike nucleus can now be regarded as a special case, with N ( r ') = 8 (31 ( r ' ) .
(7 .12.24)
We can evaluate the Fourier transform of the right-hand side of (7.12.22) in analogy with (7.12.10) as follows:
4
'1T = Ze 2 2 Fnucleus (q)
q
(7 .12.25)
434
Scattering Theory
where we have shifted the coordinates x --> x + x ' in the first step and
-
Fnucleus = jd 3xe iq• xN( r ) .
(7 . 12.26)
thus obtain the deviation from the Rutherford formula due to the finite size of the nucleus, We
da dQ
-
= ( ddaQ ) Rutherford I F(q) \ 2 ' -
(7 .12.27)
where ( daIdQ) Rutherford is the differential cross section for the electric scattering of electrons by a pointlike nucleus of charge Z j e j. For small q we
have
= 1 - (; q 2( r 2 )nucleus + 1
··· ·
(7 .12.28)
The q ·x-term vanishes because of spherical symmetry, and in the
q 2-term we have used the fact that the angular average of cos 28 (where 8 is the angle between q and r) is just t : (7 .12 .29)
The quantity ( r 2)nucleus is known as the mean square radius of the nucleus. In this way it is possible to "measure" the size of the nucleus and also of the proton, as done by R. Hofstadter and coworkers. In the proton case the spin (magnetic moment) effect is also important. 7.13.
COULOMB SCATIERING
This last section was not written even in preliminary form by Professor Sakurai, but it was listed as an item he would like to consider for Modern Quantum Mechanics. Professor Thomas Fulton of The Johns Hopkins Uni versity graciously offered to contribute his own lecture notes on Coulomb scattering for this section in honor of Professor Sakurai's memory. The editor deeply appreciates this gesture of friendship. However, the reader needs to be alerted that there could be stylistic and even notational changes here from the rest of the book. In this revised edition, the material is somewhat reorganized; the principal results previously obtained are still given in the main text, while the mathematical details are now presented in Appendixes C.l and C.2. Much of this material is a slightly expanded version of Gottfried's treatment (Gottfried 1966, pp. 148-153) and uses the notation of thi!
7.13.
435
Coulomb Scattering
text. The treatment of Coulomb partial waves is original. The present discussion of Coulomb scattering differs from the standard treatment (see, for example, Bohm 1951, and Schiff 1968) by dealing directly with the asymptotic forms. The differential equation (7. 13.7) is the standard form, which can also be obtained by separating variables in parabolic coordinates. It can be recognized as the differential equation for the confluent hyper geometric function,* and so the properties of this function (including asymptotic forms) can be applied to analyse this case. However, the present alternative approach can pass directly to the asymptotic limit by using Laplace transforms and complex variable techniques (see Appendixes C.1 and C.2), and thus avoid referring to the detailed properties of the confluent hypergeometric function. Clearly, this simpler approach, which can be understood without any reference to material outside the present textbook, loses information about the scattering wave function at the origin. Thus, for example , we can say nothing about the Gamow factor/ which is relevant for the study of many nuclear reaction rates. The interested reader is therefore encouraged to supplement his knowledge here with reading from other texts (e.g. , Schiff 1968) that deal with the behavior of the Coulomb wave function at the origin via power series expansion of the associated confluent hypergeometric function. Scattering Solutions of the Hamiltonian of the Coulomb Interaction
We start with the three-dimensional SchrOdinger equation with Cou lomb potential V(r) = for
a collision between particles of. charges Z1e and
( :� V2 -
or
z _ -_z_,I_ Z-=. ze_ r
-
z1
�2e}!J(r) = Eljl(r),
(7.13.1) - Z2e ,
as follows:
E>O
(7.13.2a) (7. 13.2b)
where m is the reduced mass m
= m1 m2/(m1 + m2) for the
particles Z1e
* See, for instance, E. T. Whittaker and G. N. Watson, A Course of Modern Analysis, 4th ed. (London: Cambridge University Press, 1935), Chapter XVI. t G. Gamow, Z. Phys. Vol. 51 , no. 204 (1928); R. W. Gurney and E. U. Condon, Phys. Rev. Vol. 33, no. 127 (1929).
Scattering Theory
436
and - Z2e. The -yk in (7. 1 3 . 2b) is given by -yk =
'Y
or
=
a
v
=
e2 -
hc
c --. = ----c:-:-
1 137.03608
(7. 13.3)
Note that 'Y > 0 corresponds to attraction. As long as we are interested in a pure Coulomb field , it is possible to write our solution I)Jk(r) to (7. 13.2) as ... I)Jk(r) = e' •x(u)
u
k·r
= =
ikr(1 - cos kz.
e)
ik(r - z)
=
ikw
(7.13.4)
The separation of variables for I)Jk(r) in (7 . 1 3 .4) is plausible if we recognize ( 1 ) that the solution will not involve the azimuthal angle because of the axial symmetry of the problem and (2) since I)Jk(r) represents the complete Coulomb-wave function (incident plus scattered wave) , terms must exist in its dominant asymptotic form that contain e•"k · r and , - l eikr . We will demonstrate shortly [see Equations (7. 1 3 .9a), (7. 1 3 . 13), and (7. 1 3 . 14)] that, with the choice of independent variables we are about to make, this will indeed be the case. (Another argument for our particular choice of variables is given by Gottfried 1966, 148.) Let us choose independent variables (z , w, A) where w = r - z and A can be taken to be
-
-
=
-
-
-
x a aA a + -r aw ax ax. -
-
-
(7. 13.5)
Note that because eikzx(u) is independent of A, the operation a/a;\ makes no contribution. The reader will readily verify that, operating on eikzx(u),
(7. 13.6)
437
7. 1 3 . Coulomb Scattering
Using (7. 1 3 .4)-(7. 13.6) in (7 . 13.2), we have
·]
[ tf
(7. 13.7)
d u d 2 + ( 1 - u) "' x (u) = 0 . du - ' u
Our first attempt at the solution of (7 . 13. 7) considers the asymptotic case-that is, solutions for large r - z (away from the forward direction, since r - z = r( 1 - cos 9) = 0 for 9 = 0) . Let us try two types of solutions, (1) x - u' and (2) x - eu . The first type of solution requires and, hence,
( - X. - i"')u' = 0, or X. x (u) _
U - i>
_
e - i>
4Jk (r) _ ei[k r - ,
In
=
- 1"1
(7. 13 .8)
(7. 1 3.9a)
k( r - z )
(7. 13 . 9b)
In k(r - z)] _
Equation (7. 1 3 .9b) clearly represents the incoming plane-wave piece. The second type of solution, x - eu , manifestly satisfies u(d2xldu2) u(dxldu) = 0, so heuristically we try for a solution that combines features of both solutions in product form, x(u) - u'eu ,
(7. 13 . 10)
and substitute into (7 . 1 3 . 7). The coefficient of u> + 1eu obviously vanishes, while the coefficient of the u'eu-term leads to the following relation: 2>.. + ( 1 - X.) - i"' = 0, or X.
Hence,
=
(7. 1 3 . 1 1 )
- 1 + i"'.
(7. 13 . 12)
and
4J(� -
ei[kr + >
In
k(r - z))
r - z
(7. 13 . 13)
'
clearly an outgoing spherical wave. Combining the two solution pieces given by (7. 13.9) and (7. 13. 13) linearly and noting that r - z = r(1 - cos 9) = 2r sin2( 9/2) while ln[2kr sin2( 9/2)] = ln(2kr) + 2 ln sin (9/2), we write the normalized wave function as ,,, (r) � 'l'k l f-.oo u
1 (21Tf/2
{ ei[k - r - , ln(kr - k - r)) +
ilk_r_ rl ln_z_k_ _'>_ +_ (k , -9')_e_ f.c� '-'
r
}
.
(7. 13 . 14)
There are extra phase effects at large r, which is due to the long range of the Coulomb interaction.
438
Scattering Theory
Now we must find the Coulomb scattering amplitude fc(k, 6). In terestingly, aside from a factor of 'Yik, the magnitude is just from (r - z): (1/2k sin2 ll/2), which appears in the asymptotic form (7. 13. 13). But we have to work hard to get it! Because the details here are quite involved mathematically (c.f. Appendix C . l ) , the first-time reader is encouraged to focus only on the final results summarized below. The physical basis of our results are discussed later on in this section. For the normalized ljlk(r) we obtain ljJk(r)
1 i(k -� 312 e z (21r)
[
_
In k(r-z)]
+
ei(kr+� In k(r-z)] g *I ( 'Y) 'Y r g 1('Y) 2k sin2(8/2)
]
·
(7. 13. 15)
This exhibits explicitly all the r- and k-dependent phases and leads to the identification from (7. 13. 14) that fc(k, ll) = eie(,)
=
In sin(e/ )] 2 2
'Yeile(,) + 2�
2k sin (6/2)
gf <'Y) g,('Y) "
Thus, as in (7 . 1 . 36) , the differential cross section is
I
dcrc(k, 6) = fc(k, ll) 12 = 2 -y24 dO 4k sin (6/2)'
(7. 13 . 16)
(7. 1 3 . 17)
where 'Y is given by (7. 13.3)-and we are back to the Rutherford formula of (7.2 . 1 1) . For the aficionados, we remind the reader of Hankel's formula (which checks for g
z =
I
cl
ess - zds
1 and f(l) 1
=
) h - f(1 - i'Y)'
0!
=
2'1Ti f(z)
--
=
eie(,)
1 ) . Thus =
f( 1 - i'Y) f(1 + i'Y)"
Partial-Wave Analysis for the Coulomb Case
We can rewrite ljl(r) from (7. 1 3.4) and (C. l . 5) as ljl(r) = e'l< · •x(u)
(7. 13 . 18)
(7. 13 . 19)
439
7.13. Coulomb Scattering
= A =
A
where
I I
c
c
eik · reik( r - z )t(l eikrteik·r( l - t) ( l
- t)d(t , 'Y)dt - t)d(t, 'Y)dt
(7. 13.20) (7. 13 .21)
We perform the partial-wave expansion of the plane wave analogous to (7.5. 18) as
L (2! + l)i1Ncos O)At(kr), 00
lf!(r) =
where
A t(kr) = A
(7. 13.22)
t�o
I
c
eikrth[kr(l - t) )(l - t)d(t, 'Y)dt.
(7. 13 .23)
Now, h = � (hPJ + hPl*) , where the hPl are spherical Hankel functions (see Appendix A and Whittaker and Watson 1935), ht(p) = £1(p)e,."/p, I
1
0
2
( �)
(
and, asymptotically,
Et(p)
1 = ( - i) + 1 +
A t(kr)
=
)
3 3" il +� p - p2 .
- 1 +
(7. 13.24)
O(p- 1 ). Hence, we write
(7. 13.25)
APl(kr) + Af2l(kr),
where A}1 l and A}2l correspond to the contributions from h1 and hi , re spectively. The explicit evaluation of APl(kr) and Afl(kr) is given in Appendix C.2 and leads to the following results
A}il(kr)
and
A e,...,12 [2'1Tigb)] 2ikr
X
{e - i(kr- (l,f2) + > In 2kr)
_
=
(7. 13.26)
0
e2i�t(k)ei(kr- (l,f2) + > In
2krJ} ,
(7. 13.27)
where we have optimistically introduced a real phase TJAk):
(7. 13.28)
440
Scattering Theory
Note that we write TJt(k) because 'Y depends on k [see (7.13.3)]. Let us look briefly at some simple examples to demonstrate that TJlk) is indeed a real phase. For I = 0, Eo = i,
e2i�o
. 2 g3<'Y) '!T! =
f(l - i-y)
= r(l + i-y)"
I
c,
2'1Ti . s - i - lds = 2'lTZg *J ('Y) = r(l + i-y)'
es
'
(7. 13 .29)
For I = 1 , we have
- 2'1Ti
-
[
2 1 f(l + i-y) f(2 + i-y)
- 2'1Ti 2'1Ti(l - i-y) [(l + i-y) - 21 = f(2 + i-y) f(2 + i-y)
J
(7. 13.30)
g3(-y) = (1 - i-y) r(l - i-y) = f(2 - i-y) . f(2 + i-y) f(2 + i-y) g J('Y) The same pattern repeats for I = 2, 3, . . . . Thus for a general I, we e2;� 1
=
conjecture
e2;�I
f(l + I - i-y) = ---'------'-'f(l + I + i-y)"
(7.13.31)
As a final summary we remark that the Coulomb partial-wave scattering case differs from the general partial-wave case (see Section 7.6) only through the modification ei(kr - (/.,12)) ei(kr - (/"12) + > In 2kr) 1 � - ----- - hPl(kr) - 2 2ikr 2ikr f(1 + I - i-y) e2;aI � e2;�I - � = c=-'-f(l + I + i-y) "
(7. 13.32)
Let us note that the asymptotic form of the Coulomb scattering wave function, Eq. (7. 13. 15), is different from that which appears in Eq. (7. 1 .33), written for large r as tfJ �
eikz
+ /(9)
-. eikr
r
(7.13 .33)
The latter form is valid only for short-range forces, while the Coulomb force is a long-range one. The form (7. 1 .33') is modified as follows for the Coulomb interaction: The first term on the right-hand side of (7.13. 15)
Problems
441
contains not only the factor eikz as in (7. 13.33), but also a modifying, coordinate dependent, multiplicative phase factor e _ ;, tn k(r-z) . Hence , the incident plane wave is slightly distorted , irrespective of how far away the particle is from the origin. Analogously, the second term on the right-hand side of (7. 13. 15) , corresponding to the outgoing spherical wave, contains the r dependent phase factor ei> tn Zkr. Nevertheless, in spite of these long range effects leading to phase factors that vary with distance, we can still define a scattering cross section for short-range forces because the dis torting terms, although they alter physically observable quantities (e.g. , the mean current) , lead to alterations that vanish as r goes to infinity.
PROBLEMS 1.
The Lippmann-Schwinger formalism can also be applied to a one dimensional transmission-reflection problem with a finite-range poten tial, V( x ) * 0 for 0 < 1x1 < a only. a. Suppose we have an incident wave coming from the left: ( xl>) = e'kx/V2'1T . How must we handle the singular 1/( E - H0 ) operator if we are to have a transmitted wave only for x > a and a reflected wave and the original wave for x < - a? Is the E ---> E + if prescrip tion still correct? Obtain an expression for the appropriate Green' s function and write an integral equation for (xl>f< + l ) . b. Consider the special case of an attractive 8-function potential 2) ( yn V= 2m
2.
(y > O) . Solve the integral equation to obtain the transmission and reflection amplitudes. Check your results with Gottfried 1966, 52. c. The one-dimensional 8-function potential with y > 0 admits one (and only one) bound state for any value of y. Show that the transmission and reflection amplitudes you computed have bound-state poles at the expected positions when k is regarded as a complex variable. Prove 2 m2 j d 3x j d 3x ' V( ) V( ) sin k l - 'l atot ==
3.
4 '1T n
8(x)
r
r'
x
x
k 2 1X - X '1 2
in each of the following ways. a. By integrating the differential cross section computed usmg the first-order Born approximation. b. By applying the optical theorem to the forward-scattering amplitude in the second-order Born approximation. (Note that f(O) is real if the first-order Born approximation is used.] Consider a potential V = 0 for > R , V = V0 = constant for < R , r
r
442
Scattering Theory
where V0 may be positive or negative. Using the method of partial waves, show that for ! Vo l « E = h2k2j2rn and kR « 1 the differential cross section is isotropic and that the total cross section is given by ()"tot =
4.
( 16w ) rn 2 V02R6 9
n4
Suppose the energy is raised slightly. Show that the angular distribution can then be written as da d fJ = A + B cos (J . Obtain an approximate expression for BIA . A spinless particle is scattered by a weak Yukawa potential Voe ��r V = ---"-p.r
where p. > 0 but V0 can be positive or negative. It was shown in the text that the first-order Born amplitude is given by a. Using /( 1 )( 0 ) and assuming 1811 « 1 , obtain an expression for 81 in terms of a Legendre function of the second kind, b. Use the expansion formula Q,(
r) =
I!
1 · 3 · 5 . . . (2/ + 1) x
{ '+ l + r
+
1
( / + 1)( 1 + 2) 2(2/ + 3) '+3
r
1
( / + 1 ) ( 1 + 2) ( / + 3)(/ + 4) _1_ + ··· �/+ 5 2 · 4 · (2/ + 3)(2/ + 5)
}
{ irl > l )
to prove each assertion. (i) 81 is negative (positive) when the potential is repulsive (attrac tive). (ii) When the de Broglie wavelength is much longer than the range of 1 1 the potential, 81 is proportional to k 2 + . Find the proportional ity constant.
443
Problems
5. 6.
Check explicitly the Px uncertainty relation for the ground state of a particle confined inside a hard sphere: V = for r > a, V = 0 for r < a. ( Hint: Take advantage of spherical symmetry.) Consider the scattering of a particle by an impenetrable sphere V( r ) = { � for r > a for r < a. a. Derive an expression for the s-wave ( I = 0) phase shift. (You need not know the detailed properties of the spherical Bessel functions to be able to do this simple problem!) b. What is the total cross section a [ a = f(dajdQ) dQ] in the extreme low-energy limit k 0? Compare your answer with the geometric cross section 1ra 2• You may assume without proof: x -
oo
--->
7.
8.
:� = 1 / ( 8 ) 1 2 • /( 0 ) = ( �) I (21 + l ) e' 8'sin i>1P1(cos 0 ) . I�0
Use 81 = �(b)lh� J;k to obtain the phase shift 1>1 for scattering at high energies by (a) the Gaussian potential, V = V0exp( - r 2ja 2 ), and (b) the Yukawa potential, V = V0exp( - w )/J.tr. Verify the assertion that 81 goes to zero very rapidly with increasing I ( k fixed) for I » kR, where R is the "range" of the potential. [The formula for �(b) is given in (7.4.14)].
a. Prove
where r ( r ) stands for the smaller (larger) of r and r '. b. For spherically symmetric potentials, the Lippmann-Schwinger equa tion can be written for spherical waves: <
>
! Elm ( + ) ) = ! Elm ) +
E
1 -
.
Ho + If
V I Elm ( + ) ) .
Using (a), show that this equation, written in the x-representation, leads to an equation for the radial function, A 1 ( k ; r), as follows: 2mik A 1 ( k ; r ) = j1 ( kr ) - -----,;2 X
{""J,( kr 0
<
) h Pl( kr ) V( r') A 1 ( k ; r ' ) r '2 dr '. >
Scattering Theory
444
By taking r very large, also obtain sin I) /, ( k ) = e' 1 k 6
=-
9.
1
-
( �7 ) f'}1 ( kr) A ( k; r) V( r) r 2 dr. 1
Consider scattering by a repulsive 1>-shell potential:
( �7 )v(r) = yl>(r - R), (y > O). a. Set up an equation that determines the s-wave phase shift 1>0 as a function of k ( E = n2k2j2m). b. Assume now that y is very large,
y » R1 , k.
Show that if tan kR is not close to zero, the s-wave phase shift resembles the hard-sphere result discussed in the text. Show also that for tan kR close to (but not exactly equal to) zero, resonance behavior is possible; that is, cot 1>0 goes through zero from the positive side as k increases. Determine approximately the positions of the resonances keeping terms of order 1jy; compare them with the bound-state energies for a particle confined inside a spherical wall of the same radius,
V= O, r < R; V= oo, r > R.
Also obtain an approximate expression for the resonance width defined by r= 2
f
d(cot «50 )/dE ] i i> E,
..,.--.,----__-,---
[
10.
11.
and notice, in particular, that the resonances become extremely sharp as y becomes large. ( Note: For a different, more sophisticated approach to this problem see Gottfried 1966, 131 -141 , who discusses the analytic properties of the Drfunction defined by A1 = j1jD1.) A spinless particle is scattered by a time-dependent potential Y"" ( r, t ) = V(r) cos wt. Show that if the potential is treated to first order in the transitiOn amplitude, the energy of the scattered particle is increased or decreased by nw. Obtain dajdrl. Discuss qualitatively what happens if the higher-order terms are taken into account. Show that the differential cross section for the elastic scattering of a fast
445
electron by the ground state of the hydrogen atom is given by
12.
:� = ( 4;�:4){1- [ 4 + (�:o)r
r
(Ignore the effect of identity.) Let the energy of a particle moving in a central field be £ ( 11 12 13 ), where ( 11 , 12 , 13 ) are the three action variables. How does the func tional form of E specialize for the Coulomb potential? Using the recipe of the action-angle method, compare the degeneracy of the central field and the Coulomb problems and relate it to the vector A. If the Hamiltonian is H=
ip.2 + V( r ) + F(A2 ) ,
how are these statements changed? Describe the corresponding degeneracies of the central field and Coulomb problems in quantum theory in terms of the usual quantum numbers (n, !, m) and also in terms of the quantum numbers (k, n). Here the second set, ( k, m, n), labels the wave functions !!2/:'"(a{Jy). How are the wave functions �/:'"(a{Jy) related to Laguerre times spherical harmonics? m,
APPENDIX A
Brief Summary of Elementary Solutions to Schrodinger's Wave Equation
'
Here we summarize the simple solutions to Schrodinger's wave equation for a variety of soluble potential problems. A.l. FREE PARTICLES ( V = 0)
The plane-wave, or momentum, eigenfunction is where
1 "' k (x t ) = e ik•x - i wt 2 ' , ( 2 77 )3/
E
k = hp '
w = - =
n
and our normalization is
(A.l .l)
n k2 p2 = --
(A.l .2)
2m
2mn
(A.lJ)
The superposition of plane waves leads to the the one-dimensional case, 1 V2 7T
.ji ( x , t ) = --
J
oo
- oo
wave-packet
dkA ( k ) e i( k x - wt )
(
w =
description. In
) 2m
nk 2
.
(A.1 .4)
For IA(k)l sharply peaked near k "' k 0 , the wave packet moves with a 446
A.2.
447
Piecewise Constant Potentials In One Dimension
group velocity
) v , ( dw dk g
[ ( X o ]1/4 oo 3 J
k0
=
m
hk 0
•
The time evolution of a minimum wave packet can be described by 1/; ( x, t ) =
.l
)2
2 '1T
- oo
)
e - ( Ax)5( k - ko)'+t k x - tw (k) t dk , w ( k =
(A.l .5) hk 2
2m ' (A.l .6)
where
(A.1 .7) So the width of the wave packet expands as at t > 0.
(A.l .8)
A.2. PIECEWISE CONSTANT POTENTIALS IN ONE DIMENSION
The basic solutions are:
- /2m(E-vo ) . ( )
kE
< V = V0 (classically forbidden region) : ·'· 'f' E
-
must be set equal to 0 if x discussion).
(c ±
=
± oo
A.2.1
(A.2 .2)
( x ) = c e"x + c e - u ' +
h2
is included in the domain under
Rigid-Wall Potential (One-dimensional Box)
Here
O < < L, v= {� for otherwise. x
(A.2.3)
Brief Summary of Elementary Solutions to Schrodingcr's Wave Equation
448
The wave functions and energy eigenstates are t/JA x ) = {i sin( nZx ) . n = 1 , 2, 3 . . . , ;, 2 n 2w 2 --E=
2mL 2
(A.2.4)
Square-Well Potential
The potential V is
{
for Jxl > a - Vo for Jxl < a ( V0 > 0) The bound-state ( E < 0) solutions are: for Jxl > a , t/1 coskx (even parity) for JxJ < a , sin kx (odd parity) where
v= o
(e-•lxl
}
E -
(A.2. 5 ) (A.2.6) (A.2.7)
The allowed discrete values of energy E - n 2K 2j2m are to be determined by solving ka tan ka = Ka (even parity) (A.2.8 ) ka cot ka = - Ka (odd parity) . Note also that and k are related by 2mVo a 2 ( A.2 .9 ) - ( k 2 + K2 ) a2. =
K
---2 -"
ll
=
A.3. TRANSMISSION-REFLECTION PROBLEMS
In this discussion we define the transmission coefficient T to be the ratio of the flux of the transmitted wave to that of the incident wave. We consider these simple examples. Square Well ( V 0 for Jxl > a, V - V0 for Jxl < a.) 1 - ----.,. ---T = ---,----2 ] sin2 2k'----,{ I + [ ( k '2 - k 2 ) 2 /4k 2 k'---:;a} 1 (A.3.1) =
=
449
A.3. Transmission-Reflection Problems
where k=
J
2mE
(A.3.2)
!z 2 '
Note that resonances occur whenever n
=1 ,2,3, . . . .
(A.3.3)
Potential Barrier ( V = 0 for lxl > a, V = V0 > 0 for lxl < a.) Case 1 : E < V0• T=
1
{ 1 [ ( k 2 K 2 ) 2j4 k 2K 2 ] sinh22 K a } +
+
1
Case 2: E >
replaced by - V0•
V0• This case is the same as the square-well case with V0
Potential Step ( V 0 for x < 0, V V0 for x > 0, and E > V0.) =
=
(A.3.5)
with
k
=
J
2mE !z 2 ,
(A.3.6)
More General Potential Barrier [ V(x) > E for a < x < b, V(x) < E outside range [a, b).)
The approximate JWKB solution for T is (A.3 .7 )
where a and b are the classical turning points.* *JWKB stand for Jeffreys-Wentzel-Kramers-Brillouin.
Brief Summary of Elementary Solutions to Schrodinger's Wave Equation
450
A.4. SIMPLE HARMONIC OSCILLATOR
Here the potential is
V( x ) = m w22x 2 '
(A.4.1)
� = V. f!!!E n x.
(A.4.2)
and we introduce a dimensionless variable The energy eigenfunctions are
>/;£ = (2"n ! ) - 1 /2( :; r/4 e - e;2H" ( 0
and the energy levels are
( �) ,
E = nw n +
(A.4.3)
n = 0, 1 ,2, . . . .
(A.4.4)
The Hermite polynomials have the following properties: a"
Hn(�) = ( - lte' a�" 2
e-<
2
r. Hn· m H" (�) e - <2 d� = 11'1122" !5""'
d2 - Hn de
dHn + 2 H n 0 ., d� Hom = 1 , H1m = 2�, -
2c
-
n
=
H2m = 4e
(A.4.5) 2, H3 m = se
-
Hi�) = 16�4
-
-
12�,
4se + 12.
...
A.5. THE CENTRAL FORCE PROBLEM (SPHERICALLY SYMMETRICAL POTENTIAL v V(r).)
Here the basic differential equation is
1 _i_ ( sinO N£ ) B [ -� _!_ i_ ( lfiE ) 2 + r 2 m r 2 ar ar r 2 sin8 ao ao 1 B 2l/l£ ] + 2 · 2 -2 + V(r) >/;E = E>/;E
sm (} B
(A.5.1)
(A.5 .2)
451
A . 5 . The Central Force Problem
The method of separation of variables, 'I' E(x)
=
R(i) Y{ (0,),
leads to the angular equation
- [ si�O :o (sin O :o )
+
si�2 0
::2 ] Y1"'
(A.5.3) = I ( I + 1 ) Y/'' ,
where the spherical harmonics Y1 "' ( 0, cp), I = 0, 1 , 2, . . . , m = - I, - I + 1 , . . . , + I satisfy - i _i_ YI"' = mYI"' acp and Y1"'( 0, cp) have the following properties: }/"'( IJ , cp ) = ( - 1 ) "'
(A.5.4)
(A .5 .5)
(A.5 .6}
2�:1 ��::�: P/''(cos iJ ) e' "'
for m 2: 0,
Y/'( 0, cp} = ( - 1} 1"' 1}/l ml*( 0, cp} for m < 0, ' P1"'(cos !J ) = (l - cos 2 1J ) "' 12 d " m P1(cos iJ ) for m � 0, d(cos 0 ) 1 d 1 {1 - cos 2 1 1} ( 0) , PI ( cos IJ ) = I 2 1! d(cos iJ ) I Y1 - V/3 4; cos O, 0 _
(A.5 .7)
For the radial piece of (A.5.3), let us define u A r ) = rR ( r ) .
(A.5.8)
Brief Summary of Elementary Solutions to Schrodinger'' Wave Equation
452
Then the radial equation is reduced to an equivalent one-dimensional problem, namely, _!f_ d2u£ + r + /(/ + 1)21l 2 u£ - Eu£ 2 m dr 2 2 mr subject to the boundary condition
[v( )
_
]
u E (r) l,-0 = 0 . (A.5.9) For the case of free particles [V(r) = 0] in our spherical coordinates: R ( r) = c1iJ(p) + c2n1( p) ( c2 = 0 if the origin is included.) (A.5. 1 0) where p is a dimensionless variable p kr, k = / 2:2£ . (A . 5.1 1) =
We need to list the commonly used properties of the Bessel functions and spherical Bessel and Hankel functions. The spherical Bessel functions are
. sm p Jo ( P ) = -p , . sm p cos p } ( P ) = -- - -- , p p2 !
(�- ) ( ]_p3 .!_p ) cos p - ]_p2 sin p.
(A.5 .12)
p, }2 ( p ) = 3 .!.p sin p - �cos P P2
n 2( p) = -
-
For p -+ 0, the leading terms are p . Jt ( P ) -;;:Q (2/ +' 1)!! ' where
n ( p ) - - (2/P-I+!1)!! , (A.5 .13) '
p�O
(2/+1)!! = (2/+1)(2/-1) , , , 5·3·1. In the large p-asymptotic limit, we have }, ( p) - -1p cos [ p - (/ +l)'IT 2 ], p � oo
(A.5 .14) (A.5.15)
A.5. The Central Force Problem
453
Because of constraints (A.5.8) and (A.5.9), R(r) must be finite at r 0; hence, from (A.5. 10) and (A.5. 13) we see that the nt{p)-term must be deleted because of its singular behavior as p � 0. Thus R(r) c1Mp) [or, in the notation of Chapter VII, Section 7.6, At(r) R(r) cd1( p)] . For a three-dimensional square-well potential, V V0 for r R (with V0 > 0), the desired solution is (A.5 .16) R ( r ) = A1( r ) = constant Jkxr ) , where 2m ( V0 - l E I) = , r < R. ( A.5. 1 7 ) 2 =
=
a
[
-
=
=
=
<
1112
h
As discussed in (7 .6.30), the exterior solution for r > R , where V = 0, can be written as a linear combination of spherical Hankel functions. These are defined as follows: which,
h}1 > ( p ) = it ( P ) + in t ( P ) h}1l*( p ) = h}2l ( p ) = it ( p ) - int ( P ) from (A.5.15), have the asymptotic forms for p --> oo hl1 l( P )
- .!..p e ' I P - U +
1 ) "/21
(A.5.18) as follows:
p � oc;
(A.5 .19) If we are interested in the bound-state energy levels of the three dimensional square-well potential, where V( r ) = 0, r > R , we have u1( r ) = rA 1 ( r ) = constant e - "1 ( : ) , (A.5.20) 1/2 2 £ K = �� j
(
)
To the extent that the asymptotic expansions, of which (A.5.19) give the leading terms, do not contain terms with exponent of opposite sign to that given, we have-for r > R -the desired solution from (A.5.20): A 1 ( r ) = constant h�1l( iKr ) = constant [ j1 ( iKr ) + in 1 ( iKr )] , (A.5 .21) where the first three of these functions are h<1> O ( iKr) = -
-1 e - "' Kf
(A.5.22)
454
Brief Summary of Elementary Solutions to Schroding,er's Wave Equation
Finally, we note that in considering the shift from free particles [V(r) = OJ to the case of the constant potential V(r) = V0, we need only to replace the E in the free-particle solution [see (A.5.10) and (A.S.ll) by E - V0 . Note, though, that if E < V0, h �1· 2l(iKr ) is to be used with K
= J2 m(V0 - E)j h 2 •
A.6. HYDROGEN ATOM
Here the potential is
Ze 2 V(r) = - r
(A.6.1)
and we introduce the dimensionless variable p
=
(Sm�l£1 )
1/2
(A.6.2)
r.
The energy eigenfunctions and eigenvalues (energy levels) are o/nlm
R ., 1 ( r ) �"' ( 0 , > )
n - / - 1)! } R "1 (r) = - {( 2Z ) 3 ( na o 2n [( n + I ) ! ] 3 =
1/2
e - p;2p1L2'+ n + l' ( p )
(independent of l and m)
(A.6.3)
h2 a 0 = Bohr radius = -m ee 2 2 Zr . p= 2 1+1, na 0 n
The associated Laguerre polynomials are defined as follows: where-in particular-
dq ) ( p L% = dp q Lp ( p ) ,
( A .6 . 4
)
(A.6.5)
and the normalization integral satisfies ]2 2d t fe - pp2' [ n2/+ L +l (p) p p
= 2 n [ (n + /)!] 3 (n - / - 1)!
( A .6.6 )
455
A.6. Hydrogen Atom
The radial functions for low n are:
R 21 ( r ) =
The radial integrals are (rk)
=
2 / 3 �e-Zrj2 a0• (� ) 2a a0 y'3 _,
o
J: dr r2+k [Rnt (r)f,
(;_i) [3n2 (a�n2) [Sn2 m n�o' (r)
=
=
=
(A.6.7)
-
zzz
l(l + 1)]
+ 1 - 31(1 + 1)]
(A.6.8)
APPENDIX
B
Proof of the Angular-Momentum Addition Rule Given by Equation ( 3.7.38)
It will be instructive to discuss the angular-momentum addition rule from the quantum-mechanical point of view. Let us, for the moment, label our angular momenta, so that }1 :;:; }2 • This we can always do. From Equation (3.7.35), the maximum value of m , m max is m max
=
m Imax + m max 2
=
j"
I
+ }.2 •
(B . l .l )
There is only one ket that corresponds to the eigenvalue m max, whether the description is in terms of liJ h; m 1 m 2 ) or l iJ h; jm ) . In other words, choosing the phase factor to be 1, we have
(B.l .2)
In the I J1 j2 ; m 1 m 2 ) basis, there are two kets that correspond to the m eigenvalue m max - 1, namely, one ket with m 1 = m ;"ax - 1 and m 2 m �ax and one ket with m 1 m ;"ax and m 2 = m �ax - 1. There is thus a twofold degeneracy in this basis; therefore, there must be a twofold degeneracy in the I J1 j2 ; jm ) basis as well. From where could this come? Clearly, m max - 1 is a possible m-value for j = }1 + }2 . It is also a possible m-value for j j 1 + )2 - 1�in fact, the maximum m-value for this j. So }1• j2 can add to j 's of }1 + }2 and }1 + }2 - 1. We can continue in this way, but it is clear that the degeneracy cannot increase indefinitely. Indeed, for m min = - )1 - j2 , there is once again a single ket. The maximum degeneracy is (2}2 + 1)-fold, as is apparent =
=
=
456
457
Proof of the Angular-Momentum Addition Rule Given by Equation (3. 7 .38)
TABLE B. I . m
Special Examples o f Values o f m . m 1 , and m 2 for the Two Cases ii � 2 , ), � 1 and h � 2, ;, � � , Respectively
ii � 2 , h � l
3
( m1 , m2 )
( 2 , 1)
Numbers of States
1
h � .
2
. - h �2 m
I
( m1 , m 2 ) Numbers of States
(2 . ! ) 1
2
(1, 1 ) (2 , 0) 2 l 2
1
0
(0, 1 ) ( - 1 , 1) (1 , 0) (0,0) 1 2 1 ( , ) ( , - 1) 3
3
-1
-2
-3
3
2
1
( - 2 , 1) ( - 2 ,0) ( - 1 , 0) (0, - 1) ( - 1, - 1 ) ( - 2 , - 1)
3 - ,
(O, J) ( - l, J ) ( - 2, J) (1 . ! ) ( 2 , - t) ( 1 , - ! ) (0, - ! ) ( - 1 , - ! ) ( - 2 , - ! ) 1 2 2 2 2
from Table B.1 constructed for two special examples: for }1 = 2, }2 = 1 and for }1 = 2, }2 = 1 . This (2}2 + 1)-fold degeneracy must be associated with the 2 }2 + 1 states j: (B l .3) i1 + iz . J1 + iz - 1 , . . · . i1 - i2 · If we lift the restriction }1 � )2 , we obtain (3.7.38). .
APPENDIX C
Mathematical Details of Coulomb Scattering Formalism C.l.
THE COULOMB SCATTERING AMPLITUDE/c(k, 9)
Let us go back to the differential equation (7. 13.7). The theory of differ ential equations tells us that the only singularity in the finite u-plane is at u 0.* (We want the solution that is regular at u = 0.) We examine x(u) in the Laplace transform space, t:
=
x(u) =
--+
tI z '•
eu'f(t)dt,
(C. l . l)
where the path of integration t1 t2 in the complex t-plane will be decided later. If we substitute x(u) given by (C . 1 . 1) into (7. 13.7), we obtain
tI z ,,
[ ur + (1 - u) t - i-y] feu'dt
=
J:>[
(t - i-y) + t(t - 1)
:J eu1dt
= t(t - 1) eu'll :� + r: {(t - i-y)f - :t [t(t - 1)f(t)] } eu'dt = 0,
(C. 1.2)
d . (t - t-y)f(t) [(t - 1 ) tf(t)] dt
(C. 1 .3)
where we have integrated by parts. If we assume that surface terms vanish, evidently and we can easily obtain the solution for f(t) as
f
=
= 0,
Al'- 1 (1 - t) -i>,
(C. 1 . 4)
where A is some constant. As an interim summary we have [from (C. l. 1)] x(u)
=
A
I
c
eu't'- 1 ( 1 - t) -i'dt,
I'z --+ I �
c
.
(C. 1.5)
*For the mathematical background, see E. T. Whittaker and G. N. Watson, A Course of Modern Analysis, 4th ed. (Cambridge, London: 1935), Chapter XVI. 458
C.l. The Coulomb Scattering Amplitude f, (k, 6)
459
Now we must choose C. Note that the integrand in (C. 1 .5) has branch points at t = 0, 1 in the complex t-plane; see Figure C. l . For the surface term in (C. l .2) to vanish, we require that the t at the end points contain an imaginary part that goes to + oo. This is because u = ik(r - z) = ikr ( 1 - cos 6) = iK and K > 0. Hence, and surface terms vanish. But there are lots of ways of reaching t = a + ioo! The choice of C indicated in Figure C . 1a will give the correct asymptotic forms, as the actual calculations show [see (C. l . lO)], and we will adopt this specific contour. Let s = ut. Then (C. l .5) becomes - 1 u s i> r (� es = A (u c x )
=A
(:r
I� I c
ds e'i' - 1 (u - s) -i\
(C. 1.6)
I-
(C. 1 .7)
where C is the s-plane contour depicted in Figure C.2a. We require that x(u) is regular at u = 0; in fact, it is because x(O) = ( - 1) - i'A
c
e'
s
ds = ( - 1) -i'A 21Ti.
We now evaluate (C. l .6) for u � ao-the asymptotic limit-and show that the correct form is obtained. For the curve C in Figure C.2b and for a 1 )/iK, but this is small for fixed s, s = - (s0 ± iE) and s/u = - (s0 ± iE K � oo . So, we can expand (u - s) in powers of s/u. For the curve C2 in
I- plane
1
0
FIGURE
(b)
(a)
C.l. (a) Branch point at t 0,1 and simple straight-line branch cut between (0, 1) in t-plane. (b) "Rubber-band distorted" branch cut between (0, 1) and a possible contour C for (C. l.5). =
460
Mathematical Details of Coulomb Scattering Formalism
s-plane c
/ I
{
\
/
'
,..
'
.... _
- - -...------;--....
-----� s=u=ikr(1 -cos 8)
--
_::::::====t-:5 (a)
s-plane
!JGURE C.2. (a) s-plane contour C into C1 plus C2.
(b)
c,
C for Equation
(C. 1 .6). (b) Rubber-band distortion of
Figure C.2b, s = - s0 + i(K :±:: e); hence, s/u = 1 - (s0 :±:: ie)IK ;;t00 1 , so we can not expand in powers of s/u . However, if we change variables s � s' here, where s' = s - u, then expansion about s'/u is possible. Hence, the contour C is now replaced by the contour given in Figure C.2b and, further, s � s' takes C2 � C1 :
J_ds J c
and
=
C1
ds +
J ds = J ds + J ds' ts C2
C1
�
s +
C1
(C. l .8)
u
(C. l .9)
C. I . The Coulomb Scattering Amplitude f, (k, 6)
461
Here (s' + ut- 1 is clearly expandable in s'/u, and-since s' is a dummy integration variable for the C1-contour-we may write (C. 1 .9) more com pactly as
Note that the coefficients of the two integrals in (C. l . lO), u-i> and e"ui'- 1 ( - 1) -i', do have the correct asymptotic forms [see (7. 13.9), (7. 1 3 . 12), and (7. 13. 14)] for Coulomb scattering. The leading terms in (C. 1 . 10) are obtained when we let s/u � 0; then x (u) - 2'1TiA [u- i'gJ{'-() - ( - ui' - 1 e"gz('Y)]
2'1Tig1('Y) =
J
e'si'- 1 ds
c
(C. l . l l)
i
Integrating the expression for 2'1Tig2('Y) by parts, we have
= + i'Ygi ('Y) 2'1Ti.
Noting that - u = u * , we find
. [+ [
x(u) :::: 2'1TiAu - "g1 ('Y) 1
(C. 1. 12) (u*Vi' g*('Y) ( ._ i'Y ___.!. _ ;, g i 'Y) u
= 2'lTiAu - i'g1 ('Y) 1 + ei
::l
]
e" u
[u-i'g1 ('Y)]* (u* ) + i> gi('Y) - [u -i'gt ('Y )] u- i> g 1 ('Y)
(u* ) i' (u) _ ;,
= ( - i)i' [k ( r - z) t = e + >wl2ei>l nk( r-z ) = (i) - i' [k ( r _ z) ] - i> = e+�l2e-i>lnk(r-z)
(C. l . 13)
(C. 1 . 14)
(C. 1 . 15)
462
Mathematical Details of Coulomb Scattering Formalism
in going from X � 1fJ [multiplication by eikz as in (7. 1 3.4)), we obtain the normalized asymptotic lfldr), which is given by Eq. (7. 13. 15). C.2.
EVALUATION OF A)l l (kr) AND A}Zl (kr) NECESSARY FOR mE PARTIAL WAVE ANALYSIS OF COULOMB SCATTERING
Let us note that
eikrt eikr(1 - t) = eikr ' eikrt e -ikr(1 - t) e -ikr e2ikrt =
so that
Aeikr A)ll (kr) = Zkr
I
c
(C.2.1) '
(C.2.2)
E1 [kr(1 - t)Jd(t, -y) dt.
Since d(t, -y) t- 2 for large ltl and E1 t 1 and the only singularities in the t-plane are branch points t 0, 1 and possible poles from E1 when its argument kr(1 - t) vanishes, we close the contour C in the manner shown in Figure C. lb. This leads at once to �
0
�
�
=
A)ll (kr)
=
(C.2.3)
0,
since-thinking of a stereographic projection of the complex plane on the unit sphere-the contour C can be viewed as surrounding the region ex terior to it, where there are no poles. For A �2l (kr), we have
AFl (kr) =
A;��kr I
c
E{ [kr(1 - t)J e2ikrt d(t, -y) dt.
-
-
(C.2.4)
We play the same game as we did between (C. l .5 ) and (C. 1 .6). Let s = ut, where u 2ikr; then t s!U and 1 t (u s)IU, so =
A�2l (kr)
=
�
iA m2
Ic e(srut-1 (u : r =
s
=
i> - 1
E{
(u � ) �' s
(C.2.5)
where C is like Figure C.2a except that now the ends of the branch point in the s-plane are s = 0 and s = 2ikr. We recall that u* - u, and therefore =
C.2. Evaluation of Ai'> (kr) and Al'> (kr) +
:::
�
i
U
[
(;t)]ds (u.) ds
u) i� - 1 ( - s) - i� - 1
tf + " (s + e - ur2 -u - i�
J
CI
e'si� - 1
E;
463
E;
21
(C.2.6)
where in the last equation we have used the asymptotic form in for E; (U/2i) as given in (7. 13.24) . Note next that e -ut2 u - i� e��12 e - i(kr+ � 1n2kr) ; hence , ( C .2. 6) become s AFl (kr)
where
=
_
:
A �,12 - i[kr- (M2)+ �ln 2kr] {e 2'1Tig ('Y) 1 2 kr
_ ei[kr - (t�t2) + �ln2kr] 2'1Tig3('Y)
. ()=J
2mg "{ 1
c1
2'1Tig3(-y) = l - 1
e..ss i� - 1 ds
J
c1
=
2'1Ti
i-y )
( ;t) ds
r(l
e's - i� - 1 E;
},
_
.
u =
(C .2.7) (C. 2. 8)
Note g1 (-y) and g3(-y) are used in Eq. (7. 13 . 28), and indeed lead to a real phase TJ1(k).
SUPPLEMENT I
Adiabatic Change and Geometrical Phase When the author died in 1982, this book was left in manuscript form ; subsequently, there have been some new developments in quantum me chanics. The most important development is a definitive formulation of geometrical phases, introduced by M. V . Berry in 1983. The phase factors accompanying adiabatic changes are expressed in concise and elegant forms and have found universal applications in various fields of physics, thus giving a new viewpoint to quantum theory . We review here the physical consequences of these phases, which have in fact been used unconsciously in some cases already, by adding a supplement to the Japanese version of the text. (Here in the new English edition of Modern Quantum Mechanics we are providing a translation from Japanese of this supplement, prepared by Professor Akio Sakurai of Kyoto Sangyo University for the Japanese version of the book . The Editor deeply appreciates Professor Akio Sak urai's guidance on an initial translation provided by his student, Yasunaga Suzuki, as a term paper for the graduate quantum mechanics course here at the University of Hawaii-Manoa.) ADIABATIC THEOREM
Let us first consider a spin with magnitude S in a uniform time-independent magnetic field. Due to the magnetic moment associated with the spin, the energy levels of this quantized system are split into 2 S + ! levels. The state ket I n) with the nth energy eigenvalue En undergoes a change in phase factor as (for a Hamiltonian constant in time)
I n) ---> e -iEntlh I n)
(S . l)
in time evolution, as we learned in Chapter 2. This is the time development of the stationary state . We next change the direction of the magnetic field very slowly. Following the slowly varying magnetic field, the direction of the spin changes also. We keep its component in the direction of the magnetic field constant in time, however. Thus the initial quantum number n does not change at all with the changing magnetic field. This is a typical example of the adiabatic theorem in quantum mechanics. Generally speaking, the adiabatic theorem asserts that there should be mechanical invariants under the slow variation exerted externally on a mechanical system . The entropy in thermodynamic systems and action integral in classical mechanical systems are, like the quantum numbers in quantum mechanical systems, examples of adiabatic invariants. Here we
464
Berry's Phase
465
simply regard an adiabatic change as a time change that is very slow com pared with the time for proper motion in each system (such as periods of oscillation) . t We do not discuss how slow change can be regarded as an adiabatic change (see A. Messiah, Quantum Mechanics, New York: In terscience, 1961 , Chapter XVII). Our problem is to discuss how the state ket I n) of a quantum mechanical system changes under adiabatic changes. We find that apart from the simple extension of (S. l) , there is another phase factor dependent upon the path of an adiabatic process. The general expression was clearly given by Berry (M. V. Berry, Proceedings of the Royal Society of London. Vol. A392 (1984), p. 45) in his seminal paper. BERRY'S PHASE
We consider the Hamiltonian of the system with an external time-depen dent parameter R(t) by denoting it as H(R(t) ) . The ket l n (R(t)) ) of the nth energy eigenstate corresponding to R(t) satisfies the eigenvalue equa tion at time t, H(R(t)) l n(R(t)) ) = En (R(t)) l n(R(t)) )
(S.2)
H(R(t)) l n (R0) , t0; t) = ih i l n (R0) , t0; t)
(S.3)
where the ket ln (R(t))) has already been normalized. Let us make R evolve in time from R(O) = R0. Suppose that the state ket is ln(R0) , t0 = O;t) at time t. The time-dependent Schrodinger equation that the state ket obeys IS
at
where t0 = 0. When the change of R(t) is slow enough, we expect from the adiabatic theorem that l n (R0) , t0; t) would be proportional to the nth energy eigenket l n (R(t)) ) of H(R(t)) at time t. Thus we represent it as l n(Ro) , t0 ; t) = exp
{- � I:
}
En (R(t' ))dt' exp(i'Yn(t)) l n (R(t)) ).
(S.4)
The first factor on the r.h.s. is the usual dynamical phase factor summing up all phase changes in stationary states [c.f. (2.6.7)]. On the other hand, by substituting (S.4) into (S.3), the second phase factor exp(i-yn(t)) is shown to be determined by
tFor adiabatic invariants in classical periodic systems and Ehrenfest's adiabatic hypothesis, see S. Tomonaga, Quantum Mechanics /. (Amsterdam: North-Holland Press, 1962).
Adiabatic Change and Geometrical Phase
466
d n(t) dt -y
=
i(n(R(t)) I VRn(R(t)) )
d R(t). dt
(S.5)
Therefore, 'Yn (t) is represented by a path integral in the parameter (R - ) space: R (t)
J
"fn(t) = i R (n(R(t' )) I VRn(R(t' )) ) dR(t' ) 0C
(S.6)
(n(R(t)) l n(R(t))) = 1
(S.7)
where the path C of integration is that of the adiabatic process as the external parameter R changes from R0 to R(t). We can see that 'in is a real number by differentiating with respect to R both sides of the nor malization condition: ((n I VRn) is pure imaginary. ) Since the integral (S.6) contains the derivative of the eigenstate I VRn), the practical evaluation here might be expected to be complicated. However, if R describes a closed loop in parameter space returning to the starting point after time interval T (the periodicity) , as is in the case of R(1) = R0, the path integral becomes greatly simplified. First applying the Stoke's theorem, we transform the line integral along the closed loop C to a surface integral over the surface S(C) enclosed by the closed loop C: 'in (C) =
vn (R)
=
J.:rC (n(R) I VRn(R))dR = Im VRn
X
-
(n(R) I VRn(R))
JJ
S(C)
Vn(R) · dS ,
= Im ( VRn(R) I X I VRn(R)) =
Im
� ( VRn(R) l m(R)) x (m(R) I VRn(R)),
m>'n
where we have used the vector identity:
V.x [f(x) Vg(x)] = ( Vf(x)) x ( Vg(x)).
(S.8)
(S.9) (S. lO)
Suppose ln(R)) is modified by some phase factor exp[ix(R)]. Then, even if the integrand of line integral (S.S) changes as Vn (R) does not change because Vx Vx = O (see Section 2.6 on gauge in variance) . We have omitted the term m = n in the last line of (S.9) because
Degeneracy Point and Solid Angle-Geometrical Phase Factor
467
(n l VRn) is purely imaginary. On the other hand, the off-diagonal matrix element (m I VRn) is represented as follows. Multiplying (m I from the left to the equation
( VRH(R)) ! n(R)) + H(R) ( VR i n(R))) = ( VRE(R)) l n(R)) + En(R) I VRn(R))
(S. l2)
obtained by differentiating the eigenvalue equation (S.Z) with respect to the external parameter R, we get (S.13)
Therefore the integrand of the surface integral (S.8) is given by
Vn(R) = Im L m#n
J
(n(R) I VRH(R) l m(R)) X (m(R) VRH(R) l n(R)) (Em(R) - En(R))
(S . 14)
Here we notice that the vector Vn(R) passing through the surface of the integral S(C) is expressed by the differential form of degree 2 of the Hamiltonian H(R). Knowledge of the state ket ln(R)) itself is usually not needed to calculate Vn as we shall see later. To sum up, the state ket after one complete turn is expressed as
ln(Ro), to = 0 ; 1)
{ � J� - If
= exp(i-yn(C))exp -
"Yn(C) =
S(C)
}
En (R(t'))dt' l n(Ro)),
Vn(R) · dS.
(S.l5 )
(S. 16)
Here, "Yn(C) is called the Berry's phase, and comes from the adiabatic change of the external parameter. We shall next show that this phase is determined by the geometric properties in parameter space. DEGENERACY POINT AND SOLID ANGLE-GEOMETRICAL PHASE FACTOR
Suppose that the nth energy eigenstate l n(R)) is degenerate with the mth eigenstate I m(R)) for an external parameter R*. Although the degeneracy
468
Adiabatic Change and Geometrical Phase
point R* in the parameter space is by no means on the path of the actual adiabatic change, the existence of R* is essential for the value of 'Yn(C).
This is because the integrand Vn(R) of 'Yn(C) is singular at R * , as is evident from (S. 14). Let us continue our discussion by taking as an example, the problem of the spin in a magnetic field mentioned earlier (spin magnitude S , and magnetic moment gJJS!Ii . t When the magnetic field B(t) slowly changes its direction while keeping its magnitude constant, we denote the spin com ponent as Sz· = mli ( m = - S, . . . , + S) along the direction of the magnetic field at each instance (taking this to be the positive direction of the z'-axis). Writing the Hamiltonian of this system as the energy eigenvalue is
H(B) = - g JJS · B(t)/li,
(S. 17) (S. 1 8)
and
(S . 1 9)
The degeneracy point is at the origin of the parameter space, B = 0, where there are 2S + 1 fold degeneracies. When we rotate the direction of the magnetic field B keeping its magnitude constant and bring it back to the initial direction again, the vector B goes around the closed loop on a sphere of radius B . Then the vector Vm(B) can be expressed from (S. 14) as Vm (B) = I m
(m (B) JS!Ii l m ' (B)) x (m' (B) JS!Ii J m(B)) B2( m ' - m)z ,f;m "'
(S . ZO)
In order to make the calculation easier, let us take the z-axis of spin space to be the direction of the magnetic field (z ' -axis direction). Since only the terms with m' = m + 1 contribute to the intermediate states, we use the expression (3.5.41) for the matrix element of the spin operator: (S, m ± l J S)Ii J S, m) = �V(S+m)(S + m + l),
(S, m ± 1 J Syifl l S, m) =
�i V(S+ m)(S + m + I ) .
(S.21)
B y substituting (S.21) into (S.20) , we obtain the components of the vector Vm(B) as follows: = 0, (Vm(B))y' = 0 , (Vm(B))z . =
m · B2
tHere spin magnitude S is in unit of h, while vector S has dimension of h .
(S.22)
Experimental Verification
469
A round adiabatic change
it -space
R"
0
Degeneracy point
R
A round adiabatic change
FIGURE S.l. External parameter R-space. For the adiabatic change of one turn along the closed loop C, the geometrical phase is given by the surface integral of the vector V(R) penetrating through the surface S(C) surrounded by C (right). This is proportional to the solid angle that the closed loop C subtends at the degeneracy point R * (left).
Thus the vector V(B) is parallel to the magnetic field B and penetrates normally through the spherical surface of the B-space. Hence the phase "Ym(C) is given by "Ym(C)
=
-m
ff
S(C)
f
B2 · dB = - m cd!.l = - m!.l(C)
B
(S.23)
where !.l(C) is the solid angle which the closed circuit C subtends at the degeneracy point. This simple expression for "Ym (C) is dependent neither on the magnitude of the magnetic field nor on the spin which determines the dynamics of the system, but exclusively on the dimensionless quantum number m and the solid angle !.l(C). The phase factor exp(i"Ym (C)) accom panying the adiabatic change along the closed loop is regarded as a geo metrical phase factor associated with the degeneracy point that is singular. Figure S . 1 illustrates this integral for the geometrical phase in the parameter space. According to (S.23), the state ket changes its sign if m!.l(C) = ± 1T after one round trip of an adiabatic process. We notice that this is the case not only for fermions with m = half integer but also for bosons with m = integer. EXPERIMENTAL VERIFICATION
The experiment to verify (S.23) has been performed by letting polarized neutrons pass through a coil which produces a spiral magnetic field inside. The neutrons, having passed through one complete period of the spiral
470
Adiabatic Change and Geometrical Phase
field after time interval T, feel the same field as the initial ones. The solid angle on the r.h.s. of (S.23) can be continuously changed by changing the ratio of the magnitude of the spiral field to the magnitude of another field applied along the direction of the spiral axis. Let us assume that the initial spin of neutrons is in a mixed state of two eigenstates: parallel and anti parallel to the field. Under an adiabatic change the ratio of the numbers of these two states is preserved. The neutrons passing through the inside of the coil proceed while rotating their spins in the plane perpendicular to the field whose direction changes along the course [see (3.2.20)]. After passing through one period of the spiral field, the Berry's phase +0/2 is added according to (S.23) to the phases of the spin eigenstates parallel or anti-parallel to the field. As mentioned in Section 3.2, twice as many of them contribute to the rotational angle ofthe spin expectation value. There fore, besides the dynamical phase, the contribution from geometrical phase is added in the total angle of spin rotation, <j>, during one period of the field. To wit, = g�J.BT/h +
!1.
(S.24)
The value of is obtained from the measurement of the neutron polar ization. The measured value of il = - g�J.BT/h closely agrees with n determined by the experimental condition (the ratio of the fields for two directions) . Thus, the validity of (S.23) for the Berry's phase was confirmed (see Fig. S.2). This experiment was performed by T. Bitter and D . Dubbers in 1987. c:
21t
·a. <JJ
-
0
Cl)
Cl c: C1l
.Q
c:
e
-
TC
•
C1l
-
c:
-
:!:: .s:: <JJ
-6<J
FIGURE S.2.
•
•
•
� Theory prediction
•
1t/2
1t
31t/2
Solid angle n (steradian)
21t
Experimental data showing the existence of the geometrical phase. The shift ll
General Consideration of Two-Level Crossing
471
GENERAL CONSIDERATION OF TWO-LEVEL CROSSING For general cases with external parameter R = (x,y,z) , the same situation as the above example of spin in the field can be realized around the de generacy point R * if there are some crossing levels. Choosing R* as the origin of the parameter space, we suppose that the energy eigenstates I + (R) ) and 1 - (R)) are degenerate at R* = 0. Taking each eigenvalue to be such that E+(R) > E _ (R), we measure energy from the degeneracy point. We expand the Hamiltonian H(R) around the origin up to the first order in R, and perform an appropriate linear transformation. Then the general Hamiltonian connecting the two levels is expressed by a 2 x 2 Her mitian matrix ;
H(R)
1
= 2
(
Z
X - iY z X + iY -
)
1
= 2
R · a.
(S.25)
The energy eigenvalues are E± (R) = ± � R and the two levels conically cross at the degeneracy point. VRH(R) is equal to !a. These expressions are the same as those of the case m = ± ! for the spin in the field. Therefore, by integrating over surface S
v± (R)
(S.26)
the geometrical factor for general two-level crossing is obtained as:
As before, il(C) is the solid angle of the closed loop (adiabatic path) C, at the degeneracy point R*. We notice here that the expression (S.26) for the vector V ± (R) is equivalent to the magnetic field (2.6.74) produced by a magnetic monopole placed at the origin. The singularity of the degeneracy point in the param eter space is structurally the same as that of a magnetic monopole , and the magnetic field by a magnetic monopole is regarded as determining the geometrical phase. We can exploit further the analogy to the magnetic field ; for example, in comparing the two following relations
B(r) - V. x A (r) V(R) - Im VR x (n(R) I VRn(R)),
(S.28) (S.29)
472
Adiabatic Change and Geometrical Phase
we can interpret (n(R) I VRn(R)) as a kind of vector-potential caused by the singularity at the degeneracy point. As we have shown in (S. l l), this quantity is gauge dependent, but V(R), which corresponds to the magnetic flux density, B, is gauge-invariant. AHARANOV-BOHM EFFECT REVISITED In Section 2.6 we studied some examples where integrals along paths de termine the phase factors. Among them the effect of electric as well as gravitational fields gives rise to dynamical phase factors. On the other hand, the Aharanov-Bohm effect due to the magnetic field can be shown to be just a consequence of a geometrical phase factor. Let a small box confining an electron (charge e <0) make one turn along a closed loop C, which surrounds a magnetic flux line <1>8 (see Fig. S.3) . We take as R the vector connecting the origin fixed in the space and a reference point in the box. In this case the vector R is an external parameter in the real space itself. By using the vector-potential A to de scribe the magnetic field B, the nth wave function of the electron in the box (position vector r) is written as
(r l n(R))
= exp
{�: J�
}
A (r') · dr' IJ!n (r- R)
(S.30)
where IJ!n(r') is the wave function of the electron at the r' position coor dinates of the box in the absence of magnetic field. Then, in the presence
Mag n etic flux line
FIGURE S.3. The Aharanov-Bohm Effect. Electron in a box takes one turn around a magnetic flux line.
Complex System and Adiabatic Potential
473
of magnetic field, we can easily calculate the derivative of the wave function with respect to the external parameter to obtain
(n(R) I VRn(R)) X
{- �:
=
I
d3 x\fl: (r- R)
A (R) I!Jn (r- R) + VR lfln (r - R)
}
=
ieA(R) fie ·
(S.31)
The second term under the integral vanishes for the electron in the box. From (S.6) and (S.31) we see that the geometrical phase is given by
e e "Yn (C) = lic r,( c A · dR = fic
II
B(R) · dS S(C)
=
e fic B·
(S.32)
This result is just the expression (2.6.70) of the Aharanov-Bohm effect obtained earlier in Chapter 2 . COMPLEX SYSTEM AND ADIABATIC POTENTIAL Up to now, the quantity R that changes adiabatically is an external pa rameter. A more interesting question is to treat quantum mechanically some complex system made up of two constituent systems, where an adi abatically changing quantity is contained in the system as an internal dy namical variable . We shall discuss how the appearance of the Berry's phase in one of the constituent systems reversely influences the other constituent that moves slowly. The former can respond quickly to an adiabatic change caused by the latter. For background, let us first consider the problem of a polyatomic molecule which is composed of electrons and nuclei, a typical example of a complex system. When calculating the energy levels of ro tation and vibration of a molecule , we shall apply a method that has been used for many years. Consider the case of a diatomic molecule ; we write the Hamiltonian of the complex system as
where R is the relative position coordinate between the two nuclei , r1 , r2, . . . , rN the N-electron coordinates, and the center of mass of the molecule is chosen at the origin. The first, second, and third terms of the Hamiltonian correspond to the kinetic energy of nuclei, the energy of the electron system, and the interaction energies among nuclei themselves and between nuclei and electrons respectively. Since the nuclei with heavy mass move relatively slowly and electrons with light mass move quickly, we can use an adiabatic approximation here. We first fix the position of the nuclei
474
Adiabatic Change and Geometrical Phase
and determine the eigenstate of electrons I n(R)) corresponding to the fixed configuration of the nuclei
- Un(R)
l n(R)),
(S.34)
where Un(R) is the energy of the nth eigenstate. We next approximate the state ket of the complex system in the form of a product of the nucleus ket lA) and the electron ket 'I'
=
lA) l n(R)).
Substituting this into the Schrodinger equation H T 'I' system, we obtain
(S.35) =
E'l' of the complex (S.36)
Since H,j_R) is the differential operator with respect to coordinates of the nuclei R, it will operate on both the nucleus state ket l A) and the electron state ket l n(R) ) . However, we can ignore the derivative term of electron state ket with respect to R , as long as the amplitude of the relative motion of the two nuclei is small compared to the equilibrium distance between them. This approximation is the so-called Born-Oppenheimer approximation. Equation (S.36) then leads to a simple eigenvalue equation that involves only the coordinates of the nuclei, as follows: (S .37) In the above equation the effective potential Un(R) that determines the motion of the nuclei is called adiabatic potential. It is the sum of the Coulomb repulsive force between nuclei and the energy of electrons for fixed R. Un(R) is obtained for each level of the electron system. The minimum Un(R) gives approximately the equilibrium distance between nuclei about which the molecule vibrates and rotates. The corresponding energy levels are given approximately by
En - "nWn(N + !) 2 + ---'----'2/n N = 0,1 ,2, . . . . , L = 0,1 ,2, . . . . , =
h2L(L + 1)
(S.38)
where wn stands for the angular frequency determined by the curvature at the minimum of Un(R) and In is the moment of inertia of the molecule. If the two nuclei are identical, L is further limited to be an even or odd number by permutation symmetry.
Dynamical Jahn-Teller Effect
475
DYNAMICAL JAHN-TELLER EFFECT Our object is to investigate the role of the geometrical phase in a complex system. To this end, let us consider the case in which the electronic states are degenerate for a certain space configuration R * of the nuclei; that is, for a case such that the adiabatic potentials satisfy (S.39) and cross one another at R * . Let us remember that an adiabatic change of one turn around the degeneracy point R* in the R-space causes a geo metrical phase. Thus for the case of the diatomic molecule where the distance associated with the nuclei is a single one-dimensional parameter in determining the electron state, this is not enough to discuss the Berry's phase. We need to consider tri-atomic molecules (ions) or a group of atoms in a crystal. Here we use the latter one as an example for discussion purpose. Generally in crystals the electronic states in the crystal fields (e.g . , a pair of d-orbital states) are often degenerate in energy for certain atomic configurations of high symmetry. Then, in most cases, the crystal becomes slightly distorted to assume a lower symmetric configuration, and thus moves in a more stable state by lowering its electronic energy. This phe nomenon is what is called the Jahn-Teller effect. To make the argument simpler, consider the electronic state of the central ion surrounded by four atoms in a plane . Suppose that the four atoms are placed in a square configuration, for which electronic states I a) and I b) extending their orbitals in different directions respectively are degenerate. The deformation of the lattice then removes the two-fold degeneracy. Displacements of atoms from the equilibrium position, taken on the square lattice points, are expressed in terms of the normal modes of vibration. Among the several possible normal modes, we look at the two independent modes A and B as shown in Fig. S.4, which couple with the degenerate electronic states. Suppose that A and B have the same eigen frequency, w, and we denote their normal coordinates as Qa and Qb, respectively. t On the other hand, the
FIGURE S.4. Distortion of lattices causing dynamical Jahn-Teller effect. Two normal modes, Q. and Qb, are shown.
tour explanation for the Jahn-Teller effect is not exact but conceptual.
476
Adiabatic Change and Geometrical Phase
electron system is described by a linear-combination of the two degenerate levels
(S.40) Then the effective Hamiltonian of the normal vibration, which couples with the electron system , can be written in the form of p� + p � f.LW2 ( � + Q + Q�) - K - Qa Qb (S.41)
2J.L
(
2
Qb Qa
)
where Pa and Pb are the conjugate momenta to Qa and Qb, respectively; f.L is the effective mass of the normal vibration and K is the strength of the coupling between electrons and lattices. The matrix that represents the interaction operates on the electron state vector (c") . We have expressed Cb the interaction by the first order terms of the normal coordinates, Qa and Qb, assuming that the vibration of the atomic displacement is small. For normal coordinates written as
Qa
the eigenenergy solving
e
p cos 6 , Qb = p sin 6 ,
(S.42)
and the eigenket of the electron system are obtained by
- Kp as
=
(
- cos 6 sin 6
E = :!: Kp
� I a> - sin � I b) sin � I a> + cos ��b).
(S.43)
(S.44)
I + (6)) = cos
1 - (6))
=
(S.45)
From (S.41) and (S.44) the adiabatic potential for the normal coordinates is given by the two-valued function
(S.46)
If one illustrates the surface of the adiabatic potential as a function of Qa and Qb, a "Mexican hat" shape like Fig. S.5 emerges. The potential consists of two axially-symmetric surfaces and assumes a minimum value at p0 KIJ.LW2 .
-
477
Dynamical Jahn-Teller Effect
Qa FIGURE S.S. Adiabatic potential surface related to dynamical Jahn-Teller effect (S.46).
Here we notice that the electron state kets (S.45) change sign for rotation by 21T around the symmetry axis:
1 ± (6
= 21T)) = (
1 ) 1 ± (6 = 0)).
(S.47)
Formally, this is the same as the rotation of the spin with S = � discussed in Section 3.2, since the sinusoidal dependence of the ket on the rotation in (S.45) is given by just one half of the rotation angle, 6/2. Naturally, in our case , it is the orbital state ket of the electron system that has changed sign and thus has nothing to do with spins. The factor ( - 1) on the r.h.s. of (S.47) is easily shown to be just the Berry's phase factor. Here the origin of the QaQb-plane is a degeneracy point of the electron system. When the vector (Qa,Qb) describing the state of the lattice deformation circulates once around this origin, the corresponding "solid angle" amounts to 21T. Therefore, from the general expression (S.27) the electron state ket is multiplied by the geometrical phase factor exp(+i21T/2) = - 1 . It is to be noted that as early as in 1963, this kind of. Berry's phase was already taken into account in a very natural way by G. Herzberg and H . C. Longuet Higgins, when they solved the problem of a coupled system with degeneracy while studying poly-atomic molecules. As shown here, for such an electron-lattice system in which degen erate electronic states couple to two lattice vibrational modes, a minimum of the adiabatic potential does not form a well but a continuous valley. Thus, the dynamical deformation of. the lattice and the corresponding adi abatic state change of the electron system are both incorporated. This is the so-called dynamical Jahn·Teller effect. Now, let us consider the dy namical motion of the normal coordinates (Qa,Qb) (that is, (p , 6) ) under the adiabatic potential (S.46) and find the influence of. the Berry's phase on the dynamics. The two-dimensional Schrodinger's equation to be solved IS
a { (-iJp -a2 ) p iJp p iJ 6 li? iP 1 -z + 2 l.l
+
2
1
z
-
+ v:d
}
(p) 'l'(p,6)
=
E'l'(p,6) ,
(S.48)
478
Adiabatic Change and Geometrical Phase
where the wave function 'l'(p,e) is expressed by the product of the nucleus piece
'l'(p , e) = + (p,e) I + (e)) + <1>-(p , e) 1 - (e)) .
(S.49)
The adiabatic potential written as V�d in (S.48) is to be understood as acting on + in 'l'(p,e) for v;d and on <1>- for V,;d. Since (S.48) is a dif ferential equation with separable variables we can solve it by choosing the nucleus wave function as (S.SO)
Here the most important point is that the state function 'l'(p,e), which describes the whole coupled system consisting of electrons and lattices, must remain a single valued function. The state ket of the electron system changes sign when e is rotated by 21T. Therefore , in order to cancel this sign change ,
-
,
j = (/ +
!)
= half integer,
(/
=
integer).
(S.Sl)
The presence of the Berry's phase has changed the quantum number that characterizes the motion of atoms, from the usual integer number to half integer one. t If two adiabatic levels are well separated, we are allowed to solve the equation (S.48) by ignoring the mixing between + (p,e) I + (e)) and <1>- (p,e) 1 - (e)). Then the low energy excitations in the potential valley is approximately expressed by
Enj
-
=
h2(/ + i) (n + 1z) hw + 2 2 1-LPo
(n: integer, j: half integer)
(S . 52)
The result is just the sum of the energies of vibration along the radial direction and those of rotation in the plane of the normal mode coordinates. (The factor � in the numerator of the second term comes from the electron system . ) In the picture of lattice deformation they are the energies related to oscillation of the magnitude of the atomic displacement and transfor mation of the deformation patterns, respectively. These kinds of energy levels have been well analyzed experimentally through spectral measure ments for ions in crystals and poly-atomic molecular gases and also through several physical measurements on matter. So scientists in these fields have general discussion is given using path integral method and action in H. Kuratsuji and S. lida, Prog. Theor. Phys. 74 (1985), p. 439 concerning how Berry's phase modifies the quan tization condition of dynamical variables. tA
Universality of Geometrical Phase
479
been familiar with the half-integer quantization which appears in rotational levels for sometime. After Berry's paper, the precise spectral experiment on a tri-atomic molecular gas, Na3 was repeated again in 1986, and the half-integer quantization was reconfirmed by G . Delacretaz et at.
UNIVERSALITY OF GEOMETRICAL PHASE So far we have discussed a geometrical phase called the Berry's phase by taking quantum-mechanical systems as example. However, let us notice that the Planck's constant is lost in the general expression giving the phase in (S . 27). In this respect it is totally different from the dynamical factor (2.6.7) discussed in Chapter 2. Therefore, although the original derivation by Berry is based on quantum mechanics, the geometrical phase factor itself is expected to be universal, regardless of quantum-mechanical systems or classical-mechanical ones. The question is then, where can we find a geometrical phase in a classical system effected by an adiabatic change? Detection of the Berry's phase has been done for light , which is on the one hand very quantum-mechanical and relativistic, but on the other hand also describable by classical electrodynamics . By letting linearly po larized light pass through the inside of a spirally-twisted optical fiber, one observes the rotation of its polarization plane. According to quantum the ory a photon is a particle with spin one, and the spin in an eigenstate is either parallel to the direction of propagation or anti-parallel to it. (This leads to the helicity eigenvalue being + 1 or - 1 respectively.) Each ei genstate is a circularly polarized light with a rotational direction opposite to each other. When the linearly-polarized light, which is a combination of these two states, passes through the inside of a once-twisted optical fiber, the propagation vector (the relevant quantity for an adiabatic change here) describes a closed circuit. (At the same time the spin also rotates.) As a result the different Berry's phase is added to each component of the combination state, so that the polarization plane of light rotates. In the 1986 experiment performed by Chiao, Wu , and Tomita, the agreement between the rotational angle of the polarization plane and the solid angle determined by the degree of twist of the optical fiber was confirmed. On the other hand, the effect can also be explained by classical electrodynamics as a result of integration of continuous parallel transformation of the electric field vector inside the optical fiber. Although the directions of light at the entrance and the exit are parallel, a difference between the polarization planes is produced for the optical fiber which is twisted in the middle . Furthermore, even for a classical-mechanical system a geometrical phase factor emerges when we oscillate a conical pendulum on the revolving earth. Seen from the rest coordinate system the time needed for change
480
Adiabatic Change and Geometrical Phase
in the gravity direction at a certain place on the revolving earth is very much larger than the period of the conical pendulum. That is, the direction of the gravity acting on the pendulum adiabatically changes as the earth rotates. Let us take the rotational angular velocity of the pendulum in the orbital plane to be w . In the total rotational angle e of the conical pendulum after one day (earth period of revolution = 1) a deviation by �6 from the simply predicted value wT, a dynamical angle, is realized. That is,
6
= wT
+
M.
(S. 53)
The shift M is dependent on the latitude of the measurement point but is independent of the gravity and the length of the string which govern the motion of the pendulum. This is after all a geometrical phase and is gen erally called the Hannay's angle. So the phenomenon is quite similar to the rotation of a quantum spin in a varying magnetic field. Let us compare the two expressions for the total rotational angles in these two cases, (S.24) and (S.53). We may speculate that �6 = n. For the conical pendulum, this is the case . The Hannay's angle is equal to the solid angle subtended at the center of the earth by the closed circuit which the gravity vector acting on the pendulum draws following the rotation of the earth. This is a geometrical phase in classical mechanics corresponding to the Berry's phase in quantum me chanics. In this way geometrical phases accompanying adiabatic changes ap pear in various systems such as oscillating systems, optical systems, spin systems, molecular systems, and so on, regardless of whether we are dealing with classical or quantum mechanics. In solid state physics, this kind of argument has also been used to explain the quantum Hall effect besides the dynamical Jahn-Teller effect discussed above. Last but not least Berry's phase has also impacted field theory. Without prior knowledge of the geometrical phase, it has nevertheless been possible to accommodate such a phase naturally into theories if one treats precisely Schrodinger's equa tion, Newton's equation, Maxwell's equations, and the path-integrals. However, we can conclude by saying that our understanding of quantum mechanics has been deepened since the existence of geometrical phases was explicitly revealed.
SUPPLEMENT II
Non-Exponential Decays Radioactive nuclei furnish us with one of the best known examples of unstable, or decaying, systems. The assumption that the radioactive decay of any nucleus is a random process, and, furthermore, that even in a large sample each nucleus decays independently, leads to the familiar exponential
decay law,
N(t)
=
N(t0)e-•(t-to),
(T.la)
where N(t) is the number of nuclei that have not decayed after a time t, and >.. is a constant determined by the properties of the individual nucleus. Since the fluctuations in (T. la) are small only when the number of nuclei is large , we can rephrase the radioactive decay law as a probabilistic state ment: The probability that any particular nucleus will not undergo a decay in a time interval between t0 and t is given by
P(t) = e -•(t-to), t > t0
(T. l b)
The quantity P(t) may be called the survival probability. Notice that P(t0) = 1 ; this is the statement of our knowledge that the nucleus under examination had not decayed at t = t0. Here, we investigate the form of P(t) for quantum mechanical systems. This is necessary because within the framework of quantum mechanics, probabilities are derived from funda mental probability amplitudes, so that it is not a priori obvious that the assumptions leading to (T. l ) can be consistently incorporated. In keeping with our intuitive notions, we will say that a quantum system is in an unstable state if the survival probability for that state reduces to zero monotonically with time. t If the state vector of the system at time t is given by l qi, t) (here, qi denote all quantum numbers necessary to specify the state) , the survival probability P(t) is given in terms of the survival amplitude (c.f. the analogous discussion in Chapter 2 of text) , (T.2) by
P(t)
(T.3)
t it is important to stress the requirement of monotonicity, since it is possible to find examples where the survival probability exhibits an oscillatory behavior (sec, e.g., Rabi oscillations, Sec. 5.5 of text. Notice that the oscillatory behavior persists even when the external potential is time independent). 481
482
Non-Exponential Decays
For an isolated system, the Hamiltonian is time independent, and the unitary operator U(t, t0) in (T.2) that evolves the states from an initial time t0 to a later time t is given by U(t, t0) = e- iH(t - to)lh_ We thus see that if I q;, t) is a normalized stationary state so that it has a definite value of energy, a(t) is a pure phase and P(t) = 1 . Although an unstable state of a quantum mechanical system cannot be represented as one of the energy eigenkets, the assumption of com pleteness of the eigenkets of the Hamiltonian tells us that we can represent the unstable state as a linear superposition over these eigenkets with time independent complex coefficients; that is, (T .4) Here, S denotes a summation (integration) over the discrete (continuous) part of the spectrum of H. We can thus write (T.2) as,
a ( t)
=
S,e - h
(T.5)
It is reasonable to suppose that the unstable states are orthogonal to the (normalizable) discrete eigenstates of the Hamiltonian which, as we argued above, cannot decay. We will, therefore, assume that unstable states are associated with the continuum states of the Hamiltonian. This is in keeping with our intuitive expectation from the fact that an unstable particle decays only to continuum states; that is, the unstable state has non-van ishing projections only on continuum states into which it can decay. In this case, the survival amplitude in (T.5) can be written as the Fourier trans form , (T.6) of a real non-negative function (T.7) where (for later convenience) we have set t0 = 0. Notice that (E I q;) is just the wave-function for the decaying state in the energy representation, so that p( E) is just the probability density that an energy measurement of, this state yields a value E . If p( E) has the familiar Breit-Wigner form, (T.8a) we leave it to the reader (c.f. the discussion (5 .8.21) to (5.8.24) of text) to show that the corresponding survival probability has the form PBw(t)
=
e- rl t -toll' ,
(T.8b)
so that the decay of the state is indeed described by the exponential decay law.
Non-Exponential Decays
483
We will now show that with some very reasonable assumptions, Eq. (T. l b ) can never be exact regardless of the nature of the quantum system. In particular, we will show that for very long times, the survival probability must decrease at a rate that is slower than exponentiaL To see this, we first remark that any interacting quantum-mechanical system must have a ground state; that is, there is a state I E0), with H I E0) = Eo I E0) such that Eo is the smallest eigenvalue of H. If this was not the case, it would be possible to extract arbitrarily large amounts of energy from the system by inter actions that send it to eigenstates of increasingly lower energy. The exis tence of the ground state obviously implies that the probability density function p(E) introduced via (T.7) above vanishes if E < E0. To proceed further, we introduce a theorem due to Paley and Wie ner (R. Paley and N. Wiener, "Fourier Transforms in the Complex Do main," Am. Math. Soc. Pub. VoL XIX, 1934), which can be stated as follows: If a(t) and p( E ) are Fourier transforms of each other such that p(E ) = 0 for E < Eo and f':-oo dtla(t) ! Z is finite, then f':.oo dtl ln l a(t) l l!(l + f) is also finite . We will see below (where we follow essentially the argument of L. A. Khalfin, Sov. Phys. JETP VoL 6, pg. 1053, 1958) that this remarkable result strongly constrains the long time behavior of the survival probability. First, we note that the fact that p(E) is real implies a*(t) = a( - t) so that if the survival probability decays sufficiently fast for large times, J'>Ooo dtla(t) i1 is finite, and the Fourier transform pair a(t) and p(E) in (T.6) indeed satisfies the conditions for the Paley-Wiener theorem. Thus, for large times, l ln l a(t) I I is required to increase slower than linearly, and hence the exponential behavior for I a(t) I is precluded. We point out that while our general argument implies that the long time behavior of the survival probability cannot be exactly exponential, it does not shed any light on when the non-exponential behavior sets in or on how much the survival probability deviates from the exponential form. The former requires the specification of a time scale other than the lifetime, and so is probably dependent on the details of the dynamics that causes the decay. We also remark that the Breit-Wigner form P nw(E) in (T.Sa) (which led to the exponential form for the survival probability (T.Sb)) is non-vanishing for all values of E , and so violates the conditions of appli cability of the Paley-Wiener theorem. Physically this form of p(E) violates our assumption of the existence of a ground state, so that the resulting exponential behavior can be, at best, an approximation to the real situation. It will be left as an exercise for the reader to check that if we cut off the Breit-Wigner form of p, the survival probability P(t) decays according to a power law for large times. We stress here that the fact that we have not seen deviations from the exponential decay law at large times does not vitiate our general ar-
484
Non-Exponential Decays
guments that do not tell us when the non-exponential behavior sets in. It could be that non-exponential decay occurs only after many lifetimes in which case it would be very difficult to observe. There are, however, equally general arguments that imply deviations from the exponential decay law for very short times (defined precisely below) which we turn to now. We begin by noting (our discussion essentially follows that of B . Misra and E . C . G . Sudarshan, J . Math. Phys. Vol. 18 , 756, 1977) that the time derivative of the survival amplitude (T.6),
daldt
=
J dE( - iE/Ii)e - i''1hp(E),
(T.9)
is a well defined, continuous function of t if we further assume (as is physically reasonable) that the decaying state has a finite energy expec tation value. We can, therefore, evaluate the rate of change of the survival probability,
dP(t = T)/dt
=
(da* (T)/dtja(T) + a*(T)[da(T)/dtj = - ti( - T)a(T) + a( - T)ti(T)
(T l O) .
where we have used a*(t) = a( - t) and the dot denotes differentiation with respect to the argument. We thus see that,
dP(t = O)Idt = 0,
(T. ll)
which of course means that the survival probability cannot be an expo nential at t = 0! We remind the reader that in (T.6), t 0 was defined as the latest instant of time when we knew that the system had not yet decayed. This implies that t = 0 is also the time when the system was last observed or, in Dirac's terminology, the time of preparation of the state of the system. We remark that while our argument clearly precludes the possibility of exponential decay at t = 0, it sets no limitations on when exponential decay may set in. Taking this analysis a step further, we can consider successive ob servations of a system. As discussed above, (T. l l) must hold after each of these observations. The implications of this are schematically illustrated in Fig. T. l . The survival probability for a system prepared at t = t1 is shown by the curve labeled 1 in Fig. T. l . Notice that, in keeping with t1. If this system is allowed to (T. ll), the slope of the curve is zero at t evolve undisturbed, the survival probability will follow the first curve which may rapidly take an exponential form (of course, for very large values of t, not shown in the figure, we have seen that the decay will be slower than exponential). The size of the flat region at t = t1 is not determined by our general arguments and must be dependent on dynamical details. If, however, an observation is made on this system at t = t2, our arguments above imply that (T. ll) must hold at t = t2, so that the system =
=
Non-Exponential Decays
485
t FIGURE T .1. A schematic of the short time behavior of the survival probability P(t) of a state of a quantum mechanical system prepared at I 11 is illustrated by the curve labeled 1 . The curves labeled 2, 3, 4 show how P(t) is modified by observations of the state at times 12, 13, 14. Notice that each successive measurement increases the survival probability from what it would have been had the system not been disturbed by that measurement. =
now follows the curve labeled 2 . Successive observations at t = t3 , t = t4 , etc . , would make the system evolve along the other curves in the figure. We see that the qualitative effect of these successive observations is to increase the survival probability of the system being observed from what it would have been if the system was allowed to evolve undisturbed after its preparation at t t1 . This inhibition of the decay appears to be a feature of quantum mechanics, once we grant that each observation of the system corresponds to the preparation of the quantum mechanical state. These arguments imply that it should be possible to slow down the decay of a quantum mechanical system by making successive rapid obser vations on it. In the limit of continuous observation (provided that this can be operationally defined), an unstable system would not decay. This has been referred to as the quantum Zeno effect. Deviations from the exponential decay law have never been ob served for radioactive nuclei or for the well measured KL mesons of particle physics. W. M . ltano et at. , Phys. Rev. Vol. A41, 2295 , 1990, have how ever recently demonstrated that the rate for transitions between two dis crete levels of a quantum system is indeed inhibited by successive mea surements of the state of the system. These authors have further shown that the amount of inhibition increases with the frequency with which the system is monitored. They argue that their results are completely consistent with the expectation from quantum mechanics assuming, as usual, that each measurement is effectively equivalent to preparing afresh the state =
486
Non-Exponential Decays
of the system . This serves to bring home the fact that while this inhibition of transitions may be somewhat surprising, it is a consequence of the structure of quantum mechanics. At this point, it is fair to warn the reader that we have only touched upon the basic question in our discussion of the quantum Zeno effect. There is considerable (and somewhat controversial) literature on the phys ical implications of this result. It is not our purpose here to discuss these in detail, nor is it our purpose to enter into various subtleties (both physical and philosophical) of the issues involved. For this, we refer the reader to the original literature. To sum up, we have presented very general arguments to demon strate that the decay of any quantum mechanical state must deviate from the familiar exponential decay law, both for very long and for very short times as measured from the instant of preparation of the state of the system. These deviations have not been experimentally observed for systems that exhibit (at least approximately) exponential decay. Since these arguments refer only to asymptotically long and short times, they do not preclude the possibility that a decaying quantum mechanical system may exhibit (es sentially) exponential decay over most of its lifetime . Indeed by making suitable approximations, the exponential decay law has been derived (see, e.g., Merzbacher [1970), Chapter 18) for model systems whose decay can be described in perturbation theory. It is instructive to study these ap proximations and trace why they are invalid for very long and very short times (as they must be in view of our general considerations).
Bibliography
The literature in quantum mechanics is very large and very diverse. We present here a bibliography of those books to which we refer specifically in our text as well as a reference section of more-elementary texts in quantum mechanics (generally at the undergraduate level) that provide the back ground to our book (addressed primarily to the first-year graduate student). Since many books have been written about elementary quantum mechanics, the list in this reference section is, of course, not meant to be complete. The editor has studied some of the suggested books, read others, glanced at a few, and likely missed others. The editor apologizes in advance if, as is all too probable, there are glaring omissions. Throughout the book little attempt has been made to quote original papers; the few that have been, quoted are noted in the footnotes or the text itself. This we believe to be appropriate in a book whose primary purpose is to serve as the textbook to a basic beginning graduate course. Baym, G. Lectures on Quantum Mechanics, New York: W . A. Benjamin, 1969. Bethe, H. A., and R. W. Jackiw. Intermediate Quantum Mechanics, 2nd cd., New York: W . A. Benjamin, 1 968. Bieden am , L. C., and H. Van Dam. (eds) Quantum Theory of A ngular Momentum , New York : Academic Press, 1965. Bj orken, J. D., and S. D. Drell, Relativistic Quantum Fields , New York; McGraw-Hill,
h
1965.
Bohm, D. Quantum Theory , En glew d Cliffs, NJ: Prentice-Hall, 195 1 . Dirac, P . A . M . Quantum Mechanics, 4th ed . , London: Oxford Univ. Press,
oo
1 958. 487
488
Dirac, P. A. M.
Bibliography
Quantum Mechanics ,
4th ed., London: Oxford Univ. Press, 1958.
Edmonds, A. R. A ngular Momentum in Quantum Mechanics, Princeton, NJ: Prince ton University Press, 1960. Fermi, E. Nuclear Physics , Chicago: University of Chicago Press, 1950. Feynman, R. P., and A. R. Hibbs. Quantum Mechanics and Path Integrals, New York: McGraw-Hill, 1 964. Finkelstein, R. J. Nonrelativistic Mechanics, Reading, MA: W. A. Benjamin, 1973. Frauenfelder, H., and E. M. Henley. Subatomic Physics, Englewood Cliffs, NJ: Prentice-Hall, 1974. French, A. P., and E. F. Taylor. A n Introduction to Quantum Physics, New York: W. W. Norton, 1978. Goldberger, M. L., and K. M. Watson. Collision Theory , New York: Wiley, 1964. Goldstein, H. Classical Mechanics , 2nd. ed., Reading, MA: Addison-Wesley, 1980. Gottfried, K. Quantum Mechanics, vol. I, New York: W. A. Benjamin, 1966. Jackson, J . D. Classical Electrodynam ics, 2nd ed . . New York: Wiley, 1975. Landau, L. D . ; and E. M. Lifschitz. Quantum Mechanics, Reading, MA: AddisonWesley, 1958. Merzbacher, E. Quantum Mechanics, 2nd ed. , New York: Wiley, 1970. Messiah, A. Quantum Mechanics, New York: Interscience, 1 96 1 . Morse, P . M., and H . Feshbach. Methods of Theoretical Physics , (2 vols . ) , New York: McGraw-Hill, 1 953. Newton, R. G. Scattering Theory of Waves and Particles , 2nd. ed., New York: McGraw-Hill, 1982. Preston, M. Physics of the Nucleus , Reading, MA: Addison-Wesley, 1962. Sakurai, J. J. A dvanced Quantum Mechanics, Reading, MA: Addison-Wesley, 1967. Sargent III, M., M. 0. Scully, and W. E. Lamb, Jr. Laser Physics, Reading, MA: Addison-Wesley, 1974. Saxon, D. S. Elementary Quantum Mechanics. San Francisco: Holden-Day, 1968. Schiff, L. Quantum Mechanics, 3rd. ed. , New York: McGraw-Hill, 1968. Tomanaga, S. Quantum Mechanics /, Amsterdam: North-Holland, 1962. Whittaker, E. T. , and G. N. Watson. A Courst of Modern Analysis, New York: Cambridge University Press, 1965 . REFERENCE SECTION
Dicke, R. H., and J. P. Wittke, Addison-Wesley, 1 960.
Introduction to Quantum Mechanics,
Reading, MA:
A very readable but terse book at the junior or senior level. It discusses a few topics, notably quantum statistics, not usually treated in textbooks at this level. Feynman, R. P., R. B. Leighton, and M. Sands, The Feynman Lectures vol. 3, Quantum Mechanics , Reading, MA : Addison-Wesley, 1965.
on Physics,
This is an introduction to quantum mechanics that approaches the subject from the viewpoint of state vectors, an approach that- together with Dirac's classic - greatly stimulated the spirit and substance of the earlier chapters of our own book. The many fascinating examples (for example, the Stern-Gerlach experiment) discussed with a minimum of formalism are to be noted in particular. Gasiorowicz, S.
Quantum Physics,
New York: Wiley 1974.
Reference Section
489
This book is at a level comparable to that of Dicke and Wittke 1960. Gasiorowicz is known to be a very effective teacher, who makes difficult concepts simple and clear even in advanced research areas of particle theory. Hence it is not surprising that his quantum physics book reflects that clarity of purpose that enhances the readability of the book. Mandl, F.
Quantum Mechanics,
London: Butterworths Scientific Publications, 1957.
This book contains a clear and compact presentation of the foundation and mathematical principles of wave mechanics at a level comparable to Gasiorowicz's book. Matthews, P. T. 1 974.
Introduction to Quantum Mechanics,
3rd ed., London: McGraw-Hill,
This third edition is lucid, concise, self-contained, and addressed to the undergraduate. It is a mathematically oriented introduction to development in wave and matrix mechanics, based on clearly defined postulates. Matt, N. F. 1 952.
Elements of Wave Mechanics,
London: Cambridge University Press,
A short but delightful treatment of wave mechanics at an elementary theoretical level by one of the pioneers. Park, D. 1 974.
Introduction to the Quantum Theory,
2nd. ed., New York: McGraw-Hill,
Intended for advanced undergraduates, this attractive book by an outstand ing pedagogue has a general introduction to quantum theory, followed by selected applications to various branches of physics, illustrating different aspects of the theory. The applications appear to require little teaching, and hence students can use the book as a reference. Pauling, L., and E. B. Wilson, McGraw-Hill, 1 935.
Introduction to Quantum Mechanics ,
New York:
Though one of the older texts, it is still well worth studying. There is a surprisingly complete treatment of the hydrogen atom and harmonic oscillator from the wave mechanics viewpoint. Rojansky, V. 1938.
Introductory Quantum Mechanics,
Englewood Cliffs, NJ: Prentice-Hall,
Despite its vintage, this book remains a classical text, especially valuable for its detailed coverage of the fundamental principles of wave and matrix mechanics.
Index
Abelian, definition of, 49 Absorption, in classical radiation fields, 335-37
Absorption-emission cycle, 321-22 Adiabatic potential, 474 complex system and, 473-74 Adiabatic theorem, 464-65 Aharonov-Bohm effect, 136-39, 472-73 Airy function, 108-9 Algebra, bra-ket, 60 Alkali atoms, fine structure and, 304-7 Angular frequency, Planck-Einstein relation and, 71 Angular integration, helium atom and, 367 Angular momentum angular-velocity vector and, 6 eigenvalues and eigenstates of, 188-95 orbital, 195-203 eigenvalues of, 31 parity eigenket of, 255 quenching of, 284 rotation generation and, 196-98 spherical harmonics and, 198-203 Schwinger's oscillator model of, 217-23 of silver atoms, 2 uncoupled oscillators and, 217-21
Angular momentum addition, 203-17 examples of, 203-206 formal theory of, 206-210 Angular momentum addition rule, proof of, 456-57 Angular momentum commutation relations, 187-89
ladder operators and, 188-89 rotations and, 152-58 2 x 2 matrix realizations, 165 Angular momentum operator, 156-157 Angular momentum operator, matrix elements of, 191-92 Angular velocity vector, angular momentum and, 6 Annihilation operator, 90-91, 147, 218 Anomalous Zeeman effect , 307 Anticommutation relations, 28 Antilinear operator, 269, 273 Antisymmetrical states, 257 Antiunitary operator, 269, 273, 277, 422-23
Argand diagram, 403 Argon, Ramsauer-Townsend effect and, 413
Associative axiom of multiplication, 16-17 491
492
Index
Atomic spectroscopy, 159 Axial vectors, 254 Baker-Hausdorff formula, 96 Balmer splittings, fine structure splittings and, 307 Base kets change of basis in, 36-37 eigenkets as, 18-20 spin 112 systems and, 22 transition amplitudes and, 87-89 Baym, G . , 236 Bell's inequality, quantum mechanics and,
229-32 Berry, M. V . , 464, 465 Berry's phase, 465-80 and polarized light, 479-80 Bessel functions, properties of, 452-53 Bethe, H. A . , 432 B iedenharn, L. C., 217 Bitter, T., 470 Bloch, F., 432 Bloch's theorem, 265 Bohr, N . , 76, 391 , 432 Bohr atom, 1 Boltzmann constant, 184 Born, M . , 50, 90, 100, 102, 187 Born amplitude, first-order, 386, 393, 442,
446 Born approximation, 386-90, 426, 428, 441 Born-Oppenheimer approximation, 474 Bose-Einstein condensation, 363 Bose-Einstein statistics, 361 Bosons, 361-63 Bound states low-energy scattering and, 410-18 quasi-, 419 zero-energy scattering and, 413-16 Bra, matrix representation of, 21 Bra-ket algebra, 60 Bra space, 13 Breit-Wigner form, 482, 483 Breit-Wigner Formula, 421 Brillouin, L., 105 Brillouin zone, 265 Canonical commutation relations, 50-51 Canonical ensembles, 185-87 Canonical momentum, 130, 135 definition of, 249 Cartesian tensors, 234-36 Cayley-Klein parameters, 170
Central force problem, Schrodinger wave equation and, 450-54 Classical physics, symmetry in, 248-49 Classical radiation fields, absorption and stimulated emission, 335-37 Clebsch-Gordan coefficients, 206, 238-40 properties of, 208-10 recursion relations for, 210-15 rotation matrices and, 215-17 Clebsch-Gordan series, 216 Clebsch-Gordan series formula, 237 Closure, 19 Collective index, 31, 296 Color singlets, 377 Commutation relations, 28 Commutativity, angular momentum and,
152-58 Commutators, 66 , 86 Poisson brackets and, 50-51 Completeness relation, 19 Complex conjugate transposed, 20 Complex numbers, quantum mechanics and, 27-28 Complex systems, adiabatic potential and,
473-74 Complex vector space, spin states and, 10 Compton effect, 1 Constant potentials, gauge transformations and, 123-25 Continuous spectra, 41-42 Correlation amplitude, energy-time uncertainty relation and, 78-80 Correlation function, 146 Coulomb case, partial-wave analysis and,
438-41 Coulomb interaction, Hamiltonian of, scattering solutions of, 435-38 Coulomb scattering, 433-41 Coulomb scattering amplitude, 458-62 Coulomb scattering formalism, mathematical details of, 458-63 Creation operator, 90-91 , 147, 218
Dalgarno, A., 298 Davisson-Germer-Thompson experiment, 1 de Broglie, L., 47, 68, 100 de Broglie's matter waves, 1 Decay width, energy shift and, 341-45 Decays, non-exponential, 481-86 Degeneracy, 29, 61, 250-51 exchange, 358 Kramers, 281
493
Index Degeneracy point, geometrical phase factor and, 467-69 Delacretaz, G . , 479 Density matrix, 178 of completely random ensemble, 183 continuum generalizations and, 182 Density operator definition of, 177 ensemble averages and, 176-81 Hermitian, 178 pure vs. mixed ensembles and, 174-87 time evolution of, 243 Detailed balance, 335, 424 Diagonalization, 39-40, 66, 90 Diamagnetic susceptibility, 350 Dicke, R. H . , 488 Differential cross section, 385 Dipole operator, 338 Dirac 8 function, 41 Dirac, P. A. M . , 1 , 1 1 , 23-4, 50-52, 84,
107, 118-19, 143, 325, 332 Dirac bra-ket notation, 273 Dirac notation, 8 Clebsch-Gordan coefficients and, 209 Dirac picture, 319 Direction eigenkets, 199 Dispersion, 34 Double-bar matrix element, 239 Dual correspondence, 13 Dubbers, D . , 470 Dyadic tensors, 234 Dyson, F. J . , 73, 326 Dyson series, 73, 325-26
Ehrenfest, P., 87 Ehrenfest's theorem, 87, 126, 131 Eichinvarianz, 135 Eigenbras, eigenkets and, 13 Eigenfunctions, 53 Eigenkets, 12 angular momentum, 189-91 as base kets, 18-20 direction, 199 eigenbras and, 13 energy, 74-75 simple harmonic oscillator and, 90-94 Hermitian operators and, 61 observables and, 17-18 parity, 255 position, 42-43 simultaneous, 30 zeroth-order, 298
Eigenspinors, 277-78 Eigenstates, 12 angular momentum, 188-95 time reversal and, 279-80 Wigner-Eckart theorem and, 239 energy, parity properties of, 255-56 zeroth-order, 347 Eigenvalues, 12 angular momentum, 188-95 degenerate, 29 energy, simple harmonic oscillator and,
90-94 energy eigenkets and, 74 expectation values and, 25 Hermitian operators and, 17 orbital angular momentum squared and,
31 Eikonal approximation, 392-94 partial waves and, 404 Einstein, A ., 226-27 Einstein-Debye theory, 1 Einstein-Podolsky-Rosen paradox, 277 Einstein's locality principle, 226-29 Elastic scattering, 444-45 Electric dipole approximation, 337-39 Electric fields, charged particles in, 280-81 Electromagnetic fields, polarization vectors of, kets and, 10 Electromagnetism, gauge transformations in, 130-36 Electron spin, magnetic moment and, 2-4 Emission, in classical radiation fields,
335-37 Energy eigenkets, 74-75 simple harmonic oscillator and, 90-94 Energy eigenstates, parity properties of,
255-56 Energy eigenvalues, simple harmonic oscillator and, 90-94 Energy shift, decay width and, 341-45 Energy-time uncertainty relation, correlation amplitude and, 78-80 Ensemble average definition of, 177 density operator, 176-81 Ensembles canonical, 185-87 completely random, 175 density matrix of, 183 mixed, 176 pure, 24, 176 time evolution of, 181-82 Entropy, 184
494 Equation of motion Euler, 243 Heisenberg, 83-84, 95, 243, 249 Euclidian space, 35 Euler angle notation, 221 Euler rotations, 171-74, 243 Exchange degeneracy, 358 Exchange density, 365 Expectation values, 25, 160 Hermitian operators and, 35 time dependence of 75-76 Experimental verification geometric phase of, 469-70
Feenberg, Eugene, 390 Fermi-Dirac statistics, 361, 376, 378 Fermions, 361-63 Fermi's golden rule, 332 Feshbach, H., 1 13 Feynman, R. P., 1 16, 1 18, 488 Feynman's formulation, 1 1 7-23 Feynman's path integral, 121-23, 138 Filtration, 25-26 Fine structure , spin-orbit interaction and, 304-7 Finite rotations infinitesimal rotations and, 152-55 noncommutativity of, 153 spin 112 systems and, 158-68 Finkelstein, R. J . , 149 Form factor, 431 Fourier decomposition, 345 Fourier inversion formula, 345 Fourier transform, 431 , 482, 483 Fractional population, 175 Franck-Hertz experiment, 1 Frauenfelder, H . , 279 Free-particle states, scattering and, 395-99 Free particles Heisenberg equation of motion and, 85 Schriidinger wave equation and, 446-47 Fulton, T . , 434 Fundamental commutation relations, 50-51
Gasiorowicz, S . , 488-89 Gauge invariance, 135-36 Gauge transformations constant potentials and, 1 23-25 electromagnetism and, 130-36 Gaussian potential, 443
Index
Gaussian wave packets, 57-59, 64, 67, 100, 1 12 Gauss's law, 140 Gauss's theorem, 401 Geometric phase experimental verification, 469-70 universality of, 479-80 Geometrical phase degeneracy point and, 467-69 solid angle and, 469 Gerlach, W., 2 Goldstein, H ., 38, 17 1 , 250 Gottfried, K . , 26, 147, 3 1 1 , 348, 349 Gravity, quantum mechanics and, 126-29 Green's function , 1 1 1 , 390, 424, 441 Green's operator, 425
Hamilton, W. R . , 100 Hamilton-Jacobi equation, 149, 392 Hamilton-Jacobi theory, 103 Hamiltonian, of Coulomb interactions, 435-38 Hamiltonian matrix, for two-state systems, 348 Hamiltonian operator, 7 1 , 90, 98, 99, 143-45 time-dependent, 73 time-independent, 72-73 two-state systems and, 62 Hamilton's characteristic function, 103-4 Hankel functions, 405, 439, 453 Hankel's formula, 438 Hannay's angle, 480 Hard-sphere scattering, 406-10 Harmonic oscillator. see also Simple harmonic oscillator isotropic, 346 Harmonic perturbation, 333-35 Heisenberg, W . , 1 , 48, 50, 100, 187 Heisenberg equation of motion, 83-84, 95, 243, 249 Heisenberg picture, 80, 1 14-15, 143-44, 181, 318-19 base kets and, 87-89 state kets and observables and, 82-83 Heisenberg uncertainty principle, 3, 58 Helium atom, 366-70 Helmholtz equation, 390 Henley, E. M . , 279 Hermite polynomials properties of, 450 Hermitian adjoint, 15
Index
Hermitian operator, 27, 45, 66, 71, 85, 90, 96, 98, 144, 156-57, 178-79, 274, 279 anticommute, 63 eigenvalues of, 17 expectation values of, 35 Hermiticity, 40, 178, 391 Herzberg, G., 477 Hilbert, D. , 1 1 , 100 Hilbert space, 1 1 Hooke's law, 90 Hydrogen atom ground state of polarizability of, 298 linear Stark effect and, 302-4 Schrodinger wave equation and, 454-55 Hydrogen-like atoms fine structure and, 304-7 Zeeman effect and, 307-1 1
Identical particles, scattering and, 421-22 Identity operator, 19, 29 spin 112 systems and, 22 Incoherent mixtures, 175 Inelastic scattering, 429-34 nuclear form factor and, 433-34 Inertia, moment of, computation of, 6 Infinitesimal rotation operator, 156-57, 196 Infinitesimal rotations commutativity of, 155 finite rotation and, 152-55 quantum mechanics and, 156-58 vector operator and, 233 Infinitesimal time-evolution operator, 70 Inner products, 13-14 Integral equation, 380 Interaction picture, 318-19 Irreducible tensors, 234-36 Isospin, 221 Itano, W. M., 486
Jackson, J. D . , 339 Jacobi identity, 51 Jahn-Teller effect, 475-79, 480 Jordan, P . , 50, 100, 187
Ket equation, 380 Ket space, 1 1-12 bra space and, 13 operators and, 14-15 orthonorrnality and, 65
495
Kets, 8-9 base change of basis in, 36-37 eigenkets and, 18-20 spin 112 systems and, 22 transition amplitudes and, 87-89 definition of, 1 1 electromagnetic field polarization vectors and, 10 normalized, 14 null, 1 1 operators and, 14 spin, 160-61 orbital state, 477 perturbed, normalization of, 293-94 state Schrodinger and Heisenberg pictures and, 82-83 time and, 69-71 vacuum, 218 Khalfin, A., 483 Kinematic momentum, 130, 135 Kramers, H. A ., 105 Kramers degeneracy, 281 Kronecker symbol, 4 1 Krypton, Ramsauer-Townsend effect and, 413
Ladder operators, angular momentum commutation relations and, 187-89 Lagrange equation, 249 Lagrangian, classical , 1 16-17, 138 Laguerre polynomials, 454 Lamb shift, 303, 349 Lande's interval rule, 306 Laplace-Fourier transform, 1 14 Laplace transform , 435 Laplace transform space, 458 Laporte's rule, 260 Lattice translation, as discrete symmetry, 261-66 Lattice translation operator, 262-64 Legendre function, 442 Leighton, R. B . , 488 Lewis, J. T. , 298 Light circularly polarized, mathematical representation of, 9-10 polarized and Berry's phase, 479 polaroid filters and, 6-10 x-polarized, 6-10 y-polarized, 6-10
Index
496
Linear Stark effect, 302-4 Liouville's theorem, 181 Lippmann-Schwinger equation, 379-86, 389, 424, 426, 447 Lippmann-Schwinger formalism, 441, 443 Longuet-Higgins, H. C . , 477 Lorentz force , 131, 139 equation of, 267
Magnetic fields charged particles in, 281-82 Stern-Gerlach experiment and, 2-4 Magnetic flux, fundamental unit of, 139 Magnetic moment electron spin and, 2-4 Magnetic monopoles, 140-43 Mandl, F., 489 Masers, 324-25 Matrices. see specific type Matrix elements angular-momentum operators, 191-92 double bar, 239 reduced, 241 tensors, 238-42 Matrix mechanics, 50 Matrix representations, 20-22 Matter waves, de Broglie's, 1 Matthews, P. T., 489 Maxwell-Boltzmann statistics, 362 Maxwell's equations, 140, 267 Mean square deviation, 34 Measurements quantum theory of, 23-26 selective, 25-26 Merzbacher, E . , 298, 347, 349, 350 Messiah, A . , 465 Minimum uncertainty wave packets, 58 Misra, B . , 484 Momentum canonical, 130, 135 definition of, 249 definition of, 54 kinematic, 130, 135 translation generation and, 46-50 Momentum operator, 59-60, 67 position basis and, 54 Momentum-space wave function, 55-57, 146 Morse, P. M . , 113 Motion, Heisenberg equation of, 83-84, 95, 243, 249 Mott, N. F. , 489
Multiplication associative axiom of, 16-17 noncommutative, 15-16 Muons, spin precession of, 161
Neutron interferometer, 150 Neutron interferometry 21T rotations and, 162-63 Newton's second law, 87, 123, 139 Non-Abelian, definition of, 158 Non-exponential decays, 481-86 Nonstationary states, 76, 257 Norm, 14 Nuclear form factor, inelastic scattering and, 433-34 Nuclear magnetic resonance, 159 Null kets, 1 1
Observables, 1 2 compatible, 29-32 eigenkets of, 17-18 incompatible, 29, 32-34 transformation operator and, 36-37 matrix representation of, 22 Schrodinger and Heisenberg pictures and, 82-83 unitary equivalent, 40-41 Operator equation, 233 Operator identity, 46 Operators, 12, 14-15. see also specific type definition of, 34, 65-66 time reversal and, 273-75 trace of, 38 Optical isomers, 259 Optical theorem, 390-92 Orthogonal matrices multiplication operations with, 169 Orthogonal groups, 168-69 Orthogonal matrices, 153-55 Orthogonality, 14, 17, 27, 52, 54, 209, 431 spherical harmonics and, 216 Orthohelium, 369-70 Orthonormality, 18, 19, 23, 30, 37, 61, 120, 209 ket space and, 65 Oscillation strength, definition of, 338 Oscillators. see also Simple harmonic oscillator uncoupled, angular momentum and, 217-21
Index Outer products, matrix representation of, 22
Paley, R., 483 Paley-Weiner theorem, 483 Parahelium, 369-70 Parity, 251-54 wave functions under, 254-56 Parity nonconservation, 260-61 Parity operator, 251 Parity-selection rule, 259-60 Park, D . , 489 Partial-wave amplituted, 400 Partial-wave analysis for Coulomb case, 438-41 Partial-wave expansion, 399-401 Paschen-Back limit, 310 Pauli, W., 163 Pauli exclusion principle, 265, 362 Pauli matrices, 164-65 Pauli two-component formalism, 163-65 rotations in, 165-68 Pauling, L., 489 Permutation operator, definition of, 358 Permutation symmetry, 357-61 Young tableaux and, 370-77 Pertubation , 286 constant, 328-33 harmonic, 333-35 Pertubation expansion, formal development of, 289-93 Perturbation theory time-dependent, 325-35, 353, 355 scattering and, 426-29 time-independent degenerate, 298-304, 354 nondegenerate, 285-98 variational methods in, 313-16 Phase shifts determination of, 405-6 unitarily and, 401-4 Photoelectric effect, 339-41 Planck, M . , 89, 107 Planck-Einstein relation, angular frequence and, 71 Planck's constant, 479 Planck's radiation law, 1 Podolsky, B . , 227 Poisson bracket, 66, 84 commutators and, 50-51 Polar vectors, 254 Polarized beams, 174-76
497 Polarized light and Berry's phase, 479 Polaroid filters light and, 6-10 Polyatomic molecules as complex systems, 473-74 Position eigenkets, 42-43 Position-momentum uncertainty relation, 48 Position-space wave functions, 51-53 Positive definite metric, 14 Potassium atom, fine structure and, 304-7 Potentials, Schrodinger wave equation and, 447-48 Preston, M . , 416 Probability conservation, 402 Probability density, wave mechanics and, 101 Probability flux, 101 Projection operator, 20 Projection theorem, 241 Propagators transition amplitude and, 1 14-16 wave mechanics and, 109-14 Pseudoscalar, examples of, 254 Pseudovectors, 254 Pure ensembles, 24
Quadratic Stark effect, 296 Quantum electrodynamics, covariant, 326 Quantum Hall effect, 480 Quantum interference , gravity-induced, 127-28 Quantum mechanics Bell's inequality and, 229-32 complex numbers and, 27-28 gravity in, 126-29 infinitesimal rotations and, 156-58 symmetry in, 249-50 tunneling in, 258 Quantum statistical mechanics, 182-87 Quantum Zeno effect, 485, 486 Quarkonium, 109 Quenching, 284
Rabi, I . I . , 320, 324 Rabi's formula, 320 Radial integration, helium atom and, 367 Radiation law, Planck's, 1 Ramsauer-Townsend effect, 413 Rayleigh-Schrodinger perturbation theory, 285, 311
498
Recursion relations, Clebsch-Gordan coefficients and, 210-15 Reduced matrix element, 241 Resonance, 321, 418 spin magnetic, 322-24 Resonance scattering, 418-21 Rigid-wall potential, Schriidinger wave equation and, 447-48 Rojansky, V . , 489 Rosen, N . , 227 Rotation generation, orbital angular momentum and, 196-98 Rotation matrices Clebsch-Gordan coefficients and, 215-17 formula for, 221-23 spherical harmonics and, 202-203 Rotation operator, 156-58 effect on general kets, 160-61 irreducible representation of, 174 representations of, 192-95 2 x 2 matrix representation of, 165-66 Rotational invariance, 402 Rotations. see also specific type 2'TI', neutron interferometry and, 162-63 angular momentum commutation relations and, 152-58 finite vs. infinitesimal, 152-55 Pauli two-component formalism and, 165-68 Rutherford formula, 434, 438 Rutherford scattering, 388
Sakurai, Akio, 464 Sakurai, J. J . , 305, 434 Sands, M . , 488 Saxon, D. S . , 113 Scattering length, 414 Scattering processes Born approximation and, 386-90 Coulomb, 434-41 eikonal approximation and, 392-94 free-particle states and, 395-99 hard-sphere and, 406-10 identical particles and, 421-22 inelastic electron-atom, 429-34 Lippmann-Schwinger equation and, 379-86 low-energy, bound states and, 410-18 low-energy, rectangular well/barrier, 41 1-13 optical theorem and, 391-92 partial waves and, 399-410
Index
resonance, 418-421 symmetry and, 422-24 time-dependent formulation of, 424-29 Schiff, L. , 106 Schriidinger, E . , 1 , 68, 99, 101 Schriidinger equation, 56, 71-73, 126, 129, 138, 181, 281 , 299, 366, 406, 424, 425, 435 complex systems and, 474 time-dependent, 426, 429, 465 Schriidinger picture, 80, 144-45, 181, 318-19, 327, 343 base kets in, 87-89 state kets and observables in, 82-83 Schriidinger wave equation, 97-109, 131, 134, 267 Feynman's path integral and, 121-23 semiclassical (WKB) approximation of, 104-9 solutions to, 446-55 time-dependent, 97-98 time-independent, 99-100 Schriidinger wave function, 276 interpretations of, 101-3 Schriidinger's equation two-dimensional, 477 Schwartz inequality, 35, 64 Schwinger, J . , 26, 46, 217, 221, 324 Schwinger action principle, 149 Selective measurement, 25-26 Silver atom beams, polarized vs. unpolarized, 174-76 Silver atoms spin states of complex vector space and, 10 representation of, 8-10 Stern-Gerlach experiment and, 2-4 Similarity transformation, 38 Simple harmonic oscillator, 89-97, 145, 146, 188 energy eigenkets and eigenvalues of, 90-94 ground state of, 92 one-dimensional, ground-state energy of, 350 parity properties of, 256 perturbation and, 294-96, 345-46 Schriidinger wave equation and, 450 time development of, 94-97 Simultaneous eigenkets, 30 Singlets, 354, 377 Sodium atoms, fine structure and, 304-7 Sodium D lines, 306
Index
Solid angle geometrical phase factor and, 469 Sommerfled, A., 107 Space inversion. see Parity Space quantization, 4 Spatial displacement. see Translation Specific heats, Einstein-Debye theory of, 1 Spherical harmonics helium atom and, 367 Laguerre times, 445 orbital angular momentum and, 198-203 orthogonality of, 216 Spherical tensors, 234-36 Spherical-wave states, 395 Spin 112 particles, spin operator for, 204-5 Spin 1/2 systems, 22-23, 26-29, 61 anticommutation relations and, 28 canonical ensembles and, 186-87 dispersion in, 35 eigenvalues-eigenket relations in, 12 rotations of, 158-68 2 x 2 matrix and, 170 spin precession and, 76-78 time-evolution operator and, 69-70 time reversal for, 277-80 Spin-angular functions, definition of, 215 Spin correlations, spin-singlet �tates and, 223-26 Spin magnetic resonance, 322-24 Spin operator for spin 112 particles, 204-5 of spin 1/2 systems, 160 Spin-orbit interaction, fine structure and, 304-7 Spin precession , 76-78, 161-62, 305, 323 Spin-singlet states, spin correlations in, 223-26 Spin states complex vector space and, 10 of silver atoms, 8-10 Spinors, two-componen t , 164 Square-well potential, Schriidinger wave equation and, 448 State kets Schriidinger and Heisenberg pictures and, 82-83 time and, 69-71 State vectors, 1 1 Stationary states, 76 Stern, 0., 2 Stern-Gerlach experiment, 2-10 description of, 2-4 light polarization and, 6-10
499
sequential, 4-6 Stoke's theorem, 137 Stopping power, inelastic-scattering and, 432 Sturm-Liouville theory, 202 Sudarshan, E. C. G . , 484 Suzuki, Yasunaga, 464 Symmetrical double-well potential, 256-59 Symmetrical states, 257 Symmetrization postulate, 361-63 Symmetry in classical physics, 248-49 discrete lattice translation as, 261-66 time-reversal, 266-82 permutation, 357-61 Young tableaux and, 370-77 in quantum mechanics, 249-50 scattering and, 422-24 Symmetry operator, 249
2 x 2 Hermitian matrix, 471 Taylor expansion, 195 Tensors, 232-42. see also specific type matrix elements of, 238-42 product of, 236-38 rank of, 234 Thomas, L. H . , 305 Thomas precession , 305 Thomas-Reiche-Khun sum rule, 338 Threshold behavior, 4 1 1 Tight-binding approximation, 263, 265 Time-dependent potentials, 316-25 Time evolution, of ensembles, 181-82 Time evolution operator, 69-71, 250, 326 Heisenberg equation of motion and, 84 infinitesimal, 70 Schriidinger equation and, 71-73 Time reversal, 266-82 Kramers degeneracy and, 281 spin 112 systems and, 277-80 Time reversal operator, 271-75 wave function and, 275-77 Trace, definition of, 38 Transformation functions, 55-56 Transformation matrix, 37-39 diagonalization and, 66 Transformation operator, 36-37 Transition amplitudes base kets and, 87-89 composition property of, 1 1 5 propagators as, 114-16
500
Transition probability, 327-28 Transition rate, 332 Translation, 44-46 infinitesimal, 44 momentum as generator of, 46-50 Translation operator, physical interpretation of, 188-89 Transmission-reflection, Schrodinger wave equation and, 448-49 Two-electron systems, 363-66 Two-level crossing, general consideration of, 471-72 Two-state problems perturbation theory and, 286-88 time-dependent, 320-22 Two-state systems Hamiltonian matrix for, 348 Hamiltonian operator for, 62 Stern-Gerlach, 2 Uncertainty principle, Heisenberg, 3 Uncertainty relation, 34-36 energy-time, correlation amplitude and, 78-80 Unitarily, 401-4 Unitarily relation, 402 Unitary circle, 404 Unitary equivalent observables, 40-41 Unitary operator, 37, 80-82, 249 Unitary transform, 40 Unitary unimodular matrix, 169-71 Unpolarized beams, 174-76 Unsold, A., 369 Vacuum ket, definition of, 218 Van Dam, H . , 217 Van der Waals' interactions, 311-12 Variance, 34 Vector operator, 232-33 defining property of, 233 Vectors. see also specific type definition of, 232 transformation properties of, 166 Virtual transitions, 333 von Neumann, J . , 176 Watson, G. N . , 442 Wave equations Feynman's path integral and, 121-23 Schrodinger's, 97-109, 13 1 , 134, 267 Semiclassical (WKB) approximation of, 104-9
Index solutions to, 446-55 time-dependent, 97-98 time-independent, 99-10 Wave functions momentum-space, 55-57, 67, 146 under parity, 254-56 position-space, 51-53 renormalization of, 293-94 Schri\dinger, 101-3, 276 time reversal and, 275-77 Wave mechanics, 98 classical limit of,, 103-4 probability density in, 101 propagators in, 109-14 Wave packets Gaussian, 57-59, 64, 67, 100, 112 minimum uncertainty, 58 scattering and, 385-86 Wentzel, G . , 105 Weyl, H . , 100 Whiskers, Aharonov-Bohm effect and, 139 Whittaker, E. T. , 442 Wiener, N . , 90, 483 Wigner, E. P., 192, 221, 227, 260, 281, 345, 415 Wigner-Eckart theorem, 239-42, 247, 280, 297, 399 Wigner functions, 192 Wigner's 3-j symbol, 210 Wigner's formula, 223 Wilson, E. B . , 489 Wilson, W . , 107 Wittke, J . P., 488
Xenon, Ramsauer-Townsend effect and, 413
y-axis, space fixed, Euler rotations and, 171 Young, A ., 370 Young tableaux, permutation symmetry and, 370-77 Yukawa potential, 431 , 442, 445, 447 scattering by, 387-8
Zeeman effect, 307-11 Zeeman splitting, 346-47 Zero-energy scattering, bound states and, 413-16 Zeroth-order eigenkets, 298 Zeroth-order energy eigenstates, 347
Solutions to Problems m
Q1mntlm1 Mechanicf->
P. Saltsiclis. additions by B . Brinnc 1995 . 1 999
0
\lost of the problems presented here are taken from the hook Saknrai. J. J . . Modern (jurmtum :\1eclumics, Reading, \1 !\: Addison - \Vesley, 1 98:5.
Contents
I
Problems
1
2
:3 1
5 II
1
2
:3 1
5
3
Fundamenta1 Concepts
Quantum Dynamics . Theory- of Angu l ar \lomentmn . Symmetry in Qna.ntnnl \lerhanirs Approximation \let hods Solutions
5
H I
17 19 23
Fundamental Concepts Quantum Dynamics . Theory- of Angu l ar \]omentum . Symmetry in Qna.ntnnl \lerhanics Approximation \let hods
1
25 36
75 9>1
l iB
2
CONTENTS
Part
I
Problems
1.
1
FUND'4.MENTAL CONCEPTS
5
Fundamental Concepts
1 . 1 Consider a ket space spanned by the eigenkets { Ia' ) } of a Her mitian operator A . There is no degeneracy. (a) Prove that IJ( A - a') <]_
is a null operator. (b) What is the significance of
" IT ( A - a ) ?
" , ' ·J. T'J.
a
'
- a
"
(c) Illustrate (a) and (b) using A set equal to S, of a spin � system. 1.2 A spin � system is known to be in an eigenstate of /] il with eigenvalue n('., where fl is a unit vector lying in the Tz-plane that makes an angle ;· with the positive .c -axis. (a) Suppose 5., is measured. What is the probability of getting + h/2? (b) Evaluate the dispersion in S.n that is, ·
( ( 5, - (5, ) ) ' ) . (For your own peace of Inind check your answers for the special cases '"'/ = 0, rr /2, and IT . )
1 . 3 (a) The simplest way to derive the S chwarz inequality goes as follows. First observe
( (u l + .\' (3 1 ) · ( I n) + .\13))
20
for any cmnplex number .\; then choose .\ in such a way that the preceding inequality reduces to the S chwarz inequility.
(b) Show that the equility sign in the generalized uncertainty re lation holds if the state in question satisfies with ,\ purely imaginary. (c) Explicit calculations using the usual rules of wave mechanics show that the wave function for a Gaussian wave packet given by . ' I O) \:r '
=
(•2rrt,i")-' 1
'4 cxp
[i(p):n c' - (:IJ -.d"(:I:))" ] --
4 . .,
satisfies the uncertainty relation
Prove that the requirement ( "/ I "" :J: In) = (imaginary number)(.r' I ""Jiln) is indeed satisfied for such a Gaussian wave packet, in agreeinent with (b) .
1.4 (a) Let :t and Px be the coordinate and linear momentum in one dimension. Evaluate the classical Poisson bracket
(b) Let :r and fix be the corresponding quantum-mechanical opera tors this time. Evaluate the commutator
[.r, cxp (1]1,.(1 )] . T
(c) Using the result obtained in (b), prove that
(lf'.rf1)
cxp h l .r ) , 1
2.
Q UANTUTvi D YNAMICS
7
is an eigenstate of the coordinate operator sponding eigenvalue?
1 . 5 (a) Prove the following:
"' ·
What is the corre
w. .- II'
. , () I ' I n ) ' (lp'
(i ) (ii)
1 1 ' '" ' / • t ()
· f '- I"" " I, I' I 1 " . ID I, I' ) . . ·u . i)p' . " •
.
where (D,, ( y/) = (p' ln) and ,:Y,,(y/) = (rl l(i) are momentum-space wave functions. (b) What is the physical significance of •
( u: :::. \
exp . -,- } , \ {}, /
where x is the position operator and ::=: is some number with the dimension of momentum? Justify your answer.
2
Quantum Dynamics
2 . 1 Consider the spin-procession problem discussed in section 2 . 1 i n Jackson. It can also be solved in the Heisenberg picture. Using the Hamiltonian
Il = - ( rnc ' 8 ) s, = w8z,
write the Heisenberg equations of motion for the time-dependent operators S'.r( l ) , 5'11 ( 1 ) , and Sz ( t ) . Solve them to obtain S.r.y.z as func tions of time.
2.2 Let 1:(/) be the coordinate operator for a free particle in one dimension in the Heisenberg picture. Evaluate [ r ( l ) , r ( O )] .
.
.
R c
2.3 Consider a particle in three dimensions whose Hamiltonian is given by ![ ·
=
By calculating [:r · r), II] obtain
-ddi ( x- · PI. .,
=
1' + 4,1
'2rrt
•1 (' :r-) .
l "
( -Pl). - ,:r
�-
m
·
""'
.)
v' ,,. .
To identify the preceding relation with the quantum-mechanical analogue of the virial theorem it is essential that the left-hand side vanish. Under what condition would this happen?
2.4 ( a ) Write down the wave function ( in coordinate space ) for the state IO) . c xp
( �:m) -
You n1ay use
(.r. , I O) . . .
1 1 � _;r - f4_ - /'2
r0
cxp �
[
1 -2
;r
( :ro I
)] ( 2
To
=
(�) '(([,(..;.)
l/2
)
.
(b ) Obtain a simple expression that the probability that the state
is found in the ground state at i for i > 0 ?
=
0. Does this probability change
2 . 5 Consider a function, known as the correlation function, defined by C(t) = (T(i )T( O ) ) ,
where :r(f) i s the position operator i n the Heisenberg picture. Eval uate the correlation function explicitly for the ground state of a one-dimensional simple harmonic oscillator.
2.
Q UANTUTvi D YNAMICS
9
2.6 Consider again a one-dimensional simple harmonic oscillator. Do the following algebraically, that is, without using wave func tions.
(a) Construct a linear combination of IO) and l l ) such that ( :r ) large as possible.
IS
as
(b) Suppose the oscillator is in the state constructed in (a) at t = 0 . What i s the state vector for t > 0 i n the Schrodinger picture? Evaluate the expectation value (:r) as a function of time for t > 0 using (i) the S chrodinger picture and (ii) the Heisenberg picture. (c) Evaluate ((t,.:r)2) as a function of time using either picture.
2. 7 A coherent state of a one-dimensional simple harmonic oscil lator is defined to be an eigenstate of the (non-Hermitian) annihi lation operator a :
where ,\ is, in general, a complex nmnber. (a) Prove that is a normalized coherent state.
(b) Prove the minimum uncertainty relation for such a state. (c) Write 1 >-) as 1 >-)
= I: .f( n ) ln). n=O
Show that the distribution of lf (nW with respect to n is of the Poisson form. Find the most probable value of n , hence of ;,; . (d) Show that a coherent state can also be obtained by applying the translation (finite-displacement) operator c:,t ;r, (where p is the mon1entmn operator, and I is the displacen1ent distance) to the ground state.
10 ( e ) Show that the coherent state 1 >.) remains coherent under time evolution and calculate the time-evolved state 1 >- ( 1 ) ) . (Hint: di rectly apply the time-evolution operator.)
2.8 The qnntnn1 n1echanical propagator, for a particle with n1ass moving in a potential is given by:
m,
, - ( - , y ; r,, = J\ .T: ,)
lax· o
T."(. :L y ; 1 , O) - · dl f.-i Ht;-n. J\
=
• '\" -" L.. n
where A is a constant. (a) What is the potential?
"'
th' .
2 rJ - 2mfl
sin(nT.r) sin(nT,IJ )
(b) Determine the constant A in terms of the parameters describing the systen1 ( snch as m , etc. ) . ,.
2.9 Prove the relation
dO( a: ) (h
-- =
,\ ( :r. )
.
where O ( :r ) is the (nnit) step function, and S(:r ) the Dirac delta function. (Hint: study the effect on testfnnctions.)
'{7{.(.,1.) i 2 · 2 , , ! (:1:0 + h:y) COS \ wT) - .r0:J:y , 2 s111 (w f ) for the classical action for a harmonic oscillator moving from the point :r0 at t = 0 to the point :J:y at t = T.
2.10 Derive the following expression
.'\ I ,...,
=
1
,
•
1
·2
1
2 .r2
-
l
2 . 1 1 The Lagrangian of the single harmonic oscillator is L = •
2
-rn.r:
-
2
- rnu.)
2.
Q UANTUTvi D YNAMICS
11
(a) Show that
where S'co� is the action along the classical path ( :�:6, 1 6 ) and G is G(O, lc.; 0, L,) =
\in� J dy, . . . dyv C� r. c)
i"'V --+ - :J
.
rn
_ ,,
l L__
2
E±.!l
{
. �v u: p , 2:: ,,_
}. t =O
"'d
nl
fr01n ( :�:", 1") to
·] }
., 1 l'� [ .....)..,. (!h+ ' - u )"., - -:cmw" 2 �
'1 , .)
.1 ]
where c = t��r-.:{') . (Hint: Let y(l) = :c( l ) - :1: d ( l ) be the new integration variable, :r,,t ( l ) being the solution of the Euler-Lagrange equation.) (b) Show that G can be written as
where
n
=
and nT is its transpose. Write the symmetric
matt·ix u. (c) Show that
[Hint: Diagonalize
rJ
by an orhogonal matrix.]
(d) Let ( ";�·') v ddu = ddu;v = fiN . Define .i x .i matrices uj that con sist of the first .i rows and .i columns of u;v and whose determinants are l'.i · By expanding uj+1 in minors show the following recursion formula for the /1.1 : .J = L . . . , :V
( 2. 1 )
12 (e) Let q)( t ) = E!ii for t = t,, + j.,: and show that ( 2 . 1 ) implies that in the limit r:: � 0, \D( I ) satisfies the equation (P(i
.2 ' l) . - = -w oi . \
d/2
" t!1 m1 • •t•Ial con d 1" t "wns qJ. ( I = t,, ) =
WI
(f) Show that
' I.o ) - \/f ( ·.,·b t b I •·Iu, ,
where
-
T
=
dc'(t-t ) () , · , ,-; "
= l.
i oJ C'Io "·1'2 ··1·'2) ,,,,· a .·1o' b ] f l ,..,. 11 . "'"' , . .m , a . ,,,,d · ' •'"• T). - �· .hczh sm(u.•.T) 2h sm( oJ T) b + .
•
.
..
{
l; - / 0 •
2 . 1 2 Show t h e composition property
where Kr(:r 1 , 1 1 ; :r0, 10) is the free propagator (Sakurai 2 . 5 . 1 6 ) , by explicitly performing the integral ( i . e . do not use completeness) .
2 . 1 3 (a) Verify the relation
where n
= m
ft
=
d2:l'
m.
d/1
ii -
=
e/ and the relation
dtl -
dt
=
c
[-+ ( F
l
d:J'
-
-
2c
dt
x
(b) Verify the continuity equation ap iJt
-,
-
. + v . .,· = o
R - R
-
-
x
d.J' -
dl
)]
.
2.
13
Q UANTUTvi D YNAMICS
with .J given by
J
=
(-n ) s ( ,·'v'l/;) � - (-c ) 4�1 '1'' 1.'. 1rt
1rtC
2 . 1 4 An electron moves in the presence of a uniform magnetic field in the z-direction ( f} = FJ'i ) . (a) Evaluate
[1 1.,, I I ,J
where
fl, = /l.r
eA,··.· : - -c
(b) By comparing the Hamiltonian and the commutation relation obtained in (a) with those of the one-dimensional oscillator problem show how we can immediately write the energy eigenvalues as Ek · n
·
_
�
e D I h ) (. + -1 ) h k' (l--+ 2m me . 2. '
"·
where hk is the continuous eigenvalue of the nonnegative integer including zero.
fiz
�
operator and
u
1s a
2.15 Consider a particle of mass m and charge q in an impenetrable cylinder with radius R and height Along the axis of the cylin der runs a thin, impenetrable solenoid carrying a magnetic flux
a.
2.16 A particle in one dimension ( -oo < :r < oo) is subjected to a constant force derivable from v
=
.\.t.
( .\ > 0 ) .
14 (a) Is the energy spectrum continuous or discrete? Write down an approximate expression for the energy eigenfunction specified by E.
(b) Discuss briefly what changes are needed if \ ' is replaced be
3
Theory of Angular Momentum
3 . 1 Consider a sequence of Euler rotations represented by
Because of the group properties of, rotations, we expect that this sequence of operations is equivalent to a single rotation about some axis by an angle (). Find ().
m=
=
3.2 An angular-n1omentum eigenstate I.J, m n1ax j) is rotated by an infinitesimal angle c about the y-axis. Without using the explicit form of the d�D"' function, obtain an expression for the probability for the new rotated state to be found in the original state up to tern1s of order c 2 •
3.3 The wave function of a patricle subjected to a spherically symmetrical potential 1/(r) is given by
v·( l')
= ( .!: + if + :3.z ) f(r).
:3.
15
THEORY OF A NUULA.R TviOMENTUTvi
(a) Is ,:. an eigenfunction of L? If so, what is the /-value? If not, what are the possible values of I we may obtain when f' 1s measured? (b) What are the probabilities for the particle to be found in various states?
m1
(c) Suppose it is known somehow that t;'(.i) is an energy eigenfunc tion with eigenvalue ;.,· . Indicate how we may find V ( r·) .
3.4 Consider a particle with an intrinsic angular momentum (or spin) of one unit of h. (One example of such a particle is the e meson). Quantum-mechanically, such a particle is described by a ketvector l e) or in :i' representation a wave function
e: (:i) = ( i'; i l e)
where I Z, i) correspond to a particle at rection.
:l'
with spin in the i :th di
(a) Show explicitly that infinitesimal rotations of e'(:f) are obtained by acting with the operator
:-
where /,
=
t, "-[· '
x
-
v . Determine
(b) Show that i' and
S !
S commute.
(c) Show that ,<; is a vector operator.
(d) Show that '¢ator.
x
( :3.1 )
if(.f)
=
1�" (S
·
ji)if where
fi is the momentum oper
I to form 3.5 We are to add angular 1nomenta j1 1 and j2 j 2 . 1 , and 0 states. Using the ladder operator method express all =
=
=
16 (nine) j, m eigenkets in terms of lj1 j1 ; m 1 m1 ) . Write your answer as
( 3.2)
�1+. 0 ) � �1 0 . +), . . . , where + and 0 stand for m 1 ,, = 1 , 0 , respectively. lj = L m = 1 ) =
v -
v �
3.6 (a) Construct a spherical tensor of rank l out of two different 1 -c veet ors {,_,, = \1 1 . 1·· rj'·._: . and ,.,· = ( '.,·.�· � ,v'u. � ,.,·.7 . • E-xp1·I CI" tly wr1" t e ''/ '1±l',1 o 1• 11 tern1s of Ux.:,,z and 1/;.,,, , . ' ../ :n
)
"'u�
.
)
(b) Construct a spherical tensor of rank 2 out of two different vectors (i and �-;. Write down explicitly ?ii±l .O in terms of U,, and 1/;.,,, , . ·H·•
3 . 7 (a) Evaluate rn=-.7
for any j (integer or half-integer) ; then check your answer for j
=
}.
(b) Prove, for any j , "' m ' l f1!''. Uli l ' = � i ( i + 1 bin _i! + m'2 + � ( :) cos2 i] L .:. , J
rn.
"
.
•
..:·
•
/
� 1 ).
[Hint: This can be proved in Inany ways. You Inay, for instance, examine the rotational properties of .r; using the spherical (irre ducible) tensor language.]
an d ( rc, (irreducible) tensor of rank 2 . • 3.8 (a) U< vvnte •
.J:y,
.1: ::: ,
� if2 )' as cmnponents of a spherical
4.
SYMMETRY IN Q UA NTUM MECHANICS
17
(b) The expectation value
Q = t (a , j, m = j l ( 3z2 - r2 ) laJ m = j ) is known as the quadTupole rnornent. Evaluate (where m ' = j, j - L j - :2 , Gordan coefficients.
4
.
.
.
) in terms of Q and appropriate Clebsch
Symmetry in Quantum Mechanics
4.1 (a) Assuming that the Hamiltonian is invariant under time reversal, pt·ove that the wave function for a spinless nondegenerate system at any given instant of time can always be chosen to be real. (b) The wave function for a plane-wave state at l = 0 is given by a complex function t '1'" ''11' . Why does this not violate time-reversal invariance?
4.2 Let tp(jJ) be the momentum-space wave function for state lo), that is, \C,(z7 ) = (J7 1o ).Is the momentum-space wave function for the time-reversed state B i n) given by ,\(z7, tp( -jJ), t/ (z7), or 1)"( -J?)? Justify your answer.
4.3 Read section 4.3 in Sakurai to refresh your knowledge of the quantum mechanics of periodic potentials. You know that the en ergybands in solids are described by the so called Bloch functions lf'n.ic fullfilling,
18 where a is the lattice constant, n labels the band, and the lattice momentum /;; is restricted to the Brillouin zone [-;rju. 11-ju]. Prove that any Bloch function can be written as, '.'. 'Fn. k ( 1· ) •
"\' ,-i--'n ( 1' �
H;
.•
•
kH , fi' 1. ) fi -
where the sum is over all lattice vectors H1• (In this simble one di mensional problem H1 ia, but the construction generalizes easily to three dimensions . ) . The functions 'ion are called \Vannier functions, and are impor tant in the tight-binding description of solids. Show that the \Van nier functions are corresponding to different sites and/ or different bands are orthogonal, i . f . prove =
./ d:crp�, (l: - R ;)¢in(;c - R1 )
�
cl,J clm n
Hint: Expand the ¢ns in Bloch functions and use their orthonor mality properties.
4.4 Suppose a spinless particle is bound to a fixed center by a potential \ '(:f') so assymetrical that no energy level is degenerate. Using the time-reversal invariance prove
for any energy eigenstate. (This is known as quenching of orbital angular mOinemtutn.) If the wave function of such a nondegenerate eigenstate is expanded as
L L hn(r )l/"' ( II, 'P) . l
m
what kind of phase restrictions do we obtain on Ftm ( r ) ?
4.5 The Hamiltonian for a spin If
=
1
system is given by
A s; + H(s; - S,� ) .
a.
APPROXIMATION TviETHODS
19
Solve this problem exactly to find the normalized energy eigen states and eigenvalues. (A spin-dependent Hamiltonian of this kind actually appears in crystal physics. ) Is this Hamiltonian invariant under tin1e reversal? How do the nortnalized eigenstates you ob tained transforn1 under titne reversal?
5
Approximation Methods
5 . 1 Consider an isotropic harn1onie oseillator in two ditnensions. The Hamiltonian is given by llo
�
=
P.;
�
rxy
+-+
2m 2m
'J
rrtu.r 2
. -
( :r ·
_ ::!
·
::!
+ If )
(a) What are the energies of the three lowest-lying states? Is there any degeneracy? (b) We now apply a perturbation '
0/Tlu.)-;f:l}
'1'.• = r
where J is a ditnensionless real nun1ber n1uch stnaller than unity. Find the zeroth-order energy eigenket and the corresponding en ergy to first order [that is the unperturbed energy obtained in (a) plus the first-order energy shift] for each of the three lowest-lying states. ( c) Solve the !!0 + V problem exactly. Compare with the perturba tion results obtained in (b). �-[You may use (n' l:r ln) = Jh/2 mc.J( JfiTIJn'.n+l + yno,•.n-1 ). ]
5 . 2 A system that has three unperturbed states can be represented by the perturbed Hamiltonian matrix
20 where E, > E1 • The quantities and b are to be regarded as per turbations that are of the same order and are small compared with E, - E 1 • Use the second-order nondegenerate perturbation theory to calculate the perturbed eigenvalues. (Is this procedure correct?) Then diagonalize the matrix to find the exact eigenvalues. Finally, use the second-order degenerate perturbation theory. C01npare the three results obtained.
a
5.3 A one-dimensional harmonic oscillator is in its ground state for l < 0. For t ?': 0 it is subjected to a time-dependent but spatially uniform force (not potential! ) in the x-direction,
(a) Using time-dependent perturbation theory to first order, obtain the probability of finding the oscillator in its first excited state for l > 0). Show that the l --t x ( T finite) limit of your expression IS independent of time. Is this reasonable or surprising? ( b) Can we find higher excited states? [You may use (n'l-r ln) = Jtlj2mw( .;n+"T!i,.' n.+ J + yniJ,,_, _ , ) .] 5.4 Consider a composite system made up of two spin � objects. for I < 0, the Hamiltonian does not depend on spin and can be taken to be zero by suitably adjusting the energy scale. For t > 0 , the Hamiltonian is given by J·[ -
-
(/ 4�) " -n
2
,�,
. 82 .
Suppose the system is in I + -) for l <:.: 0 . Find, as a function of time, the probability for being found in each of the following states I + + ) , 1 + -) , 1 - +) , 1 - -) : (a) By solving the problem exactly.
a.
APPROXIMATION TviETHODS
21
(b) By solving the problem assuming the validity of first-order time-dependent perturbation theory with H as a perturbation switched on at I = 0. Under what condition does (b) give the correct results? 5.5 The ground state of a hydrogen atom ( n to a time-dependent potential as follows: Y(x, i ) = %ms(h - w l ) .
=
1 ,I
=
0) is subjected
Using time-dependent perturbation theory, obtain a n expression for the transition rate at which the electron is emitted with mo mentum iJ. Show, in particular, how you may compute the angular distribution of the ejected electron (in terms of IJ and q defined with respect to the :.-axis) . Discuss briefly the similarities and the differences between this problem and the (more realistic) photo electric effect. ( note: For the initial wave function use
If
you have a normalization problem, the final wave function may be taken to be 'V
:( :f')
.. .
=
(_2,) u
,.ifi·.i'fn
with /, very large, but you should be able to show that the observ able effects are independent of L)
22
Part
II
Solutions
1.
1
FUNDAMENTAL CONCEPTS
25
Fundamental Concepts
1 . 1 Consider a ket space spanned by the eigenkets { I a') } of a Her mitian operator A . There is no degeneracy. (a) Prove that
II(A - a') <]_ I
is a null operator. (b) What is the significance of
5,
of a spin � system.
"' . , II ( Q " - (1.'l I (J."\ ""'lin'} o.
( 1.l )
(c) Illustrate (a) and (b) using A set equal to (a) Assmne t hat In) is an arbitrary st ate ket . Then
il( A - u ') l n )
� (.rl··
I
(h) Aga.i n for a11 arbitrary state In) we wi I I have
[
a" II ( A - I!) o/' ;f;.o/ a - a
]� � � (/.111
1
({
"'\ I\ (J. "' I Ct)' 1
{ !!! " > ( a"' I ct,1 II , a - a ) I a"'1 = (/Ill o/' ;f;.o/ a - (1 " l::: (a"'ln:)h,l m,] .�la"') = (a' ln') l a') ::::} 1
�
(/.Ill
la')(a' l So it projects to the eigenket I a') .
,
= A ., ' .
1
( 1. :?)
26 (c ) It is 8 ,
ft/2 ( 1+)(+1 - 1-)(-1 ). This operat or has eigenkets I +) and 1-) with eigenvalues ft/2 and -tt/2 respectively. So ms, - ,z'] ) ==
[% ( 1+)(+1 - % (1+)(+1 + HHl] X [� ( 1+)(+1 - HHl + � ( 1+)(+1 + ] HHl -
HHl
u �
[-17. 1-)(-l][n.l+)(+ll = -17?1-) H+)(+l = o , where we have used t hat 1+)(+1 + 1-)(-1 1 . for a.1 ft/2 we have " II ( - · " ] )' ,_, , + :;- 1 /1 /·) - a 1l./') + / 1'> =
=
0
,oz
a
=
c
* [� ( 1+)(+1 - HHl + � ( 1+)(+1 + HHl]
, II ..J.. -r, ") :t -t-·' I -
f �
. /1
,__
I1
.....,
-hI n. l +)(+l = 1+)(+1·
Similarly [or a' =
II
a_ll;j::. a_ l
( 8,
" - a )
a -a I
( 1. :3)
II
-tt/2 we ha.ve
( 1.1 )
-tt/2 - h/2
-* [� l l+)(+l - HHl - � ( 1+)(+1 + H(- l l] - * ( - h l-)(-1) = 1-)(-1 .
( l .'i)
1.2 A spin � system is known to be in an eigenstate of .� ii with eigenvalue 1>/2, where t) is a unit vector lying in the :rz-plane that makes an angle r with the positive .:- axis. ( a) Suppose S, is measured. What is the probability of getting ·
+ ft/2?
1.
27
FUND'4.MENTAL CONCEPTS
(b) Evaluate the dispersion in Sc, that is, ((Sc - (8,c ) ) 2 ) .
(For your own peace of mind check your answers for the special cases �f = 0, Tt /'2, and Tt . ) Since the nnit vector n rn ak es an angle '"'' / \Vith the positive .z-axis and is lying in the .rz - plene, it can he written in the fol lowing way fl
So
. n
8
(
=
COS ·'"'/ + (
2
[(S- 1 .3.36),(5- 1 .4.1 8)]
[� ( 1+)(+1 - H H l]
8, cos !' + 8x sin AI
( l .G)
sin ·j
j_.
=
cos
,
[� ( 1+)(-1 + 1-)(+ll]
+
siu ;(.1 . 7 )
Siuce the system is iu au eigenstate of ,<; · il with eigeuvalue fl/2 it has to satisfay the following equation
( I.S )
From ( 1 . 7 ) we h ave t h at
(
t'lll i
oc; fl . cos ; ,'-, . n = •
.
:2
Sill }'
- COS )
)
.
( 1. 9)
The eigenvalues and eigenfuncions of t h i s operator can he found if one solves the secular equ at i on •
1
n.:.:.
-4 cos ·
n_:.:.
1
1
s m+ ). - - 4
'
·
•
•
·
( h/2/ '
det ( .'>oc; · n - A l) = il =? det '
cos ; - ).
' /l.
=
•
fl =? . .
Siuce the system is iu the eigenstate h ') -
h •
(
'
=
cos !'
siu 1 a
'
-
""/
sin -y
- cos ...y
COS j'
Slll ._,,.· .
=
)( ) a
b
•) ...., c;: �lll . �.
=
"::) � a 2 Slll ....:.2 . COS .
'J ";1
�
..L
2
') ' ...., < �lll
T:_
.
;.
,
n :.:.
). - - 4 ,
l ·
2
=
(
a
h
a
h/2 sin -y
-'/') fl. CO· < •! - A' ,__
0
=? ). .
(�)
=
tau
'I ...!....
'2
.
a
=
n_
±-.
) = 0 =?
( 1 . 1 0)
2
we will have that
cos '"'/ + l; sin ...r '"'/ - (, cos ...Y
) { a sin =}
�
= =
a
b
} =}
(1.11)
28
I ,.:; · Fl ; +) to be norn1
Tint we w
2
+ h"
=
1
=?
/' u � = co:-;. � <")
<")
2
=?
a =
i ±y1ros-,-, -
2
=
IS
/' � cos 2
(1 .12)
where the real positive convention has been used in tlw last step. This 1neans tha.1. th<' sta.l.e in wlddJ th<' systern is i n , is gi ven in f.('nns of the <'ig<'nst.at(-'S of the S, op nat or hy
,. '": I -1· . -ry I ,)c; · n• ; +;I = cos -ry I + ) + sm I
( 1. 1 :3 )
(
1 ,9, ; +)
=
\, I +) + V\-, 1 -- ) .
V-
( 1 . 14)
So the propability of getting +r1j'! wlwn ,(), is Ine
For 1 for ;
=
( l . Ei) 0 which means that the system is in the IS, ; + ) eigenstate we have
1 ( .c,. ,, . ; + 1 S·· , ; + ) I ' = -,' ( 1 l = -', .
( 1.16)
I (S, ; + I S, ; + J I " = 1.
( 1. 1 7 )
have
( 1 1 8)
1.
FUND'4.MENTAL CONCEPTS
29
(b) \Ve have that "'\ (( Sx ., - (,')xJ .' ' ) ''} = (, 'J.r/
As know \V(-�
2n (
�
( /,'Jx " \; ), ' ·
' 1 I ') I +;\' 1 - I + I - ;\+ 2 n1 ( 1+)(-1 + 1-)(+ll (1+)(-1 + 1-)(+ll t�' ( 1+)\ + 1 + HHl t�' .
( 1 . 1 9)
=?
=
=?
( 1 . 20)
So
[cos � (+I + sin �H] i (1+)(- 1 + 1 -)(+ll [cos � I+) + si n � H] -r,2 cos ...!."2 ... sin ...!."2 ... + -n2 siTt ...!."2 ... cos ...!.2 ... = -2 sin '"'·' h'.! 2 ' a.t td 4 [cos �(+ I + sin �H] t�2 [cos �I +) + sin � H] -[cos" ( 1 .2 1 ) -2 + sm" -]2 4 4 So substituting in ( 1 . 1 9) we will have "
-
±''
n,M
.
SUI
fl
I
=?
_...,.
,, I'
. ,, I'
=
±''
H�
- .
( 1 .22)
and finally ( ( ::,.sE )2 )Frr/ 2,jS, ;+)
( ( ::,_ s, )2 )·,=0;iS>)
(1 .2:l ) (1 .24) ( 1 .2:5)
:30 1 . 3 (a) The simplest way to derive the Schwarz inequality goes as follows. First observe
( (ol + ,\ " (fJ I ) · ( In) + A I/])) ::0: D for any c01nplex nmnber A ; then choose ,\ in such a way that the preceding inequality reduces to the Schwarz inequility. (b) Show that the equility sign in the generalized uncertainty re lation holds if the state in question satisfies
with A purely imaginary. (c) Explicit calculations using the usual rules of wave mechanics show that the wave function for a Gaussian wave packet given by
satisfies the uncertainty relation
V(P�: J ")j(( �p)")
=
Prove that the requirement ( /l :
�
:t
�·
l o) = (imaginary number)( .t' l�plo)
is indeed satisfied for such a Gaussian wave packet, in agreen1ent with ( b ) .
( a) vVe know that for an arbitrary state lc) the following relation holds (1 .2G) ( clc) 2 0. This means that if we choose lc) = In) + ,\ I J) where A is a complex number, we will have ((nl + A" (.· Jil · ( In) + A I ) ) ::0: 0 =} ( 1 .27) O
(olo ) + A(c�. l!l) + A"(!JI<>) + I A I 2 (iJI;J) ::0: 0. ,
( 128)
1.
FUNDAMENTAL CONCEPTS
If we now choose .\
=
- (,B in)/ ( ,6 1 ,6 ) the previous relation will be
\ '' 1 ''I >
·
31
(;3Jo)(oj(i) (;3Jo)(oj(i) (:ilti) ltii.:J)
+ 1(;31o)l' Plti)
> l\ o l o.I\ 1!31!3 i \' ' \ I � I ( 'B o I\ l ' ·
.\
-..,
"'-
0 =? ( 1.29)
Notice that the equality sign in the last relation holds when
l c)
=
l o)
+ .\1 ;3)
that is i f In ) and 1/ll are colinear.
=
0 =? l o )
=
- .\ 1:3)
( Ll())
(h) The unce1tainty n'lation is
I Lll ) To prove this relation we use the Schwarz inequality ( 1 .29) for the vectors In) = i'>.Aia) and l/1) = i'>. R ia) which gives ( 1. :3 2 )
·n ,e equality sign i n this relation holds according t o ( 1 .30) when
i'>.Aia)
=
( Ln)
.\i'>. R ia).
On the other han d the right-hand side of ( 1 .:32) is
( U1 ) which means thai the equal i l y sign in the une<'rtainty relation ( LH) holds if
� I ({L',.. L i'>.R})I"
=
() =? ({i'>.ki'>.R})
=
()
(al i'>.Ai'>.D + i'>.Ri'>.iila) = 0 1 '�') .\"(al(i'>.R J' Ia) + .\(al(i'>.R J"Ia) = 0 =? (.\ + .\" )(ul(i'>.R)"Ia) = 0. ( Ll:i) =?
Tlms the equality sign in the uncertainty relation holds when
with
A
( Llfi) pnrely irnaglna.ry.
(c) We have
( .r' I Ll.:r l n)
(.r'l (:r - (.r) ) l o-) ( 1/ 1 <· '; ( ".JJ - 1\ '1·)'1 . "
'
Ol! the other halld
t
=
.
:r'(.r' lo:)
( r' I ( P � ( p) ) I n ) I J 0 ."c . () \.L -U�l
d:r'
Bnt
' d -.-1 :r I o: II
[
J
() i(p):r' ' \·I. o ; I .II . . . d:r:1
d:r:' \ .
=
[
l\·r ' l OJ' i(p)
So sn hstitnting in ( 1 .38 ) we have
' , ) II "·,I n ) + \1' I ;r, . , '2d' ( :r �
'•)
i''
-�-� -
n.
�
h
--
T
�
-
�
�
)
,
(:1: ) (:r I n )
( 'I Ll.p I or1 .
( 1 .37)
(Jl · ) I\•L Jl 0:)
(.r'
�
( :r ) d2 4
I I 2d2 ( :r � (.r ) )
i h (. .1.,I � ( .l.·\r .) .1.,I or\ ( I
2d'
(:r) (:r' lo:)
=
in
( l .:l8)
) ']
] \ ,I I n I;
� ( l'r ( .1.
,1 '!d' (:1: Ll.:r l n )
:�:
(1 .39 )
=?
( 1 .40)
1.4 ( a) Let :t and Px be the coordinate and linear momentum in one dimension. Evaluate the dassieal Poisson bracket [.1:, F ( px )Lta.,,,;,;a/ ·
(b) Let :r and fix be the corresponding quantum-mechanical opera tors this time. Evaluate the commutator
[
.,. PX . .)I
(iJI,. .a)] . -�
. ' tt
"'•
(e) Using the result obtained in (b), prove that exp
( lf!.r ll) I
h ·" ) , 1
1.
FUND'4.MENTAL CONCEPTS
33 "' ·
is an eigenstate of the coordinate operator sponding eigenvalue? (a ) vVe h a ve
[ c Fir,.r .l] classtc-al . <
l
iJx iJF(!'x) iJ.r dpx
\
iJF(px) dpx
What
IS
the corre
iJx i1F(11x ) Up,. U.r
( 1 .4 1 )
(h) 'When x and fJx are treated as <j llantum-rnechanical O]Wrators we have
[.
[ . >Y• (-ia)" p"_· ] _ �•·
_
( .a) ]
p, J', exp i
T
r · '
� L..J
n=O
__
,, a - I: n=l
( c ) We h a ve now
[ (II'T·")]
H
. __
=
n. 1 \'·ia) '·] - ' r :· ' px tn rl
__
In - 1 ) 1 \
a ) ( i p,o .r !-r' ) - a exp ( i p ) l.r') h h a x p ( .r' exp ( i ,. ) l.r') - a. <'X]l t fl rl J I"J) h h ' 'flx \ , ( - a ) exp ( rl } 1 x ) . T
exp
,"
'
1:
So exp (ip;,] ) 1 .1�' ) So we G1 I t writ P
IS
an (-'ig(-'nst ate or t he operator ,r wit.h eigenvahw
I .r wh(-'r<::
_·
� L..J 1 n=O 'fl ._
I
( !•}
J' exp
'fl .
__::_ 1
,
.)
- a
=
, exp r-:'""a.
c
( ) I .r') '
( 1 .4:3) .1:1 -
C i s a constant w h i c h chw to norrna ] i:;,ation can h(-' ta k(-'n to be 1 .
a.
1.5 ( a) Prove the following:
. , c;- vP' I 0 ) , 111 ' I
a
u]l
(p' l:ci<x)
(i)
!. dP
, 'P.a •
.
. ( P' l , " /"" ( P' l t. il, up
a
where (,i,_, (p') = ( p' l o_) and �J(3(p') = (p'I_/J) are tnon1entun1-spaee wave functions. ( b ) What is the physical significance of exp
(i:�:=.) . -
.
n.
where :r is the position operator and =. is some number with the dimension of momentum? Justify your answer.
( a) We have (i )
(;/ l :r l n )
1
j d:/ 1:/) (.t:'l o) = j d:/(p' l.t l .t')(:r' lo) 1 " \ 't ' I o:_) J t.:r :t \P' I :t'_) (:r. ' I o__) (5- =.7.32) I c .t !• d:t -:---;a ( -,-) (th) (:r In) = th -a:-; [ !' A,- -,- (:r In)l . up up . " () . - [
:l .
.
(ii)
I
!}
----.
I
II I
,-
h e;-' (f!' I ) up
I
u
<>
.
',/"' '
•
.
"
l
.
I
:,/., ' I 1 .r /- (---
1h
.
I
.
I
"
1
:,ioc'
1
'*
(1 .45)
( 1 .46) where we h a ve used (l .il5) and that (;Jiri) = o(, ( p1) and (p' lo) = o,, ( Ji) .
1.
FUND'4.MENTAL CONCEPTS
35
(b) The operator exp ('·;,") gives translation in momentum space. This ca.n he justified by calculating the following opera.i.or i/=')] [p,exp (h ,
2:: I n - ] ) ! •X•
n= l \
I
·
( l.4 7)
011 c-n 1
whf-�n this conlnlntat.or c1cts (-:igeTISt!-l1e I P' ) of the TTJOTTJ(-'Tit11nl oper ator we will lta.ve l.r:=) ] I 1) p [exp [p,exp (h P c�::: ) I P1)] [exp c�=)] / I P1) � ::=: ex p ( ) p [exp (';, ) 1 /) ] - p' [exp c·;,::: ) ] l l ) � l:C:.) I P1)] ( p + ::=: ) [exp (z:r=) h I "] (HS) [exp ( h Thus we have t lmt i:r ::=: ) I ) 1 ) ( 1 .49) exp (,h p = A. I p + .::. , wltere A. is a con ta nt. wltidt due to norma.l ization ca.n t k n to he So
t ·t r,� -
1'
::::
I
1
s
1' I
•
.
_
lw
a
e
!.
36
Quantum Dynamics
2
2 . 1 Consider the spin-procession problem discussed in section 2 . 1 in Jackson. It can also b e solved in the Heisenberg picture. Using the Hamiltonian (H ., H=- . ,., , = cv.",
( )
.,
'fllC
write the Heisenberg equations of motion for the time-dependent operators 8.,( 1 ) , 8,JI ) , and 8z (l ) . Solve them to obtain 8,q,z as func tions of time. Let
ns first prove the fol l owing (2. 1 )
Indeed we have
[u
t A8U, Ut HsU
U1 [ A s , Bs] U
=
]=
Ui l s HsU - ut H s A sU
U1 C.,U
The Heisenberg eqnat.ion of motion gi ves
=
Cn .
dS." dl dSy dl d8:: dl Di fferentiating once more eqs. (2.3) and (2.·1 ) we get (p. 8,1.'
d"'! ,
d/2
.,
d/2 But on the other hand d8.); dt - Aw >'in wl + Bw co" wl
-Cw cos wl
C = - H.
- Ocv "in w l =?
(2.2)
(2.3) (2.4) ( 2.!:i)
2.
Q UANTUTvi DYNAMICS
37
So, finally Sr ( l ) S,J l )
Sr (O) cos wl - S,JO) sin ce!/ Su(O) cos ec!/ + 8,.(0) sin wl
8, ( f )
S, (O).
2.2 Let :r(f) be the coordinate operator for a free particle in one dimension in the Heisenberg picture. Evaluate [.T( t ), :r (O) ] .
The Hamiltonian for a free particle in one dimension is given by p' H=-
( :2 . 1 0 )
2m
This means that the Heisenberg equations of motion for the oper
p( I ) ih(f) at :r
l -,- [p(t), JJ (f)] 1, ,l
= -:./ 1 1.
l
[
p2 ( l ) p( t ) , . -
l
=
:r
0 =}
(2 . 1 1 ) p(O) 1 . . p2 (f ) J>(f) 1'�' 1 p( D ) I =? � [.T , JJ] = � :<:( f ) , -,- = �. 2p(i)ih =
(I)
Ul
1 rl
[
l
2Hl
_!__ p(O) + :r(O).
2-m
2rru.tl
rn
(2. 1:?)
m
[
. . . . illt t . t . ( 0 ) , :r(O) ] = - - . [.T ( i ) , :r(O) ] = -p(O) + :�.: ( 0 ) . :r(O) = lP Hl Hl rn
Tlms fin a l ly
tn
. ·]
-
(:2 .1:3)
2.3 Consider a particle in three dimensions whose Hamiltonian is given by ll =
_!!____ + \i ( i) . �2
2m
By calculating [ l · rY, II] obtain
( -.p? ) - (:r---; . \'1----; ,') .
d x-. . ---... -( di PI =
fli
To identify the preceding relation with the quantum-mechanical analogue of the virial theorem it is essential that the left-hand side vanish. Under what condition would this happen?
Let
m
ftrst c
[7 · jl, II]
=
[l · JJ. II]
[ -'2 l [I:" :r,p,.. 2:: 2::. [ ] 2:: p + � ( .r-) - · p,- �
=
.•
1:
_
rn
: ri, ,,PJ p; + ..A n
IJ
I
i= l
J';
.i =l :1
[p; , V(:C) ] .
l;
�+\ � rn ,"
.
( .r-)
]
( 2.14)
Tlw first cornrnut
1 1 1 ,,....., m [ T;, pj ] = .,....., rn (p, [ :r ; ,p1 ] + [T;, f'JP;;) = ., rn (p1iliSi1 + iMi1p1 ) l ., . irt r = - u , 7 P7 · -'2ul0qp (2.1:i ) 7 n1 2m Tlte second cornrnutator can b e calculated i f w e Taylor expand tlte fun ction \'(:C) i n terms of :z:; wlticlt means that we take V(:C) = Ln an:C:' with an i n depen dent of 1· , . So _
[p; , \/( f)]
·- -
=
I: an [Pi : .T7]
=
I: an 2:: r 7 [Pi : T:] .1'7- k- l n-1
k=O
n . . n-1 . l...: an L (-l.h).ri , (J- L a,_:r i = -z,h L a,_n:rin-1 = -Lhn-1 -
k=O
n
' " d "I -·
, .r ) . i):r, . .
' ih \
L.. - c ;1 p1 p;
-;n �j
m
-p '{71.
n
(2. 1 6)
'fhe right-hand side of ( 2 . 1 /1 ) now becomes =
dJ' I
n.
- Ill .- '
[i · fi, II]
n
_
___.2
'. I
.
--: - du · v\ ( x ). •
_---->
't ) :Ci-.·{)-V(:r )
+ L.., ( -m 7
____, ,
cl:r
1
-
(2. 1 7)
2.
Q UANTUTvi DYNAMICS
39
The Heisenberg equation of motion gives
-l p- - :r� .r, � H (2.=1 7) -:- [ :r� · p, · '' 1', (:r�) ]
d di d � :11 - X ( . P; di
- :r . }J -+
,.r,
.....
-0
'
p - ( r · vq ,
fli
( ) tn
=?
(2. 1 8 )
�
where i n the l ast step we used the fact that the state kets i n the Heisenberg picture are i n dependent of time. The left-hand side of the l ast equation vanishes for a stationary state. Indeed we h ave
d (n Ji · J�n) = � (n l [ r · p, H ] In) = ..� ( En ( n l r · P1n) - E,, ( n l r · J�n ) ) = O.
(lf
lH
'f.Jl
So to h.a.ve the quanturn-rnecha.nica.l analogue of the virial theorern we c.a.n ta.ke the expectation values with respect to a stationan1 state . 2.4 ( a) Write down the wave function ( in coordinate space) for the state
You may use
( :r IO) ,
.
= "
-
1 / t - 1 ·/2 ·
·
.
.r0
exp .
[ 1 ( I)'] ( -2
:r
,
:r o
) ) (m� . f
1
:ro
:=
I /2
.
(b) Obtain a simple expression that the probability that the state is found in the ground state at I = 0. Does this probability change for I > 0?
(a) vVe h ave
In, t =
0) ( :r'ln , t = 0 )
-IJW IO) exp ( ---y;) ( :r
1
. . (-'Xl) .
(
-i a
p --
n
.
.
.·L••l
) 1 0. ) (1'1.=1
1 /4 . . - 1 / 2 _t.Xp ' .1 0 11 •
=?
[
- :2 1
1:r 1
1
('"1 -- ) ] a
-..1 0
'
'
.
a
1 0. ) (2. 1 9)
40
(b) This probabil i t y is given by t he expressiou
. \ ( -zpa IO)I2 n. , .-}
I (Oi n , l = 01 l' = l (<'xp 1
(exp
(-zpa) ---y;- IO)
•
- tpa\ (---y;)I
d:cJ (0 1·.c.')11 .c/1 exp .
J j d:c
[ . I)'] ( . [ ( ) '] [ (·/' + , /2 + a' [ ') ( , ,a;- a' { ( ) ( ) ( ,; ) :1:0 1 exp
t � l 1.:t - 1 1'2
X
. .I
K
exp j
1
1
-1
-
----;:;;-
1 1
'
,1_ . . -1 (-_' xp 11 -1 /1.1.0
1
- ·) - . '2
�J. o
L
a
�
So
For I > 0
I (Oi n , t ) l '
'
4 -.,-.2o
J.
.
�
1
- -·-.
.
.r J.· o
1 1'2 d:c e x p - � :r y Tt:r0 . 2:;:0
ex p
0)
:c' - a
( J
r:;
( 2.20)
-
2:!:
a2
· . JT.:r0 = ex p - -'V111 ·,·· 0 ' 4' ·J'' ·o
I
,
'
.....Lu
I (O IU( t )ln, l = D) I ' = I (Oi exp
....•)a.L
.
)]
,1
a +-+2 4 4
�
I (Oin, I = D) I ' = ex p - ',
-
')]
(2.21)
.
(
-
i
�l ) lo, / = D)l'
1,- iEot/I• (O i n , I = o) l ' = I (O i n , I = o)l ' .
( 2 . 2:1 )
2 . 5 Consider a function, known as the correlation function, defined by
C(l ) = (11 ( i ) 11(0)),
( 2 . 24)
where .t( t ) is the position operator in the Heisenberg picture. Eval uate the correlation function explicitly for the ground state of a one-din1ensional sin1ple hannonic oscillator.
2.
Q UANTUTvi DYNAMICS
41
The Ilarniltoni
lT
( 2.25)
So t.he H eisen h erg eqnation:-: . of rn ot. i on vvill give d:c(J) dl
-:-,; [.C ( / ) , lf
ta _
dp(l ) dt
1 l = -:-,; 1-!J.
[·
.C ( / ) , -.)..... m_
,l
:c \ / ) � .tt r:c ( t ) , p' (J. ) j + �m w ' .� r.c (l. ), :c'(i)] lrn1n. ') · . I'( ,. ) 1
:...
w.
p' ( l. )
-
[
+
1.
'i ITlW
2
,
-
J J7, :...
,., l
.. ) .-p( l ) = ,_,J.furt m 1 [ ' 1 ' '''' p' ( l ) -:- p(l ) , II ] = -:- p(l ) , -.- + 0- mw - r(l) tft dt 2m
[
, ]
l
'
(:?.26)
_
.
' c ( !. ) .
ntw2 ., rn0..• 2 . -.-.- p(t ), :qi. ) = � [ ....·2r(c:c ( t )] = .... ,, w 2.11 , r,
(2.27)
Differentiating once more the eqn at.i on s (2.26) and (2.27) we g<'f
l dp( t ) - -..m dt 1 d:r(l) - --· m d. /
(2.27) . 2 = -w :r ( l ) -:- .r: ( t ) = /•, cos wt +
,
,
(2.2G) , = -w-p ( l )
•
.
=?
p(l ) =
C
,
R
.
. "n w/ =c>. :r (ll) = ;•,
cos w / + D smw/ •
.
=?
.
JI(O) = C, .
Bnt on the other hand from (2.:?6) we have d.r(l ) d! -w:r( 0) sin wt + Rw cos wt R
=
So
p( IJ) nu..,�.)
:ri' l ) = .r( O\ cos w/ ·
'
and t h e correl
p(l )
.
-- =?' m
p(IJ)
-- cos 1.1.)t rn
TJ
f) .
+ - sm wt '{fl.
::::;>
= -mw:r(ll ).
+ -- sinwl p(O) rnw
(:?.29)
l • (2.::m) ') ' ' ' ' I I C ( t ) = (.r (l. ).r (IJ); = (x-(O)) cos Lc' l + (p(IJ).r (IJ);- sm wl . I
•
(2.28)
.
rTU..;.)
(2.30)
42
Since we are interested in the ground stale the expectation values appearing in the last relation will be
2m_u)
h .
(p( O) .r ( 0))
"
2m_u) h
( 0 1 -.- ( a + a ' ) ( a + a1 )IO) -.- (O iaa1 IO) . /mr1w �. -' V 2 y 2m w (OI(a · - a )(a + a J IO) =
+
Thus
C(l) =
;, --
2rnw
:!
''
coswl - i. - sin wt = . '2mw
h
;;---f2.:n) 2rrt_W
1
-i!!_i Oiau1 IO.'; = -i!!_. 2\
=
;,
_ ,-
:!mw
�·, .
;
(:2. 3 2)
(:2 . 33)
2.6 Consider a one-dimensional simple harmonic oscillator. Do the following algebraically, that is, without using wave functions.
(a) Construct a linear combination of IO) and l l ) such that ( :r ) is as large as possible.
(b) Suppose the oscillator is in the state constructed in (a) at t 0. What is the state vector for I > 0 in the Schrodinger picture? Evaluate the expectation value (:c) as a function of time for I > 0 using (i) the Sehrodinger picture and (ii) the Heisenberg picture. =
(c) Evaluate ( ( �:t f) as a function of time using either picture.
( a) \Ve want to find a state In) = co lO) + c1 l l ) such that ( :r ) is as large as possible. The state In) should be nonnali:oed. This means \Ve can write the conslands co and
( 2.:l4) c,
in the following form
( :2.3 5)
2.
Q UANTUTvi DYNAMICS The average (:r:) in a OlH:�-d irneiJsioiJaJ sin1ple harnlOIJic oscillator if-1 given
by
+c�Co \,/� --.- ( ! I a + a1 1 0;' + Ic1 I ' \,J� --.- (· ! I a + a1 1 1 '; _ _ + c�cu) = 2 \',/ - J?(c�c1 .) ,2rru..,. -(c2,c1 .2rrtu..J .J _ ljj Vljj ,--2,/-- c (· J, Jo)l col \/1 ! col ' , V ljj 2rnu.) _ y 2m�
y 2m�
.
.
m
.
····
where we h ave used that :r: = y/ rnw 2 1' ( a + u.'� ) . \Vhat we need is to find the values of average (:r:) as large af-1 possible. '
·
····
( 2 . :lfi)
! co l and 1\1 rio that 1nake the ····
=> Jl - I eu I '. - Jl. ! c-u il' · l ' l·•o=>l i" ' l - I eu I' - I cu I' = 0 ro
(2.37)
A
a (:r:) =0
=> - sin(o1 - ou) = 0 => o1 = u +
JJ,
rn,
n
E :2. .
� . •'" (•) )<_- ) "
Hnt. for (.r) nuJ.sirmnn \V(-' \Vant also
So \V<-'
n:u 1
k: E Z .
\Vrite that . ..
1
1
(2. :j9 )
1
., , . I n '> = e "'" - 1 0) + e'· r•.'o+ "' - I I '> = c "'" - (· IO) + I ! '> ) . I
y'2
\Ve can always take 50
.
= 0. Thns
v0
•
I
/ ,
y'2
(2.4 1 )
44 ( b ) We h a ve
I n,
lo; t)
In, lo) = In). So 1 . -iE.,t/t' I O) + _1 -• E,t /�< l t ) U(t : I = O l l t ) = t-•H'i'' l ) = _ 12t. ;2' . � (t - /wl/2 1 0 ) + ,-iw:OI/ 2 1 1 )) = � ,-iwt/2 ( I O-) + , - iw/1 1 )) .(:2.il 2 ) .
v :?
()
(L
•
0
·
·
( i ) Ttl t he Sch riidinger picl me
)
[ � (t'wl/2(01
f.t
( n, lo; ll:csln, lo; l)s
I . s\ '1' ·
v :?
[�
/
,•w ll/2 ( 1 1) ] (, - •A/2 10) + , -•wll/2 1 1 )) ] .!. J(wl/ 2 - w31/ 2 )1 Q ·· 1 ) + .!. i(w:JI/2-wl/2)(·1 1 · ll I ) I I 2t n 1 -iwt \1 n. + 1 iwt \1I-I ( 2 43) 1 · 'J f_
-
-t
2
I
,--
+
\
--
T
·L -
�
2rnu.J ;.: -
2-tnw
-t
=
\
V
,.
.L
17 nl. ' W --
�--
2-tnw
.
·
( i i ) In the Heiset,herg picture we hav<> rrorn ( 2.29) t hat
:t H (. / .)
=
So
(:r )H
[�
(ni:J:H I0)
(OI +
:r (fl) "'" wl + -- "11
p(rnvO ) . wl.
•
..J
� ( I I] (:c(O) cos wt + mw0) p(
.
1 +:;s111. w. / I1 1 Ip I ) � rru..tJ
1
.
-
S i l l WI
.
) [ r.>. IU) + � I I )l 1
v :.!
·
I
V2
0
h- coswl + -.-] - sm wl\ "I \f�� .2rnw cos wl + c- \ ��V 2ntuJ 2rnv.) �
.
.
J
-. SI I I W / q f ..
+. 1 . 2rrtw'
�
:l
-
-
. lm nLc'
, / - cos Lc•/ .
V
2
.
.
.
.
1
.
-t
)\ .-
;V i·Jn2huJ
v 2mw
(c) T1 is known 1 ha1
( ( !i. .t.,· ) 2 ' ( _ / -
.r
"
)
-
I \ ·" 1
'2
( :2.•15)
2.
Q UANTUTvi DYNAMICS
45
]
In the Schorlinger picture we ),ave
which rnear1s that
[
h 2 + at 2 + ao. t + at o.). . . - ( a + at ) - , -\a f\ (h 2rni.J.J 2ruw
2
'
(2.46)
So
n. - n cos2 w l ( (. :'l r ) 2 ) s (2=.4:lJ ,.2rrtw
,) [
2rnw
In the Heisenberg picture
"'n(!
..
p(O)
=
:r(O) cos w/ + -- sin wl mw .
.T
=
n
.
-
2rrtv..J
.
suJ
2 wl.
]2
O ). sin 2 wl p'.(· cos 2 wl + 2 (0) ntJw2
. wl + p(O)l·(O) cos wl sm . wl + :r(O)p(O) cos wl sm
rn:w
m_w
n. . 2 + at2 + aa ' + ata .) cos2 w l -.-(a '2rrtw lfl flu). a2 t2 + a - rw t - a T. a ) s i n 2 w I c::--::--:c I 2m. 2w2 ' 1 /"�-.rr;-,t;,--,u--'(o. . + at )(a', - a .) sin ,'!.wl + - \.f rrtw 4 rrtw 2 . • sin '2wl i /runr1w . t (a - a ) (a + n.t ) . +y 2 m w 4m.w fl . 2 -( a + a t' + aa+• + at n. ) cos2 wl 2m.w ' •
.
.
(2.48)
46 , . ' ;r, !2 - a") sm :?wl , -(a" wt + -2rrtw . -(a --"2rnu.) + a t2 - aa t - a 'a) sm" h . h ' h t2 ' + a t a + --a" --(aa' cos '2wt + --a cos :?wl ) 2m_u) 2m_u) 2rnw in. ,2 +-('2 . 49) '2rrtw ( a' - a" ) sm '!wt. ,
'
0
"
'
•
'
'
• .
.
'
--
-
which means t hat \. o I :rff I o. '; H '
[
]
_ � (OI + \1 (1 1 � V�m"' v � [aat + a1o + a2 cos 2wf + a1 2 cos 2wt + ·i ( a.t' - ) sin 2wl x
o
2
( .,� . ·')I So
( ( L"n: ) 2 \ .
, Ff
.44) (2=
n - -.'' '' • 2 2 -2mw '2rnu.) cos wl = -'bnw- sm wl. .-
(2 . 51)
2 . 7 A coherent state o f a one-di1nensional sin1ple hannonic oscil lator is defined to be an eigenstate of the (non-Hermitian) annihi lation operator
a:
where
.\
a. I.\) = .\I.\ ) ,
is, in general, a con1plex nmnber.
(a) Prove that is a normalized coherent state. (b) Prove the minimum uncertainty relation for such a state.
2.
Q UANTUTvi DYNAMICS
47
(c) Write I .A) as I.A) = I: f( n ) l n) . Show that the distribution of 1/( n ) l' with respect to n is of the Poisson form. Find the most probable value of n, hence of E .
( d) Show that a coherent state can also be obtained by applying the translation (finite-displacement) operator , -ipl/n (where p is the momentum operator, and I is the displacernent distance) to the ground state. (e) Show that the coherent state I .A) remains coherent nnder time evolution and calculate the time-evolved state I.AU)). (Hint: di rectly apply the time-evolution operator.)
( a ) vVe have
since
a i O) = 0. The
commutator is
� ( o l)k-1 � _I_ A\ n L _I_ ' " �- l n 1 , ,!a , a t] 1·\ a t ,) n-k = L....,; ..... � L A L( o ) -
n=1 Co> ). S o frotn ( ')�··)..0
·n · 1
k=l
L
n=l n ·
f1 (n 1 l ) , .An ( a t)"- ! = ). f _l_n .1 pat ) n = n=O ,,
•__ __
·····
.
1
k=l
). , ·'"t
( ') ;' ,ry) ) "'"" '
)
which means that I .A) is a coherent state. If it i s normal i7,ed, it should satisfy also ( .A I .A) = 1. Indeed
(.AI.A) = (01 ·' c - i ' l' c·' ·' ' I O ) = c-1\' (O ic' '" c·' ·' ' I O )
48
f
- ' A I2 1:>.1' f;.
'
=
.·1
- ·
(b) A ccord i n g t o problem (1.3) the state should satisfy the followi n g relation
c3.J:
where Tnnnber. Since
==
J: - (.\ IJ:I.\ ), c3.p
nm
Js a purely nnagJTJary
ll�h/h( ' .\) = \1 - .\ + at )I.\) o :rl.\) = \v -(a + )l t 2ntw lrnu.)
(.\l.r l.\) =
+
\fl
1'
2n?w
"- ,\ ,\ ( + ") \.1 '2 m w ,.--;-
and so
c
wri te
.
(.r)
and
I.\ ) is a coherent slate we have
Using t h i s relation we
and
p - (.\ l p l .\)
==
(' '....) . <..)" fi- )
(.\I( a + tzl)l.\) =
Sirnila.rly for tl1e IIHnnenhnn
1'
, 2ntw
((.\ l al.\) + (.\ la11.\ )) (2.09)
c3.:rl.\) (:r - (J:))I.\ ) =
\1.
=
n \j_ _. (a1 - .\" ) 1 .\) . 2utw �
p = i �(a T - a) we lta\'e
jim hw . :; pl. \ ) = v ' \ � (a - a)l. \ ) = '. Vr;;;r: � (a - .\ )I. \ ) r:
(2Ji8)
t
t
(2.60)
(2.61 )
2.
Q UANTUTvi DYNAMICS
49
and . /m!i.vJ '\ qvt - ( A - A ')
) ( •) (''))
. , , '
'-'•
/
and so . , ' ( p - \Jl! ) I AJ' =
, t ' iv/mliw � I" -· A' ) I AJ
f.) - i tV/ � 1. :-. pi A ) . rn lW
�
=;..
(2.13:3)
So using the last relation in (2.GO) 1
:-..r i A\ =
. � - ( -t)\1 -.- :-.f!IA ) = y� .2uu...r. .J V r hw n
IOW '-..--'
(2.G 1 )
pmcly imagin ary
and thus the minimum uncertainly condition is satisfied.
(c) The coherent stale can be expre"ed as a superposition of energy eJgen slates ,-;>;:)
n=O
<X>
n=O
l A) = I: l n ) ( o i A ) = I: f(n) l n ) . for the expansion coefficients f(o) we have
.f·r,, \ )
( 'fl. I 1\' ) = \' (1_ I f -1.11'/" t ·"' 1 10 ) = t -1 1!'/";\ fl I f� ·'"1 10) .
t - i ll'/' ( o l
. I X"' (nl(nt )"' IO ) f � (An.t )"' IO ) = t- i ll'/2 f -
m.=O rn .
I m.=O rn .
(2.66 I )I ('•)""' · ('�)
which means that the d i stribution of l.f( n ) l2 with respect to o i s of the Poisson type about smnc tncan va l ue n = lA 1 2 .
.50 The most probable value of 11 is given by the maximum of l11e distribution
l.f( n ) l' w l1 i cl1 can be found in tl1e following way
(2.68)
11(11 + l ) l' lf(n ) l'
of
is I .A I '· (d ) We should check if t l � e state exp ( - i r1l /h) I D ) is an eigenst ate of tl1e an ni hilat ion operator a. \Ve l1ave which means that tl1e most probable value
a exp (-i pljrt ) IO)
=
n
[a , ,l - •pl/h J] IO)
( 2.69)
since a i D ) = 0. For the romrm1tator in the last relation we h ave
I: :-! "' 1
n=O fl .
f:l �fl .
n=
E 2
iV I
.. '" 1 - i/ " -i / " n ( -) [u . p"] = I: :-! ( -) I: r/- 1 [u . p]p" - k . fl fl n =l 1 1 . k= I -i ( f ) " t 1'"-' i fl \ '-' k= l , i/ 1 - i / p n- 1 (- ) � ( ) n . n � (n - 1)1
/mC
( ·.
llll:uJ
uq '/ - ( - 1P' I .
\r 2 h
where we l �
_ _
( 2. 7 0 )
.
t . /m rtw ' V --:;:- r a, a - aJ -
_
rtw --:,;:-· 'v/m .·
So substit uting ( 2.70) in (2.fi'l) we gel a [exp ( - i pl/ h ) IO) ] =
'v 2n
(2.7 1 )
.
lexp (-ipljh) IO) ]
{[flu) -
( ').... . �/ ·...) )
which means that the st at e exp ( - ipljn) IO) is a coherent state with eigen /mw v a ] uc / V l. tJ . •
( e) Csing the l 1int we l1 ave I W J)
=
U ( l ) I .A )
=
, - iiiti" I .A ) C2�6l
,-ditJt.
f
n =O
, - 1 v1'12
;,.A " I u )
V 1l !
2.
Q UANTUTvi DYNAMICS
- j' '2 • -i-d /2 '\' . -l.\c -•d '
{ ).
Lc r;::;
c
51
·n.:::::O
Thns
_. - iu.;t ) n
(.
-vQ
In)
=
( 2 .66)
e
-i�t'2 ' 1 A' f' -iA )
\ ,-i-_,_·f )- = /\C �-1-l..t.lf/2 U AL \ ,-i-_,_·f L._.--id/2 1 /\L \ ._. - -icd ) , I _x, id i .X (l)).
a i.X(l))
C
( ')""" ' -'J) '' '
(2.74)
2.8 The quntun1 1nechanical propagator, for a partide with 1nass m,
moving in a potential is given by: '\"
( .T , 1J , r� ) .
t. '
_ -
ico il -J F;t j h /- -t. o
L.(
where :\ is a constant. (a) What is the potential?
I
( .I' , !J ·, I , i'J .) _ - . \ '\' L.. " L
, ,
sin ( nr;r.) sin ( ru",lt') E -
!3:... .!:.._ n 2 :1-rn
(b) Determine the constant :\ in terms of, the parameters describing the systen1 (such as m , r etc. ) . Vve h ave
l·)...... . "{ d) "
So
r;>, ( ;r; )
=
\(J !17.
-
sin ( n r:r),
En -
h 2 r2
--
2m_
fl_. 2 .
l•) "10) .... . I
)
For a one d i nwnsional i n fi nite square wel l poten l ial with si1-e [, t he energy eigenvalue Rn and eigenfllnctions qn (:r) an� gi v�n hy En = Comparing wilh (:?.76) we gel y =
r =}
r
tc 2 ll
-
'2 m
= �
for 0 < .r: < ol herwise whi le
:? r
- = - =} IT ih
.4
2.9 P rove the relation
/1
( ) ·
" ---:-
/,
·
2
fl 2 .
l•) "") .... . I I
l•) " " )
E. r
.... . t o
2rn·
= i IT
( :2.7 9)
•
d(} ( ;r: ) = o(r:) d:r
where li(:r) is the (unit) step function, and o(.r) the Dirac delta function. ( Hint: study the effect on testfunctions.)
For an arbitrary tesl fn nction f(:r) we have
1+�· d(}. ( :r)-j (:r)d:r d.r.
·
-rx'
. .
r+.
df(.:r) L.. (} ( " ! ----;r;- d" 0:
.
.
2.
Q UANTUTvi DYNAMICS f! ( r)f ( r )
I
.
: :;;
lim J(.r) .?:--t + rx_.
l:"
dO(;c) d.c
53 .
-
/- +o.c d[( :c)
.o .
- f(.r)
l
· (Lc
-
+"'
=
0
rLr
.f(O)
J(x )J(.e )d.t• =? ( 2 . 80 )
J( r).
2
2.10 Derive the following expression
2
,, .o,t
=
.
(
mw .
Slll , LJ
.T)
!( r + r 0' .
'
. '7 ') . r r' ) cos ( w .
-
.r 0 .rT
l
for the classical action for a harmonic oscillator moving from the point .t•o at l = 0 to the point xr at l = T.
The Lagrangian for the one diim·nsional harmonic oscillator is given by
(
�' '-' . r ,
· :r
)
=
·2 I -:;_ n Lr
From the Lagrange equation we lmve
·····
. 2 2 1 2rnw .1:
d .. . d OL ('."I) 2 .. . = 0 =? -utw x - - (ut x) dt o.c dt x + w 2 J' = 0. OL o.c
·····
-
(2.81 )
•
. -
=
0 =? ( 2 .S: 2 )
which i;; the equation o f motion for the system. Thi;; can be ,;olved to give .t(t)
=
,1 ws c.Jt
+ R sin wt
( 2.83 )
witll boundary conditions J:(t = 0 ) .r (i = T )
J:T
=
.I' o cos I.;.} T +
n sin I.;.} T
::::}
n sin I.;.} T =
J:T
-
(2.84) X o co:-; wT ::::}
.54 So - .r0 cos cJT . Sill u}f . Slll i.;;..•T . sin u...•T . + :ry sin ;..Ji - :r0 cos wT sin ;..Ji x0 cos u...•t
r ( t)
.To COS Wf +
rT sin =
:i· ( l )
;tT;.;J
sin u.fT
wl + .To sin w(T - t ) . Slll W T
·T
2 ( : .86)
=;.
( ·I . R") I
cos ;..J[ - .c0u....• cos .......'( T - l ) sin
'vVith these at hand we have I
"'T
I
·T
....
cJT
.� .� 1 1 l dt � m '1 (ri·) - � m :r:r - �mw2 .J:2
s
[
dt£(.T, .i:)
Jo
=
dt (�m.i:2 - �rnw 2 cr 2)
'l
1 -�m f dt.T [x + , ..':r] + .�) •
{2.1:1>2 )
: 111. 2
��� :r:r i'T
[:r ( T ) i. f/ ' ) - .7:(0).7 (0 ) l
[
]
0
�
.l'TW .:cow' (:rT - l'o COs wT) (;,·Tcos wT - J·o) - . . 2 sin wT sin wT rnw :r T cos w F - :r0:rT - ;roTT + ;r 0 cosu.//' , . 2 8111 V..J T
rn
-;c-
. . , 2 slll u..' / TJt...J
[2 [/ 2 + \ :r T
�,
]
2
I
:r0) cos wT - 2:r0:r T .
2-
,-
l
(2.88)
2 . 1 1 The Lagrangian of the single harmonic oscillator is 1 ·2 1 2 2 £ = :2-rn:r - - rnv ;r· 2 ...J
(a) Show that
where (;,·b. l b )
Sc-�
and
1\· r· b l· b l ._·r· () I-·() ,\ - \..-"-·-'"t' ""
[1..n. l · sd
(T' ( (') , l b ) () � I· o ) ·
is the action along the classical path G is
G(O. tb ; O, I,, )
=
J' d
from (.I:,, i,) to
2.
Q UANTUTvi DYNAMICS .
i l m.
.
m
j dy, . . . dyN ( .
J ·t fo�
. -
1\; -+�
)r ·� ',
{ . u:p �. [.
! "+ ' )
"� I
!\i
{-,,_ j=U
2
?
-
55 (Y1+1 - y, ) - .,-unu.ry-
rn ..
)c ·�'-
.,
.
.
1
) ·�
., ., .I
]}
where s = t',��{r [Hint: Let y(t) = x(t) - xc�(t) be the new integration variable, xc�(i) being the solution of the Euler-Lagrange equation.] (b) Show that G can be written as G
where matrix
n =
a.
2\ -t·::vJ
= lim
[ ] �1
(j:y
(.
m'
J
2 ttlflC J
and
11 T
(S+ I ) 2
;· dy1 . . . dyxcxp( -nTan )
1s its transpose. Write the symmetric
(c) Show that
[Hint: Diagonalize
a
by an orthogonal matrix.]
(d) Let (";:,'')N dtla = dtla't; = fJN · Define .i x .i matrices aj that con sist of the first .i rows and j columns of a't; and whose determinants are p1 • By expanding aj+l in minors show the following recursion formula for the p7 : j=
L.
.
.
•
N
(2.89)
(e) Let <,i(t) = cp; for t = ta + .ic and show that (2 .89) implies that in the limit E --+ 0,
.
.· . I1 uuba . . . I con d 1t10ns " . q( t w•t
=
t., ).
=
d•\(I=L ) l) , · dt '
I.
where T
=
lb - 1 ,, .
(a) Bt>caust' at ally givt'll poilll tht' posilioll kets ill Lht> IIt>ist>lllwrg piclurt' form a complete set, it i s legitimate to insert the i dentity operator written as
So
,.\Jim ./
./ d:r l :rt) (:rtl = 1
d:c , d:�:, . . . d:J: cv(:I;b/bl:rNIN) (.r N I N I :I: N- 1 / N- i )
( :r i+ , I i+ , I :c; l ;) . . . ( :�; , I , I:�;,, l ,, ) . -tr:-·_,
(:2.90)
··· X
( 2. 9 I )
It is
(2.92) For the second term in this last eq11ation we have
2.
Q UANTUTvi DYNAMICS
57
( 2. 9;l ) Substituting this in (2.92) we get (2.9 1 ) and this into ( 2.91 ) :
(.r:,,l& l:r,J") :\i -+X>
l1111 •
r
-
j d:r1
= •
•
j D:r exp { f,s[:r]} cr p d.r:N ( ,)-:-;,fu-� ) (N+ ,-
'{7/_ -. -
.
....... 1
•
,__
L et y ( l ) .r:(l ) - :rc� ( l ) =? :r(l) boundary conditions y(l " ) y (!6 ) have D:r Dy and =
=
=
8 [:1:] = S[t! + ret] =
1 ,,. !,1
[
[
''
[ ( !! +
d.I.
,s _o
11
1"
!,
+
.
.L
. t ,.
=
=
:J:c�,
(i))[ 1 i)� 1 [" [� . t,.
c.. ( .r ..t . :i:c�) +
S',d
'I { =
(}.I.
·
-
fl
'\' L.....J J=O iV
'
l
[-, '{7/_
- .r 1)2 - ;-·1 cmw2
( .r;+l
�).;c
)
·] } .
2
.r: .I
,_,
=
.i; + .i:,,t )dl
y+ -
1-
y(l) + .rc� ( f ) =? i( l ) 1i( l ) + .i:c� ( l ) with 0. For this new i ntegration variable we
x,;
[
-
d dl
(i))[ 1 o):� I. i)2.; 1 !/] i)( �. ') JI. y [''· � �m1/ - �m,,?y. 2] .L
x,;
.i;
+1
d.I.
.1_.
(
XcJ
y' + 1
f .L
X<e�
+
"· '
•
"
dl .
.
·
(2.9.5)
with G(O, 16: 0, 1 , ) =
!' }0.�. dy, .
_
·
·
·
_
dt/N _
(
Hl
')r! l·r,?.,_
,_, ,
c
)
t ,"\'ilJ
{
i
�
u:p /j L., _
_
7- ./ =0
[
Hl
) .c
'
,_,._
f Y;+ l _
- !h ) _
_ 2
. ]}
J 2 2 - ::;e mu! y1 _
,_,
•
.58
[m
(b) For the argument oi t he exponential in t he last relat ion we l 1ave
i� 1 2 2 2 f L..J '' ( 1/i +l - Y.t ) - -ry " mcJ !li l J=O
· N '
·t
�'--
')
Jj L..J 2 " ( !/]+t 7'
.i=O
·rn . ·--
N
- 2 ,� "f, I; rn
-
._ t � i.j=l
] (y, =Ol =
N 1 1.· ' 1 !Jj - Yi+ ' !h - !h!h+' ) - Jj L..J -ryc mw !li'l;1y1
"
�
')
+
-
_
}.
1.j=l -
('2y;i5ii!li - !Jii5i,1+t!J1 - y;S;+t.j!ij ) -
� .,nw
i,::- r n· .... �
,'2.
LY.Y +l =D) =
I\i
I; !J;O.;jff:,-(2.96) i,J=l
where t he last st ep is wri t t en in SlJ(:h a form so t h a.t t he m a t ri x synnnf'tric. Th ns w� h a.v� G
with
=
N -+ ·:_.:,
-1
rn
. . �
0 0
,,
m
'
(/ 27rzhc ) ";''
0 -1 -1 2 I) - I ,,
(J =
. lim
I)
;·
d.11 1
I)
I) I)
I) I)
a
= r /·t an [/'
=? a
So we ca.n d i a.gonal iy,e
T
rr
wh ich means t h a l T d;V n c_, ""
a
(J
.
1 0 0 0 1 0
I) I)
i::.-rnu.· 2 :!n
2 -I 2 -1
I) I)
(c) \Ve ca.n dia.gona.l i;�,e following will hold
.
+
2:cin.
=
I) I) I) I) I) I)
by a. 1111 i 1 a.ry rna.t rix
[/'Tan ( f/'+' ) T
=
U. Since
[/'Tan [/"
by all ort hogonal ma.t rix
= RT (Jn R
will be
.
.
0 0
--
dy N c.rp( - n T (J I> )
(J
a.nd det
R =
1
=
R.
a
a =?
So
i � synnnet ric !.he
[/' = [/'' .
( 2 .99) (2 . 100)
/
( '2.1 0 1 )
2.
Q UANTUTvi DYNAMICS
where
a;
59
are the diagonal elements of the matrix
(d) from ( 2.9S) we have --
dt.!o- =
ct;-r �l
7,c
1rt
2 -1 0 -1 2 -I 2 0 -1
dct
o-n .
0 0 0
0 0 0 �
0 0
0 0
c: 2w 2
2 -I -] 2
0 0
0 0 0 1 0 0 0 I
0 0 0 0 0 0
0 0 0 0 0 0
0 0 1 ( 2. 1 02 )
det u;v = PN · vVe define j ' aN . ,S' o
x
j matrices uj that consist of the first j rows and j columns of 2
del uj+ 1 = del
c_2 u)2
-I
�
-I
t: 2 W 2
0
-I
0 0 0
0 0 0
2
�
0 0 0
0 0 0
0 0 0
2 - C 2i..�.,2.. -1 0 -1 2 - f 2i.JJ 2 -I 2 - C: 2w 2 -1 0
From the above it is obvious that det uj+ t
PJ+ t (e)
We
w
_
�
•V
(2.10 :3 )
have
q( t ) = <;i(l, =?
( ·�) - c- 2 • 2 1 det. crj' - det. <7.'i- l -----r ( 2 - c 2 w 2 )pj - Pi - t for j = 2, :), . . . ,
+ jo:)
r;;(t, + (j + 1 ) 2: )
E p, . l = \")� - �- " uJ " ) �p1 - · - �PJ-1 c- JI;+ 2 2r;;(i,, + j::) - :: v}q(l, + j::) - r)(l,, + (j - 1 k ) 2 2 (2. 1 0 1 ) 2r:>( t ) - c: w r:>( t ) - q(t - c: ) .
60 So
q(t + c:) - q(t) = q( l ) - r?( l - c ) - c:2w2 o(l) � 0(t)-6(t-e:)
O(t+e:)-O(t)
----'--c---'----
Q'{f ) - d'/(t - F) - = li1n0 · '
-
e
e
= -w
Fmm
( c ) we have also that ,:: -;.
•
.1 -u.J""
c
c;
e
,. ,
'i')
(..? t.
I
o( l I
c
{p�� - =
.,
====>
::::}
dt2
-w·-
,
o( l II . •
.
( 2 . 10:i)
(2. 1 06) and
,f;(t,, + c) - ( t .,)
2 - c:"u/
-1
(2. 1 07)
---+ 1 .
The general solution to ( 2.10.5) is o(t ) = A sin("-'t + o)
(2. 108)
and from the boundary conditions (2. 106) and ( 2. 107) we have
+ o) = 0 � o = -wl,, + mr
which gives that q( l ) = A sin '-<. i(/ - L,), while
dr'l dt
·
=
Aw cos(t - L,) '
1 Aw = 1 � A = -
o'(L,) .
�
'
·'
'
"'
[(
Jj] e
(f) C athering all the previous results together we get lim
N -+:::o
)
' (N+ I )
'2n ·1 tu:. ? Ill
-.-
(2. 109)
(2. 1 07) �
( 2 . 1 10)
. sin vJ(t . - fc�) . q(t) =
'fhus
.
A = .·tW
n E Z
"
N
/dftcr
(2.111)
2.
Q UANTUTvi DYNAMICS
.[
) 1 12 lim (,2:nrn ") 1/'1. ( (.lim zJL
m
'
.
2 -n l
.
(2._!_1 1 )
1
!\; -+ '::0
/
.
!\; -t'X'
l/
( ) .
c:
v
2irtc "
-
EJlN
rn
.
.n
)-l/'1. � (
hrt t
.
.
.
[
u:p . n sm(wT) 21
mcc
( -}
smwT
.
1mcc .
] -1 /2
de!rJ
So from ( a ) (2.88)
61
rn
") 1 11 [r:>(hlr' /'
) . 1. �nl l/
'
-
(2. 1 1 2 )
[(:r:,,2 + 2 ) cos ccT - ,2:rp:, ] . :r,
.
J
2 . 1 2 Show the composition property
where Kr(:r 1 , 1 1 ; :J:0, 10) is the free propagator (Sakurai 2 . 5 . 16) , by explicitly performing the integral (i.e. do not use completeness) .
\:Ve h ave
62
J:� ] } X ) _ �2 r [ _ .c� { + ( 11 - 10 ) ( 12 - 11 2r. r r (1 2 - 11) 11 - 10) [ ]} [ fr�r, { � u , � 1, ) + (1, � �J .� ; - i;:'z, u , -� 1, ) + (!, � to) im- [ :r3 + .r6 ] } X { m '2fr ( 1, - 1 , ) ( 1 , - 10 ) '2r.tfr y ( l , - 1 1 )( 1 1 - 10 ) J. dr, { -'2ifrm [ ( / 2 - t,1 1 -)( 1lo1 - 10 ) l [ 2 - 2 1, - fo im [.r,(t, - lo) ·+ :ro (l, - 1 ,)]] } ( 1 , - 1,)(1, - l o ) "' -;;;; (12 - 1 1 )( 1 1 - 10) fr "' im [:r� (l, - lo) + .cc(t, - ti)] } m /t { '2ft (1, - 11)(11 - lo) '2r.rfr V ( 1 , - 1 1 )( 1 1 - 10 ) [ / dJ: , { -m [ (12 - 11)(11- lo- 10) l [.�: , :r, ( l , - lo)(12 +- J:/0)o(t, ] }] X { im'2{l ( l , - - lo) [:r,(l, - lo) + .cloo(f,) - ti)]'} r.2ili. ( 1 2 - 11)(11 - t0) { im m '2/i (1, - 11 ) ( 1 1 - 10) rn( l , - fo) '2r.itr V (!, - 1 1 ) ( 1 1 - lo) [ z;3 (1, - lo)(l, - lo_l J-:f, (l, - 1,)(1, - lo) lo) ( 11 - 10 )- - .r0 12 - 11r - :r2 :r0 11 - 10 ) (1 2 - 11 )]} ( 1 2 - fo) Y 2r. 2 - X + lo) + :I:G( / 2 - 1 ,)(1, - lo - t, + 1 , ) · {i2Trr_ t [J:j (i, - l o ) ( l , - lo( 1-2 -t, 10)(1 2 - 1,) ( 11 - 10) 2.r�:o2(1, -_lo)(l, -_1 , ) l } ( 1 2 1 0 ) ( 1 2 1 1 ) ( 1 1 10) [im(:1:2 - To)' ] 2Tr ( l 2 - 10 ) hrn(/ 2 - lo) ( T\1 .r2 , 12; .r0, lo) ,
rn
_
1/
1 V
exp
1
1
-.
_,
'1. (
r rr
exp
1
.
exp
1
.
exp
exp
1
_
.
. -
1i.
,
_ -
exp
•
1 I , )(I,
·"1'
1 \I , y
exp
+
(I, -
'
''
11l
exp
(
- I,)
'
(I, -
1
--
I
_
I,
'211i.
x
----,-----,----'-'--- - .,-----,.-,------,- "-----'--
ex p
I
-
in. ( I
-
.
.
111
(
,,
1
X
'2(
'2
I,)
. exp
,
.
'2' 1
•,;.,. 7
Q_ " '·[ DV''4. '·fi('C' L -4. �''Tf' '\' -- �l' 1 � -L �\ _.,J
63
2.13 (a) Verify the relation
where 11
=
rnf, = fl - c�f (P. J .-:
dt 2
m-
[
and the relation
dll
== dl
c
IT + -
1 2c
�
(-d.i dl
R - R �
x
d:f dt
�
x
-
)l .
(b) Verify the continuity equation up
�,
..,
cJL + v · y = O
with .J given by
j
_,
(a) vVe have
=
( tt ) .�.( . ,.c=,, ) - ( t 'rrJ
'l/)
_
tA; . . p:l
-
8 1; o:c; .
vVe have abo that
1 H] iTt . .
- [ :r • . �
-
c•A1
- -
_
R �� .
(' 8A1 o.r;
.'- 'lp
) -1.1 ' 1 '
'
.
l = - -f [p· A. 1 ] - -c lA;, p1 ]
i1i.c (
,
c
=
c
8
.
·
4;
.
o:c; .
_
c 8A ; o:rj.
r
)
(2. 1 1 4 )
[ IT' ] [ IT']
1 1 = :r ;.. =: ;. + up r iTt . 2m iTt 2m
1 {[:r;, n,] ni + nJ tabn 2itt TI; . .- l l j
Ul C
'
v '1;_'
[ , (' i1ir i1ir c c (i1ir' ) ----;;- -.:iJh p,
d:r; dt
-..5
·
. {[:r;, vil n, + ni [:r., p,]} [:r;, nJ]} = � lll21n.
1
64
['l.r:; Hl
dl. .r:i
� ih dt
dt1
· r, ·l 1 l 2rn
J.
d2
=
'
[II
I
ihrn
_
"
!1 2
2m
+
]
uo
·
{ [II;, II,] IIj + IIj [II, IIj]} +
-{--. [II, r;o] tan1
£
d/ 1
(2. 1 15)
m
(b) The time- dependent Schrodinger eqnation i s
. i)
I ,1.h " · (:z:' I cr, t0; t ) = (:z:' I H I cr, i0; l ) = (o:' I .!. m ul �
1
..., rrt
[ [
. ., -r,, - ·ul '"
-
..
t .4'(.r') r:
][.
., -r,, -ttl v -
·
1 (:_ . . - -n.2 'Y- , \7..... , + -!IN- , 2m c
+cr;o(:z:....,,. ) <, · ( :z:....,. . t ) ·
[ ., ,..,,, v
.
1
+
2rn
. -
.
r:
.
-a
. t. -+ , ,{, - .4 (.r )
[
_ _
v( .
c
1
u\ p - �
.
( :£
c
,
n�
-,
l0; l t2
+ n:) I o, t0; t ) + t"!';:'J(:r� ) (; x
2m
.
.
il(.r ) + 1n- i\ (.r ) \7 + --,-2 /\ (.r ) v (.r - . ___.,
.
c
f. -+ -+!
·
c
2 • -+f
.
��
.2
c
t.
1A
· · -+f ,t) (J· )i;:(J·
'2 · -+f
� �)
l
,.
· �'
,
+ up(.r ) v(.r , I )
��
_
_
_
_
l
1 t t 1 2 f ( + --,A'v + . - -tt'\7' ,:. + -it, \7' · A v + 2in - A V\· 1 ·
l
�
'I n� l0; l )
' ' -+!
, t)
e u·t 4� ) -r, �) ip. ( .r�� , l ) + :1: . / ) ' + -ce !a., (n' v · ,4 L (J· · v , I;J. ( x , l ) c
. ....., V v(J· ,t) + ..... ,
·
t ..t(.r'J
( �) ]; 1 )
c
J\lultiplying the last equation by
c'•"
c
w e get
·
c
•
��
t ¢,,:. .
(2.116)
2.
65
Q UANTUTvi DYNAMICS
The complex conjugate of this eqntion is , . () ." �'f .111:�·--w
•t
• ()I
[ -2
=
-- - h •r'Vt2 , 2rrt •
·
·
I; '
>< -
. C
-
c
, I
ii
(v --.o
I
)
___,
.
A 1
,
(
___,
2!1'1 -A. . c
Thus subtracting the last two equations we get
'i·'V ·
___, I
· '!'
"'
2 l
. 2 2 · · 2 + -, A. 1 1'' 1 + c r;) l
·
2 . 1 4 An electron m ':?ves in the presence of, a uniform magnetic field in the :-direction ( F3 = F3 : ) .
(a) Evaluate
[ H,., II,J
where
66 (b) By comparing the Hamiltonian and the commutation relation obtained in (a) with those of the one-dimensional oscillator problem show how we can immediately write the energy eigenvalues as l'\ . ,,
h"F (' ltfllh) ( " + -1 ) /TIC
-- + .2m
_
.....
,. '
--
.
where hk is the continuous eigenvalue of the nonnegative integer including zero.
The Inagen tic field iJ of the form
.
.
:2
f!cc
[.· . tA.,.
.• �c-t,;
B.r
_
(2.118)
2
�
t A,; ] (2._1:1 ") [·Jlo· + tfly . . c c 2c tB [ tB [ ihtB ihefl --- - p . r J + - 'if· J = -- + --
[ I I , , I l u]
_
l'o·
is a
= flz can he derived frmn a vector petential .4(:1')
A.,. = --- By 2. Thns we have
71
operator and
2c
eH ih-·-.
.
,
· l'y
.
_
.
2c
.. . "' ·
( --- c..l) = 2rrt1
'
....
· I'
2c
li
_
2c
tfl.•r ]
. {ly
'
,
2c
(2.119)
(h) The Hamiltonian for this system is given by
W I JerE'
-
1 H=2rn
-
:
., �m.
.,
-
.
1 - 1 IT ' + 1 IT ' am
.. l[
[
p
t
=
., �m.
II, , ll,
l
.,_
1 = -4rn� -, .
�,�
[(
,
2
_; 2 I ll
·
P.r
-IT + - IT- + -p: :.r
=
·
)
Y
,,
(
"C''C
eH11 '
+ -,-_ 2c
1
1 ., 1 ., = II, + rr, (2.1:?0) 2m_ 2rrt ""
.:.m. r "'
+
,
.
g,· ,., ,._,._
• .
l'y -
e H.r ' -,2c
)
•
·] = 0
p;
(2.121)
there existt-' a. set. of t-'itnnlta.JJ(-'OllS eigent-'t.a.tes n ) of tlw optTatort-' TI1 diJd H,. So if is the continions eigegenvalne of the opnator p, and n) it' eigent-'t.a.te we will l1ave
lk�
hk
lk,
112lk,
71) \
.,
=
p; . l A:, n ) 21n
=
Il�''!,;�., . -.
21n
(2. 1 22)
2.
67
Q UANTUTvi DYNAMICS
On the other hand II1 is similar to the Hamiltonian of the one-dimensional oscillator problem which is given by H
=
1
2m
-
p + -2I muJ"r" ')
·)
"
(2.1 2:3 )
.-,
·
with [:r, p] = ill. In order to use the eigenvalues of the harmoni c osci l l ator F, = n.w ( n + j) we shou l d have the same comrm1tator between the sqr1ared operators in the Hamiltonian. From (a) we have
So
H1
[II, , l l y]
=
t:B c
in. -
'*
( IIcB.c)
[ -'- , I l yJ
can be written in the fol lowing form l __ H ' I 2m 1 H' + 2m. Y -
r
(H"c) ' i
l l __ H ' = __ H ' j 1 2m 2 rn \ 2rn
1 7m '
( l cB. 1 )' ( II.xc ') ' !t
11
--
me
-- --
d3
--
In this form it is obvious that we can replace
uJ
(2.1 :2 1 )
in. .
=
with
cB
�
���� to have
k ) Ik ) -h2rn2-lF k, ) ( l c B i h ) ( [t:'k.' ( l rH i n ) (n �2. ) ] l k, n ) .
JJ, I
.
2m
u
+
+
JJ,
.
rnc
u
=
,
+
u
+
.rn. c.
" +
(2.125)
;:;-
1\
J
� ?
lk, rt) (2.126)
2 . 15 Consider a particle of tnass m and charge q in an itnpenetrable cylinder with radius H and height a. Along the axis of the cylin der runs a thin, itnpenetrable solenoid carrying a tnagnetic flux . Calculate the ground state energy and wavefunction. In the case where R cylindrical coordinates is
0 the Schrodinger equation of motion
1ll
the
(2.1:27)
68 If we write W (p, q, : ) = ( q ) R (p ) Z ( : )
am!
F = ";,�F: we will have
R(p) Z ( z ) d" I dR .. - . z .. - . z . d" R + ( ) ( ) , + '±' ( c,>j � .c '±' ( c,>J. � " -dIY' dp p i,p p -' ·
.
+ R ( p )
. .
,
_ l _ rP R
re z
, +
R i p ) <Jpc
with initial cond itions
So
·
.
, e" + k- R(p)il>(q)Z( : ) = 0 =? .
d __ l dR ,
Ri p)p <Jp
\TJ (p., c[J , z )
+
7_-" _ •J 'J _ l_ d2 Z I d2 il> 0 ( � . L8) , + �c , + , ( . I pc if> JP)
= <]) ( R , q, : ) =
w i t !J Z(O)
0
Z(a )
=>
A1 + D1 = 0 =? Z ( : ) = A1
0 =? C sin la = 0 =? Ia =
So
fl7f
'� (p, 0, 0)
((110 ---
( - ilz
'� (p, 0, a )
= 0.
) = C sin /:
=? I = 1, = ��� ([
=
n
= ± 1 . ±2 . . . .
Z( : ) = C' sin l., z
( 2 . L!O)
Now we will lmve
I dR I tl'
d' R
-- --
=? =>
1
rP
-. -
. ., =
(q) do-
I.
- ·rn 2 ==>
. (
wit!J if>( s> +
:ZD)
.
=
c
±•mel·
.
=
So tlw Schr()dinger equation is reduced lo
dR rP R . • 7_ __ P -- - m 2 + p2 I '-'2 - 1 2 ) = 0 + -· 2 li(p) dp H(p) dp p2
-- --
( 2 . Lll )
( 2 . J :l2)
2.
Q UANTUTvi DYNAMICS
69
[( 1•2
d" R l dR p - I) + - - + " - !" ) - p2 dp2 p dp J" R l dH m2 '* + + 1 d( vfk" - l"p)" - (h:" - 12)p2 v!F - l"p d( vik" - l"p) =? R(p) = A,Jm ( VF - i2 p) + R" ;Vm ( Vk2 - l'p)
=?
m" ] R( ) -
[
In the case at h and in which Pa --+ 0 we should take n, = 0 since when p --+ 0. From the other boundary condition we get R(R) = 0 =} AJm ( RVk2 - 12) = 0 =} RVk' - I' =
Kuw
lR
(p )
=
0
(2 . 1 33)
Nm
--+
oo
(2.1 34)
Knw
\vhere is the u-t.h zero of the rn-th order Bef-lsel function .Iul. · 'T'h if-1 rneans that the energy eigenstates are given by the equation
=}
- 2m [ K:.·nv
fl" E - -
--
R2
+
(· '.' )']
(') l dd •H J
11a
._ .
while the corresponding eigenfunctions are given by
(2. L36)
with n = ± 1 , ±2, . . . and m E: Z . '\ow suppose t!Jat B = Bz. \Ve can then write .i I"I
=
( B. .� ) .' (. q, ) ' p '!.p
__
C:.J
=
211 {J
--
( 2.137)
<.-'J .
'I'he Schriidinger eqnation in the presence of the magnetic field B ca11 be w rlt.ter1 a.s follows
[
. - -111 V -
1
2rn
"
�
t.4( £J c
--
][ ·
-111 V "
�
t.4( :fJ c
--
]
•r( :r J .
�
=
fc· •r ( :r J .
.
�
(2. UR)
70
-: _ak.·1ng .\1
· D,_�; = IJO\V t)Je tran,'·aronnalton .
[ ') •' [ ')' �m
. r') r p. + z. � m rip ric
' - :;---
12
- :) 17,
.
( -,----()p'
\VI Jere D2 = ( /� ,-Jd> .
0
0,
'1 =
-
( -ihpil_'2
+
·I (')
__
. 1
p
.I
·
pcp
p
.
·;' ]
(
, 2-r. ) 2 . Leting .
2 ,. t _
_ _
_
(
A=
ic tJ.: 2 Jf
\Ve get.
' . c) . r') ' 1 . + z. p+ q;- D6 rip r)z p
[
2 1 + ---, J)'' + -,--' -- -:c)
:F--:- � _
l
+ o- D6
'l 00, -
z
ljJ ( r ) --->
t') ('_dfio_' '
=
0
_
-
"i
1
()2 .. drp2 .
[-
.A d
- 21 '
iJ,,,
. ,
-
A
']
= -m
2
=?
-d'-"'
dr;)
_
d
. - 2 L l -. .
The wlution to this equation is of the form tid'. So
!' e '" - '>�l' ..'\/ r !'' + ( m_ ' .
- ..'l ' 'J e ''' .
=
�-7 l' ll ---_
_
-
_ ''V " (/). ) -
(
But
'fhis
=?
nwans
but TJO\V be
rn
m
=
=
-
)
=
E �·(:r)
2i \ ± 2im 2
�
=
(2. 1 40)
.
A. 2 )
=
0.
- ./-l' ' )
i(A ± m )
i(A±m)o . (' -·2 t
=
·
R(p )( rp ) Z ( z ) )
.
A'
' . ./-l' I. + (.rn ' ·L.l
----
\vllicll 1neans t.lla.t
�
(2. 1 :39)
fi
Following tlw same procedure we used before (i.e. �·(p, ��. z) we will get the same equation> with tlw exception of ·
.
v(l')
--j
=
,,- � 2 -r: we 1ret
;"';
' f. ljJ ( .T ) .
l
(2.141) m' E
Z
(2. 1 42)
that the energy eigenfunctions will he �\nwf:C) =
AJm(
h'�y p )einc¢ sin (n �12)
is not. a.n integer. At-' a. r(-'snlt the t'TJ(-'rgy of the ground �tate \vi11 r,'. E- 2m
[ ";nv
-R'
'' + n. .
(a
)']
(2. 1 44)
2.
Q UA.NTUTvi DYNAMICS
71
where now 1 n = tn ' A_ .is nol zero in general but it corresponds to n/ E Z such that 0 S: m' - A < l . �otice also that if we require the ground slate to be unchanged in the presence of IJ, we obtain .flu:c IJU a nli::alion
-
,
m -A=0
tu: r
271
=
=? - --
m
1
il\
=?
2 . 1 6 A particle in one dimension ( constant force derivable fro1n
-
(,\
v = :u:,
=
'.!Ttrn'tu·
= >
(-
< .r <
' rn
=
E Z.
' 1 4 ;)" \ '-'• ) ''
) is subjected to a
0).
(a) Is the energy spectrum continuous or discrete? Write down an approximate expression for the energy eigenfunction specified by
E.
(b) Discuss briefly what changes are needed if � · is replaced be v
>- 1 4
=
( a ) In t he case nndcr ronstrnction t here i s only a rontinnons spcctnnn and
the eigenfunctions arc non degenerate. From the discussion on \V l\ H approximation we had t hat for r�: > V(:r)
1/'J(:r)
=
(.!_ j ,f2mW - �'(r\Jb) B( .r ']' exp (-� / J'2m [ E - Y( J·)]d:r) · sin ( �n. j'' J2m lE - V(:r )]d:c - �)I ��( .1· ) ] 1 ;4 ) 1 /2 d:r· - -) . (�lo·'=J;/' ( F c '
, cxp [ �· ( ;r )] 1 r4 h,
·
1
E ·
, . .Jc·
[ r?
[
A
_
- \•
.1
,4
lll •'( · E - � ..c )] 1 /4 S
w - �'( :r )] 1 /4 Sill c
.
1
.
[
'
'
11 .
,
,
r il
' \ -
J
E ) 3/' {2;;:): - 3 ( ".\ VT - 1 2
T
' "
·
"',
"
]
"
4
('2. 1 46)
72
( \)l j:l
R - .c and n = ' where q = n -J: . ;�On the other hand when E < �\e ) Vn(;r )
[
=
l
(2. 1 47) \Ve can find an exact solution for this problem so we can compare with the approxim<1lc solntions we got with the \VKB method. We have
=>
(pi Hio) = (pl£1o) (pl . - + >..rio) = E (plo ) 2m Hlo) = Elo) p'
=?
p' -n(p ) 2m
=?
_!}_ o(p)
=?
n (p )
dp do(p) --
o(p. ).
=?
In
=?
' ) nJ.. (Jl
\Vc also have
=?
d dp
+ in>.-o(p) -i -
= =
' -i ' . ;,; - ." .
Ho(p)
( ) o(p) ( ) 11>. -..- (fr..'p - ) [ i ( p" r.p) ] .
=
=
=
h).
f, -
,
2m
p2 . - dp 2m
,bm
p"
-i
h ).
c exp
lz).
G
m -
+
c1
( 2 . H 8)
'·'
( E I E') = 1 dp(Eip)(piE') = 1 n/;; (p)nR' (p)dp i " ( £' E )p 1 I ' 2T
J(E - £')
c
1
So o R(P)
=
[
-
[
1 i � exp " aA v 2-:rn.>.
I
v' ' , -
(-Gm-
]
I
c
(2. 1 49)
)]
Ep
.
(2. 1 'JO)
2.
Q UANTUTvi DYNAMICS
73
Tlwse are tlw IIa1niltonian eigenstat.ef-1 in rnornent.lnn space. For tlw eigen funct.ionf-1 in coordinate f-lpace we lEtve
(J . I SO)
,- f !lj.JC.;,�-' ,._ ,/A (i::, -F:p)
1 , .. 2r.r / A 1 p" i 'E i·-.- - ,..,dpexp . . h hA6m A 'J.r.n /) .
=
' ( :r )
j' 'lp(:r' I p) i
: J> I E)
=
[
I
(
n
l
"
.r
- -
,J \
( '2. 1 5 1 )
p
Using now the substitution 11 =
p . ---"-.,. p" = u" (h2m A ) 1 i1 hA6m 3
.
, . ,..- j ;+···• duexp 2r. J:\ -·· · , . ' and q = rE = ( -;-�- ) _ +r:..·,
(.,fl'2 (flA' ) 1 I"'" 2r.h V A
-0.
o
--
where
n
. .
since J!��· sin
('
'
('�;'
-
n
uq) dv = D .
Ai(q) =
I
�:;:
we will lmve
[.
., (:L(' _-
i u"
3
- -
-
3
.T
l.
l
-
iv.q
.
E
( J. h A l
/
...,-
"'
- -
.r
u
( li'J. m ,\ ) 1 1 ·
l
·..;
('2.1 5:j)
So
In tenn,; of the Airy function,;
!.+ex•
yK . o ,-
[
du exp
('2.152)
--
�
co,;
(
""
.
.3
)
- vq
du
( :?.154)
n ·'I ' ( -q ) . r--7'
y r. .\
.
(2.155)
For large lql, leading terms in the asymptotic series are as follows
Ai ( q)
A. i ( q)
" q1 .14
·) r,r �v
I
;:::; ""
yi7r(
(
:112)
. 2 exp -...,. q · 3 2
[
f SITI ( -q) -q) ' 4 J I
.
,
"" •
q>0
+4 :
"]
(2.156)
q
...., , '[ ,�) 7I ) \''J
74 Using tl1ese <tpproxirnations in ( 2. 1 55) we get
�
I
!\l74 "lll 7r V .\ '1 o
11
� ·
.
I
·
hy.\ 1 -q) •
[2 2 :;; [ "l4 for E > \ "( :c) f. · E· ' " :c ) - -q 312 -;:;'1 )
·
+- ,
1 1 1 exp
-(
'> -
3
)
l,
as expected from the \'VKB approximation.
(h) 'When V
=
or
c < v· (
" ' ( :..'! . I :>1>)
.\ l:c l we have hound states and therefore the energy spec
trurn is discrete. So in this ca.se the energy eigenstates heve to satisfy the consistency relation
1,, d:c .J2m[E - .\ l.r l] :l '2
The t.nrning points a.re
(n + �)
rrli.
=
;�: 1 =
=
(n + �) rrh.
� 4 a.n d :r2
=
lf.
11 =
So
0. L 2, . . .
(2. 1 59)
:3.
3
THEORY OF ANUULA.R TviOMENTUTvi
75
Theory of Angular Momentum
3.1 Consider a sequence of Euler rotations represented by eXjl
. :!
(-icr:;n) 2 (· (- -
.
f'XJl
f'XJl
i(.) +,..- J/ c·c)s - , _ !}_ ?
. :!
( -icr2,i3 ) (-icr:n ) 2 -e-i(,:<-�,)/2 ) ,s1 _ · 1.1 !1
ti(,, +�>')j '2 cos 2
2
�
ti(o.--�>')j '2 sin !}_ . '2
'2
·
• '2
-
Because of the group properties of rotations, we expect that this sequence of operations is equivalent to a s ingle rotation about son1e axis by an angle <(J. Find <(J.
.
·
In the case of Euler angles we have . ·vt' l'l ( n. , o.J ,
�·
1
,-i(nh)/2 COS f!. . ' rl (-J ( o - •I I · -
.
)
.....
-
(
( · .) 2 •
( :.;, . I ,
)
2
SJJI � ,_ 2
while the sarne rotation will he represented by ( 1 / 2 ) .. [) '�-
( n) ·
·
"
(S-:�. 2 ..1ii)
=
( �1\' · · · ( '• ) (·· · inx · · · n. y) sin . (�) (·· · in,. + 11u) sin (%) (d·). + . ( ) /
COt\
.
. f:\1. 11 '!'l12
.
'.2
COS
2
. t1l -::; Sill
'' 2
) ('' 2. .
•).
)
Since these two operators must lJ
=}
COt\
( .)
'
2.
9
COt\ CJ
=?
(�)
.
()
= cos
-:-
::::}
.
.
;i . )/ 2 cos :... ' 2 ._ = cos - - rn -,. sin
(; _ ,'_ ( _) -r�r _
•
2
=
;i (
2 -
2 COS - COS
•
I
:?
2 ( o + l)
-
[·
2
2J
:2
-
.
"J
.3._ cos2 ( (} + i) - 1 . = arccos :? cos2 :...
2
2
((!)2 ) _:__
l
3.2 An angular-n1mnenttun eigenstate l_j , m = mu1ax = j) is rotated by an infinitesimal angle E about the y-axis. Without using the
76
explicit forn1 of the d���m function, obtain an expression for the probability for the new rotated state to be found in the original state up to terms of order E ' . The rotated slate is given by
l.i, j ) R
=
[
R(c, iJ ) Ij, j)
1 -
· 1
=
l
[
dU1(c:)ij,j) = exp
't + ( -;;,"22c:2 J�
li .i)
(-- i -�'6 ) ] l i, j ) .
(:3.4)
up to terrns of order c 2 • \Ve can write J!l in terrns of the ladder operators J,. + i J,, J J 'j _ _ Subtitution of this in ( :l.4), gives
J+
=
y - - = · ;j· ·--- t ,
[
ILi) n = 1 -
}
J _ � ', __._
: ( -h - L ) +
R
, , -ll
�
!
l
2
J+ --- L
'21..
(3. 5)
.
( -h - J_ )' lj. j)
]
vVe know that for the ladder operators the following relatio11s hold
-h l j, m ) J_ lj, m ) So
V h V(j + m )(j - m + 1 ) lj, m - I ) h (j - m ) (j + m + 1 ) lj, m + 1 )
- L I.i . .i)
( ,h -- L ) lj. j)
-t,
=
fiJ I.i. .i -- 1 )
fij (J+ - L ) l.i .i -- 1 ) - n fij ( J+ I.i .i - 1 ) - -'- l.i..i -- I ) ) -h fij [ fiJ I.i. .i) - /2\2.i - 1 l l.i. .i - 2)]
( J+ -- L ) " lj. j )
-n
(:3.6) I) ('3 � •
.
(:3.8)
( ;1 .9)
•
•
and from (:Ui)
)
)
.)....., yr::;-: �J 1J,. J· - 1 I' - ,_<...:- ')-J_. 1.J,. .J' ) + ;:'<...:-,_' ')-\/.l ( ':>-.l - 1 ) 1J; , J; _ ')' -1 I J,. J 'i + ::_ . 2 J- I .J'.. .J' - 1 I1 + 62 \i' .J' ( 2J' 1 - 6 i l i- i1I + = Vr:;:: . - I ) I .i-. .i --- 2\I �.! 2 --1 --1 . . . .
(
'
"'
)
'
:3.
THEORY OF ANUULA.R TviOMENTUTvi
77
Thus the probability for the rotated s tate to be fonlld i ll the original state will be (3. 1 0)
3.3 The wave function of a particle subjected to a spherically symmetrical potential 1/(r·) is given by
<;·(:!')
=
(:r + y + 3�).f(r ) .
(a) I s ,;; an eigenfunction o f 1:? If so, what I S the /-value? not, what are the possible values of / we may obtain when [2 measured?
If IS
(b) What are the probabilities for the particle to be found in various m1 states?
(c) Suppose it is known somehow that \'·(:!') is an energy eigenfunc tion with eigenvalue E. Indicate how we may find V ( r ) .
( a ) \Ve have
So
<;·( I') = (:l'l v) = (:r + y + 3.: )/( r ) .
, ._e 1 L-2 1 . ) cs-.3.6. = 1 5) r·
1
,2
-a
( 3. 1 1 )
[ 1 iP 1 iJ (sm. ., iJ )] --,--,11 -- 72 d(;'J SHI
+
....,11 - -,--11 SITI (}
'
!1
')(!.
,(_
)
v .e .
(:u:?)
If we write �·· (:1') in terms of spherical coordinates (l' = r si ll li co>c!;, y = r sin (! si n i/>, � = r cos (! ) we will have
1 (Jl
�·' ( :1') = r .f( r ) (sin (! cos
Then
iJ
C
i/> + sin II sill i/J + 3 cos (! ) .
(H:3)
r f(r) . . . . - sm. rp). = -, . r .f ( r ) sill li --2- -,-( > (cos rp + sm (' > ( ) ,\. 14) . , . co1' (' e) = . . . v(: . , 1 1 (} . {) Sill {) Sl1J C C.'J2
_
C (/)
Slll
7S and
ij ( · d ) · ( �
-,--- Slll f) -,--Sill 0 00 00 --:-1
i;J
. .
.
]
r {( r) a . -,--- [-;j Slll /) + ( COS r) + Slll (Y ) Slll /) COS /) Sill 0 00 r {(r) c . . . q) (cos"·• II - Sill 2 fi) j 3. .I :) ) -·. · ! -6 Sill fi cos I! + ( c os rp + s1n
!C
)
.•
.
=
--':-'----
"
Slll I) '
.
,
Substitution of (:l.14) and (:1 .15) in (:1 . 1 1 ) gives
[--sin . -1-(cos + sin o){1 - ws2 /) + sin2 /)) - 6 cos I)] · · · li ' · 1-2 sin2 B(cos + sin r,)) + 6 cos /)] h 2rf( r) [-.Sill 0 · ·
- h2r f(r)
o
2h 2 r { ( r ) [sin 0 cos rp + sin 0 sin �' + :1cos OJ = 2h 2 ,;:. (:C) 'J.h' l!'(.i) = 1 (1 + 1 )h2i;•(.T) = I ( I + l)h2i;"(.T) q
!. I,'" ( 2
)
� .T
=?
( :3 .1 6 )
which means that lj;(:i') is en eigenfunction of L2 with eigenvalue I = L
(b) Since we already know that I = 1 we can try to write 1:-( :f) in terms of the spherical harmonics �m (0, q ). \Ve know that yI+
1
=
y-1 1 So we can lvritc
v(.f) =
/''f \y/ .::.::_ f i r ) L :l\1'2Y 0 + y - l �
i
'
3
·
•
1
1
v r
r,
/2rr
(1· '-1 17+1 ) 1 - 1
i yl�r (1�- 1 r/ 1 ) !1
+
- r-1+ ' + i YI+1 + ;yI- l l
v{2; �;' rf(r) l3\1'2Y,0 + ( 1 + i)}�- 1 + (i - 1 ) 1/1 l .
(3. 1 7 )
But this means that the par( of the slate t hat depends on t he values of can be written in the following way l �·)m
=
N
[:3/:211
=
Lm
=
0) + ( 1 + i ) ll = L m
and if we wan( it normali�ed we will have I N I 2 ( 18 + 2 + 2 )
=
1
=}
N
=
=
-1) + ( 1 - i ) l l
I y22
�·
=
1, m
m
=
1)
(3.18)
]
:3.
THEORY OF ANUULA.R TviOMENTUTvi
So
9 2 I ll = l. m = O l c· ) l ' = · = _!!_ . \ . . 22 11 . ') 1 1 1 1 = l . m = + 1 1 '··· )1' = ....:... = - . \ 22 II .
P( m = 0) P(m = + 1 )
79
_
T
P(m = -1 ) -
1 ( 1 = l . m = = 1 I ,;W =
( :) .19 ) ( :].20)
I
2; = u· ·J
( :].21 )
(c) H ,::r;, ( .i) i s an energy eigenfunction then i t solves the Schriidinger equation 2 a 12 -h2 ,-p (h·' �:·r;:( Z) + -; (J �·r;; ( l) - ; , . , c·r;(l) + V (r) �·r; ( :l') = ;, 2m r
[
E if•t;( l')
�
V(r) V( r )
]
2 d 2 -h 2 -m d2 . . lrJ(r)] + - - lrJ(r)] ·-- --;:- lrf(r)] + 2m dr' r dr fi:r {(r ) Y"' � I
. - Y1
·
c
c
E + -.-1 - ,11 ' . .
\ ' (I' )
[
]
•
r .f(r) 2m
[_!_i. [ dr
f(r) +
/' ( r
r
r' )J + �[f (r) + rf'(r)] - � f(r)] c
r
1 ;, ' [J' r r ) + I' ( r ) + r r" ( r ) + 2 r'f,. lll + -rf(r) :!m h2 rf"(r) + 4f' ( r) E+ 1'/( r ) 2m !i:
·
·
\
·
·
·
·
·.
·
,
( r ) r .f ( r ) 1 1 •
·
,-n;
=
�
r
�
·
...., ...., (• 7 •) , ')') ;
3.4 Consider a particle with an intrinsic angular 1non1entun1 (or spin) of one unit of fi . ( One example of such a particle is the g nleson). Quanttml-Inechanically, such a particle is described by a ketvector lc) or in :1' representation a wave function o· r. :r�l' = \� :r�·) .,. I '-o,I1
'-
where 1 :1', i) correspond to a particle at Z with spin in the i:th di rection.
(a) Show explicitly that infinitesimal rotations of g'(:f) are obtained by acting with the operator
)
80 -.
f. where L = _,, ,,
'
x
(b) Show that
v. Determine S -.
l
.....
!
S commute.
and
(c) Show that S is a vector operator.
(d) Show that � ator.
x
q( l') = �' (S · fl)q where iJ is the momentum oper
(a ) We have :)
l q) = I: t=l
j li; i)(.i; i lq) = I:1 j I F; i)q' (.i') re r. :'\.
.
�
1-=
Under a rotatioll R we will have
I !!') = U( R ) I !!) = I: :J
!=l '
I U( R) [ 1;1')
:2:
li)] 1!\i')d':r
I: I I R:r) ex·· li)Dit1 ) ( R)qtci'JrP ,· ""'Jl�' I: I I :C; i)D)il ( R)l( W ' :f)d1:r J
t=l '
(! I T ) ,, �
3
I: r=l
J
r=l '
I l i; i)l!" r)d3 :�· =?
Di11 l ( liJl(n- ' r) <
=?
i/(r) = r!!l( n- ' r).
(:3.25)
Under an illfillitesimal rotatioll we will have s
.. u
(:3.26)
o"' (' r�)
�
'
On t he ot her hand
�· R( o'rp ) I!�·( h,_ , :r) = R(. o'rp ) I!�·( :r� - E� x r�) -. q( t - ,E X :r ) + E� X [!�( :r� - E� X t�) , .
- .....
.....
.....
(:3.27)
ili i'J - ( f x rJ . �iliTJ = !l(.i) - f. (r x �J!l( r) l ( ' �· � � [� �
)
q(:r - 1 E .
'
.
•
•I! :r)
(:3 . 28 )
:3.
81
THEORY OF ANUULA.R TviOMENTUTvi
where '¢§'(.1')
=
['\"g;(;l')] li).
Fsing this in (:!.:?/) we gel
i!U'J - * "'. li!( .CJ + E' x [i!(.C'J - * "' fi!U'l]
?'(.C'J
.
Tint
( 3.:50) or in m<'lt.rix forrn
[Er ( � 1 -1 ) -* [Ex ( � 0 0 0 0 0 0 ih
+ cy
0 - if>. 0
which means that
-10
( -1 n u n J ( �: ) y + E, ( ig E ( � ) nJ u� ) ) 0 0 0 0 0
+
c,
0 0 in 0 0 -ih 0
+
o
with (S,,)u - in.te,,u. =
Tlms we w i l l h ave that
g(.C)
=
Ud(.f)
=
[l - *( · ( l
+
s) §'(.f) =} u,
]
(b) From their definition it is obvious that only on the I I') ba sis and S only on li). (c)
l
=
0
=
and
1 - *t·
- in 0 0
( l + S). (:ul)
S commute since l
acts
S is a vector operator since
[S,. · · S I·],-nn .
[S;S:i - SjSi]!nn
=
I; [( -itt)q.t( -in)tjtm
- ( -itt)tjht( -ih)tiim]
[1l2tiktt_jml - 1l2t_jt:!Eimt] t?· 2:::: (Ji_jJhn - JimJJI: - JijlJinn + JJmJb ) 112 I: (J,m shi cl;m clj;, ) n2 I; C1jtCkm! L in.Cijt ( - itu:kmd L i fl.c,jt(St hm . L
-
=
=
(3.:3:2)
(d ) It is
�
X
i.l(i')
1 and j2 3.5 We are to add angular momenta j1 1 to form j 2 , 1 , and states. Using the ladder operator method express all (nine) j, m eigenkets in terms of IJ1 .h : m 1 m 2 ) . "\Vrite your answer as =
0
=
=
L m = I ) = �1+. 0 ) - � 10. + ) , . . . , where + and 0 stand for m1 ,2 1 , 0, respectively. lj =
.
v-
v-
=
\Ve want to add t h e angnlar momenta j1 = 1 and .i2 = I to form .i = . h = 1 , 2 slates. Let us take lirsl the slate j = 2, m. = 2 . This stale i s related to l .h m1 ; j2m2 ) throngh the following equation
l.h - .i2 l , .
.. h + .
0,
.
(:UG) So "'Uiug j
=
2,
m
=
2 in (:3.%) we get
I � we apply t h e .]_ operator on th is statd we will get .I- I i = :2, m = 2 ) = (.J, _ =?
=? '*
+ -h -) 1 + +)
l>v(j + m )(j - m + 1 l l.i 2 , m = 1 ) l>v(.h + m l ) Ch - m, + 1) 10+) + �z v� Ch + m , ) (j, - m , + 1)1 + o ) v'41.i = 2. m = 1 ) = /21 0 +) + Y2'1 + 0) (:u 7) l.i 2 , m 1 ) V1.?- 10+ ) + V1'?- I + o ). =
=
------
=
=
=
:3.
THEORY OF ANUULA.R TviOMENTUTvi
L lj = 2, m = 1 ) VGIJ = 2, m = O) VGij = 2, m = o) -
83
� (J1- + J2-)ID+) + � (JJ- + J,_)I + l ) =? � [01 - +) + 0100)] + � [0100) + 121 + -)] =? 2100) + 1 - + ) + I + - ) =? ( 3.:�8)
. L lj = 2, m = ll) = /23(J!+ .}2- J IOO) .
V
1 -) ..., ( .11 - + .;,_ ) 1 - +) =? + y�(.]1-h-)1 + + + 6 yi(, - -- o) 1210 ---). V6 1 1 VGI.i = '!., m = --· 1 ) = \ 32 [ v'21 + ] + 010 -- ) + V6 121 -- o) 1 - o1 =? - - o)· -I V21o-) -01 IJ = 2, m = - 1 ) - =-010-) =v21 + + + G G fi G ' . 1 1 IJ = 2, m = - 1 ) - ;;; Ill -) + 0 I - OJ ( :U9 ) v-
/
=?
·>
'>
v�
L lj = '2, m = - 1 ) - �U1y'2 · + J,_)I0-, - 1 + �(J12 + J,_ ) l - 0) =? 1 121 - -) 1 01 - -) =? v41J = 2, m = -2) +v0 v0 IJ = 2, m = -2) I - -). (3.40) Now let us retmn to equ
1!.
=
=?
=? (1. =
=?
84
l.i = L rn = 1 ) should be !lorrnalized so
I ,, ' '' ' ) ') . =? _ I a I - = I 7 I a I = (J = I , m = l Ij = l , m = l ; = l 7 I a I -) + I i, i -) = 1 (:JAl
In
I
'
By convention we take a to be real a!ld positive so a = �� and That is
T
�
�
tl"'
1
b
= - �S" · v-
IJ = L m = l ; = l2 I + 0; - l2 I 0+; . .
'
Using
1
v -
12
I
I
I
same procedure we used before
IJ- = 1 ; m = ] \ ,
I
121.i = l , m = O) I J = I , m = D) L lj = L m = O) 121J = L m = - I )
I,J- = L. m =
--- 1 \ I
_
�(J1- + h - l l + -) - �(J1- + lc- ) 1 - +) =? I
v-
12
-
.
I
12
-
v -
- v21o-) - - v21 - o; 7 I
.
I
f'5
I D-) - - I --- 0)' -= . I') v-
(:!.45)
'
v -
Retnrnillg back to ( :L:l5) we see th
l.i = O, m = 0) = c, IOD) + c2 l + - ) + c,l --- +). This state should h e orthogm1
f2
1 I :J"' + Ji(" + Ji(" V I
I
(J = I , m = 01J = O, m = 0) = 0 =? 0c2 - m r:z· "-1 .
_
_
v-
v
(:!.4 7)
:3.
85
THEORY OF ANUULA.R TviOMENTUTvi
U.sing tl"' ];wt relation in (:3.47), we get ( 3.49) The st ate lj
= 0, m = 0)
shonld he normalized so
(j = O, m = O lj = O, m = 0) = 1 =?
1
h i = 13'
By conv<'nt.lon vve take c
c
1
2
= - v�3 . Thus
IJ =
0, rn
t.o he
T"('a.l
=?
jc, l 2
+ hl2 + i cY = 1
arHl positive so
I
c2
1
1
v ,)
v •)
=?
:J 1c2 l 2 = 1 (:3.50) 1
and
-::7\
= 0) = I - +) - I OO) . I + -) + 10 10 10 v ·)
So gathering all the previous results together
= 2, m = 2) = 2, m = 1 ) lj = 2, m = 0) IJ = 2, rn = - 1 ) IJ = 2, rn = - 2) lj lj
l.i = L m = 1) l.i = L m = 0) lj = 1 , m = - 1 ) IJ = 0, rn = 0)
I + +) +IO+) + + I + O) 1 I + -;' + 1 I - +;' V�:i I OO) + fo fo 1 j - 0) ....1. ., 0-) + 1 . .;2 .;2 v �
v -
-
1 - -) ,fi l + O) - )2 10+ ) }; 1 + - ) - v�-. 1 - + ) + l o- ) - .�, 1 - 0) l - +) � -) + -k v3I + v3
(:3.:52)
v ""
v -
v-
100). - --',.. y .3 '
3.6 (a) Construct a spherical tensor of rank 1 out of two different -, ·r( ' J,o m · 1 y wr1te · vectors [_, = ( r· .'r. r·' y , r·' . ) and 1. . = ( '' ' ' 1· ·y , '' '' ·) . Exp 1·1nt ±l tertns of Ur.y,, and v: y . c o ·
•
•
"
•.
(b) Construct a spherical tensor of rank :? out of two different (2J vectors F and 1' Wri't e d own exp 1·IC-I'tly ·.,,±:2 , ±LO In t ertns of I_ ':�:,y , z and �� y . • . •
I
•.
•
r
86 (a) Since (} and r are vector operators they will sat isfv t he following com Tnut atioH relatioHs
Front llte com ponent s of a Vf'ctor opPraJor Wf' can consln1cl a srJlJPri<·al tensor of rank 1 in 1 he following wa.y . The defining properties of a spherical tensor of rank I are the following
( :3.G4)
Jt I S
( :3.GG)
-ihU, + i(ih)U,. = - h ( U.,. + iU,) =;1 u+, = - 12 ( 1/, + iU,, ) -./211U_,
=
[L , U. ]
=
[J, - iJ,, Uc ]
-inC, - i(ih)F,. = h (U,. - iU,) =;1 u_, = w. - ;u" l
12
( :3.G6)
( 3.G7)
So from the vect or operators (} and f· we can cmtSt ruct spherica.l 1 ensors witlt COifl ()OJtf'nt s Uo u+ ,
u_ ,
....... ....!. -· !1 ) -· T + iU .Jj... ( U ·r· l ' ( r· 'l /2
u,
',i' -
IH
� +r %
( :3.G8)
1/_,
It is known ( S-:l. l 0.27) 1 hat i f x-,;;•, ) a11d 7-.);J are irreducible spherical ! ensors of ra11k k1 a11d /;:2 respect i vely t hen we ca.n construct a spherical t en sor of ra11k k
( :).G9)
:3.
THEORY OF ANUULA.R TviOMENTUTvi
87
In thi.s c
(3.6 0 )
( I I ; 00 I I I ; 1 0) U0 � o + ( I I ; - 1 + 1 1 1 1 ; li J) U-1 � + 1
+ ( I I ; + I - 1 1 1 I ; 1 0) U+ 1 �·:_ 1 1 1 -. u_ , v:,_. , + - U+ , v_ , 12
y'}
"'H Ux - i U") (�� + d•v; ) - "' �" (Ux + i U, ) ( l �- - i f ;, ) v2 · v2 I [r · ' · ·r· ' · ·r· , · + r .iu· ,v'-u· - r/T· ,r':r:· + l·r-/1; ,r'-11· - 1·r·.r11 ,r':r:· )_1; 1''·1; + 'I )_r r'y - 'l /y r'1: · · · 1
1
·
., (r · ' · �'> . /x v'll . -
� 2v 2 v :!
r · "· ) /y v x ,
·
·
·
( 1 1 1· - 1 0 1 1 1 1· 1 1 11 (1-· -1 I··0 + ( ll ·1 O - l l ll '·. l l )' (-·i0 \•'� 1 1 1 I'> I/ -1 h + I'> I/o �:_1 v2 v2 r _ .!. ( [ T - _ :l7 I y ). . ::! [-·T.? ( ')· X X _ :l -·cy )l7:? + .!. '::! •
(b) In the same manner we will h
{ '' )
T,(
( I I ; 00 I I I ; 20) U0 � 0 + ( I I ; - I + i l l I ; 20) [f_ , l'+ 1
+ ( 1 1 ; + 1 - 1 1 1 1 ; 20)1+1 li- 1
·
·
·
r· ' · ]
- --11i r'-11 · ·
(· 3 . o-· t ·)
(3 . 62)
88
( 1 1 ; o - 1 1 1 1 ; 2 - 1 )Uo 1:.. 1 + ( 1 1 ; - 1 0 1 1 1 ; 2 + 1 )U-I 10 1 , 1 , 1. "' /o v - 1 + "' /_1 v·o v 2· v 2· -
T .
-
� (Uz\� + U" \� - il/, 1 ;, - i U" V, )
2 - l ( [i - l"['· y ) (". ',· i\ 1 1 ·) - 1 - 1 l 1 1 ,· 2 - "7- ) t-'i-1 1.'1 -1 (:l.:"_ - ) [T-· -1 1/ ! y) 1 -1 l _r _ l'1/ 2 ( :l.6 7) � ( U,i;. - U,1 i;1 - iUr \ �1 - i U,y,). 'X
3. 7 ( a) Evaluate
for any .i (integer or half-integer) ; then check your answer for j = � · (b) Prove, for any j, '\' L...t :7
· ,i_J + rn t2 + 2I (_,, cos2 /-'•J - l ) . rn 2 1 (/(i) -;1t_m ( (rJ ) 1 2 = 2,.J() + 1 ) snl I
. .
·
J
0 .
[Hint: This can be proved in n1any ways. You Inay, for instance, examine the rotational properties of .!'; using the spherical ( irre ducible) tensor language.]
:3.
THEORY OF ANUULA.R TviOMENTUTvi
89
(a) \Ve have .I
L nt=-J
l d:�:n, (;J ) I 'm
.I
L m i (J m lt-il,S / " IJm ' ) l ' '\' L..t ·m (.J. Ut I t � J
iJY''l''l• l.J·nt')
· I t-- ,'J?I-:J1'n l}·trl )) (t\.rm'
'
.I
L m(.jm lt -i.J, !ijhl.im') (.jn/lei.!', l/"lim )
m. = -.t .I
L (J rn' lf·i.l:;i-1/ hrn Lim.) (j rn lf - d !)/ h Li rn')
f,• !Jfll' I f>i.lI
·
y
') Jt;. t
y• 1
• z
"' 1 · . m�j .J'Iri ) (.Jfll I
[
* (jm'l ri.J,!l h./' -i.!,:J h l.i m') I
:t
e
I
]
t
· ') · Y'. ,•1-n· l.JITI
-iJ
� (.jm'l f�' (0; ty ) J, D( J; t y ) IJm') . ;
Bnt the rnornentmn .! io a vector operator oo rrorn ( S-3.10.:3 ) we will have that
D" (3; cy)J,D(3; cy) = I; R,1 ((J; Ey)J1 .
(
J
On the other h a 1 1d we know ( S-:3 . 1 .::Jh) tltat
TI(;J; t y ) = So j
L IJ�::n, (!JJI' m
*
cos 3
0 sin !1 0 I - si l t /3 0 cos j3 0
[- sin ;l (.j m' I.J,.I.i m')
)
.
m' cos H.
·
2
(UO)
+ cos J(.jm'l./ , l.i m')]
+ J_ y. ' + mn I I . ' ·h . ;'] 1J y;I - SJTI ml I m ) ' cos 1'J
[
( 3 . G9)
]
(:3.71 )
90
)
For j = 1 /2 we know from (S-:3.2.44) that
(
. )) d(1 /' 1 (_i-' -
•
nun'
So for m' = 1 /2
cos ts - S111 t .3 cos 13 sm 1
1/'2 ' l d(J) (}1)1 2 rn ·mt/L L.....t Jn=-1/'2
while for m' = - 1 /2
Fi
I I f ,m 1 / 2 (,i/3)1 2.: m=-1 /L
1 /2
2
'1
•
•
.
- � cos j] = n/ cos ,d
-
n
�'' )
. . ( •)'
;) · 2--:-1 c'o:-;, , /3 + -1 :-;n1 2 2 :2 2 1 · I 7 -2 co:-; (· j = t co:-; . u .
m
.
(h ) We have
('l
.
( :). ' -/'1· ·)
,
' rn '-' l lu1 l_i-') ) I ' - 'rn (m L rn=-J J ' ' . ' r · ' 1 - '·/11·'''r'" l .J'T , n; I L..J rn l u nt J
-
t:
-m=-J
,
I ' I -i 1 3/'. IJ fH ) \J I ' rn -) \J I i 1 •i 't, I . 'fn. e · ·rn. f: · !F- t J ffl ' ) L.....t -m=-J J
.
;
rn=-J
h
� (jm'l c'JuPi>·.;;,
·
.
[t
m =- J.
I
.
� rJ· rn ' l t: -i.lyt3/ n .]'2' t:. i.J,JJjt-, IJ: rn. 'I\ I
• 1 1 J rn • l 12"'t rJ/). l 1 (:11• y;1 (J rn ' l "'fJ.lJ1. (:11
h)
·
.
u
•
z L'
]
l .im ) (j m l e i J,_J / t. lj m ')
·
l·
"
(. .,,) . _/ ;),)
:3.
91
THEORY OF ANUULA.R TviOMENTUTvi
From (:l .GG) we know th
where
rJ'� is the 0-cornponent of a second rank tensor. J'' =
2 j6 :) r,(0 ) +
( :3. ?G)
So
� .!' ;J '
" .1 Ln<= -J
'vVe
=
1 1 ;· . I I ' ··· (J fll . T lJ fll
tt :3
� 3-
1 ( ··· . I I ' "· , . J ' J) l ( " ; + yfi,-,. J fll D I.· /> ·, t...' ) • ·. i ' .· '·•;. J lJ:.ill· '�. 0'' . 1 1> ) :) ;,
n
·
•.
� ··
,
•
.
know that for " spherical tensor (S-:3. l0.22b) 'D( R ) T:) ' l vt ( H) =
( •0) -g ( · )
·'
L v;�; ( TI)T�;,'I
q l=-k
•
.
,
2
( jm' I 'D(B; cy ) J; 'D1(!3; i'v ) ljm') = (jm'l L r;.'3 'D,�76(!3; i'y ) ljml)
I: 2
·1' = - 2
q'=-'2
. o • • , . ' I ·TI'l 1)(21 , - J· nl '\I . q , 0· (. ,.i J : t. Y J , ! rn q l
.. . -
.
·
,._
( :3 .80)
·
I J.n/\ Bnt we know from the 'vVigner-Eckart theorem that ! 1··m ' I T1,'.J. '1 -r- 0 . !
\•
(.i) '2 L ,01 ldmm• (} ) l J
L
.-
. ·l ' 1 1 ·p; . · - c'l · .. l'· 'J r J· + 1 l + , v v,,,, r ,i; ; • ) i··J ill' l 1"c'J . . 0 I)· Til I ;, :3 :ln . < 1 (2) . I . ' I 2 1 ' I 1 . . 1 ) + -d00 ( dhym + -:3·J (J ./..• - ,-a J ·Jill i 2 ·· · 1 . . 1 (2) . ,, 1 1 ) + 2 d00 (d) m J(J + JJ(] + 1 ) J
rn=-J .,----
·
· ·
·
.
.
[
·
·
t ..
.
n
.
·
-
..
l
•
n
=
D So .
92
)[
1 . . . . -I '7 ( 7 + I ) + - ( :kos- (, - I m - - -I 7· (. .7· + I ) 2 3 . 3 I . . . I .. . . I . -- 7 ( 7 + I ) co s - (1 + 6 '7 I..7 + I ) + - 7 (. .7 + I ) + ' . ,
.
,
o
�''
:1
o
.. ..
l
hei + 1 ) sin' ,3 + m 12 &(:1 cos' ,3 - 1 ) 2'
where we have used
d�,ilUIJ
r, (cos .d )
=
=
!,(:3 cos' (J
-
m1
2
1
-
.
( :3 cos- .;, - I ) ,
,
(:1 .8 1 )
tJ
.
3.8 (a) Write :ey, :rz, and (:r2 - y2 ) as components of a spherical (irreducible) tensor of rank 2.
(b) The expectation value
Q == c:(o J m = J l ( 3:2
-
r') lnJ m
=
j)
is known as the quadrupole moment. Evaluate
f (n,}, . .
m
' I ( "' - - !r' l O',),. nl· I .,
}. )
=
•
(where n11 = .i. .i - l , j - 2 . . . . )in terms of Q and appropriate Clebsch Gordan coefficients. (a) Csing the relations (3.63-3.67) we can find that in the case where (r
�-:. = :r the compouenls of a spherical tensor of rauk 2 will be 1 ( ;c�' - :�r' ) + t•:ry - ( :r.:: + i:y) ,- . ·-) - r·) - y-'' ) y';; -,.. , (:Cc
I'
2
So from the above we have
=
'' ) y';;l' r;- -3 .c -·) - r-.
:ry =
( b ) We have
Q
1:(o .j . m
=
l2i T+�
J l ( 3 c 2 - r2 ) lo . .i.
rl'l2 - - -
2i
m =
j)
T:_;) I ') r_· -,
:r::::- =
& (:r2 - y2 ) - i:ry
:r.:: -
I' r·- 1)
cy
I ') - +1 -T
2
=
(:1 .82)
:3.
THEORY OF ANUULA.R TviOMENTUTvi J(; · . t
(n,.J , m = .J l r,I0 2 J I o. , .J,. m.
. .7
.·
·
})
lo .II T�'l llo .' =
. 1 (rL-E) = .J1 = •
l
93
1 .,!
01
!
,.J �; .J .J�; .J.J ) .
.,
.
.
I1nJ . ' II T
Q (j 2·/2. 1+ VGt ; jOij 2 ; jj) .
i".:""l -
I . ' I TI' ' I . . . - . . e 1n, .7 , m +2 o , ), m - .7 ) .
:2.J J +I
>t
t\ 3.8 1 )
So
t I\fJ:: :.J.: fHI I (J�2 --· y' J I O: � .J. � lrl. = .J' )
1ll 2 l OJ) .. J(;
l + t I1 o. , j,. .m' I T('l _2 o , J, m .
- J' )
94
Symmetry in Quantum Mechanics
4
4 . 1 ( a) Assuming that the Hamiltonian is invariant under time reversal, prove that the wave function for a spinless nondegenerate system at any given instant of time can always be chosen to be real.
(b) The wave function for a plane-wave state at t = 0 is given by a complex function cc'P·.r/1'. Why does this not violate time-reversal invariance?
(a) Suppose that
I n)
i 1 1 a llotHlegell<erat<e e1wrgy eige11stai.t'. Then
( 4. 1 ) So i r we choose at a11y i11sta11t or time S = real.
(h)
r,
_ uj;·' the wave ru11ctio" will he
th" case or a rre<e particl" the Schriidi11g<er equation is
(iJ.2)
The wave fu11 d i ons (?n (:r) = (:_- -'iP·:i"jf;_ and <;)�,, (:r) = (:_- iP·.i:jn. correspond l.o !.he ' �aHH:' t'igt'nvalut' E = J)�r and ::;o tl1tTe i� degt'nerac.y siitce tl 1ese cotTt'SfHH i d t o d i ff<ere11l slate kets l11) and 1 - JJ). S o W <e ca11110l a pply the previous r<esull .
4.2 Let o(f!) be the momentum-space wave function for state lo), that is, o(f!) (J"I o ) .Is the momentum-space wave function for the =
4.
SYMMETRY IN Q UANTUM MECHANICS
95
time-reversed state El lo) g1ven by cp()Y), q( -j!), q"(r1), or q"( -J1)? Justify your answer.
In the rnornenturn spa.ce we h.a.ve
j ,Pp'(J11n) IJ1) I n) = j ,Pp'
�
�
( 4. :l)
For the Tnonwntnnl it is natural to require
(ol m o) = - ( r:i li/lci ) � ( r1 l0i>EJ - ' Ir1) � Elr>e- ' = -il
(4.4)
So (4.5) up to ct phctse factor. So finally
E> lo) = �
j d'p' (J11n)' l - lf) = ./ cflp'(-z11n)" lz1)
(z11E>In) =
(4.6)
4.3 Read section 4.3 in Sakurai to refresh your knowledge of the quanttnn Inechanics of periodic potentials. You know that the en ergybands in solids are described by the so called Bloch functions 11\,.k fullfilling,
where a is the lattice constant, n labels the band, and the lattice momentum A· is restricted to the Brillouin zone [ - ;r / a , ;r f a ] . Prove that any Bloch function can be written as, H;
96 where the sum is over all lattice vectors R, . (In this simble one di mensional problem R, = ia , but the construction generalizes easily to three dimensions. ) . The functions o, are called \Vannier functions, and are impor tant in the tight-binding description of solids. Show that the Wan nier functions are corresponding to different sites and/ or different bands are orthogonal, i . t . prove
Hint: Expand the <;J,s in Bloch functions and use their orthonor mality properties.
The defming property of " TiloclJ fu]](:lion <\. k (.r) is (4.7)
.'k" ' (j)n ' ( r· L · R.l
f_
• -
�
R)· ) C ikR.,
-
(4.8)
w h i ch means that it i s a Bloch fun ction
=
� q), (:r - Ri )tikR , . R,
(4.9)
'f.he last relation gives the Bloch functions in terms of \Vannier fun ctions. 'fo find the expansion of a \Vannier function in terms of Bloch functions we mu ltiply this relation by c -ikR, and integrate over k:. Vn . k (.r:)
=
� ¢>,(.r: - R: )e ikll, R;
4.
SYMMETRY IN Q UANTUM MECHANICS
But
97
.
'> _. slTJ r L7i1I a
where i n the la't >tep we used that R ; - R; =
11 a ,
fl; -
with
11
( R-i
Rj )]
R;
-
E Z.
_
( 4. 1 1 ) So
(4.12) So '"ing the orthonormality propertie> of the Bloch fu]J(:tion'
j d:cq>�,(:r: - R;)qn(:c - R;) =
., il·ll . , -i!' R (•)-)2 e· ·· '' V'lE,k (-. ;r ) t · "-1 0). n.!..-, , ( :r- ',)(fl.f\:(fl. \''f/:r a-
Iff --· jj -._-)-.1 e a-
_
'
'
'
,-'1
)
•
�
=
( 27r -
'
I f ( 2.- ) 2 e '
•) cr
JL
• k ll;-dc'n ,
·
j
·
·,·
. (.. .T ) C·n .f>..(II-' , , .,_ , k' (_. .c· ) (I·.. C (I'·
1;, ,,.'"'"z
i k ll.. -ik 'R r . J o·mnl) ( 1\' ·
.
·
' 1'-h:(.11\'' - 1\'' )(
( 4 . 1:3)
4.4 Suppose a spinless particle is bound to a fixed center by a potential Y (:C) so assymetrical that no energy level is degenerate. Using the time-reversal invariance prove
( l) = o
for any energy eigenstate. ( This is known as quenching of orbital angular n1ome1ntmn.) If the wave function of such a nondegenerate eigenstate is expanded as
I: I: Fi,,( c)1/n(O, l
no
)
m ,
98
what kind of phase restrictions do we obtain on Ftm ( r)? Since the H am i l toni a n i s invariant nnder time reversal rre = e rr.
(4. 1 4 )
So i f I n ) i s an energy eigenstate with eigenvah1e Fn we will h ave
H0ln) = 0 H in) = Fn G i n) .
If
(4. 1 !j)
there is no degeneracy In ) and G i n) can differ at most by a phase factor. Hence
I ii ) = G i n) = , " I n ) .
(4. 1 6)
For the angn lar-momentnm operator we have from (S- 4 .4.5:3)
'vVe
---; ---; ' ---; (4 6) (n i L iu) = -- ( lt i L i it) �1 --· ( n i L in) (n i L i n ) = 0
have
Gin)
0 .
=?
( 4. j 7)
1 d3:c i:r) (.iln ) 1 d'.T ( Zin )'0 I Z) =
1 13 T (:r:�I n ,; . 1 x�) (4=. 16) t.
.
f�
i' l n )
=?
(4. j 8)
ts " '"'"'(IJ " l (· e . (/}. ) = C L..t -Flm(r) It , (p) Im (J' )}'m· L p·
ml
ml
=}
ml
/ };;n'• Ln1l Ft7n (
'
ml
r
)(
-
·
·
·
ml
1 )" y;-m ( 0. ¢> )d\1 = '
m_l
<15
J Y7'" Ln1l Ft,, ( r )Y;"' (
·
0, 0 )d\1
4.
99
SYMMETRY IN Q UANTUM MECHANICS
4.5 The Hamiltonian for a spin
1
system is given by c:;') -
II = AS ; + B (S; - " � ) . ,'1
,'
-
)
Solve this problem e<Eactly to find the normalized energy eigen states and eigenvalues. ( A spin-dependent Hamiltonian of this kind actually appears in crystal physics.) Is this Hamiltonian invariant under tin1e reversal? How do the nonnalized eigenstates you ob tained transfonn under ti1ne reversal?
For a spin have
1
system
I
=
1 and
m
-
- 1 . 0. + 1 .
For the operator S, we
So
For the operator S, we have 8+ +
.
S,, I 1 , m ) ( L n i S,.I i , m )
2 ("-" ·' · '18)
So s.T
11.
•)
r,"
-
4
8_ 1 L m ; I
=
I I 2S+ I L m ) ·-+ I L m ) ·+ 2S_ '
1 ( L n i S+ I i . m ) + � ( L n iS- I I , m ) , 1 1 11 y ( l
(
(
, .. m - m ) ( 2 + m )on +l +
0 V2 0 0 0 0
0
2 0
2
v:
) =?
'r , 2 11 \j
-
_ _ _ _ -
,
1 + m ) ( 2 - m ) on,n• - 1 · .
0 0
V2
(4.21 )
100
(
In t.lw sanw rnanner for the operator S!J S'.r '!' .T
k
h
')_,
-
(
r,'
-4
=
y'2 I)
0
-
y'2
I)
0 -2 0 -4 0 "
� _ ..::,' � we ftnd
)
0
y'2
- 12
•)
=
�2 ) ( J� ) ( ) I)
=?
=
r,'
0 -� l 0 0 I
(4.22 )
.
2
Tfms the Ham iltonian can he T'<'fll'""'Tiled by the matrix rr
�
A 0 B
n2
(
o
o
o A
D
H
.
('1 .23)
To find t.h<' en<'rgy eigenva111es we l1ave to :-;.olve t.he sec111a.r eqna.tion det ( II - .\!) =?
0
=
=?
del
.4h2 - .\
0
0
-.\
0
o
.4tt' - .\
(AN - .\ ) 2 ( - .\ ) + ( Bt,' )2.\
=? .\( .4tt 2 - .\ - Bn')(AI,' - .\ =?
.\1
=
.\,
0,
=
n.' ( A
Bn' =
+ H),
Bh2
)
=
0
o =? .\ [ (At,' - .\ ) 2 - (Btt'J' ]
+ Btt2 ) = D .\3
=
n2 ( A -
H).
=
o
('1 .2•1 )
To find the eigenstate l n \J tlmt corresponds to the eigenvalue .\, we lJave to solve the following eq uat i on ( 4.25)
n'
( ·� �� �.) ) ( � ) { =?
B
0
{
-c
A
c
-c!i
�2 + c.4 ·� a
=
=
=
0
=?
{
=?
a A + cB aB + cA.
a=0 c=0
() () (4.26)
4.
SYMMETRY IN Q UANTUM MECHANICS
So
lno ) lno )
= =
(�) (�) n�n
1 1 0) .
In the same way for .\ = h'( A + R )
({ )( ) A 0 R 0 0 0 R 0 A
� __,.
{]. = b
a
b c
=
(A + R)
c o =
So
l n A+ s) -
l n A +R J
=
For ,\ = n."( A - H) we have
( ·� � � ) ( ) R 0 A
� c
(�)
{ () a
b c
" '�'
=?
�
101
=}
( 4.2 7)
a A + cR
0 a R + cA
=
=
=
( 1 .28 )
(�)
=?
1 1 ---= I L + 1,\ + /') � I L. -1,\ . . /•) v �
= (A - R )
v �
{ () �:
c
=?
a( A + R ) b( A + R ) c(A + R )
aA + c
(4. 29 )
�
:
a(A -
H)
b( A - R ) a R + cA = c( A - R )
(4.:JO )
So
(4.:H )
102
Now we are going to check if the lla1nilton ia.n is in variant under ti1ne reversal 0 Hfr 1
40 '-""z 0 -1 + B r, 0 '-"".,; 0 -1 - 0 '-"y" 0 -1 ) 1 " 0 ' 0 ,, 0-1 oSZ "' ,, 0-1 - - 0 ,, 0-1 + H \' 0 - '- .r - .\,; ii - ·'='z As; + B ( s; - s,;J = H.
-
-, '"-1 0 '-" 'O "- "c'J Syo y
To find fhe traTJsfonna.t.ion of the eigenst.a.tes nnder tirne rever:-;.a.l relation (S, 1 .'1 .58) 0 11 , m )
So 0 lno )
=
(4.:3 2)
\\'(-' US('
fh('
( - l )m ll, - m ) .
0 1 1 0) (4,::_'3) l i D)
(4.:34) (4.:35)
I no)
El ln .IH > )
l
1 M0IL +1) + v2 1 (4.33) - - I L· - 1 I1 0 v�
- l nA +R )
1 I'>El l 1 , - 1 ) v2 1 + M ) I L· l ) v ....
1 1 /? 0 1 1 , + 1 ) - M 0 I L - 1 ) L \ V2 c 4�l J 1 1 I. -I - 1 1, + - I 1 . + 1 )
12
In A - H ) ·
12
( 4 . :lti) ('1 . 3 7 )
('1 . 3 8 )
a.
5
APPROXIMATION TviETHODS
Approximation Methods
5 . 1 Consider an isotropic harmonic oscillator in two dimensions. The Hamiltonian is given by 2 2, . - ( :r + ''I ) TT0 = .-+-+-
P.r
2
"2m
l}'2 y
2m
m_�..r.1:1 .
�
"2
·
(a) 'Vhat are the energies of the three lowest-lying states? Is there any degeneracy? (b) We now apply a perturbation where 5 is a dimensionless real number much smaller than unity. Find the zeroth-order energy eigenket and the corresponding en ergy to first order [that is the unperturbed energy obtained in (a) plus the first-order energy shift] for each of the three lowest-lying states. (c) Solve the !!0 + \/ problem exactly. Compare with the perturba tion results obtained in (b). 2[You may use (n' l:rlrz) = j' /n,� -· ( y rz + lc'in',n+l + fon,•,n-d·l h"' Deli11e st.ep operators: O. r :
t ar :
/'iT/W · 'I f!� ··· ) . - ( .r. + V 2h 1nw ' · fm_w 1:Jl:r , d-.- ( .r - - I . 2fl llmw
( ,v + ,,Y 2f7. Y
ipy . -
fW..r.J /
,
�Vf m•)./w ( .I·t _ mw ) . � z
1TlU) lJ111 '
Fnnn the fuTJdanH-'nta.l coTnTnnl-ation r(-'lations
1.
/ ,
\V e
GUJ
(5.1) �(-'(-'
tl1at
104 Defilling the rmrnher operators
we find
, flo \ ' x + N, = -1 nw nw( N + 1 ) .
N lfu
=}
-
.
( 5.:! )
I.e. ellergy eigellkets are also eigenkets of N: rn l rn � n ) � n. l u;, n ) ::::}
I"v�r l rn l n ) ..'lu I rn , n )
N l m, n ) -
so that
( 5.:3 )
(m + n ) l m , n )
Ho I m , n ) = Em,n I m , n ) = n.w (m + n + 1 ) I m , n ) .
( a. ) Thf' lowf'st. l:ylr1g states are state f�:o,o = tu.JJ H 1 ,0 = F;u_1 = '2 nw H,,0 = F;u_2 = t:'u = :3 n.w• (h) Apply the perturbation
deO'f'TI(-'l'acv 0
1/
1
•
2 3
= Smw ' :J:y.
( Hu + F ) I I ) = FJ I I ) Hu l lu ;, = fJ" I I" )
Ful l problem [npertnrbed problem: :
Expalld tlw ellergy levels alld tlw eigellkets as
E - E" + 2.1 + 2.' + . . . I I ) - I I" ) + I I' ) + . . .
so thai I he full problem bt•com<'!' ( r, pO -
Jfu )
[ I I" .) + .
I I' ) + ·
·
·
·
l
= ( ' - "-'' 1 F
'2
'-'
( 5.4) ) iL I ll) 'l + I I' .) j .
·
·
-
·
- -
·
·
·
l
·
a.
10.5
APPROXIMATION TviETHODS
To l 'sl order:
( F:" - H0 ) 1 1 1 )
Mnltiply with ( /0 I to find
=
( 1/ - � 1 ) I 1° ) .
1
I)
= ( !" I v - � I I" ) =? � 1 = ( !" I v I I" )
( 1° I F" - Ho 1 11 ) �1 ( 1" 1 1" )
( 5 .G)
In the degenerate ca.se this does not \Vork since ,:ve\e not usiTJg the right basis kds. \Vind back to (5.5) and multiply it with another degenerate basis ket
( rn() I
- Ho I ll- IJ �1 ( m0 l l0 )
O = ( m" I V - �1 1 1" ) =? o I 1 rn " l 1,'. 1 1 ',
DO
r.�
1 •
( 5.7)
Now, ( m" l l" ) i s nol m·cessarily 5�ct since only stales corresponding lo differ ent eigen values have to be orthogonal ! Insert a 1 : Ill ...... ,, 1 \I t 0 I I" "' \I I r'll 'v ' ·ll l \I /•() l· I , l IJ t I , ;. L kE/l
t.
I
I
.:....l..
n
I
This is the eigenvalue equation which gives the correct zeroth order eigen vectors! Let us use all thi"
1. The ground state is non-degenerate
::::?
� !10 = ( 0, 0 1 1 · 1 0, 0 ) = OmcJ1( 0, 0 I :ry I 0, 0 )
�
( 0, 0 I ( ax+a� ) ( ay+a�) I 0, 0 ) = 0
2. First excited state i s degenerate I L 0 ) , I 0, 1 ) . \Ve need the matrix elements ( L 0 I V I L 0 ) , ( L 0 I V I 0, 1 ) , ( 0, 1 1 V I I, 0 ), ( 0, 1 1 V I 0, 1 ) .
Fl.Tl (/"
d
I m , n ) = vm I m - I , n )
a� l m , n ) = v m + l l m + l , n )
ri c .
106 Togetl1er this gives
hn ,w
(l )
clttw
The 1/-matrix becomes
( 5.8)
o 1 0
2
'
and so the eigenvalues ( = L'l.1 ) are L'l. 1 =
±
iihw . 2
-
To get the eigen vectors we solve
and get
�( I O. l ) + l l.O)), v-
� ( 1 0. 1 ) - 1 1 , 0 ) ) .
E+ = ftw(2
I
,
+ 2). � ), I
/<)_ = n.w( 2 -
ii
).
,
( '\.9)
:j. The second excited state i s also degenerate :2, 0 ) 1 1 . 1 0, 2 ) so we need the corresponding 9 matrix elements. H owever the only non vaTJlshlng oTJes are: .
.
.
.
],·1 1 ,2o = \ 20, 1 1 = \1 1 ,02 = \'o2. 1 1 =
y'2
iin.w
(5.10)
( where the y'2 e
0 = dd
(
-A
l 0
)l
1 0 -A l -A
= - A ( A1 - 1 ) + A = A (2 - A1 )
a.
107
APPROXIMATION TviETHODS
which meau' that the eigeuvalues are {0 , ±Mlce!}. By the same method as above we gel the eigeuvectors
I . . . )+
�( 1 2 , 0 ) + V2 1 1 , 1 ) + 1 0 , 2 ) ) , 1 ( - l 2, 0 ) + 1 0, 2 ) ) , V2 �( I 2, 0 ) - V2 1 LI ) + 1 0 , 2 ) ) ,
I · · · )o
I . . . 1-
Eo = ;:;,"' ,
L
= h'-" ( 3 - S).
(c) To solve the problem exactly we w i l l make a varia ble change. The poten tial i s
rnuJ2 ['2 ( .r2 + u. 2 ) + <\.ryl = + id + y) 2 ) + muJ2 -
[�((:c
(:c -
�(:c + id - ( 1: - y ) ] . 2
)
(5. 1 1 )
Now i t i s natural to iutroduce
( " .I ')) ._).
Note : [1J, p;,] conjugate.
=
=
[y', p�]
._
i(�, 'o that (.r', p;,) aud (y', p�) are cauouically
lu lhe'e uew variables the problem lakes the form H
=
::,-(p, +
1
_n;
r'2
Py
r'2
)+
2 rn;,.v -, .....
.
[ ( 1 +
�
So we get one osc i l l ator with u.'�. = w yll + S an d another \vith w;1 = w .;f=S. The energy levels arc:
f;o.o E 10
flu•. ,
r1u1 + r1w;, = t'"' ( 1 + v 1 + S) = ., , ') fluJ\ ., ' I + 1 + 'i1 "r + . . . •) = fluJ\ �+
'1 ",.)
+ CJ ( (!c2 ) '
108
hoJ + ttoJ;, = , . . = hoJ('2 - �cl) + O(c\2 ) , = hw(3 + 8) + 0(8' ), P2 0 - hw + 2hw� = hoJ + hoJ� + hw'y = . . . = :lhw + 0(8'), E1 ,1 -2 = hw(3 - o ) + O (
r
.
.
.
.
.
.
r
So iirst order perturbation theory worked !
(
)
5 . 2 A system that has three unperturbed states can be represented by the perturbed Hamiltonian matrix E1 0 a·
0 P1 b-
a b r;,
where F2 > F1 • The quantities a and b are to be regarded as per turbations that are of the smne order and are sn1all eon1pared with F'2 - i-':1 • Use the second-order nondegenerate perturbation theory to calculate the perturbed eigenvalues. (Is this procedure correct?) Then diagonalize the matrix to find the exact eigenvalues. Finally, use the second-order degenerate perturbation theory. Compare the three results obtained.
( a) Fir'L fiud the exad result by diagoualizing the IIamihoui;m : 0
0 E1 - E 0 pl - p ((
-
a b
r;, - r;
fj•
( F'1 - F') [( 1·.'1 - F)( 1':, - 1':) - l bl' ] + a [0 ( P1 - P) 2 ( E, - P) - ( P1 - P ) ( lbl' + l a l ' ) .
a" ( 1'.'1 - /<:)]
So, /<; = /<.'1 or ( I';, - t:) ( /<.'2 - F) - ( lbi ' + l a l 2 ) = 0 i .e. B2 ( E1 + F:,) F: + F:, E, - ( l a l' + 1/JI') = 0 =? r;, + E,
2
±
V
'
E, + P, '
2
-
F F., ''
+
1 · 1 ' + l b l' a
-
=
(5.14)
a.
APPROXIMATION TviETHODS
109
(S.l'i)
Since lof + l h l � is small we can expand the sq11are root and write the three energy levels as :
E
/<,·
E (5. 1 6) (b) Non degenerate perturbation theory to :?'nd order. The b
The matrix elements of the pertm-bittion V ( I I \i I :3 ) Since ::'-.k1 1 shifts are
=
(2) ::'-, 1 ·
=
a,
(kiV Ik)
( 2 1 li 1 :3 ) =
=
b,
=
(
0 0 a 0 0 b a· b' 0
( 1 1 v 12)
=
)
(A I vIk)
=
o.
0 l'st order gives nothing. Dut the 2'nd order
Ia I " ""' l l ic1 l" = I ( :� 1 v 1 1 W £..... E" E1 - E2 E1 - £, 1 - E0k k#1 2 2) 2 IW l \ild = 1( 3 1 1 1 1 L E"2 - E0k E I - E2 kf:-2 IW l a l2 + IW laf + l'.''J. - 1'-' 1 1'-' 2 - 1'-' 1 t:1 - t:2 �
L'
C
C
C
(5. 1 7)
110 The nil perturbed problem h as two ( degenerate) stales 1 1 ) a!ld 1 2 ) with energy E1 and one ( non-degenerate) sla.t.e I :3 ) wit.l1 energy E2 . l_:Tsillg non degenerate pert.nrbation theory we expect. only the conection to F2 ( i .e. D.&2 )) t o give t.h;; correct rt'fmlt. : and iTHleed t.lds t.11rn:-;. o11t. t.o hC' t.he ca.:-;.e. �
(c) To find the cotTect energy shi fts for the two degenerate stales we have to 11se degC'nera.te pert.11rha.tion theory. Tl1(' V - rna.trix for t.he degetJ('l'
( :� :� )
, so
1 'st order pert. thy. will again gi ve !lothillg. \Ve h ave
to go to 2'nd order. The problern we want to sol ve i s ( 110 + V) I I ) nsing t.hC' expatJsioTJ " I I ) = I I" ) + I I ' ) + . . . E = E + D. l'i + D.l2 l + . . .
=
FII)
I.. ·c>. 1 1'\' )
where /1 1 1° ) F0 1 1° ) . '\ol<' that. the snperscripl index in a bra or ket de 0 notes which order it has in the pertnrbation expansion . Di fferent solntions to the fnll problem are denoted by di fferent /'s. Since the ( snb-) problem we are no\v so1ving is 2-dirnensiona.l we expect t.o filld two so1nt.ions corn'spolldillg to I = 1 , 2. Inserting the expallsiolls in (5.18) leaves ns with =
( E0 ····· Ho) [ I I" ) + I I' ) + . .] =
( V - D. cn _ D. I2J . . . J [ I 1" ) + 1 1' ) + .
A t first order in the pnturbation this says:
( F0 - lfu) 1 1' )
(V -
d.I1 I ) 1 1° )
. . .] .
(0.1 9 )
where of course D. it) = 0 as noted above. \Jultiply this frorn the left with a bra ( k0 I from outside the deg. subspace =
( k" I F0 - lf11 1 11 )
=?
I I1 ) .
- '\' - L
kfJ>
,
( k:0 I V 1 1° ) I I/' ) ( k0 I F 1 1° ) =
E. o - Ek
.
•
(0.20)
This expression for 1 11 ) we will use in the 2'nd orrin equation from ( 5.19) To get rid of the left h alld side, multiply with a degenerate bra ( m0 I ( Ho I m0 ) = £0 I m0 ))
( m0 l lc'0 - l/0 I P )
=
0
=
( m0 I V 1 11 ) - d.1 2 1 ( m0 1 1° ) .
APPROXIMATION TviETHODS
a.
Inserting the expression (5.20) for
111
1 11 )
we get
To rnake this look l i ke an eigenvalue eqna.t ion we have to iTJsert a 1 : ._. ._. L L n E J) kiD
( m0 1 V I k:0 )( k" I V I 11 0 ) \1 n. . o 1 1o , l E 0 - Ek
_ �
,, (') • \
�
nl o
1 1o )· .
Maybe it looks more fmni liar in matrix form
"' ;·'\fm n ·'I'-· n ·' ( " ·'1' - W -· u1 L nED
-
where
._. L., k#D
)
( m0 I V I k0 ) ( k0 I V l 11° ) . . E0 - E"k
o ( m l io )
are r•xpressed i n tllf• ba'i' de lined hy su hspace basis /) { I I ) , 1 2 ) } =
1 1° ) .
Fvahmte
iH m
the degenerate
E1 - E2
IW
'With this explicit expression for :'YL solve the eigenvalm• equation ( defi ne /\ � ( 2 ) ( R1 - HJ. and take ont a. con1n1on factor E1�E2 ) =
'Itt
0
=?
( lui'a"b-
)
ab" I W - ,\ ( la.12 - ,\J( I bi' - ,\J - la.121 bl' ,\' - ! lal' + IW J-\ ,\ = O, l a. l' + IW .
A
=
=
( 5.2 1 )
112 From before we knew the non-degener
F (t) = Fo,-t / T (a) Using time-dependent perturbation theory to first order, obtain the probability of finding the oscillator in its first excited state for l > 0. Show that the l -+ ex:· ( T finite) limit of your expression IS independent of time. Is this reasonable or surprising? (b) Can we find higher excited states? [You may use (n'l:rln) = Jn/2mw( y�S,, n + l
(
+ foS,,.,,_J).]
iW .1 = - I 11fJ:cc -tfn ( F1 = - ();c ' At t = 0 the system is in its gronnd stale I n, 0 ) = I 0 ) . \Ve w ant to caknlate 1 ·r t .i /
I n, i )
. i)
·-
"'
/
-
I:, c,(l)<-Ent/1• I n )
wlwre we get c,( t ) from its dilL eqn . (S. 5.5.15): th-,--). ({
K.m
c ( t) K.m ,
'
(nIV Im)
(5.22)
( n i - Fo:n-'1 ' 1 m ) = - Foc-'1 ' ( n l :r l m ) = - Fo<
' ' Vn;;:;--:-:: ( v'mo,_m-1 + v m + lo,.m+ i )·
_, , _
�-
� rn"-'-"
-
a.
APPROXIMATION TviETHODS
Put i t back into ( 5.22) · t (} ' 11/. ,_,l cn. { l ) o
p - 1 /T = - J'u''
( ;:;:-::Lj -d ( vrr;2. rnw -v n + 1 �e "n+l . l ) + v nc en ( !
Pntnrfmtion theory means expanding order this is {) (0) _:_c i)[ n (1) = 0
=?
·
)
'
=
f:: iwt
c};1 1 + c�.' I + . . . , and to 7,eroth
('(n0) ... - J- no
I 1'0 J·t ' I: 1/ (I') . . ·(_ 0) = - Po v· fl [,' t' ,-t/T ( � I +1
To first order we get :t:
_
1-Jl
.
th.
(.
,c
nrn _
m
lrnw . ,
C
u
'�""'1
'
( .m .
,-i�l' .(o) __ Y 'fl. i ! C (.n+l
(
"n-1 (, 1 )' ') .
,i�l' .(o) _ ( I 'J (, I 'J + \r;11 (_ en-] ,
)
\\/e g<'t. 011e TlOTJ - vardsldTJg t.enn for n = 1 , i .e. at flr:-;.t. order in pert11rhatloT1 theory with the ll.O. in the gronTJ
_
-.-,
. VI :... rnu...• tu) �
fi{J j 11.
;_. z
[
(I)
]
I
)
I
(1
-:;.
-
,,(iw-?JI
_ _ _, F,.. t ( l) l o , I ) = L.. cn ( l )c· l n ) = c1 ( I ) < ,'
and
)
-;R1 t
''
1 1 ).
n.
The probability of finding the ll.O. in 1 1 ) is
As t
--t :x;
_1, 1
(I
-----t
F(,
-
\
1!. n.
I
I
-, fl V 2rnw tw - :;:.
.
·
=
con...,l.
This is of cmn·se r<'a.soTJahle smc<' applyh1g a. static force 1neaT1s that the syst.ern asyrnptotica.l1y -finds a Tle\v <'<}111iihri11rrL
1 14 (b) A, remarked earlier t here are no other 11011-vanishiug c, ', at first order, "' 11 0 higher excited state' call be fou11 d . However, goiug t o h igher order m pertmhat iou theory such 'tales will be excited. 5.4 Consider a composite system made up of two spin � objects. for t < 0, the Hamiltonian does not depend on spin and can be taken to be zero by suitably adjusting the energy scale. For t > 0, the Hamiltonian is given by II =
t,'
.S, ·
(46.) -
82 .
Suppose the system is in I + -) for i S 0 . Find, as a function of time, the probability for being found in each of the following states I + +) , 1 + -) , 1 - +) , 1 - -) : -
(a) By solving the problem exactly. (b) By solving the problem assuming the validity of first-order time-dependent perturbation theory with JJ as a perturbation switched on at i = 0. Under what condition does (b) give the correct results? (a) Tht:-� basis we art:-� using is or conrse I S11z � 82z ) . Expand pot ential in thi, basis:
[
•-c 1 Lr·•)o2:.L' r,1'
+
c• o ,:_-lly•1 2y
+ C',.Jiz,.Jc•2 z
=
{ 'Ill t J l ·i S
t.tH:-'
int er act ion
1)(-H'ilS
·}
\ 1 + ) ( - 1 + 1 - ) ( + I J , ( I + ) ( - 1 + l - ) ( + l l2 +
2 + i (- I + )( - I + 1 - ) ( + I ), \- I + )( - I + 1 - )( + I h + +
t�' [
( I + )( + I - 1 - )( - I lt( I + )( + I - 1 - )( - I I + + )( - - I + I + - )( - + I +
+ I - + )( + - I + I - - )( + + I + + i ( I + + )( - - I - I + - )( - + I + - I - + )( + - I + I - - )( + + I l + 2
h]
a.
11.5
APPROXIMATION TviETHODS
+ I + + )( + + I I + )( + I + - I - + )( - + I + I - - )( - - I ···
(
]
···
=
1 1) = I + +) 12) = I + -) 1 :5 ) = 1 - + ) 1 4 ) = 1 - - ) )
In matrix form this is (nsing
� --�l �1 1 l
0
0 2
0
II = !:-.
0 0 0
l
( 5 .23)
.
This basis is nice to use, since even though the problem is 4-dimensional we get a 2-dinwnsional inatrix to diagonalize. Lucky us! ( Of course tl1is luck is due to the rotational in variance of the problem . ) Now diagonalize the 2 0
=
del
(
·····
1
x ..
...
2
( ,\
,\
····
2
=? -J:
=
)
2 matrix to lind the eigen values and eigenkets ·····
l
l
2
·) ·····
\ . A
=
(
.
l
..
...
,\ )1
..
.. .
4
=
,\ 2
+
2 .-\
) ( . ) ) ( l '
.'!
r
=
.'!
r
1 + 2y = J: =? J: = y = /2
-3 :
l . =? -:r + 2y = -:).r =? .r = - y = /2 So, the cmnplde spectrum is:
lJ
I
+ +
) . I - - ). ):,( I + - ) + I - + )
},( I + - ) - I - + )
with em:rgy 1:-. with energy
- :)t-,
..
.. .
il
116 This was a curnlwrsorne but straigh tforward way to calculate the speclrmn. A smarter wav would h ave been to use .'! = .'!1 + .if, to find
'·�... '' Vh
kl·l(';\\' tl·
l
·,
tt
Cil
,_] l
-
bl
a2
-
/l 2 2
I ( �I + 1 ) - 4 oh'
. SO
Also, we know that two spin � systems add up to singlet (spin 0), i .e.
S
=
1 (:; states) '*
S = 0 ( 1 st ate)
'*
.'!1 · .if, =
,Ci,
·
.if, =
mw
� (t/1 ( 1 + 1) - ;' ) = �r/
� ( 3�' ) = in2 -
-
,'
F(spin =O)
1 ) and om·
"
4� 1(/ = .:'-. n.- 4 4 .:'-. - :;; = n. ' 4
F(spin = 1 )
triplet ( spin
.
·
_
3-"'
·
·
(5.25)
{ I + + ) I - - ), },( I + - ) + I - + ) ) } are spin L and },( I + - ) - I - + )) is spin O!
From Clebsch-Gordau decomposition we know that
,
Let 's get back on track and find the dywnn ics. In the new has is H is diagonal and time-independent , so we can nse the simple form of t the time-evolution operator: U(Uo)
=
fl>p
{ 'r, -
ll ( i
-
}
lo) .
I + - ) . In the new basis { 1 1 ) I + + ) . 1 2 ) I - ) . I :q � ( I + - ) + I - + ) l. 1 4 ) �( I + - ) I + ) l }
'fhe initial state was =
=
=
-
-
-
=
117
;j. APPROXIMATION TviETHODS the initial slate is
U(t , 0 ) on that we get l n,t ) �'rp {-i Ht } ( l 3 ) + 1 4 ) ) = � [u:p { -it; t } 1 3 ) + up { '� t; / } 1 4 )] = 1 - i· t;_t } -( t:cp\( _ 1 + - '1 + 1 - + ))+ v0 J! t;t } 1 ( I + - ,: - I - + ) +u:p \r -v0 � [( , -'�� + ,''�' ) I + - ) + ( ,-'�' + c'"�' l I - + )]
Acting with
[
l
;,
l
h
--
.
> I,
I
·]
where
w The probabi lity to find the system in the state
(5.26) I !i ) is as nsnal 1(11 I t )12 o,
I + + I n. l ) I - - I o. l \ 0 I(+ - I t )1 2 = � (2 + ,4:w� + c''wt ) = � (I + coYlwt) I - 4(wt)2 . . . I( - + I t )1 2 = � (2 , -,:�1 ) = 1 ( 1 - cos4wt) 1l(wt)2 . . . (b) First order pertmbation t hcory (nse S . 5.6. 17): \
,
=
'
,
I
=
c::=
o,
o,
14d -
c::=
6ni: .j t: !l
11 fo
/i' C.i�,.;I ' TVm • · (' I I ·) ,
(.
Here we have (usi ng the original basis) Ilo
l i ) = I + - ),
·
= 0, V given b y (5.23)
(5. 27 1
118
I - + ),
If)
En - E; ---'-'-,---, ---.:-
2 i'. ,
0,
=
n
n
{ F,,
=
0}
0,
=
# f.
lr'"'T'ting this into ( 5.27) yields c;
c ! +- ) = l ,
(0)
(0)
(I) cf
(IJ c 1 _+ 11 = - t '
·
i ;,·t
n o
' = - '! tv.) ·· I �w
t·lf'! ..
....
I ·c>-�1> '!')
.•
a� the onl�y non-vcmishing coefficients np to first. oNler. The prohahilit.�y of findh1g t.h<' �y�t.ern in I - - ) o1· I + + ) is t.hn� ohvionsly 7,ero, wh(-'T"('t'IS fm· the othr•1· t wo st ates l
P( I + - ) ) P( I - + ) )
=
lc1I I I I, I ) + c1("I) ( I. ) + . . . 1 2
=
l2td.l 2 •
·
=
1 (d.)
2
to fir st order, in coJTespmHlence with the exact result . The approximation breaks down when wl « 1 is no l onge r valid, so for a given t:
wl « I
=?
i'. « .
tt l
- .
5.5
The ground state of a hydrogen atom (n to a time-dependent potential as follows: qr, l )
=
= l ,l =
0) is subjected
% cos( k:z - wl.).
Using time-dependent perturbation theory, obtain an expression for the transition rate at which the electron is emitted with mo mentum r!. Show, in particular, how you may compute the angular distribution of the ejected electron (in terms of 0 and t!> defined with respect to the :-axis) . Discuss b·riefiy the similarities and the differences between this problem and the (more realistic) photo electric effect. ( no t e : For the initial wave function use
a.
APPROXIMATION TviETHODS
119
If you have a normalization problem, the final wave function may be taken to be \fJ :U) =
c�, )
, ,,,,�;,,
with /, very large, but you should be able to show that the observ able effects are independent of [,. ) 1/
LI
j,;, cos(kz
To begin with the atom is in the n
=
=
0 state . 1\t I
- OJ! )
=
0 the perturbation
is tnJT)(•d on . \Ve want to find the transition rate at which the electron IS emitted with momentum zlj. The initial wave-function is
and the final wave-function is IT'
=
-; )
The pertn rhation is
•
�-T.·-l-) ' . '-
-
_
•Y J l ·l
31<
v f dho-A) I'() i t '
\i '
( .ip· xf!; . .
+
Vt:: i,vt + vte-i'.;t . l
t
-
d loo - �1) '
] "
(5.29)
Tirrlf•- dqwndent pertu r b at ion theory ( 5.5.6.44) gives us the transition rate
:2:-r I ·t I"
w ,_., = -y;
because the a.t cnn a.b sorh:->
and
(, ''! 1
I ei h · I n "
a
=
Vni
photon
1 , I = tJ.
tl'w'J.
The rna.trix
) = J r: x(. k13
f
dernent
IS
I c i)···- I .1' }I \I :r I n
= LI =
0 I1 =
(!i .:)())
120
So
(tib) ,,
;
is the :JD Fourier transform of the initial wave-function (and some
constant ) with q = kf .5 .39)
- h!,.
That can be extracted from ( Sakurai problem
The transition rate is understood to be i ntegrated over the density of states. 'vVe need to get that as a function of fJ; = hkf. As in ( 5.5.7.3 1 ) , the vol ume elemeut i s n 2 rlrulfl
t smg
\VE'
=
dn
n 1 d'!.!1 - dpf . ( Pf
1 2 r 2 P.r 2n ( '21T-fl ) 1
get
which le
=
L'kj
.,-- "-',-dfl.dpr , ( 2"J""
=
and this i s the sought density. Fiually. ·)- "2 "4-2 �" 'o v "
r,
1
4 P a3 [ ,\ + ( kr _ keY ]
- -
·
.•
4
1':lzlf
( 2 ;;-t,p
j() 'jPJ ·
r."
Note th
. 2 = lkt I 2 cos 2 I!-- + k2 - 2klk! l co.siJ + lkf I 2 .m1. II h} + k2 - 2klk! lco.siJ. •
(5. 31 )
In a companson bet ween this problem and the photoelectric effect as dis cussed iu (S. 5.7) \Ve note tha.t siuce there i s no polariza.t iou vector involved� w has no dependence ou the azimuthal augle >;>. On the other haud we did not m