This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
w
:(l,/c) indicates that, on inclusion of the contributions f"d;,g,,), of integral spin fields, the various parts of I would is actually a pseudovector with respect to time reflec- transform difierently, thus emphasizing again the tion. With thd plane surface c chosen perpendicular to general failure of Eq. (2.la) to admit time reflection as the time axis, the components of (P,) are obtained as a unitary transformation. 'three-dimensional To aid in investigating the extended class of transforvolume integrals, mations that is required to include time reflection we f shall introduce some notational developments. The (Po):G/c) d'o(rsn). scalar product of two vectors, Vo and V6, can be written | (3'2) (3.4) (al6) = V"'Va: Voilr.', t,
(Pt):0/c) aolr*), k:r,2,3, |
thereby being regarded as ttre invariant combination of € vector Vr with the dual, complex conjugate vector induces iI,.* We allow operators to act both on the left and on ,"n.ltio., tco---fi0, rl-rr and the ti-" (P)+(P6), Qrl--(Pr), according to the transforma- the right of vectots, V and iP*. Thus, an operator tion properties of tensors. This difiers in sign from a associated with A, the transposed operator .4"; is proper vector transformation. In particular, the energy defined bys does not reverse sign. under time reflection. More (3.5) A*:tAr, V*A:Arg', generally, this property of (P,) is obtained from the pseudovector character of drr, which expressesthe or by (3.6) pseudoscalar nature of a four-dimensional volume (alAlb):v"'Avt:9uAr9"'. element with respect to time reflection. Similarly, the We also define the associated complex conjugate expectation value of the charge operator.4", ff
\Q):0/c)
| dc,(j,):(1/c)
JJ
| d.o(j]
(,4v)..:1-v'.
(3.3)
behavesas a pseudoscalarunder time reflection. Hence, this transformation interchanges positive and negative charge, and both signs must occur symmetrically in a covariant theory. Indeed, for some purposes the requirement of charge slmmetry can be substituted for the more incisive demand of invariance under time reflection. The significant implication of these properties is that time reflection cannot be included within the general framework of unitary transformations. Thus, on referring to the Schrddinger equation for translations (2.107), or the analogousoperator equation (2,110), we encounter a contradiction between the transformation properties of the proper vector translation operator du and of the pseudovector Pu, This difficulty appears most fundamentally in our basic variational principle (2.14). With 8 behaving as a scalar and (d.r) as a pseudoscalar,reflection of the time axis introduces a minus sign on the right side of this equation. However, it is important to notice that the scalar nature of S cannot be maintained for that part of the lagrange function which describes half-integral spin fields. Indeed, such contributions to 3 behave like pseudoscalars
(3.7)
The connection with the hermitian conjugate operator ,4i is obtained flom the definitioir of the latter,
(,4v)':E'7t, Iio-*-0"*.n,al
(3.8)
invarianrof a spin | fretdisw:{/.l!{/.
R*, cat The transformation that represents time re8ection, {': be obtained from its equivalence with a rotation through the Accordingly, angle r in the (45) plane; R: exp?rtro*):ias.
.9,,9,=9tn-tyon{:-W,
which indiates tle pseudoscalar character of the spia | field laqranqe function, witi respect to time reflection. The correspondins behavior oI 6eldswith other spin valuescan be obtained fiom tb6 observationthat a spinor of-rank z contains6elds of baiic invariant and lime reflection spin lz, -{z-1,. .". mnk tr are operator lor a splnor ol"Th..
W={11ft,0ot9, and
f
1"
L .-,
J *..
R:erpl i"| 2" d{"(t) l: fJ idr6*) Therefore,
{/'*' : {t R-tiI r&' R! : (- r)"6V, which shows the pseudoxalar nature of the lagrange function for all haltinteral spin ffelds, eNote how the familiar property of transposition, (,'18)r *B"r{?, foflowsfrom dripdefinition:',4B'l!:A(*BT\:\ltBrAr.
354 JULIAN
namely, t1-
t.T
(3.9)
Conventional quantum mechanics conte;plates transformations only within the V vector space, and contragradient transformations within the dual V* space. We shall now consider transformations that interchangethe two spaces,as in (3.10)
!["+V;:i{r"-' The efiect of Eq. (3.10) is indicated by
( a l a ) :v " - v r : v ; v ; ' : ( 6 l a ) ,
926
SCHWINGER
R will now be chosen to oroduce that linear transformation of the f", RQ"R-L:R'808,
which compensatesthe efiect of the gradient vector transformation. Thus. we have
E:(+)er(e"r, A,O.r),
( 6 1A r I e ) .
(J.2J)
where the (+) sign here refers to the fact that the structure of the lagrange function, for half-integral (3.11) spin ficlds, can be maintainedonly at.the expenseof a change in sign. We now see that if
and
( a l A l b ): v j A * a : v ; , 4 i h ' :
(3'22)
(3.r2)
t': e($"r, apo'r'),
(3'24)
the form of our fundamental dynamical equation rvill have been preserved under time reflection, since Eq. ( 3 . 1 3 ) (3.20) will then differ from Eq. (2.14) only in the V;-:Rt[", substitution of {"? for d" as the appropriate field where R is a unitary operator, we have variable, and in the interchange of or and o2, which ( 3 . 1 4 ) simply reflects the reversed temporal sense in rvhich ( a l b ) : ( 6 1 d . ) , ( e l A l b ) : ( 6 1A l a ) , the dynamical development of the system is to be in which traced. (3.1s) Invariance under time reflection thus requires that A:1ntp-'1, inverting the order of all factors in the lagrangefunction Now, we have leave a scalar term unchanged,and teverse the sign of (RAR-')': 8,4, (s.to) a pseudoscalarterm. This can be satisfied, of course, tn : lAtnn-,1r: (RBR-t)r by an explicit symmetrization or antisymmetrization ol and therelore the various terms in !. When the lagrange function, (3.17) thus arranged, is employed in the principle of stationary @llA, nflb): - (6llA, Blld). action, the variations Dod"will likewise be disposedin a We have here precisely the sign change that is required symmetrical or antisymmetrical manner. We must now to preservethe structure of equationslike Eq. (2.110) recall that the equationsof motion (2.18), which do under time reflection. not depend explicitly on the nature of the field commuWe now examine whether it is possible to satisfy the tation properlies, have been obtained by postulating invariance under time reflection of by requirement the equality of terms in 60"8that difier basically only means of transformations of the type (3.13). When we in the location of 60d", Since such terms appear with introduce the coordinate transformation the same sign in scalar components of 3, and with ( 3 . 1 8 ) opposite signs in pseudoscalarcomponents, we deduce io:-*0, ip:16 h:1,2,3, a correspondingcommutativity, or anticommutativity, in conjunction with the eigenvector transformation between 6od" and the other operators in the individual (3.1e) terms of 6o!. v-(i', o):Rll(l', "), The information concerning commutation properties the fundamental dynamical equation (2.14) becomes that has thus been obtained is restricted to qperatorsat common space-time points, since this is the nature oI 6(i2", orlil', a) the terms in 8. Commutation relations between field quantities located at distinct points of a space-like :(i/ttc)(ti',,,1afatlulfu', oJ, (i.2u) surface are implied by the general compatibility reI J quirement for physical quantities attached to points "L with a spacelike interval. Components of integral spin where fields, and bilinear combinations o{ the components of c : (nsR-'; r : 3r ((ROdR-t)r, + A!(Rd"R-) "). (3.21) half-integral spin fields, are the basic physicial quanIn the last statement, the * sign indicates the efiect of tities to which this compatibility condition applies. By the coordinate transformation (3.18) on the components consideringthe generalpossibilities of coupling between of the gradient vector, while the notation J"( ) the various fields, we may draw from these two expressymbolizes the reversal in the order of all factors sions of relativistic invariance the consequencethat the induced by the operation of transposition. The operator variations 64b(r'), and therefore tle eon=jugatevarialVlore generally, if
355 927
THEO,RY OF QUANTIZED
tions 6llt(*f), iommute or anticommute with d"(r), ll,(a) for all r and *' on a given o, where the relation of anticommutativity holds when both a and b refer to componentsof half-integral spin fields. The consistency of this statement with the general commutation relations that have already been deduced from it is easily verified. By subjecting the canonical variables in Eq. (2.81) to independent variations, wei'bbtain
FIELDS
and statistics of particles is implicit in the requirement of invariance under coordinate transformations.r0
r0The'discussion of ttre spin and statistia connection by W. Pauli fPhvs. Rev. 58, 716 (1940)l is mmewbat more negative in charac'ier,although bawd on closelyrelatedphysiel requirements. Thus, Pauli remarks that Bose-Einstein quantiation of a halfinteeial spin field implie au enerEythat posrcse no lower bound, and-that Fermi-Diric quantization of an integral spin field leads to an alcebraic contradiction with the commutativity of physical oumtitiis located at points wilh a spacelike interval. Anotier dostulate which bas Leen emploved, that of charge symmetry g, i-W. p.uti and F. L Belinhnti, Phvsie 7, 177 (1940)l' suffices -m lO"@), d@6(r')la: [II'(r), a4b1r';1*: determine the niture of tbr commutation relations for suffrci[C{r), 6ilb(r') ]+: [rl'(o), drl6(r')]*: 9, 13.25; entlv simple systems.As we have noticed, it is a consequenceof timi refleition invariance.The commentsof Feynman on vrcuum rvhich is valid for all r, d on r. In addition, Eq. (2.81) polarization and statistics [Phys. Rev.76, 749 (lq+9)] aPpearto suce a properly states that all physical quantities commute at be an illustration of the charge symmetry requirement, contradiction is establishedwhen ttre chargesymmetrical concept distinct points of o. of t-hevmum is applied to a Bo*-Einstein spin | field, or to a We conclude that the connection between the spin Fermi-Dinc spin 0 field,
356
P o p e r2 9
The Theory of Quantized Fields. II JurtaN Scurvttcrn II qrutrd Unioersil,t,, Canbrid.ge, fufassdchuseils (ReceivedFebruary 19, 1953) 'Ile arguments leading to the formulation of the action principle for a general field are presented. In associationsith the complete reductiou of all numerical matrices into symmetrical and antisymmetrical parts, the geaeralfield is decomposedinto two sets,which are identified with Bose-Einsteinand Fermi-Dirac fields. The spin restriction on the trvo kinds of fields is inferred from the time reflection invariance requirement. The consistencyof the theory is verified in terms of a criterion involving the various geireratorsol infinitesimal transformations.Following a discussionof chargedfields,the electromagneticfield is introduced to satisfy the postulate of general gauge invarimce. As an aspect of the latter, it is recognizedthat the electromagneticfield and charged fields are not kinematically independent.After a discussionof the field strength commutation relations, the independent dynamical variables of the electromagnetic freld are exhibited in terms of a special gauge.
tfHE
general program oI this seriesl is the conI struction of a theory oI quantizedfields in terms of a single fundamental dynamical principle. We shalt first present a revised account of the developments containedin the initial paper. THE DYNAMICAL PRINCIPLE The transformation functions connecting various representationshave the two fundamental properties
6W"fi:6W"8; the infinitesimal operators 6trV"0are Hermitian. The 6W"p possessanother additivity property referring to the composition of two. dynamically independent systems. Thus, if I and II designate such systems, (ot' au' IAt'Fn' ) : (ar' I f.r) (ar' I 9n'), and if 6ll"pr and 6IV are the operators characterizing "6rr ininitesimal changes of the separate transformation functions, that of the compositesystem is
f
("' lt't : I la' lP'\lB'@'11').
@'1ts')*:@'1"');
dll
where{d,B' symbolizesboth integration and summation over the eigenvaluespectrumi If 6(a'lB') is any infinitesimal alteration of the tr4nsformation function, rve may write 6(q,lB,):i(q,l6Iy"pll3,), (1) which serves as the definition of the infinitesimal operator 6W"p. The requirement that any infinitesimal alteration maintain the multiplicative composition law of transformation functions implies an additive composition law for the infinitesimal operators, ' 6W (2) "r:51Y "ua51, ur. If the a and B representations are identical, we infer that 6W"":0, which expressesthe fixed orthonormality requirements on the eigenvectors of a given representation. On identifying the o and 7 representations,we learn that 6ilrp": -51y"u. The second property of transformation functions implies that
-i(d'l6w
or
611'"rtr517"utr. "6: Infinitesimal alterations of eigenvectorsthat preserve the orthonormality properties have the form 6{,(a): -iG.V(q'), ffI,(a')I:iV(d')1G",
,
where the generator G" is an infinitesimal Hermitian operator which possesses an additivity propery'yfor the composition of dynamically independent systems.If the two iigenvectors of a transformation function are varied independently, the resulting change of the transformation function has the general structure (1), with 6W"e:6"-6u' The vector V (a')+6.P (d') : (1- iG
(a'), ")v can be characterized as an eigenvector of the operator set a: (l - iG a(1 * iG.) : q- 6a, ") wrtn ule ergenvalues d . Ilere Ea: -ile,
G"l.
This infinitesimal unitary transformation of the eigenvector V(a/) induces a transformation of any operator F such that (a'IFIa"): (a'|FIa").
-i(B' l6w "plp')*: "ptla') :i.(9'l6Wp,l q'),
IJ. Schwinger,Phys. Rev. 82, 914 (1951), Part I ,IJ
357 JULIAN
714
SCHWINGEIT must have the additive form
We write this in the form (a' I F I a") - (a' I F I a") : (a' | (F - F) | a"),
c : f a o c , o ,J{I"^a,' :, c , r , 1 , J,
or, in virtue of the infinitesimal nature of the transformation, 6 ( a ' l F l a ' t ) : ( a ' l 6 FI a " ) ,
where rJo is the numerical measure ol an element of (*) is to be regarded as the timespace-likearea and G10) rvhere the left side refers to the change in the eigen- Iike component of a vector in a local coordinate system vectors for a fixed F, while the right side provides an based on o in order to give the surface integral an invariant form. If one can interpret Gu@) oll tr1, and equivalent variation of the operator F, given by on 02, as the values of a vector defined at all points, -ilF,G"). 6F:F-F: the difference of surface integrals in (6) can be transIf the change consists in the alteration of some formed into the volume integral paramete-.rr, upon which the dynamical variables de?6r pend, and which may occur explicitly in F, we have (d.r)duG,(r), 6ll'r,: | J,, (6F)" F:F:F+6,F- a.F, @ u : a / a x , )' where 6"F is the total alteration in F, from which is A second type of transformation function alteration subtracted 0,F, the change in F associated with the is obtained on considering that the transformation conbe cannot explicit appearance of r, since the latter necting f1, o1,and f2, d2 can be constructed through the produced by an operator transformation. We thereby intermidiary of an inhnite successionof transformations obtain the "equation of motion" with respect to the relating operators on infinitesimally neighboring surparameter r, faces.According to the general additivity property (2), (3) 6,F: a,F+ilF,G,l. For dynamical systems obeying the postulate of local action, complete descriptions are provided by sets of physical quantities, f, associated with spaceJike surfaces,o. An infinitesimal alteration of the general transformation function (fr'orl(r"or) is ch4racterizedby (4) ""oz). Here the indices 1 and 2 refer both to the choice of complete set of commuting operators f, and to the spaceJike surface o. We can, in particular, consider transformations between the same set of operators on difierent surfaces,or between different sets of commuting operators on the same surface, as in 6(lt'orlf z"oz):i(f t'otl6Wplf
6(('oli'c):i.(l'ol6wl('o).
(5)
One type of change of the general transformation function consistsin the introduction, independently on rr and on oz, of infnitesimal unitary transformations of the operators, including displacements of these surfaces. The transformations will be generated by operators G1 and G2, constructed from dynamical variables o1rd1 &nd d2, respectively' and 6W12:5t-6''
(6)
When the transformation function connects two different sets of operators on the same surface, which are subiected to infinitesimal transformations generatedby C ana G, respectively, we have, referring to (5),
oI
6W.rr:2 6W"aa'.,, a modification of the where 6tr4/,16",, .hu.".r""iir.. transformation function connecting infrnitesimally differing complete sets of operators on the infinitesimally separated surfacesa a.nd of-d'a' If the choice of intermediate operators depends continuously upon the surface, we shall have 5W ".,:0, and, referring again to the dynamical independenceof phenomenaat points separatedby a space-likeinterval, with the consequent additivity property, we see that will have the general form 6W "+a",, Votdc
(dr)ac(rt.
6W,va".,: | Therefore 'al
6tvu: | ,
(dr)6JG).
(s)
The combination of these two types of modifications is describedby
'' 6w,r: 6,- 6r* f
1r*;u"ir,,
(7) 6w:G-G. which involves dynamical variables on the surlaces o1, o2, and in the interior of the volume bounded by these points on a spacephenomena at distinct Since physical the like surface are dynamically independent, a generator G surfaces.On the other hand, we can write this as
358 THEORY
OF
QUANTIZED
7t5
FIELDS
where 6llrrr:g(I,trzrz)and the objects of variation here are 61, 62, and the dynamical variablesof which J is a function. { , / r ' ) [ a u c , f] .+r 6 t ( . f r ] . 6ll'rr: | The latter statement is the operator principle of stationaryaction.ft assertsthat l,tr/r:must be stationary which indicates, conversely,that any part of 6A(r), with respect to variations of the dynamical variables possessingthe form of a divergence,contributes only in the interior of the region definedby o1 and o2, since to the generation of unitary transformations on o1 Gr and G: only contain dynamical variablesassociated with the boundariesof the region.This principle implies and o2. The fundamentaldynamicalprinciple is containedin equationsof motion for the dynamical variables,that the postulatethat there existszrclassof transformation is to say, field equations,and providesexpressions for function alterationsfor rvhich the characterizingoper- the generatorsGr ancl Gr. The class of variations to ators 6llrz are obtained by apyrropriatevariation of a rvhich our postulatereferscan nos, be clefinedthrouglr the requirementthat this informatior.rconcerninglield singleoperator IIl12, equationsand infinitesimalunitary transformationsbe 6lV t':5111t ' self-consistent. "1 'I'here existsmuch freedomrvithin this class,as may Of course,this principle must be implernentedb). the be inferred from the remark that trvo Lagrange funcexplicit specificationof that class. tions, difiering by the divergenceof a vector, describe The operator LIlg, the action integral operator, evi the same dynamical system. Ihus dently possessthe form v^l',hp
intaor'l
?61
c (r): s (r) - d'l" (r)'
'ar
It',,: I
J",
(r.r. id.r;"e
frn:If
The Hermitian requirementon 6lI/12is satisfiedif IIlr: is Hermitian, which impliesthe sameproperty for J(r), the Lagrangefunction operator.In order that relations betweenstateson or and o2be invariantly characterized, the Lagrangefunction must be a scalarwith respectto the transformations of the orthochronous: Lorentz group, rvhich preservethe temporal order of o1 and a2. A dynamical system is specified by exhibiting the Lagrange function in terms of a set of lundamental dynamical variables in the infinitesimal neighborhood of the point r. Contained in this Lagrange function n'ill be certain numerical parameters,rvhich may be {unctionsof :r. Any changeof theseparametersmodilies the structure of the Lagrangefunction and is thus an alteration of the dynamical system. Accordingly, inlinitesimal changesof the dynamiczrlsystem are clescribedby 'or
6ll r' -
|
1 d . r t 6 lr . r r ,
n'here 6A:6(J), ancl thc numerical parameters are thc object of variation. This lorm is in agreement rvith (8). For a fixed dynamical system, tr/12 can be altered by displacing the surfaces 6b q2 and by varying the dynamical variables contained in the Lagrange function. 'Ihe transf ormation function ({ r' o rl l r" o r) describes the relation bets'een two states of the given system so that a change in thc transformation functior.i can oniv arise from alterations of the states on or and or. Hencc, for a fixed dynamical system we must have --ff*
yields
6IY tt:1; t- 1;"'
*u. \ras suggestediry H. J. Bhabha, Revs. ]Iodern P h y s . 2 1 , 4 5 1( 1 9 4 9 ) .
n'hcrc
n.
n".h
,"- (tTrWz),
(9)
.".f""" ff
r:ldo,l,-ldoJo. t-
J 6
. \ c c o r t l i r r g l l ' .t h c . t r L i o n a r y a c t i o n p r i r r , i l ' l c f o r [ - 1 : is satisfied if it is obeyed b.v ltrl12,since
6W":Gr-G. Here
6 l l ' , : 6 : , , G , , 6 l lr : 5 . . t ; r ,
dehne dr and dr, rvhich are new generatorsof infinitesimal unitary transformationson 01 i1r]do2Jrespectively. The latter equationspossessthe form (7), and thus characterizetransformationfunctions connecting trvo different representationson a cornrnon surface. Indeed,rvith a suitably elaboralenotalion) t'e recognize in (9) the additivity property of action operators, r) : 1l'(iror, fi or)* IIl ((,o,, l ro "o ") llt;(ist, rvhere,{or example, II/ (f s', i
i:ot),
I t r ' r - 1 Y 1 7 t o r ,f r o r ) : l l " ( f r o r , i r o r ) , and lVz:IL'1.1,or,7rorr. To be consistentwith the postulalc of local action, the field equations must be differential equations of linite order.One can alwaysconvertsuchequationsinto systemsof iirst order equationsby suitableadjunction o{ variables.\Ve shall designatethe fundamental dynamical variablesthat obey first-order field equations by x,(r), which {orm the componentsof tire general lield operator x(.D. With no lossin generality,rve take
359 7t6
JULIAN
SCHWINGER
ment on I is satisfied if JC is a scalar,
x(r) to be a Hermitian operator,
K(Lil:K(i,
Y'(x)t: Y'(x) ' If the Lagrange function is to yield field equations of the desired structure, it must be linear in the first derivatives of the field operators with respect to the space-time coordinates. Furthermore, if these field equations are to emerge as explicit equations of motion for field components,that part of the Lagrange function containing first coordinate derivatives must be bilinear in the field components. With these prelimiuary remarks, we write the following general expressionfor the Lagrange function, s:l(721,0,y-0ux2lux)-K(x),
(10)
in whiclr a matrix notation is employed, a2Iuoua: y, (21,),,a,a ".
and if
/irs)lI
-$l
(1J)
Note that ?Iu" and $* also obey Eqs. (12) and (13), respectively, and that these equations can be combined into Z-'($ r?I,)Z:r,,(S t?I,), in view of the nonsingularcharactero{ $. For an ininitesimal Lorentz transformation, '
ep,: tr:'r_ 'rr'r* ep, The derivative terms have been symmetrized witl.r respect to the operationof integrationby parts, a process the matrix I can be written which adds a divergence to the Lagrange function, and L:l_iie*Sr,, is thus without efiect on the structure of the dynamical syst€m. In order that S be a Hermitian operator, the where generalfunction JCmust possessthis character, Sr,*: -Sr,1 u' v:0,
rc(il1:3c(il' and the numerical matrices ?L,; p-0, 1, 2, 3 (ra:ixs, ? t 4 : t U o Jm u s t b e s k e r v - H e r m i t i a n , 2 l p 1 : ? I r h x :- \ p t
p:(), 1,2,3.
(12)
Lr"WuL:ru,4,,.
lhe same \\'e shall supposell)at the source possesses transformation properties as the field. The condition for the source term of the Lagrange function to be a scalar is then siven bv
-
cvpt
(14) 3.
(15)
The infinitesimalversionof (13) is - S",h: SSr'$-t-
Su,t
or
(ss!,)1: ($s,,),
in rvhich the cornplex conjugate statements refer to the Although we are interested in complete dynamical componentsindicaied in (15). Similarly, systemsJit is advantageousmathematically to employ (16) ?I,s,x- ^t,It?Iu:i (6,r?I,-r,,9Ir) devices based upon the properties of external sources. Accordingly,rveadd to (10) a term designedto describe and 1 ? 1 ,S, " r ] : i ( 6 u i S ' ? I , - 6 , , . S - ' 9 I x ) . the generation of the field 1(r) by an exlernal source [E f(r), which is to be regardedas a field quantity of the If one viervs'a: (1-i\e ,,5) x as a field in the original samegeneralnature as 1(r), coordinate system and thus subject to the same de(i1) pendence upon that coordinate system as 1, it is s"""""":+(€Sx+x$t). inferred that if is operator a Hermitian matrix, This is a Hermitian $ L-rS,,L: r rxr,rSxr. mi-m For inlinitesimal transformations, this reads For the sourceconcept to be meaningful, all componi[^S,,,Sr"] : d,-S,x- 6,^S,x*d,rSr*-durS,-. ents of 1 must occur coupled with the source components in (11), which requires that E be a nonIn performing the variation of the action integral, singular numerical matrix. we shall treat the two types oI quantities, coordinates An orthochronousLorentz transformation and ireld variables, on somervhat the same footing, tx r although the former are numbers and the latter operr : p , r , + IF , ators. We introduce an arbitrary variation of the cort"r:I, ru>-0, ordinates, 6r!, throughout the interior of the region, but subjectto the conditionthat the boundariesremain induces a linear transformation on the lielcl complane surfaces, ponents, tr: ['a:aLv, (r7) d u d r ,{ d , 6 r ' r : 0 , rvhereZ must be a real matrix, f *- r '1. The scalar requireto maintain the Hermiticity of
on o1and or. The field components x,(r) are dependent both upon the coordinate system and the "intrinsic field." Ltnder a rotation of the coordinatesystem, the field components are altered in the manner described
360 THEORY
OF OUANTIZED
by (1a). Accordingly, we write the general variation of the field as the sum of an intrinsic field variation, and of the variation induced by the local rotation of the coordinate system, 6(x) :6x-
where the antisymmetry of S' ensures that only the rotation part of the coordinate displacementis efiective. For the source field, a prescribed function of the coordinates, we have
^f
r
fanc^r
: +(?1,a,+?{,du). ?{(ua,) 68 : 6x?lrapx- d"x?1"0x- arc+ I (dxSt+ f$6x) *6rui (xsauE+ a,t$il * a"[] (x?{u61- 61?[u1)]. Hence, on applying the principle of stationary action to coordinate and field variations, separately, we obtain
(18)
0,T u,: I (yE 6,1* A,fEx), and
We also remark that
6?,C:6xaplpx- A,x2lu6x++(6xst*f$6x),
6(d*) : (d.r)6 ultc,, 6(0): - (a,6r,)a,, whence 1 1q \
The Lorentz invariance of 3 produces a significant simplification, in computing the contribution to d(S) from the coordinate induced variation of 1. Thus, if 316*,were antisymmetrical and constant, its coefficient in the variation of the Lagrange function would vanish identically, save for the source term since the rotation induced change of f is not present in (18). Accordingly, for the general coordinate variation of (10), there remains only those terms in which d"6o, is differentiated, or occurs in the dilation iombination, 0r6x,!0,6x,. Both types are contained entirely in (19), which leadsto r,! 6,6r)| QA u04- 0,yWu7) 6(r) : 6"c- + (Ap6 - i.L(a ra,6r)tx(21,,S,^-pS,.t21";" - r+ (dts6r,)({SS,,x xSu,t$t).
G-
-DxU*r)*1,,6r,]l | do"[ (\U"6x J n
The operator 3Cis an arbitrary, invariant function of the field 1. If its variation is to possessthe form (21), with D1appearing on the le{t and on the right, the latter must possesselementary operator properties, characterizing the class of variations to which the action principle refers. Thus, we should be able to displace 01 entirely to the left, or to the right, in the structure of DJC, 6K : 5x(6r\c/ Ox) : @,tc/ dx) 6x, which defines the left and right derivatives of 5C with respect to 1. In view of the complete symmetry between left and right in the processof multiplication, we infer that the expressionswith dx on the left and on the right are, in fact, identical. The field equations, therefore, possessthe two equivalent forms 2AuOuy: (afic/ai-Et, -ou72A,u: G,K/ax)-tE,
In virtue of the symmetry of the secondderivative, (dud,3rr)x (?IuS,r*S,ri?lu)x : (d" (ddrrf drdr,))1 (21,S,1f S,1t?L)1 +-
where the last step expressesthe result of'an integration by parts, for which the integrated term vanishes, since the dilation tensoris zero on the boundaries(Eq. (t7)). Collecting the coeficients of dudr, into the tensor ?r,, we have fol
J
-
(dr)[6f+(d,6.r.,)r,,1
d,
fol
|
and G can be equivalently written
(d,611{ dldr,) ap[x (?l,Srr+^S,r1?I)x],
6(Wn\: |
Ql)
rvhile the surface terms yield, on oy and o2, the infinitesimal generator
and
6 ( d , x ): d u d ( x ) - ( 0* 3 r , ) 0 , y
and we have employed a notation for the s)'rnmetrical nr"f
The expressionfor df' is
iL(auor,)s",x,
6(f):6*ud,{.
717
FIELDS
(dr)[6"c-6.r,a,ru"lou(7,,6x"\f,
J n,
where T * : s,6* - t Q\I'1no,1v-o pa{',1v) -ii(g$S,a-15",t$f) *lldr[x(9lr,sx,r*sr1,i?I,;)1], Qo)
f
G:
I rlo,[19i,01f I",6i',] (2',2)
:
f a",l-aer,,1f
?u,6r,1.
In keeping with the restriction of the stationary action principle to fixed dynamical systems, the external sourcehas not been altered. If we now introduce an infinitesimal variation of f, and extend the argument of the previous paragraph to df, we obtain the two equivalent expressionsfor the change induced in tr/12,
6ilu:
fot
|
J o,
(d:)D€$r:
fol
|
{a';1559.
Jo'
The correspondingmodification in the relation between
36r 718
JULIAN
SCHWINGER
stateson or and on 02 can be ascribedto the individual states only if one introduces a convention, of the nature of a boundary condition. Thus, we may suppose that the state on 02 i: unaffected by varying the external source in the region between or and qz. In this "retarded" description, Dtltr/r: generates the infinitesimal transformation of the state on or. An alternative, -dgtr/12generat"advanced" descriptioncorrespondsto ing the changein the state on oz,with a fixed state on or. These are just the simplest of possible boundary conditions. The suitability of the designations,retarded and advanced, can be seen by considering the matrlt of an operator constructed from dynamical variables on some surface o, intermediate betrveena1 a.\d o2,
to plane spaceJike surfaces,limiting displacements to infi nitesimal translati4ns and rotations, 6ru: e'!e "ru' with the associatedoperators,the energy-momentum veclor f
p,:
I rtouT'r,,
and angular momentum tensor f
J u,: I doxMx,,. J
Mxuo: :rrTx'* r,Txu.
(.({ o 'l F (a) ll r" 02)
The operator G, evidently generatesthe infinitesimal produced by the - | , l i o l f ' o \ d l ' t f ' o F ( o \ l l " q \ ' ' l f " ( f " o 1 2 " o 2 \ . transformation of an eigenvector, displacement of the surface to which it refers. With the notation An ininitesimal change of the source I produces the aJ, (1,") : (e,6,* |epdu)V (i,o), following change in the matrix element, we have 6t(ir'orIF(o)|lz"oz) l5,v(1/c) : P,!Ir(f/o), -i6,V(f'o) i:{/ (l'o) rP,, : (11o, | (a f (o) { ibsWuF (6) + iF (6)6rW, 2)| f 2"oz) : ( r 1 " ' | ( a f ( o ) * i ( F ( a ) 6t w n ) + ) l l z "qz ) , and '") i6,N (f ' q) : J pN (f ' o), - du,v (f t : v (l' o) tI u,. in which we have allowed for the possibility that F(o) may be explicitly dependentupon the source,and introIf F(o) is an arbitrary function of dynamical variables duced a notation for temporally ordered products. The on o, and possibly of nondynamical parameters dematrix element depends upon the external source pendent on o, we use the notation through the operator F(o), and the eigenvectorson o1 gets for various expressions 6gF(o), D,F(o): (e,6,1 I e,,6,,)F (o), and or. One thereby depending upon the boundary conditions that are a,F (o) : (e,0,1 1,,,A,) F ("), adopted. Thus, if the state on a2 is prescribed, we find to distinguish between the total change on displace6 s f ( o) 1 , " ': 6 , p 1 o ) * i ( F ( o ) D t l Zrr) +- i 6 l / r z F ( o ) ( 2 J ) ment, and that occasionedby the explicit appearance of nondynamicalparameters.On referring to Eq. (3), : atF (o)* ilF (") , 6Jv ',f, we seethat which only involves changesin the source prior to, or 'Ihe 6,F(o): a,F(q)i ilF(c), P,), opposite convention yields the analogous on o. result 6u"F('o): 6,,P 1") -l ilF (o) , l ,'f 12 drF(o)1,a"- dtF(o) {i(F(o)Daltr/r,)1- iF (o)61tr' The proper interpretation of the generating operator : atF (o) _ i.lF ("), dy'{'r"]. G, can be obtained by noting its equivalence with an appropriately chosen inflnitesimal variation of the exNote that ternal source. Consider the following infinitesimal surdsF(o)1."i-6gF(")1.a": i[P("), 6rII/'r]. face distribution on the negative side of o, The operator G of Eq. (22) consistsof two parts, (24)
So€:21,ru"u,r,rr,
G:G,*G,, lvhere
c,:
f"ao,y\t,a": ["rto,6ylr,a,
which is not incompatible with the operator properties of these variations. We have assumed, for simplicity, that the equation of the sur{aceo is r:1oy:0.With this choice,
and
do,T,,a*,:e,P"lle,,J u, G ": f of the restriction The latter form of G, is a consequence
6{v,,:
f
Gx d.oa2Iqo16x: "
The change that is produced in 1 can be deduced from
362 THEORY
OF qUANTIZED
719
FIELDS
decomposition
the variation of the field equatons, 2Wro,62y- 6E(61K/ ai : - E5 t : -?Irordxd(rror).
?I,:?J*tr)f!1,{:r, ?lro)t- -?IP(t),
E:S(t)+S('?), $ori.:9u,, s(2)h: -s(2),
2lP(2)r-?l!(2)' Evidently there is a discontinuity in 6rx, on crossing the surfacedistribution 6f, which is given by The matrices of the first kind are real (p:6, ' '3;, and those of the second kind are imaginary. We shall -2116y61. 2?lrordrxl: not write the distinguishing index when no confusion is In the retarded description, say, 6g1is zero prior to the possible. According to this reducibility hypothesis, the field source bearing surface, so that the discontinuity in 6g1 is the change induced in 1 on (the positive side of) o. equations in the two equivalent forms Thus, the surface variation of the external source 2?1,0,a: (agt /Ax)-EE, simulates the transformation generatedby G", in which -2?It"a &: (A'tc/Ax) -E "t' on o is replacedby ?[
?Iror *?Irord i : 2Irorx * :2lrorx- 12[
separate into the two sets
(2s)
The matrix ?lrorhas been retained in this statement since it is a singular matrix, in general. The number of components of 1 that appear independently in (25) and this is the number equalsthe rank of the matrix ?1101, of independent component field equations that are equations of motion, in that they contain timelike derivatives. The expression of (25) in terms of the generator G* is [?Irorx,G"]:i]?lrordx.
(a &/ ad : @,K/ ad,
221u6 uq: (arclad)-Sf, and
Gfic/aV): - (a,K/a{').
221,6u{: (afic/a,!)-En, Furthermore, the generator G,:J
f-f d"vU,q6r:J do( U.",t'6rtv,
decomposesinto G6fG,1, where
Q6)
The factor of ! that appears in this result stems from the treatment of all componentsof ?Irolxon the same a n d footing; we have not divided them into two sets of which one is fixed and the other varied.* If I is an arbitrary function of ?I1o;1on a, we rvrite
f -. od G ^ : I d o 6 2 ( ,D JJ
f l do{?{'0,6d)d. Q7)
cu: ' J J la,g\ts,a{,:I dor_ltu6,l,t{.
, lF , C,l:;15n7,: i15P
These resulls reflect the form assumedby the Lagrange {unction,
in which the componentsof 2lro;x are the objects of variation. When the field equationsthat are equations of constraintprove sufficientto expressall components of 1 in terms of ?{rorx,we can extend (26) into
a,6l++lvah a,vl- K(4,,1,), J : +{d?I,, +i{rs,0)*4[4$,*].
lx' G,f:iLax. Of course,one must distinguish between these variations, in which only the ?lrorxare independent,and the independent variations of all components of 1 which produce the equations of constraint from the action principle. In order to facilitate the explicit construction of t-he field commutation relations, we shall introduce a reducibility hypothesis, which is associatedrvith the Lorentz invariant process of separating the matrices ?[u,S into symmetrical and antisymmetrical parts. We require that the field and the source decomposeinto two sets, of the first kind lttr:6, €(r):f, and of the secondkind, x@:V, tQ):n, as a concomitant of the * Note addedin hool:-Further discussionof this point rvill be found in a paper submitted to the PhilosophdcalIlagazine.
The equivalencebetween left and right derivatives of the arbitrary function 5C,with respect to field components of the first kind, and of the trvo expressionsfor G6, showsthat D{ commuteswith all fields at the same point. It is compatible with the field equations to extend this statementto fields at arbitrary points,
@),66@')f : 0, lo @),6oG)l : 1,1, provided the source components are included, [l(r'), 6d(r')]:
["r(r), ad(r')] : o.
It lollows from (27) that the relation between ry'and dry'is one of anticommutivity. The opposite signs of the left and right derivatives of 5Cwith respect to ry'is then accountedfor by [ O ( r ) , 6 9 ( r ' ) ] : { , 1 ' Q ) 6, V Q ' ) l : 0 , provided only that JC is an even function of the vari-
363 720
JULIAN
SCHWINGER
ables of the second kind. The inclusion of the source components
which requires that the real, symmetrical matrix illroy{rr-rO" positive definite.
rr('),De(r')t:{n@),6te)t:0, coI;:11i.":?1i:Y:::?ISL"J#:o"i*Jf insures compatibility with the field equations. We have now obtained the explicit characterization of the class of variations to which our fundamental postulate refers. Let us also notice that "'ra.rlt$"agr*. rat -Jd' 'u!'\\us'^' f
r,r 6twrr: I td'rq$ag:
'
r".
into6rwnt6,w12, lvhere crecomposes f"t
f"'
6 r l lr : : I
J o t
and
r d . r r 4 t l a EI :
rd.rtr$69'6.
(dr)r/864- |
rtxll-Sta,g.
J o ,
llror(2),must be even, 2n@. Let us imagine that, by a suitable real transformation, !I1o1o)is brought into diagonal form. If the number of components in r[ is odd, the product of all these componentsat a given point commuteswith t at that point. Thus, as far as a l g e b r ao I o p e r a l o r sa t a s i v e n p o i n t i s c o n c e r n e d ' :nt
*:H:i'::J'-:,il,1i'?li,.'J^^",:1';.:1'J,1 :l:1*T,i:fri':;:lL:i'.it"::lffii:Lthe
The relation betweeninvariance under time reflection, and the connectionbetweenspin and statistics,may be noted here. The time reflection transformation
' 6"1',,: f
assump-
txq:
-14,
tatr:ao,
induces a transformation of the field We can conclude that source variations have the same operator properties as field variations, as already exploited in Eq. (2q. The operatorpropertiesof ?Irotxon a given o can now be deducedfrom (26), with the results
'x: Ltx' such that
Lqr.l(tLr- -lI4
La'"!loLr:s5 r,
(2e)
and ZI"SZ4:E,
K(Lq):K(i,
[Urord("), d(r')?lior]: ri2l10)6"(r- *'),
However, this preservation of the form of the Lagrange (28) [ ? I t o r d ( r )9' ( r ' ) 2 1 , 0 , ] : 0 , function is only apparent,for fields of the secondkind. Since -l9l,rtzl is a non-negativematrix, one can only {?Itor/(r), /(r')?lror} - i}?IrorD,(rc-r'), satisfy the first equationof (29) rvith an imaginary Zai?) ir rvhich 6,(r-r') is the three-dimensionaldelta func- rvhich produces skerv-Ifermitian field components '1('!). tion appropriateto the surfaceo. The numericalforms But the invarianceof the Lagrangefunction is not the of these commutators and anticommutators insures c o r r e c tc r i l e r i o n f o r i n v a r i a n c eu n d e r t i m e r e f l e c t i o n . their consistencywith the operator properties of 6?11qS The reversal of the time senseinverts the order of o1 variables first and The dynamical o{ the 6?I1nyry'. and and o2, and thus introducesa minus sign in the action second kind thus describeBose-Einsteinand Fermi- integral, which can only be compensatedby changing Dirac fields, respectively, rvhich are unified in the the sign of i in (4). We shall describethis as a transgeneral field 1. formation from the algebra of the operators 1 to the Since the rank of the antisymmetricalmatrix 2116;(1)complex conjugate algebra of operators 1*. Since the is necessarilyeven, there are an even number of inde- linear transformation designedto maintain the form of pendenf field componentsof the first kind, say 2n(L). s(0, a,O;{, dury') has efiectively replaced a with One can always arrange the matrix ?l
kinematically invariant under time reflection.In order that it be dynamically invariant, 3Cmust be such that Jc(0, i,l,)*: ut(6*,,t/*). Since 3C is an even function of the componentsof ry', the ialter are to be paired with the aid of imaginary matrices, characteristicof the variables of the second kind. The sourceterm is invariant if sourceand field transform in the same rvay. The correlationbetweenspin and statisticsenterson
364 THEORY
observing that an imaginary Za is characteristic of halfintegral ipin fields. We can prove this by remarking thaiall the transformation properties of Z4 are satisfied by -riS'+)lr, L+: exp(- tniSu)1-r exp(irrSu) : exp( where Zq is the matrix describing the reflection of the irrst space axis. The latter form is a consequenceof
721
FIELDS
OF QUANTIZED we have
f"l
J *ro1t-J r,(o2\: |
(dxl'Ir51'.,r,-x,d,*i'su,){
Jot
+ (tcu0,- s,0 ut i,S,,)€Ex].
In the absenceof an external source,I*, is symmetrical and P,, Ju, are conserved'.For and divergenceless, simplicityfwe shall confine our verification to the situation of no source, in which the infinitesimal 6f is distributed in the region between o1 and o2. Hence
Zr-1S1aZ1:-'!t'' The essential point with regard to the reality oI Ir is that Su:iSro is a real matrlx, whence
6 a P , G 1 t -:
|
(dr')a,YSDt.
Jat
and exP(2riS u) L t' f" ( d . r tl r u d , - . t , 3 , * i . s " , ) \ 5 6 { . 6 2,J, r o r t : - | say' as.9r2' same eigenvalues Now S1amust possessthe Joz spin field, which implies that /-r is real for an integral and imaginary for a half-integral spin field. The re- The consistencyrequirement quiremenl of time reflection invariance thus restricts fol (d.rr(6\)$6t:6'G,' (6Gr),=| fields of the first (B.E') and second (F.D') kind to This corre. t . t spins, respectively. half-integral integral and lation is also satisfactory in that it identifies the doublethen demands that valued, half-integral spin fields rvith fields of the second - (6x),: ,,0"x|!er'@,0 x"0 u-liS,,)x, (30) kind, of which I is an even function. "We have introduced several kinds of generators of which is indeed true in virtue of the equivalence beinfinitesimal transformations. A criterion for consistency tween (d1(*)),, induced by the displacement 6f!, and of the evaluations alternative the from is obtained ' x @ ) - x @ ) , i n d u c e db y the coordinatetransformation generators, such commutator of two '*r: rri_6xu. - i (6Gi. (6G Alternative forms of P, and /n, are convenient {or i G t: bl: lG ", ") the consistencYof G. and Gr. The following testing namely (6G,)b+(6Gt.:0. relationsderived {rom (16), Ir* : exp(riSr) h:
ix (2lI.Sr,- S't2l x)3xx, y2I r7_ all,a "7: "0 - Su't?Il)x, id rx ( ?IxSu, 0,xll,x: 0 uv2l "7-
As a first example, we consider the two generators e , P , ( q ) * | eu , J , " ( . or ) ,
G.:
enableus to write fp, as
and fo'
G= J
|
T. : S6r-
(d'r11$56.
o,
where
in the retardeddescription.In preparationfor the test, rve remark that l',to.t ,,t", -
:
f"'
J.,
syu,: -suy,:if,1(2?I1"Sra*2Srr,i?Irr _ -?|S",-S,,i?[)x, and
t,1.r\3.1",
r*o,a,rra.ttx), /"' tr'o+
P,-
"",
1,,: (dr')[r"3^rr, - r,01TsulT,"- ' r
7,"- T,u: - i+(tES,,x- xS",i$t),
(1U"a,v- a"r?l,r)*p",1. | d ou l s : 6 , , - |
but doesenter in
(dx)0xLIx,, f
: | Since
p,": -iilS u,x@&/ax)+(a;tt/ax)S,,xl.
In virtue of the antisymmetry of sr* in the first two indices, drsr", is automatically divergencelessand does not contribute to the energy-momentum vector P,,
and that J r,(or)-J r,(or)-
t Q}l u0;x- 3 "x?I,x)I d1s1",{Pp,,
1
I
- r,o,l i'9,,) a ^l- ix}Ix(:c,a, x " !!
(r,0,- x
"0,*iSu)1?Lx*runr,+
r,Pxpl
ra',.*,- tlo,x,\!. [
365 722
SCHWINGER
JULIAN
The components of P, in a local coordinate
system are
t
3"(r-r') and therefore vanish when multiplied by x1p1*fr1*1t . Furthermore,
p,o,: -x?I,*,dt,x- I (t8xix$t)], I dafK J
(31) P 1a ,:
y?I1o'd,*'1{p,6'1p,].
while those oI J,
ti(a fic/ 3y)6,(x- r'),
from which we obtain llc, N e1Gll: 2ip(or6r : 0.
arc f | dor1a,[JC | (19{,,,d,r,t J
- o utxllutx)* i(fSx* xSt)l -|i
[rc(*), x (r')]?Iro, : li(a,.tc/ ay)6"(r - x'), ?lror[x(r'), K(r)l:
f I dolJ
J r o , r * r: * ( n , P m -
and
With this information, the proof is easily extended to all componentsof pu,. The consistencyof the generators G, and G* requires that
f I do1(2l,ouS1o,ar*Sror.rrtlror)1, (.i2)
i 6 ^ ( e , P , f l e u , J , , ): -
faofa*),?t,o,a*.
or I o tt,:
t?{to, (*,r' d1ly r11,di*1* t51r114)X I daf
6\P,:
I do6"y22lq,6x,
J
- r1i;p1o;11;]. f r1t;p1oy17y The quantity p", is closely related to the infinitesimal expression of the scalar character of 3C, K (y- ii eu"S,,y) -cc (x) : 0.
f
6\ J u , : ,
do(t,0,- :,0 u|rSu,)1Zlfro,dr,
J
which can now be verified from the expressions (31) a n d ( 3 2 ) ,w i t h p 1 o ; i 1 y : 0 .
We can, indeed, conclude that
CHARGEDF'IELDS
pp":0, if 3C is no more rhan quadralic in the components of various independent lields. We shall also prove this without the latter restriction, but, for simplicity, with the limitation that there are no equations of constraint. The commutation relations equivalent to (30),
la, P,l: lx, J -f:
-ia,y, -i(x,\'-xd,JiS,")v,
imply that [x, tr'u,]:S,,x, where
Our considerations thus far specifically exclude the electromagnetic field (and the gravitational field). We introduce the concept of charge by requiring that the Lagrange function be invariant under constant phase (special gauge) transformations, the infinitesimal version of which is ,r: 11_i.6t6)x. Here 6X is a constant, and 6 is an imaginary matrix which can be viewed as a rotation matrix referring to a space other than the four-dimensional world. The invariance requirement implies that
N r,: J u,-*uP,!x,Pu.
6i:s6s-1,
This enablesone to expressthe scalar requirement on JC or in the form and [3c' N!"]:o' The components and that d l o r l r r:
I d o '( r 1 r ' r 1 r , ' ) [ f f ( r ' ) I ( r ? [ r i , a , r r ' t
- oai x?Iatld- l(4Sx*x$t)
($a;t:55' [6, E-l?I,]:
[6, S,,]:0,
3(.(x-i6\Ex)-K(i:0. We now write the general variation as 6() : 6y - ii @,6x,) Su - i6r.Ey, "a
where 6tr, characterizing a local phase transformation, f - l i d o r ( ? I , o , S , o , , r , * S i o ; 1 r 7 t ? [ , q is ) 1 ,an arbitrary function of r, consistentwith constant I values on or and on oz. The additional contribution to 6(a) thereby producedis do not involve the unknown ptoltrt. According to our jf dts6l- t; (tS6x- xS6€)DX, simplifying assumption of no constraint equations, the commutators (anticommutators) of all field components rvhere at r and *' contain the three-dimensionaldelta function i ,: - ix?Ir8x
366 THEORY
OF OUANTIZED
is the charge-current vector. The stationary action principle requires that
(33)
a,jF-i+(tEEx-xE8$, and yields as the phase transformation generatror f
I dol*6\:Qdl,
f1:
J"
where p is the charge operator. The integral stntement derived from (33),
QG,) - QG,) :
[,','
(d.r)diQ8 6t - t8 8x),
becomes th'i conservation gf charge in the absenceof an external source. If an infinitesimal source is introduced in tne region bounded by o1and o2,we then have, in the retarded descriPtion, 6g0(",):-,
|
723
FIELDS
algebras are involved, the field contains particles with charges0, *a. To present 6 as a diagonal matrix, we must forego the choice of Hermitian field components. Thus, for the example of a charged F.D. fiild, where the field components decomposeinto ry'1i,9(2), corresponding to the structures (34) and (35), the mutually Hermitian conjugate operators rlts; : tltss- i9 et, t et : t ot * i{ pt, are associated with eigenvalues *e and -e' respectively. On introducing these field components, the derivative term in the Lagrange function, the electric current vector. and the commutation relations, respectively, read Il*<-tw,,du*r*rl*1&r+r?Iu, 0r!<-tf, -iei(tePlr{<+t-{+flIu*<-),
(36) (37)
and } {?Iot! e>@), {c+r (o',)?Iror :{2t,0),y'1_;(r),/1-,(*')U10,}:0, (38) {?I
(ditdi$61
J oz
:ilQ@),Gi' W'rence
lx,Ql:Ex. This commutation relation also follows directly {rom the significanceof G1, indicating the consistency of the latter with Gs. We shall supposethat the matrix $ is an element of the algebra generatedby S-121* and Sr,. It follows that $ commutes with 6, and therefore that the latter is explicitly Hermitian, Et: E.
There is evident symmetry with respect to the substitux+-e' tionr!6<+!6; Since/1..yand p1-yare Hermitian conjugate operators' we can arbitrarily select one as the primary nonHermitian field. We shall write ca-l9fa u --;^, 'lPt <
and
te>E:,1,18:0.
to:{,
yields the following forms for (36), (37), and (38): possesses This
Such an antisymmetrical, imaginary matrix real eigenvalues which are symmetricaliy distributed about zero; nonvanishing eigenvalues occur in oppositely signed pairs. Since 6 commutes with all members of the above-mentioned algebra, the charge-bearing character of a given field dependsupon the reducibility of this algebra. Thus, if the algebra for a certain kind of field is irreducible, the only matrix commuting with all members of the algebra is the symmetrical unit matrix. Hence 6:0, and the field is electrically neutral. If, however, the matrix algebra is reducible to two similar algebras, as in
,r:(? ;r),
, ^
. r
tL!^t ,, xdpV)-
i . . ^
i
' l
iLxd tY"t t, V ),
e+10'Yr'!l'
(3e)
and : I0 @)rtot'/(r"')r
{": (-s.-').F,
(41)
(34) and state this ryrnmetry as invariance under the substi-
the matrix 6 exists and has the form (with the same partitioning) r* """'- o' / 0" _ i ", l (35) s: r( ) \i 0 / This describes a charged field, composed oI particles with charges +e, the Jigenvaluesof 6. If threi similar
i.:.ttionrlte{", ee-e. T h e m a t r i c e s" l p i P : 0 , ' ' ' 3 ,
obeY
'Y'i:E7*t-t' and ?ut':
-S7rS-r,
(42)
since they are purely imaginary matrices' ore sbould also recall that E is an antisymmetrical, imaginary matrix. If we were to depart from these special struc-
367 724
JULIAN
SCHWINGER
tures by subjecting all matrices to an arbitrary unitary trairsformation, we should find that the only formal changes occur in (41) and (42), where the matrix !] appears modified by an orthogonal, rather than a unitary transformation. Hence, in a general representation these equations read ,lr":C{, "t,r' : - C-11,C , where C still exhibits the symmetry of $, appropriate to the example of a half-integral spin field,
THE ELECTROMAGNETICFIELD The postulate of general gauge invariance rnotivates the introduction of the electrornagnetic field. If all fields and sources are subjected to the general gauge fra nqfnrm
c tinn
'1: exp(- iI (r)d)x: x exp(iI(r)6), the Lagrange function we have been considering alters in the following manner, 'f : S-firdrX. The addition of the electromagnetic field Lagrange function,
The commutation relations (40) are in the canonical form which corresponds to the division of the independent field components into two sets, such that one has vanishing anticommutators (commutators, for an integral spin field) among members of the same set. The generator of changes in ry'and ry',Eq. (27) in the notation of the charged half-integral spin field example, is C hl',{,) : Li I do (0t, 0,6{- a,i't,",V\.
C " ^ r : i U u , A p l - I { F w , . 0N A , - 0 , A u l ++FN'2+JpAp,
(43)
provides a compensating quantity through the associated gauge transformation 'A':Au-3'\' The term involving the external current -ftsis effectively gauge invariant if o'J u:0'
which can be deduced directly from the Lagrange function derivative term (39). Associated with the freedom of altering the Lagrange function by the addition of a divergence, are various expressions for generating operators of changesin the field components. Thus, we have the following two simple possibilities for the derivative term and the associatedgenerating operator,
+lh* id,vl, Ghl'):i J
and
I d"{to,6*,
-+lidih,"1'f, _f
Ghlr\: - i I do6,l,tp,{.
since the modification is in the form of a divergence. In the same sense, there is no objection to employing a form of the Lag^rngelunction in which the secondterm of (43) is replacedby (44)
+ { a p F P "A, , l . We write the general variation of ,4" in the form
6 ( Au ) : A 1 , - 1 6u ' r " 1 O , :6A u- l(3 uin,- A,6r,)A |(0,6t,t 3,5r)A,, "which ascribesto ,4, the same transformation properties as the gradient of a scalar, thus preserving the possibility of gauge transfonnations under arbitrary coordinate deformations. In a similar way, 5(F ) : 5Pu,- (dudir;)Fr,- (d,irJF"x.
Evidently G(ry'),for example, in the generator of alterations in the components z(0)ry',with no change in f7,r. The associatedcommutation relations, lt a{, G (,1')f: it ot6,!, lhat,G(0):0, are satisfied in virtue of (40), and, conversely, in conjunction with the analogousstatements for G(l), imply these operator properties of the field components. The connection with the generator in the symmetrical treatment of all field components is given by
(,1,) (,1'), c ({',,r): +G + +G which indicates the origin of the factor (1/2) in the generalEq. (26).
With .regard to the derivation of the electromagnetic field equations from the action principle, it should be noted that general gauge invariance requires that the sources of charged fields depend implicitly upon the vector potential -4u. We expressthis dependenceby ?
6 r ! { . r ' ): ,
Gx \ 6 t G ' \ / 6 A , ( r ) ) d l
"(r).
Since the infinitesimal gauge transformation, 6,4r: -dr6tr, must induce the change6t:-?dtr6f, we learn that a,QtQ')/6Au@)): -i8t@)6(x-r',).
(4s)
One obtains the following field equations on varying
368 THEORY
OF QUA NTIZED
If 6"Iuhas the explicitly divergencelessform
F r" and A, in the complete Lagrange function, F,":3u'4'-0'A,, 3,Fr,: j,!kplJ
p,
FIELDS
(46) (47)
6J':0'6M"'
M":-M'u'
(s3)
where6Mu, vanisheson o1 atrd 02,we find that
where
^61
f
ku ( s ' -) ) , I t d r ' r I l a g t r ' t .6 1 , { r r 1 $ x 1 v ' 1 " lx(r')S(a{(r')/6,a,(r))1,
6tW,,:
| Joz
(d)'\LbMpr,,,
which makes it unnecessary to introduce an external is the contribution to the total current vector associated source that is directly coupled to the field strength tensor Fu,. with charged field sources.We derive from (45) that The special nature of the electromagnetic field3 is a,hr:it(tSEx-xEE0. apparent in the form of the operator (52) generatilig Ilut ttie total current vector is divergencelessin conse- changesin the local electric field components.Since one quenceof the electromagneticfield equations. Therefore of the field equationsis the equation oI constraint
a,ir: -i+(ttb'x-x$49,
d 1 l y F 1 o y 1j lro1r:* k < o t l J < o t ,
(54)
the three variations DFlo;1*;cannot lle arbitrarily aswhich is in agreementwith (33). A{ter removing the terms in d(8"-) that contribute signed; the electromagnetic field and charged fields are not kinematically independent. This is evidently an to the field equations, we are le{t with aspect of the gauge invariance that links the two types : (s"*, A 6PQ"P6A')| 6 u6'J'6x' +{6j,, A } of fields. Alternatively, s'e see from (51) that ,410)is - \ (6 rbx,! 0 (+ A,j - + lFur, F,r) ) not a dynamical variable subject to independent varia"6r,) U,, * ( a u 6 r , ) 1 , A , , ( 4 8 ) tions. But there is no field equation that expresses,4loy in terms of independent dynamical variables, in virtue in which of the arbitrariness associated lvith the existence o{ -i6yl/',E{a, A pl: - i{A $ y}2l,Eby. A , l : +16j,, gauge transformations. Furthermore, a variation of -,1gr in the form o{ a gradient, that is, a gauge transThis term alters the field equations of charged fields, formation, yields a generating operator which, in con2 2 1 , ( A , x -i , E + l A, , 7 \ ) : ( AL K /A ) - $ L sequenceof (54), no longer contains electromagnetic - (auxliL{x, A p}E)2WP: (a,K/a).i- t!3. field dynamical variables. Thus, in either {orm, (51) or (52), there are only two kinematically independent We have anticipated that not all componentsof '4, variations of the electromagnetic field quantities. is as The tensor ?r, now obtained commute with 1. We now apply these generators to deduce commuta( 4 9 ) tion properties for the gauge invariant fieid strength Tp,:...+ilFt^, F,r)- j{jr,, A,Jl-J,A,, comporlents. According to the efiect of a variation where .. . stands for (20), but with "8 the complete 61 1r;,upon the local componentsof F' we have Lagrange function. The action principle supplies the [ F 1 o ) 1 rG y ,e ] : 0 , difierential equation A ot), G tl: i (d c16A 6 - o s16 at, lF tot (50) A pa,J,. auTu,: i(x8a,t*a,tEx)l whence 'I'he divergence term in (48) yields the infinitesimal (55) generator [ i ' 1 q 1 r y ( r )i,' 1 0 ) 1 r ) ( r ' ) ] : 0 , and
cn: -
ao,n,"aa,:-
!
!
(s1) a'a
rvhile the Lagrange function with the derivative term (44) would give Qr:
ff do,oF,,A":
J
dabFo,,r,Aur.
(52)
J
The change in the action integral produced by a variation of the external current _/, is given by
5rw,,:
I', @.r)EJ,A,.
[l'orro (r), Fcorr-r(*')] - 63r<-rdr.r)6,(r - r'). : i (6rp<,,rdro
(56)
In using Gr, we must restrict the electricfield variation according to dllyDFlqlry:0, w h i c h i s i d e n t i c a l l ys a t i s f i e do n w r i t i n g 6 F< o t t o :3 < t t 6 Z < n p 1Z' 6 1 6 : - Z o t 1 o . 3 Papers dealing tith the situatior peculiar 1o the electromaenetic 6eld are" legion. Of the older literature, the closesl irl spiiit to our proceduie is {hat of W. Pauli, Handbuch der Physik (Edwards Brbt}ters, Ann Arbor, 1943), Vol. 24.
369 726
SCHWINGER
JULIAN
Th ic viAl/le f ha fnrm
A changein the external current, of the form (53), yields -
f
Qp: I do)2F6s8266.
: 3pOx6M,x-0"3x6Mux, (59)
fr'r 6rF,,(r): if r,"G), |
[ F 1 1 ; 1 aG, r ] : 0 , Gp]: idFlo)1ly, [F1q)11y,
:
f . .," d o l i \ n 1 nD , F G t1 1- 16 mo , 1 1 , . F 1 0J y. 1 0 , J
The alteration produced in the field components follorvs from the lield Eq. (47), and the {orm of (46) given by
(s8)
Thus, 0e) (6Fo a)- 6Mrorro)= - dordlr 116 | Os16M61pl, d l o y D F q r:; 1d71; 4 D F 1 s-y 1d;1y1 ; 6 F 1 e 1 1 r ) ,
the
on
c,rrfare
6 F 1 e ; 1 4 ]d- 1 r ; 6 r u 1 r r o , 6 F& 1e ) f : A( D 6 q o ; 1-r yd 1 r ; 6 m 1 0 y 1 4 . In the retarded description, these discontinuities are the actual changes in the field components on o. On referring to the general formula (23), we obtain 661tn 61s1: ilFrorrzr,G-], - dlrydm : ilF d14dzz1o;1ry G^].
( d x . , ) ! 6 M 1 , ( r , ) q y e _x , 1 i
|
XIF -,(r),1"^,(r')1, and r11is the discontinuousfunction
for which the associatedgenerator is
which yields the iollorving discontinuities in 6I'
I
fdt
(s7)
gM u,:6m,A(rp),
rrnscino
(dr')l6M^^(jr )!.'^^(r') |
Jot
where the latter is equivalent to (56). An alternative derivation employs an infinitesimal change in the external source,distributed on (the negative side of) d, r(o):0r
dpF,r+A,Fr!+dxF,,:0.
Jot
L
then provides the commutation properties
^ G^ :
d,/bjr J I
where, in the retarded description
The expressionof changesinducedby 61"1n;11y,
(r')]: g, [Frnrro(r), l'1,,;1"1 lFcorr.r(r), F utot@')l : i ( d 1 1 ;d11a- 1- 6 o rr . rd < u ) 6(,r - r ' ) .
o ^'o il r, pe- d u0M J,+
na(r- x'):1, :0,
5 0 2* o r fio
We have a similar expressionlor 6xj,(r). On comparing the coefficientsof 6Mp(x') in (59) (our two treatments employing external sources are thus distinguished by surfaceand volume distributionsof 6M ,,, respectively), 1vefind - 0 f 11 a@- r')ilF, Q), F y"@')f - 0,a1 - *' @ )ilj, (r), ry^ (r:')l * 0q1Q - r' )i,lj" (r), P1-(o')l : ( 6 , 1 d r d * -D , " d u d 1 - 6 * 1 d , d * *Du-a,dr)6(r:-*'). (60) The value oI ilF,"(x), F^.(r')], for equal times, is then obtained from the coemcient of the difierentiated delta function of the time coordinate, $'ith the anticipated result. In the approximation that neglects the dynamicai relation between currents and fields at points in timelike relation, the differential Eq. (60) has the solution q1Q - r' )ilF,,(x), Fr. (s')l : (6,r4!d.- d,.d,3r - 6rrd,d** 6u-A,a x)D."t(r:- r-'), tvhereD""1(r-*/) is the familiar retarded solution oI - di'D.",:61r-t';
(61)
Had we employed the advanceddescription,4+ would be replacedby -4 , where 4 (r-r'):0' :1,
1'n1'o' r:o(ro',
and the advancedsolution of (61) rvould appcar. Subtracting these trvo results, we find ilF,,(x), F^"(r')l : ( 6 , r d t s a _6-, ^ a r d r - d , 1 d , d " f6 g * 3 , d 1 ) D ( . r : - . u , ) , in which D(r.-x') provided by
is the homogeneoussolution of (61) D: D*t- D.a".
370 THEORY
OF
141
FIELDS
QUANTIZED
'I'he kinematical relation between the electromagnetic The resulting commutator lield and charged 6elds, on a given o, is most clearly ilA t?r 61 @), F10)1a (irl)] indicated in a special choice of gauge, the so-called :3ruut6"(x- r') - 0*16si D'(rradiation gauge, : (611y1n6"(r;r'))(r), (62) 3o>Ao>:0'
r')
With this choice, the constraint equation for the electric Iield reads d l r v F l o r t r l :- 6 t r t ' A < o t i:< o t I J < o > ,
is also consistentwith the transversenature of '41t1'The remaining commutation relationsare
so that the scalar potential is completely determined by the charge densitY,
We shall use the device of the external current to derive the commutation relations between the electromagnetic field tensor and the displacement generators P,,-J u". According to (49) and (50),
.,1ro (r) :
-' (r') ), f a"' o " (* *' I Uror(*')*J,0,
(") (r')] : 0' : lA 6 Q),,4 1a(r')] [F.,,0, (r), Florcl(")
Jt
rvhere D,(x-
r') : (l / 4r)l(r1*1- x
f6l
P,(qr)-P"(o)-
|
U*ll" 'lAfi"J^7,
Jcz
does not commute with the components Evidently, -410y fo' of chargid fields. In this gauge, then, the dependence J (dx)[ " *.4r(r,0"- x,6,)Jx u,G)- J,"(o,\: | Joz of the ilectric field upon the charged fields is made +A.J,-A,JF], explicit through the decomposition of the electric field inio transverse and longitudinal parts, in which we have indicated only the terms containing Prortrl: - drol'A t*1- 0 614P1 the external current. We consider an infinitesimal : p,o,1py("){F1oy1o(r). change in the latter possessingthe form (53)' In the description, lhe resulting changes of P, and The inference that the transverse fields are the inde- retarf,ed ate o1 on J u, pendent dynamical variables of the electromagnetic held in this gauge is confirmed on examining the generf"t (dx)tz6Mx"a"Fx,, ators Ge and Gr. Indeed, a"P,(a): - | Jo
ff doF oxtf)6A o, daF s0t16Aut: G^: J J and 5o:
f J
6,t. (" ) : -
^-1*ua"- r "0u)Fx, 1a*174tM
[:"'
d o 6 F < o t t t t A -o . | do6F'o"u's'A'r" J
in view of the transversenature of A61,FQ' (62)' We can now derive the commutation properties of these dynamical variables from lAsl,Gal:i6A6,
*6M x"Fr,-6MxrF,xf. When expressedin terms of the generator
"
fF1ol1*r(t),Ge]:o'
lArrt,Gr]:0, fFqoylo("),Grf:i6Fo>ots), restrictions the into d.ccount on taking : 0, dtll6.4rlr : dr6DF(o)(*)(")
*:
I.',,'
(dx)|6M x,Fx,
the following commutatorsare encounlered, ilFx' P,l:a,Fx,, i.LF*, J u"): (* u6,- x,0)F a^J 6,^F,t -dp-F,r*6grF* -6"xFv*
oroducedby the transversenature of thesequantities' The Lagrange multiplier device permits us to deduce Finally, we remark that the extension of (31) to electromagnetic field, in the radiation include'tie that gauge, is (rr)]: 56rtoD"(r-r')*dro'Ig>' i.lA 111{i:r),F1oy1;t"r The divergencelesscharacter of the transverse electric field supplied the information 6 6t3"(r- r'), 61t,'2)r1*1: whence
I
^
P\o\:
f
J
,,',, , \ , t t t ' (F
- J
371 728
JULIAN
SCHWINGER
and
.'
of the independent fields yiel
P@:
J
dol+l\ 0.114(r), F1*rrrr)-xglrordr*rx].
ln arriving at the expressionfor p1ny,the noncommutivity of lrol with 1 must be taken into consideration, but producesno actual contribution, A variation of each
ar,:
f
a"7arru1111e)0sA1p1-6.111116,1,,1n111110) -6y2[610p1al'
which confirms the consistencyof the translation generator with the various field variation generators.
P o p e r3 0
372
The Connection
Between
Spin and Statisticsl
W. Peur-r Physikaksches Institul, Eidg. Technischen Hochschule, Zilrich, Switzeildnd and Institilte for Adtanced, Staly, Princeton, New Jersey (Received August 19, 1940) In the following paper we conclude for the relativistically invariant wave equation for free particles: From postulate (I), according to which the energy must be positive, the necessity ol Fermi-Dirac statistics for particles with arbitrary half-integral spin; from postulate (II), according to which observables on different space-time points with a spaceJike distance are commutable, the necessity ol Einstein-Bose statistics for particles with arbitrary integral spin. It has been found useful to divide the quantities which are irreducible against Lorentz transformations into four symmetry classeswhich have a commutable multiplication like *l, - 1' * e. - e with e2: 1.
(e.g. Sr:iSo, the digit 4 among the i, k, "' Sr*: lSo*). the requirements of the relativity QINCE Dirac's spinors u rwith p : 1,''', 4have always u theory and the quantum theory are fundaa Greek index running from 1 to 4, and uoa mental for every theory, it is natural to use as means the complex-conjugate of ao in the ordiunits the vacuum velocity of light c, and Planck's nary sense. constant divided by 2zr which we shall simply Wave functions, insofar as they are ordinary denote by Z. This convention means that all vectors or tensors, are denoted in general with quantities are brought to the dimension of the capital letters, [J;, (J;n.... The symmetry charpower of a length by multiplication with powers acter of these tensors must in general be added of. h and,c. The reciprocal length corresponding explicitly: As classical fields the electromagnetic to the rest mass rn is denoted by r:mc/lt'. and the gravipational fields, as well as fields with As time coordinate we use accordingly the rest mass zero, rtake a special place, and are length of the light path. In specific cases,how- therefore denoted with ,the usual letters pi, ever, we do not wish to give up the u.se of f ;p: -fm, arrd gik: gki, respectively. the imaginary time coordinate. Accordingly, a tensor I;r is so deThe energy-momentum tensor index denoted by small Latin letters r, fined, that the endrgy-density trZ and the morefers to the imaginary time coordinate and mentum density G7,are given in natural units by runs from 1 to 4. A dpecial convention for de- W : - T a r a n d ' G * : - i T * a w i t h k : 1 , 2 , 3 .
51. Uxrrs AND NorATroNs
noting the complex conjugate seems desirable. Whereas for quantities with the index 0 an asterisk signifies the complex-conjugate in the ordinary sense(e.g., for the current vector Si the quantity Sox is the complex conjugate of the charge density Sn), in general U*;*...signifies: the complex-conjugateof Ur*... multiplied with (-1)", where z is the number of occurrencesof
TENSoRS. DsrrNrrrox
$2. InnenucralE
oF SPINs
We shall use only a few general properties of those quantities which transform according to irreducible representations of the Lorettz group.2 The proper Lorentz group is that continuous linear group the transformations of which leave the form 1
I This paper is part of a report which was prepared by the
Lx*':x'-xo'
..tr'Jri5iiil'"'"s5il;;'i;;;;" -since "been 1e3eand in whichslight
imorovements have made. In view of lhe uniavorable times, the Congress did not take place, and publication of the reports has. been postponed for an the indehnite lengtl of time. The -retation befreen the pr.esent discussion of the connection between spin and statistics, and the somewhat less general one of Belinfante, based on the concept of charge invariance, has been cleared up by W. Pauli and F. J. Belinfante, Physica 7, L77 (1940).
k:r : rnvariant
and
in
condition
that
they
addition
to
have
the
that
satisfy
determinant
the *
1
lSee B. L. v. d. Waerden, Die gruppentheoreLische Method.einiler Qaantentheorie (Berlin, 1932).
716
373 SPIN
I Ll
AND
and do not reverse the time. A tensor or spinor which transforms irreducibly under this group can be characterized by two integral positive numbers (2, q). (The corresponding "angular momentum quantum numbers" (j, ft) are then given by P:2j+1, S:2k+1, with integral or half-integral j and, k.)* The quantity U(j, fr) characterized bv U, k) has p.q:(2j+l)(2k+l) independentcomponents.Hence to (0,0) corresponds the scalar, to (+, +) the vector, to (1,0) the self-dual skew-symmetricaltensor, to (1, 1) the symmetrical tensor with vanishing spur, etc. Dirac's spinor zo reduces to two irreducible quantities (],0) and (0, !) each of which consists of two components. lf U(j, &) transforms according to the representation
ui:
(2i+1\ (2k+1)
E
L,"u",
l:1
then U*(k, j) transforms according to the complex-conjugate representation rl*, Thus for k:j, .{*:d. This is true only if the components of UQ,k) and U(k, j) are suitably ordered.For an arbitrary choice of the components, a similarity transformation of ,t and A* would have to be added. In view of.$1 we represent generally with U* the quantity the transformation of which is equivalent to A* if the transformation of U is equivalent to A. The most important operation is the reduction of the product of two quantities U{jt
k').Uz(j,, kz)
which, according to the well-known rule of the composition of angular momenta, decomposeinto several I/(j, &) where, independently of each otherj, & run through the values ' ' ', lj'-j,l j:jrIjr, jtljz-r, k:krIkz, kt*kz-1, . . ., lkr-krl. By limiting the transformations to the subgroup of space rotations alone, the distinction between the two numbersj and I disappearsand U(j,k) behaves under this group just like the product of two irreducible quantities U(j)U(k) which in turn reduces into several irreducible *. In the spinor.calculus this is a spinor with 2j-undotted and 2& dotted indices.
STATISTICS
U(l') each having 2llt
components, with
t:j+k,j+k-r, ..., li-kl. Under the space rotations the t/(l) with integral I transform according to single-valued representation, whereas those with half-integral I transform according to double-valued representations.Thus the unreducedquantities T(j , k) with integral (half-integral) j*& are singlevalued (double-valued). If we now want to deterrnine the spin value of the particles which belong to a given field it seemsat first that these are given by l:j+n. Such a definition would, however. not correspond to the physical facts, for there then exists no relation of the spin value with the number of independent plane waves, whieh are possible in the absenceof inteiaction) for given valuesof the components &r in the phase factor exp i(kx). In order to define the spin .in an appropriate fashion.s we want to consider first the case in which the rest mass m of all the particles is different from zero. In this case we make a transformation to the rest system of the particle, where all the .spacecomponents of. k; are zero, and the wave function dependsonly on the time. In this system we reduce the field components, which according to the field equations do not necessarilyvanish, into parts irreducible against spacerotations.To eachsuchpart, with r: 2s* 1 componentsl belong r different eigenfunctions which under space rotations transform among themselves and which belong to a particle with spin s. If the field equations describe particles with only one spin value there then exists in the rest systerri only one such irreducible group of components. From the Lorentz invariance, it follows, for an arbitrary system of reference,that r or lr eigenfunctions always belong to a given arbitrary h. The number of quantities Uff, fr) which enter the theory is, however, in a general coordinate system more complicated, since these quantities together with the vector &i have to satisfy several conditions. In the case of zero rest mass there is a special degeneracybecause,as has been shown by Fierz, this casepermits a gauge transformation oi the sSee M. Fierz, Helv. Phys. Acta 12, 3 (1939); also L. de Broglie, Comptes rendus 208, 1697 (l%9):209, 265 (193e).
374 W.
PAULI
718
We divide the quantities U into two classes: (1) the "*1 class" with j integral, fr integral; (2) the "-1 class" with j half-integral, F halfintegral. The notation is justified because, according to the indicated rules about the reduction of a product into the irreducible constituents under the Lorentz group, the product of two quantities point of view is concerned because the total of the *1 class or two quantities of the -1 angular momentum of the field cannot be divided class contains only quantities of the *1 class, by up into orbital and spin angular momentum whereas the product of a quantity of the i1 measurements. But it is possible to use the class with a quantity of the - 1 class contains following property for a definition of the spin. -1 class. It is important If we consider, in the q number theory, states only quantities of the conjugate I/* for whichj and & complex that the not present, all then where only one particle is are interchanged belong to the same class as [/. of the square of the the eigenvalues j(j*l) angular momentum are possible. But j begins As can be seen easily from the multiplication rule, tensors with even (odd) number of indices reduce with a certain minimum value s and takes then the values s, s*1, " "a This is only the case only to quantities of the *1 class (-1 class). i s n o t p o s s i b l e The propagation vector ftr we consider as befor m:0. For photons, s:1;i:0 Ionging to the -1 class, since it behaves a{ter for one single photon.s For gravitational quanta s : 2 a n d t h e v a l u e s j : Q a n d j : 1 d o n o t o c c u r . multiplication with other quantities like a quantity of the - 1 class. In an arbitrary system of reference and for We consider now a homogeneous and linear arbitrary rest masses, the quantities U all of equation in the quantities [/ which, however, which transform according to double-valued does not necessarily have to be of the Iirst order. (single-valued) representations with half -integral Assuming a plane wave, we may put ft1 for (integral) j*ft describe only particles with half-i0/3xr Solely on account of the invariance integral (integral) spin. A special investigation is the proper Lorentz group it must be of against decide required only when it is necessary to
second kind.* If the field now describes only one kind of particle with the rest mass zero and a certain spin value, then there are for a given value of kr. only trvo states, which cannot be transformed into each other by a gauge transformation. The definition of spin may, in this case, not be determined so far as the physical
whether the theory describes particles with one single spin value or with several spin values' $3. Pnoor oF THE INoBrtNttB Cnan-lcrnn op firB Cnenco rN Clse oF INrEcnel aNo OF THE ENENCY IN CASE OF SpIN Heln-INrrcnel We consider first a theory which contains only I/ with integral j-lk, i.e., which describes particles with integral spins only. It is not assumed that only particles with one single spin value will be described, but all particles shall have
the typical form
(1) LVU-:LU+. Lku+:DU-, This typicalform shallmeanthat theremay be as many different terms of the same type present, as there are quantities [/+ and U-. Furthermore, among the (J+ may occur the [/+ as well as the ([/+)*, whereas other [/ may satisfy reality conditions [/: U*..Finally we have omitted an eaen number of & factors. These may be present in arbitrary number in the term of the sum on the left- or right-hand side of these equations' It is now evident that these equations remain in-
integral spin. variant under the substitution -Ei-*u*"-transformal - t r a n s f o r m a t i o nion o[ the frrstkind" rveunderU- { ct" U+* U*c-i" wirh an s t a n d -a [J++(J+, ht+-k;., [(t/+)*+(I/+)*]; arbitrarvspaceandtime functiona. By "gauge-transformaU-+-U-, l(U-)*--(U-)*1. tion of ihe secondkind" we understanda transformation
(2)
of the tYPe l.da ,pr-et--r6it a s f o r t h o s e o f t h e e l e c t r o m a g n e t i cp o t e n l i a l s a T h e g e n e r a lp r o o f f o r t h i s h a s b e e n g i v e n b y X [ . F i e r z . Helv. Phvs. Acta r3, 45 (1940). 6 See for instance W. Pauli in the article "Wellenmechanik" in the Handbuch der Phvsik, Yol, 24/2, p. 26O'
Let us consider now tensors I of even rank (scalars, skew-symmetrical or symmetrical tensors of the 2nd rank, etc.), which are composed quadratically or bilinearly of the U's. They are then composed solely of quantities with even j
375 719
SPIN
AND
STATISTICS
and even ft and thus are of the typical form
T-LU+U++LU-U-a2Lr+ku-,
(3)
where again a possible even number of & factors is omitted and no distinction between U and U* is made. Under the substitution (2) they remain unchanged, T+I. The situation is different for tensors of odd rank S (vectors, etc.) which consist of quantities with half-integral j and half-integral b. These are of the typical form
s_uu+ku++Lu_ku_+LU_ (4) and hence change the sign under the substitution (2), S+-S. Particularly is this the case for the current vector sd.To the transformation k;+-h; belongs for arbitrary wave packets the transformation xi--xi and it is remarkable that from the invariance of Eq. (1) against the proper Lorentz grolp alone there follows an invariance property for the change of sign of all the coordinates. In particular, the indefinite character of the current density and the total charge for even spin follows, since to every solution of the field equations belongs another solution for which the components of s6 change their sign. The definition of a definite particle density for even spin which transforms like the 4-corfponent of a vector is therefore impossible. We now proceed to a discussion of the somewhat less simple case of half-integral spins. Here we divide the quantities [/, which have half-integral j-tk, in the following fashion: (3) the "*e class" with j integral A half-integral, (4) the " - e class" withj half-integral A integral. The multiplication of the classes (1), . ' ., (4), follows from the rule e2:1 and the commutability of the multiplication. This law remains unchanged if e is replaced by -.. We can summarize the multiplication law between the different classes in the followins multiplication table:
We notice that these classeshave the multiplication law of Klein's "four-group." It is important that here the complex-conjugate quantities for whichj and & are interchanged do not belong to the same class, so that U+', (U-e)* belong to the f e class -e class. U-,, 1U+"1x We shall therefore cite the complex-conjugate quantities explicitly. (One could even choose the [/+' suitably so that oll quantities of the - e class are of the form (U+")*.) Instead of (1) we obtain now as typical form
.)*: I u-.+t( u*r* Lhu+,+Lh(u
Lku,+Lh(u1,)* :lu+,ll1t1-'7*,
(r.)
since a factor k or -i6/0x always changes the expression from one of the classes *e or -e into the other. As above, an even number of & factors have been omitted. Now we consider instead of (2) the substitution ht--k;; u1c+iu+e. (u+.)*+-i(ll+c)*.
(u-,)*+i(u,)*. .. u,--iLI
,t\ \u/
This is in accord with the algebraic requirement of the passing over to the complex conjugate, as well as with the requirement that quantities of the same class as U+,, (tl-e)* transform in the same way. Furthermore, it does not interfere with possible reality condftions of the type U+.:(U-,)* or U-,: ([/+.)*. Equations (5) remain unchanged under the substitution (6). \\'e consider again tensors of even rank (scalars, tensors of 2nd rank, etc.), which are composed bilinearly or quadratically of the I/ and their complex-conjugate. For reasons similar to the above they must be of the form
T -LU+, U+,+LU-, U-"+DU+,kU-,+LU+.( U-r*+ t U,(U+")x a2(U-,)*ku-,
+L(U+,)*ku+,*l(u-,sxp1u+r*+ u-,)*+>(u+r*(u*r*. a) I ({/-.)*(
Furthermore, the tensors of odd rank (vectors, etc.) must be of the form
x1 2 u-,(g-,sx s -, u+,ku+"+ L u-,ku-,+ L u+"u-,+ L u+,k(u-,)*+ L u-,k(u+,1 + >.u+,(u+,)tt + L (u-,)*k(u-,)** L tu*,1*,(u+,)*+ r, (u-, *( u+.)*. (s)
376 W.
PAUL I
720
The result oJ the substihr.tion (6) is now the op|osite oJ the result of the substitution (2): the tensors of even rank change thei,r sign., the tensors oJ od.drank remain unchanged:
is an indication that a satisfactory interpretation of the theory within the limits of the one-body problem is not possible.* In fact, all relativistically invariant theories lead to particles, which in external fields can be emitted and absorbed in T+-T:; (9) S+fS. pairs of opposite charge for electrical particles In case of half-integral spin, therefore, a and singly for neutral particles. The fields must, positive definite energy density, as well as a therefore, undergo a second quantization. For positive definite total energy, is impossible. The this we do not wish to apply here the canonical latter follows from the fact, that, under the above formalism, in which time is unnecessarilysharply substitution, the energy density in every space- distinguished from space, dnd which is only time point changes its sign as a result of which suitable if there are no supplementary conditions between the canonical variables.? Instead, we the total energy changesalso its sign. It may be emphasized that it was not only shall apply here a generalization of this method unnecessary to assume that the wave equation which was applied for the first time by Jordan is of the first order,* but also that the question is and Pauli to the electromagnetic field.8 This left open whether the theory is also invariant method is especially convenient in the absence with respectto spacereflections (x/ : - x, s6l : aq). of interaction, where all fields (/(") satisfy the This scheme covers therefore also Dirac's two wave equation of the second order component wave equations (with rest masszero). nui, - K2u(,r:0, These considerations do not Drove that for integral spins there always exists a definite where energy density and for half-integral spins a lAzA2 definite charge density. In fact, it has been shown _:a__ n : t by Fierz6 that this is not the case for spin )1 b:r atrk2 6xo2 for the densities. Th6re exists, however (in the c number theory), a definite total charge for half- and r is the rest -u., .lf th" particles in rrnitshfc. integral spins and a definite total energy for the An important tool for the secondquantization integral spins. The spin value ] is discriminated is the invariant D function, which satisfies the through the possibility of a definite charge wave equation (9) and is given in a periodicity density, and the spin values 0 and 1 are dis- volume I/ of the eigenfunctions by criminated through the possibility of defining a definite energy density. Nevertheless,the present 1 sin &orn D(x, xJ:--l exp [i(kx)] -----_-i (10) theory permits arbitrary values of the spin vho quantum nrrmbersof elementaryparticlesas well as arbitrary valuesof the rest mass, the electric or in the limit 7+charge, and the magnetic moments of the particles. I r sin A^r^ D(x, xo): _ * | d3&exp [i(br)]- : (11) (2r)BJ rnB oF Frer.os rN TnB Aeho $4. QuewrrzarroN sENcEoF INrBnecrroNs. CoNNpcrroN * The author therefore considers BBrwBsN SprN anp Starrsrrcs as not conclusive the The impossibility of defining in a physically satisfactory way the particle density in the case of integral spin and the energy density in the caseof half-integralspinsin the c-numbertheory * But we exclude operations like (b2+K2)r, which operate at finite distances in the coordinate space, 6 M. Fierz, Helv. Phys. Acta 12,3 (1939).
original argument of.Dirjc, according to which the fleld equatlon must be ol the lirst order. 7 On account of the existence of such conditions the canonical formalism is not applicable for spin )1 and tnerelore tne dtscusslon about the connection between spin and statistics by J. S. de Wet, Phvs. R.". Si. O+6 (19,1O),which is based on that formalism is not general enough. 8 The consistent development of this method leads to the "many-time formalism" of Dirac, which has been given.by..P. A..M..Dirac, Quantun Mechanics (Oxford, second edition. 1935)-
377
72r
SPIN
AND
STATISTICS
Ry &6 we understand the positive root hs:l(k2lx2)\.
In
(r2)
D(x,0):9;
it
follows
114 D{x, xn):- - -FJ.r, xi 4rr 0r
The D function is uniquely determined by the conditions: llD-rczD:0;
general
f o r x o > .r lNoli(t{or-rr)+) F J r, x o): 1 - iH o\I)li K(rz- x o2 )',f f o r r l x o > . - r for -rlxo. llroJ"l*oL r=;+;
(#)^=":'o,(13)
(18)
Here -ly'ostands for Neumann's function and For r:0 we have simply .I1o(Dfor the first Hankel cylinder function. The strongest singularity of D, on the surface of the D ( x , * o ): { 6 ( r - r ) - 6 ( r * x o ) l / 4 r r . ( 1 4 ) light cone is in general determined by (17). We shall, however, expressively postulate in This expressionalso determines the singularity the following that al,l physical, quantities ot f,nite of.D(x, 16) on the light cone for x{0. But in d,istances exterior to the light cone (for l*o'-xo" l the latter caseD is no longer different from zero l lx' -x" l) are commutaDle.* It follows from this in the inner part of the cone. One finds for this that the bracket expressions of all quantities regione which satisfy the force-free wave equation (9) can be expressed by the function D and (a finite 1A -F(r, xo) D(x, xo) number) of derivatives of it rvithout using the 4nr 3r function Dr. This is also true for brackets with with the f sign, since otherwise it would follow that gauge invariant quantities, which are constructed -rr)+) for xolr lJol|(xo, I bilinearly from the UG>, as for example the F(r,ro):l 0 f - o r ) x o > -. r i ( t S ) charge density, are noncommutable in two points l- Jolr(*or-rr1+1 l o r - r > x & l with a space-like distance.lo The jump from * to - of the function F on The justification for our postulate lies in the the light cone corresponds to the d singularity of fact that measurements at two space points rvith D on this cone. For the following it will be of a space-like distance can never disturb each decisive importance that D vanish in the exterior other, since no signals can be transmitted \,vith -r). of the cone (i.e., for rlxo) velocities greater than that of light. Theories The form of the factor d7k/kois determined by which would make use of the Dr function in the fact that dak/ko is invariant on the hypertheir quantization would be very much different boloid (i) of the four-dimensional motrrentum from the known theories in their consequences. space (k, fr). It is for this reason that, apart At once we are able to draw further conclusions from D, there exists just one more function about the number of derivatives of .D function which is invariant and which satisfies the wave which can occur in the bracket expressior.rs,if we equation (9), namely, take into account the invariance of the theories under the transformations of the restricted 7 cos froro f Lorentz group and if we use the results of the Dtk. xo): _ (t6) | d 3 f te x -p [ ; ( k x ) ] - . preceding section on the class division of the (2r)3 J ko tensors. We assume the quantities [/(o to be For r:0 one finds ordered in such a way that each lield component is composed only of quantities of the same class. 11 Dr(x, ro) (17) * For the canonical quantization formalism this postulate 2r2 f2- rs2 s S e e P , A. N{, Dirac. Proc. Camb. Phil Soc. 30, 150 (1934).
is satisfied implicitly. But this postulate is much more general than the canonical {ormalism. r 0 S e eW . P a u l i , A n n . d e l ' I n s t . H . P o i n c a r 66 , 1 3 7 ( 1 9 3 6 ) , esp. $3.
378
w.
PAULI
We consider especially the bracket expression of a field component [/(") with its own comptex conjugate f U r ' > ( x ' , x n ' ), U * G )( x t ' , x n " ) f . We distinguish no'rv the two cases of half-integral and integral spin. In the former case this expression transforms according to (8) under Lorentz transformations as a tensor of odd rank. In the second case, however, it transforms as a tensor of even rank. Hence we have for halfintegral spin IUG) (x' , xst), U*t') (xtt , xt")l :odd number of derivatives of the function (19a) D(x' *x" , xo'- ro") and similarly for integral spin fUr't(xt, xst), U*t)(xtt, xstt)l :even number of derivatives of the function (19b) D(x'-x", xot -rcott). This must be understood in such a way that on the right-hand side there may occur a complicated sum of expressions of the type indicated. We consider now the following expression, which is symmetrical in the two points t, t)l X : l U t >( x ' , x n t ) , U * { , t ( x t x s t
-flUo)(x", xn"), U*{')(x',xn')f. (20)
Since the D function is even in the space coordinates odd in the time coordinate, which can be seen at once from Eqs. (11) or (15), it follows number from the symmetry of X that X:even of space-like times odd numbers of timelike derivatives of D(x' -x" , xyt - K1t'). This is fully consistent with the postulate (19a) for halfintegral spin, but in contradiction with (19b) for integral spin unless X vanishes. We have therefore the result lor inlegral spin t)f ) l(l(,) (x', xn'), U*t (x", xo' l l U t > ( x t t , x o t t ) ,U * k ( x ' , r i ) ] : 0 .
(21)
So far we have not distinguished between the two cases of Bose statistics and the exclusion principle. In the former case, one has the ordinary bracket with the - sign, in the latter case,
722
according to Jordan and Wigner, the bracket
lA, s1:a313a with the * sign. By insertiag the brackets with the ! sign into (20) tae haztean algebraic contrad'iction, since the left-hand side is essentially positive for x':x" and cannot vanish unless both Uc) and, I/x(') vanish.* Hence we come to the result: For integral' spin the quantization according to lhe excl'usion princ'iple is not possible. For lhis result,it'is essential, thatr the use oJ the DlJunction in place of the D Junct'ion be, for general reasons, d'iscarded. On the other hand, it is formally possible to quantize the theory for half-integral spins according to Einstein-Bose-statistics, but according to the general result of the preceding section the energy of the system would not be posiLiae. Since for physical reasons it is necessaiy to postulate this, we must apply the exclusion principle in connection with Dirac's hole theory. For the positive proof that a theory with a positive total energy is possible by quantization according lo Bose-statistics(exclusionprinciple) for integral (half-integral) spins, we must refer to the already mentioned paper by Fierz. In another paper by Fierz and Paulill the case of an external electromagnetic field and also the connection between the special case of spin 2 and the gravitational theory of Einstein has been discussed. In conclusion we wish to state, that according to our opinion the connection between spin and statistics is one of the most important applications of the special relativity
theory.
* This contradiction may be seen also by resolving I/t": into eigenvibrations according to u*c) (x, f,0): v-, 2 r I u +* (k) exp [t J - (kT) +[or,o | ]. * U-(F) exP Li { (kx) - }oro I J I L I c ) ( x , x 0 ): V - t > k l'U + ( h ) e x p [ i l ( k x ) - ] o x o ] l
417-*(llexo[;{ -(kxt+froro}]}.
The equation (21) leads then, among others, to the relation
I I/**([), U+(k)f+ lu -(b), u-t(fr) ] = 0, a relation, which is not possible for brackets with the * sign unless U+(k) and, U+*(E) vanish. lM. Fierz and W. Pauli, Proc. Roy. Sc. Ar73, 211 (1939).
-452
PHYSICS: J. SCHWINGER
PROC. N. A. S.
ON THE GREEN'S FUNCTIONS OF QUANTIZED FIELDS. I By JULIAN SCHWINGER HARVARD UNIVERSITY
Communicated May 22, 1951
The temporal development of quantized fields, in its particle aspect, is described by propagation functions, or Green's functions. The construction of these functions for coupled fields is usually considered from the viewpoint of perturbation theory. Although the latter may be resorted to for detailed calculations, it is desirable to avoid founding the formal theory of the Green's functions on the restricted basis provided by the assumption of expandability in powers of coupling constants. These notes are a preliminary -account of a general theory of Green's functions, in which the defining property is taken to be the representation of the fields of prescribed sources. We employ a quantum dynamical principle for fields which has been described elsewhere.1 This principle is a differential characterization of the function that produces a transformation from eigenvalues of a complete set of commuting operators on one space-like surface to eigenvalues of another set on a different surface,2
(rl', (T1jr2, 02)
i(r1' 711 afUl (dX) -CI 2,p 0'2)
(1) Here £ is the Lagrange function operator of the system. For the example of coupled Dirac and Maxwell fields, with external sources for each field, the Lagrange function may be taken as £ = -..1/4[P, 'Y;(-ip - eA>)P + m+/] + 1/2[4, 'i] + Herm. conj. + 1/4F,P2 - 1/4{Fp,, )A, - 6A} + J,A,X, (2) which implies the equations of motion 'Y;&(-ib - eA,u)# + mi = 71. = J, +ji,, F,, = 6g.A4 - ,A,;, (3) =
where
j;&
e'/2[l, y4]. (4) With regard to commutation relations, we need only note the anticommutativity of the source spinors with the Dirac field components. We shall restrict our attention to changes in the transformation function that arise from variations of the external sources. In terms of the notation =
(r1', 'l Ir2 , '2)
(
= exp iW,
al'I!F(x) |2, '2)/(r1', '711
2,
'2)
=
(F(x)),
(5)
PHYSICS: J. SCHWINGER
VOL. 37, 1951
453
the dynamical principle can then be written bW = j;`°
(dx)(bc(x)),
(6)
where = (k(X))5ii(X) + "(x)(#(x)) + (A,(x))6Jp(x). The effect of a second, independent variation is described by
l(b2(x))
=
i .J '
(7)
(dx') [((5 e(x)6' e(x')) +) - (5(x))(5'.e(x'))], (8)
in which the notation ( )+ indicates temporal ordering of the operators. As examples we have
6v(+(*
))=
02j, (dx') [((#(x)7;(x')5n(x')) +)-(t(x))(;(x')5v(x'))], (9)
and
bj(o(x)) = i J:" (dx')[(4,(x)A,(x'))+)
-
(4/(x))(A;(x'))]5J,(x'). (10)
The latter result can be expressed in the notation
although one may supplement the right side with an arbitrary gradient. This consequence of the charge conservation condition, 6AJ;, = 0, corresponds to the gauge invariance of the theory. A Green's function for the Dirac field, in the absence of an actual spinor souree, is defined by = (dx') G(x, x')56(x'). J,0
(12)
According to (9), and the anticommutativity of 65(x') with 4'(x), we have G(x, x') = i((4,/(x)i(x'))+)E(x, x'), (13) . On combining the differential where E(x, x') = (xo - xo')/ xo equation for (y6(x)) with (11), we obtain the functional differential equation -e(A,.(x)) + ieb/,Js(x)) + m]G(x, x') = 6(x - x'). (14) An accompanying equation for (A,(x)) is obtained by noting that (15) (j,(x)) = ie tr 'y,sG(x, x')x' x, in which the trace refers to the spinor indices, and an average is to be taken of the forms obtained with xo' -- xo h 0. Thus, with the special choice of gauge, b6(Av(x)) = 0, we have -62(A (x)) = J,(x) + ie tr y;,G(x, x). (16) The simultaneous equations (14) and (16) provide a rigorous description of G(x, x') and (A,(x)).
xo'l
PHYSICS: J. SCHWINGER
454
PROC. N. A. S.
A Maxwell field Green's function is defined by &Pv(x, x') = (8/bJ(x'))(Ap(x)) = (515J=(x))(A(x'))
i[((A,(x)A (x'))+)
-
(A;(x))(A ,(x'))]. (17)
The differential equations obtained from (16) and the gauge condition are + ie tr 'y(6/5J,(x'))G(x, x), -b)2S;,(x, x') = (x-x) bA9;v(x, x') = 0 (= 6'x). (18) More complicated Green's functions can be discussed in an analogous manner. The Dirac field Green's function defined by
5,72((jt(XI) t(X2)) +) e (XI X2),v
= 0
=
J91((dxl) ,20/" (dx2')G(xl, X2; XI', X21)5V(XI')5j((X2%) (19) may be called a "two-particle" Green's function, as distinguished from the "one-particle" G(x, x'). It is given explicitly by
G(xi, x2; xI', x2') = ((4(x1)4#(x2){(x1')l(x2'))+) E, e(xI, X2)E(XI', X2')E(Xl, xI')E(xI, x2')e(x2, Xi')e(X2, x2') (20) This function is antisymmetrical with respect to the interchange of xi and X2, and of xi' and x2' (including the suppressed spinor indices). It obeys the differential equation W G(x1, x2; xl', X2') = 6(x - xi')G(x2, X2') - (xl -x2')G(x2, x1'), (21) where 0 is the functional differential operator of (14). More symmetrically written, this equation reads e
=
i 1a2G(xi, x2; xi', x2')
=
5(xi - xl')(x2 - x2')-
6(xi - x2')6(x2- xi'), (22) in which the two differential operators are commutative. The replacement of the Dirac field by a Kemmer field involves alterations beyond those implied by the change in statistics. Not all components of the Kemmer field are dynamically independent. Thus, if 0 refers to some arbitrary time-like direction, we have m(l - #02)4, = (1 - #02)rt - Pkk(-i2) - eAk) #o2#, k= 1,2,3, (23) which is an equation of constraint expressing (1 - #o2)4, in terms of the independent field components 13o2#, and of the external source. Accordingly, in computing 5,(4,(x)) we must take into account the change induced in (1 -,o2), (x), whence G(x, x') = i((4,(x)1(x'))+) + (1/m)(1 - #02)5(x - x'). (24) The temporal ordering is with respect to the arbitrary time-like direction.
PHYSICS: J. SCHWINGER
VOL. 37, 1951
455
The Green's function is independent of this direction, however, and satisfies equations which are of the same form as (14) and (16), save for a sign change in the last term of the latter equation which arises from the different statistics associated with the integral spin field. 1 Schwinger, J., Phys. Rev., June 15, 1951 issue. 2 We employ units in which h = c = 1.
ON THE GREEN'S FUNCTIONS OF QUANTIZED FIELDS. II By JULIAN SCHWINGER HARVARD UNIVERSITY
Communicated May 22, 1951
In all of the work of the preceding note there has been no explicit reference to the particular states on 01 and 01 that enter in the definitions of the Green's functions. This information must be contained in boundary conditions that supplement the differential equations. We shall determine these boundary conditions for the Green's functions associated with vacuum states on both o1 and a2. The vacuum, as the lowest energy state of the system, can be defined only if, in the neighborhood of a1 and U2, the actual external electromagnetic field is constant in some time-like direction (which need not be the same for a1 and a2). In the Dirac one-
particle Green's function, for example, G(x, x') = i(+i(x);(x')), xo > xo', = -i(4(x') A(x)), xo < xo', (25) the temporal variation of +1(x) in the vicinity of o1 can then be represented by
(26) exp [iPo(xo - Xo)]4I(X) exp [-iPo(xo - Xo)], where Po is the energy operator and X is some fixed point. Therefore, x -- or: G(x, x') = i(4/(X) exp [-i(Po - Povac)(xo - Xo)];(x')), (27)
O6(x)
=
in which Povac is the vacuum energy eigenvalue. Now PO -Povac has no negative eigenvalues, and accordingly G(x, x'), as a function of xo in the vicinity of a,, contains only positive frequencies, which are energy values for states of unit positive charge. The statement is true of every timelike direction, if the external field vanishes in this neighborhood. A representation similar to (26) for the vicinity of 01 yields X --
02: G(x, x')
=
-i( (x') exp [i(Po - PoV")(xo - Xo)]1(X)), (28)
456
456 PHYSICS: J. SCHWINGER
PROC. N. A. S.
which contains only negative frequencies. In absolute value, these are the energies of unit negative charge states. We thus encounter Green's functions that obey the temporal analog of the boundary condition characteristic of a source radiating into space.' In keeping with this analogy, such Green's functions can be derived from a retarded proper time Green's function by a Fourier decomposition with respect to the mass. The boundary condition that characterizes the Green's functions associated with vacuum states on a, and a2 involves these -surfaces only to the extent that they must be in the region of outgoing waves. Accordingly, the domain of these functions may conveniently be taken as the entire four-dimensional space. Thus, if the Green's function G+(x, x'), defined by (14), (16), and the outgoing wave boundary condition, is represented by the integro-differential equation, yA(-ib - eA+,u(x))G+(x, x') + (29) f(dx')M(x, x")G+(x', x') = 6(x - x'), the integration is to be extended over all space-time. This equation can be more compactly written as ['y(p - eA +) + M]G+ = 1, (30) by regarding the space-time coordinates as matrix indices. The mass operator M is then symbolically defined by MG+ = mG+ + iery(S/8J)G+. (31) In these formulae, A + and 8/1J are considered to be diagonal matrices, (xl A +,, x') = 6(x - x')A4+4(x). (32) There is some advantage, however, in introducing "photon coordinates" explicitly (while continuing to employ matrix notation for the "particle coordinates"). Thus
-jA + J(d{),yQ)A +(t),
(33)
where -y() is defined by
(x,Yr(;)jx')
=
yp5(x -
)(x - x').
(34)
The differential equation for A +(t) can then be written -
t2A +(t)
=
J(Q)
+ ie Tr [y(t)G+],
(35)
where Tr denotes diagonal summation with respect to spinor indices and particle coordinates. The associated photon Green's function differential equation is
-at2q+(t, {')
=
(- ') + ie Tr [-y()(5/6J(t'))G+]-
(36)
PHYSICS: J. SCHWINGER
VOL. 37, 1951
457
To express the variational derivatives that occur in (31) and (36) we introduce an auxiliary quantity defined by
r(a)
=
-(61beA +Q))G+-I - (5/beA+(Q))M.
(37)
ef (d{')G+F(t')G+S+(t' t),
(38)
a()
=
Thus
(6/5J(Q))G+
=
from which we obtain
M= m+
ielf(d))(dt'),yQ)G+rw)9+Q1, 0,
(39)
and
-aZ29+Q, {')
+
f(d t)P( , `)
+(t` ') = 6( P(t, 0') = -je2 Tr ['(y)G+r(Q')G+] (40)
With the introduction of matrix notation for the photon coordinates, this Green's function equation becomes
(k2 + P)9+
= 1,
[,, k^] =
(41)
i
and the polarization operator P is given by P = -ie2 Tr [yG+rG+].
(42)
In this notation, the mass operator expression reads
M = m + ie2 Tp [yG+rS+],
(43)
where Tp denotes diagonal summation with respect to the photon coordinates, including the vector indices. The two-particle Green's function
G+(xi, x2; xl', x2')
=
(xi, x2| G121 xl', x2'),
(44)
can be represented by the integro-differential equation
[(Ylr + M)1(77r + M)2
= 112, (45) p - eA +, thereby introducing the interaction operator 112. The unit operator 112 is defined by the matrix representation 7r =
(X1, X21 1121 XI 1, X2) = (X1 - X1')6(X2
-
X2')
-
6(xl - X2')5(X2
-
x'). (46)
On comparison with (21) we find that the interaction operator can be characterized symbolically by
458
PROC. N. A. S.
PHYSICS: J. SCHWINGER
12G12 = -ie2 Tp[LYP2S+]GG12 -ie2 Tp[y1G,6/6J]1(112G12) - -ie2 Tp[Y2riFi+]G12 - ie2 Tp[y2G2i5/J] (I12G12), (47) where G1 and G2 are the one-particle Green's functions of the indicated particle coordinates. The various operators that enter in the Green's function equations, the mass operator M, the polarization operator P, the interaction operator 112, can be constructed by successive approximation. Thus, in the first approximation, M(x, x') = mb(x - x') + ie2ey,GG+(x, x'),y,DD+(x, x'), PMV(R, i') = -ie2 tr[-yMG+(Q, i')-yvG+(t', c)], I(xb, x2; X1', X2') = -ie2y,y2,.D+(xl, X2) (X1, x21 1121 Xl', X2'), (48) where
9;AV Q, 0'
=
6,D +(,i)
(49)
and the Green's functions that appear in these formulae refer to the 0th approximation (M = m, P = 0). We also have, in the first approximation,
FJ(t; x, x') = 'yJA(t - x)6(x - x') -ie2y,,G+(x, t),yG+Q, x')'ypD+(x, x')
(50)
Perturbation theory, as applied in this manner, must not be confused with the expansion of the Green's functions in powers of the charge. The latter procedure is restricted to the treatment of scattering problems. The solutions of the homogeneous Green's function equations constitute the wave functions that describe the various states of the system. Thus, we have the one-particle wave equation
(51)
(,rr + M)# = 0, and the two particle wave equation
[Q(y7r + M)y(77r + M)2 - 112h'12
=
0,
(52)
which are applicable equally to the discussion of scattering and to the properties of bound states. In particular, the total energy and momentum eigenfunctions of two particles in isolated interaction are obtained as the solutions of (52) which are eigenfunctions for a common displacement of the two space-time coordinates. It is necessary to recognize, however, that the mass operator, for example, can be largely represented in its effect by an alteration in the mass constant and by a scale change of the Green's function. Similarly, the major effect of the polarization operator is to multiply the photon Green's function by a factor, which everywhere appears associated with the charge. It is only after these renormaliza-
VOL. 37, 1951
ZOOLOG Y: ENGSTROM A ND R UCH
459
tions have been performed that we deal with wave equations that involve the empirical mass and charge, and are thus of immediate physical applicability. The details of this theory will be published elsewhere, in a series of articles entitled "The Theory of Quantized Fields." 1 Green's functions of this variety have been discussed by Stueckelberg, E. C. G., Helv. Phys. Acta, 19, 242 (1946), and by Feynman, R. P., Phys. Rev., 76, 749 (1949).
DISTRIBUTION OF MASS IN SALIVARY GLAND CHROMOSOMES By A. ENGSTROM* AND F. RuCHt DEPARTMENT FOR CELL RESEARCH, KAROLINSKA INSTITUTET STOCKHOLM
Communicated by C. W. Metz, May 15, 1951
The measurement of the absorption of soft x-rays, 8 to 12 A in wavelength, in biological structures makes it possible to determine the total mass (dry weight) per unit area of cytologically defined areas in a biological sample. Knowing the thickness of the sample or structure being analyzed the percentage of dry substance can be estimated. For theoretical and technical details see Engstrom' 1950. Dry substance is an accurate basis upon which to express the results obtained with other cytochemical techniques, e.g., the amount of specifically absorbing, ultra-violet or visible, substances. The present investigation is an attempt to determine the- dry weight (mass) of the different bands in the giant chromosomes from the cells in the larval salivary glands of the fly Chironomus. The structures to be observed, however, are just on the border of the resolving power of the x-ray technique for the determination of mass. The results reported, therefore, must be interpreted with care. The specimen intended for x-ray investigation is mounted on a collodion film circa 0.5 micron thick. This film supports the object in the sample holder, a brass disk with a slit about 6 mm. long and 0.5 mm. wide. In the first experiments salivary glands from Chironomus were isolated on a microscope slide and the chromosomes transferred to the thin carrier membrane on the sample holder. When examining the x-ray picture of these chromosomes no details at all could be seen due to shrinkage effects when the chromosomes were dried. For the x-ray determination of mass the specimens must be dried before they are introduced to the high vacuum of the x-ray tube. The water must also be taken away for another reason: The high absorption of soft
387
P o p e r3 2
Electrodynamic Displacement of Atomic Energy Levels. IIL The Hyperfine Structure of Positronium RoBERT KARPf,us etp Aerlsru Krr:rx H truud Unitersily, Caubrid,ge, .ll ussltchilset!s (Received NIay 13, 1952) integro-difierential equation for the electronA functional nositron Green's function is derived from a consideration of the effect of sources of the Dirac field. This equation contains aD electron-positron interaction operator from which functional deprocedure. The by an iteration rivatives may be eliminated ooerator is evaluated so as to include the efiects of one and t*'o virtual quanta, It contains an interaction resulting from quantum exchange as well as one resulting lrom virtual annihilation of the pair. The wave functions of the electron-positron system are the solutions of the homogeneous equation related to the Green's function equation. The eigenvalues of the total energy of the
I. INTRODUCTION
system may be found b). a four-dimensional perturbatiol teclL nique. The s1'stem bound bv the Coulomb interactjon is here treated as the unperturbed situation. Numerical values for tbe spin-dependent change of the energy from the Coulomb value in the ground state are finally obtained accurate to order a relative to the hyperfine structure d2 Rr'. The result for the singlet-triplet energy difference is LW n:
la2 Ry-17 /3-
(32/9 12 ln2)a/ ol:
2.0337X 105 Nfc/scc.
Theory and experiment are in agreement.
investigation to be describedin this paper I was suggestedby the current theoreticalinterest in the quantum-mechanical two-body probleml-3 and the recent accurate measurement of the ground state hyperfine structure of positronium.a b The system compbsed o{ one electron and one positron in interaction is the simplest accessible to calculation because it is purely electrodynamic in nature. Moreover, the success of quantum electrodynamics in predicting with great accuracy the properties of a singleparticle in an external field indicates the absence of fundamental difficulties from the theory in the range of energies that are significant in positronium. The discussionof the bound states'of the electrolpositron system is based upon a rigorous functional difierential equation {or the Green's function of that system, derived in Sec. II by the method described by Schwinger.I In order to obtain a useful approximate form of this equation (and of the associatedhomogeneous equation) we have iterated the implicitly defined interaction operator) in this way automatically generating to any required order the interaction kernel obtained from scattering considerations by Bethe and Salpeter.3 In the present case we have included all interaction terms involving the emission and absorption of one or two quanta. The latter include self-energyand vacuum polarization corrections to one-photon exchange processes as rvell as trvo-photon exchange terms. The particle-antiparticle relationship of electron and positron is represented by terms describing one- and twophoton virtual annihilation of the pair.FE In contrast
to the caseof scattering,only the irreduciblesinteractions appear explicitly. Our subsequentconcern is llith'the solution of the associatedhomogeneous equation.It should be enphasized at the outset that we shall be silent (out of ignorance)on the questionof the fundamentalinterpretation of a rvave function rvhich refers to individual timesfor eachof the particles.The possibility,nevertheless, of obtaining a solution to our problem entirely s'ithin the framework of the present formalism dependson two conditions.The first of theseis that most of the binding is accountedfor by the instantaneous Coulombinteraction.Salpetere has shownthat when the interaction is instantaneous,the rvaYeequation can be rigorouslyreducedto one involving only equal times for the trvo particles. trforeover, the rvave function for arbitrary individual time coordinates can be expresseci in terms of that for equal times. This last circumstance can alsobe exploitedin the der-elopmentof a perturbation theory x'hich yields the contribution to the energ)' levels of a small non-instantaneousinteraction.eThe relevant resultsof this treatment are siven in Sec.IIL The secondcondit ion is thaI I he freJpa rt icleapproximation for all intermediatestatesshall be an adequate one. Ihe essentialpoint here is that \yhether one derives an expiicit interaction operator by the iteration procedureadoptedin the presentpaper (tantamount to an expansion of the intrinsic nonlinearity in terms of free particle properties)or by a partial summationof a scattering kernel, the propagation rvhicl.r naturally enters in intermediate states is that of free particles. In the treatment of fine-structure effects, the contribu-
I L Schwinger,Proc.Nat. Acad. Sci. US 37,452,455 (1951). ,-lr. cell-Mann and F. Low. Phys. Rev.84,350 (1950. 3 H. A. Betheand E. E. Salpeter,Phys-Rev. 84,1232(1951). {M.DeutschandS.C.Brou'n,Phys.Rev.85,1047(1952). 6 M. Deutsch, latest result reported at the Washington Me€ting oI the American Physical Society, May, 1952. ?hys. Rev. 87, 212(T\ (1952). u j. lir"nn6, Arch. sci. phys. et nat. 28, 233 (1946);29, 121, 2O7,and 265 (1947).
? V. B. Berestetskiand L. D. Landau, Exptl. Theoret.Phys. J. (U.S.S.R.) (1949).Seealso V. d. Be;estetski,J. blxitl. 'fheoret. 19, 673 Phys. (U.S.S.R.)19, 1i30 (1949). 3 R . A . F e r r e l l ,P h y s , R e v . 8 4 , 8 5 8 ( 1 9 5 1 )a n d P h . D . t i r e s i s (Princeton, 1951). Dr. Ferrell kindly sent us a copy of his thesis. 'gE. E. Salpeter, Phys. Rev. 87, 328 (1952). We are indebted to Dr. Salpeter for making available to us a copy of his paper prior to publication. \\re hive found his ideas very helpful i'n iut work.
'fHE
848
388 I C E L E C TRO D Y NA N'1
849
D I SP LA C E M E N T
tion of nonreiativistic intermediate states, lvhere the and satisfy the difierential equations Coulomb binding cannot be ignored, must then be eA1,@)*ieal6J rQ))*ml obtainedin a mannerreminiscentof the first treatments ly r(-i0 uof the Lamb shift.'g This rvill not be necessary in the XG-(x' r'): a1a- ,"'7 (2'aa) present paper since rve shall be concernedwith the and hyperfine (spin-spin)type of interaction to tvhich only i 0,1 eA+ r@) - ie6/ 6l r@))I mf relativistic intermediate states contribute to the re- ly,(quired precision.ro y6+ (t, xt): 6(x- n'), (,2.4b) The practical goal of this v'ork is to obtain the splitting of the singlet-tripletground-statedoublet of posi- with the outgoing wave boundary condition. They are, tronium correctto order a3 Ry. Previouscalculations,6-8 of course,relatedby the matrlr C: accurate to order a2Ry, have included the lowest (2.5) G " B + @ , x ' ) : - ( " " , C t B B , G p , " ' - ( rt )' ,. order contributions of the ordinary spin-spin coupling arisingfrom the Breitlr interaction (the analogof rvhich We shall now introduce matrlr notation for the in hydrogen is responsible for its hyperfrne structure) combinedparticle coordinatesand spinor indices,and and of the one-photonvirtual annihilation force, char- the combined photon coordinatesand vector indices. acteristicof the systemof particle-antiparticle.The ex- Becausethe formulaswill get quite involved, the matrix pressionfor the energyshift given in Sec.III, Eq. (3.6) indices will be erpressedas arguments, by numbers for yieids these again in lowest approximation and contains l h e p a r t i r l e sa n d b y ! . f ' , " ' f o r t h e p h o l o n sa. n d t h e as well the matrix elementsof all interactions which summation convention n'ill be understood. Functions can contribute to the required accuracy. ol one coordinateare to be diagonalmatrices; quanti SectionIV is devotedto the detailedevaluatiorrof all ties alfixed with only one matrix index are to be vectors the matri-r eiementsthat may be looked upon as general- with respect to that index. The arguments of the Dirac ized Breit interactionsbecausethey dependpurely on matriceswill refer only to the vector and spinor indices the exchangeof photons between the two particles. of these quantities; they will be unit matrices in the In Sec. V we consider the annihilation interaction coordinates. Similarly, functions of the coordinates peculiar to the electron-positronsystem. Fiually, the alone must be understood as multiples of the l)irac comparisonlvith experimentis given in Sec.\iI. unit matrix. As an example, Eqs. (2.4) and (2.5) rvill be tranII. THE WAVE EQUATION scribed rvith the symbols I and S+ standing for the A discussionof the one-particleelectronand positron functional differentialoperatorsin Eq. (2.4): Green'sfunction associatedr.viththe vacuum state will a(r,;); serueas an introduction to this section.If the notation Q.a'a') 3-(rz)G-(23): of reference1 is extendedto include the positron field (2.4'b) ; [+(rz)c+(23):6(13) variablesQ'@), {' (") , and their sourcesthat are related to the electron variablest@),0@), and their sources G+(t2): _C(Lt')C L(22,)G_(2,1,). (2.5') by the usual chargeconjugatingmatrix C, II the mass operator M(12) is defined in the usual way, cic-t: -t. c: -4. c'c:1, () 1t (2.6) M+(lDGrQs):1n+Q2)C;t(2j), ,tr':C,l',,tt':C',1r. n':Crt, r'-C )n. operator where !J? is the functional difierential the Green's functions are defined by the vacuum exDlt(r2): m6(12)+ie7c, 12)6/6t (0 , Q.7) pectation values ^ o\
6l/(r))ol-o:
I
d a x ' G( x , r ' ) 6 4 ( r ' )
,l oz
and fo'
( 2 . 2 a ) then the Green'sfunction equations(2.4) can be rvritten in terms of integro-difierential operators F that are obtained from the 3 by the replacementof \)l.by M. A v e r l e v o n e r a t o rf { } - l 2 ) m u s t n o w b e d e f i n e df o r
60,(f'(r))01,,:o:1 dax'G'(x, x')bl(:'), (2.2b) J62
ecch
(lrcpn'c
frrnefinn
r+ (t,t2) : (6/ 6eA {))
where d4 and 0q/are arbitrary variations of the electron and positron sources,respectively.The Green'sfunctions can be qxpressedin terms of expectation values by and ,
G-(*, r'):i((!(x){,(x'))*)0.(r,
r')
(2.sa)
a.nd
6+(s, t'): i((!' (x){'(r'))*)0,(r, *')
(2.3b)
10R. Karplus and A. K)ein, Phys. Rev. 85,972 (1952). 11G. Breit, Phys. Rev. 34, 553 (1929);36, 383 (1930);39, 616 (1932).
(G+(r2) )-1 : (6/ 6eA+(0)F+(r2)
I-(f , 12) : - $ / 6eA aQ)) (G- (12))-l : - $/aeAa({)F-(r2).
(2.8a)
(2.8b)
In the absenceoI an external field thesetrvo quantities becomeequal becausethen the charge occurs always
389 R.
KARPLUS
to an even power only, and the two difier just in the sign of the charge. We now proceed to the two-particle system. The electron-positron Green's function for the vacuum state is defined by the relation d,6,,(Qt(x),1,,(rz) )*)oI o,,:o.rr,, :,
?ot Joc
A.
KLEIN
850
Finally, the equation may be written in the form
lF- (rt')F+(22')- r (12,r' z')fc--+(r'2" 34) :6(13)6(24), (2.16) r.here the interaction operator 1(1234)is definedby
",.r
I (12, l' 2')G-+(l',z',,34)
fol
anri I
AND
dnh,G-(rfiz,rih,)
: - F+(22')lyn-Qt')-
Jaz
M-(r|'))G--+(L'2" 34)
XEq(x1')6n'(x2'). (2.9)
-L + ie^/(8, | 3/) c (s'2)c (44,) (6/ 6J (0)G (4,s),
Evaluation of the variations with the help of Eq. (9), reference1, leadsto the explicit er?ression
: - F-(1 1')[IJt+ (zi',) - M+ (22')]G--+(r' 2" 34)
G+(rp2, *1'12')
- iey 22')C(2'1)C-'(33')(6/6"r( Q, t))G+(3'4). (2.17)
The secondexpressionarisesrvhen $+ and then F- are appliedto the Green'sfunction. Theseerpressionsmust norv be rearrangedso as to yield the interaction operator e x p l i c i t l y a s a n i n t e g r a l o p e r a r o ru p t o t h e d e s i r e d x2'). (2.10) order of accuracy.In other rvords,the functionalderivax((0@)0' @z'))a)ne(x1', As might be expected,this Green'sfunction is related tives may occuronly in terms that contributenegligibly to a charge conjugate of the two-electron Green's to the effect that is being investigated.The subsequent function with arguments interchanged properly, by operationss'ill be directedat finding an expressionthat is suitable for the purposesof this paper. (For other E q s . ( 1 3 ,2 0 ) , r e f e r e n c e1 : effects,such as the Lamb shift in positronium,a difierG"Br6+(xg2, lc1'r2') e n t f o r m o f t h e i n l e r a c t i o no p e r a t o ri s n e c e s s a r ) . ) With the help of the definitiono{ the vertex operator, = -L 6F'L '66uob'as' (Jlr2, f,rf ?, Eq. (2.8),the lorvestorder interactionmay be separated - C pp,C-t ay G p,- (ayx2)G5, (x2'x as follows: 1'). (2.11) ; : (@ (",) *, () 0 (x| ) 0, (r z,)) +)oe ") - ((Un,){' (rr))+)oc(rr rz)
"
'I'he
antisvmmetry of the two-electron Green's function itssures that both direct and exchange processes are contained in the electron-positron Green's Iunction; the second term merely corrects for the fact that the uncoupled electron-positron system cannot undergo an exchange process. In this case, G. y, 6,- (r 6 2', t 1'* r) -4,
( 2 . 12 )
wnence G'y5+(rp2, -
34)
: i e'? t (t, 11')9+(f, €')t' +1.v,22')G' + (I' 2', jl) _ _ ItJn- ( r, ) = M (1r' )lF + (22,) G-+ (1'2" 34) *i.e'7(1, 13')C(3'2)g+(t, t')C-t(2' 4') xt-(t"
r- 7y,ta t' ) G y p,- (x z'x z) G " e, - ( * t - r r ) G r , r ( r r ' r r ' ) ,
I(12,l'2')G+(1'2"
+' l')G (1',3)G+(2'+).
(2.18)
The secondterm in Eq. (2.18)can be simplifiedb-"*the use of Eqs. (2.16)and (2.6), rvhenceit becomes _ i", t (8, 11, (t, t,, )G_ )16/ 6eJ())
r1'r2') - Cp o,C-t t yG.; : G
(r$t')G y a,- (rz' rz) (xp1')G 96+(rrr2'), ";
xI(1"2, s'4')G-+(3'4" 3+). (2.t9) (2.7j)
the proper description for noninteracting particles. The difierential equation for G+ may be obtained rvith the help of that for G--, Eq. (21), reference 1, and of Eq. (2.4'). They yield
3- Qr' )G+ (1'2, 34): 6(13)cf(24) I i e1 ft , t I' )CQ'2')C-| (M') xG+(22')(6/6J(0)G-(4'3) (2.14)
and F+(22')3-01')G+ (1,2,, 31): 6(13) 6(24)
* i et G, Ir' )C(1'2)c-t (44') (6/ 3J(t) )c (4'3). (2.1s)
The last term, finally, is brought into more useful form rvith the help of the identity g*(E, l')C-t (2'4')r- (t', 4' t')c- (r' 3)G+(2'4) : D+(E, E')C-t(2'4')7(1', 1't')6+(1'2' , 34),
(2.20)
rvhich may be verified by iteration of both sides.The interactionoperator thereforeis given by I (r2, 31): ie'zYc,13)9+(€,t',)r+(t',,24) * ie't (t, lr')C (1'2)D+(t, t')c-1(41'h (t', 4'3) - i e \ ( t , n ' ) G ( r ' 1 " ) l ( 6 6 e J( ) ) I ( 1 "2 , s ' a ' ) /
y6-'+(3,4" s"4")lrc- +(3,,+,,,34)l-t. (2.2r)
390 851
ELEC
TRO DYNA
M I C
This, and a correspondingexpressionobtained from the alternative form of Eq. (2.17) correspondto Eq. (47), reference1; the only difierencelies in the secondterm above, rvhich represents the interaction due to the virtual annihilation of the electron-positronpair. The last term contains the effects of higher order electrodynamic processes involving more than one virtual photon, such as multiple photon exchanges and the corrections that symmetrize the first term in the interaction so that it dependson the vertex operatorof both the electronand the positron. We are interested in the effects of one and two virtual quanta, terms of order I in the interaction- For this reason, the functional derivative in Eq. (2.21) needsbe evaluated only to the lowest order, (s'4" 3" 4")l l(6 / 6cJ(t))r (1" 2, 3' 4')G--+ XIG- +(3"4" ,34)l-'=-
I(7"2, s'4')
XG-+(3,4,, j,,4,,)16/6eJ(l)f XIF- (3" 3)F+ (4'' 4)l:
- ie,l^y(E, 7''s')
Xt G', 24')l t (E,7" 2')C(2'2)C-r (4" 2") x7 (8" 2" 3',)fD+(t,E )G- (3'3")G+(4'4")
D I SPLA CE M ENT
interested,the operatorF(12) is a multiple o{ the Dirac operator F(72) that depends on the experimental mass m oI the electron, Ft(t21:(-aB/2r)
tFt\12),
(2.25)
rvith l-*(x, *'): 6(r.- r')17 u(- i0,'teA+ u@))*mf. (2.20) We may now introduce the interaction
Iqrz,s+7:e-aB/r)r(r2,3e,e.27) which enters the equation of the usual form for the wave function,
lF- (l t )F+(22, ) - I (12,r'2,)1,t, Q,2,) : O. (2.28) To find the energy levels of the system, we seek solutions of the form {(rp2):
4rx tu(r) ; X :i@ft
rz), x: rrxz,
(2.29)
that are eigenfunctionsof the total momentum operator with eigenvalue K. This eigenvalue is the goal of the calculation. fn the absenceof an external field, the interaction operator conservesthe total momentum, so that it is possible to write an equation for the function 96(r) of the relative coordinate *,
X D +(t, t' )l- F+ (4" 4) ^tG', 3" 3)
lF x(nc')- I r(x, *')lea@):s,
(2.30)
t F-(s"3)yQ',a"4)1. (2.22) rvhere Whenthis expression is multipliedout, the first of the e ; R X I Fs ( r x ' ) f . B 1 t four termsis conveniently includedin a symmetrical lowestorderinteraction, and the (.t) superscripts can r-71x,X' + - t"x') : I F ix') F'pE\X- ix, X' be droppedin the limit of vanishingexternalfield. ",(X This form of the approximateinteractionoperator, xeiK'x'd4xt,
I (12,3+)=ie'zr(t, 13)9+({,t')r (t', 24)
(2.31)
anrJ,I6(r, r') is similarly related to I(1234). The Dirac indices in Eq. (2.30) are summed in the same way as 7")t (E,I" 3h (E',24') those in Eq. (2.24); ,px stillhas two sets of Dirac indices I (ie')'t G, 11')G(1' even through it has but one four-vector argument. To ({', 4" D+(t, XG(4'4")7 4 {) D*(E,E') avoid complications in the notation, this matrix notation will be continued; where necessary, superscripts I Qe')\ (t, 11')G(l'1")y Q, 1"2')C(2'2)D+(tt') 7 and 2 will distinguish Dirac matrices that operate, 4")7 Q', 4"4) xD+(t, E')lC-1(i3')tG',3'4')G(4' respectively, on the first and second particle index of the wave function gs(r). + C-' (M')t (E',4'3')G(3'3")1 (l', 3"3)1, Q.23). Before we proceed to solve Eq. (2.30),_weshall decan be easilyunderstood in termsof the equivalent composethe first two contributions to 1(1234), Eqs. (2.23) and (2.27). W\th the help of the expressions!2 Fp'nman rlleorcm The wave functions {(12) of the electron-positron I,(t, 13): z"(f, 13)(11 aB / 2r) systemare solutionsof the homogeneous equation,
* ie't G, ll')C (1'2)D+(t, l')C-\ (44')7Q', 4'3)
+ (2,32) +^A(r(1_€' f-3) l F - ( l 1 ,) F ( 2 2), - I ( t 2 , 7 '2 ') ) , 1(,t ' 2 ') : 0 , ( 2 . 2 4 ) andr2,13 related to Eq. (2.16).It is important to realizethat the operators-F(12)alsocontainelectrodynamiccorrections. g+,"(t, t): Ql qA/2r)D+(E, t',)6Thesemay be obtainedfrom the corrcctionsto the one*D+Q(t, E')iF", Q.33) particle Green'sfunction G(12), ol rvhich Ii(12) is the inverse.IzFor the nonrelativisticstatesin which rve are 13Note that r2R. Karplus and N. lI. Kroll, Phys. Rev. 77, 536 (1950).
,r:|i,Dr,,
Da=!iDy, D+@:+iDFo.
391 R.
KARPLIIS
AND
,{.
8.52
KLE]N
they become
then given to a sufficient approximation bye
4ni.a7u(f, l3)D+(t, t'h,(t' , 24) * ie\ r(t, 11')C(r'2)D+(tt')fl (43')y,(l',3'j' X (1- aB/ r) * 4ria1,(9,1i) U (t, t')
tn: _ t. an*an*'p"(r) f X
X t,,tzt12- 1',t' - 4)+ 4r i o^,e,(l - {, E-.i) X D+(t, t') t,G', 24)! 4ria7 uQ,13) D,(D(tt).y,G,, 24), (2.34)
l",tr, {
*/rrat"(x,
(x, x,) r')l I y21(t,x')l I a2po)
t')l
f I d a x ' t d a t ' t ' Ix r ( x , x " )
up to termsinvolvingtwo virtual photons.The experiXfF6c(x", x"'))-'I *,(x"',"1],P"1*1, 1.t.0; mentalvalueof the fine structure--censtant a hasbeen l writtento absorbthe chargerenormalization factorin measured in the reference frame in which the total r'.q.(2.33),12 4ra: e2(ll oA/2r):4o11i7.O,' . . .
spatial momentum vanishes,
(2..35)
III. PERTURBATIONTHEORY Salpetere has discussed a method for finding the eigenvaluesof the total energy of a two-particle system describedby an equation like Eq. (2.30) if the interaction function does not differ greatly from a local instantaneousinteraction of the form 6(r-r')6(l)/(r)
(x,:r, t;
i:1,2,3).
K!:
( 0 ,K o ) .
(3.7)
The function 9g@) is the relativistic Coulomb wave function that is a good approximation to the actual wave function of the state whoseenergy level is sought. It is a solution of lFac(r,r')*Ic(r,x')),pg(x'):0,
(3.8)
whence AE: Ko- Koc.
(.3.1) 'Ihe
(3.e)
expressionEq. (3.6) is accurate to order c relative to the {ine structure contribution 161 and further presupposesthat the intermediate states in the secondorder perturbation term, the last in Eq. (3.6), can be replaced by free particle states. This is the casefor the IC (r, r')l I 6y(r, x'): Ic (r, *') spin-spin interaction under investigation. Before closing this section, we must briefly discuss I I xn(x, r')! I y1a(x,r,), (3.2) the wave function pc(r) that enters into'Eq. (3.6). where As is the case with the electrodynamic corrections to Jc(n, r'): -i.a6(x-r')1nt7nz6(t)/r, (3.3) the magnetic interactions in hydrogen, the contributions to A-E come mostly from the vicinity of the relative the Coulomb interaction,and coordinate origin. The two-photon contributions, therefore, will be at most of the order a,196(0)1,,where I nn:2ia(2r)-36(xr') po(r) is the Pauli wave function for the ground state 'Y0t1o2ko21 of positronium. Since this is the smallest magnitude f I T''T' that is being considered, contributions to these terms \ | d'*ett'1 \-"/ ,1 L ku' hi2ktz l' that are proportional to the relative momentum can be neglected. ft therefore sufices to approximate I a1a: iez(7uC)6(x)6(x')(C-\ ,)(7- aB/ r) / K,z. (3.5) pc(")l lp"@') by the product of I esQ)l': (lam)s/r These include the Breitlt interaction, retardation efiects, and the appropriate spin matrix element, which will be denoted by ( ). In calculating the effect of 161, which and the virtual annihilation exchange interaction. All the contributionsderivablefrom Eqs. (2.23) and,(2.3a) contains contributions of order olpo(0)1, due to the that are not included in Eqs. (3.2-5) depend on the exchangeof one virtual photon, the relative momentum appearanceof two virtual quanta. The two-quantum can no longer be neglected. fndeed, corrections of terms that are includedin Eq. (2.34)will be denotedby relative order a that arise from the larse momentum 162s(r),while those that are explicit in Eq. (2.23) will c o m p o n e n tos f t h e w a v e f u n c t i o nm u s t n o t b e o m i t t e d . be denotedby I urn(r, rt) or I 6s{2)(a, r,) dependingon As Salpeterehas pointed out, an improvementover the Pauli wave functions is obtainecl when the intesral whether they are exchangeor direct interactions. 'l'he change in energy levels produced by the per- equation, turbations 161 and 1yq2acting on the electron-positron vc(x): -io I lF*c(x, *')l-t,pe(r',0)dr'fr', (3.10) system bound by the Coulomb interaction Eq. (3.3) is JS-ucha term can indeed be separatedfrom the centcrof-mass transform of the first two contributions o{ Eq. (na), which may be written
392 853
DISPLACEMENT
ELECTRODYNAMIC
The following observationscau now be made about that part of the energy change which depends on the spin of both particles. Only large contributions of magnitude f a 2 and a-r will be important in the integral. It can be (3.1 y')l-tqs(r')dr' r'. 1) ,ps(r)-- ia | [f'^c(x, / seenthat only small valuesof the momenta k', k"Sam make such large contributions. The important region IV. THE DIRECT INTERACTION of integration, therefore, extends over small values of either or of both these momenta. When both momenta We turn now to the evaluation of the matrix elements are large, k' and k" -m, the integral becomesnegligible for the energy shift that was obtained in the previous for the purposes of the present calculation. A term section. We shall consider first the contributions A.Er proportional to k'2 and k"2 in the spin matrlr element, from interaction, arise direct which of those terms for instance, is negligible becausein its evaluation one namely, those in which an electron-positron pair is may neglect (arz)2 compared to h'2 and *"?, so that the present in each intermediate state. According to Eq. integral in Eq. (a.3) becomes effectively independent (3.6) and the definition precedingthis equation, of a.Ia One may now see that the spin-dependent contribution of the retarded Coulomb interaction involves i , d 4r c dx4' , p c ( r c ) I r r r ( x x, ' ) 9 6 ( x ' ) A[. B: one of the cl.k'c2.k' terms of both F(l) operatorsand I is therefore a negligible large momentum effect. The Breit interaction, of course,is important and contributes t/ -il eo(o)i2 dnilA.v'\lK2Bt2)Q(, r') in conjunction rvith only one factor cr'k/cz'k'. Since J corrections that involve an additional factor fr"2 are too small, one may use an approximateexpression
is used Ior au iteration procedure based on the Pauli wave function,
+
J
dLtc"d4x"'I Ktn(x,x")
F*(r)ry+(1+ a1.k / 2m)(l - ar. k / 2m) i( z+n) I t!) tt Xl(m / E) (e-Nn-n) | ! s-
, r')> XIF vc(x", :c")l-Ll KIBQ"'I r -il p0(0)| rJ daxdar'(I *'17. orutr>,1,
l(e-&a
(4'1)
The one-photonpart of the interaction, AEBL: - i
J
to evaluate AErr. The spin matrix simple,
has now
quite
become
X c r . c r ( l * c , . k , ,/ 2 m ) ( l * a z . k ,/, 2 m ) l +(at . o2k2- o\ .kc, . k)+3(or . cr)[t,
2qr (2r)3J
(4.4)
( 0 | . a t . k '/ 2 r n ) ( 1 - a 2. k ' / 2 m )
d'a x,Ja x' Pc(x)I x rs(*. x' t,p6(*')
:-
element
nttt -p-i(f In)ltl)]
| d4xd4k\oc(x)eik.
''T' '
f T' \'
'to''vo'ko'1 "Yo'll'ko'
;_ 8,," L k,,
* lpc\x), \+.2) k;nk,,
R;'ft"'J
(4.5)
and the since the 0-function implies that k'-k":k, integrand has the necessary spherical symmetry. The &6 integration with the usual treatment of the poles yields
presents the greatest complication becauseit contains the lowest order hyperline structure as leading term. When the approximate Coulomb wave function evaluated in the appendix is inserted here, one obtains a spin matrix element and multiple momentum integral which is multiplied by the explicit factor c3l 96(0)1'?:
e ikotdk|(kr-k02-it)-t-Tik- t.-;tlt (6)0). (4.6)
I
The function of time in Eq. (a.3) is therefore even) so that the time integration may be carried out only over positive values if a factor of two is supplied' I'he integrals encountered are of the form
Sarl ,po(O) l'? r AEBr:
(2n)am2 J nt2
( P 'zj
I
dle
,to't^:t"-ikte
- i(k+ L' + E" +m*m)-t,
nf
-6(k-k,+k,)
|2 ! a 2n 2 ) 2 ( k t + I a , m 2 ) 2
:!r!!!f,.,a>)(43) " (,"eolff
i\n"xmtl
since
the
denominator
never
vanishes.
'fhe
(4.7) energy
)a Detailed examinalion shows that the integral actually is pro portional to loga in this case. This rJependenie, however, is still negligible for our purposes.
393 R.
KARPLUS
z X f ,Jk,/k',1k"6(k-k'ali'11p': FLa2mzl1(E'fm)(E"!m) ' ltnqz/ trzl-'?1 h.
mz-E'E"
k X,
2E'8"
. (E'-m)(E"-nr)
at d l x d o xe' - i K " \ x- x ' ) d X
kaE'1p"
k I -l. kIE'*D"_l2ml
4EE"
854
: - i l,ps(0)l' (4nia)' LE Bz(2)
4E'8"
h*E'lE"-2m
KLE]N
of this treatment lies in thc fact that the sum of the direct interactions is independent of the cutoff; that a cutofi need not have been introducedat all if the terms had beengroupedproperly accordingto the photon momclltum that rnakes the contributiorr rather tltan lccording to the physicrl processthal is represenlecl. 'fhe evaluation of the remainderof Eq. (4.1) is relatively simple.The secondline contributes
A,Epl : *(c1. o2)aa(2r)-3 | pn(O)l, J
I
A.
'l'he justification
change has now been reduced to
X(k"2
AND
(+.8)
X {((r*1G1(X*1x, X' jlr')y"t) X (t
"'G'
(X - ir, X' * L*') t u,))
X D+(X- X' + +(r- r'))D.,(X'- X t i@l x'))
As it stands, the integral in Eq. (4.8) is quite difficult to carry out. We must remember, however, that at f least one of the two variables k', k" must be small comt f (2t) l d1kd4k'ei+tpik'!'f kt2kF'2 pared to m, a fact which permits replacement of the corresponding kinetic energy by the rest energy, Furthermore, the occurrence of a factor (E'-m) im' X((T'. r'- yo'r o'(ko2/k;2))Gt(x*lr, X'llx') plies that the particular term contributes only for XG'(X - lr' X'-\x'11rt . rz- 1otyo,(ho2/ kir))), large kt-m, whence &" must be small, and aice uersa. (4.10) In such a case, the small momentum may also be neglected in the argument of the d-function. The an expression derived from Eqs. (2.23), (2.31), and remaining integrationcan then be carriedout: (3.4). When Fourier transforms are introduced for the Green'sfunctions, the energymay be written LE 31: -(ot . v2)atQ") -"1 po/n)|, 4-2 Afu"r:'. . . . l e . ( o )i ' l , l o k 1 h , ' 7 ' '
"
| X (h'2+ +q2n 2)-2(kt 2+ I d2nt')-2 { ((k - E' ) / 2mE kk'') 6(k-k') (k'' 2+ ia2m2)-2
"\ x(,",!s!:?-' \ F.,-(n-ko)r
* ((h - E" ) / 2mE" kk"'z)6(k+k") (k' 2+ !azrnzl-21
:
2r
a
|
. ( u , . o .i )l t "' l e o ( u ) Il !l -l J
4a r
-
2q r
J
\ZT)'
x I dkdk'dk" | (m,/ ti' E')6(k+k" -k )
"')
G !" - ! -l (T' "! T t - ? u ' ? u 'P u I '1r,"r't E2- (n- ko)2
tnl
l n' 2_k .^-l 1 . ( + . , )
In the 6rst term both k' and k" are of the order am, in the secondk'-m, and in the third k"-m. A cutoff ft,, has here been introduced as a lower limit on the final momentum integration. Its presenceshows that some contributions of order a2| ,pr(0)] , to AErr do arise from small values of momentum, contrary to expectation. It will be seen, lrowever, that the direct interaction I xzxQ)'and the second-orr'lereffect of 1{r, also cotrtain c o n t r i b u t i o n sf r o m s m a l l v a l u e so f t h c m o n t e n t u m : r s represented by the appearance oI ln(m/2k,,). Just as here, these are being treated incorrectly because of the assumption of free intermediate states that is implicit in the derivation of the interaction operator.
t'(iKC lk)-n [r-
( T ' ' ' ( ' - " Y o: '' o ' k o/ 'k : )
@+ W
\ ),
(4'l | )
where, as before, Iiz:k2lm2-ie
(e)0),
};Kna>m
(1.12a,)
it nrl hu't- Pz-Put-1,
(4.121t)
define the treatment of the poles. Explicit display of the spin matrix element and spherical averaging wisely precede the momentum
394 855
ELECTRODYNAMI
C DISPLACtrMENT
integration,
only anticipate the result [see Appendix, Eq. (A.3)] that the latter will contribute a term that renormalizes the charge occurring in the first-order virtual annihilant r' tion from its uncorrected value d to the measured d h o ( k p z ) z h , d h , xf value 4ra [see Eq. (2.35)]; to the order consideredin Jo J-6 this paper, therefore, all quantities depend on a from - ? k')lE2 - (/n - h0)21-2 X { (ko'z nere on, The first one and last two terms in Eq. (5.1) present * ik o'lEz- (m.- k o)zf-tlE'z- (rn+46)'?1-') ; (4.13) some complications since the quantity B is actually a it depends on the identities divergent integral.r2 We expect that other divergent integrals will make the complete result finite, but we : (outo 6iiia;i; (4.14) 2(o'' o'). ni\ 7;7 i: must exercise great care to obtain the correct finite The evaluation of the integrals is straightforward, result. For purposes oI orientation it is instructive to except that the same cutofi fr- for small momentum consider briefly the matrix element in Eq. (5.1) for values must be introduced. The result is noninteracting, nonrelativistic initial and final states, because the high energy divergencesmay be expected (5q 2a n 'l 2r q : - -(c'. c') | es(0)l, - - *aE B2t2) 1n^- l, (4.15) to be the same in this simpler caseas in the positronium I J m' tZT ZR^l T atom. The wave function ,pg(r) then represents the and gives the total efiect independent of ,t- of processes initial state plus a co'rrection due to one Coulomb where all quanta are exchanged between the two scattering, while { lF (r, xt )f-t I x B (r', r" ) s s(0)d4*' d4*" represents the correction to the initial state due to the particles, Breit interaction and retardation effects. The three 2r a 3al I A E a t l L E e z ? ) : - ( o t . " r ) l e o ( O ) l ' { 1 - - f . ( 4 . 1 6 ) terms we are now considering, therefore, comprise the 3m2 | 2rl matrlr element of the virtual annihilation in the initial trEp2t2): (8a2fr) I ,ro(O;| '1",."t;
The perturbatio\ AEB1G)includes efiects of vacuum fluctuations on the exchangeof a single quantum. The spin-dependent corrections to the vertex operator are contained in the anomalous-magnetic moment (a/2r)(e/2m) of each particle while the vacuum polarization has no efiect on the singlet-triplet separation. The added contribution is therefore 2ra AEB1(D: - -(a1.r,)le6(0)1,{a/nl. 3 rnz
t!
Y
Y
lr
ir
|
\7 Y ir
Itr I b
,. Feynman diagrams for virtual annihilation electron positron scattering. "r"-
(4.17)
state plus a correction due to the four-dimensional interaction representedby one quantum exchange.The V. EXCHANGE INTERACTION Feynman diagrams for theseprocesses,Fig. 1, show that In this section we shall evaluate the matrix elements the electrodynamiccorrections,Fig. 1b, 1c, are just of the exchange energy, embracing all processes in the correction to the vertex operator, and therefore which there is an intermediate state with no pairs contain each a contribution (q/2r)B multiplying the To our order of accuracy, basic interaction Fig. 1a.r?,16 present. The energy change,according to Eq. (3.6), is the divergent integrals disappear. f With this understandingwe can attempt to evaluate A E n : - ; I d a x d a xe's ( x ) l y ' ; ( r , r ' ) e 6 ( r ' ) ( 1 - a B / r ) the actual matrlx element in Eq. (5.1). In order to keep track of the inf,nite quantities, it is very conf venient to regulate the interaction brought about by -tl eo(0)l' dardax'(I6ra(x.x')) I photon 11 in Fig. 1 with a heavy photon of mass .4..r8 The integral B can be evaluatedto B^,l'? f -tl eo(O) d4)cd4#td4r"d4r'tl frf l' t J B^:(ir'?\ t I uduldak Jo
X(I rtn(r, r")lF ac(x" , x"')l-11 s1a(x"', r') t I rra(x, *")lF 6c(rt' , x"')f-tI yla(x"' , r') t I ru(r, r" )lF xc (x".,x"' )f-' I x t n(x"', r')).
(.5.1)
Considerationof the virtual two-quantum annihilation I xzr and of the second-ordersingle-quantumannihilation will be postponed to the end of this section. We
J
2 X l(kzf m2u2)2- (k2J m2u2 | L2(l - tr)) - 4m2(1- u - lu2) (h2+ m2u2)-3l :ln(tr/m)!1a*1n(m/2k,,),
(.s.2)
* J.C'.lVr.a,Phys.Rcv.78,182(1950). 16R. P. Feynman, W. PauliandF. Phys.Rev.74,1430(1948); Villars,Revs.ModernPhys.21,434(1949).
395 R.
KARPLUS
AND
where quantities depending inversely on ,4.have been omitted and the low energy cutofi A- has been introduced [see text following Eq. (a.9)]. The structure of the exchange interaction 161a implies that the energy change corresponding to Fig. 1 can be written
A.
Tr[C-17;e1(0)] :ll-
(a/2r)B tl Tr[C-12;i,o(0))(- a/rm,) f*.
LE a1: - v o*-2 o t(O) "aQ iC)"p
o'opt(o)
"'
h2dk{-?n2E-t(4k2/3} 2n2)
X (kz+ +q2rn2)-2t I (E: r - P- t1t, *z P- s -h2(h2+ l\2- L4/4rn )-rl_ (rn2_I.le) t X E (4k,/ 3* 2m2)h-4+ + (E-t - E - 1)
(5.3)
where (C-\)
-1f1. + (tnz- + L2)k-2E
p'=TrlC-tttpt(O)l Here
E _(h2+L2)r,
=lr- (a/zr)B Trll d.k(rn,/ E) ^12"(2")-, I X (h2+ iq2nt2)-2| (l - r. k / Zm) t XC-tt i0* r.k/ 2m)l C-t^yft2/ 4*rl
eo0) l
+ (i/zr)
f
J
(5.5) (5.6)
in the second set of terms, which came from the regulating expression. One can observe that these reduce to the first set when h:O il am there is neglected with respect to fr. The integrations are similar to the ones encountered in connection with Eq. (4.9) but made more complicated by the regulator. If one erpands the result in porvers of (m/L)'and keeps only the leading fprm
nne
nhtqinc
Tr[C-r7;e1(0)]
dak(i ;@- i (],KC* h))
:ll-
XC-ty 1(tn- 7 (! KC- h)h ;- (ks2 / k;2) Xto(m- 7(iK"*k))C-tt;(m-
856
integral, Eq. (5.a) becomes
X I
X(C-\ ) y "',p{o)",0,,
KLEIN
(d/2r)B ^l Tr[C-\2;,po(0)]
x U -f (q/ 2r)Lln(rt/ m)l- +* L- In(m/ 2k-)l
t(iK" - h))tol
: (l- 2a/ r) Tr[C-12;po(0)], (5.7)
(5.2), whence Xlk p'zl-tlE'- (tn+ ko)'f-Lla, - (m- ho)rl-L,po(o) with Br given by Eq. LE a1: - (r a/ mr) (1- 4a/ r)
f
- (i/ 2r) dah{i,@- i G KC* k)) J
XTr[po(0)zF]
XC-t y1(m.- 1 (l KC- k)), Fllh t2+ L'zl-\
Tr[C-r'y;e6(0)],
(5.8)
because7rC is a symmetrical matrlx. When the usual representation of the charge conjugating matrix
- (n- fr0),1,e0(0)1. (s.4) XlEz- (m I ho)zl-'lE2 I In writing the contribution of the regulating term, the last in Eq. (5.a), we have taken advantage of the fact that a very short range potential has no bound state so that the scattering picture describedby Fig. 1 is applicable. The total energy has been approximated by 2m everywhere except in the correction to the Coulomb wave function, rvhich comesfrom Eq. (A.9) evaluated at the origin. Only a spaceJikepair-producing Dirac matrix need be taken in Eq. (5.3). The trace is evaluatedrvith the help of the facts that the Pauli wave function has only largecomponentsan<1that the chargcconjugating matrix C is an odd Dirac matrix. After integrating over fro with the usual treatment of the poles and after spherical averaging of the momentum
C:C-L:
(5.9)
"to"yz
is inserted, the direct product of the Dirac matrices in Eq. (5.8) can be relabeledso that the operatorsrefer to the spins of the individual particles:F8 | Oo(O)" eQ {) e"(C- y ) p'"' p o(0)"' p'--
po*(0) "B
a po,l,po(0)y X [86"",68p,+]o """ "'
:-
Ieo(0)l'(s'),(s.10)
where S is the total spin of the system, S:*(a'f
a'z).
( s .1 )
We then obtain the known effect of the virtual annihilationrs plus a large correctionof relative order a, 6 E " ' : ( r a / m 2 ) \ S ' z ) l , p o @ ) l ' 0a- a / r l .
(5.12)
396 857
ELECTRODYNAMIC
DISPLACEMENT
We norv turn to the contribution of the second-order prevents them from contributing. Since the wave funcsinglequantum annihilation, tions in which the spin matrix elements are evaluated have only large components,Eq. (5.18) can be simpliaE ^2(\, iTzd2m-,TrIpo(0)roC](C-tr) Iied to X[Frc(0, 0)]-'(z,c) Tr[C-I7;e6(0)],
(5.13)
rvith the spin sums as inferred from the derivation of this expression. In the appendix this efrect is interpreted in terms of the polarization of the vacuum by the photon produced in the virtual annihilation. The evaluation given there together with Eq. (5.10) shorvs that the effect on the singlet-triplet splitting is8 L E A 2 ( D : ( r a / n ) ( S r ) |e o ( 0 ) l , | - 8 q / 9 r ) ,
- 3k,'z\Q orsc)(C- lvorr)) : 3}"'!(("vzro)(rzvo)), (5.19) where
ia2 A E d r ( r ) : - l 9 o ( 0 )l t
The momentum integration,
J
? I d[hk!21k"2+L2l-a:ir2/3L2,
ar
xl Jo
x((t rltGKc-k)-m)y"C)
| 9r(0) l,
FL
Nt | "a*7*12-y1'-4(r-y)-ief-r .ro
: - (a' / nf)(2- S'z)l,po$)l'z(2- 2 ln2+ il).
(i KC - k) mft
" - k) - nl1,\,
(S'rs)
u,herr lourier representationsare iutroduced for the Green'sfunctions.The spin matrix elementsof the tso parenthesesis to be taken as trace with the final and initial state n'ave functions, as in Eqs. (5.3) and (5.a). The momentum integration is simplified by the usual procedure of combining the three distinct denominators accordingto the formula
(5.22)
brings the energy perturbation into the form LEp{z): -6oz*-r(2-5r)
I C-\,6GKC
(5.21)
3k,z(z-521.
T.
KC - h) \'z+ m'?l' l- L X Ihe2(K c - k) "21(+
(5'20)
Rearrangement of indices according to Eq. (5.10) finally produces the ordinary spin matrix element of a function of the total spin, Eq. (5.11),
(5.t4)
since the renormalization constant ,4, Eq. (A.3) has been incorporated already. The final item to be discussedin this section is the energy shift associated with tivo-quantum virtual annihilation given by
x (- C-' t,lt
7s:'Yt"t2"l3"fot^ls2=- l.
(s.23)
The real part of this expression corresponds to the energy change of the level while the imaginary part correspondsto the well-knownrTdecay rate of the singlet s t a t eb y t w o - p h o t o na n n i h i l a t i o n , r-1:a3 Ry-:0.804X10ro sec t.
(5.24)
'I'he
total contribution of the virtual annihilation interaction may be collectedfrom Eqs. (5.12), (5.13), ano ().rJr, 6B o: (r a/ m2)l,po(0)l,{(S'!)(1- aa/ r - 8a/9r)
f 2(s'- 2)(1-ln2)).(s.2s)
- h)^t* *,1,l-, , d 4 k ath2 ( K C k ) , r l ( i K "
VI. SUMMARY
:6
'I'he
ftftf
J^
dependenceof the 1tS and 13S stales in positronium on the spin of the system is obtained by the addition of Eqs. (,1.16),(a.17),and (5.25):
t'dx tuJ a,n J^ vn
X llk - i Kc (xy -t 2(1 - *))1'?
tE:
{rm2(xy2-4(1- x)(l- y))- i.el-a. (5.16) T h e d i s p l a c e m e n tk s - h s f n ( y + 2 ( 1 - r ) ) denominator into the form
h2| rm2lx(2*y)'- aQ-il
(2r o/ m'z)l,po(0)l,{t(ar. c')lr-
}a/ rf
+ Xsl[l - (26le| 2 tn2)a/r)1.
brings the
By taking the difierence of the value of these operators in the singlet and triplet states, one arrives at the ( s . 1 7 )hyperfine splitting
and leavcsthe numerator proportional to (- c-''v,'r o'r,* C-'r,r,r u)) (5.18) "t'c) after hyperspherical averaging and the discarding of some terms whose deoendenceon the Dirac matrices
LIV 6: (2raf m2)l *oo(0)1'z{7 /3-(32/9+21n2)s/tl :la'zRy-17
tk x'((t n
-lJ.
/3- (32/9+2 tn2)a/rl :2.0337X106 Mc/sec.
A5h..t".,Ann. N. Y. Acad.Sci. 48,21g(1946).
397 R,
IiARPLUS
AND
The singlet state is the lower one. It can be seenthat most of the rather large negative electrodynamiccorrection comesfrom the virtual annihilation interaction. When the experiment of Deutsch and Browna is interpreted on the basis of a Zeeman efiect that depends on the totar magnetic ^. ;;;"G;i;;;,jfi]"7i'"1'"i each particle, the value of the separationobtained by them isb
A.
The center-of-masstransform of the noninteracting Green'sfunction appearsin tire integralequation (3.10). Its Fourier representation is ,. [j?xc(r'r')]-t:-)
APPENDIX The operator [tr.(13)F+(24)]-1:G-(13)G+(24), the n o n i n t e r a c t i n gl w o - p a r t i c l eG r e e n ' s f u n c t i o n ,a n d i t s F o u r i e r t r a n s { o r m sa p p e a r s o f r e q u e n t l y t l r a t r h i s appendlr will be devoted to a discussionof someof its properties. fn connection with the second-ordereffect of the virtual annihilation, there appears the tensor
_ t
r
dok"ou o'
-. /.{.4\ l m - t t ( i K-c l h ) ) L n' -t-t G K c - h-) f x- ki'?) (+Krc t E2- (LKoctha)zllE'z-
LWh: (2.035+0.003) 105Mc/sec. Theory and experiment are thus in satisfactory agreement. We are grateful to V. F. Weisskopffor calling this p r o b l e mt o o u r a t l e n l i o n .T h e a u t h o r s a r ea l s o lne l : db l , e d t o t h e m e m b e r so f t h e I n s l i t u , t . e , I o r . A d v a n c S ludy Princeton,for an informative discussion'
858
KLEIN
when the center of mass is at rest. The quantity -R is defined by (A'5) E-h'zlnf-ie,. e)0, 5 i n e eF . o ( A 4 ) r e n r e s e n r st h e n o n i n t e r a c t i n gt w o plt'ir"tr.."'='r""Iti"" i " t o u t g o i n gr v a ' e s .T i e i n r e lration over the fourth component of the momentum
i;Ji.IfTJ;.,ilJ.ilIi:,.T]''Jil':,Y; iltimecoordinafts , lFxc(x,r'1f
':i(2r)-3
f dkr;t r'-"r J| X.(hrllqrmr)Filt-t'),
(A.6)
where
(C-1tl)lFxc(0,0)l-'(ry,C) :, t
F o Q ) : L 7 s - o t u _ m ) )+t e _ i ( E + n ) t t t )
f2E d 1x ' e - i K ( x- x ' ) ( c - L ? ; ) " " . G " , s , - ( x ,x ' )
f/
k'l
o,.k\/ o,.k\ |/ Xl I l+l l t * 2* m I 2m / \ [\ /
t h a t a p p e a r si r r t h e v a c u u mp o l a r i z r t i o nt e n s o r . rh2 i s e q u a tl o
(4o;tt"""
otk\
t)l tt I !le-ttz-^> t - ri(E+m) 1
x T r t l p - ; . x , ) t , G - ( x , , x ) 1 ,( A . 1 ) l yeq u r n t i t y b y E q .( 2 . 5 )T. h i si s .h o w e l epr r, e c i s er h
i --l^..lKc\2-K.cK
o'k\/
xlll+-ll1--l+-l 2 m' l 4 m 2 J 2 m/ \ L\
r X G " p + ( X , X ' ) ( t , C t ul d, u n :x ' r - ' n ' * - * ' '
k, /
;,\t*
c\
'
I cr.k-a2.k\l
l't
u
)l
(A'7)
und the total energyo{ the 1,5state has been inserted,
v,(1- +vr)(Kc),I
xlza+["'dv m2lf,(KC)'(1_V2) ].
Koc:2m-ta2m.
(A.2)
In the frame where Knc:6, (KC)2--4m2, the tensor becomes
'#','(;-^-::) (A.3)
(4.8)
The wave function derived from Eq. (3.11) with the help of the operator just obtained is
eg\):
(2a/(2n)2)
f | dketk'n
X (h2!la2m2)-2F{t) eo(O). (A.9)
P o p e r3 3
OF ONTHEMAGNITUDE CONSTANTS THE RENORMALIZATIOI\ EI-.ECTRODYNAMICS IN QUANTUM BY
GU){NAR KII,LI'N \I lith the aid of an exacl forntulation of the renoimalization method in guantum electrodynamics which has been developed earlier,it is shown that not VV all of the renormalization constants can be finite quantities' It must be stressed theory. that this statement is here made rrithout any reference to perturbation
Introduction. In a previous paperl, the author has given a formulation of quantum electrodynamics in terms of the renormalized Heisenberg operators and the experimental mass and charge of the electron. The cor^sistency of the renormalization method was there shown to depend upon the behaviour of certain functions (II (p'), Dr(p') and Xr(p'z)) for large, negative values of the argument p'. If the integrals
(rr (- o\ \)a)a
de,
f3, (- o) \
"
do
(i:
1,2)
(1)
converge, quantum electrodynamics is a completely consistent theory, and the renormalization constants themselves are finite quantities. This would seem to contradict what has appeared to be a well-established fact for more than trventy years, but it rnust be remembered that all calculations of self-energies etc. have been made with the aid of expansions in the coupling constant e. Thus what we knorv is really only that, for example, the selfenergy of the electron, considered as a function of e, is not analytic al the origin. It has even been suggested2 that a different scheme of approximation may drastically alter the results obtained with the aid of a straightforward application of perturbation any theory. It is the aim of the present paper to show-without attempt at extreme mathematical rigour-that this is actually not the .case in present quantum electrodynamics. The best v'e can 1 G. KAr,r-6Nn Helv. Phys. Acta 25, 417 (f952), here quoted as I. ' Cl., e.g., W. TurnnrrC, Z.f. Naturf .6a 462 (1951). N. Hu, Phys. Rev. 80, 1109 (1950). 1*
399 4
Nr. 12
hope for is that the renormalized words, that the integrals
theory is finite or, in other
n'rr(- o). f},(-") da, \.. V
o0,
(2)
Jq',)a"
appearing in the renormalized operators, do converge.No discussion of this point, hov.ever, r'ill be given here.
General Outline of the Method. We start our investigation with the assumption that all the quantitiesK, (l -l)-1
and *
that the integrals (1) converge. This will be shown to lead to a lower bound for I/(p,) u,'hich has a finite limit for - prn *, thus contradicting our assumption. In this way it is proved that not all of the three quantities above can be finite.'Our lower bound for II (pz) is obtained from the formula (cf. I, Eqs. (82) and (32 a)) r/ \- r,
Ir (p') : :+ 1- 1;wt').r; ) , |
(3)
p\'): p
It was sho*'n in I that, in spite of the signs appearing in (3), the sum for II (p') could be written as a sum over only positive terms. Thus we get a lorver bound for II (pz), if we consider the following expression
Ir (p')r --+; )l'l aol,i"lq, q')l'. "r
(4)
ofrp
In Eq. (4), (0 ljrlq, g') denotes a matrix element of the current (defined in I, Bq. (3)) between the vacuum and a state with one electron-positron pair (for no>- co). The energy-momentum vector of the electron is equal to g and of the positron is equal to q'. The sum is to be extended over all states for which q + q' : p. We can note here that, if we develop the function Z (pr) in powers of e2 and consider just the first term in this expansion, only the states included in (a) will give a contribution. For this case, the sum is easily computed, e. g. in the following way:
) r ' l ( 0t i , 1 " , 1: '
Z
( i : r , , r t 1 1 ; ;"';- t ( 0 t L4t l ' )
400 Nr. 12
(5)
'lhe
for large function Z(0) (p'z) has lhe constant limit ,{n, 'of values Oz. This corresponds, of course', to the rvell-knorvn for the first-order charge-renormalization.We shall see, divergence however, that with the assumptions we have made here the Iower bound for the complete If (p'), obtained from (4), is rather similar to IIo (p').
An Exact Expression
for the Matrix
Element
of the Current.
Our next problem is to obtain a formula for (0 lirlq'q'> rvith which we can estimate the matrix element for large values of - (S * q')'. ,For this purpope we first compute
= -"!!Jt 3)lip(*),f (3)ldr"' lir@),rtor(r')l
(6)
- r" rp(B)idar"' . I;,!,i ) vnlirt*), (Cf. I, Eq. (5a).) The last commutator can be computed without difficulty if we,introduce the following formula fot ir(*) ieNz
i r @ ) : =E,slr)
L
0 2A " . @ )
+ r _ Ltaiffi-
rvith €6:
6pit- L6r+6i,*
L6p+lAn(r) (7) (7 a)
and
: (r)). s7@) f;fO<"1,yrp
(7 b)
The expression (7) is rvritten in such a way that the second timederivatives of all the Ar's drop out. With the aid of I' Eqs. (4)(7) we norv get
401 6
Nr. t2
(r), Lir(,rl,vG11*-,-*: ,'?, €utlrt rp(3)l I : -
-i"') -=€r7,TaT1"V(r)d(F
j
(8)
l
It thus follo*'s that /.x
t i r @ ) , t t o r ( r ' ) J: - N ) j . ( 1 3 ) [ . r p ( ' ) ,f ( 3 ) ] d -
pIV
t _ tfulS(t
(e)
t)y;rp(x).
We then proceed bv computing
Q l {fj r(r) , ,ro)(r')), Ttor(r")} | o ) ieAf" (1e (* i_ Lfui.S ) zr S 2) N ) S-(tl) de"" (r), {?(o) x [(0lf,rp (2),/(3)}]lo>-
:
If this expression is considered as an identity in r, an I r nI d fr' :t:' c" ]t will obviouslv give us a formula for (0 lirlU, A, anr dd fo for rilightthl rie Q l i p I q ' ) . ( C f . I , E q s . ( 6 8 ) a n d ( 7 7 ) . ) \ \ / e t r a n s f o r m the iht hand side of (10) in the follo'wing u.ay: l''"' (2),/(B)): r)4
9,,7a 1e;+ Iili l si ( 3 2 )
(11)
and, hence,
( o l l i r @ 1 ,{ 7 ( 0 ) ( 2/ () 3 , ) } ll o l :
# r ^ t ( 3 2 ) ( 0l t i r { O , A i ( 3 ) l 0 )
+ u'Iljf'u(ol.jp(*),{/(e),7<+>>ll 0)s (42).
(12)
The last terrn in (10) can be treated in a similar \vay: f"
oUl
(r}l L i , , @ ) , ? t o r ( 2: ) Jn - ) j 7 , ( r ) , f ( 4 ) l s ( 1 2 ) d r l v 1 , L _r O G ) y t s ( * z ) E ^ p and
t i ! -. < . t 3 ) d x " ' ( o 1 ( F/((' ): ), ) l o 1: - ( 0 1 { o( r ) , t p ({ r, t' ) * 5 " ^ :B" -) ty n , p ( r ) d . r " ' ): l or > s(r ) -*,] [t
I tt*' j
Nr. 12 Collecting (12), (13) and (14) we get ( 0 | {[J& (0), ,p(o) (r')), y{o)(r")) | 0 )
:
i+]+2(N-1)l6r1s(1 r)vf
(r2)
pI
-,
\ s _ 1 r s7; 7 . s ( 3 2d)r " ' ( 0 l l i r { * ) , A i ( 3 ) l | 0 )
(15)
a-h ,.J
-"')jf
FI"'
( / ( B )f, < + l )l lo ) s ( + z ) U f ' " s ( 1 s )< o l l i p ( r ) ,
- N ' ) j j " ' ! j f ' " ( 1 3 ) ( o| { f ( 3 ) ,l iu @ ) , / ( 4 ) l| 0 } )s ( + 2 ) . 'Ihe
second term in (15) can be rewritten lvith the aid of the frrnctions II (p') ana n (p2).
(0 | li
: 0) dr lv r , @) ,A ,.(3| 0 )l) ta o {a +l1 oI h1,@) ,i 7@\l : ap,'ot''u(p)lptpt- oz6rtJ4\P\ -;t\
(16)
i
We are, holvever, more interested in the expression
t''"1'[ L' dPPin,*\w'') e ( r 3 ) l ( o l t T r ( r ) ,A i ( 3 ) ll o l : 5tr+ e,r). )
(1i)
-t ine(p)n (p,)l+ tr * e(r Dlu#::}, | lvhere
@(*) : Obviously, rve have
,, \-'"/ "fa\a l
dpeip' u( p)n q') .
(18a)
@(3r):0
ryP:
( 1 7a )
-in(o)d(t-;"')
(18b)
for rld' : ro. It thus follows
a#+9 e '( * B ) o e u o f r l - T - f r l ' [ e ( r B ) @ ( B r+) ]2 i n ( 0 ) d u * 6 7 . * d ( r 3 )( .t e ) o r:,^r o
403 8
Nr. 12
Using the equation
u u ' ;t ( l 3 ) z l s ( s z ) : o '
weger (ol
-'
: -
(20)
a
- f e(r B )]s(1 s)y7 s(8 2 )(o I t7r ( r ) ,a,( 3) Jlo>dr "' \ ; [1 J_-
I ine(p)rI @')l d;f\o*"'too"o*u'(l')7"s(zz)lnQ'z)
(21
t i6r+f-sQ *)yns(r2). Introclucing(21) into (1b) we obtain ( O| {[7, (r), yt(o) (ct')], 1p@ (x:,,)>| 0> nn t
: re rr"' r - d p i ^ , y e r 3)yps(82)i - rr er) \a ' u \ (iLn'nr'or*",s(l +n@)-ine(p)n(pr)l
-"'
c.r
\
al-
Pr"'
o * " ' d * r v s ( r s () o l tj r { * ) , ( / ( 3 ) ,f f + > >0l l) s ( 4 2 ) \ * a.l- o
(r.r\
- r ' \ 0 r , , , \ d . r r v s ( 1 (30) l { f G ) , [ i r G ) , f f + 1 1 1 , 1 0 ) s ( 4 2 )
'-a'*z"g7fr1s(1
r)y,s(*2). The first ter-m in (22) describes the vacuum polarization and is quite similar to the corresponding expression for a weak external field (cf. I, Appendix). The remaining terms contain the ano_ malous magnetic moment, the main contribution to the Lamb shift etc. Introducing the notation
- N z0(rB ) 0(8 4 )(0 l l i u @),{f (B) ,lf + .r }Ilo> - LN 20(r 3 )g(r 4 )(0 l {/(3 ), l i u @ ) ,/ta>l}I o> _ 2 _ ! ( N _ 1 ) , "L 6 1 , n r n. ,(6r 3 n) d(34) t:
ie
nn
arar'tto'',r'+i,(x4tAp(p', p)
(23)
404 9
Nr. 12
o(*):]tt+u(n)1,
( 2 3 a)
rve obtain from (22)
(ol;r,lq,q,> | _ - ( 0 l 7 ; l' rq , q ' ) - Z < < n- t q ' ) ' ) +_z ( o-)i n l r ( ( q * q ' ) ' ) lt *, i _-l I * ie( 0 l,t@l,t') Ar (- q',q)<01 v(o) | q) . |
-
I
(24)
L)
This is the desired formula for the matrix element of the current.
Analysis of the Function Ap(p',p). We nov' v'ant to investigate the function Ap(p',p) in some detail, especially studying its behaviour for large values of - (q + q')' in (24). For simpliciff, we put p' : k * 4 and study i eA e ( p ' , p ) :
r'r' I (3t'1-ip(r4\ 1 s 2 0{ ( r 3 ) 0 ( . r a ) ( 0 1{ / ( 3 ) , d*" r'ip' ! \d*"' [,ru,
U o @ ) , 1 ( 4) )l o l 1 - 0 ( r 3 ) 0 ( 3 4 ) ( o tl " r r ( " ) , { f ( s f) (, 4 ) i l | 0 ) } '
i
We treat the two terms in (25) separately. The first vacuum expectation value can be transformed to momentum space with the aid of the functions
t \ i ' )@ ' , p ): v , : Z ( 0 1 / zl ') ( z ' l j o l r ) ( r l f l o )
(26)
t \ , - () p ' , p ): v ' ) . < 0 l i l t ' ) < r ' l i * l r ) < r l l 1 0 )
(27)
n l !@ ' , 0->- , ' ) i : (0 l / l z ' )( z ' l i l ' >< z l i k l D
(28)
a ? \Q ' , p ): v ' 2 . ( o l , rzo')l ( z 'l f l r >< z l f l l) .
(2e)
ii:,,=i,
It then follows that ( 0 | {/(3), lt^(r), f(4)l ) | 0 ) :
u1,;( ffi
(3r)+ipr'+}Af) rip' @" p)
.. -'iP'(34) nf)(P" P) :,:f.',:( :!.J^i!,' r';,,.r';;-"
(30)
405
Nr. 12
10
Our discussion started with the assumption that all the renormalization constants and, of course, all the matrix elements of and /(r) are finite. As this'is a condition on the operators jr(r) the behaviour of, for example, the function II (p') for large values of -p2, and as this function is defined as a sum of matrix elements, it is clear tirat we also have a condition on the matrix elcments themselves, i. e. on the functions A and B defined in p ' ) d .T o g e t and -(p(26)-(29) for large values of -p2,-p'2 more detailed information on this point we consider the expressi
(31)
u'ith
Frr(r-r")
: 0( " - * " ) < t l l i p ( * ) , i r ( * " ) ) | " )
(32)
(cf. I, Eq. (A. 8) and the equation of motion fol Au (r)). Supposing, for simplicity, that I z ) does not contain a photon u'ith energy-momentum vector 1c, we have
t
("ljp(")lz, k)
t'-*r*,(
(') | * ) + i \ dt:"Fp,(*-* ,,)(olnf o lAto) {"")l*>.1
\Vriting
Fr1(r- r") : 0(* -.,:"),#
5
rrt (n) dp"io@-*")
(34)
and using the formula e(r-r")
: *"5 41,i,<*^-*"')
(35)
dpein'-t"){FpQ) + inFri.@))
(36)
we get
iFrtt(r- *") :,#
\
.J
with
F6@) : e \ * r a ( p , p o *i .
(37)
(33)
406
Nr. 12
1l
We further note that from (34) it follo'ws that
F r 1 f u :) v : ( z l i t l z ') ( r ' l iu l z ) - I ' f ( r i p l r ' ) ( z ' l i , . l z ) .( 3 4 ) p,- ,:
t)\- t:
I)'-,+ p
p'-t-D
If every expression appearing in our formalism is finite, the integral in (37) must converge. This means thatr)
I::.It(P'Po): Putting p :
A:
o'
( 3e)
fr rve then get frorn (38) and
( 3e)
1i o l " ' ) l ' ( - 1 ) N ! :+r x i : ' r: 0
lim )
|(
lirn )
| ( 1 1 io l " ' > l ' ( - 1 ) N l " ) + N l :" i 0 . -P P(t)
"
(.i0 a)
po:>a Dt:'):p(.)+ p
and Po )
6
Pl''):
(40 b)
If we first consider a state I z ) rvith no scalar or longitudinal photons, it can be shorvn with the aid of the gauge-invariance of the current operator (cf. I, p. 126. Eq. @Z) there can be verified explicitly rvith the aid of (32) and (33) above) that only states I z') rvith transversal photons will give a non-vanishing contribution to (a0 a) and (40 b), and these contributions are all positive. \Ve thus obtain the result
('ljolz')l' : 0 - l i m' t|+ a
(41)
tP\-'-P\o'
if none of the states lz) and lz' ) contains a scalar or a lorrgitudinal photon. Because of Lorenlz invariance which requires that Eq. ( t) is valid in every coordinate system, it follows, horvever, that (41) must be valid for all kinds of states. If lve make a Lotentz transformation, the "transversal" states in the new coordinate system will in general be a mixture of all kinds of states in the old system. If (41) were not valid also for the scalar and longitudinal states in the old system, it could not hold for the transversal states in the nerv system. 1) The case in which the integrals converge without will be discussed in the Appendix.
the functions vanishing
407 Nr.12
L2 From equation (41) rve conclude that
lim Al+)(p,,p) : o
@2 a)
-(p-P')"+a
lim B[+)(p,, p) :
o
(42b)
__pr_>a
rim B[-r (p,,p) : o. t'2 .>
(a2 c)
a
It is, of course, not immediately clear that the sum over all the must vanish because every term vanishes. terms in (26)-(29) What really follows from (40) is, however, that the sum of all must vanish. If the limits in the absolute values of (zlirlz') performed in such a way that p2 and p'2 are A and B are then one of the p2's are kept fixed kept fixed for ,4 and (p-p')'and for the B's, equations (42) will follow. To summarize the argument so far, we have shown that if we write
( 0 I {f(B), lio@), l(4)l} | o I : we have
(tP
rz 116\ \dpap'
tim F1(1,, P ):
p) (43) r'o'Gr)+ip(r4)Fk(p',
o.
(44)
Pa. ,_rr,o) Fu(p',p) : \:; FkQ
(a5 a)
-(p*p')'>
Introducing
t
a
the notations
ar
and
(45 b)
\(p',il:\+r.(p',pter)
(e is a "vector" with the components er : 0 for 1t * 4 and eo : 1) we find from (44) and the assumption that the integrals in (45) converge that lim F* (p' , p) : -(p-p')'>
(cf. the Appendix). now rvrite
*
With
lim Fo(p',p) :
-(p*p')'>
o
(46)
q
the aid of the notations (a5) we can
408
Nr.12
t3
0 ( r c 3 )a G a ) ( 0 J{ / ( 3 ) , l j o @ ) ,l ( 4 ) l ) | 0 > -r :,r1'"\
fi' ip(r4) apap,.'n'(3r)l F - * @ ,p, l \ tl
(47)
al a)
- n' Fo(p', p) * in (Fo@' p) + F,,(l',1))J . , In quite a similar r,vay it can be sholvn that the second term in (25) can be written in a form analogous b @7) with the aid of a function Gn(p',p) rvhich also has the pr-operties (44) and (46). It thus follows
lim tIo(p', p) :
(48)
o.
-(p-p')'->@
It must be stressed that this property of the function Ax(p,, p) is a consequence of (41) and thus essentially rests on the assumption that all the renormalization constants are finite quantities. It is clear from (24) that the function ,4, transforms as the matrix /p under a Lorentz transformation. The explicit verification of this from (23) is somewhat involvcd but can be carried through with the aid of the identity
0 ( r 3 ) 0 @ a{)f ( 3 ) , l i r @ ) , 1 ( 4 -) 0 l )( * 3 ) 0 ( 3 4 )l i r @ ) , { f ( 3 )f,( 4 ) } l | , . . ) (49) : 0@a )0@3 ){ f ( 4 ),l l r@),r(3 )l } 0(r a )0 G3)lj p( ,) , { f( +) ,/( 3) ) l and the canonical commutators. Eq. (4g) can aiso be usecl to prove the formula
- c-l Ar(- q', q) c : t[(-
t, u')
(50)
which is, hot'ever, also evident from (24) and the charge invariance of the formalism. From the Lorentz invariance it follows that we can r,vrite
Ar(P' ,p)
I
* pt,Ga'a Gr n'* ^)a' lypFa'a t p*gt'el Qyp * nr)e (;t)
:f g' : 0,1 p: 0,1
lvhere the functions .F., G and ,FI are uniquely defined and depending only on pr,p'r, (p-p'), and the signs e(p), e(p,) and e(p-p').From (50) it then follows
409
t4
Nr.12 pes'(- p,p) : Fe'p(- p,, p)
(52 a)
Gaa'(- p, p) : Ha'aG- p, , p).
(52 b)
Utilizing (51) and (52) tlv-eget
q ) : (0 l#) lq,q') R( ( q+ qj) ' ) ie( o l P < ol qi ') A r ,(- s',q )(0 | e to rl (53) + , ^ s ( ( q + u )\ (o r- e ') (0 | ? (0I )s' ) ( o Ip' o) | s) 'where, in vierv of (a8), I i m R ( ( q * q ' ) r ) : l i m s ( ( s * q ' ) z ): 0 .
- @ + q ' F+ r
* ( q + q ' ) '+ o
(54)
The equations (53) and (54) are the desired result of this paragraph.
Completion
of the Proof.
We are nor*' nearly at the end of our discussion. From the assumptions made about II (p') (and its consequences for fr (p'), cf. the Appendix), Eqs. (53) and (54), the limit of Eq. (24) reduces to
liry.( {lIi,l q,q') -: ( oITtor I q,q')jI + n( 0)+ 2'I-r r r -Ll I I
-(q-q')'+q
(55)
:(olffrlq'q')E Oul inequalitl' (a) nov' gives
n Q\)+
>'i
q+q : p
' .'*' ZI
: n(D( pz) (#) Except e2 and
for the possibility mt\ ql
Lt- l
-
"r"
(56)
(T=-;f
of l/ being exactly of | (irra"p"naent 'we have then proved that, if all the renormaliza-
4to Nr. 12
t5
1 I and are finite, the function (1 _ Z) N oo. This is an obvious II(p') cannot approach zero for -p,contradiction and the only remaining possibility is that at least one (and probably all) of the renormalization constants is inIinite. ,1 The case ,A/: ; is rather too special to be considered seriously. tion
constants K,
We can note, hoiever, that N must approach 1 fbr e -> 0 and that one of the integrals in I Eq. (75) will diverge at the lower limit for p --."0, independent of the value of e. The constant N could thus at the utmost be equal to
1
i
for some special combina-
tion (or combinations). p zof e2 and 4.
O, pr is an arbitrarily
small quantity it is hardly possible to ascribe any physical significance to such a solution, even if it does exist. The proof presented here makes no pretence at being satisfactory from a rigorous, mathematical point of vierv. It contains, for example, a large number of interchanges of orders of integrations, limiting processes and so on. From a strictly logical point of view we cannot exclude the possibility that a more singular solution exists lvhere such formal operations are not allowed. It would, hov'ever, be rather hard to understand how the excellent agreement betrveen experimental results and lowest order perturbation theory calculations could be explained on the basis of such a solution.
Appendix. It has been stated and used above that: if
f("): where /(r) and fulfills
,\:H-
(/(0) :
0)
(A.1)
is bounded and continuous for all finite values of c
lf (*+u)-
f (r)l(Mlc I
for all c
(^.2)
4il
l6
Nr.l2
anrl if the integral converges,both /(r) and l(r) u.ill vanish for large values of the argument. This is not strictly true, and in this appendix we uill study that point in.some detail. We begin by proving that if the integral in (A. 1) converges absolutely and if
,
lim logrl/(r)l:0
(A.3)
I->A
it follows that
lim l(c) : 0.
(A.4)
c->t€
o" (Note that the integral \ =+ is nof convergent ancl that the n-tog_ e J vanishing of f (r) is already implicit in (A. B).) To get an upper bound for f(c) v'hen c ) 0 we rvrite
:"5,9,, : (J:"ffn1y,.,, (A r) r(*) ('n
",..,
/f
t'
(? r'
(The limit o is simpler and need not be discussed explicitly.) The absolute value of the first term in (A. b) is obviously less than , ('l'
, f"l'
i\trrolldv(const 1\,ffr.*0. uo ll
(A.6)
Uo
The last term can be treated in a similar way and vields the result
It'--,.
I o?-..'
I Ji,n
I uetp
l\#qoul=\l/(Yrlds-0. "l:l rru_r UIB
(A.7)
,\P,.,1: c)-.r(r-r,,1 \"*,,"*
The remaining term can be written
.Fl'"*y)-,I.5"f r(, l,o.,, |.!:+1,,,-,, (A. {t)
412
Nr.12
t7
In view of (A. 2) and (A. 3), the three terms in (A. 8) vanish separately for large values of c. It thus follows lim l(c) :
g
t>4
q.e.d.
As the functi
f(') :
ds;f Q) : o for yS o
+q -'-
"\. ) 0 "
(A.e)
will follow
(A.10)
r(r):-#"\Pr' where both /(c) Note that
and f(c)
are finite.
r "fl---er)(z-c)
n2'
:
#\
-,o-, I
t
)!z--
utuz a',"* * '' tz' "-iw'lx-iw11t
l-t*rl
(A.1t)
.P :
;\
dwrsi"(u-'):
d(y-c).
It then follows ,r1", ,n" integral
Dt'a,- [!14(')J a, fT-qtu.r< llclc
AU
is divergent, because the second term is convergent in view of (A. 10). This is everything that is needed for the proof. Drn.Xrt.Fyr.f,edd.27,N.l2,
2
413 18
Nr. 12
It is, of course, possible to construct functions f(r) where (A. 10) does not follow from (A. 9). In that case we are not allowed to interchange the order of the integrations in (A. 11); but we have already excluded such cases from our discussion. For simplicity, the statement that the functions ..vanish,' for large values of the variables has been used in the text. If a more careful argument is wanted the phrase "the functions have the property that the integral (.*^ , .
t /(r.) ' \'"'
,U
converges" should be substituted for the word "vanish" places.
in many
The author wishes to express his gratitude to professor Nrrr,s BoHn, Professor C. Morr,rn, and professor T. Gusra.soN for their kind interest. He is also indebted to professor M. Rrusz for an interesting discussion of the problems treated in the appendix. CERN (European Council lor Nuclear Research\ Theoretical Studg Group at the Institute lor Theoretical Phgsics, uniuersitg ol Copenhagen, and D_ep.artm.entol Mechqnics and. Mathematical phgsics, Uniuersitg ol Lund..
Indleveret I.'erdig
til selskabet den 28. februar 1953. fra t['kkeriet den 27. maj lgb3.
414
P o p e r3 4
On the Self-Energy of a Bound Electron* Nonu,llr M. Knoll** AND WrLLrs E. Leuo, Jn. Columbia University, Nm York, New York (Received October 7, 1948) The electromagnetic shift of the energy levels of a bound electron has been calculafed on the basis of the usual formulation of relativistic quantum electrodynamics and positron theory. The theory gives a linite result of 105,1megacycles per second for the shift 2'S1-2'P1 in hydrogen, in close agreement with the non-relativistic calculation by Bethe.
I. INTRODUCTION
1927-1934 formulation of quantum electrodynamics due to Dirac, Heisenberg, Pauli, and PETHET has recently discussedthe anomalous L) fine structure2in hydrogen on the basis of Weisskopf. It will appear from this that the relativistic invariance of the present non-relativistic quantum electrodynamics. His formal theory is to some degree illusory in that all selfresult for the 2254-22Pqdisplacementwas energies diverge logarithmically, so that the difference of two energies such as W(2'S) and AW:W(2'S) -W(z'P\) : (aiRy/3r) log(K /z), (1) W(z'P), although finite, is not necessarily unique. The method we have used has a certain the fine structure con- simplicity in its motivation, however, and the where a:e2/hc-l/137 stant, Ry the Rydberg energy a2mc2f2, and e results are surprisingly plausiblein their mathean average excitation energy of the atom, calcu- matical appearance.In any case,the calculations lated to be 17.8Ry. As Bethe's calculation di- may serve as an illustration of the extent to verged logarithmically, it was necessaryfor him which physical results may be derived from a to introduce a cut-off energy K for the light divergent field theory. quanta which could be emitted and reabsorbed The calculation is incomplete in several well by the atom. On the basis of speculations as to defined respects. It is only made to order a in the improved convergenceof a relativistic calcu- the coupling between the electron and the electrolation which included positron theoretic effects, magnetic field, and to fourth order in the ratio Bethe took K equal to mc2.This led to a value of of the velocity of the atomic electron to the LW/h:1040 megacyclesper second,which was velocity of light. It is expected that these dein very good agreement with the then available ficiencieswill be made up elsewhere.We will observationsof 1000 Mc/sec. make no effort to improve on the low frequency The purpose of this paper is to show that a part of the calculation as done by Bethe, for this relativistic calculation of AIl does,in fact, give is essentially a non-relativistic problem. a convergent answer, and to present the results II. DERIVATIONOF EQUATIONSFOR and some details of a calculation based on the SELF-ENERGY * Work suooorted bv the Sisnal Corps. +* Now Niiional RisearchFellow ;t The Institute foi Advanced Studv. I H. A. Bethe, Phys. Rev. 72,339 (1947). It may be of sme interest to obserue that if the non-relativistic theory is taken seriouslv to such an extent that retardation and tJre recoil enercy- ir the enerqy denominators are retained, the dynamic dli-energy diverges only logarithmically, and tlre S-Pr level shift conilerges,and, in fact, with K determined to be K -2mc2, The iesulting shift of 1134 Mc is in disagreement with the ob*rations. z W. E. Lamb, Jr. and R. C. Retherford, Phys. Rev. 72,
We start from the Hamiltonian for a system of iy' electrons moving in an external static electric potential energy field Tand interacting with the radiation field. After elimination of the longitudinal and scalar photons in the usual vr'ay, we obtain the Hamiltonian
24r(re47).
. A lat6r tentative value reported at the April 1948 Society meeting was 1065+20 Washington Physiel Mc/ec.
388
H:H,^alH*"r*I1i'r,
(2)
u"^:Iax'LNtxhclkl,
(3)
415 389
SELF_ENERGY OF A BOUND l1-*-I
lcs.;.pti0;mc2+V(rc)1,
e)
1.:1
n/
Hn": -L
nr'nr'
e s ,A . (r)ti
t L
"r/,o;.
(5)
Here e is the (negative) charge on the electron; s, A are the'Dirac matrices in the form
":(:
,:(;
;)
-:)
(6)
where c are the usual two-component Pauli matrices. The vector potential of the radiation field is expanded in plane ivaves normalized in the continuous spectrum as 2
A(r):- rtP*1fat | tr:l
(hc/k)rbw.eux X exp(lk.r) f conj., (7)
where bp1+, b11 are the creation and destruction operators for a light quantum of wave vector k and po,larization type I:1, 2. In positron theory, because of the indefiniteness of the number of electrons, it is convenient to use second quantization for the electrons as well as for the light quanta. Then II-u,+
| a x r t r * 1 x ;{ c a . p l B n c 2 l
lz(x) lrl:(x),
(8)
f
H inr+ - | dxq+1x)eo.A(x)t(x) ff
* | | dxdx',1+6),1,9)(e'/lx-x'l) JJ Xt+(x')t(x') :Hr*Hc,
ELECTRON
in state a plus the vacuum electrons" minus "self-energy of the vacuum electrons alone." The highly divergent interaction of the extra electron with the infinite charge density of the vacuum electrons must still be removed. This is done by the process of symmetrization 6 in which the calculation is also made using the equally justified picture that all the electrons in existence are positively charged, so that the observance of a negatively charged electron in state a corresponds to a vacancy in the sea of negative energy states othenvise filled with positively charged particles. Then the results of the two methods of calculation are averaged. Since in the first picture there is one particle present in addition to the vacuum particles, and in the second picture one partiile fewer, the self-term i:j in the electrostatic energy cancels out as does the direct Coulomb interaction between the bound electron and the vacuum electrons, insofar as the latter are not polarized by an external electric field. The result is an avoidance of all singularities worse than logarithmic, and these may be plausibly discarded by renormalization of charge6 and mass. The self-energy to order a consists of the first-order Coulomb self-energy Wc and the second-order electrodynamic self-energy Wo. The former will split naturally into a direct or vacuum polarization term Wp and a static exchange term Ws. The static and dynamic terms trZs and Wo were first calculated by Weisskopf6 in 1934 for the case of a free electron. To calculate Wc we need the expectation values of the operator
(9)
I cr
where {+(r) and r[(r) are, respectively, creation H":- | | dxdx'r!+(x)r!(x) 2J J and destruction operators for an electron. We will expand r!(r) in terms of the eigenfunctions X(e'/ lx-x' l)q+(x/){(x/) (11) u"(x) of the potential field Iz
:t
t(x):I
a"u"(x),
(10)
where the coefficients a, are operators corresponding to the destruction of an electron in state n, etc. We are concerned with the self-energy of a "single" electron bound in some stationary state u"(r) in the potential field Z. In positron theory, this is taken to mean "self-energy of one electron
I ZEA"uda"+opafo'6,
aFt6
where 1rr A.tut:- | | dxdx'u"*(x)u6@) 2J J
X (e'/ I,x-x'l)ur*
(x')u6(r'),
'W. Hei*nberg, Zeits.f. Physik 90,209 (1934). 6V. F. Weisskopf,Zeits. f. Physik 90, 817 (1934). I P. A, M. Dimc, Solvay Congress,1933.
(12)
416 N. for the states represented functionals
M.
KROLL
AND
by the Schrtidinger
O ( 0 1 p ; 0 1 " ; 1 1 ' ,i1D) (, 0 ( p ) l c ) ) , s e c o n d p i c t u r e .
E.
390
JR.
L A o ' >< , ' > a ' ) G+' ) I
D A
(r) (r';
(13a)
A<,,>ato't<,'t,
*D (r')
(13b)
Here r denotes any positive energy state, while a prime indicates the exclusion of the state a occupied by the bound electron. The letters p, o denote any negative energy state, while the indices z, d, P, y, and 6 are to be used for a complete set of states of any energy whatever. In the alternate picture, a positive energy state of a positive particle is represented by (p) and a negative energy state bV (r), (s), The state whose vacancy constitutes our electron is denoted by (a). Consider the expectation value a " + a B a r + a 5 A . p rO , (1"0,,1,).
a8+6
The vacuum term is I
(r) (p)
and the difference , 4 r o l r , t c , t r-" r L A o a t t o > < " > - 2 L A t a > < " t < o < , t
I
(')
G)
* Ort,"r,")(')- 2
: *
F
A <"tt"t<,tt,t,
represents the self-energy of the electron in state a as calculated on the basis of the positive particle picture. The wave functions un and uh, are identical for physical reasons, so that we may now drop the parentheses. Averaging the two results. we obtain
L +A""""+L +A"""":Ws*Wp. The static term
Aoo,o+L A""oo
I A c>
E lot,rr"rro *E
G) (r)
Using the matrix elementsbfor the destruction and creation operators, we obtain L 4",,"+L
LAMB,
and obtain L
o(1.0",1r),6(0,1,), firstpicture,
O*(1"0,,1,) |
w.
(14)
/v^9rs
rP
r I\ -L \ - ,L .rl 'd d o p
- ILLF \ -L ./L .p .f . f . o .
Ws:L
P r'
rf
=+ ? *; J dxdx'u"*(x)u"(x) X (e,/ lx-x' l)u**(x')u"(x'),(15)
Subtracting ,tr" *ru.unrri,"r-. \L -L .\--d /d p p t L L / rL p\ f-r \r -t / poPf
the self-energy of the electron in state a on the basis of the negative particle picture is \L l-t a/f r a
r
r |r \! - L/
lrdapp
|
-\-/ L
*o"* nF
I
by
"*2EA""oo,
use of a Fourier
representation
for
l/lx-x'l _: lx-x'l
where the upper or lower sign is to be taken for a positive or negative energy state, respectively. The first term represents an exchange term and diverges only logarithmically. The last term is the direct Coulomb energy of the electron in state a interacting with the sea of negative energy electrons and diverges quadratically. As mentioned above, the worst part of this diy'ergence is removed by the process of symmetrization. On the basis of the alternate picture, we therefore calculate the expectation value O*(010"1,,)
and
tf zLapta
' :L
+4""""
(1/zr'z) t dk exp(zk.(x-x'))/k2, r
(16)
may be written as
w s: e,/ao,) f
d!
T + [ an.*rxl
X exp(ik'x)2,(x) d'x'u^*(x') f XexP(- ik' x')u"(x'). (17) The polarization
tetm WP(a)
w,(o):T-o*-:;
a61+a61a61+a1rl.4t.lrplr"lcrl
(a) G) (z) (;)
(D(0aoy01"111",y),
xl
r
r'
ezr
J
axlu"lx)lz
dx'
_I+lz"(x')1'?, Ix-xln
(18)
417 OF
SELF-ENERGY
39T may be written
A
as
BOUND
ELECTRON
gres ls ?
Wptu): I dxlu"(x)l'ett(x),
(1e)
- l df 2k r
wherethe functionl,*, ,. ,n. potentialdue to a
- E")) ( | (/k IH, Io) l' / (8,1-hck
[t
1 l( o kl H , l o )l ' / ( E " * h c k * E " ) )
-5-
charge densityT-e
p ( x ) : ( e / 2 )L + l u " ( x ) | " ' z ,
*2 I ((aIH' I ak)(pkiEtlp)/tuk)f,
(20) which
induced in the vacuum by the external electrostatic 6eld. The energy trZp vanishes for a free electron. The second-order electrodynamic self-energy Wn@) of the electron in state o, according to the electron picture, is given by the difference of the energy Wo(1"0,,1r) for the electron in state o plus the vacuum electrons and the energy Wo(0,1,) of the vacuum electrons alone. The vacuum energy Wo(0,1r) is given by second-order perturbation theory, and involves the virtual emission and re-absorption of a light quantum of wave vector k and polarization type tr. There are two types of terms, representedby the following transition schemes: ,/o+r*k1 I landl \rlk-p/
may
be written
as
fz
- | dk E tE (+ | (nkllJtla)l'/ J
r-r
( l E " l + h c k + E " ) ) + 2Z , ( ( a l I { r l a k ) x(pkllJrld/hck)1. affects only
Symmetrization and gives
the second term,
f2
wo@):-ld.kLL J
x:t"
v (+ | (zkIr{r | l' / (lE"l *hek+ E")) ") f2
/p-p+k\ \ofk+o./
1'
+fdktE Jr-tn
X (+ (alHr lak)(nkllJrlnt/hch). In the caseof the energy WDQ"0,'1), there are added which the transitions additional some The last term can be written as electron can make, and some of the previously presence prevented by the are allowed transitions f2 (dk/h,)u L of the atomic electron in state o. One has then | J r:rn the following types of transitions: ,zo+r'tk1
I
\r, |-k-p/
t,|
1a+r*k\ \r1-k-a/
L
/o+o*k1 /n*n-|k1 I t,| L \a*k-a/ \o*k*o,/
r1a+ai-k\ l \Pfk+P,'/ The difference of the two corresponding ener? See reference a, Eq. (40). 8 E. A. Uehlins, Phys. Rev' 48, 55 (1935), W. Pauli and M. Rose. Phvs. Rev. 49,462 (1936), and V' F. Weisskopf, Ksl. Danske Vid. Sels. Math.-Fys' Medd 14, No. 6 (1936). i R. Serber, Phys. Rev. 48, 49 (1935).
+
ff
J J
(21)
dxdx'u"*(r)o'errz.(x)
X exp(ik' (x - x'))2"*(x')a'erran(x'), 'rvhichwill be zero if the polarization currentT-e j(x'):el
+u"*(r')uu"(x')
(22)
is zero, In the absenceof an external vector potential, this current is in fact zero' so that the last term in Eq. (21) will henceforthbe dropped. It should be noted that two physically different k-spacesare involved in the expressionsEq. (17) and Eq. (21) for Ws and Wo'
418 N,
M.
KROLL
AN D W.
III. COMPUTATION OF THE SELF-ENERGIES We turn now to the evaluation of the expressions trZs, Wo, Wp for the self-energy. We shall pay particular attention to the static and dynamic terms Ws and Wo, as the polarization (Uehling) term Wp is directly related to the polarization charge density which has been cdmputed by others.T-e In the calculations to follow, relativistic units will be used throughout, in which h, m, and c are taken equal to unity. Our main interest, of course, is in the case of an electron moving in a Coulomb field for which V : - e2/r. The integrals like
E.
LAMB,
392
JR.
The sums over n can be performed, at least formally, by making use of the completeness of the solutions of the Dirac equation. Thus I,I/s can be written in the form ff W s: (e2/4r2) | (dk/h\ J
| dxu"* (x)
| NJ
xexp(ik x)(H/ | u |) u"1x7 dx'u (x') "* J Xexp(-
ik.x')u"(x'),
(23)
where H:a.p*AlV
Q4)
is the Hamiltonian of the unperturbed electronic motion, and I 111 the absolute value of the Hamil(nklr.ll a) : - (ie/ zrntl d.xu^*(x)a. ey1 tonian, by which we mean an operator having I the same eigenstates and spectrum as the HamilXexp(-zt.x)2"(x) tonian, except that its eigenvalues are taken to occur in the theory of the relativistic photo- be positive. It can most conveniently be comelectric effect and have been studied extensively puted by representing it as +(11,);. The equivaby Hall.10Becauseof their complexity, it seems lence of Eqs. (17) and (23) follows from the fact is f 1 when operating on a positive hardly likely that we could perform the neces- that H/lHl energy state and - 1 when operating on a negasary further operations on them to evaluate such tive energy state. Using the completeness of the expressionsas Wp. Even more must such a direct u"(x), we now find attack be ruled out for the case of an electron
moving in a general potential field I/(x), for rf which the relativistic eigenfunctionsu^(x) are W 5 : (e2/ 4r2) t| , (dk,/ hr) | dxu"* (x) not known. The only remaining method of apxexp(i.k. x) (H / | -r1l) exp( -lk . x)u"(x), (25) proach seems to be to make an expansion of some kind. We observethat if the electron is free, so that the problem of computing IZs is reduced the evaluation of the sums is a comparatively to that of finding the expectation value of the simple matter. Thus, if u"(x) is a plane wave of operator momentum p, then
r.
I dxu"*u' e exp( ik. x)zr,
(e2/4r2)
| (dk/k'z)
Xexp(-ik.x)(H/ lHl) exp(f zt.x) is different from zero only if the momentum of the state a is kfp, and there are only four such for the state o. We first note that for anv.oolvstates.In the caseof a "weakly" bound electron, nomial function f(p, V) i.e.,for l@, V) exp(fik.x)2,(x)
h>>(al lpl la),
one might expect that the matrix element above would have an appreciable value only when lE" I is of the order Ep:*(1+p)1. We take advantage of this fact in the method of calculation used. r0 H. Hall, Rev. Mod. Phys. 8, 358 (1936).
: exp(f ik. x)/(k-fp, V )u"(x). This theorem may then be usedfor any function J6, V) such as H/IHI, for which a seriesexpansion in p and tr/ is valid. We therefore write (H/|Hl)
exp(tu.x)u"(x) : exp(tu. x) (H / | H | )y+pu "(x),
419 393
SELF_ENERGY
OF
A
where the notation
BOUND
ELECTRON
where
(
Eh: (r+k')\' no: lHl -E*,
)r_n
means that the operator p is to be replaced by kfp everywhereit appearswithin the brackets. One then has
Do+:BrSP-t, w":E"-1.
(31) (s2) (33) (34)
is, of course, also necessaryto evaluate A* ws: @'/a,,\("ll #r, t"rl,*1,).
t lr
wo: -Q,/4r')lal I tav.rDE o.eo^ ^:r \ lr'
r/ H x{l-*1
\
\ : IH I pay- E1,: (Ey, | 6y)r- E* :i6t-t(6v" / E) { }6(6v3 / E*') +,'' " 'lvhere 6r : 2k. p-|p'* 2 V (c.kl- s. p -f 9) le'aVlV2.
/ l/(lHl+h-E')
(3s)
(36)
t\lr1l //
All expansions indicated are to be carried to sufficiently high order so as to include all terms which are effectively of the fourth or lower order in vfc. The operator p is obviously of 6rst order The evaluation of the terms trZs and JZo thus in zt/c, V is of second order becauseof the virial theorem, while k, 0 are of zeroth order. Since hinges on the expressionof the operators e.,2:sr2:su2:1, a must be regardedas a zeto' (r/lijl)*p and (r/ (lHl lk+E"))v+p order quantity until the expansionhas beenfully out. worked whose expectation values operators of in terms The expansionof the operatorsin the manner can be readily obtained. indicated and the summation over the polatizaTurning now to this task, we write tion direction I:1, 2 is a straightforward but ( lfll)r+r: ((/1'zf)rap: ((("'p*0* 7)')i)t+o lengthy matter. This being completed, one is : ((1*p'*"'p V I Va'Pl2tsV* tr/'z),)11o left with a sum of expectationvalues of various : (l + k' +2k -p *p'z-l 2 V(a' k|_o' P+B) operators (28) la'rV*V2)\, 1 , 0 , a . p , 0 o ' p ,p ' , A p ' ,V , A V , where rr denotes an operator p which operates t' nV Xp, a' nV, 8s' *'V, d2V, Vp2,p2V' nV'p' only on the quantity immediately following it gn,Bpn,V2,FV2, Va.p, tsVc'9, ( e . g . c, ' p 7 : V t ' 9 * e ' t V ) . a . p p zw , : E - 1: c . p * F * V - 1 , w 2 ,B w , It is clear that the ratio of (1111)r+pto combination of some fifty Q+n'1t approachesunity as & becomeslarge, each multiplied by a which correspondsto'our previous statement elementary integrals over k. The result of this regarding the relationship between the magni- calculation is given in Eq. (73) below. Before tude of the matrix elements and the energy of coming to it, we shall first discuss briefly the validity of the expansion used and the form in the state z. Thus we exPand which the sdlf-energyis expressed' (r/l/Jl)t+n and (t/ lHl lk+E")t+p Assuming for the moment that p and tr/ can be regarded as numbers less than unity (in as follows: relativistic units), then the expansionsof
r,r .(#,-') / tH|+k+E") |"*;."-^1,)
(r/ lHl)*r:r/(E**^)
- - L / E * - A * / E n ' * L f/ E r x - ' ' ' , ( 2 9 )
(r/(lHl +ft+8,))r+p :l/(Dk++Lk+w"):1/Do* -(a*+w")/(D.+)',*' ' ',
(r/lEl)t+o
and (r/(lHl+k+E"))k+,'
(if carried far enough) are valid for all values of ft, sinceEr approachesunity and D,'- approaches (30) two as & goes to zero. On the other hand, Dl+
420 N.
M.
KROLL
AND
approaches zero in this limit, so that one should examine the low ft behavior for this case. It turns out that that part of A*-w"' which does not approach zero as & approaches zero is of the order (u/c)2.Therefore, in the term involving Df we shall carry our integrals down wave number &r, only to some intermediate which, for convenience, v/e take to be of order aB. This term must then be given a separate treatment for the low & region O
(32)
".(:),
V ) ) " . p * Z * 1- E ) 4 :
@ : ( r/ ( 1 + E - V ) ) " . p 0 ,
E.
LAMB,
394
JR.
ord,er (a/c)2 is just a non-relativistic two-component Pauli-Schrcidinger wave function. One can'then see, for example, that
\ a l p '- 0 p ' l a :) 2 | d x \ (1/ ( 1+ E - V )) I
Xo' p6)ap'z(r / (1+-E - V) )". pQ" tf
__ | dx6"*p,e" 2J
-i@lp"la),
(40)
since l-E and V are of order (v/c)2, and f dx+"pn6" and (a lpa lo) differ only by a quantity of order (a/c)6. One can therefore simplify the final result Eq. (73) by expressing all operators in terms of certain arbitrarily we have taken to be
chosen ones which
0 , u ' p , V , d 2 V ,B a ' n V , Vp2,pa, and V2. Our reduction is obtained by using the following relations betrveen expectation values
1+Bf |c.p-$pa{f,pa.dV,
(41\
p'-,r.p**pn*ttsa.zrV,
( a?\
. p p z+ c . p f
|Pc.nI/,
(43)
BV+V-lVp'z-i]r.nV,
(44\
0c'P-0,
//.c\
p2V+Vp2,
(46)
dV'p+-|22V,
(47)
s.rV+0,
148)
'n.-VXp-!a."1't!r2V, \/s.pnVp2{,ils.*V, B V a . p +- l B a . n I ; , a'PP'+P',
(49)
(s0) (s1) (s2)
0pn*pn,
where{ and o are two-componentv,/avefunctions satisfying ( a ' p ( r /( r - l E -
rv.
s,
133;
(3e)
so that for a positive energy state o is of order v/c with respect to @, rvhich apart from terms of
rq4)
BV2+V2, w : H - t_+tc.p+ 7++p'-+Ba. rV,
(55)
- i Vp' - *9a. xV, Bza+lu. p I V - L"pn
(56)
ze,r-lpr* Vp"+Vr.
(57)
From these relations one finds that the total
421 SELF-ENERGY
395
OF
A
BOUN D
ELECTRON
contribution of the static and dynamic terms of the self-energy, apart from the low & contribution of the term involving D1+, is
is the induced charge density calculated by various authors. To the order required, this is found to be
(ws* wo)': ("/d (.1(:f,- o,ru;++)s
pqx):(e/6n,)v,Vl
/f\ i+ | G,dk/Ek') | \.-rt l(e/60r')vaV,
* * c . p * 3 c . p f t ; ! f , P an. V - ( $ l o g ( 1 / E )
(63)
from which one finds
- ! tog2 | 1r/ 72)",v1"). (s8)
lf\ - (2e'z/3r)l l*
Wp(a): In order to compute the low i contribution of the term involving Dr+, it is convenient to take advantage of the essentially non-relativistic nature of this region and to make use of the previously discussed large and small component reduction Eq. (37). One then readily finds that the resultant expression has, to the order required, just the form of the non-relativistic self-energy, so that Bethe'st calculaticlr may be used up to the frequency k;. The contribution is
| &'dh/Ef)
|
\J/
x (al v' la) - 1e/ ts"11alv,v la).
(64)
The prime appearing on tr/' is used to indicate that the gaugeof Z'has beendeterminedby the fact that it arises from an expressionof the form F
-1/4n I dx'vzV(x')/lx-x'1. I
(65)
We shouldlike to point out that the expression for p(x') can be readily and neatly calculatedby - g(losk;/z)n'zV : (a/r)(al - frp'&; la). (59) methods very similar to those used above in the caseof the static and dynamic terms in the selfAdding the t\4'o, and observing that (alp'?lo) is energy. To show this we first evaluate (au.a)*
the same as (alc.plo) to the order required, we find for the total contribution of Ws and Wo
p(x',x") : - (e/Z) | : * (e/2)
us++)a wstwo: @/d ("1(;{,- row *(".p/6)*i,6a.rV-
u"*(x')(Hi lHt)u"(x")
|
44
:-(e/2) I t t @/lgl),,
/l | * log\E
- ! tog2 rrl /7r)"r1"),
*u"* (x')u"(x")
n
r:l
Y:l
\,u^r*(x')u*,(x"),
(66)
where the uny, F: l, 2, 3, 4 are the components of are to be taken and all operators in H/lHl with.respect to x". Making use of the completeness relation
(60) un,
and we note that the result is independentof the joining frequencyfr;. As previously mentioned,the direct Coulomb energy term can be expressed in terms of a polarization charge as follows:
E u"u*?')u",(x"):6u,6(r'-x"),
(67)
we obtain
p(r',x"): - (e/2)L g/ lHl)F6(x'-*")
Wp(a):e I dxla"(x) 'z
p
,
: - ( e / 2 ) ( S p u t H / IH I ) " , , 6 ( x ' - x " ) .
(68)
X I d x ' p ( x ' ) / l x - x ' 1 , ( 6 1 ) To evaluate this, we Fourier-analyze the deltafunction
where p(x'): (e/2) t + l u " ( x ' ) P
I
( 6 2 ) 6(x'-r" 1: Q /8r3) | dk exp(ik. (r" - x')), (69)
422 N.
M
KROLL
AND
TesI-p I.
2tsv,
Ouantity
VY iPs.VV
-1t)
State 2tPVt
22P./,
0 r/6
0 -r/12
and find
p(x',x"):(l/8r3)
f
J
dk exp(-lk'x')
X (Spurfl/ lfll) exp(ik.x"), f
: (t /8r3) dk exp(lk.(r"-x,)) J X (Spur//l1/l )r+n.1(x"), (70) when1(r") is a constantequalto one.Since
p(x'):p(x,,x,),
E.
LAMB,
396
JR
where we have made use of the fact that a: -iY . We obserlre first of all that the worst divergence is logarithmic, and that the expression is invariant to the gauge of I/. We next see that the difference of the selfenergies of the states 225; and 2,P1 of the hydrogen atom does converge, since the expectation values of B, V', (and, therefore, c.p) are, respectively, equal for the two states. (These statements follow from the observation that neither a small change of charge nor mass of the electron will remove the degeneracy.) In order to calculate the numerical difference of W(a) for the two states, we need the values of the expectation values of the remaining operators in Eq. (73). These are given in Table Itr in units of a2Ry. The energy difference is then A,W : (a'Ry /3r)Uog(r
(7r)
/e) -log2
+(23/24)-+7'(74)
which, using Bethe's revised valuesrz for the con-
we obtain, finally,
Yf,,E9#/i;],ii:i'l#ii;:;,i,".-,',J
p(x,):_ (e/,6rs) X f akg ' p' u r 1 l l . 1 7 l ) g 1 r . 1 ( r , ) . J
W.
(72)
The expression can now be readily reduced to the form Eq. (63) bV expanding
and thus differs from the original guess by only a small amount. It should be admitted, however, that one cannot regard this energy difference as uniquely determined, since one is taking the difference of two infinite quantities. With respect to the determination of the abso-
(Spurf/|f1|)tlolutevalueoftheself-energyforastate,itis convenient to attemDt a phvsical interoretation in the manner used for the evaluation of the ofthetermsinvolvei. In thiscontext,ltshould static and dynamic terms. be observed that even if the coefficients of the Z'
rv.rNrERpRErArroN oFRESUr.rs il1'tt'JTff:*T;?.'i-":f::l'::$::#1T: fact that
The total expression for the self-energy is
w(a)
=@/")("1(: I aurcsnt)u -(? awnta,,r++)2, [ * @ . p / 6 ) - ( i / 4 ) 0 s . vv
+ (* rosfr/O- ] log2
rsl v + ( r t/ 72 )-rrU )v, l a ),
( / J,
they would manifest
themselves as a
u It should be mentioned here that the exoected value of V, tr/really diverges for the S stares of the Coulomb field, since it then is equal to the square of the absolute value of the wave functioi at the origiri. Since, however, our evaluation is being carried only to order (u/c)n, one should be able to us the spatial dependence of the Schr
423 397
SELF-ENERGY
OF A BOUND ELECTRON
modification in the real charge and mass of the our self-energy expressionmust be found to give electron, and thus be included in the observed a covariant expressionfor the free electron. We charge and mass. We shall assume that these proceed by subtracting some free electron operterms have been so included in the observed ator from the operators contained in our selfcharge and mass and drop them from the self- energy expressionsuch that the self-energy of a energy expression.The term in dpeu'vlzis just free electron is zero, thereby regarding the total of the form of the interaction of a Pauli-type self-energy as contained in the observed mass. Such a procedureis, of course,not unique: we intrinsic magnetic moment with a static potenmake the simplest subtraction, examine the shall as implying interpreted be thus tial I/ and can an additional electronic magnetic moment of resultant expression, and then investigate the a/2r-Bohr magentons, while the term V27 im- nature of the lack of uniqueness.Thus if one plies a correction to the external potential, or, simply drops the (a/&r)o'p term from the selfmore specifically, an additional short-range inter- energy,one obtains action between the electron and a point charge. 2e) : The term in a 'p is not subject to a direct physical W @) (d/ r) (al iA"' v lry4 + (+ log(1/ interpretation, and, in fact, must be regardedas (7s) + ( 1 1/ 7 2 -t ( r / r s ) having no physical significance. Thus if one ap) v v l a ) ' plies the self-energy expression (73) (with the This expression can be interpreted as arising B- and V' terms omitted as explained above) to in the magnetic moment of the a free electron of momentum p, only the term from an increase 2r'Bohr magnetons and an addiaf of. electron yielding for self-energy the in c.p contributes, given by (a/.6tr)lp'z/(l-lp')il. Now if the electron is to be tional interaction potential / \ relativistic connection particle, the regarded as a -(r/1s) tvv. (76) 6v.ff: - I + los(l/2€)+(1r/72) between the momentum and energy of a particle / \must be retained, so that the self-energy should contribute 68 and 983 Mc/sec., respechave the momentum dependence appropriate These to the level shift. tively, -l/(l*p')' isrs that to a mass correction, with our subtraction prescripIn accordance corresponding to the term in p already subhowever, add any linear combinawe could, tion non-covariant term of the tracted. The presence operators of order up to (a/6tr)lp'/(l-lnz;+], which is reminiscentof the tion of free electron (u/c)a whose expectation value is zero for the free be can self-energy, stress terms in the classical electron. There are seven such operators,\6 uiz., traced to the fact that the total self-energy is l, P, p', 9p', a'p, p', Bpn. The condition that a of the in case the infinite. and can be avoided linear combination gives zero to order constitutes free electron by paying proper attention to the constraints, so that there should be four three k-spaces.ra in the various domains of integration independent combinations giving zero linearly That is, in order to keep the total self-energy free electron. A possible choice for these for the over a finite integrate finite it is necessary to is the following: region of the light quantum space and the elec(76a) a integrates over If one st":l - 9-tp'-c'P*6P'*8Pa, tron momentum space. region which would be spherical for an electron (76b) %:pr-Bp,-*Pa, at rest, a covariant result is obtained. One can(76c) sl":pa-Fpa, not, however, apply this prescription to a bound electron, so that some other means of modifying -0P2. (76d) Oa: a.p 13The enersv of a particle of mass z and morentum p is (mzlpz)t. If z,is irodified by a quantity 62, then the energy to nrst oroer rn ou rs (m2! P2)r! 6m / (mz! Pz)|, as is appropriate for and the correction term with n:1, the electron, is of the form given. nA. p"is. Verh. d. K. Ned. Akad. v. Wet, Section 1, 19, No. 1 (1947).
The expectation values of phe above combinations are all zero for the lfree electron. Their effect upon the self-etergy of a bound electron depends upon their expectation values for a -lfooFtor"
of odd order in t/c have been ignored, as there ire all rero for the bound electron'
424 N. M. KROLL.{ND
\\:. E.
L.{MB,
lR.
398
bound electron. Those for go, Oa,and O" are zero, so that their subtraction would have no physical On the other hand, consequences.
electron has been made by Kusch and Foley,l0 who obtain a value in good agreement with the value a/2zr-Bohr magnetons theoretically computed by Schwinger.lT If we adopt this experi(77) @laala):-i(al\s'vVla)/2, mental and theoretical result, the 2254-22P1 which is precisely the form of interaction of a separation Liecomes uniquely determined to be magnetic moment with a static potential Z. just the value 1051 Mc/sec. obtained above by a Thus, the lack of uniquenessof the subtraction direct subtraction (74) of thd self-energies for prescription is just such as to make the magnetic the two states.
moment correction indeterminate, while-the correction to the potential is left uniquely determined. Now a purely magnetic measurement of the correction to the magnetic moment of the
16P. Kusch and H. M. Foley, Phys. Rev. 74, 250 (1948), also J. E. Nafe and E. B. Nelson, Phys. Rev. 73,718 (1948). ItJ. Schwinger, Phys. Rev. 73,416 (1948) and Pocono Conference. 1948.