Field 2006

Home Search Collections Journals About Contact us My IOPscience
Derivation of the Lorentz force law, the magnetic field concept and the Faraday–Lenz and
magnetic Gauss laws using an invariant formulation of the Lorentz transformation
This content has been downloaded from IOPscience. Please scroll down to see the full text.
2006 Phys. Scr. 73 639
(http://iopscience.iop.org/1402-4896/73/6/018)
View the table of contents for this issue, or go to the journal homepage for more
Download details:
IP Address: 216.47.136.20
This content was downloaded on 16/12/2014 at 18:41
Please note that terms and conditions apply.

INSTITUTE OF PHYSICS PUBLISHING PHYSICA SCRIPTA
Phys. Scr. 73 (2006) 639–647 doi:10.1088/0031-8949/73/6/018
Derivation of the Lorentz force law,

the magnetic field concept and
the Faraday–Lenz and magnetic Gauss
laws using an invariant formulation of
the Lorentz transformation
J H Field
Département de Physique Nucléaire et Corpusculaire Université de Genève, 24,
quai Ernest-Ansermet, CH-1211 Genève 4, Switzerland
E-mail: john.field@cern.ch
Received 28 November 2005

Accepted for publication 2 March 2006
Published 15 May 2006
Online at stacks.iop.org/PhysScr/73/639
Abstract
It is demonstrated how the right-hand sides of the Lorentz transformation equations may be
written, in a Lorentz-invariant manner, as 4-vector scalar products. This implies the existence
of invariant length intervals analogous to invariant proper time intervals. An important
distinction between the physical meanings of the space–time and energy–momentum 4-vectors
is pointed out. The formalism is shown to provide a short derivation of the Lorentz force law
of classical electrodynamics, and the conventional definition of the magnetic field, in terms
of spatial derivatives of the 4-vector potential, as well as the Faraday–Lenz law and the
Gauss law for magnetic fields. The connection between the Gauss law for the electric field
and the electrodynamic Ampère law, due to the 4-vector character of the electromagnetic
potential, is also pointed out.
PACS numbers: 03.30+p, 03.50.De
1. Introduction performed by the present author on the basis of ‘space–time

exchange (STE) symmetry’ [3]. These derivations, for the
Numerous examples exist in the literature of the derivation of case of the non-homogeneous Maxwell equations, are briefly
electrodynamic equations from simpler physical hypotheses. recalled in the last section of the present paper. Frisch and
In Einstein’s original paper on special relativity [1], the Wilets [4] discussed the derivation of Maxwell’s equations
Lorentz force law was derived by performing a Lorentz and the Lorentz force law by application of relativistic
transformation (LT) of the electromagnetic fields and the transforms to the electrostatic Gauss law. Dyson [5] published
space–time coordinates from the rest frame of an electron a proof, due originally to Feynman, of the Lorentz force law,
(where only electrostatic forces act) to the laboratory system the Faraday–Lenz law of induction and the Gauss law for
where the electron is in motion and so also subjected to magnetic fields, based on Newton’s second law in classical
magnetic forces. A similar demonstration was given by mechanics and the quantum commutation relations of position
Schwartz [2] who also showed how the electrodynamic and momentum, that excited considerable interest and a flurry
and magnetodynamic Maxwell equations can be derived of comments and publications [6–11] about a decade ago.
from the Gauss laws of electrostatics and magnetostatics Landau and Lifshitz [12] presented a derivation of Ampère’s
by exploiting the 4-vector character of the electromagnetic law from the electrodynamic Lagrangian, using the principle
current and the symmetry properties of the electromagnetic of least action. By relativistic transformation of the Coulomb
field tensor. The same type of derivation of the electrodynamic force from the rest frame of a charge to another inertial
and magnetodynamic Maxwell equations has recently been system in relative motion, Lorrain et al [13] derived both the
0031-8949/06/060639+09$30.00 © 2006 The Royal Swedish Academy of Sciences Printed in the UK 639
J H Field
Biot–Savart law, for the magnetic field generated by a moving provided by the derivation of the Lorentz force law presented
charge, and the Lorentz force law. in the present paper. The discussion of electrodynamics in
In many textbooks on classical electrodynamics, the [15] is couched entirely in terms of the electromagnetic
question of what are the fundamental physical hypotheses field tensor, F µν , and the electric and magnetic fields which,
underlying the subject, as distinct from purely mathematical like the Lorentz force law and Maxwell’s equations, are
developments of these hypotheses, used to derive predictions, ‘parachuted’ into the exposition without any proof or any
is not discussed in any detail. Indeed, it may even be stated discussion of their interrelatedness. The 4-vector potential
that it is futile to address the question at all. For example, is introduced only in the next-but-last exercise at the end
Jackson [14] states: of the chapter. After the derivation of the Lorentz force law
in section 3 below, a comparison will be made with the
At present it is popular in undergraduate texts and
treatment of the law in [2, 14, 15].
elsewhere to attempt to derive magnetic fields and
The present paper introduces, in the following section,
even Maxwell equations from Coulomb’s law of
the idea of an ‘invariant formulation’ of the LT2 . It will
electrostatics and the theory of special relativity.
be shown that the RHS of the LT equations of space and
It should be immediately obvious that, without
time can be written as 4-vector scalar products, so that the
additional assumptions, this is impossible.
transformed 4-vector components are themselves Lorentz-
This is, perhaps, a true statement. However, if the invariant quantities. Consideration of particular length and
additional assumptions are weak ones, the derivation may still time interval measurements demonstrates that this is a
be a worthwhile exercise. In fact, in the case of Maxwell’s physically meaningful concept. It is pointed out that,
equations, as shown in [2, 3], the ‘additional assumptions’ whereas space and time intervals are, in general, physically
are merely the formal definitions of the electric and magnetic independent physical quantities, this is not the case for
fields in terms of the space–time derivatives of the 4-vector the space and time components of the energy–momentum
potential1 . In the case of the derivation of the Lorentz force 4-vector. In section 3, a derivation of the Lorentz force law,
equation given below, not even the latter assumption is and the associated magnetic field concept, is given, based on
required, as the magnetic field definition appears naturally in the invariant formulation of the LT. The derivation is very
the course of the derivation. short, the only initial hypotheses being the usual definition of
In the chapter on ‘The Electromagnetic Field’ in Misner the electric field in terms of the 4-vector potential (which, in
et al’s book Gravitation [15] can be found the following fact, is also uniquely specified by requiring the definition to
statement: be a covariant one) as well as the definition of the physical
meaning of the electric field, E, E as EE ≡ F/q,
E where FE is
Here and elsewhere in science, as stressed not least the force on a point charge, q, assumed to be a Lorentz-
by Henri Poincaré, that view is out of date which invariant quantity. The Faraday–Lenz law and the Gauss
used to say, “Define your terms before you proceed”. law for magnetic fields are shown to follow, as 3-vector
All the laws and theories of physics, including identities, from covariant definitions of electric and magnetic
the Lorentz force law, have this deep and subtle fields in terms of the 4-vector potential. In section 4, the
character, that they both define the concepts they use time component of Newton’s second law in electrodynamics,
(here BE and E) E and make statements about these
obtained by applying STE symmetry [3] to the Lorentz force
concepts. Contrariwise, the absence of some body of law, is discussed. In section 5, Ampère’s electrodynamic law
theory, law and principle deprives one of the means is derived from the Gauss law of electrostatics by special
properly to define or even use concepts. Any forward relativistic covariance.
step in human knowledge is truly creative in this Throughout this paper it is assumed that the electro-
sense: that theory concept, law, and measurement— magnetic field constitutes, together with the moving charge,
forever inseparable—are born into the world in a conservative system; i.e., effects of radiation, due to the
union. acceleration of the charge, are neglected.
I do not agree that the electric and magnetic fields are
the fundamental concepts of electromagnetism, or that the 2. Invariant formulation of the LT
Lorentz force law cannot be derived from simpler and more
fundamental concepts, but must be ‘swallowed whole’, as The space–time LT equations between two inertial frames S
this passage suggests. As demonstrated in [2, 3] where the and S 0 , written in a space–time symmetric manner, are:
electrodynamic and magnetodynamic Maxwell equations are
derived from those of electrostatics and magnetostatics, a x 0 = γ (x − βx0 ), (2.1)
more economical description of classical electromagnetism is
provided by the 4-vector potential. Another example of this is y 0 = y, (2.2)
1 Actually, a careful examination of the derivation of Ampère’s from the z 0 = z, (2.3)
Gauss law of electrostatics in [3] shows that, although equation (3.4) of
the present paper is a necessary initial assumption, the definition of the x00 = γ (x0 − βx). (2.4)
magnetic field in terms of the spatial derivatives of the 4-vector potential
occurs naturally in the course of the derivation (see equations (5.16) and
(5.17) of [3]) so it is not necessary to assume, at the outset, the expression 2 This should not be confused with a manifestly covariant expression for
for the spatial components of the electromagnetic field tensor as given by the LT, where it is written as a linear 4-vector relation with Lorentz-invariant
equation (5.1) of [3]. coefficients, as in [16].
640
Derivation of the Lorentz force law
The frame S0 moves with velocity, v, relative to S, along the 2

sx,0 ≡ (1x)2 − (1x0 )2 = (1x 0 )2 − (1x00 )2 , (2.20)
common x-axis of S and S0 . β and γ are the usual relativistic
parameters: pT2 ≡ p 2y + pz2 = ( p 0y )2 + ( pz0 )2 , (2.21)
v
β≡ , (2.5)
c m 2T c2 ≡ p02 − px2 = ( p00 )2 − ( px0 )2 . (2.22)
1
γ≡p , (2.6) The intervals s y,z and sx,0 may be combined to obtain the usual
1 − β2 invariant interval, s:
where c is the speed of light, and s 2 = s y,z
2 2
+ sx,0 = (1x)2 + (1y)2 + (1z)2 − (1x0 )2 . (2.23)
x0 ≡ ct, (2.7) Similarly, the transverse momentum, pT , and the ‘transverse
mass’ m T may be combined to obtain the mass of the
where t is the time recorded by an observer at rest in S. Clocks object O:
in S and S0 are synchronized, i.e., t = t 0 = 0, when the origins
of the spatial coordinates of S and S0 coincide. m 2 c2 = m 2T c2 − pT2 = p02 − px2 − p 2y − pz2 . (2.24)
Equations (2.1)–(2.4) give the relation between space
and time intervals 1Er = rE2 − rE1 , 1x0 = c(t2 − t1 ), where The physical interpretations of the LT for 1X (2.8)–(2.11)
(Er1 , t1 ) and (Er2 , t2 ) are space–time events, and rE ≡ (x, y, z), and P (2.15)–(2.18) will now be discussed.
as observed in the two frames: 2
The value of the invariant interval sx,0 specifies four
rectangular hyperbolae on the Minkowski plot of 1x versus
1x 0 = γ (1x − β1x0 ), (2.8) 1x0 (figure 1). They are labelled by their intercepts sx+ and sx−
on the 1x-axis, and s0+ and sx− on the 1x0 -axis. The equations
1y 0 = 1y, (2.9) of the hyperbolae are:
q q
1z 0 = 1z, (2.10) sx+ : 1x0 = ± (1x)2 − sx,0 2
, 1x > sx,0 2
, (2.25)
1x00 = γ (1x0 − β1x).

q q
(2.11) sx− : 1x0 = ± (1x)2 − sx,0
2
, 1x 6 − sx,0
2
, (2.26)
Suppose now that a physical object, O, of Newtonian
mass, m, is at rest in the frame S0 ; then t 0 = τ is the proper
q q
s0+ : 1x = ± (1x)2 + sx,0
2
, 1x0 > 2
−sx,0 , (2.27)
time of the object. The following 4-vectors are now defined
(capital letters are used consistently to denote 4-vectors): q q
s0− : 1x = ± (1x)2 + sx,0
2
, 1x0 6 2
−sx,0 , (2.28)
X ≡ (x0 ; x, y, z), (2.12)
where all intercepts have the same length:
dX
V≡ = (γ c; γ vx , γ v y , γ vz ), (2.13) q
dτ sx+ = sx− = s0+ = s0− = sx,0
2
.
P ≡ mV = ( p0 ; px , p y , pz ). (2.14)
The intervals 1x and 1x0 corresponding to any pair of
X, V and P are the space–time, velocity and energy– space–time points lie on one of these hyperbolae. If the points
momentum 4-vectors of the object O respectively. It follows have a space-like separation (sx,0 2
> 0), the corresponding
from equations (2.8)–(2.11) and the definition of P in (2.14) intervals lie on (2.25) or (2.26); if they have a time-like
that it has the following LT between the frames S and S0 : 2
separation (sx,0 < 0), they lie on (2.27) or (2.28). The different
points on each hyperbola are the intervals of the same pair of
px0 = γ ( px − βp0 ), (2.15) space–time events as recorded by different inertial observers.
p
As shown in figure 1, the magnitude |sx,0 2 | of the invariant
p 0y = p y , (2.16) interval is equal to the distance of closest approach of each
hyperbola to the origin in the Minkowski plot.
pz0 = pz , (2.17) Now it is interesting to note that, for any pair of
space–time points, either a space-like interval, 1x, or a time-
like interval, 1x0 , may be identified as an invariant interval
p00 = γ ( p0 − βpx ). (2.18)
in a particular reference frame. Consider the hyperbolae with
Inspection of (2.8)–(2.11) and (2.15)–(2.18) shows that the 1x intercepts sx+ or sx− in figure 1. In the inertial frame in
which 1x0 = 0 (the intersection of these hyperbolae with the
LT equations for 1X and P are identical. However, as will be
1x-axis), it follows from (2.25) and (2.26) that:
now discussed, there is an important difference in the physical
interpretation of the two sets of transformation equations.
q
sx+ = 1x = sx,02
(1x > 0), (2.29)
Two independent Lorentz-invariant quantities may be
associated with each LT (2.8)–(2.11) and (2.15)–(2.18): q
sx− = −1x = 2
sx,0 (1x < 0). (2.30)
2
s y,z ≡ (1y)2 + (1z)2 = (1y 0 )2 + (1z 0 )2 , (2.19)
641
J H Field
a different position in S, after a suitable correction for light

propagation time delay. Each such pair of clocks would yield
the same value, 1x0 , for the time difference between two
events in S, but with different values of the invariant interval
defined by (2.20).
The invariant quantities s0+ = s0− are better known as c1τ,
where 1τ is the proper time interval in the frame S. Less well
known however is that, as a consequence of the space–time
symmetry manifest on the Minkowski plot, sx+ = sx− may
be also identified with an invariant space interval1λ in the
frame S. This is the same as the length along the x-axis of
any physical object at some particular instant in S. Both 1τ
and 1λ may be defined by measurements corresponding to
particular space–time projections in the frame S. As discussed
above, 1τ corresponds to a 1x = 0 projection and 1λ to a
1x0 = 0 one. The role of such projections in the generation of
the various apparent distortions of space–time in special rela-
tivity has been discussed in [17]. In every-day language, these
projections correspond to observations of the time recorded
by a clock at a fixed position, or of the dimension of an object
at rest, or in motion, at some particular instant, i.e. the usual
Figure 1. Space-time Minkowski plot of 1x versus 1x0 . The way in which time and space measurements are made.
intervals corresponding to every pair of time-like separated events The intervals 1x and 1x0 refer, in general, to space
are seen, by different observers, to lie on the hyperbolae with and time differences between different events. The latter
1x0 -axis intercepts s0+ and s0− . Those for space-like separated events may be, but are not necessarily, related to properties of the
lie on the hyperbolae with 1x-axis intercepts sx+ and sx− . All
intercepts are defined as positive quantities. The dashed lines show same physical object. As discussed above, 1x may be, for
the asymptotes of the hyperbolae, which are the projection of the example, identified with the physical length, `, of a rod, but
light cone in the 1x0 − 1x plane. the LT equations (2.8)–(2.11) are valid for any pair whatsoever
of space–time events. Thus, in equation (2.20), 1x and
The measurement in this frame consists of taking the 1x0 may be freely and independently chosen, each different
difference, 1x, between the spatial coordinates of events at pair describing a possible, but different event configuration
2
some fixed time (1x0 = 0). Such a frame may be defined corresponding to the same, or different, values of sx,0 .
for any pair of space-like separated events as a consequence The situation is quite different for the quantities p0 and
of the geometry of the Minkowski plot. Notice that 1x is px as a consequence of the existence of the Lorentz scalar, m,
not necessarily defined in terms of such a measurement. If, the Newtonian mass, which is a fixed property of any physical
following Einstein [1], the interval 1x is associated with the object O. Because of the relation (2.24) it follows that for
length, `, of a measuring rod at rest in S and lying parallel to fixed pT , as required by the LT equations (2.16) and (2.17),
the x-axis, measurements of the ends of the rod can be made at the value of p0 is determined by that of px , and vice versa.
arbitrarily different times in S. The same result ` = 1x will be Therefore only one of these quantities is independent for any
found for the length of the rod, but the corresponding invariant physical object.
2
interval, sx,0 , as defined by equation (2.20), will be different in It has been shown above that space and time intervals 1x
each case. Such measurements, with 1x0 6= 0, are associated and 1x0 between arbitrary space–time events can be iden-
with all points of the hyperbolae with 1x-axis intercepts sx+ tified with certain Lorentz-invariant quantities Sx ≡ sx± and
and sx− , except their intersections with the 1x-axis. S0 ≡ s0± by noting that the latter correspond to measurements
Similarly, 1x0 may be identified with the time-like of the metric in equation (2.20) in frames of reference where
invariant interval corresponding to successive observations of 1x0 = 0 or 1x = 0, respectively. A related correspondence
a clock at a fixed position (i.e. 1x = 0) in S. In this case, between arbitrary space and time intervals and Lorentz scalar
(2.27) and (2.28) give: quantities is given by writing the LT equations (2.8) and (2.11)
in the following invariant form:
q
s0+ = 1x0 = −sx,0 2
, (1x0 > 0), (2.31) Sx0 = −Ū (β) · S, (2.33)
q S00 = U (β) · S, (2.34)
s0− = −1x0 = 2
−sx,0 , (1x0 < 0) (2.32)
where the following 4-vectors have been introduced:
This corresponds, in the Minkowski plot, to the inertial frame
for which the hyperbolae with 1x0 -axis intercepts s0+ , s0− S ≡ (S0 ; Sx , 0, 0) = (1x0 ; 1x, 0, 0), (2.35)
intersect this axis. Such a frame exists for every pair of
time-like separated events. The interval 1x0 could also be U (β) ≡ (γ ; γβ, 0, 0), (2.36)
measured by observing the difference of the times recorded
by a local clock and another, synchronized, one located at Ū (β) ≡ (γβ; γ , 0, 0). (2.37)
642
The time-like 4-vector, U , is equal to V /c, where V is the 3. Derivation of the Lorentz force law, the magnetic
4-vector velocity of S0 relative to S, whereas the space-like field and the homogeneous Maxwell equations
4-vector, Ū , is ‘orthogonal to U in four dimensions’3 :
E is customarily written
In electrostatics, the electric field, E,
U (β) · Ū (β) = 0. (2.38) in terms of the electrostatic potential, φ, according to the
equation EE = −∇φ. E The potential at a distance, r , from
Since the RHS of (2.33) and (2.34) are 4-vector scalar a point charge, Q, is given by Coulomb’s law φ(r ) =
products, Sx0 and S00 are manifestly Lorentz-invariant Q/r . This, together with the equation FE = q E, E defining
quantities. These 4-vector components may be defined, in E
the force, F, exerted on a charge, q, by an electric
terms of specific space–time measurements, by equations field, completes the specification of the dynamical basis
similar to (2.29)–(2.32) in the frame S0 . Note that the 4- of classical electromagnetism. By this it is meant that
vectors S and S 0 are ‘doubly covariant’ in the sense that S · S these postulates, together with the requirement of special
and S 0 · S 0 are ‘doubly invariant’ quantities whose spatial and relativistic covariance, enable all the equations of classical
temporal terms are, individually, Lorentz invariant: electrodynamics (the Lorentz force law and the four Maxwell
equations) to be derived. The Lorentz force law and the
S · S = S02 − Sx2 = S 0 · S 0 = (S00 )2 − (Sx0 )2 . (2.39) homogeneous Maxwell equations are derived in the present
section; the electrodynamic Ampère law is derived from the
Every term in this equation remains invariant if the spatial and electrostatic Coulomb law in section 5 below.
temporal intervals described above are observed from a third It remains to generalize the above equation, relating
inertial frame S00 moving along the x-axis relative to both S the electric field to the electrostatic potential, in a manner
and S0 . This follows from the manifest Lorentz invariance of consistent with special relativity. In relativistic notation4 , the
the RHS of equations (2.33) and (2.34) and their inverses: electrostatic field is related to the potential by the equation:
E i = ∂ i A0 , where φ is identified with the time component,
Sx = −Ū (−β) · S 0 , (2.40) E In order
A0 , of the 4-vector electromagnetic potential (A0 ; A).
to respect special relativity, the electric field must be defined
S0 = U (−β) · S 0 . (2.41) in a covariant manner, i.e. in the same way in all inertial
frames. The electrostatic law may be generalized in two ways:
Since the LT equations (2.1) and (2.4) are valid for any
4-vector, W , it follows that:
E i → E±
i
≡ ∂ i A 0 ± ∂ 0 Ai . (3.1)
Wx0 = −Ū (β) · W, (2.42) This equation shows the only possibilities to define the electric
field in a way that respects the symmetry with respect to
W00 = U (β) · W. (2.43) the exchange of space and time coordinates that is a general
property of all special relativistic laws [3]. Choosing i = 1 in
Again, Wx0 and W00 are manifestly Lorentz invariant. The LT
equation (3.1) and transforming all quantities on the RHS into
equation in invariant form (2.43), for the electromagnetic
the S0 frame, by use of the inverses of equations (2.1) and
4-vector potential, A, plays a crucial role in the derivation of
(2.4), leads to the following expressions for the 1-component
the Lorentz force law presented below.
of the electric field in S, in terms of quantities defined in S0 :
An interesting special case is the energy–momentum
4-vector P. Choosing the x-axis parallel to pE and β to E ±1 = γ 2 (1 ± β 2 )∂ 01 A00 + γ 2 (β 2 ± 1)∂ 00
correspond to the object’s velocity, so that S0 is the object’s
proper frame, and since P ≡ mcU (β), equations (2.33) and × A01 + γ 2 β(1 ± 1)(∂ 00 A00 + ∂ 01 A01 ). (3.2)
(2.34) yield, for this special case:
Only the choice E 1 ≡ E −1 yields a covariant definition of the
Px0 = −mcŪ (β) · U (β) = 0, (2.44) electric field. In this case, using equations (2.5) and (2.6),
equation (3.2) simplifies to:
P00 = mcU (β) · U (β) = mc. (2.45)
E 1 = ∂ 01 A00 − ∂ 00 A01 = E 01 , (3.3)
Since the 4-vector U (β) is determined by the single
parameter, β, then it follows from the relation P ≡ mcU (β) which expresses the well-known invariance of the longitudinal
that, unlike in the case of the space and time intervals component of the electric field under the LT.
in equation (2.20), the spatial and temporal components of 4 A time-like metric is used for 4-vector products with the components of a
the energy–momentum 4-vector P are, as already discussed 4-vector, W , defined as:
above, not independent. Thus, although the LT equations
for the space–time and energy–momentum 4-vectors are Wt = W 0 = W0 , Wx,y,z = W 1,2,3 = −W1,2,3
mathematically identical, the physical interpretation of the
and an implied summation over repeated contravariant (upper) and covariant
transformed quantities is quite different in the two cases. (lower) indices. Repeated Greek indices are summed from 0 to 3, repeated
Roman ones from 1 to 3. Also
3 The 4-vector Ū has previously been used in the covariant formulation
∂ ∂ ∂ ∂

of advanced and retarded potentials in classical electrodynamics. See, for
∂µ ≡ ; − 1 , − 2 , − 3 = (∂ 0 ; −∇). E
example [18]. ∂x 0 ∂x ∂x ∂x
643
J H Field
Thus, from rotational invariance, the general covariant Introducing now the magnetic field according to the
definition of the electric field is: definition5 :
E i = ∂ i A 0 − ∂ 0 Ai . (3.4) B k ≡ −i jk (∂ i A j − ∂ j Ai ) = (∇ Ek
E × A) (3.12)
This is the ‘additional assumption’, mentioned by Jackson in enables equation (3.11) to be written in the compact form:
the passage quoted above, that is necessary, in the present
dP i
case, to derive the Lorentz force law. Note, however, that = q(E i + β j B k − βk B j ) = q[E i + (βE × B)
E i ], (3.13)
Coulomb’s law is not assumed; the only postulate is the dt
existence of a 4-vector potential. The electric field is defined so that, in 3-vector notation, the Lorentz force law is:
by equation (3.4) but the magnetic field concept has not
yet been introduced. A further a posteriori justification of d pE d(γ β)
E
= mc = q( EE + βE × B).
E (3.14)
equation (3.4) will be given after derivation of the Lorentz dt dt
force law. Here it is simply noted that, if the spatial part of the Writing equation (3.4) in 3-vector notation and perform-
4-vector potential is time-independent, equation (3.4) reduces ing vector multiplication of both sides by the differential
to the usual electrostatic definition of the electric field. E gives:
operator ∇
The force FE0 on an electric charge q at rest in the frame
E = −1 ∂B ,
0 E
S is given by the definition of the electrostatic field, and
E × EE = (∇
∇ E 0 − ∂ 0 (∇
E × ∇)A E × A) (3.15)
equation (3.4) as: c ∂t
F 0i = q(∂ 0i A00 − ∂ 00 A0i ). (3.5) since W E ×W E ≡ 0 for any 3-vector W E , and equation (3.12)
has been used. Equation (3.15) is just the Faraday–Lenz
Equations analogous to (2.43) above may be written relating induction law, i.e. the magnetodynamic Maxwell equation.
A0 and ∂ 0 to the corresponding quantities in the frame S This is only apparent, however, once the ‘magnetic field’
moving along the x 0 -axis with velocity −v relative to S0 : concept of equation (3.12) has been introduced. Thus the
initial hypothesis, equation (3.4), is equivalent to a Maxwell
∂ 00 = U (β) · ∂, (3.6) equation once the magnetic field has been introduced. This
is the a posteriori justification, mentioned above, for this
A00 = U (β) · A. (3.7) covariant definition of the electric field.
Actually, since (3.12) follows from (3.4), using only the
Substituting (3.6) and (3.7) in (3.5) gives: relativistic covariance of the latter, the Faraday–Lenz law has
been derived above with the covariant definition of the electric
F 0i = q[∂ 0i (U (β) · A) − (U (β) · ∂)A0i ]. (3.8) field and special relativity (i.e. the LT) as the only initial
postulates.
This equation expresses a linear relationship between F 0 i ,
Another immediate consequence of the magnetic field
∂ 0 i and A0 i . Since the coefficients of the relation are Lorentz definition (3.12) is the Gauss law for magnetic fields:
invariant, the same formula is valid in any inertial frame (that
is to say that the relation is a covariant one), in particular, in ∇ E · (∇
E · BE = ∇ E = 0.
E × A) (3.16)
the frame S. Hence:
Thus the magnetic Gauss law follows from the 3-vector
F i = q[∂ i (U (β) · A) − (U (β) · ∂)Ai ]. (3.9) identity WE 1 · (WE1×W E 1 ) ≡ 0, for any 3-vectors W
E 1 and W
E 2,
and the definition of the magnetic field and the Faraday–Lenz
This equation gives, in 4-vector notation, a spatial component law follow from the covariant definitions (3.4) and (3.12) of
of the Lorentz force on the charge q in the frame S, and so the electric and magnetic fields respectively and the 3-vector
completes the derivation. identity WE ×W E ≡0.
To express the Lorentz force formula in the more familiar It is common in discussions of electromagnetism to
3-vector notation, it is convenient to introduce the relativistic introduce the second rank electromagnetic field tensor, F µν
generalization of Newton’s second law [19]: according to the definition:
dP F µν ≡ ∂ µ Aν − ∂ ν Aµ (3.17)
= F, (3.10)
dτ
in terms of which, the electric and magnetic fields are defined
where F is the 4-vector force and τ = t 0 is the proper time as:
(in S0 ) that is related to the time t in S by the relativistic time E i ≡ F i0 , (3.18)
dilatation formula: dt = γ dτ . This gives, with equations (3.9)
B k ≡ −i jk F i j . (3.19)
and (3.10):
From the point of view adopted in the present paper, both
dP i dP i the electromagnetic field tensor and the electric and magnetic
=γ = q(∂ i Aα − ∂ α Ai )U (β)α
dτ dt fields themselves are auxiliary quantities introduced only for
= γ q[∂ i A0 − ∂ 0 Ai − β j (∂ i A j − ∂ j Ai ) 5 The alternating tensor, i jk , equals 1(−1) for even (odd) permutations
− βk (∂ i Ak − ∂ k Ai )]. (3.11) of i jk.
644
mathematical convenience, in order to write the equations of that Feynman’s results were just those required ‘for the
electromagnetism in a more compact way. Since all these existence of vector and scalar potentials’. In the present paper,
quantities are completely defined by the 4-vector potential, the existence of a 4-vector potential is assumed and just the
it is the latter quantity that encodes all the relevant physical same results are derived.
information on any electrodynamic problem6 . This position In [2], equations (3.4), (3.12) and (3.17) were all
is contrary to that commonly taken in the literature and introduced as a priori initial definitions of the ‘electric
textbooks where it is often claimed that only the electric and field’, ‘magnetic field’ and ‘electromagnetic field tensor’
magnetic fields have physical significance, while the 4-vector without further justification. In fact, Schwartz ([2], chapter 3,
potential is only a convenient mathematical tool. For example, p 127) gave the following explanation for his introduction of
Röhrlich ([18], p 65) makes the statement: equation (3.17):
E known as potentials have
These functions (φ and A) So far everything we have done has been entirely
no physical meaning and are introduced solely for deductive, making use only of Coulomb’s law,
the purpose of mathematical simplification of the conservation of charge under LT and Lorentz
equations. invariance for our physical laws. We have now come
to the end of this deductive path. At this point when
In fact, as shown above (compare equations (3.11) the laws were being written, God had to make a
and (3.13)), it is the introduction of the electric and decision. In general there are 16 components of a
magnetic fields that enables the Lorentz force equation to be second-rank tensor in four dimensions. However,
written in a simple manner! In other cases (e.g. Maxwell’s in analogy to three dimensions we can make a
equations), simpler expressions may be written in terms major simplification by choosing the completely
of the 4-vector potential. The quantum theory, quantum antisymmetric tensor to represent our field quantities.
electrodynamics, that underlies classical electromagnetism, Then we would have only 6 independent components
requires the introduction the 4-vector photon field Aµ in instead of the possible 16. Under LT the tensor would
order to specify the minimal interaction that provides the remain antisymmetric and we would never have
dynamical basis of the theory. Similarly, the introduction of need for more than six independent components.
Aµ is necessary for the Lagrangian formulation of classical Appreciating this, and having a deep aversion
electromagnetism. It makes no sense, therefore, to argue that to useless complication, God naturally chose the
a physical concept of such fundamental importance has ‘no antisymmetric tensor as His medium of expression.
physical meaning’.
The initial postulate used here to derive the Lorentz Actually it is possible that God may have previously
force law is equation (3.4), which contains, explicitly, the invented the 4-vector potential and special relativity, which
electrostatic force law and, implicitly, the Faraday–Lenz lead, as shown above, to equation (3.4) as the only possible
induction law. The actual form of the electrostatic force law covariant definition of the electric field. As also shown in
(Coulomb’s inverse square law) is not invoked, suggesting that the present paper, the existence of the remaining elements of
the Lorentz force law may be of greater generality. On the the antisymmetric field tensor, containing the magnetic field,
assumption of equation (3.4) (which has been demonstrated to then follow from special relativity alone. Schwartz derived the
be the only possible covariant definition of the electric field), Lorentz force law, as in Einstein’s original special relativity
the existence of the ‘magnetic field’, the ‘electromagnetic field paper [1], by LT of the electric field, from the rest frame of the
tensor’, and finally the Lorentz force law itself have all been test charge, to one in which it is in motion. This requires that
derived, without further assumptions, by use of the invariant the magnetic field concept has previously been introduced as
formulation of the LT. well as knowledge of the LT laws of the electric and magnetic
It is instructive to compare the derivation of the Lorentz fields.
force law given above with that of [13] based on the relativistic In the chapter devoted to special relativity in Jackson’s
transformation properties of the Coulomb force 3-vector. book ([14], section 11.9, p 547) the Lorentz force law
Until now, Coulomb’s law has not been invoked in the present is simply stated, without any derivation, as are also the
paper. On the other hand, reference [13] makes no use of defining equations of the electric and magnetic fields
the 4-vector potential concept, which is essential for the and the electromagnetic field tensor just mentioned. No
derivation presented here. This demonstrates an interesting emphasis is therefore placed on the fundamental importance
redundancy among the fundamental physical concepts of of the 4-vector potential in the relativistic description of
classical electromagnetism. electromagnetism.
It is interesting to note that exactly the results obtained In order to treat, in a similar manner, the electromagnetic
in the present section (the Lorentz force equation and and gravitational fields, the discussion in Misner et al’s
the homogeneous Maxwell equations) were also obtained book [15] is largely centred on the properties of the tensor
by Feynman’s proof [5] on the basis (see section 1) of F µν . Again the Lorentz force equation is introduced, in the
completely different initial postulates. It was remarked [21] spirit of the passage quoted above, without any derivation
or discussion of its meaning. The defining equations of
6 The explicit form of Aµ , as derived from Coulomb’s law, is given in the electric and magnetic fields and F µν , in terms of Aµ ,
standard textbooks on classical electrodynamics. For example, in [13], it is to
be found in equations (17)–(51) and (17)–(52). Aµ is actually proportional,
appear only in the eighteenth exercise of the relevant chapter.
in relativistic theory, to the 4-vector velocity, V , of the charged particle that The main contents of the chapter on the electromagnetic
is the source of the electromagnetic field. See [20]. field are an extended discussion of purely mathematical
645
J H Field
tensor manipulations that obscure the essential simplicity of motion. Of course, equation (4.5) may also be derived directly
electromagnetism when formulated in terms of the 4-vector from the Lorentz force law, so that the time component of the
potential. relativistic generalization of Newton’s second law, equation
In contrast to [2, 15] and [14] (see 11.9, p 547), in the (4.1), contains no physical information not already contained
derivation of the Lorentz force law and the magnetic field in the spatial components. This is related to the fact that, as
presented here, the only initial assumption, apart from the demonstrated above, the spatial and temporal components of
validity of special relativity and the definition of the physical the energy–momentum 4-vector are not independent physical
meaning of an electric field in terms of the force on a test quantities.
charge, is the chosen definition, equation (3.4) (which is the
only covariant one) of the electric field in terms of the 4-vector
5. The electrostatic and electrodynamic Maxwell
potential Aµ . Thus, a more fundamental description of some equations and the 4-vector potential
aspects of electromagnetism than that provided by the electric
and magnetic field concepts is indeed possible, contrary to the It has previously been shown [2, 3] that the 4-vector character
opinion expressed in the passage from Misner et al quoted of the electromagnetic potentials may be used to derive
above. the Ampère law of electrodynamics from the Gauss law of
electrostatics, which is equivalent ([12], chapter 5, paragraph
4. The time component of Newton’s second law 36, p 100) to Coulomb’s inverse square law of electrostatic
in electrodynamics force. The essential features of these derivations are here
briefly recalled. The first step is to write the Gauss law of
STE symmetry [3] states that physical laws in flat space electrostatics:
are invariant with respect to the exchange of the space and
E · EE = 4πρ
∇ (5.1)
time components of 4-vectors. For example, the LT of time,
equation (2.4), is obtained from that for space, equation (2.1), in the manifestly covariant manner:
by applying the STE operations: x0 ↔ x, x00 ↔ x 0 . In the
present case, application of the STE operation to the spatial (∂ · ∂)A0 − ∂ 0 (∂ · A) = 4π j 0 . (5.2)
component of the Lorentz force equation in the second line of
equation (3.11) leads to the relation: This is done by substituting the covariant definition (3.4) of
the electric field in (5.1), associating the charge density ρ with
dP 0 dP 0 the temporal component of an electric current, j 0 ≡ cρ, and
=γ = q(∂ 0 Aα − ∂ α A0 )U (β)α
dτ dt adding to (5.1) the identity:
EE · vE
= − q E i U (β)i = γ q , (4.1) ∂ 0 (∂ 0 A − ∂ 0 A) ≡ 0. (5.3)
c
where equations (2.5) and (3.4) and the following properties Equation (5.2) corresponds to the rest frame of the charge
of the STE operation [3] have been used: density ρ, where (5.1) is valid. Since, however, the coefficients
of the temporal 4-vector components are Lorentz invariant
∂ 0 ↔ −∂ i , (4.2) (compare with equation (3.8) above), (5.2) is also valid in any
inertial frame in which the spatial component E of the 4-vector
A0 ↔ −Ai , (4.3) current  is non-vanishing. In the analogous discussion of the
Lorentz force in section 3, the rest frame of the charge density
C · D ↔ −C · D. (4.4) was S0 , the frame in which it is in uniform motion S. The
Equation (4.1) yields an expression for the time derivative of 4-vector character of  and A, and relativistic covariance of
the relativistic energy, E = c P 0 : equation (5.2) then implies that [2]:
dE dEx (∂ · ∂) AE − ∂(∂
E · A) = 4π E. (5.4)
= q EE · vE = q EE · . (4.5)
dt dt
Rewriting (5.4) in 3-vector notation [3] gives:
Integration of equation (4.5) gives the equation of energy
E × BE − ∂ E = 4π E.
conservation for a particle moving from an initial position, xEI , E
to a final position, xEF , under the influence of electromagnetic ∇ (5.5)
∂t
forces: Z EF Z xEF
This is the electrodynamic Ampère law that includes
dE = q EE · dEx . (4.6) E
EI xEI
Maxwell’s ‘displacement current’ −∂ E/∂t. Instead of
invoking covariance, (5.4) may also be derived from (5.2) by
Thus work is done on the moving charge only by the electric applying the operation of ‘STE symmetry’ [3].
field. This is also evident from the Lorentz force equation, The material presented in the present section is
(3.14), since the magnetic force ' βE × BE is perpendicular complementary to that presented in sections 3 and 4, where
to the velocity vector, so that no work is performed by the the dynamical basis of classical magnetism (the inverse square
magnetic field. A corollary is that the relativistic energy (and law of force) was not considered. In the present section, the
hence the magnitude of the velocity) of a charged particle inverse square law is embodied in (5.1) and the Ampère law
moving in a constant magnetic field is a constant of the (5.5) is directly derived from it from relativistic covariance [2]
646
or STE symmetry invariance’ [3]. Collecting together the References

results of section 3, section 4 and the present one, all the
important equations of classical electrodynamics have been [1] Einstein A 1905 Ann. Phys. 17 891
[2] Schwartz M 1972 Principles of Electrodynamics (New York:
derived simply by formulating them in terms of the 4-vector
McGraw-Hill) chapter 3
potential A and imposing special relativistic covariance. The [3] Field J H 2001 Am. J. Phys. 69 569
only dynamical input is Coulomb’s law implicit in (5.1) [4] Frisch D H and Wilets L 1956 Am. J. Phys. 24 574
and (5.2). In conclusion, the 4-vector potential constitutes a [5] Dyson F J 1990 Am. J. Phys. 58 209
simple physical concept on which to base a unified description [6] Dombey N 1991 Am. J. Phys. 59 85
[7] Breheme R W 1991 Am. J. Phys. 59 85
of classical electrodynamics with a minimal number of
[8] Anderson J L 1991 Am. J. Phys. 59 86
initial postulates. In particular, the existence of magnetic [9] Farquhar I E 1991 Am. J. Phys. 59 87
fields, the Lorentz force law and Maxwell’s ‘displacement [10] Tanimura S 1992 Ann. Phys. (N.Y.) 220 229
current’ have been shown to be pure consequences of the [11] Vaidya A and Farina C 1991 Phys. Lett. A 153 265
special relativistic covariance of all the equations of classical [12] Landau L D and Lifshitz E M 1975 The Classical Theory of
Fields (Oxford: Pergamon) section 30, p 93
electrodynamics.
[13] Lorrain P, Corson D R and Lorrain F 1988 Electromagnetic
Fields and Waves 3rd edn (New York: Freeman)
section 16.5, p 291
Acknowledgments [14] Jackson J D 1975 Classical Electrodynamics (New York:
Wiley) section 12.2, p 578
I thank O L de Lange for asking the question whose answer, [15] Misner C W, Thorne K S and Wheeler J A 1973 Gravitation
presented in section 4, was the original motivation for the (San Francisco: Freeman) chapter 3, p 71
writing of this paper, and an anonymous referee of an earlier [16] Fahnline D E 1982 Am. J. Phys. 50 818
[17] Field J H 2000 Am. J. Phys. 68 367–74
version, submitted to another journal, for pointing out to
[18] Röhrlich F 1990 Classical Charged Particles
me related material, in the books of Jackson and Misner (Reading, MA: Addison-Wesley) section 4.7,
et al, which is discussed in some detail in this version. I also equation (4.89)
thank a second anonymous referee of the other journal for [19] Goldstein H 1959 Classical Mechanics
critical comments that have enabled me to much improve the (Reading, MA: Addison-Wesley) p 200, equation (6)–(30)
[20] Field J H 2005 Classical Electromagnetism as a Consequence
presentation of the material in section 2. Finally, I thank the
of Coulomb’s Law, Special Relativity and Hamilton’s
Physica Scripta referee for encouragement, several important Principle and its Relationship to Quantum Electrodynamics
error corrections, and the suggestion to include the material of http://xxx.lanl.gov/abs/physics/0501130
section 5. [21] Hughes R J 1992 Am. J. Phys. 60 301
647

Field 2006

Enviado por

Dados do documento

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Field 2006

Enviado por

Direitos autorais:

Formatos disponíveis

Home Search Collections Journals About Contact us My IOPscience

magnetic Gauss laws using an invariant formulation of the Lorentz transformation

2006 Phys. Scr. 73 639

Please note that terms and conditions apply.

Derivation of the Lorentz force law,

Received 28 November 2005

PACS numbers: 03.30+p, 03.50.De

1. Introduction performed by the present author on the basis of ‘space–time

The frame S0 moves with velocity, v, relative to S, along the 2

1x00 = γ (1x0 − β1x).

a different position in S, after a suitable correction for light

or STE symmetry invariance’ [3]. Collecting together the References

Você também pode gostar