All About Canonical Transformations

1 Topic 2: Canonical Transformations and the Hamilton-
Jacobi Equation
Reading: Hand & Finch Chapter 6 (required), Goldstein 391-396 (supplemental, but I will
cover this material in the notes).
The rst part of this topic includes the material in Chapter 6 with some supplementary
reading from Goldstein. At the end, we will cover in a bit more detail the relationship to
wave mechanics.
1.1 Canonical Transformations
We have considered coordinate transformations between sets of space co-ordinates

Qk = Qk (q(t) ; t) (1)
Where q has N components in general. These are called point transformations. Obvious
examples of this are going from Cartesian to cylindrical or spherical co-ordinates.
One advantage of the Lagrange formulation of mechanics is that it easily lets us choose
any invertable function of the Cartesian coordinates as generalized coordinates and then
write down Lagrange's equation directly :

_ @qi = 0 i = 1; :::N (2)

d @L @L
dt @ qi
so we get the equations of motion with no contortions.

Is there a similar situation for the coordinates of phase space and Hamilton's formu-
lation? Not in general { you can easily invent transformations from (qi;pi) to a new set
(Qi(qi;pi); Pi(qi;pi)) (such a transformation containing both q and p is called a contact trans-
formation) so that the equations of motion for (Qi; Pi) do not follow from a Hamiltonian.
A simple example:
2
H = 2pm ; p_ = 0; q_ = mp (3)
Transform
P = pt so P_ = pt_ + p = p = Pt (4)
Q = qt so Q _ = qt_ + q = pt + q = P + Q (5)
m m t
Does a function K (P; Q; t) exist such that
P _= @K
(6)
@Q
_ = @K
Q (7)
@P
1
? If so, then
_ @2K _
@P
@P
= @P @Q
= @Q
@Q
(8)
But _
@P
= 1 (9)
@P t
_ 1
@Q
@Q
= t
(10)
So this transformation does not lead to Hamilton's equation, and area in (P; Q) space is not
preserved under motion. A transformation like this is said to be non-canonical. This has
nothing to do with nding the motion in the new variable. In particular if
dP
dt
= Pt ! dP
P
= dt
t
! P = kt (11)
dQ
dt
= mP + Qt = kt
m
+ Q
t
(12)
or 1 dQ
t dt
Q
t2
= mk (13)

d
dt
Q
t
= mk (14)
so
Q
t
= k
m
+a
t
Q = k
m
t2 + at ! q =
k
m
t+a
However any result that follows from Hamilton's equation does not apply in the non-canonical
space of (P; Q).
We can, however, nd a class of transformations that are said to be canonical, in that
they do follow from a new Hamiltonian. These transformations (involving qi ; pi) are called
contact (as opposed to point) transformations. These phase space of the new variables has
the same properties as the old (e.g. Liouville's theorem holds). Two examples for the free
particle:
p2
H = (15)
2m
and apply the transformation
P = ap + bq
Q = cp + dq
2
and the inverse
p= 1 (dP bQ)
q =
1 ( cP + aQ)

where we assume the determinant of the coeÆcients = ad bc 6= 0.
So
P_ = b = (dP bQ)
p b
m m
_ = d p = d (dP bQ)
m
Q
m
so we want to know if there is a K such that Hamilton's equations hold. We saw above that
@ P_ @ Q_
this requires @P = @Q . In fact
@ P_
@P
= bd
m
_
@Q
@Q
= m
db
So there is a new Hamiltonian, K :

(
K P; Q) = H (p (P; Q) ; q (P; Q)) = 2m1 (dP bQ )2 (16)
so
_ =
P
@K
= mb (dP )
bQ
@Q
_ = @K
= d
(dP bQ)
m
Q
@P
as above. Except for the factor , this looks just like what you might expect from Lagrangian
mechanics { just write the Hamiltonian in the new coordinates. If = 1, that's just what
happens, however for a general case "just evaluate the old Hamiltonian in the new variable"
doesn't work.
To show that the contact transformations are really very dierent from the point tran-
formations of Lagrangian mechanics consider this example { again the free particle:
2
H = 2pm (17)
and the transform
P = p cos q
Q = p sin q
3
so p2 = P 2 + Q2 and
_ = p_ cos q pq_ sin q = p2 sin q = pP 2 + Q2 Q
P
m m
2 2
_ = p_ sin q + pq_ cos q = p cos q = p cos q = P 2 + Q2 P
Q
p
m m m
so
_
@P
= p
P Q
@P P2 + Q2 m
_
@Q
= p
Q P
@Q + Q2 m
P2
so like in the previous example K does exist and

K =
1 P 2 + Q2 3
(18)
3m
2
Note this is not even close to H (p (P; Q) ; q (P; Q))! This transformation is "cononical" for
the free particle, but it is not canonical for other Hamiltonians. For example
2
H = 2pm + mgq (19)
_=
p mg; q _ = mp (20)
P_ = mg p
P Qp 2
P + Q2
P2 + Q2 m
Q_ = mg p
Q
+ P p
P2 + Q2
P2 + Q 2 m
and _ _
@P
@P
6= @Q
@Q
(21)
So, this is a "cononical-like" transformation.
For a contact transformation to be a general canonical transformation, it must yield
Hamilton's equations for any Hamiltonian.
Now we're clear on what canonical transformations are, lets see how to construct them.
As your book does, I'm going to stick to 2-D phase space (1-D conguration space) for the
following discussions. I'll also note that I will like your book take the generating function
approach to constructing canonical transformations. There is another, seemingly unrelated
approach that can be derived in terms of the matrix, or sympletic formalism of Hamilton's
equations. We'll cover this later (see also appendix A of chapter 6 and Goldstein pp 391).
4
1.2 Generating Function Approach to Canonical Transformations
We saw last term (proven in a presentation problem) that two Lagrangians diering by a
time derivative of the form dFdt(q;t) both are valid descriptions of the same physical system.
Note that
F does not depend on q_. Consider two dierent descriptions of the same system
0 _
L Q; Q; t and L (q; q; _ t). These refer to the same physical system if
_ t) dF (q;dtQ; t)

_
Q; Q; t = L (q; q;
L0 (22)
Note here F can be a function of q; Q but not their time derivatives.
We want the Euler-Lagrange equations to hold in terms of the new variables. Integrating
both sides we have
Z t2 Z t2
L0 dt = Ldt + F (q (t1 ) ; Q (t1) ; t1) (( ) ( ) )
F q t2 ; Q t2 ; t2 (23)
t1 t1
We can see that Hamilton's principle will hold in the new system if it holds in the old if we
take the variation of the above equation and assume that arbitrary variations in Æq imply
arbitrary variations in ÆQ (assuming ÆF vanishes at the end points). So, F can be used to
generate a new Lagrangian in terms of new variables for which Hamilton's principle holds.
Now we have to gure out how to construct the new canonical momentum and the
new Hamiltonian to get the new conguration space (in which everything we derive from
Hamilton's formalism applies). Note we have specied a generating function, F , not a set of
transformation equations. We have to get the specic form of the transformation equations
from F .
To to the above, take the time derivative of F (Q; q; t):
dF
= @F q_ + @F Q_ + @F
dt @q @Q
(24)
@t

Now L0 = L Q; Q; t _ , so

@L0
@ q_
= @ q_ @L @
_
@q
dF
dt
= @L
@ q_
@F
@q
=0 (25)
and
= @F
@q
p (26)
By denition the momentum canonical to Q is
0
P = @L
@Q_=
@F
@Q
(27)
To get the tranformation explicity, we solve p = @F@q for Q = Q (q; p; t), then we solve
P = @F@Q for P = P (q; p; t) (substituting our Q (q; p; t) in for Q).
5
To nd the new Hamiltonian, K (Q; P ), we construct it from our Q; P and L0 :
K (Q; P; t) P Q _ L0
= @F Q_ L + @F q_ + @F Q_ + @F
@Q @q @Q @t
= P Q_ L + pq_ P Q_ + @F @t
= pq_ L + @F
@t
and nally
( ) = H (q (Q; P ) ; p (Q; P ) ; t) + @F (q (Q;@tP ) ; Q; t)
K Q; P; t (28)
Now can we use any F (Q; q; t) we want? Without proof, it is also necessary and suÆcient
that @q@Q 6= 0. If this second derivative vanishes, the transformation will not be invertable.
2
@ F
Suppose we are given a set of transformation equations, how do we know if @Fthey are@Fcononical?
First, express p; P as functions of q; Q; t. Then solve for F using P = @Q ; p = @q , and see
if our conditions apply (F = F (q; Q; t) and @q@Q 2
@ F
6= 0). This may or may not be possible to
solve - and we saw that not all contact transformations are canonical.
1.2.1 Types of Generating Function
We have considered generating functions of the form F (q; Q; t), where we assume ÆF = 0 at
t = t1; t2 , and F satises the condition on the double partial derivative above. Now we can
perform a Legendre transformation on either q or Q to replace them with either p or P , so
that we can express the same canonical transformation by any of four generating functions,
F = F1 (q; Q; t) ; F2 (q; P; t) ; F3 (p; Q; t) or F4 (p; P; t). Why this is useful will be clear later.
We therefore get any generating function from any other one by a series of transforma-
tions. For example, to get F3 (p; Q; t), transform F1
F3 (p; Q; t) = F1 (q; Q; t) qp (29)
and from @F3 @F1
@p
= @p
q=0 q (30)
so @F3
q= (31)
@p
and @F1 @F3
P = = (32)
@Q @Q
Note that we have assumed that q; p; Q; P are all independent variables. This is ne for the
purposes of the derivation, as we know they form an independent set dynamically. To @Fget
the transformation equations which give us the functional dependence, we use q = @p , 3
and P = @F@Q . 3
6
We can go through the same exercise to get F2 and F4 , but we have to change the sign on
the transformation (because of the asymmetry of the minus sign in Hamilton's equations).
F4 (p; P; t) = F3 (p; Q; t) + P Q
F2 (q; P; t) = F1 (q; Q; t) + QP
and (without going through the straightforward steps) we get the following transformation
equations:
F1 (q; Q; t) p = @F
@q ; P =
1 @F
dQ
1
F2 (q; P; t) p = @F
@q ;2
Q = @F
2
@P
F3 (p; Q; t) q = @F @p ;3
P = @F
@Q
3
F4 (p; P; t) q = @F @p ;
4
Q = @F
@P
4
1.3 Poisson Brackets
The Poisson bracket of two (arbitrary) functions, F; G with respect to canonically conjugate
pair is dened by
XN
[F; G]q;p @F @G
@q @p
@F @G
@p @q
(33)
k =1 k k k k
The value of the Poisson bracket is independent of which set of conjugate variables we use
to evaluate the partials, so long as they are related by a canonical transformation
[F; G]q;p = [F 0; G0]Q;P (34)
where F 0; G0 are the transformed functions.
If we let F = Q; G = P; then
[Q; P ]Q;P = [Q (q; p) ; P (q; p)]q;p = 1 (35)
The signicance of this is that without knowing the form of the generating function for the
canonical tranformation q; p ! Q; P we can test whether a given relationship is canonical
or not. If this holds, then the transformation must be canonical (it is a suÆcient and
necessary condition). You can demonstrate this directly using the formulae of the canonical
transformation.
If we consider an arbitrary number of dimensions, [F; G]q;p = [F 0; G0]Q;P ; and @q@qlk = Ælk ,
@pl = Æ lk ,... then we have
@qk
[Qi ; Qk ]p;q = 0; [Pi; Pk ]p;q = 0; [Pi; Qk ]p;q = Æik (36)

1.4 The Sympletic Property of General Canonical Transforma-
tions
(See Appendix A of your book or Goldstein ch. 9)

7
We can work out a suÆcient condition for a general canonical transformation easily using
a separate appoach. First introduce a systematic notation. Let
0 1
p1
B p2 C
B C
B ::: C
B C
B C
= = (37)
B pn C p
~
B q1 C q
~
B C
B q2 C
B C
@ ::: A
qn
where is a 2N dimensional vector, p~; ~q are N-dimensional vectors, and we dene

J = 0
1 0
1
(38)
where 0; 1 are N N matrices and J is a 2N 2N matrix. Hamilton's equations using this
notation are
_ i = Jij (39)
@H
@ j
Now a few properties of the matrix J : J~ = J, J 1 = J, ! J2 = 1 so

(det J)2 = (det ( 1))2 = ( 1)2N = +1 (40)
so
det J = 1 (41)
Its value is actually +1 as can be seen by row interchange (I won't go through it but you
can write it out for yourself) - anyway you can see for the simple example

0 1
1 0 =1 (42)

So in summary,
J is antisymmetric, has 1 as its square, and has unit
determinant.
Now let
i = i j ; t be invertible so that i = i j ; t (this implies det @ is not identically
@ i
j
zero). Then

H j ; t = H j ( i ; t) ; t = H ( i ; t) = H i j ; t ; t (43)
@H
@ k
= @H @ i
@ @
(44)
i k
and
_ = @ i _ j + @ i = @ i Jjk @H + @ i
i (45)
@ j @t @ @ j @t k
8
so
_ =
i
@ i
Jjk
@H
+ @@ti j
@ j @ k
= @ i
@ j
Jjk
@H @ l
@ l @ k
+ @@ti j
If it is the case that
(1) @ i
@ j
Jjk
@ l
@ k
= kJil for constant k 6= 0 (46)
and that there is a function F (i ; t) such that
= Jil (47)
@ i @F
@t
j @ l
then @F
@
jt = Jli
@ i
@t
j (48)
l
Then we have
_ = Jil @ (kH + F )
i (49)
@ l
and the new Hamiltonian is

K j ; t = kH j ; t + F j ; t (50)
if the constant k is 1, then we have an ordinary canonical tranformation, and otherwise what
some people call a "general canonical transformation".
In matrix notation if we let
(X)ij = @ @ i
(51)
j
then we have
(X)ij (J)jk (X)lk = k (J)il (52)
or
XJX~ = kJ general canonical
XJX~ = J canonical (k = 1)
example:
P = p + mv (P = ; p = )
Q = q + vt
(a Galilean transformation). Then

X=
1 0 (53)
0 1
9
so this is an ordinary cononical transfomation (but time dependent) with k = 1 if we can
nd an F so that @F
@P = 1 0 0 1 0 = v (54)
@F
@Q v 0
so F = P v is suÆcient. We then construct
K (P; Q; t) = Hp;q ((P mv ) ; (Q vt) ; t) + P v + const (55)
For a free particle
K =
(p mv)2 + P v + const = P 2 (56)
2m 2m
(a symmetry). For the SHO
K =
p2
+ 1 m!2 (Q vt)2 (57)
2m 2
and the solution is
Q = a cos (!t + ) + vt (58)
We supplement the above with two facts that I will not prove:
(1) The suÆcient conditions given above can be shown to be necessary also
(2) Any general canonical tranformation can always be compounded of a linear transfor-
mation
i = ! ij j (59)
where !ij is a suitably chosen symmetric matrix followed by an ordinary canonical transfor-
mation. Thus, all the interesting transformations are the ordinary ones. In summary;

i = i j ; t (60)
XJX ~ =J (61)
(X)ij = @ @ i
(62)
j
K =H +F (63)
@ j
t;all other i 's = Jij j (be careful to note what is held xed) (64)
@F
@
j @t
i
i _ = Jij @K (65)
@ j
Note, if the transformation doesn not depend on time, take F = 0 so

K ( i ; t) = H j ( i ) ; t (66)
in other words, just the analog of what happens in Lagrangian theory.
10
1.5 The Hamilton-Jacobi Equation
The Hamilton-Jacobi Equation is an important example of how new information about

mechanics can come out of the action by considering various kinds of variation of the tra-
jectories.
From the denition of the action,
(2)
it is (1)easy to see that it can be considered to be an
ordinary function of the variables qi ; t(2) ; qi ; t(1) by using actual motions connecting these
points in time-conguration space:
Z t(2)
S
(2) (1)
q ; t(2) ; q ; t(1) = ( _ )
L qi ; qi ; t dt (67)
i i
t(1)
where qi (t) ; q_i (t) are the actual motions

(1)
satisfying Lagrange's equations with the boundary
(2)
condition qi = qi t ; qi = qi t .
(2) (1)
As an example:
L =
1 mx_ 2
22
H =
2m ; p = mx_
p
then
x(2) x(1) (1)
x (t) = x(1) + (2) t t (68)
t t(1)
So
Z t (2) 2
S =
1 2
x x(1)
t 2 m t(2) t(1) dt
1
= 12 m (xt(2) xt(1) )
(2) (1) 2
Now it is interesting to notice that

x(2) x(1)
@S
@x(2)
= m
t(2) t(1)
= p(2)
x(2) x(1)
@S
@x(1)
= m (2)
t t(1)
= p(1)
and also
(2) 2
@S 1 x(1)
= 2 m t(2) t(1) = H (2)
x
@t(2)
(2) 2
@S 1 x(1)
= 2 m t(2) t(1) = H (1)
x
@t(1)
(note that in the above the H 's could have been L's in this simple, ambiguous example).
11
Let's work
(1)
out the general case of a change in S when we wiggle each of the 2n + 2
(2)
variables qi ; qi ; t(1) ; t(2) . Consider the following picture
Then
Z +
t(2) dt(2) Z t(2)
dS = ( ( ) + dqi (t) ; q_i + dq_i; t) dt
L qi t Ldt
+
t(1) dt(1) t (1)
(2) (1)
= L(2) dt(2) L(1) dt(1) + pi dqi t(2) pi dqi t(1)
The term p(2) (2) p(1) dqi t(1) comes from the case when the ends t(1;2) are xed and
i dqi t i
so the dq's are the changes at these times.
But dqi t(1) 6= dqi(1) (see the picture). In fact,
(1)
dqi = dqi t(1) + q_i t(1) dt(1) (69)
and similary for (2). So

dS = L(2) dt(2) + p(2) (2) q_ t(2) dt(2)
dqi
(1) dq (1) q_(1) dt(1)
L(1) dt(1) pi
i i i i
= (pidqi Hdt)jtt (2)

(1)
So we have, with a slight

change of notation in which you can think of (qi; t) as the free
(0)
variable and qi ; t as initial constants
(0)

dS = pidqi Hdt
(0) (0)
pi dqi H (0) dt(0) (70)
We can get the result
@S
@qi
= pi ; @S
dt
= H (71)
12
and @S
(0) = p(0)
i ;
@S
dt(0)
= H (0) (72)
@q i
very easily. Think of S as an indenite integral (i.e. t as a variable)
Z t Z t
S = Ldt = ( H + piq_i) dt (73)
so dS
dt
=( H + piq_i) (74)
But S = S (qi ; t), so
dS
dt
= @q
@S
q_i +
@S
@t
= H + pi q_i (75)
i
so @S
@t
= H ; @S
@qi
= pi (76)
and we also have @q@Si = p(0)
(0) i ; dt
@S
(0)= H (0) .
Hamilton was intrigued by these equations, and he noticed the following. Remember
that
H = H (pi ; qi ; t) (77)
and so
@S
@t
+ H @q ; qi; t = 0
@S
(78)
i
which is a non-linear partial dierential equation for the function S (qi ; t). (since its non-
linear, the sums of solutions are not necessarily solutions). Lets see how this works for the
simplest possible case:
p2
H = (79)
2m
so 2
@S 1
+ 2m @q = 0
@S
(80)
@t
(x x )
S for the free particle is S = 21 m t too
2
2
@S 1
= 2m t t x xo
@t o
@S
@x
= mt t x xo
o
so combining
1 @S 2 = 1 m x x0
2
(81)
2m @q 2 t to
and Hamilton's partial dierential equation works.
13
But what about the converse. Can you use Hamilton's PDE to calculate an S and use it
to solve mechanics problems? The answer is yes if you happen to nd the "right" solution.
But, PDE's have an innity of solutions . For example, the above solution to the PDE can
be found by a familiar separation of variables procedure. Assume
S = X (x) T (t) (82)
so Hamilton's PDE becomes 1 (X 0T )2 = 0
XT 0 + (83)
2m
where prime means dierentiation wrt the function's argument. Then divide by XT 2 (as-
sumed non-zero) to get
T0
+ 1 X0 = 0 2
(84)
T2 2m X
Since x; t are independent and we have f (t) + g2(mx) = 0 we must have
f (t) =
1 = const
k
2m
g (x) = +k
so 1
2mT 0 = kT 2 ; dT T 2 = kdt
2m (85)
so
1 = kt + a
T 2m
T =
1
2m (t to )
k
p
X0 = kX
p
p dX
= kdx
pX p
2 X = kx + b
or
X = 14 k (x xo )2 (86)
and 2 m (x
S = XT =
1 k (x
4 )xo
= 2 (t xo )2 (87)
2m (t
k
to ) t0 )
which is just the same as we got by direct integration.
But what if we had assumed
S =T +X (88)
14
so
+ 21m (X 0)2 = 0
T0
1 (X 0)2 = k
2m
p
! X = 2mkx + a (89)
so
T0 = k; T = kt +b (90)
so p
S = 2mkx kt + const (91)
which is a whole lot dierent than the previous solution! Note that this function is in fact
an indenite integral of the Lagrangian of a free particle with motion
x = vt + xo (92)
so
x_ = v (93)
Z
S=
1 mx_ 2 dt = 1 mv2 t + const (94)
2 2
p p p
which
p is equal to 2 mkx
p kt = 2 mkvt kt + 2mkxo = 21 mv 2 t + const if 12 mv 2 =
2mkv k and const = 2mkxo
This simple example raises the question { How do we gure out which solutions of Hamil-
ton's PDE are associated with actual motion and how do you extract that motion from the
function S ?
Hamilton did not manage to solve the problem, but Jacobi did, and so the PDE is now
known as the Hamilton-Jacobi equation. We can understand what Jacobi produced if we
remember that if we write
(0)

S qi ; t; qi ; t(0) (95)
(think of qi(0) ; t(0) as constants) we also get

@S
= H
(0) (0)
p ;q ;t (0)
@t(0) i i
@S
= pi
(0)
@q
(0)
i
or when we dierentiate S with respect to constants t0; qi(0) we get other constants, p(0)
i ;H .
0
So Jacobi's rule for a system with N degrees of freedom:
(1) Find any solution S of the HJ equation which depends on N +1 arbitrary algebraically
independent (see denition below) constants,
S = S (qi ; t; ai ) + A i = 1; :::N (96)
15
(one of the constants is always additive and ignorable).
(2) Obtain the EOM by setting
@S
@ai
= bi (97)
where the bi are N more constants, and solve for the
qi = qi (t; aj ; bk ) i; j; k = 1::::N (98)
By alegebraic independence we mean that the N N matrix
@2
@qi @aj
(99)
has a determinant that is not identically zero so that the matrix "usually" has an inverse
(there may be singular points where there is no inverse { i.e. the determinant is zero). So
if you get a solution with ak , k = 1; :::N 1 you cannot ll out the set of constants by for
example splitting one of these into the sum of two new ones, so

anew = a1 ; a2 ; :::; a0n 1 ; a00n 1 (100)
where a0n 1 + a00n 1 = aold
n 1 . Clearly in this case two rows (if j determines rows) of @qi @anew
@S
j
will be identical and the determinant is identically1 zero.
Check the theorem in two simple cases: L = 2 mq_2 so
1 ( x xo )
2
S= m (101)
2 t to
take xo as a1 : then
@S
@a1
= @S
@xo
= m
x xo
t to
= b1 (102)
Which gives a solution.
If we take to as a1: then
2
@S @S 1
= @t = 2 m t t = b1x x0
(103)
@a1 o o
this also gives a solution p

S = 2mkx kt + const (104)
take k as a1, so r
@S
= @S
= 1 2m x t = b (105)
@a1 @k 2 k 1
or r
x=2
2m t + const (106)
k
{ a solution.
16
We can easily prove Jacobi's theorem:
@2S @2S
d @S
dt @ai
= 0 = @ai @qj
q_j +
@ai @t
(107)
But
@S
@t
= H @q ; qi; t
@S
(108)
i
only pi = @qi depends on the a's, so

@S
@2S @H @ 2 S
@a @t
= @pj @qj @ai
(109)
i
so we get
@2s
@ai @qj
_ @p = 0 = S q_ @ p
qj
@H @H
(110)
j
If the matrix @a@i @qS j is invertible, then the only solution to these equations is q_j = @p@Hj , half
2
of of Hamilton's equations.
Next from
pi = (111)
@S
@qi
get

p_ i =
d @S
dt @q i
@2S @2S
= @qi @qj
_ + @q @t
qj
i

= @
@qi
@S
@qj
_ + @t
qj
@S
= @
@qi
(pj q_j H )
and
_ = @q
pi
@L
(112)
i
which is just Lagrange's equation.
A very important special case is that in which the Hamiltonian is conserved (i.e. inde-
pendent of t) and since it is often the energy, call the conserved value of H the constant E ,
and then S must be linear in t. Then
S = Et + W (qi ) (113)
satises the H-J equation if we have

E +H ; qi = 0 (114)
@W
@q i
here E is a constant. You can consider this another proof of H = const if @H@t = 0.
17
1.5.1 The Relationship of the H-J Equation to Quantum Mechanics
The H-J equation is an important step to formulating a wave equation for particles. Write
a solution of the H-J equation for a free particle in 3-D as
S = p x Et (115)
where the three components of p are the three constants. By the H-J equation

pp
@S
@t
+ H
@S
@xi
; xi = E +
2m = 0 (116)
we get the expected E = 2pm . We get the EOM by
2
@S
@pi
= xi @E
@pi
t = xio (constant) (117)
or
=m pi
xi t + xi0 (118)
However, if you look at S you see that at a xed moment in time, a xed value of S denes
a plane and the vector p is normal to it. At a later time t + t a plane of Econstant S will
still have p as its normal and it will be displaced along p by and amount jpj t. If this is
not obvious, adopt the coordinate system so p = pêx and we have
2 2
S = px 2pm t = p (x + x) 2pm (t + t) (119)
so xt is the volocity at which the plane moves = 2pm . So we see that the particle trajectories
are normal to surfaces of constant S (although the planes of constant S travel at half the
speed of the particle). This may seem disappointing at rst, until you notice that the group
velocity of the wave is mp_ . The particle trajectories have the same geometric relation to the
planes of constant S as the rays of optics do to planes of constant wave phase.
This suggests considering S as a wave phase factor
= oei S~ (120)
where ~ is a scale factor with dimensions of S = (energy time) or (momentum distance).
No prejudice about its value follows from classical mechanics except that if in fact classical
physics is the ray approximation to some underlying wave mechanics, then ~ must be very
tiny, since it took almost 100 years from the time that Hamilton did the above for optics
before any wave character was experimentally detected for a material particle.
From
= oei S~ and S = p x Et (121)
what equation does satisfy? We get
r = ~i r S (122)
18
take the divergence of both sides
r (r ) = r2 = ~i r rS+ ~i r2S (123)
for the free particle ~i r2 S = 0, and
2
r2 = ~ r rS =
i i
~
(rS)2 (124)
and @
@t
= ~i @S
@t
(125)
using the H-J equation with H = 2pm 2
= ~i 21m (rS)2 = ~i 21m

~2 r2 (126)
@
@t
or 2
i~
@
@t
= 2~m r2 (127)
A familiar equation!
In fact the relationship between the wave equation and the H-J equation was well known
to Hamilton, his contemporaries and his successors. In optics, the analog to the H-J equatino
is called the eikonal equation. The S function is the icon or representative of the wave
function. The function e for suitable "~" in the optics case gives an approximation to
i S~
the wave equation in certain cases. It is the phase of a wave that carries most of the really
wave-like character of a wave (think of interference) and so it is usually the most interesting
physically.
It is interesting to inquire under what conditions the solution S to the H J equation
makes a good approximation to the phase of the solution of the Schroedinger equation.
Without proof, if
~ 2
2m r + V = i~
@
@t
(128)
and we take
= A (r) ei S ~r;t = A (r) e ~i (W (r) Et)
( )
(129)
then
1 (rS)2 + V E = ~ ~ r2A + 2i rW rA + ir2W (130)
2m 2m A A
The r.h.s. can be approximated by zero when ~ is very small compared to other quantities
with the same dimensions in the physical problem. Thus for macroscopic bodies this is the
case since
~~ (mass of an electron) (131)
cm
s
cm
19
It is useful to associate a wavelength with W (x), which in 1-D
1 W (x) = 1 W (x ) + 1 (x x ) dW j + ::::
o o xo
~ ~ ~ dx
= W (xo) + x xo
o
where 1o = ~1 dW
dx so
ei ~ (W (x) Et)
1
= ei( x oxo !t) (132)
where ! = E~ . Then the r.h.s. can be shown to be well approximated by zero if

dV
o
dx
kinetic energy (133)
i.e. the potential is slowly varying in space. In this case, the classical action S yields a
reasonable approximation to the phase function of the quantum mechanical wave function.
This corresponds to the WKB limit where the potential is essentially constant over many de
Broglie wavelengths.
1.6 Adiabatic Invariants
This section follows the derivation of Landau and Lifshitz (see pp. 154).
Adiabatic invariants are quantities that remain essentially constant in a system that is
not closed, but where some parameter varies slowly. We're going to consider systems that are
strictly periodic and conservative when they are "closed" (i.e. when we keep all the system
parameters constant), and consider what we can say about the system as we slowly vary one
parameter. Specic examples of such systems would be a pendulum where we slowly change
the length of the string (by slow we mean slow compared to the natural frequency), or a
mass on a spring with a slowly changing spring constant. When you change the length of
the string in a pendulum you know ! increases, and E changes, but can we construct some
combination of parameters that stays essentially constant.
Call the parameter we vary , and let it vary slowly (adiabatically) with time as the
result of some external action. If the period is T , we require
T
d
dt
(134)
The energy E changes slowly (if we average over the rapid oscillations of the system) with
time as changes, and dt is then some function of . This dependence can be expressed as
dE
the constancy of some combination of E and (called an adiabatic invariant ) which remains
constant during the motion of a system with slowly varying parameters.
We can write the Hamiltonian as H (q; p; ), and the rate of change of the energy is
dE
dt
= @H
@t
= @H @
@ @t
(135)
20
Now @H @ @t depends on the rapidly varying q; p as well as on the slowly varying . We want
@
to average over the rapid (periodic) variations resulting from the oscillatory motion to isolate
the slow variations in
dE
dt
= @ @H
@t @
(136)
where we average over the rapidly varying (oscillating) H , but during our averaging time,
remains essentially constant, so we can pull the @ @t out of the averaging. Furthermore,
in averaging H we consider q; p to vary and to be constant. We are essentially averaging
over the motion of the closed system (what would happen with constant).
The average is Z T
@H
= 1 @H
dt (137)
@ T 0 @
From Hamilton's
1
equations, @p , and we can change the integral over time to one over
q_ = @H
q : dt = dq @H@p
Z T I 1
T = dt = (138)
@H
dq
0 @p
where the integral over q is taken over the complete range of variation of the coordinate
during one cycle.
So H @H 1
@H
dq
dE
dt
= @ @H
@t @
= @
@t H
@
1
@p
(139)
@H
dq @p
Since we are taking the averages for constant , H = E =const also over the integral, and
p is a dened function of q; E; , p = p (q; E; ). So
H (q; p; ) = E (140)
and
dH
d
= 0 = @H
@
+ @H
@p
@p
@
(141)
@H
@
@H
= @p
@
(142)
@p
if we substitute this into the expression for dEdt

H @p

@ dq
dE
dt
= d
dt
H @p
(143)
dE dq
so I
@p dE
@E dt
+ @ dt
@p d
dq =0 (144)
21
or exchanging the order of integration and dierentiation
I
d
dt
pdq =0 (145)
If we dene I
I 21 pdq (146)
then
=0
dI
dt
(147)
Remember the integral is over a period with constant E; . So in this approximation I
remains constant when varies.
I is the adiabatic invariant we have been looking for, and is a function of E; . If we
look at the partial derivative with respect to E
I
2 @E =
@I @p
@E
dq = T = 2! (148)
we see that the partial is related to the period.
The geometrical signicance of I is that it is related to the area of phase space enclosed
by the curve I Z Z
I =
1 1
2 pdq = 2 dp dq (149)
As a simple example, compute I for a simple harmonic oscillator with natural frequency
!:
H =
1 p2 + 1 m!2q2 (150)
2m 2 p
Since
p H = E = const, the phase path is an ellipse with semi-axes 2mE on the p axis and
2E=(m!2) on the q axis, and the area divided by 2 is
I = E=! (151)
The signicance of this is that when the parameters of the oscillator vary slowly, the energy
is proportional to the frequency.
Suppose the SHO starts with maximum amplitude qo and mass mo at t = 0. Assume its
mass is gradually ("adiabatically") increased and its spring constant k is held xed. What
is the amplitude q when the mass is m?
Eo =
1 kq2
2 o
r
!o =
k
mo
so r r
1
I = kqo 2 mo
= 1 2 m
(152)
2 k 2 kq
k
22
or
r
q2 = q2
o
mo
m
m 1=4
q = qo
m
o
In fact 1=4 Z ts !
q t ( ) = qo ( ) cos 0 m (t) dt + o
mo
m t
k
(153)
Historically the adiabatic invariance of H pdq was taken to be of great signicance during
the early development of quantum mechanics (the "old" quantum mechanics). Remember
Plank's famous hypothesis that for an SHO
En = n~! (154)
(where n is an integer and ~ a constant) are the only allowed energies. As we see above
I
2 ! =
E
pdq = nh (by postulate) (155)
is an adiabatic invariant and so remains constant2 under any kind of variation slow compared
to the period of the oscillation. Think of T = ! for optical light frequencies
= c (156)
T = c = 500 10 9m = 200 10 17 s 10 15s
3 108 m=s (157)
so fast macroscopic changes might occur in 10 9 s and still be adiabatic.
Ehrenfest adopted the adhoc principle that quantum conditions shouldH be applied to
adiabatic invariants which Sommerfeld generalized to an "action variable", pdq.
1.6.1 Adiabatic Invariant for a Charged Particle in a Magnetic Field
There is a very important adiabatic invariant used in many elds involving charged particles
moving in magnetic elds. It can be expressed in many ways, one of which is that the
magnetic moment of a charged particle circulating in a magnetic eld that changes slowly in
time is invariant. This also applies if a particle drifts along circulating around magnetic eld
lines in an inhomogeneous magnetic eld so that the average motion is through a changing
magnetic eld.
Let's derive this using Hamilton's methods and the denition of an adiabatic invariant.
For constant B we need A. Although it is not unique,
A= Br
1 (158)
2
23
works. Take B = Bêz and use cylindrical coordinates

v=
dz
ez +
^ _ ê + _ ê (159)
dt
So
= 12 Bêz (ê + zêz ) = 12 Bê
A (160)
so A circulates around the z axis and increases linearly in .
The Lagrangian (refer to our previous discussion of L for a charged particle in a eld) is

1
L = m _ + _
2
2
+ z_ 2 + e1
B2 _ (161)
2 c2
The canonical momenta are

p = m_
p = m2 _ +
1 e B2
2c
pz = m
dz
dt
zis cyclic, is cyclic, and @L

= 0, so pz = const; p = const; H = E = const. The
Hamiltonian is
@t
p2 p2z p 1 e B2 2

H = 2m + 2m + 2c
2m2=E (162)
Thus the motion, treated as a trajectory in phase space moves on "the energy shell" dened
by 2
p2z p2 1 1
2m = E? = 2m + 2m 2 c B = const (163)
p e
E
24
This algebraic equation gives a closed curve in the (; p) plane that is quite simple in just
one case: the special case in which p = 0. It is the only case in which = 0 is allowed, or
the orbit passes through the z-axis.
For p = 0; 2
p2 1 1
E? =
2m + 2m 2 c B ( 0) (164)
e
This is the equation for half an ellipse, so we have the following geometry in phase space:
and in conguration space
and
p = 0 ! _ = 12 mc
eB
(165)
and
2_ = !c = mc
eB
(166)
In this simple case we get out the cyclotron frequency, and we also get the adiabatic invariant:
2I = 12 maxp max
= 12 2mE? 1 !1 m 2mE?
p p
2 c
= 2 E!?
c
Since it is always possible to choose the origin of the coordinates so that the elliptical orbit
of the particle passes through the z-axis, the above result is in fact the general case for the
25
adiabatic invariant of a charged particle in a B eld. We can express it in various ways
(see HW):
a) the radius of a particle's orbit changes inversely as the square root of the magnetic
eld.
b) the amount of magnetic ux linking the particle's orbit is a constant
c) the magnetic momentof the circulating charged particle is a constant.
1.7 Action-Angle Variables
We'll now consider periodic systems where is constant, so that the system is closed. We
want to perform a cononical transformation from the old q; p to a conjugate pair where I is
the new "momentum". The generating function that will get us there is
Z
F = W (q; E ; ) = ( ; )
p q; E dq (167)
taken for a given constant E and . For a closed system, we can replace E with I , since it
is a function of the energy, and we can write W = W (q; I ; ) and
E = (168)
@W @W
@q
j @q
jI
and so
p = @W (@qq; I ; ) (169)
26
(from the formulae for canonical transformations). We can use the other relation for canon-
ical transformation to get the "coordinate" variable
= @W (q; I ; ) @I
(170)
So I and are canonical variables, where I is called the action variable, and the angle
variable.
We are considering a conservative system with a time-independent Hamiltonian, and we
have therefore used a generating not explicitly dependent on time. The new H 0 is therefore
just H expressed in terms of the new variables. H 0 is just E (I ) expressed as a function of
the action variable. Hamilton's equations in the action-angle variables are
dI
= 0; _ = dE (I )
dt dI
(171)
The rst shows that I is constant (as we knew). The second equation shows that is
linearly increasing with time
= dE
dI
t + const = ! (I ) t + const (172)
and we equate it with the phase of the oscillations.
W (q; I ) is a many-valued function of the coordinates which increases each period by
W = 2I (173)
R H
We see this from the denition W = pdq, and I = 21 pdq.
If we express q; p in terms of the action-angle variables, these must remain unchanged
when ! + 2 (with I const). So q; p are periodic functions of with period 2.
The action-angle variables may seem to be of very limited utility given the restrictions,
but they are important historically and also in numerous astronomical contexts. These
action-angle variables can also be used to@Wformulate
( q;I ;)
the EOM when the system is not closed,
@W (q;I ;)
and = (t). Then we still have p = @q and = @I , and
Z
(
W q; E ; )= (
p q; E dq ; ) (174)
In
R the same approximation we
H used to get the adiabatic invariant, we calculate W (q; E ; ) =
p (q; E ; ) dq and I = 2 pdq taking to have a xed value, so that W (q; E ; ) is the
1
same function it was before, but we then allow to be (t).
The generating function is now an explicit function of time, so we get H 0 from
H0 = ( ; ) + @W
E I
@t
= E (I ; ) +
@W
@
jq;I _
where we express @W
@ jq;I in terms of I and after dierentiating with respect to .
27
Hamilton's equations are then
@H 0
dI
dt
= @
= @
@ @W
@
jq;I _ (175)

_ = @H 0 = ! (I ; ) + @ @W
jq;I j ;
_
(176)
@I @I @

where ! = @E
@I is the oscillation frequency calculated as if were constant.
28

All About Canonical Transformations

Enviado por

Dados do documento

Descrição original:

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

All About Canonical Transformations

Enviado por

Direitos autorais:

Formatos disponíveis

1 Topic 2: Canonical Transformations and the Hamilton-

We have considered coordinate transformations between sets of space co-ordinates

_ @qi = 0 i = 1; :::N (2)

so we get the equations of motion with no contortions.

So there is a new Hamiltonian, K :

so like in the previous example K does exist and

1.3 Poisson Brackets

[Qi ; Qk ]p;q = 0; [Pi; Pk ]p;q = 0; [Pi; Qk ]p;q = Æik (36)

(See Appendix A of your book or Goldstein ch. 9)

where  is a 2N dimensional vector, p~; ~q are N-dimensional vectors, and we de ne

Now a few properties of the matrix J : J~ = J, J 1 = J, ! J2 = 1 so

(a Galilean transformation). Then

The Hamilton-Jacobi Equation is an important example of how new information about

where qi (t) ; q_i (t) are the actual motions

Now it is interesting to notice that

= (pidqi Hdt)jtt (2)

So we have, with a slight

this also gives a solution p

only pi = @qi depends on the a's, so

= ~i 21m (rS)2 = ~i 21m

if we substitute this into the expression for dEdt

The canonical momenta are

zis cyclic,  is cyclic, and @L

p2 p2z p 1 e B2 2

Você também pode gostar

where is a 2N dimensional vector, p~; ~q are N-dimensional vectors, and we dene

So we have, with a slight

zis cyclic, is cyclic, and @L

p2 p2z p 1 e B2 2