Escolar Documentos
Profissional Documentos
Cultura Documentos
J. Maes
As time goes on, it becomes increasingly evident that the rules which the
mathematician finds interesting are the same as those which Nature has
chosen.
Paul Dirac at age 36, ([13])
Abstract
This master thesis is not the work of just one person, but the result of the
dedication of many people. First of all, I would like to thank my supervisors.
Dr. Gleb Arutyunov from the Institute for Theoretical Physics, who had the
brilliant ability to give strong intuition for abstract notions in physics and
mathematics, without doing deficit to the theory itself, and Dr. Andre Hen-
riques from the Department of Mathematics, who amazed me with his deep,
fundamental understanding of the concepts we were discussing. He succeeded
in significantly improving my skills in abstract mathematics the past year,
by going back to the very fundamentals. Both of my supervisors were always
willing to help me. Some of our appointments even took more then two hours.
I am sure I forget to mention people here which do owe credit to this thesis.
This is my final thanks to all of those people.
Contents
1 Conventions and notation 8
2 Introduction 10
3 Vocabulary 14
3.1 The language of manifolds . . . . . . . . . . . . . . . . . . . . 14
3.2 Structure-preserving maps (morphisms) . . . . . . . . . . . . . 18
3.3 Lie groups and Lie algebras . . . . . . . . . . . . . . . . . . . 20
3.4 Actions and representations . . . . . . . . . . . . . . . . . . . 23
4 Coadjoint orbits 26
4.1 The coadjoint representation . . . . . . . . . . . . . . . . . . . 26
4.2 The coadjoint orbits of SU(2) . . . . . . . . . . . . . . . . . . 31
4.2.1 Deriving the coadjoint orbits by finding conjugation-
invariant functions on su(2)∗ . . . . . . . . . . . . . . . 31
4.2.2 Deriving the coadjoint orbits by comparing with SO(3) 35
6 Geometric quantization 55
6.1 The mathematical framework of classical
mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
6.1.1 A crash course in symplectic geometry . . . . . . . . . 56
6.1.2 Symplectic geometry and classical mechanics . . . . . . 62
6.2 From classical to quantum mechanics . . . . . . . . . . . . . . 66
6.3 A heuristic discussion of prequantization . . . . . . . . . . . . 70
6.4 Formalizing the discussion . . . . . . . . . . . . . . . . . . . . 74
6.4.1 Putting more structure on vector bundles . . . . . . . . 74
6.4.2 Čech cohomology and the de Rham isomorphism . . . 81
6.5 The integrality condition . . . . . . . . . . . . . . . . . . . . . 86
6.6 The prequantum Hilbert space . . . . . . . . . . . . . . . . . . 95
6.7 Polarizations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
6.7.1 Polarizations in general . . . . . . . . . . . . . . . . . . 98
6.7.2 Kähler polarizations . . . . . . . . . . . . . . . . . . . 101
6.8 The Hilbert space . . . . . . . . . . . . . . . . . . . . . . . . . 104
References 130
Index 133
1 Conventions and notation
-Today’s shocks are tomorrow’s conventions.-
Carolyn Heilbrun ([15])
Notation:
K: the field of either the real or complex numbers.
U ≡ {Ui }i∈A : an open cover of a manifold. Here A denotes some index set.
S 2 : the two-dimensional sphere.
(M, ω): a symplectic manifold M with symplectic 2-form ω.
(M, J, ω): a Kähler manifold M with symplectic 2-form ω and complex struc-
ture J.
Tm M : the tangent space of a manifold M at the point m ∈ M .
T M : the tangent bundle of a manifold M .
T ∗ M : the cotangent bundle of a manifold M .
Ωk (M ): the set of k-forms on a manifold M .
Vect(M ): the set of vector fields on a manifold M .
Vect(M ; P ): the set of vector fiels X such that X|m ∈ Pm for every m ∈ M .
Here P denotes a polarization.
End(X): the set of endomorphisms of a mathematical object X.
Aut(X): the set of automorphisms of a mathematical object X.
Mat(n, K): the group of invertible n × n-matrices with entries in K.
GL(V ): the group of invertible, linear transformations on a vector space V .
C ∞ (M, K): the space of either real or complex-valued functions on a mani-
fold M .
C ∞ (M, K; P ): the functions in C ∞ (M, K) whose flow leaves the polarization
invariant.
L2 (M, C): the space of square-integrable, smooth, complex-valued functions
on a manifold M .
P (M ): the space of polynomials on an manifold M .
Pn (M ): the space of homogeneous polynomials of degree n on an manifold
M.
ΓX (M ): the set of smooth sections over a real manifold M of a line bundle
X.
h∗ : the push-forward corresponding to the diffeomorphism h : M → N with
M , N manifolds.
8
h∗ : the pull-back corresponding to the smooth map h : M → N with M , N
manifolds.
exp: the exponential map.
g: the Lie algebra of a Lie group G.
Id: the Identity map.
π = (π, V ): a representation of a Lie group G in a vector space V .
π ∗ = (π ∗ , V ∗ ): the representation dual to π in the dual space V ∗ .
V C : the complexification of a vector space V . a
T M C : the complexified tangent bundle. T M C ≡ (Tm M )C .
m∈M
V ∗ : the dual of a vector space V .
Ad: the adjoint representation.
Ad∗ : the coadjoint representation.
ad: ad ≡ Te Ad.
ad∗ : ad∗ (X) ≡ (−ad(X))∗ for X ∈ g.
Xf : a Hamiltonian vector field corresponding to a real scalar function f .
Lv : the Lie derivative along a vector field v.
∇: the connection.
[, ]: the Lie bracket.
{, }: the Poisson bracket.
h, i: evaluated on a linear functional f in the first slot and a vector v in the
second slot it denotes the value of the linear functional f on the vector v.
Otherwise, it denotes the inner product.
C p (U, R): the set of p-cochains corresponding to real, locally constant func-
tions. Here U denotes an open covering of a manifold M .
Z p (U, R): the set of p-cocycles corresponding to real, locally constant func-
tions. Here U denotes an open covering of a manifold M .
H p (U, R): the pth Čech cohomology group with coefficients in R. Here U
denotes an open covering of a manifold M .
H p (M, R): the pth de Rham cohomology group of a manifold M with coef-
ficients in R.
Conventions:
Skew symmetrization: T [ab...c] = p!1 σ sign(σ)T σ(a)σ(b)...σ(c) ,
P
where T is a p-index tensor and the summations are over all permutations
of a, b, ..., c.
Interior product: i(X)ω = kω(X, ...) for a vector field X and ω ∈ Ωk (M ),
where M denotes a manifold.
9
2 Introduction
- When one looks back over the development of physics, one sees that it can
be pictured as a rather steady development with many small steps and su-
perposed on that a number of big jumps. These big jumps usually consist in
overcoming a prejudice... And then a physicist has to replace this prejudice
by something more precise, and leading to some entirely new conception of
nature.-
Paul Dirac on Quantum Theory ([13])
At first sight, classical physics could thus provide us with a very satisfactory
description of the world we live in. However, it fails to give an explanation
for a number of phenomena observed at the microscopic level and, moreover,
is in plain contradiction with experimental evidence.
As an illustration of what these phenomena are and what they may be try-
ing to tell us about a theory which will have to replace classical physics, we
mention two examples. The first is regarding the stability of atoms. From
scattering and other experiments it had been deduced that atoms consist of a
tiny positively charged nucleus orbited at quite some distance by negatively
charged point-like particles, the electrons. Classical physics would predict
such structures to collapse within fractions of a second, in contradiction with
the stability of the world around us: orbital motion is accelerated motion,
and according to Maxwell theory accelerated charges emit radiation; conse-
quently, the electron would radiate away energy and spiral into the nucleus
of the atom. It was also observed that simple atoms (like hydrogen) were
able to adsorb and emit energy only in certain discrete quantities. Com-
bining these two observations it thus appeared to be necsesary to postulate
the existence of stable orbits for electrons at certain discrete radii (energy
levels). This suggests that at the microscopic level, as opposed to in classical
physics, nature allows for a discrete (or quantized) structure.
10
The second example is the set of experiments related to the nature of light.
On the one hand, there is the famous double-slit experiment which inves-
tigated the diffraction and interference patterns of beams of particles like
electrons. The experiment is depicted in figure 1 (from [11]). It indicated
Figure 1: The double-slit experiment. A single photon (being a probability wave) departs
from slit a in plate S1. When the wavefront reaches slits b and c in plate S2 it creates two
new probability waves, one emerging from slit b and one from slit c. The single photon
will show up on the detector screen F (at say d) according to the net probability values
resulting from the co-incidence of the probability waves coming by way of the two slits b
and c.
that under certain circumstances particles (like electrons) can show inter-
ference patterns and thus wave-like nature. On the other hand, Einstein’s
interpretations of the photo-electric effect made clear that light showed be-
haviour characteristic of particles and not of waves. In the words of the great
Dirac (see [12, p.3]),
‘We have here a very striking and general example of the breakdown of clas-
sical mechanics - not merely an inaccuracy of its laws of motion, but an
inadequacy of its concepts to supply us with a description of atomic events.’
11
account. In particular, it implied that one cannot make any observations on
a suitably ‘small’ system without perturbing the system itself (expressed in
the uncertainty principle).
However, at the conceptual level the situation was not satisfactory. In partic-
ular, it was not clear how general the proposed models were, which features
were to be regarded as fundamental to any quantum version of a classical the-
ory and which were to be attributed to particular properties of the systems
considered so far. To gain some insight into this question, it was in particular
Dirac who emphasized the formal similarities between classical and quantum
mechanics and the necessity of properly understanding these. Abstracting
from the analogy found between classical mechanics and Schrödinger and
Heisenberg quantum mechanics, Dirac formulated a general quantum condi-
tion, a guideline for passing from a given classical system to the correspond-
ing quantum theory. This process in general is known as quantization .
Roughly speaking, quantization consists of replacing the classical algebra of
observables (smooth functions on phase space) by an algebra of operators act-
ing on some Hilbert space, the quantum condition relating the commutator
of two operators to the Poisson bracket of their classical counterparts:
12
Furthermore, there is no reason to believe that such a construction would
be unique as there could well be (and, in fact, are) lots of different quantum
theories which have the same classical limit. Unfortunately, however, it is
conceptually very difficult to describe a quantum theory from scratch, with-
out the help of a reference classical theory. Moreover, there is enough to the
analogy between classical and quantum mechanics to make quantization a
worthwile approach. Perhaps, ultimately, the study of quantization will tell
us enough about quantum theory itself to do without the very concept of
quantization.
13
3 Vocabulary
-The golden age of mathematics, that was not the age of Euclid, it is ours.-
C.J. Keyser ([16])
We call the manifold M smooth when all the maps φα,β are infinitely differ-
entiable.
• the functions xiα = xi ◦ φα are called local coordinates. Here {xi }1≤i≤n
are the standard coordinates in Rn ;
14
0
Definition 3.1.2. Two atlases {Uα }α∈A and {Uβ }β∈B are called equivalent
if the transition functions from any chart of the first atlas to any chart of the
second one are k-smooth (whenever defined). The structure of a smooth
manifold on M is an equivalence class of atlases.
Definition 3.1.4. A vector bundle over the manifold M (called the base
manifold) is a real manifold X (called the total space) together with a
smooth map pr : X → M (called the projection) such that
15
at m ∈ M is a smooth map γ from an open neighbourhood of 0 in R to M
with γ(0) = m. On the collection of curves at m we define an equivalence
relation: γ1 ≡ γ2 if for a chart U at m and the corresponding diffeomorphism
α, α ◦ γ1 and α ◦ γ2 have the same derivative in 0. The equivalence class of
γ relative to this equivalence relation will be called the derivative of γ at
0 and will be denoted by γ̇(0). This relation is independent of the choice of
U . This leads us to the following definition:
The union T M forms a vector bundle over M with the projection pr that
maps all elements of Tm M to m: Indeed, let (U, φ), with U ⊂ M , be a local
chart with coordinates (x1 , ..., xn ). First of all, Tm M is a vector space. How
is addition and scalar multiplication defined on this vector space? If we take
two curves γ(t) and γ̃(t) trough m with γ(0) = γ̃(0) = m, then adding them
or multiplying them by a scalar need not produce a curve in M . However,
we can add these curves, using the chart φ, as follows:
(1) Addition: (γ + γ̃)(t) = φ−1 ((φ(γ(t))) + φ(γ̃(t)))
(2) Scalar multiplication: (λγ)(t) = φ−1 (λφ(γ(t))) (λ ∈ R)
Taking the derivatives of γ + γ̃ and λγ at the point 0 ∈ R will, by the chain
rule, produce the sum and scalar multiples of the corresponding tangent vec-
tors (see also [30]). Furthermore, any tangent vector a ∈ Tm M can be written
as a = ak ∂k where ak = ak (x) are numerical coefficients and ∂k ≡ ∂/∂xk de-
notes the partial derivative. So the set pr−1 (U ) is identified with U × Rn . It
is by definition a local chart on T M with coordinates (x1 , ..., xn ; a1 , ..., an ).
This proves the claim.
16
is dual to Vect(M ). Locally, we can write a differential 1-form as the expres-
sion ω = ωi dxi for some chart U , where {dxi }1≤i≤n forms the basis of Ω1 (U )
dual to the basis {∂i }1≤i≤n of Vect(U ).
X
ω= ωi1 ,...,ik (x)dxi1 ∧ ... ∧ dxik , (1)
i1 <...<ik
(2) Flatness: d ◦ d = 0.
(3) Leibniz rule: d(α ∧ β) =dα ∧ β + (−1)p (α∧ dβ) where α is a differential
p-form.
We will end this subsection with a very important object in differential geom-
etry, which is the Lie derivative L. Let M be a manifold. Three particular
cases of the Lie derivative are:
The Lie derivative acting on differential k-forms (case (c)) has some particular
properties. In order to quote them we first need to define an auxiliary notion.
17
Definition 3.1.7. A flow on a manifold M is a smooth map H : R×M → M
with the property that if Ht : M → M is defined by Ht (p) = H(t, p), then
H0 is the identity map and Hs Ht = Hs+t for all s, t ∈ R.
You may think of a flow in the definition above as a fluid in motion which
does not change in time (the map in the definition above then describes how
a given fluid particle moves in time). The orbit of a point p ∈ M is the image
of the map γp : R → M γp (t) ≡ H(t, p). The orbit map γp defines a tangent
vector γ̇p (0) ∈ Tp M . In the physical situation of a fluid in motion this can
be interpreted as the velocity of the particle at p. By letting p vary, we get
a vector field on M . We call this vector field the infinitesimal generator
∂H
of the flow and denote it by ∂t t=0 .
We now quote without proof three properties satisfied by the Lie derivative
acting on differential k-forms.
Theorem 3.1.1. The Lie derivative L obeys the following three properties.
(1) commutation with d: dLV = LV d,
18
subsection is therefore an enumeration of the concrete types of morphisms
used in this thesis and their definitions.
We start with:
We say that h is compatible with the group structure. In general, the def-
inition of homomorphism depends on the type of algebraic structure under
consideration. The common theme is that a homomorphism is a function
between two algebraic objects that respects the algebraic structure. Let us
now define:
The set of all automorphisms is a subset of End(X), which denotes the set of
all endomorphisms of a mathematical object X, with a group structure, called
19
the automorphism group of X and denoted by Aut(X). The definitions
and their relations are summarized in the diagram below:
automorphism −−−→ isomorphism
y y
endomorphism −−−→ (homo)morphism.
Here an arrow denotes implication.
We finally define:
Definition 3.2.5. (Diffeomorphism)
Given two manifolds M and N , a bijective map f from M to N is called a
diffeomorphism if both f : M → N and its inverse f −1 : N → M are differ-
entiable (if these functions are r times continuously differentiable f is called
a C r -diffeomorphism). A diffeomorphism is an isomorphism in the category
of smooth manifolds. Two manifolds M and N are (C r -)diffeomorphic if
there is an (r times) continuously differentiable bijective map f from M to
N with an (r times) continuously differentiable inverse.
Two useful notions related to morphisms are the push-forward and pull-
back .
Push-forward: given a diffeomorphism h : M → N , then objects on M (such
as a function f : M → R or a vector field V : M → T M ) can be transfered
to N to give a corresponding object on N . This is called push-forward and
the transfering map is often denoted by h∗ . For instance, h∗ f ≡ f ◦ h−1 and
h∗ V ≡ Dh ◦ V ◦ h−1 : N → T N .
Pull-back: similarly, the pull-back refers to transfering objects on N via h to
objects on M and is denoted by h∗ . This is in fact the push-forward of h−1 .
The difference is however that a pull-back often makes sense when h only is
a C ∞ -map (without insisting that it is a diffeomorphism). For instance, the
pull-back of a function g on N is simply h∗ g ≡ g ◦ h.
20
Next we come to the concept of isomorphic Lie groups. Let G and H be Lie
groups.
Before we define the exponential map we first need to define what left in-
variant vector fields are. A vector field v ∈ Vect(G) is called left invariant if
(lx )∗ v = v for all x ∈ G, where lx is the translation map lx : G → G y 7→ xy,
which is a diffeomorphism. Equivalently, v is left invariant, when
From the above equation with y = e we see that a left invariant vector field
is completely determined by its value v(e) ∈ Te G. Now
d
αX (t) = vX (αX (t)). (3)
dt
The following lemma, of which the proof can be found in for example
[3, p.16], states some properties of the exponential map and at the same
times justifies its name.
21
We now come to a very important application of the exponential map. The
proof of the lemma can be found in [3, p.17].
Definition 3.3.3. A real Lie algebra is a real linear space a equipped with
a bilinear map [·, ·] : a × a → a such that for all X, Y, Z ∈ a we have
(2) [X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y ]] = 0 (Jacobi-identity)
Note that:
for all X, Y ∈ a.
22
3.4 Actions and representations
To understand what a representation is we first need some theory about
group actions.
(b) α(e, m) = m (m ∈ M ).
23
Sets of the form mG (m ∈ M ) are called orbits of the action α. Note that
for two orbits m1 G, m2 G either m1 G = m2 G or m1 G ∩ m2 G = ∅. Thus,
the orbits constitute a partition of M . The set of all orbits, called the or-
bit space , is denoted by M/G. Let us denote the canonical projection
M → M/G, m 7→ mG by π.
The orbit space X = M/G is equipped with the quotient topology. This
is the finest topology for which the map π : M → M/G is continuous. Thus,
a subset O of X is open if and only if its preimage π −1 (O) is open in M .
We quote without proof (for the proof the reader is referred to [3, p.69]):
Besides, the notion of a representation of a Lie group there also exists the
notion of a representation of a Lie algebra.
24
Definition 3.4.6. Let g be a Lie algebra. A representation of g in a
complex linear space V is a bilinear map ι × V → V, (X, v) 7→ Xv, such
that
[X, Y ]v = XY v − Y Xv
Earlier in this section we already defined what it means for two atlases to be
equivalent. We end this section by stating what it means for two represen-
tations to be equivalent.
If the above representations are finite dimensional, then one does not need to
require T to be continuous, since every linear map V1 → V2 has this property.
25
4 Coadjoint orbits
-Isaac Newton encrypted his discoveries in analysis in the form of an anagram
that deciphers to the sentence, ‘It is worthwhile to solve differential equa-
tions’. Accordingly, one can express the main idea behind the orbit method
by saying ‘It is worthwhile to study coadjoint orbits’.-
A.A. Kirillov ([1])
The notion of coadjoint orbits is the main ingredient of the orbit method. It
is also the most important new mathematical object that has been brought
into consideration in connection with the orbit method.
Because here G is a matrix group the Lie algebra g is a subspace of Mat(n, R).
Hence the adjoint representation is simply matrix conjugation:
Ad(g)X = g · X · g −1 , X ∈ g, g ∈ G. (5)
26
any linear representation (π, V ) of a group G one can define a dual represen-
tation (π ∗ , V ∗ ) in the dual space V ∗ :
where the asterisk in the right-hand side means the dual operator in V ∗
defined by:
Now the groups under consideration are matrix groups. What specific form
does the coadjoint representation take for matrix groups? The space Mat(n, R)
has a bilinear form
The bilinear form S gives an identification between Mat(n, R) and Mat(n, R)∗
via the map β : A → S(A, −) ∈Mat(n, R)∗ defined by β(A)(B) = S(A, B).
27
Now we want to show that the space g∗ , dual to te subspace g ⊂Mat(n, R)
can be identified with the quotient space Mat(n, R)/g⊥ , where the sign ⊥
means the orthogonal complement with respect to the the form S:
To start with the following table comparing properties of a map and its
dual is good to have in mind:
f f∗
isomorphism isomorphism
zero image zero image
injective surjective
surjective injective
Now consider the following short exact sequence of vector spaces (π ◦ ι = 0):
ι π
A ,→ B → C
28
g(b + ι(a)) = g(b) + g(ι(a)) = g(b), because g ∈ A⊥ . It is left to check
that g = π ∗ (f ), because then g ∈ π ∗ (C ∗ ). Indeed g(b) = f ([b]) =
f (π(b)) = (f ◦ π)(b) ≡ π ∗ (f )(b).
From π ∗ (C ∗ ) ⊂ A⊥ and A⊥ ⊂ π ∗ (C ∗ ) follows π ∗ (C ∗ ) = A⊥ .
Now take A = g, B = Mat(n, R), ι the inclusion map from g to Mat(n, R) and
π the projection map from Mat(n, R) to Mat(n, R)/ι(g). Indeed π ◦ ι = 0
for this choice, by definition of the quotient space Mat(n, R)/ι(g). It fol-
lows by the previous argument that g∗ ∼ = Mat(n, R)∗ /g⊥ . Here is used
S(A, ι(B)) = β(A)(ι(B)) and β(A) ∈ Mat(n, R)∗ . Finally, because the
bilinear form S is non-degenerate, the map β is an isomorphism. Hence
Mat(n, R) ∼ =Mat(n, R)∗ /g⊥ ∼
=Mat(n, R)∗ . We arrive at g∗ ∼ =Mat(n, R)/g⊥ .
In practice this quotient space Mat(n, R)/g⊥ is often identified with a sub-
space V ⊂Mat(n, R) that is transversal to g⊥ and has complementary di-
mension. So we can write Mat(n, R) = V ⊕ g⊥ . Let pV be the projection
of Mat(n, R) onto V parallel to g⊥ . Hence for matrix groups the coadjoint
representation takes the form:
Ad∗ (g) : g∗ ∼
= V → g∗ ∼
= V F 7→ pV (gF g −1 ), g ∈ G = Mat(n, R). (11)
29
We will again derive what specific form the Lie bracket takes for matrix
groups. First of all, note that by the chain rule
d
ad(X) = Ad(exp tX), t ∈ R (12)
dt t=0
Ad(g)X = g · X · g −1 , X ∈ g, g∈G
• is a bilinear map,
It follows that Te G together with the Lie bracket is a Lie algebra. This jus-
tifies the identification of Te G with g, the Lie algebra of G, already used
earlier in this subsection. The Lie bracket is a Lie algebra representation
(also called the infitesimal representation associated with the representa-
tion of the Lie group) of g in End(g), which follows from the definition
of a Lie algebra representation given in section 3. Namely, for Z ∈ g,
[X, Y ]ad(Z) = (XY − Y X)ad(Z) = XY ad(Z) − Y Xad(Z).
We are now ready to define the infinitesimal version of the coadjoint rep-
resentation:
30
Definition 4.1.6. Let X, Y ∈ g, F ∈ g∗ . Then ad∗ : g × g∗ → g∗ is
defined by ad∗ (X) : g∗ → g∗ ad∗ (X) ≡ (−ad(X))∗ . So by definition,
h(−ad(X))∗ F, Y i = hF, −ad(X)Y i = hF, −[X, Y ]i = hF, [Y, X]i.
For matrix groups it takes the form
From the conditions on x contained in the definition we can derive the form
of a matrix in SU(2). Let us first take a general matrix in Mat(2, C). It has
the form
α β
γ δ
31
The next step is to calculate the Lie algebra of SU(2), which we will denote
by su(2). We can write x = expX for X ∈ g, ∈ R small and where exp is
the exponential map defined in subsection 3.3. Taylor expanding this expres-
sion around and keeping terms up to first order in we have x = I + X,
so the condition x† x = I translates into
exptr(X) ∼
= I + trX = I ⇔ trX = 0.
Hence,
su(2) = {X ∈ Mat(2, C)|X = −X † , trX = 0}. (17)
Again, from the conditions on X we can derive the form of a matrix in su(2).
Take the same general matrix in Mat(2, C) as before. At the level of matrices
the condition X = −X † becomes
∗
−α −γ ∗
α β
=
γ δ −β ∗ −δ ∗ .
Hence it follows that
(i) α∗ = −α, (ii) δ ∗ = −δ, (iii) − β = γ ∗ and (iv) − β ∗ = γ.
If we write α = a + ib, β = c + id, γ = e + if, δ = g + ih and note that the
conditions (iii) and (iv) are equivalent we deduce that
a = 0, g = 0, d = f, c = −e.
Finally tr(X) = 0 implies that b = −h. We conclude that at the level of
matrices
ib c + id
su(2) = b, c, d ∈ R (18)
−c + id −ib .
The Lie algebra su(2) is semisimple. To show this, it suffices to show that
su(2) has an invariant inner product, since SU (2) is a compact Lie group
(see [31]). An inner product h, i is invariant under the action of su(2) if, for
X, Y, Z ∈ su(2) ([32]),
h[X, Y ], Zi = −hY, [X, Z]i, (19)
32
where [, ] denotes the Lie bracket. Since su(2) is a matrix group, the Lie
bracket [, ] is just a commutator bracket. Consequently, an invariant inner
product is given by
hX, Y i = tr(XY ), (20)
for X, Y ∈ su(2). Indeed,
for X, Y, Z ∈ su(2).
By remark 4.1.1. we can now drop the projection pV in equation (7). Further-
more, we know by the previous subsection that on matrix groups Mat(n, R)
there exists a bilinear form S and the corresponding earlier defined isomor-
phism β. This isomorphism β tells us that for SU(2) we have g∗ ∼ = g. Take
∗ ∼ ∼ 3
F ∈ g = g = R . Now by definition 4.1.3. in order to find the coadjoint
orbits of SU(2) we look for the points F 0 ∈ g∗ ∼
=g∼
= R3 for which
F 0 = gF g −1 for a certain g ∈ SU(2). (21)
Let Q be a funtion Q: su(2) → C which is conjugation invariant. Then (21)
implies that Q(F 0 ) = Q(gF g −1 ) = Q(F ) with F 0 ∈ Ω, a coadjoint orbit of
SU(2). Hence Q(Ω) = constant ∈ C. That is, different coadjoint orbits are
characterized by different constants in the equation above. We have to check
all conjugation invariant functions on su(2) to find all coadjoint orbits of
SU(2).
33
−b2 − c2 − d2 ...
tr = -2(b2 + c2 + d2 ) = constant.
... −b − c2 − d2
2
b2 + c2 + d2 = r2 . (22)
The reader can convince himself/herself that all conjugation invariant func-
tions on su(2) are similar to the ones described above and hence lead to
the same coadjoint orbits: two-dimensional spheres or points. The formal,
rigourous proof of the fact that the two-dimensional spheres and a point
exhaust the coadjoint orbits of SU(2) will not be given in this thesis.
34
4.2.2 Deriving the coadjoint orbits by comparing with SO(3)
Another way to see that the coadjoint orbits of SU(2) are two-dimensional
spheres or a point is to establish an isomorphism between the Lie algebras
su(2) and so(3) and then prove from this that the coadjoint orbits of both Lie
groups are equivalent. Because SO(3) is the group of rotations in R3 it is then
clear from visual point of view that its coadjoint orbits are two-dimensional
spheres or a point.
G×g Ad / g PPP
PPP
PPPφ̃
IdG ×φ̃ PPP
PPP
PP'
G×h / H ×h / h
φ×Idg Ad
with IdG and Idh the identity mappings on G and g respectively. The fact
that these diagrams are equivalent is obvious from the fact that
φ × φ̃ = (φ × Id) ◦ (Id × φ̃).
35
So in the particular case considered above we have V = g, W = h, φ = φ̃,
α1 = Ad and α2 = Ad ◦ (φ × Id). Indeed α2 is a continuous representation
in this case, because the composition of a continuous representation with a
homomorphism is again a continuous representation. It follows that φ̃ is a ho-
momorphism of G-representations and G-orbits in g are sent to G-orbits in h.
However, we are interested in the coadjoint orbits, not in the adjoint ones.
If φ̃ : g → h is a homomorphism, then φ̃∗ : h∗ → g∗ is a homomorphism. The
following commutative diagram is induced:
Ad∗ / g∗ hQ
G ×O g∗ QQQ
QQQ φ̃∗
QQQ
IdG ×φ̃∗ QQQ
QQQ
G × h∗ / H × h∗ / h∗
φ∗ ×Idh∗ Ad∗
Now take G = SU(2) and H = SO(3). We will now show that su(2) ∼ = so(3).
Because su(2) is the algebra of X ∈ Mat(2, C) with X † = −X and tr(X) = 0
one can see from this that as a real linear space su(2) is generated by the
elements
i 0 0 1 0 i
r1 = , r2 = , r3 = .
0 −i −1 0 i 0
Note that rj = iσj , where σ1 , σ2 , σ3 are the famous Pauli spin matrices.
One readily verifies that r12 = r22 = r32 = −I and r1 r2 = −r2 r1‘ = r3 and
r2 r3 = −r3 r2 = r1 .
It follows from the above product rules that the commutator brackets are
given by
36
From this it follows that the endomorphisms ad(rj ) ∈ End(su(2)) have the
following matrices with respect to the basis r1 , r2 , r3 :
0 0 0 0 0 2 0 −2 0
mat(ad r1 ) = 0 0 −2 , mat(ad r2 ) = 0 0 0 , mat(ad r3 ) = 2 0 0 .
0 2 0 −2 0 0 0 0 0
Write Rj = Rej for j = 1, 2, 3. Then by the above formulas for mat ad (rj )
we have
37
From this one sees that x ∈ {−I, I}. Hence ker(φ) = {−I, I}. So
SU(2)/{±I} ∼= SO(3).
Moreover, we can now show that the SU(2)-orbits in so(3)∗ ∼ = su(2)∗ are
equivalent to the SO(3)-orbits in so(3)∗ . Two points x, y ∈ so(3)∗ lie both
on a SU(2)-orbit if g · x = y, where g ∈ SU(2) and ‘·’ means ‘acting’. We
denote this by the equivalence relation x ∼SU(2) y. Because φ is a homomor-
phism g · x = y ⇒ φ(g) · x = y. The fact that φ(g) ∈ SO(3) thus gives
x ∼SU(2) y ⇒ x ∼SO(3) y. Now suppose that the points x, y ∈ so(3)∗ lie both
on a SO(3)-orbit, that is, h · x = y for h ∈ SO(3). As proved earlier, φ is
surjective, meaning ∃ g such that φ(g) = h. This implies that g · x = y.
Hence x ∼SO(3) y ⇒ x ∼SU(2) y and we are done.
In other words, the coadjoint orbits of SO(3) and SU(2) are equivalent.
38
5 Symplectic structure on coadjoint orbits
-One feature of coadjoint orbits is eye-catching when you consider a few ex-
amples: they always have an even dimension. This is not accidental, but has
a deep geometric reason.-
A.A. Kirillov ([1, p.4])
From the definition it follows that symplectic manifolds should have even
dimension. Indeed,
The important and beautiful fact that the coadjoint orbits of Lie groups (in
our case matrix groups) are symplectic manifolds is encoded in the following
theorem:
39
Remark 5.1.1. Notice that although this theorem applies only to matrix
groups, the result can also be proven for general Lie groups. The proof of this
more general theorem can be found in for example [2, p.227-230]. Because
in applications in (mathematical) physics the Lie groups are almost always
matrix groups the above theorem suffices for the purposes of this thesis.
Proof. In this proof we will proceed as follows. We will first prove the ex-
istence of this differential form. Then we will prove that the differential
form is nondegenerate, G-invariant and closed, respectively. This together
establishes the claim of the theorem.
Now take for M the orbit space Ω, for the curve the specific realisation
t 7→ exptX with t ∈ R and for the action the coadjoint representation.
Then for F ∈ Ω
d
ξX (F ) = Ad∗ etX F =
dt t=0
d
pV etX F e−tX = pV ([X, F ]).
(23)
dt t=0
40
Te G ≡ g → Te Ω ≡ g/Lie(StabG (F )) and the fact that these choices for
a and b do not depend on the choices for X, Y ∈ Lie(Ω). Namely, for
Z ∈ Lie(StabG (F )) we have
We are now in the positon to define B(a, b) ≡ BΩ (F )(X, Y ) ≡ hF, [X, Y ]i.
Indeed this definition does not depend on our choice for X, Y ∈ Lie(Ω):
take Z ∈ Lie(StabG (F )). Then
so it follows that,
41
• G-invariance: BΩ (F )(X, Y ) being G-invariant means that
lg∗ BΩ (F )(X, Y ) = BΩ (F )(X, Y ) where lg∗ is the pull-back of lg as defined
in section 3. In general, a mathematical object α has a natural group
action defined on it and we call the mathematical object G-invariant
if lg∗ α = α. If α is a function f we get lg∗ (f ) : M → R m 7→ f (g · M )
and lg∗ f = f ⇔ f (g · m) = f (m). Here · means ‘acting on’. This gives,
using BΩ (F )(X, Y ) ∈ C ∞ (Ω), that,
Furthermore,
Hence,
42
We claim that dB(X, Y, Z) = X · B(Y, Z) − B([X, Y ], Z) + c.p.. Here
is used that for a function f and and a vector field v the expres-
sion v · f denotes the directional derivative of f in the direction v,
∂f
that is v · f ≡ v i ∂x i. Let us check this claim by first calculating
X · B(Y, Z) + c.p.. We have
X · B(Y, Z) + c.p. =
X ∂ X
Xk k
Bij (Y i Z j − Y j Z i ) + c.p. =
k
∂X i<j
XX
{X k (∂k Bij )Y i Z j + X k Bij ∂k Y i Z j + X k Bij Y i ∂k Z j }
i<j k
XX
− {X k (∂k Bij )Y j Z i + X k Bij ∂k Y j Z i + X k Bij Y j ∂k Z i } + c.p. (24)
i<j k
It follows that the terms containing ∂k Bij match with the expression
for dB(X, Y, Z) found at the beginning. So it is only left to check that
the terms not containing ∂k Bij cancel against −B([X, Y ], Z)+c.p.. We
have
−B([X, Y ], Z) + c.p. =
− i<j Bij ([X, Y ]i Z j − [X, Y ]j Z i ) + c.p.
P
+X k ∂k Y j Z i − Y k ∂k X j Z i ) + c.p. (25)
Hence the terms not containing ∂k Bij in (24) indeed cancel against the
terms in (25). This verifies the claim.
X · B(Y, Z) − B([X, Y ], Z) =
(LX B)(Y, Z) + B(LX Y, Z) + B(Y, LX Z) − B([X, Y ], Z) =
B([X, Y ], Z) + B(Y, [X, Z]) − B([X, Y ], Z) = B(Y, [X, Z]).
43
Here is used that LX B = 0: earlier in this proof we showed that B
is G-invariant. Write g = exptA for t ∈ R, A ∈ g, g ∈ G. Then B
being G-invariant means lg∗ (B) = B with lg the left translation map.
∗ ∗
So lexp(tA) (B) = B ⇔ dtd t=0 lexp(tA) (B) ≡ LXA (B) = 0 and we are done.
One can apply similar mathematics on the cyclic permutations of the
expression X · B(Y, Z) − B([X, Y ], Z). We end up with the three terms
B(Y, [X, Z]), B(X, [Z, Y ]), B(Z, [Y, X]). Using the anti-symmetry of B
and the Lie bracket, rewriting of these three terms gives:
where the anti-symmetry of the Lie bracket is used in the last two
identities. The last expression is indeed equal to zero by the Jacobi
identity. Hence we have established that BΩ is closed.
It follows that the coadjoint orbits of matrix groups are always even-dimensional.
44
Definition 5.2.1. A Poisson manifold is a differentiable mainifold M such
that the the algebra C ∞ (M ) of smooth functions over M is equipped with a
bilinear map {, } : C ∞ (M ) × C ∞ (M ) → C ∞ (M ) such that {, }
(2) obeys the Jacobi identity: {f, {g, h}} + {g, {h, f }} + {h, {f, g}} = 0,
{f g, h} = − 21 ∂(f
∂xµ
g) µν
ω (x) ∂x∂h
ν =
1 ∂g µν ∂h 1 ∂f ∂h
−f 2 ∂xµ ω (x) ∂xν − g 2 ∂xµ ω µν (x) ∂x ν =
Our procedure to calculate the explicit ‘local coordinate’ shape of the Kir-
illov form on the two-dimensional spheres will be to calculate the matrix ω µν
in (26) locally and show that it is invertible. Hence in the SU(2)-case we
45
can calculate the inverse matrix ωµν and hence the corresponding symplectic
form ω. By showing that ωµν is closed and SU(2)-invariant we thus find the
Kirillov form on the two-dimensional spheres, the physically relevant coad-
joint orbits of SU(2). We will do the calculation in two often used coordinate
systems: the spherical coordinate system and the stereographic coordinate
system.
We can cover the sphere by two charts. One chart being the whole sphere
minus the northpole (θ = 0), which we denote by U+ , the other chart being
the whole sphere minus the southpole (θ = π), which we denote by U− . Be-
cause we calculate the Kirillov form on the two-dimensional sphere in local
coordinates, we have to perform the calculation on both charts U+ and U− .
Let us first consider U− .
Indeed,
As a first step towards finding the Kirillov form on U− in the spherical coor-
dinate system the Poisson brackets {θ, φ}, {φ, θ}, {θ, θ} and {φ, φ} need to
be calculated. By formula (26) and in particular the antisymmetry of ω θθ ,
ω θφ and ω φφ it immediately follows that
{θ, θ} = 0
{φ, φ} = 0
{θ, φ} = −{φ, θ}.
46
So it is left to calculate {φ,
n θ}. oThe most convenient way to do this is to
x1
first calculate the bracket x2 , x3 in terms of the parameters θ, φ and sub-
sequently, to express this bracket in terms of {φ, θ}.
{xi , xj } = cijk xk ,
which easily follows from equation (26) and the chain rule. To check the
Jacobi identity for general functions in C ∞ (R3 ) it is by (28) sufficient
to check it for the coordinates {x1 , x2 , x3 } on R3 . Indeed,
{x1 , {x2 , x3 }} + {x2 , {x3 , x1 }} + {x3 , {x1 , x2 }} =
{x1 , cx1 } + {x2 , cx2 } + {x3 , cx3 } = 0.
This turns R3 into a Poisson manifold. Moreover, for the Poisson bracket
induced by the Kirillov form the constant c in the bracket relations above
equals c = 2, since in that case the structure constants of the Poisson bracket
relations and the Lie bracket relations between the generators of su(2) are
the same (without proof). Consequently, for these induced Poisson brackets
it holds that,
47
Consequently,
n o
x1
x2
, x3 = x12 {x1 , x3 } − xx12 {x2 , x3 } = x12 (−2x2 ) − x1
x22
(2x1 ) =
2
x21
2 −1 − x2 = −2(1 + tan2 (φ)) = cos−2 2 (φ) .
2
∂f ∂g
Again, from (26) and the chain rule follows {f (φ), g(θ)} = ∂φ ∂θ
{φ, θ}. So,
on the other hand,
n o
x1 −1
x2
, x3 = {tan(φ), R cos(θ)} = cos2 (φ)
R sin(θ){φ, θ}.
− 12 R sin(θ)
0
ωµν = 1
2
R sin(θ) 0 .
48
that SU(2)/{±I} ∼ = SO(3), proving S0(3)-invariance is equivalent to prov-
ing SU(2)-invariance of ω. We can of course let SO(3), the rotation group
in 3 dimensions, act on the vectors corresponding to points on the sphere.
Since our choice of axes is arbitrary it suffices to prove rotation invariance
by choosing a specific rotation, being the rotation around the x3 −axis by a
constant angle φ̃. The matrix representing this rotation is given by:
cos(φ̃) sin(φ̃) 0
− sin(φ̃) cos(φ̃) 0
0 0 1 .
R sin(θ) sin(φ) cos(φ̃) + R sin(θ) cos(φ) sin(φ̃)
−R sin(θ) sin(φ) sin(φ̃) + R sin(θ) cos(φ) cos(φ̃) =
R cos(θ)
R sin(θ) sin(φ + φ̃)
R sin(θ) cos(φ + φ̃)
R cos(θ) .
showing SU(2)-invariance of ω.
49
that projects the two-dimensional sphere to the complex plane. This projec-
tion is depicted in figure 2.
Figure 2: The stereographic projection. For any point P on the sphere (minus the
southpole), which we denoted by U− , there is a unique line trough the northpole and P ,
and this line intersects the complex plane (which runs trough the southpole of the sphere)
in exactly one point z.
The plane x3 = 0 runs trough the southpole of the sphere, which coincides
with the origin of R3 . Let M be the sphere minus the southpole (U− ). For
any point P on M , there is a unique line trough the northpole and P , and
this line intersects the complex plane x3 = 0 in exactly one point z. The
stereographic projection of P is defined to be this point z. We denote the
distance from the southpole of the sphere to the point z by ρ. The stereo-
graphic coordinates z can thus be expressed in terms of the variables φ, ρ.
We already know that the points on the sphere can be expressed in terms of
the variables θ,φ. So if we find an expression for ρ in terms of θ (and the
50
constant radius of the sphere R), this enables us to write z as a function of θ
and φ (and R). As we shall see, this will be helpful in calculating the Poisson
brackets between the independent coordinates, Re(z) and Im(z), that span
the complex plane.
Figure 3: The triangle A lifted from figure 1 and the small rectangular triangle B at the
‘top’ of A.
51
Consequently, Re(z) = 2R cot(θ/2) cos(φ) and Im(z) = 2R cot(θ/2) sin(φ).
Using Euler’s formula, cos(φ) + i sin(φ) = eiφ , we conclude that
z = 2R cot(θ/2)eiφ , (32)
The next step is to calculate the Poisson brackets {Re(z), Re(z)}, {Re(z), Im(z)},
{Im(z), Re(z)} and {Im(z), Im(z)}. By formula (26) and in particular the
antisymmetry of ω {Re(z),Re(z)} , ω {Im(z),Im(z)} and ω {Re(z),Im(z)} it immediately
follows that
{Re(z), Re(z)} = 0
{Im(z), Im(z)} = 0
{Re(z), Im(z)} = −{Im(z), Re(z)}.
To find the expression for {z̄, z}, we in particular use the fact that the Pois-
son bracket is a derivation in its first argument, giving,
• {cot(θ/2), cot(θ/2)} = 0
52
• {e−iφ , eiφ } = −ie−iφ {φ, eiφ } = ie−iφ {eiφ , φ} = −e−iφ eiφ {φ, φ} = 0
(1+cot2 (θ/2))ie−iφ
• {e−iφ , cot(θ/2)} = 12 (1 + cot2 (θ/2))ie−iφ {φ, θ} = R sin(θ)
.
{z̄, z} =
2 (θ/2))ieiφ 2 −iφ
0 + 4R2 cot(θ/2)e−iφ ( (1+cot
R sin(θ)
) + 4R 2
cot(θ/2)eiφ (1+cot (θ/2))ie
( R sin(θ)
)+0 =
2
8iR cot(θ/2)
sin(θ)
(1 + cot2 (θ/2)) = 4iR (1 + cot2 (θ/2)) .
z z̄ z z̄ 2
Using cot2 (θ/2) = 4R2
it follows that {z̄, z} = 4iR 1 + 4R 2 . Hence,
z z̄ 2
{Re(z), Im(z)} = 2R 1 + . (33)
4R2
Hence, the matrix ω µν in (26) is given by
!
z z̄ 2
µν 0 2R 1 + 4R 2
ω = z z̄ 2
−2R 1 + 4R2
0
.
1
!
z z̄ −2
0 − 2R 1 + 4R2
ωµν = 1 z z̄ −2
2R
1 + 4R2 0
.
53
Remark 5.2.2.1. Although sometimes z and z̄ are treated as independent,
they are in fact not. The variables Re(z) and Im(z) are independent. This
is why we calculated the matrix ω µν with respect to the basis {Re(z), Im(z)}
of the complex plane and only at the very end substituted z and z̄ in the
expression for ω. Notice, however, that the complexifications of z and z̄ are
independent.
Indeed, ω is closed:
1 ∂ z z̄ −2 1 ∂ z z̄ −2
dω = i 2R ∂z
1+ 4R2
dz ∧ dz ∧ dz̄ + i 2R ∂ z̄
1+ 4R2
dz̄ ∧ dz ∧ dz̄ = 0.
54
6 Geometric quantization
-The correspondence between a classical theory and a quantum theory should
be based not so much on a coincidence between their predictions in the limit
~ → 0, as on an analogy between their mathematical structures: the primary
role of the classical theory is not in approximating the quantum theory, but
in providing a framework for its interpretation.-
P.A.M. Dirac ([14])
55
formalism with that of classical Hamiltonian mechanics.
Furthermore, this implies that symplectic manifolds always have even dimen-
sion (see section 5.1).
ω = dpa ∧ dq a . (35)
θ = pa dq a . (36)
From this, one sees that ω = dθ, so it is globally exact and hence closed
since d ◦ d = 0 by definition of the exterior derivative.
From this we can now show that expression (35) is invariant under
change of coordinates. Denote the transformed coordinates by {p̃a , q̃ b }.
56
Then the transformed and untransformed coordinates are related, using
the Einstein summation convention, via, ([20, p.181])
∂ q̃ b
pa = p̃b . (37)
∂q a
It follows that
a
ω̃ = dθ̃ = d(p̃a dq̃ a ) = d(p̃a ∂∂qq̃ b dq b ) =
d(pb dq b ) = dθ = ω.
57
• ωm is a homomorphism. Indeed,
ωm (X i + Y i ) =
(X i + Y i )ωik = X i ωik + Y i ωik = ωm (X i ) + ωm (Y i ).
where i(X) denotes the contraction of a differential form with the vector
field X, as in (38), i.e. the insertion of X into the first ‘slot’ of a differential
form. The conventions of section 2 are used in applying this contraction. It
is sometimes helpful to think of ω as an antisymmetric metric on M ; then
(38) corresponds to ‘lowering the index’ on X.
Xf ∼
= i(Xf )ω = 2ω(Xf , .) = −df. (39)
In order to justify this statement we have to prove that for every function
f ∈ C ∞ (M, R) we can establish the identity 2ω(Xf , .) = −df . This means
that if we write Xf = ξ k ∂x∂ k we can find an explicit expression for the coeffi-
cients ξ k . This is the case, as follows from
where in the last identity the invertibility of ωkj is used. We call a vec-
tor field Xf satisfying (39) a Hamiltonian vector field of f .
58
L(Xf )ω ≡ d(i(Xf )ω) + i(Xf )dω = −ddf = 0,
where the definition of the Hamiltonian vector field (39), the closedness of
the symplectic form ω and the identity d ◦ d = 0 are used respectively. This
implies by property (3) of the Lie derivative (see section 3.1) that
∂
∂t t=0
(φf )∗t ω = L(Xf )ω =0⇔
(φf )∗t ω = constant,
Here is used the expression for the Hamiltonian vector field in local coor-
dinates (A.1), which is derived in the appendix. As promised, we will now
prove that {, } in (27) defines a Poisson bracket:
59
1
∂ f ω σπ ∂π ∂µ gω µν ∂ν h + 14 ∂σ f ω σπ ∂µ g∂π ω µν ∂ν h + 14 ∂σ f ω σπ ∂µ gω µν ∂π ∂ν h.
4 σ
Consequently,
where in the second identity is used that the terms containing sec-
ond order derivatives of f , g or h cancel each other out by applying
suitable interchanges of indices (which is allowed since all indices are
summed over). In the third identity a similar interchange of indices is
applied. In the last identity the Jacobi identity for Poisson forms is
used, (A.3), proved in the appendix.
We have now established that the pair (C ∞ (M, R), {, }) forms a Lie algebra,
since the space of smooth real-valued functions is a linear space and {, } is a
bilinear map satisfying anti-symmetry and the Jacobi identity.
[X, Y ]b = X a ∂a Y b − Y a ∂a X b , (42)
for X, Y ∈ Vect(M ). In the appendix identity (41) is proved and from this
the fact is proved that the space of Hamiltonian vector fields together with
commutator bracket forms a Lie algebra.
60
Definition 6.1.1.1. A symplectic vector space is a pair (V, ω) in which
V is a 2n (n ∈ Z) dimensional real vector space and ω : V × V → C is a
bilinear form on V satisfying
• ω(X, .) = 0 ⇔ X = 0 X ∈ V (non-degeneracy).
• dim(H) = 12 dim(V ) = n,
• ω(X, Y ) = 0 X, Y ∈ H.
61
Furthermore, for fixed q ∈ Q, the fibres Tq∗ Q are Lagrangian submanifolds of
T ∗ Q. In this case v2,a = 0 in the above form for v and Tq∗ Q ∼
= Rn gives the
n-dimensionality of Tq∗ Q.
{fk , fl } = 0 ∀ k, l. (43)
holds forTall k, l this implies that the Hamiltonian vector fields Xfk are tan-
gent to l {Fl = 0}, so that they locally span the tangent bundle T N . Then,
noticing that (43) holds if and only if ω(Xfk , Xfl ) = 0, this implies ω|T N = 0.
62
summation convention used) the kinetic energy and V = V (q k ) the poten-
tial energy, minus whose gradient describes the forces acting on the particles.
If H does not depend on time explicitly, the equations of motion (44) imply
that H is conserved along every trajectory in phase space. Indeed,
d ∂H ∂H
H = k q̇ k + ṗk
dt ∂q ∂pk
∂H ∂H ∂H ∂H
= k − = 0, (46)
∂q ∂pk ∂pk ∂q k
where again the Einstein summation convention is used. The evolution of
any other smooth real-valued function on phase space, called an observable,
is given by
d ∂f ∂f
f = k q̇ k + ṗk
dt ∂q ∂pk
∂f ∂H ∂f ∂H
= k − (47)
∂q ∂pk ∂pk ∂q k .
In the example of the harmonic oscillator in one dimension, the conserva-
tion of the Hamiltonian along any trajectory in phase space implies that
d d 1 2
dt
H = dt 2 (p + q ) = 0 ⇔ p2 + q 2 = Constant ∈ R. Since Constant ∈ R,
2
we can write Constant = r2 for some constant r ∈ R. Hence the phase space
trajectories of the one-dimensional harmonic oscillator are circles of radius
63
r. This is in agreement with the explicit solution of the equations of motion
(45), since
q(t)2 + p(t)2 = (q(0) cos(t) + p(0) sin(t))2 + (p(0) cos(t) − q(0) sin(t))2 =
q(0)2 cos2 (t) + p(0)2 sin2 (t) + 2q(0)p(0) cos(t) sin(t)
−2q(0)p(0) cos(t) sin(t) + p(0)2 cos2 (t) + q(0)2 sin2 (t) =
q(0)2 (cos2 (t) + sin2 (t)) + p(0)2 (cos2 (t) + sin2 (t)) =
q(0)2 + p(0)2 = Constant.
1 ∂f ∂
Xf = ω µν µ ν
2 ∂x ∂x
∂f ∂ ∂f ∂ ∂f ∂ ∂f ∂
=0 · k
− k + k
+0· k
∂pk ∂p ∂q ∂pk ∂pk ∂q ∂q ∂qk
∂f ∂ ∂f ∂
= k
− k (48)
∂pk ∂q ∂q ∂pk
. Therefore the Poisson bracket equals
∂f ∂g ∂f ∂g
{f, g} = Xf (g) = k
− k (49)
∂pk ∂q ∂q ∂pk
. It follows, that the Poisson brackets between the coordinates and momenta
(called the canonical Poisson brackets) are
64
∂q m ∂q l m l
{q m , q l } = ∂pk ∂q k
− ∂q ∂q
∂q k ∂pk
=0−0=0
∂pm ∂pl ∂pm ∂pl
{pm , pl } = ∂pk ∂q k
− ∂qk ∂pk = 0 − 0 = 0
∂pm ∂q l ∂pm ∂q l
{pm , q l } = ∂pk ∂q k
− ∂q k ∂pk
k l
= δm l
δk − 0 = δm .
d
(47) ⇔ f = {H, f } = XH (f ), (52)
dt
so time evolution in Hamiltonian mechanics is determined by the Hamilto-
nian vector field XH of the Hamiltonian H.
A first advantage of this new formulation, is that it makes manifest the form
invariance of the equations of classical mechanics under symplectomorphisms
(diffeomorphisms leaving the symplectic form invariant), since the Poisson
bracket is defined in terms of the symplectic form. Because canonical trans-
formations are a particular type of symplectomorphisms, the form-invariance
of the equations of classical mechanics under canonical transformations (in
particular coordinate transformations) is also manifest.
65
or compact symplectic manifold. The necessity to consider such more exotic
systems in physics has arisen in recent years in a number of different con-
texts, e.g. for the description of internal degrees of freedom and in conformal
field theory.
66
Definition 6.2.3. A complete metric space is a metric space M , i.e. a
non-empty set equipped with a distance function (called a metric), that is
complete, which means that every Cauchy sequence of points in M has a
limit that is also an element of M .
Since it is now clear what a Hilbert space is, we can say what rays in a Hilbert
space are, which were the quantum states. Each point (vector) in the Hilbert
space (apart from the origin) corresponds by definition to a so-called pure
quantum state . Since multiplying this vectors by non-zero complex scalars
leaves the Schrödinger equation invariant, two vectors that differ only by a
non-zero complex scalar are considered to correspond to the same state. In
this sense, each pure state is a ray in the Hilbert space.
Finally,
Definition 6.2.4. A Hermitian operator on a Hilbert space H is a linear
map Q̃ : H → H for which hQ̃x, yi = hx, Q̃yi, for x, y ∈ H and h·, ·i the
inner product on H.
The first and simplest problem of quantization concerns the kinematic rela-
tionship between the classical and quantum domains. The kinematic problem
is: given M and ω, is it possible to construct H and Q̃? In the rest of section
6, we will shed more light onto this problem. A more subtle problem concerns
the dynamical relationship between the classical and quantum domains. This
problem will not be treated in this thesis.
It is this property which has to be sacrificed when moving from the classical
to the quantum theory. As already discussed in the introduction, the non-
commutative nature of observables is at the heart of phenomena in quantum
mechnanics.
67
of operators Q(f ) on some Hilbert space to classical observables f . Ab-
stracting from the analogy between classical mechanics and Schrödinger and
Heisenberg quantum mechanics (for example, by using the observations from
the experiments described in the introduction) the assignment Q has to sat-
isfy some requirements. Firstly,
(Q1) R-linearity:
Secondly,
(Q2)
Here [·, ·] denotes the Lie bracket of vector fields and ~ is Planck’s constant, a
constant of nature characteristic of quantum effects. It is a very small num-
ber (~ ≈ 1.05457162853 × 10−34 Joule · second), and for most macroscopic
considerations the fact that it is not zero can be neglected. Treating ~ as a
parameter, and taking the limit ~ → 0 (which is considered to be the limit
corresponding with classical mechanics), one recovers from (Q2) the commu-
tative structure of classical mechanics. At the microscopic level, however,
effects of order ~ can no longer be neglected and here classical mechanics
needs to be replaced by quantum mechanics.
hQ̃x, yi = hx, Q̃yi ⇔ hqx, yi = hx, qyi ⇔ q̄hx, yi = hqy, xi = qhy, xi = qhx, yi
⇔ q̄ = q ⇔ q ∈ R.
So thirdly,
(Q3)
68
The three requirements above can be summarized in one single statement,
using definition 3.4.6, with H the complex vector space and (S, {·, ·}) the Lie
algebra, where {·, ·} denotes the Poisson bracket. That is, the operators in
Q form a representation of a subalgebra S of the classical observables. This
statement is Dirac’s quantum condition (see [14]) and Dirac formulated it
by saying that the Poisson bracket is the classical analogue of the quantum
commutator.
However, the quantum condition does not, in itself, uniquely determine the
underlying quantum system. Furthermore, not every Hilbert space and op-
erators that satisfy it represent a ‘physically reasonable’ quantization. There
are some guiding principles about when a quantization is ‘physically reason-
able’. First of all, S must contain the constant functions in C ∞ (M, R) and
these must be represented in Q by the corresponding multiples of the identity
operator. This is equivalent to:
(Q4)
Q(1) = I, (58)
Secondly, requiring only Dirac’s quantum condition and (Q4), the Hilbert
space is still ‘too big’ and we need some irreducibility condition to make
it smaller, which appears to be a natural requirement. In general, we can-
not have a one-to-one correspondence between Q̃ and the whole of C ∞ (M )
without making H ‘too large’ and the selection of a subalgebra S for which
(Q1), (Q2), (Q3) and (Q4) should hold involves picking out some additional
structure in M .
One way to phrase this irreducibility condition makes use of the concept of
a complete set of observables introduced in subsubsection 6.1.2. In complete
analogy, a complete set of operators is defined such that the only operators
which commute with all the operators from that set are multiples of the
identity operator. The condition is then formulated as:
69
However, GQ uses another approach to make the Hilbert space irreducible,
using so-called polarizations. Let us for a moment put aside the problem
of the ‘size’ of H and look at a construction for H and Q̃ where only the
symplectic structure on M is used and where there is an operator in Q̃ for
every classical observable in C ∞ (M, R) (so not only for observables in S). So
Q works on the whole of C ∞ (M, R) and satisfies an extension of (Q1)-(Q4)
to all f ∈ C ∞ (M, R). Such an assignment is called a prequantization and
shall be treated next.
70
1 1 n 1 1
ω ≡ (−1) 2 n(n−1) ω ≡ (−1) 2 n(n−1) ω ∧ ... ∧ ω , (59)
n! n! | {z }
n times
for n ∈ Z and ω = dp ∧ dq, the symplectic form on T ∗ Q with Q a
1-dimensional manifold. ω is called the Liouville form .
Let the Hilbert space associated with M be the space L2 (M, C) of square-
integrable, smooth, complex-valued functions on M , with the inner product
n Z
1
hφ, ψi = φψ̄ω ; φ, ψ ∈ L2 (M, C). (60)
2π~ M
Here k·k is the norm on the Hilbert space L2 (M, C) induced by the inner
product h·, ·i.
Let us try to find the assignment Q in (54). Using definition 3.4.1 (and the
remark beneath), each classical observable f ∈ C ∞ (M, R) acts on the set
L2 (M, C) according to
To see this, note that C ∞ (M, R) is a group under pointwise addition and
that the map Q : f 7→ −i~Xf is a group homomorphism. Indeed, using
(A.1), it follows that Q(f + g) = −i~Xf +g = −i~Xf − i~Xg = Q(f ) + Q(g).
Furthermore, since the map in (61) is clearly linear, it follows that Q is a
representation of C ∞ (M, R) (with pointwise addition as group operation) in
L2 (M, C).
71
In this case, we indeed have Q(1) = −i~X1 + 1I = 1I = I. But unfortu-
nately, in this case (Q2) is not satisfied, since
In the third identity, we used the identity (41) and the fact that for φ ∈
L2 (M, C) the identity [Xf , g](φ) = Xf (gφ) − gXf (φ) = Xf (g)φ + gXf (φ) −
gXf (φ) = Xf (g)φ holds. In the fourth identity we used equation (49) and
the commutatitivity of real-valued smooth scalar function under pointwise
multiplication.
72
j
{f, g} + θj X{f,g} = {f, g} + θ(X{f,g} ).
• (Q3): Using the expression for the inner product, (60), on L2 (M, C),
for f ∈ C ∞ (M, R), φ, ψ ∈ L2 (M, C), we have
1 n
R
hQ(f)φ, ψi = 2π~ M
(−i~Xf (φ) + f φ − θ(Xf )φ)ψ̄ω =
1 n i ∂
R
2π~
(φi~Xf ( ∂xi ψ) + f φψ̄ − θ(Xf )φψ̄)ω =
1 n
RM
2π~ RM
(φ−i~Xfi ( ∂x∂ i ψ) + φf ψ − φθ(Xf )ψ)ω =
1 n
2π~ M
φQ(f )ψω = hφ, Q(f )ψi.
1 n
R
hexp(iu/~)φ, exp(iu/~)ψi = 2π~ M
exp(iu/~)φ exp(−iū/~)ψ̄ω =
1 n
R
2π~ M
exp(iu/~)φ exp(−iu/~)ψ̄ω = hφ, ψi.
73
−i~Xf φ0 − θ0 (Xf )φ0 + f φ0 =
−i~Xf (exp(iu/~)φ) − (θ + du)(Xf ) exp(iu/~)φ + f exp(iu/~)φ =
−i~ exp(iu/~)Xf (φ) − i~(i/~) exp(iu/~)Xf (u)φ − θ(Xf ) exp(iu/~)φ−
Xf (u) exp(iu/~)φ + f exp(iu/~)φ =
−i~ exp(iu/~)Xf (φ) − θ(Xf ) exp(iu/~)φ + f exp(iu/~)φ =
exp(iu/~)(−i~Xf (φ) − θ(Xf )φ + f φ).
Let X be a line bundle over the real manifold M . Recall (see section 3) that
a section over U of a line bundle X is a map s : U ⊂ M → X satisfying
pr(s(m)) = m ∀m ∈ U . We denote the set of all smooth sections over U
of a line bundle X with ΓX (U ). It is a vector space, where the scalars are
C-valued functions on U .
74
Denote by Vect(M, C) the space of smooth complex vector fields (so multi-
plication by complex numbers is included in the scalar multiplication) on M
and by C ∞ (M, C) the set of smooth complex-valued funtions on M . Now
define,
Definition 6.4.1.2. A connection ∇ on X is a map ∇ : Vect(M, C) →
End(ΓX (M )) that assigns to each X ∈ Vect(M, C) an operator ∇X on
ΓX (M ) such that
Here X denotes the formal complex conjugate of the complex vector field
X. The formal complex conjugate of X satisfies by definition the following
rules,
75
Definition 6.4.1.4. The curvature 2-form of ∇ is the complex differential
2-form Ω on M determined by
i
Ω(X, Y )s = ([∇X , ∇Y ] − ∇[X,Y ] )s; X, Y ∈ Vect(M ), s ∈ ΓX (M ). (65)
2
Here [∇X , ∇Y ] = ∇X ∇Y −∇Y ∇X is a commutator bracket. Indeed Ω defines
a differential 2-form:
And,
Ω(X, Y )s =
i
(f ∇
2 1 [X1 ,Y ]
s−Y (f1 )∇X1 s+∇[X2 ,Y ] s−f1 ∇[X1 ,Y ] s+Y (f1 )∇X1 s−∇[X2 ,Y ] s) =
f1 Ω(X1 , Y ) + Ω(X2 , Y ).
Furthermore, Ω is also linear in s over C ∞ (M, C). Firstly, using property (c)
of the connection, for f1 ∈ C ∞ (M, C), s1 , s2 ∈ ΓX (M ),
76
[∇X , ∇Y ](f1 s1 + s2 ) = ∇X (∇Y (f1 s1 )) − ∇Y (∇X (f1 s1 )) + [∇X , ∇Y ]s2 =
∇X (Y (f1 )s1 + f1 ∇Y s1 ) − ∇Y (X(f1 )s1 + f1 ∇X s1 + [∇X , ∇Y ]s2 =
X(Y (f1 ))s1 + Y (f1 )∇X s1 + X(f1 )∇Y s1 + f1 ∇X (∇Y s1 ) − Y (X(f1 ))s1
− X(f1 )∇Y s1 − Y (f1 )∇X s1 − f1 ∇Y (∇X s1 ) + [∇X , ∇Y ]s2 =
f1 [∇X , ∇Y ]s1 + [∇X , ∇Y ]s2 + [X, Y ](f1 )s1 .
And,
It follows that,
Ω(X, Y )(f1 s1 + s2 ) =
i
(f [∇X , ∇Y ]s1 + [∇X , ∇Y ]s2 + [X, Y ](f1 )s1
2 1
− [X, Y ](f1 )s1 − f1 ∇[X,Y ] s1 − ∇[X,Y ] s2 ) =
f1 Ω(X, Y )s1 + Ω(X, Y )s2 .
Notice that one can always define such a 1-form β in this way, since, first
of all, both sides of the equation are linear in X. On the one hand, β is a
1-form and so by definition linear in X. On the other hand, the connection
is by definition linear in X by property (a) of the connection. Furthermore,
every function can be written as the composition of a 1-form and a vector field
(by definition) and every section can be written as a complex-valued function
times some other section (since ΓX (M ) is a vector space with complex-valued
functions as scalars). β is called the potential 1-form for ∇. We claim that
a general section s on U ⊂ M can be written as a complex-valued function
times the unit section. Indeed a general section s on U ⊂ M can be written
as
77
where f ∈ C ∞ (U, C) is a general smooth complex-valued function. Since
1(m) = 1 ∈ C ∀m ∈ U is a complex basis vector for the one-dimensional vec-
tor space C, it follows that 1̂ is a basis vector for the complex one-dimensional
vector space ΓX (M ). The claim follows. It follows that we can write the con-
nection of a general section s as,
∇X s = ∇X f 1̂ = X(f )1̂ + f ∇X 1̂ =
X(f )1̂ + f −iβ(X)1̂ = X(f )1̂ − iβ(X)s. (69)
With expression (69) we are able to show that the curvature 2-form of ∇
is a closed form. Writing out the right-hand side of definition (65) for
X, Y ∈ Vect(U ), s ∈ ΓX (U ), using (69) and (67), gives
i
2
([∇X , ∇Y ] − ∇[X,Y ] )s = 2i (∇X ∇Y − ∇Y ∇X − ∇[X,Y ] )s =
i
∇ (Y (f )1̂−iβ(Y )s)− 2i ∇Y (X(f )1̂−iβ(X)s)− 2i ([X, Y ](f )1̂−iβ([X, Y ])s) =
2 X
i
2
X(Y (f ))1̂ + 2i Y (f )∇X 1̂ + 2i (−iX(β(Y ))s) + 2i (−iβ(Y )∇X s) − 2i Y (X(f ))1̂
− 2i X(f )∇Y 1̂ − 2i (−iY (β(X))s) − 2i (−iβ(X)∇Y s)
− 2i ([X, Y ](f )1̂ − iβ([X, Y ])s) =
i
2
Y (f )∇X 1̂ + 2i (−iX(β(Y ))s) + 2i (−iβ(Y )∇X s) − 2i X(f )∇Y 1̂
− 2i (−iY (β(X))s) − 2i (−iβ(X)∇Y s) − 2i (−iβ([X, Y ])s) =
1
2
X(β(Y ))s− 12 Y (β(X))s− 21 β([X, Y ])s+ 2i Y (f )(−iβ(X)1̂)− 2i X(f )(−iβ(Y )1̂)
+ 12 β(Y )X(f )1̂ − 12 β(X)Y (f )1̂ − 2i β(Y )β(X)s + 2i β(X)β(Y )s =
1
2
X(β(Y ))s − 12 Y (β(X))s − 21 β([X, Y ])s.
In the fourth equality is used that X(Y (f )) − Y (X(f )) = [X, Y ](f ) for
X, Y ∈ Vect(U ), f ∈ C ∞ (U, C). As a next step to establish the closedness of
the curvature 2-form Ω, we prove that
the expression with which the derivation above ended. Denote the set of
local coordinates on the n-dimensional manifold B by {xi }1≤i≤n . Writing
X = X i ∂x∂ i for X ∈ Vect(U ) and β = βj dxj for the potential 1-form β, it
indeed follows that
78
It follows that on every neighbourhood U ⊂ M we have Ω = dβ. Hence on
every U ⊂ M the identity dΩ = (d ◦ d)β = 0 holds, hence Ω is closed.
Another ‘tool’ we will need in the next section are so-called transition func-
tions. Let (U1 , ψ1 ) and (U2 , ψ2 ) be two local trivializations of M and take
the intersection U1 ∩ U2 non-empty.
ψ2 |(U1 ∩U2 )×C (m, z) = c12 (m)ψ1 |(U1 ∩U2 )×C (m, z). (70)
Notice that this function is well defined, since ψ1 (m, z) and ψ2 (m, z) both
take values in the fibre over m. Indeed, by definition
ψ1,2 |m : {m} × C → Xm , so that ψ1,2 (m, z) = ψ1,2 |m (z) are values in Xm .
Since the Xm ∼ = C are by definition vector spaces over C, it follows that its
elements can be multiplied by complex numbers, such as c12 (m).
where cji is the transition function between (Uj , ψj ) and (Ui , ψi ). Indeed, cji
being a transition function means by definition that it satisfies the equation
ψi |Uji ×C (m, z) = cji (m)ψj |Uji ×C (m, z). It follows that
1̂i |Uji = ψi |Uji ×C (m, 1) = cji (m)ψj |Uji ×C (m, 1) = cji (m)1̂j |Uji (m),
as stated by (71).
79
Notice that (cji )−1 denotes the inverse of (cji ) with respect to pointwise multi-
plication. First we prove property (72). Note that cji satisfies ψi |Uji ×C (m, z) =
cji (m)ψj |Uji ×C (m, z) and cij satisfies ψj |Uij ×C (m, z) = cij (m)ψi |Uij ×C (m, z).
Consequently, cij = (cji )−1 is smooth (since ψi , ψj are smooth) and
ψi |Uji ×C (m, z)ψj |Uij ×C (m, z) = cji (m)cij (m)ψi |Uij ×C (m, z)ψj |Uji ×C (m, z) ⇔
cji (m)cij (m) = 1 ⇔ cij = (cji )−1 .
Here is used that Uij = Uji . To prove relation (73) we first construct func-
tions c˜ji : Uji ×C → Uji ×C from the original complex-valued scalar functions
cji : Uji → C. This can be done by means of the map
which is well-defined, since m ∈ Uji and cji (m)z ∈ C. Notice that this map
is a diffeomorphism,
• c˜ji is smooth: This follows directly from the definition, since cji is
smooth.
• The inverse c˜ji −1 exists with respect to composition and is smooth: the
inverse map is given by
c˜ji −1 : Uji × C → Uji × C (m, z) 7→ (m, c−1ji (m)z). Indeed,
• c˜ji is bijective: it is surjective, since the expression cji (m)z spans C. For
injectivity take c˜ji (m0 , z 0 ) = c˜ji (m, z) ⇔ (m0 , cji (m0 )z 0 ) = (m, cji (m)z).
This implies that m = m0 and so cji (m)z 0 = cji (m)z ⇔ z = z 0 . Injec-
tivity follows.
• c˜ji |{m}×C is linear: all linear maps from C to C are classified by maps
of the form f : C → C z 7→ cz where c ∈ C. Indeed cji (m) ∈ C, so
linearity follows.
• c˜ji |{m}×C is bijective: This follows directly from the discussion above.
80
• c˜ji |{m}×C is a homomorphism under addition, since
c˜ji |{m}×C (z1 + z2 ) = cji (z1 + z2 ) = cji z1 + cji z2 =
c˜ji |{m}×C (z1 ) + c˜ji |{m}×C (z2 ) for z1 , z2 ∈ C.
(c˜kj ◦ c˜ji ◦ c˜ik )|Uijk ×C = (ψj−1 ◦ψk )|Uijk ×C ◦(ψi−1 ◦ψj )|Uijk ×C ◦(ψk−1 ◦ψi )|Uijk ×C =
(ψk−1 ◦ ψi ◦ ψi−1 ◦ ψj ◦ ψj−1 ◦ ψk )|Uijk ×C = 1˜ij |Uijk ×C ,
where 1˜ij : Uij × C → Uij × C (m, z) 7→ (m, z). Using (74) we derive
(c˜kj ◦c˜ji ◦c˜ik )|Uijk ×C (m, z) = (m, ckj |Uijk (m)cji |Uijk (m)cik |Uijk (m)z) = (m, z) ⇔
ckj |Uijk (m)cji |Uijk (m)cik |Uijk (m) = 1,
(CO2) f contains just one function fi1 i2 ...ip+1 for each ordered set of p + 1
indices for which Ui1 ∩ Ui2 ∩ ... ∩ Uip+1 is nonempty,
1
P
(CO3) fi1 i2 ...ip+1 = f[i1 i2 ...ip+1 ] ≡ (p+1)! σ fσ(i1 )σ(i2 )...σ(ip+1 ) . Here σ denotes a
permutation of the set of indices {i1 i2 ...ip+1 }.
Furthermore,
81
Definition 6.4.2.2. Let Ui0 ∈ U and let Ui0 ∩ Ui1 ∩ ... ∩ Uip+1 be nonempty.
Then the coboundary operator Coboundary operator δ is defined on
cochains by
Here iˆl denotes omitting the l-th index of fi0 ...il ...ip+1 .
The coboundary operator δ takes p-cochains into (p + 1)-cochains. Indeed,
• (δf )i0 i1 ...ip+1 is defined on Ui0 ∩ Ui1 ∩ ... ∩ Uip+1 : follows directly from
the definition (75),
• δf contains just one function (δf )i0 i1 ...ip+1 for each ordered set of p + 2
indices for which Ui0 ∩Ui1 ∩...∩Uip+1 is nonempty: follows from definition
(75) and the fact that f contains just one function fi1 i2 ...ip+1 whenever
Ui1 ∩ Ui2 ∩ ... ∩ Uip+1 is nonempty,
(δ(δf ))i0 i1 i2 i3 i4 ≡
5((δf )i1 i2 i3 i4 − (δf )i0 i2 i3 i4 + (δf )i0 i1 i3 i4 − (δf )i0 i1 i2 i4 + (δf )i0 i1 i2 i3 )|Ui0 i1 ...i4 =
5{4(fi2 i3 i4 − fi1 i3 i4 + fi1 i2 i4 − fi1 i2 i3 ) − 4(fi2 i3 i4 − fi0 i3 i4 + fi0 i2 i4 − fi0 i2 i3 ) +
4(fi1 i3 i4 − fi0 i3 i4 + fi0 i1 i4 − fi0 i1 i3 ) − 4(fi1 i2 i4 − fi0 i2 i4 + fi0 i1 i4 − fi0 i1 i2 ) +
4(fi1 i2 i3 − fi0 i2 i3 + fi0 i1 i3 − fi0 i1 i2 )}|Ui0 i1 ...i4 = 0.
Consequently, δ 2 = 0 when p = 2.
82
In the rest of this subsubsection let us consider a specified type of functions
which is referred to in the definition for p-cocycles and that will also be used
later on in this thesis. These are the locally constant functions, functions
which are constant in a neighbourhood of each point and take values in R.
For these functions we denote the set of p-cochains by C p (U, R) and the set
of p-cocycles by Z p (U, R). We can now define,
• f ∼ g, g ∼ h ⇒ f ∼ h: if f ∼ g ⇔ f = g + δi and
g ∼ h ⇔ g = h + δj then
f = h + δj + δi = h + δ(i + j) ⇒ f ∼ h.
(PU1) 0 ≤ hi (m) ≤ 1,
(PU2) supp(hi ) ⊂ Ui ,
P
(PU3) i hi (m) = 1.
We quote without proof that for every manifold it is always possible to find a
locally finite contractible cover U of M and a partition of unity subordinate
to U (for the proof, see for example [21]).
83
For f = {fi1 i2 ...ip+1 } ∈ C p (U, R), define f˜i1 i2 ...ip+1 : M → R by
fi1 i2 ...ip+1 x ∈ Ui1 i2 ...ip+1
f˜i1 i2 ...ip+1 ≡
0 x∈/ Ui1 i2 ...ip+1
Again, since the only important case we consider later on in this thesis will
be whenP p = 2,˜ we prove (77) for this case only. In this case,
αf = i1 ,i2 ,i3 fi1 i2 i3 hi1 dhi2 ∧ dhi3 . The left-hand side of (77) gives
P
dαf = d ˜i i i hi dhi ∧ dhi =
f
i1 ,i2 ,i3 1 2 3 1 2 3
In the third identity is used that hi0 dhi1 ∧ dhi2 ∧ dhi3 equals zero outside
Pi0 i1 i2 i3 ≡ UiP
U 0 ∩ Ui1 ∩ Ui2P∩ Ui3 . In the
P fourth identity is used that
i0 hi0 = 1, i1 dhi1 = i2 dhi2 = i3 dhi3 = 0. Identity (77) now follows.
The map (76) and the identity (77) tell us that associated with any
[f ] ∈ H p (U, R) we have a set of closed p-forms, any two of which differ by
an exact p-form. Indeed, take [f ] ∈ H p (U, R). Then, using (76) and (77)
respectively, it follows that δf = 0 ⇒ αδf = dαf = 0.
So αf is indeed a closed p-form. The fact that any two of these differ by
84
an exact p-form follows from the following argumentation. In the deduction
above the only freedom we have is in choosing the cocycle f satisfying δf = 0.
We could also have chosen the cocycle f +δh, where δh is a coboundary, since
δ(f + δh) = δf + δ 2 h = δf . So another closed p-form can be given by
αf +δh = αf + αδh = αf + dαh ,
which indeed differs from the original αf by an exact p-form dαh .
Now define,
Definition 6.4.2.6. The pth de Rham cohomology group of M , denoted
by H̃(M, R), is the space of equivalence classes of p-forms, αf , with two forms
regarded as equivalent if they differ by an exact p-form.
Indeed, the equivalence described in the definition above defines an equiva-
lence relation ∼, since,
• αf ∼ αf : follows from αf = αf + dαh for αh = 0. Indeed when αh = 0
we have that dαh = 0 is an exact p-form.
• αf ∼ αg ⇒ αg ∼ αf : since αf ∼ αg ⇔ αf = αg + dαh for some exact
form dαh , it follows that αg = αf − dαh = αf + dα−h ⇒ αg ∼ αf .
• αf ∼ αg , αg ∼ αh ⇒ αf ∼ αh : if αf ∼ αg ⇔ αf = αg + dαi and
αg ∼ αh ⇔ αg = αh + dαj then
αf = αh + dαj + dαi = αh + dαi+j ⇒ αf ∼ αh .
In the rest of this subsection we prove that the map α in (76) determines an
isomorphism between the pth Čech cohomology group and the pth de Rham
cohomology group. This isomorphism is called the de Rham isomorphism.
. First define ι : H p (U, R) → Z p (U, R) [f ] 7→ f and
π : {closed p-forms on M } → H̃ p (U, R) αf 7→ hαf i. Subsequently, define
α̃ = π ◦ α|Z p (U,R) ◦ ι :
ι α|Z p (U,R) π
H p (U, R) → Z p (U, R) −−−−−→ {closed p-forms on M } → H̃ p (U, R).
To prove the claim we have to show that α̃ is an isomorphism. Indeed,
(1) α̃ is a homomorphism: Indeed,
α̃([f ] + [g]) = (π ◦ α|Z p (U,R) ◦ ι)([f ] + [g])
= (π ◦ α|Z p (U,R) ◦ ι)([f + g]) = (π ◦ αZ p (U,R) )(f + g) = π ◦ αZ p (U,R) (f + g)
= π ◦ (αZ p (U,R) (f ) + αZ p (U,R) (g)) = hαZ p (U,R) (f ) + αZ p (U,R) (g)i =
hαZ p (U,R) (f )i + hαZ p (U,R) (g)i = α̃([f ]) + α̃([g]).
(2) α̃ is bijective: clearly ι and π are bijective. Furthermore, α|Z p (U,R) is
bijective. Surjectivity of α|Z p (U,R) follows directly from (77), whereas
injectivity follows directly from (76). This implies that the composition
α̃ = π ◦ α|Z p (U,R) ◦ ι is bijective too.
85
6.5 The integrality condition
To make the prequantization construction considered in subsection 6.3 rigourous
one needs a Hermitian line bundle-with-connection over the classical phase
space (the symplectic manifold M ). Existence of such a line bundle-with-
connection is however not guaranteed. For such a line bundle-with-connection
to exist a certain condition should be satisfied by the curvature 2-form Ω de-
fined on it. This is the so-called integrality condition (see [22],[19]). Let us
formulate it:
In the sequel of this section I shall prove that this integrality condition is
both a necessary and sufficient condition for a Hermitian line bundle-with-
connection to exist. The construction presented below is called the Čech
construction .
Ω = dβj . (78)
df˜ij = βi − βj . (79)
86
triple intersection Ui ∩ Uj ∩ Uk there exists a locally constant smooth
real function ãijk defined by
87
βi , fij and aijk .
1
Next, define zijk ≡ 2π (ãijk + dij + djk + dki ) on nonempty Ui ∩ Uj ∩ Uk .
Then it follows from the argument given above that z ≡ {zijk } is an
1
element of the class defined by 2π Ω in H 2 (U, R). Let m ∈ Uijk . By
(IC), when Ui ∩ Uj is nonempty, we can define the constant dij such
that all zijk (m) are integers when Ui ∩ Uj ∩ Uk is nonempty. Notice that
we wouldn’t be able to choose the dij such that the zijk (m) are integers
without (IC), since in general the ãijk are only locally constant and the
intersections Ui ∩ Uj ∩ Uk do not need to be connected.
We are now ready to define on each nonempty Ui ∩ Uj ,
88
– (j, m, z) ∼ (k, m̃, z̃) ⇒ (k, m̃, z̃) ∼ (j, m, z): suppose
(j, m, z) ∼ (k, m̃, z̃), then m = m̃ and z = cjk (m)z̃. This implies
m̃ = m and z̃ = ckj (m)z using (72), i.e. (k, m̃, z̃) ∼ (j, m, z).
– (j, m, z) ∼ (k, m̃, z̃), (k, m̃, z̃) ∼ (l, m̂, ẑ) ⇒ (j, m, z) ∼ (l, m̂, ẑ):
suppose (j, m, z) ∼ (k, m̃, z̃) and (k, m̃, z̃) ∼ (l, m̂, ẑ), that is,
m = m̃ = m̂ and z = cjk z̃ = cjk ckl ẑ. This implies m = m̂ and
z = cjl ẑ using (73) and (72) respectively (cjk ckl clj = 1 ⇒ cjk ckl =
cjl ). It thus follows that (j, m, z) ∼ (l, m̂, ẑ).
We define addition
a and scalar multiplication on the fibers (m ∈ M
fixed) of X ≡ Ui ×C/ ∼ by [(j, m, z1 )]+[(j, m, z2 )] = [(j, m, z1 +z2 )]
i
and λ[(j, m, z1 )] = [(j, m, λz1 )] (λ ∈ C) respectively. Let us now prove
the claim:
89
follows from,
Ψk (m, z) = Ψk (m̃, z̃) ⇔ [(k, m, z)] = [(k, m̃, z̃)]
⇔ (k, m, z) ∼ (k, m̃, z̃) ⇔ m = m̃, z = ckk (m)z̃ = z̃.
It follows that (m, z) = (m̃, z̃). Furthermore Ψk is smooth: it
sends (m, z) to [(k, m, z)] where (k, m, z) ∼ (k, m̃, z̃) if m = m̃
and z = ckk (m)z̃ = z̃, so it sends m to m = m̃ in a smooth fashion
(‘identity map in m’) and it sends z to z = z̃ in a smooth fashion
(‘identity map in z). It follows that Ψk is smooth. The smooth-
ness of Ψk on Uk ∩ Uj should not depend on the choice k. Indeed
it does not, since (k, m, z) ∼ (j, m̃, z̃) if m = m̃, z = ckj (m)z̃ and
ckj is smooth. For the inverse map, define
(Ψk )−1 : pr−1 (U ) → U × C [(k, m, z)] 7→ (m, z). Indeed it is
an inverse, since (Ψ−1 −1
k ◦ Ψk )(m, z) = Ψk ([(k, m, z)]) = (m, z).
That the inverse map is well-defined follows from ckk (m) = 1. Its
smoothness is proved in the same way the smoothness of Ψk is
proved.
The fact that the restriction map,
Ψk |m : {m}×C → pr−1 (m) (m, z) 7→ [(k, m, z)] is an isomorphism
follows directly from {m} × C ∼ = C and the fact that αk is an
isomorphism. Linearity follows from
Ψk |m (λz1 + z2 ) = [(k, m, λz1 + z2 )] = [(k, m, λz1 )] + [(k, m, z2 )] =
λ[(k, m, z1 )] + [(k, m, z2 )] = λΨk |m (z1 ) + Ψk |m (z2 ).
(3) The cij are the transition functions of the line bundle X: in order
to verify this, we have to check if the cij s satisfy (70). Indeed, on
(Ui ∩ Uj ) × C, we have
cjk (m)Ψj (m, z) = cjk (m)[(j, m, z)] = [(j, m, cjk (m)z)] = [(k, m, z)] =
Ψk (m, z).
90
as should be the case. Furthermore, for m ∈ Ui the map
H : X → R (m, z) 7→ h[(i, m, z)], [(i, m, z)]i = z̄z is smooth and on
Ui ∩ Uj (where (i, m, z) ∼ (j, m̃, z̃) if m = m̃ and z = cij z̃) the smooth-
ness does not depend on the choice i, since cij is smooth. Hence h·, ·i
defines a Hermitian structure on X.
The line bundle X is also equipped with a connection. From the real
βi we can construct complex β̃i ≡ iβi . Pick a collection {(Ui , Ψi )} of
local trivializations of X, a unit section 1̂i on Ui and a vector field
X ∈ Vect(Ui ). From the complex β̃i we can define a connection ∇X via
91
On the other hand,
h∇X̄ si , s0i i + hsi , ∇X s0i i =
hX̄(f )1̂i − iβi (X̄)si , s0i i + hs, X(g)1̂i − iβi (X)s0i i =
X(f¯)h1̂i , s0i i + iβi (X)hsi , s0i i + X(g)hsi , 1̂i i − iβi (X)hsi , s0i i =
X(f¯)h1̂i , s0i i + X(g)hsi , 1̂i i = X(f¯)gh1̂i , 1̂i i + X(g)f¯h1̂i , 1̂i i.
This establishes the compatability condition (64) on some Ui . The
fact that the compatibility condition continues to hold on the overlaps
Ui ∩ Uj in a well-defined manner follows from the well-definedness of
the inner product on the overlaps (since |cij |2 = 1) and from
∇X sj = ∇X (cij si ) = cij ∇X si , so we have proved the ‘sufficient’-part of
the theorem (since the open cover of M in (IC) can always be chosen
such that it is contractible for a collection {Ui , Ψi } of local trivializa-
tions of X).
92
Since Ψ̃i |m is linear it follows that this map does define a linear iso-
morphism compatible with the inner product. Hence we can always
construct such a map, because the Ψi |m we started with was arbitrary.
Since we have a well known inner product on C, namely hz1 , z2 i ≡ z¯1 z2
we now have an inner product hΨi |m (z1 ), Ψi |m (z2 )i = hz1 , z2 i ≡ z¯1 z2 on
pr−1 (m) for m ∈ Ui too. Subsequently, consider the functions
ψj |Uij ×C ψi−1 |pr−1 (U )
−1 −1 ij
c˜ji = (ψi ◦ ψj )|Uij ×C : Uij × C −−−−−→ pr (Uij ) −−−−−−−−→ Uij × C
introduced in subsection 6.4.1 for m ∈ Uij . Since Ψj |m is compatible
with the inner product, so is Ψ−1 i |m×C and so is
−1
c˜ij |m×C ≡ Ψj |m ◦ Ψi |m×C . Using (74) this implies for the norm of the
corresponding transition functions cij that
hc˜ij |m×C (z), c˜ij |m×C (z)i = hz, zi ⇔ hcij (m)z, cij (m)zi = hz, zi
⇔ |cij (m)|2 hz, zi ⇔ |cij (m)| = 1.
From this it follows that the inner product h·, ·i on pr−1 (m) is well-
defined for m ∈ Ui ∩ Uj as well. Indeed, take z1 , z2 ∈ pr−1 (m) ∼ = C.
We know that the sections si ∈ ΓX (Ui ), sj ∈ ΓX (Uj ) are related by
sj = cij si on Uij . Hence, writing z1 ≡ s̃i (m), z2 ≡ ŝi (m), on Uij we
have
hs̃j , ŝj i(m) = hcij s̃i , cij ŝi i(m) = |cij (m)|2 hs̃i , ŝi i(m) = hz1 , z2 i,
as should be the case. The line bundle X is now provided globally with
a Hermitian structure, since for m ∈ Ui the map
H : X → R (m, z) 7→ hz, zi = z̄z is smooth and this smoothness does
not depend on the choice i, since cij is smooth.
The existence of the connection ∇ allows us to define a complex 1-
form βi on Ui via 1̂i βi (X) = ∇X (1̂i ) for X ∈ Vect(Ui ), and for general
sections si = f 1̂i on Ui (f ∈ C ∞ (Ui , C)) via the relation ∇X si =
X(f )1̂i − iβi (X)si . The compatibility of the connection ∇ and the
Hermitian structure h·, ·i follows now by the same arguments as in the
‘sufficient’-part, since |cij (m)|2 = 1 for m ∈ Uij in this case too. Hence
we have a Hermitian line bundle-with-connection.
Next, define fij ≡ −1 i
log(cij ) on Uij where log is the complex logarithm
whose values are only determined upto integer multiples n ∈ Z of 2πi.
And from this define, zijk ≡ fij + fjk + fki on Uijk . In subsubsection
6.4.1 we proved that the transition functions cij satisfy
cij (m)cjk (m)cki (m) = 1 on Uijk . It follows that
zijk (m) ≡ fij (m) + fjk (m) + fki (m) =
−1
i
log(cij (m)) + −1i
log(cjk (m)) + −1 i
log(cjk (m)) =
−1 − log(1) n 2πi
i
log(cij (m)cjk (m)cki (m)) = i
+ ijki = 2πnijk ∈ 2πZ
for nijk : Uijk → Z a constant integer. Hence the zijk s are constants.
93
The fact that these zijk form a cochain follows from the same arguments
as for the aijk in the ‘sufficient’-part. In particular, the zijk determine
a cocycle z ≡ {zijk }. Indeed,
(δz̃)ijkl = z̃jkl − z̃ikl + z̃ijl − z̃ijk =
f˜jk + f˜kl + f˜lj − f˜ik − f˜kl − f˜li + f˜ij + f˜jl + f˜li − f˜ij − f˜jk − f˜ki = 0.
Here is used the anti-symmetry of the fij . Indeed, since we proved in
subsubsection 6.4.1 that the transition functions cij satisfy
cij (m) = (cji (m))−1 , it follows that
fij = −1 i
log(cij ) = −1i
log((cji )−1 ) = 1i log(cji ) = −fji .
Our independently constructed βi and fij satisfy the relation
dfij = βi − βj . Indeed, since sections transform as sj = cij si on Ui ∩ Uj
we know that ∇X sj for X ∈ Vect(Uj ) satisfies ∇X sj = cij ∇X si . It
follows, writing si = f 1̂i for f ∈ C ∞ (Ui , C), that
∇X (cij si ) = cij ∇X si ⇔
X(cij f )1̂i − iβj (X)cij si = cij (X(f )1̂i − iβi (X)si ) ⇔
X(cij )f 1̂i + cij X(f )1̂i − iβj (X)cij si = cij (X(f )1̂i − iβi (X)si ) ⇔
X(cij )si − icij si (βj (X) − βi (X)) = 0 ⇔ X(cij ) = icij (βj − βi )(X) ⇔
dc
dcij (X) = icij (βj − βi )(X) ⇔ cijij = i(βj − βi ) ⇔
−1
i
d log(cij ) = βi − βj ⇔ dfij = βi − βj .
Subsequently, define Ω = dβi on Ui . Since for general sections
si = f 1̂i (f ∈ C ∞ (Ui , C)) the βi are defined via (83) it follows by
the argument given in subsubsection 6.4.1 that Ω equals the curvature
2-form corresponding to X. Hence, by remark 6.5.1 it equals the sym-
plectic 2-form on M (since M is the base manifold of the (Hermitian)
line bundle-with-connection X).
Let [z] ∈ H 2 (U, 2πZ) be the equivalence class determined by the cocycle
z ∈ Z 2 (U, 2πZ). This equivalence class depends only on Ω and not on
the choices made for the βi , fij (in particlar not on the ambiguity
in defining them) and zijk s. The argument is fully equivalent to the
one presented in the ‘sufficient’-part, except that in this case the fact
that the values of fij (m) (m ∈ Uij ) are only determined up to integer
multiples of 2πi has to be taken into account. This ambiguity is absent
for the equivalence classes [z] ∈ H 2 (U, 2πZ). Indeed, take
f˜ij = fij − 2πinij on Uij for nij : Ui ∩ Uj → Z an integer. These new
f˜ij give rise to the same 2-form Ω, since the nij are constant functions.
Furthermore,
z̃ijk ≡ f˜ij + f˜jk + f˜ki = fij − 2πinij + fjk − 2πinjk + fki − 2πinki =
zijk − 2πi(nij + njk + nki ) = zijk − 2π(δn)ijk
on Uijk . So the z differ at most by a coboundary and hence give rise
to the same equivalence class [z].
94
As a consequence we have found an open cover U of M such that the
class [z] defined by Ω in H 2 (U, R) contains a cocycle z in which all the
zijk are 2π multiples of integers. The integrality condition is clearly an
equivalent statement and the proof of the ‘necessary’-part is complete.
Suppose that the symplectic manifold (M, ω) is quantizable in the sense that
1
~
ω satisfies the integrality condition (IC). Then we know that it is possible to
construct a Hermitian line bundle-with-connection X with curvature 2-form
1
~
ω, called the prequantum line bundle Prequantum line bundle. Now let
m ∈ M and s1 , s2 ∈ ΓX (M ). Define
Z
1 n
(s1 , s2 )(m) ≡ ( ) hs1 , s2 i(m)ω . (85)
2π~ M
Here ω denotes the Liouville form as defined in (59) and h·, ·i denotes the
Hermitian structure on X as defined in the previous subsection by
hs1 , s2 i(m) ≡ hs1 (m), s2 (m)i ≡ hz1 , z2 i ≡ z¯1 z2 using the notation
z1 ≡ s1 (m) ∈ C, z2 ≡ s1 (m) ∈ C. We claim that (·, ·) defines an inner
product. Note that the space ΓX (M ) is a vector space with complex-valued
functions as scalars. Let us now verify the claim,
1 n
) M hs¯1 , s2 i(m)ω ,
R
( 2π~
where in the last identity ¯ω = ω and the fact that integration is a
linear operation are used.
95
• (·, ·) is antilinear in the first argument: Let f be a complex-valued
function on M . To begin with, h·, ·i is antilinear in its first argument,
since
hf s1 , s2 i(m) = hf (m)s1 (m), s2 (m)i = f (m)z ¯ 1 z2 =
¯ ¯
f (m)z¯1 z2 = f (m)hs1 , s2 i(m)
and for s3 ∈ ΓX (M ) we have
hs1 + s2 , s3 i(m) = z1 +¯ z2 z3 = z¯1 z3 + z¯2 z3 = hs1 , s3 i(m) + hs2 , s3 i(m),
denoting s3 (m) ≡ z3 .
Antilinearity of (·, ·) in its first argument now follows directly from
integration being a linear operation.
s 7→ −i~∇Xf s + f s, (86)
Our goal is now to show that this assigment satisfies the conditions (Q1)-(Q4)
introduced in subsection 6.2 and hence is a suitable prequantum assignment
of operators on the prequantum Hilbert space (actually, some subset of it,
which is argued below) to classical observables f . Indeed Q satisfies these
conditions,
96
(1) Q satisfies (Q1): let f, g ∈ S ⊂ C ∞ (M, R) and λ ∈ R. Then,
Q(λf + g) = −i~∇Xλf +g + (λf + g) = −i~∇λXf +Xg + λf + g =
− i~λ∇Xf − i~∇Xg + λf + g = λQ(f ) + Q(g),
which establishes the claim.
(2) Q satisfies (Q2): let f, g ∈ S ⊂ C ∞ (M, R) and let ~1 ω be the symplectic
form on M (as mentioned before). Then
[Q(f ), Q(g)] = [−i~∇Xf + f, −i~∇Xg + g] =
− i~[∇Xf , g] − i~[f, ∇Xg ] − ~2 [∇
Xf , ∇Xg ] =
2ω(Xf ,Xg )
− i~[∇Xf , g] − i~[f, ∇Xg ] − ~2 ∇[Xf ,Xg ] + =
i~
− i~[∇Xf ,g − i~[f, ∇Xg ] − ~2 ∇X{f,g} + {f,g} i~
=
− i~[∇Xf , g] − i~[f, ∇Xg ] − ~2 ∇X{f,g} + i~{f, g}.
In the second identity the commutativity of scalar functions under
pointwise multiplication is used. In the third identity we used the
fact that the symplectic form ~1 ω (which equals the curvature 2-form
as a base manifold of X) satisfies by definition the equality
ω(Xf ,Xg ) i
~
≡ 2
[∇X f
, ∇ Xg ] − ∇ [Xf ,Xg ] . In the fourth identity the defini-
tion of the Poisson bracket (40) and the fundamental identity (41) are
used. Furthermore,taking s ∈ H we have
[∇Xf , g]s = ∇Xf (gs) − g∇Xf s = g∇Xf s + Xf (g)s − g∇Xf s =
Xf (g)s = {f, g}s,
and hence [∇Xf , g] = {f, g}. And
[f, ∇Xg ] = −[∇Xg , f ] = −{g, f } = {f, g}.
It thus follows that,
[Q(f ), Q(g)] = −2i~{f, g} + i~{f, g} − ~2 ∇X{f,g} =
− i~({f, g} − i~∇X{f,g} = −i~Q({f, g}),
as was to be shown.
(3) Q satisfies (Q3): let s1 , s2 ∈ H and f ∈ S ⊂ C ∞ (M, R). Then
(Q(f )s1 , s2 ) = (−i~∇Xf s1 + f s1 , s2 )R=
1 n
i~(∇Xf s1 , sR2 ) + f (s1 , s2 ) = i~( 2π~ ) MRh∇Xf s1 , s2 iω + f (s1 , s2 ) =
1 n 1 n
− i~( 2π~ ) RM hs1 , ∇Xf s2 iω + i~( 2π~ ) M (Xf (s1 , s2 ))ω + f (s1 , s2 ) =
1 n
− i~( 2π~ ) M hs1 , ∇Xf s2 iω + (s1 , f s2 ) = (s1 , Q(f )s2 ).
In the fourth identity the compatibility of the Hermitian structure h·, ·i
on X and the connection ∇ on X is used. In the fifth identity we
used that on the appropriate subsetR of the prequantum Hilbert space
1 n
we are considering the term ( 2π~ ) M (Xf (s1 , s2 ))ω vanishes. For the
technical proof of this fact we refer the reader to ([7, p.129]).
(4) Q satisfies (Q4): let 1 denote the map 1 : M → R m 7→ 1 and 0 the
zero vector field. Then
97
Q(1) = −i~∇X1 + 1 = −i~∇0 + 1 = 1,
where linearity of ∇ is used in the sense that ∇0 = 0̃ with 0̃ the zero
operator on ΓX (M ) and so we are done.
6.7 Polarizations
As mentioned in the introduction of this section the prequantum Hilbert
space is considered ‘too large’, in the sense that it cannot represent the
phase space of a physically reasonable quantum system. One way to reduce
it is by putting an additional geometric structure on the classical phase space
(as a base manifold of the prequantum line bundle), called a polarization.
As metioned by Woodhouse (see [7, p.133]), it should be stressed that the
justification is not based on general mathematical results (such as the irre-
ducibility of representations) but on the examination of particular examples:
the construction unifies and generalizes a number of familiar techniques that,
in the past, have not appeared to have had any obvious connection with each
other and that have sometimes seemed to be overspecialized with applica-
tions only to particular physical systems.
In this subsection we will first treat polarizations in general and then look
at a specific kind of polarization, the so-called Kähler polarization.
Xk (f ) = 0. (88)
98
Indeed, this gives n restrictions on M , reducing its dimension by n. Now
define,
∇X s = 0, (90)
2i
∇[X,Y ] s − ω(X, Y )s = 0 ∀X, Y ∈ V ect(M, R; P ). (92)
~
Condition (92) is automatically satisfied, provided that
99
Definition 6.7.1.2. Let P be a distribution on a manifold M . If
[X, Y ] ∈ V ect(M, R; P ) ∀X, Y ∈ Vect(M, R; P ), the distribution is called
involutive .
In other words, condition (i) translates to the statement that P on M must
be an involutive distribution. To continue,
Definition 6.7.1.3. Let M be a manifold. If for every point on M , a coor-
dinate chart U can be found such that a distribution P on M is spanned by
just the derivatives with respect to the coordinates on U , the distribution is
called completely integrable .
The following important theorem, known as Frobenius theorem, is quoted
without proof. For the proof, the interested reader is referred to ([23, p.170]).
Theorem 6.7.1.1. For any distribution P , P is completely integrable if and
only if P is involutive.
It follows that condition (i) is equivalent to saying that P on M must be
completely integrable. Furthermore,
Definition 6.7.1.4. Let M be a manifold and P a distribution on M . If
N ⊂ M is an n-dimensional submanifold of M with Tm N ⊂ Pm ∀m ∈ M ,
N is called an integral manifold of the distribution P .
We state without proof, that given a rank r completely integrable distribu-
tion P on M , there exists an integral submanifold N ⊂ M of dimension n = r
for all m ∈ M such that Tm N = Pm (for the proof, see ([23])). Consequently,
from condition (i) it follows that for all m ∈ M there exist integral manifolds
N ⊂ M through Pm = Tm N of dimension n, since the rank of P on the
symplectic manifold M is n(= dim(P )).
Now looking at condition (ii) and definition 6.1.1.3, we see that condition
(ii) translates to the statement that the integral manifolds N ⊂ M are La-
grangian submanifolds of M . So, in first instance, one would define a polar-
ization of (M, ω) to be a maximally integrable distribution P of T M such
that Pm is a Lagrangian subspace of Tm M for every m ∈ M . However, this
definition is far too restrictive. For instance, on a two-dimensional surface a
polarization corresponds to a nowhere-vanishing vector field. But the two-
dimensional sphere, S 2 , does not have a globally defined nowhere-vanishing
vector field and hence a polarization in the above sense does not exist for
S 2 . The solution to this problem is to complexify
a the tangent a bundle of M .
That is, T M will be replaced by T M ≡ C
Tm M ≡
C
(Tm M ⊗ C).
m∈M m∈M
100
Maximally integrable distributions P of T M C , such that Pm = (Tm N )C for
N ⊂ M a Lagrangian submanifold of M , are more likely to exist. We thus
arrive at
Definition 6.7.1.5. Let (M, ω) be a symplectic manifold. A polarization
P of (M, ω) is a maximally integrable distribution of T M C such that
Pm is a Lagrangian subspace of Tm M C for all m ∈ M .
To conclude this subsubsection, we define the corresponding polarized sec-
tions as follows,
Definition 6.7.1.6. A polarized section of the prequantum line bundle
X is a section s over M satisfying
101
We are now ready to define what a Kähler manifold is.
Definition 6.7.2.4. A Kähler manifold (M, J, ω) is a complex manifold
(M, J) which at the same time is a symplectic manifold (M, ω) such that the
symplectic form ω and complex structure J are compatible, that is,
ω(JX, JY ) = ω(X, Y ) ∀X, Y ∈ Vect(M, R).
Now suppose we are given a Kähler manifold (M, J, ω). Let m ∈ M be ar-
bitrary. Consider the complexified tangent space (Tm M )C ≡ Tm M ⊗ C. We
can diagonalize the linear endomorphisms Jm : Tm M → Tm M in (Tm M )C .
To this end, let ṽ be a vector in (Tm M )C and set Jm ṽ = λṽ with λ ∈ C
an eigenvalue of Jm . Since we know that Jm ◦ Jm = −Im , it follows for the
eigenvalues λ of Jm that
(Jm ◦ Jm )(ṽ) = −ṽ ⇔ Jm (λṽ) = λ2 ṽ = −ṽ ⇔ λ2 = −1 ⇔ λ = ±i.
The eigenspace (Tm M )(1,0) corresponding to the eigenvalue i of Jm is spanned
by vectors of the form v −iJP m v for v ∈ Tm M . Indeed, denote such an element
in Span(v − PiJm v) by ṽ ≡ i cP i (vi − iJm vi ) for
P vi ∈ Tm M and ci ∈ C. Then
J
Pm ṽ = Jm c (v
i i Pi − iJ v
m iP) = c (J
i i m iv ) − i i(ci Jm (Jm vi ) =
i ci (Jm vi ) + i i ci vi = i i ci (vi − iJm vi ) = iṽ.
Similarly, the eigenspace (Tm M )(0,1) corresponding to the eigenvalue −i of
Jm is spanned by vectors of the form v + iJm v for v ∈ Tm M .
102
ωm (v, w) = 0 ∀v, w ∈ (Tm M )(0,1) ,
where ωm : (Tm M )C × (Tm M )C → C is the alternating multilinear map de-
fined by the symplectic 2-form ω for m ∈ M .
a
Now construct from this the distribution T M (0,1) ≡ (Tm M )(1,0) on T M C .
m
The distribution T M (0,1) is maximally integrable. To see this, first note that
this is equivalent to showing that the distribution is involutive. Secondly,
because M is a complex manifold the complex structure J defined on it is
integrable. It follows that for any X, Y ∈ Vect(M, R) it holds that
N (X, Y ) ≡ [X, Y ] + J[JX, Y ] + J[X, JY ] − [JX, JY ] = 0 ⇔
i[JX, Y ] + i[X, JY ] = iJ[X, PY ] − iJ[JX, JY ]. (0,1)
Take two vector fiels X̃ ≡ i ci (Xi − iJXi ) ∈ T M and
(0,1)
P
Ỹ ≡ j dj (Yj − iJYj ) ∈ T M for Xi , Yj ∈ Vect(M, R) and ci , dj ∈ C.
(0,1)
hThat
P TM is involutive
P now follows i from P
c i (X i − iJX i ), d j (Y j − iJY j ) = i,j ci dj [Xi − iJXi , Yj − iJYj ] =
P i j
The proof of this will not be provided in this thesis and we refer the reader
103
to ([6]) for a much more detailed derivation. The fact that the Kähler form
is closed has important consequences. Indeed,
dω = 0 ⇔ 2id(ωij̄ (z, z̄)dz i ∧ dz̄ j̄ ) = 0 ⇔
∂ω j̄ (z,z̄) ∂ω (z,z̄)
2i i∂z k dz k ∧ dz i ∧ dz̄ j̄ + 2i i∂j̄z̄k̄ dz̄ k̄ ∧ dz i ∧ dz̄ j̄ = 0.
Both parts of the expression above must separately vanish. Denoting ∂i = ∂z∂ i
and ∂ī = ∂∂z̄ī , this implies that
• ∂k ωij̄ dz k ∧ dz i ∧ dz̄ j̄ = 0 ⇔
∂k ωij̄ dz k ∧ dz i ∧ dz̄ j̄ + ∂i ωkj̄ dz i ∧ dz k ∧ dz̄ j̄ = 0 ⇔
∂k ωij̄ − ∂i ωkj̄ = 0,
¯ a,
ω = 2i∂i ∂j̄ Ka (z, z̄)dz i ∧ dz̄ j̄ = 2i∂ ∂K (95)
= −2i ∂∂z̄i ∂z∂ j Ka (z, z̄)dz̄ i ∧ dz j = −2i ∂∂z̄i ∂z∂ j Ka (z, z̄)dz̄ i ∧ dz j = ω.
104
Kähler polarization and discuss the operators acting on it.
Let us denote the space of polarized sections of the prequantum line bundle
X over the Kähler manifold (M, J, ω) with respect to a Kähler polarization
P by ΓX (M ; P ). As previously, denote the prequantum Hilbert space by H.
Since a Kähler polarization just picks out a particular subspace of the pre-
quantum Hilbert space, the Hilbert space is well-defined, since the intersec-
tion H ∩ ΓX (M ; P ) is nonempty. The technical proof that HP does indeed
define a Hilbert space (so in particular has a natural inner product defined
on it) will not be provided in this thesis. For a detailed proof we refer the
reader to ([7, p.136-137]).
105
This condition (96) tells us that a classical observable f on M defines an
operator on HP via the assignment Q(f ) provided that its flow leaves the
polarization invariant. We denote the functions in C ∞ (M, R) whose flow
leaves the polarization invariant by C ∞ (M, R; P ).
We will conclude this section by showing that for the harmonic oscillator,
a physically relevant and easy application, the quantization construction de-
scribed above already fails to give correct results. The required modification
which is needed to give the correct results is called metaplectic quantization.
We will not discuss metaplectic quantization in this thesis, but will show that
it gives the correct results for the harmonic oscillator. The Hamiltonian of
the harmonic oscillator in 1 dimension is given by
1
H(q, p) = (p2 + q 2 ) = z z̄, (99)
2
where q, p are the position and momentum coordinate on the phase space of
the harmonic oscillator, which is T ∗ R2 , and z ≡ √12 (p + iq). Note that T ∗ R2
is a 2-dimensional Kähler manifold, with complex structure defined by
∂ ∂ ∂ ∂
J( ∂q ) = − ∂p , J( ∂p ) = ∂q
and symplectic form ω = idz ∧ dz̄. Hence H(q, p) is an observable which
preserves the polarization P = T M (0,1) , since it is a specific case of (97). In
106
terms of z and z̄ the symplectic form ω equals
ω = dp ∧ dq = −i 2
(dz + dz̄) ∧ (dz − dz̄) = 2i (dz ∧ dz̄) − 2i (dz̄ ∧ dz).
Hence the Hamiltonian vector field associated with the observable H equals,
using (A.1),
XH = i ∂H ∂
∂ z̄ ∂z
− i ∂H ∂
∂z ∂ z̄
= iz ∂z∂
− iz̄ ∂∂z̄ .
Furthermore, the polarized sections with respect to P = T M (0,1) are holo-
morphic scalar functions (scalar functions depending on the z coordinate
only). Indeed, the connection along directions in P takes the form of an
‘ordinary’ directional derivative, since the potential 1-form β corresponding
to ∇ vanishes on P . This implies, for s ∈ ΓX (M ; P ), X ∈ Vect(M, C; P )
and ψ the scalar function associated with s, that
d
∇X̄ s = 0 ⇔ dz̄ ψ(z, z̄) = 0 ⇔ ψ(z, z̄) = ψ(z).
As a consequence, the assignment Q(H) acting on holomorphic functions,
equals,
Q(H)ψ(z) = −i~(∇XH − iβ(XH ))ψ(z) + Hψ(z) =
− i~XH (ψ(z)) − i~(−iβ(X H )) + Hψ(z) =
∂
~z ∂z ψ(z) − i~(−i) i zdz̄ ~
iz ∂
∂z
− iz̄ ∂
∂ z̄
ψ(z) + z z̄ψ(z) =
∂ ∂
~z ∂z ψ(z) − z z̄ + z z̄ = ~z ∂z ψ(z).
∂
Hence the quantized Hamiltonian corresponds with the operator ~z ∂z . The
n
eigenfunctions of this operator are the monomials z (n ≥ 0) with eigenval-
ues n~. Indeed,
∂ n
~z ∂z z = ~znz n−1 = n~z n .
But as the reader might know from introductory quantum mechanics this
spectrum of the Hamiltonian is incorrect and should be shifted by the ground
state energy 21 ~. The problem arises from the fact that viewed as an operator
the Hamiltonian Ĥ = ẑ z̄ˆ is not Hermitian, whereas the symmetrized Hamil-
tonian Ĥ = 12 (ẑ z̄ˆ + z̄ˆẑ) is. This symmetrization process can be incorporated
by a quantization procedure called metaplectic quantization.
A general rule of thumb for including the metaplectic correction (in this
case the term 21 ~) to the operator corresponding to a polarization preserving
observable is the following (see [24]). Let the polarization P be spanned by
the n complex vector fields Xk on a Kähler manifold M , k ∈ {1, ..., n}. If f
preserves the polarization P , there is a matrix a(f ) ≡ (akl (f )) of functions
on M satisfying
[Xf , Xk ] = alk (f )Xl , (100)
as follows from (96). In terms of this matrix, the corrected quantum operator
is (see for example [9]),
1
Q̃(f ) = Q(f ) − i~ tr(a(f )). (101)
2
107
Furthermore,
[XH , ∂∂z̄ ] = −i ∂∂z̄ z ∂z
∂
+ i ∂∂z̄ z̄ ∂∂z̄ = i ∂∂z̄ .
108
7 The orbit method and the representations
of SU(2)
-It is the harmony of the diverse parts, their symmetry, their happy balance;
in a word it is all that introduces order, all that gives unity, that permits us
to see clearly and to comprehend at once both the ensemble and the details.-
Henri Poincare about symmetry ([25])
109
(I2 ) The integral of ω over S 2 is an integral multiple of 2π.
Theorem 7.1.1. The formulations (I1 ) and (I2 ) of the Integrality Condition
for S 2 are equivalent.
Figure 4: The pyramid cover of the sphere. The sphere is covered by the four patches
U1 (blue), U2 (violet), U3 (green) and U4 (red) as indicated in the figure. Considering the
intersections U123 , U124 , U134 and U234 and picking a point in the center of each of them,
drawing all the geodesics between these points gives the edges of a regular tetrahedron.
110
existence of locally constant smooth real functions aijk ≡ fij + fjk + fki
on Uijk . When considering the pyramid cover of the sphere this gives
the globally constant functions (since the Uijk are connected)
a123 ≡ f12 + f23 + f31 on U123
a124 ≡ f12 + f24 + f41 on U124
a134 ≡ f13 + f34 + f41 on U134
a234 ≡ f23 + f34 + f42 on U234 .
We can define new f˜ij from the old fij by
f˜12 ≡ f12 − a124 on U124
f˜13 ≡ f13 − a134 on U134
f˜23 ≡ f23 − a234 on U234
and leaving the other fij unchanged, since daijk = 0 and so ω remains
the same. In terms of the new f˜ij the new ãijk ≡ f˜ij + f˜jk + f˜ki become
ã124 ≡ f˜12 + f˜24 + f˜41 = f12 − a124 + f24 + f41 =
f12 − (f12 + f24 + f41 ) + f24 + f41 = 0
ã134 ≡ f˜13 + f˜34 + f˜41 = f13 − a134 + f34 + f41 =
f13 − (f13 + f34 + f41 ) + f34 + f41 = 0
ã234 ≡ f˜23 + f˜34 + f˜42 = f23 − a234 + f34 + f42 =
f23 − (f23 + f34 + f42 ) + f34 + f42 = 0
ã123 6= 0. R
Hence ã124 , ã134 , ã234 ∈ 2πZ. If we can prove that S 2 ω = −ã123 it fol-
lows that ã123 ∈ 2πZ and consequently, using the results of subsection
6.5, we have found an open cover U of S 2 such that the class defined
by ω in H 2 (U, R) contains a cocycle ã in which all the ãijk are integral
multiples of 2π. Indeed, for a consistent choice of orientations on the
Rboundaries R of the R Ui , R R
2 ω = ω + ω + ω + ω=
RS UR1 U2 R U3 R U4
dβ + dβ + dβ + dβ =
R U1 1 R U2 2 R U3 3 R U4 4
β + ∂U2 β2 + ∂U3 β3 + ∂U4 β4 =
∂UR1 1 R R R R R
− U12 β1 + U13 β1 − U14 β1 + U21 β2 − U23 β2 + U24 β2 −
R R R R R R
β 3 + β 3 − β 3 + β 4 − β 4 + β =
U43 4
RU31 U32 R U34 U41 R U42
(−β1 + β2 ) + U13 (β1 − β3 ) + U14 (−β1 + β4 ) +
RU12 R R
U
(−β 2 + β 3 ) + U
(β 2 − β 4 ) + (−β3 + β4 ) =
R 23 24 R RU34
˜21 + ˜13 + ˜41 + ˜32 + ˜24 + df˜ =
R R R
d f d f df d f d f
RU12 R U13 R U14 R U23 R U24 R U34 43
f˜ + ∂U13 f˜13 + ∂U14 f˜41 + ∂U23 f˜32 + ∂U24 f˜24 + ∂U34 f˜43 =
∂U12 21
(f˜21 (u123 ) − f˜21 (u124 )) + (f˜13 (u123 ) − f˜13 (u134 )) + (f˜41 (u124 ) − f˜41 (u134 )) +
(f˜32 (u123 )− f˜32 (u234 ))+(f˜24 (u124 )− f˜24 (u234 ))+(f˜43 (u134 )− f˜43 (u234 )) =
−ã123 (u123 )+ã124 (u124 )+ã134 (u134 )+ã234 (u234 ) = −ã123 (u123 ) = −ã123 (·).
In the second identity we used Stokes’ theorem. In the third iden-
111
tity we used that ∂U1 = U12 ∪ U13 ∪ U14 , et cetera. In the sixth
identity again Stokes’ theorem is used. In the seventh identity the
u123 , u124 , u234 , u134 denote points in U123 , U124 , U234 , U134 respectively
and is used that ∂V12 = V123 ∪ V124 , et cetera. In the last identity is
used that ã123 is constant.
• (I1 ) ⇒ (I2 ): this implication is now easy to prove. Suppose we are
given the pyramid cover considered above such that the class defined
by ω in H 2 (U, R) contains a cocycle ã in which all the ãijk s are integral
multiples of 2π. Such ãijk always exist, since the intersections Uijk are
all connected for the pyramid cover, from which it follows that the ãijk
are globally constant. We thus know that ã123 , ã124 , ã134 and ã124 are
integral multiples of 2π. It follows that R
−ã123 (u123 ) + ã124
R (u124 ) + ã134 (u134 ) + ã234 (u234 ) = S 2 ω
and hence that S 2 ω ∈ 2πZ, since ã123 , ã124 , ã134 and ã124 are integral
multiples of 2π. The implication follows.
112
• (i, z) ∼ (i, z): we have (i, z) ∼ (i, z) ⇔ z = z for both i = 1 and i = 2,
which is indeed true.
• (i, z1 ) ∼ (j, z2 ) ⇒ (j, z2 ) ∼ (i, z1 ): we will prove this relation for the
case i = 1, j = 2. Suppose (1, z1 ) ∼ (2, z2 ), which means z2 = z11 and
z1 6= 0. Then z1 = z12 and z2 6= 0. Hence (2, z2 ) ∼ (1, z1 ).
On the overlap of the first copy of C and the second copy of C in the def-
inition for S 2 points are indeed identified by ∼ if one views the first copy
of C (minus the point (1, 0), but the equivalence relation does act trivially
on this point anyway) as the image under the map α : C/{0} → C z 7→ z
and the second copy of C (minus the point (2, 0)) as the image under the
map β : C/{0} → C z 7→ z1 . The fact that one can let the north pole of S 2
correspond to the point (1, 0) and the southpole to the point (2, 0) follows
from the fact that (1, ) ∼ (2, ˜) when ˜ = 1 for 6= 0 infinitesimally small.
113
As a next step we show that S 2 is a Kähler manifold. We know from sub-
section 5.2.2 that the symplectic form ω on S 2 equals
i z z̄
−2 ¯
ω = 2R 1 + 4R 2 dz ∧ dz̄. We claim that ω can be written as ω = 2i∂ ∂K
2 2
for K = R log(1 + z z̄/4R ), which shows that S is a Kähler manifold. Here
∂ ≡ dz ∧ ∂z ∂
and ∂¯ ≡ dz̄ ∧ ∂∂z̄ . Indeed,
¯ = ∂ K ∧ dz̄ = ∂ (R log(1 + z z̄/4R2 )) ∧ dz̄ = z/4R 2 ∧ dz̄.
∂K ∂ z̄ ∂ z̄ 1+z z̄/4R
Subsequently,
¯ = i∂ z/2R 2 ∧ dz̄ = i(dz ∧ ∂ )
2i∂ ∂K z/2R
∧ dz̄ =
1+z z̄/4R ∂z 1+z z̄/4R2
z̄z/4R2 z z̄ −2
i 1+z1/2R i
z̄/4R2
− (1+z̄z/4R 2 )2 dz ∧ dz̄ = 2R 1 + 4R 2 dz ∧ dz̄,
which proves the claim.
114
We will describe this quantization assignment both in spherical and stereo-
graphic coordinates.
1 ∂f ∂g ∂f ∂g ∂f
Q(f ) = −i~ 2
− + i cot(θ)g
R sin(θ) ∂θ ∂φ ∂φ ∂θ ∂θ
∞ 2
+f for f ∈ C (S , R; P ). (102)
115
case and the point θ = 0 in the second case. One can get around this prob-
lem in case the stereographic coordinates are used as local coordinates on
the sphere. We know from subsubsection 5.2.2 that in projective coordinates
i z z̄ −2
the symplectic form on {U− } equals ω = 2R 1 + 4R2 dz ∧ dz̄. Viewing
the south pole of the sphere as the point at ∞, this symplectic form only
has a singular point at ∞. One can get rid of this singularity by applying
the change of coordinates z = ω1 for ω ∈ C. As a consequence, when z → ∞
then ω → 0. Applying this change of coordinates one gets, taking 2R = 1
for notational convenience,
1 1
ω = (1+|z| 2 )2 dz ∧ dz̄ =
(1+1/ω 2 )2
d( ω1 ) ∧ d( ω̄1 ) =
|ω|4 1 1
∧ dω̄ = (1+|ω|
(1+|ω|2 )2 ω 2 ω̄ 2
dω 2 )2 dz ∧ dω̄.
Consequently,
−i (z̄/2R)dz z z̄ 2 ∂f ∂ ∂f ∂
β(Xf ) = ~ 1+zz̄/4R2 (R) 1 + 4R 2 ∂ z̄ ∂z
− ∂z ∂ z̄
=
z̄/2R 2 ∂f ∂f
( −i z z̄
= −iz̄
~ 1+z z̄/4R2
)(R) 1 + 4R 2 ∂ z̄ 2~
(1 + z z̄/4R2 ) ∂ z̄ .
Moreover,
z z̄ 2 ∂f ∂g
− ∂f ∂g z̄
(1 + z z̄/4R2 ) ∂f
∇Xf s = R 1 + 4R 2 ∂ z̄ ∂z ∂z ∂ z̄
1̂ − 2~ ∂ z̄
g 1̂.
Finally, the quantization assingment in stereographic coordinates becomes
116
z z̄ 2 ∂f ∂ ∂f ∂ z̄ 2 ∂f
Q(f ) = −i~ R 1 + − − 1 + z z̄/4R
4R2 ∂ z̄ ∂z ∂z ∂ z̄ 2~ ∂ z̄
∞ 2
+f for f ∈ C (S , R; P ). (103)
and
117
the Lie algebra of general 2 by 2 complex valued traceless matrices. Since
this Lie algebra has complex dimension 3 it spanned by the set of matrices:
0 1 0 0 1 0
X= , Y = , E= .
0 0 1 0 0 −1
Now consider the base elements X, Y and E of sl(2, C). Substituting them
for ξ in (108) gives
∂ ∂ ∂ ∂
Xp = −z2 p, Y p = −z1 p, Ep = −z1 + z2 p. (109)
∂z1 ∂z2 ∂z1 ∂z2
118
It follows that the base elements X, Y and E can be associated with the
operators
∂ ∂ ∂ ∂
X = −z2 , Y = −z1 , E = −z1 + z2 . (110)
∂z1 ∂z2 ∂z1 ∂z2
One can indeed check that these operators satisfy the relations (105) and
hence form a Lie algebra representation of su(2) ⊗ C ∼ = sl(2, C) in Pn (C2 ).
The idea of the infinitesimal method is to reduce the consideration of Lie
groups to consideration of the associated (complexified) Lie algebras. From
the fact that the operators in (105) form a Lie algebra representation of
su(2) ⊗ C it indeed follows that every irreducible continuous representation
of SU (2) is equivalent to πn for some n ∈ N. For the full rigourous proof see
for example ([3]) or ([26]).
Here z13 , z12 z2 , z1 z22 and z23 denote the basis elements of P3 (C2 ). The arrows
represent the action of the operators X, Y and E on the basis elements.
The numbers above the arrows denote the coefficients in front of the basis
elements after the action has taken place. As a check, we apply the operator
119
X to the basis elements of P3 (C2 ). We indeed calculate,
−z2 ∂z∂ 1 (z13 ) = −3z12 z2 ,
− z2 ∂z∂ 1 (z12 z2 ) = −2z1 z22 ,
− z2 ∂z∂ 1 (z1 z22 ) = −z23 ,
− z2 ∂z∂ 1 (z23 ) = 0.
120
R(z− +z− ) −iR(z− −z− ) 2Rz−
x+ = x1 + ix2 = 1+|z− |2
+i 1+|z− |2
= 1+|z− |2
,
R(z− +z− ) −iR(z− −z− ) 2Rz−
x− = x1 − ix2 = 1+|z− |2
−i 1+|z− |2
= 1+|z− |2
,
R(|z− |2 −1)
x3 = 1+|z− |2
.
Note that here is used that Q(f ) acts on holomorphic sections, hence the
∂
∂z−
-terms can be dropped in the calculation. The determined operators
Q(x+ ), Q(x− ), Q(x3 ) satisfy the same characteristic equations
121
We end this section by giving an alternative method to determine that the
Hilbert space for spheres of half-integer radius k ∈ Z2 consists of polynomials
of degree ≤ k compared the method described in subsection 7.2.
On U− it holds that
(x1 + ix2 )
z− = . (114)
R − x3
Indeed,
D E . D E
x1 +ix2 R(z− +z− ) −iR(z− −z− ) R(|z− |2 −1)
= 2 + i 2 R− 1+|z− |2
=
D1−x3
E . D1+|z− | E 1+|z− |
2R 2R
1+|z− |2 1+|z− |2
= z− .
(x1 − ix2 )
z+ = . (115)
R + x3
On the overlap of the patches U± we have z− z+ = 1, which follows from
(x1 −ix2 ) x2 +x2 x2 +x2
z− z+ = (xR−x
1 +ix2 )
3 R+x3
= R12 −x22 = x12 +x22 = 1.
3 1 2
As the patches U± are topologically trivial, any line bundle is trivial when
restricted to one of these patches. Thus, all we have to do is to give a pre-
scription for gluing these trivial bundles together over, say, the equator. If
one does this with the transition function (z− )−k = (z+ )k one obtains the
line bundle Lk over a sphere of radius k ∈ Z2 . To determine the global
holomorphic sections of Lk in the trivialization U± , we have to check which
local holomorphic functions on U± can be patched together via the transi-
tion functions.
P∞ Since holomorphic functions f (z) can be written as a series
f (z) = i=0 ai z i with ai ∈ C, it follows that a basis for holomorphic func-
tions on U+ is given by the monomials (z+ )l and on U− by (z− )m for l and
m non-negative integers. We thus have to find the solution to the equation
122
8 Conclusions and outlook
-Life is infinitely stranger than anything which the mind of man could invent.
We would not dare to conceive the things which are really merely common-
places of existence.-
Sir Arthur Conan Doyle ([28])
In the first subsection of this section we will try to gain insight in the overall
structure of this thesis and recall the important results derived previously.
Finally, in the last subsection, we will consider were future research efforts
should be directed.
The parts (1) and (2) are the basic ingredients used in the Orbit Method, the
method to determine all irreducible unitary representations of a Lie group.
Part (3) is an application of the Orbit Method to the particular Lie group
SU (2).
(1):
In part (1) we introduced the notion of coadjoint orbit, the most important
new mathematical object that has been brought into consideration in connec-
tion with the Orbit Method. As an application, we showed in two alternative
ways that the coadjoint orbits of SU (2) are spheres. Subsequently, we proved
the beautiful non-trivial theorem that the coadjoint orbits of a matrix group
posses symplectic structure (and mentioned that this result is still valid for
general Lie groups). In particular, it followed that the coadjoint orbits of
a matrix group are always even-dimensional. Then we derived the explicit
shape of the Kirillov 2-form for the coadjoint orbits of SU(2) (the spheres)
in both spherical and stereographic coordinates. The symplectic structure
of the coadjoint orbits provided an important link between coadjoint orbits
and geometric quantization, since quantization was as assignment applied to
smooth real functions on symplectic manifolds (classical observables).
123
(2):
This brought us to part (2) of the thesis. Here we showed that Geometric
Quantization, formulating a relationship between classical- and quantum me-
chanics in a geometric language, is a rigorous quantization scheme applicable
to general curved manifolds. In particular, it is showed that the Hilbert space
over a curved manifold (M, ω) does not consist of square integrable scalar
functions on (M, ω), but instead of polarized, square integrable sections of
a Hermitian line bundle-with-connection over (M, ω) with curvature Ω = ω~ .
Important roles in the construction of the Hilbert space are played by the
Integrality Condition and the notion of Polarization. We proved the Integral-
ity Condition to be both a sufficient and necessary condition for a Hermitian
line bundle-with-connection over (M, ω) with curvature Ω = ω~ to exist. As
an important specific type of polarization (a way of reducing the ‘too big’
prequantum Hilbert space) we considered the Kähler polarization, naturally
defined on kähler manifolds. Finally, as an important explicit realization of
geometric quantization, we showed the assignment (87) acting on sections in
the Hilbert space HP , to satisfy all axioms (Q1)-(Q5), imposed on a quanti-
zation assignment.
(3):
In the third and last part we applied our geometric quantization construc-
tion to the spheres, the coadjoint orbits of SU (2). We find that in that
case the integrality condition translates into the spheres being of half-integer
radius in order for smooth real functions defined on it to be quantizable.
Furthermore, we showed that the Hilbert space on these spheres of half-
integer radius consists of polynomials of degree n ≤ k, for a sphere of radius
k ∈ Z2 . Subsequently, we determined the irreducible unitary representations
of SU (2) by the Infinitesimal Method. Finally, as an application of the Orbit
Method we brought coadjoint orbits and geometric quantization together by
determing all irreducible unitary representations of SU (2) by means of the
Orbit Method.
8.2 Outlook
Geometric Quantization is a quantization construction applicable to symplec-
tic manifolds. Looking at the classical phase space as a symplectic manifold
presupposes to take point particles as the fundamental building blocks of na-
ture. In field theory the fundamental building blocks of nature are considered
to be fields instead of point particles and therefore it is interesting to extend
Geometric Quantization to fields, a first step towards making Quantum Field
Theory mathematically rigorous. An introduction to such an approach can
124
be found in, for example, ([7]). In string theory the fundamental build-
ing blocks of nature are one-dimensional extensions of point particles, being
strings. Finding a rigorous quantization scheme corresponding to such a the-
ory is another very interesting prospect.
125
A An identity at the heart of quantization
-Let us try to introduce a quantum Poisson Bracket which shall be the ana-
logue of the classical one.-
P.A.M. Dirac ([12, p.86])
First of all, we will derive the form of the Hamiltonian vector fields in local
coordinates. Take Y to be a vectorfield on M . Because Xf and Y are vector
fields, in local coordinates they can be written as: Xf = ξ i ∂x∂ i , Y = η i ∂x∂ i .
Consequently, in order to find the form of Xf in local coordinates we need
to determine the coefficients ξ i .
that ω is a symplectic form gives that ωji is invertible, that is, ω kj ωji = δik .
∂f i kj ∂f
It follows that ∂x j = 2ωji ξ ⇔ ω ∂xj
= 2ω kj ωji ξ i = 2ξ k . Hence, in local
coordinates,
1 ∂ 1 ∂f ∂
Xf = ξ k k = ω kj j k . (A.1)
2 ∂x 2 ∂x ∂x
126
The identity (41) relates the Lie algebras of Hamiltonina vector fields and
functions on M . We already proved that (C ∞ (M, R), {., .}) has the structure
of a Lie algebra. We can use identities (41) and (A.1) to show that Hamilto-
nian vector fields together with the Lie bracket for vector fields also form a
Lie algebra. First of all, notice that the space of Hamiltonian vector fields is
a linear space, where smooth real-valued functions play the role of ‘scalars’.
Furthermore, the Lie bracket for vector fields satisfies:
• Jacobi identity: for h ∈ C ∞ (M, R), [Xf , [Xg , Xh ]] + [Xg , [Xh , Xf ]]+
[Xh , [Xf , Xg ]] = X{f,{g,h}} + X{g,{h,f }} + X{h,{f,g}} =
X{f,{g,h}}+{g,{h,f }}+{h,{f,g}} = X0 = 0.
To prove identity (41) we will write out both sides of this equality in local
coordinates and check that the corresponding expressions match. The left
hand side equals
By switching the indices i and k in the second term of the last expression
(over both indices is summed) one easily sees that the second and fourth
term in this expression cancel. The remaining expression can be written out
further:
1 ij
4
ω ∂i f ∂j (ω kl ∂k g)∂l − 14 ω ij ∂i g∂j (ω kl ∂k f )∂l =
1 ij
4
ω ∂i f ω kl ∂j ∂k g∂l − 41 ω ij ∂i gω kl ∂j ∂k f ∂l + 14 ω ij ∂i f ∂j ω kl ∂k g∂l − 14 ω ij ∂i g∂j ω kl ∂k f ∂l .
Writing out the right-hand side of identity (41) gives, using the definition
of the Poisson bracket in local coordinates, that
127
X{f,g} = − 12 ω ij ∂i {f, g}∂j = 41 ω ij ∂i (ω kl ∂k f ∂l g)∂j =
1 ij
4
ω ∂i ω kl ∂k f ∂l g∂j + 14 ω ij ω kl ∂i ∂k f ∂l g∂j + 14 ω ij ω kl ∂k f ∂i ∂l g∂j .
−ω ij ∂i ω kl + ω kr ∂r ω lj − ω lr ∂r ω kj = 0.
The last identity is known as the Jacobi identity for symplectic forms . We
will derive the Jacobi identity for symplectic forms from the Jacobi identity
for Poisson forms, using that ω is a symplectic form, hence invertible. Let
us start by multiplying the Jacobi identity for Poisson forms by ωsl , using
ωsl ω lk = δsk ,
ω kl ∂k ω mn + ω kn ∂k ω lm + ω km ∂k ω nl = 0 ⇔
ωsl ω kl ∂k ω mn + ωsl ω kn ∂k ω lm + ωsl ω km ∂k ω nl = 0 ⇔
128
−∂s ω mn + ωsl ω kn ∂k ω lm + ωsl ω km ∂k ω nl = 0
Using differentiation by parts and the fact that the boundary terms,
∂k (ωsl ω lm ) = ∂k δsm , vanish, we get
We can write the first term of this expression as ω mr ∂s ωrl ω ln . This follows
from ∂s (ω mn ωnl ) = 0 ⇔ ∂s ω mn ωnl + ω mn ∂s ωnl = 0 ⇔ ∂s ω mn + ω mr ∂s ωrl ω ln =
0 ⇔ −∂s ω mn = ω mr ∂s ωrl ω ln . Multiplying the resulting expression by ωpm
and ωnq respectively, gives
which is indeed the Jacobi identity for symplectic forms! Identity (41) is
proved.
129
References
[1] A.A. Kirillov, Lectures on the Orbit Method, American Math-
ematical Society, volume 64, 2000.
[3] E.P. van den Ban, Lie groups, Lecture notes, Spring 2005.
[14] P.A.M. Dirac, Proc. Roy. Soc. London ser. A, 109, 642-53,
1926.
130
[16] G.E. Bredon, Topology and geometry, Springer-Verlag New
York, 1993.
131
[31] Topics in Representation Theory: The Killing Form, Reflec-
tions and Classification of Root Systems, http://www.math.
columbia.edu/~woit/notes9.pdf.
132
Index
Čech construction, 86 Distribution, 99
Čech cohomology group, 83 completely integrable, 100
s0(3), 37 involutive, 100
su(2), 32
generators of, 36 Endomorphism, 19
pth de Rham cohomology group, 85 Exponential map, 21
1-form, 16 Exterior derivative, 17
Action Flow, 18
left, 23 Formal complex conjugate, 75
right, 23 Gauge-transformations, 73
transitive, 24 Geometric Quantization, 13, 55, 123
ad, 29
ad∗ , 31 Hamilton’s equations, 62
Adapted potential, 105 Hamiltonian, 62
Adjoint representation, 26 Hamiltonian vector field, 58
Automorphism, 19 Hermitian line bundle with connec-
group, 20 tion, 75
Hermitian operator, 67
Canonical coordinates, 62 Hermitian structure, 75
Cartan formula, 17 compatible, 75
Category theory, 18 Hilbert space, 66, 105
Coadjoint orbit, 29, 123 for coadjoint orbits SU(2), 112
Coadjoint representation, 27 Homomorphism, 19
Coboundary, 82 of G-representations, 35
Cocycle, 82
Complete metric space, 67 Infinitesimal generator, 18
Complex inner product space, 66 Infinitesimal method, 117
Complex structure, 101 Inner product, 66
Connection, 75 Integrable, 101
compatible, 75 Integrality condition, 86
Cotangent bundle, 16 for spheres, 109
Cotangent space, 16 Invariant subspace, 24
Curvature 2-form, 76 Isomorphism, 19
Curve, 16
Jacobi identity, 22
De Rham isomorphism, 85 for Poisson forms, 128
Diffeomorphism, 20 for symplectic forms, 39, 128
Distance function, 66
133
k-form, 17 Prequantization, 70
Kähler potential, 104 Prequantum Hilbert space, 96
Kirillov form, 39 Pull-back, 20
Pure quantum state, 67
Lagrangian submanifold, 61 Push-forward, 20
Lagrangian subspace, 61 Pyramid cover, 110
Lie algebra, 22
homomorphism, 22 Quantization, 12, 67
Lie bracket, 17, 29 for coadjoint orbits SU(2), 114
Lie derivative, 17 Quantum condition, 69
Lie group, 20 Quantum mechanics, 12
automorphism, 21
homomomorphism, 21 Ray, 67
isomorphism, 21 Representation, 24
Line bundle, 74 equivalent, 25
Liouville form, 71 irreducible, 24
Lie algebra, 25
Manifold, 14
almost complex, 101 Section, 15, 74
complex, 101 polarized, 101
integral, 100 Stereographic projection, 50
Kähler, 102 Structure of a smooth manifold, 15
Poisson, 45 SU(2), 31
symplectic, 39, 56 Submanifold, 15
Metaplectic correction, 107 Symplectic potential, 57
Morphism, 18 Symplectic vector space, 61
Partition of unity, 83
Phase space, 62
Poisson bracket, 45
Polarization, 101
Kähler, 103
134