Escolar Documentos
Profissional Documentos
Cultura Documentos
A
B
Life Sciences
Physics
Mathematical and
Physical Sciences
Behavioral and
Social Sciences
Applied Sciences
Computational Techniques
in Quantum Chemistry
and Molecular Physics
Proceedings of the NATO Advanced Study Institute
held at Ramsau, Germany, 4-21 September, 1974
edited by
G. H. F. DIERCKSEN,
B. T. SUTCLIFFE,
A. VEILLARD,
Munich
York
Strasbourg
ISBN-13: 978-94-010-1817-3
e-ISBN-13: 978-94-010-1815-9
001: 10.1007/978-94-010-1815-9
CONTENTS
PREFACE
VII
B. T. Sutcliffe
MOLECULAR PROPERTIES
107
201
251
299
V. R. Saunders
347
N. C. Handy
CORRELATED WAVEFUNCTIONS
425
M. A. Robb
MOLECULES IN ASTROPHYSICS
435
505
529
PREFACE
PREFACE
VIII
B. T. Sutcliffe
Dept. of Chemistry, University of York, England
1.
E'l'
(1.1)
Hamiltonian Operator
H(l,2--N)
Hwe
L: h (i)
1=1
shall take to be
+
L: Ie /
1J = 1
The
41TE
r ..
(1. 2a)
~J
(1. 2b)
h(i)
41TE
h 2 /me 2 .
Diercksen et aL (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 1-105.
All Rights Reserved Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
B. T.SUTCLIFFE
To a lesser extent we shall also be concerned with describing practicable methods of calculating expectation values of
operators, between the calculated state functions.
We shall not have much occasion in future to refer explicitly
to why we want to know how to solve these problems, so it seems
appropriate to describe the context at this stage, in the hope of
forestalling possible puzzlement or exasperation later.
We hope to be able to produce numbers from our calculations
which, at very least, can be compared with experimental numbers.
Better than this we hope to be able to anticipate numerically
the outcome of as yet unperformed experiments, and best of all,
we hope to have methods which will yield numbers so reliable as
to be useful alternatives to experimental measurement.
When we reflect on the fact that most experiments are done
on huge assemblies of molecules, (a tube of gas in a spectroscope,
a flask of liquid on a heating mantle and so on), and that these
assemblies are open, (if only by virtue of the intervention of
the measuring apparatus) and therefore developing in time, we may
wonder what possible relevance our simple isolated molecule, time
independant Hamiltonian, can have to an experimental situation.
The answer is, I think, that so far as we know, the solution of
the problem as we have stated it is a sine
qu~non
of any
where
system.
H can
ll(n)
g et S(n)
(1. 4)
nrmn
B. T. SUTCLIFFE
However in many
~,
a long way from being directly usable to achieve our desired end.
The solution if (1.1) in our case, though difficult enough, is
simply the start of a hard road to observation.
Now what I should like to do in this series of lectures is
the following.
We are interested
Then in the
In this
useful perspective
in
in it, but many of them are there, if you can wade through the
rather abstract maths.
is unfortunately rather badly proof read and not always too clear)
by Kato (Supp. Prog. Theoret. Phys. 40, 2 (1967
much of the progress in the field up to 1966.
which summarises
2.
B. T. SUTCLIFFE
H(1,2)
anI
(2.1)
2m
E'I'
(2.2)
which we must specify in detail.
Now
x. + a
1
t(a )e kx
x
k(x+a )
x
a k kx
e x e
(2.3)
is an important and quite general point (it holds for all atoms
and molecules) unless by some means or another we get rid of the
centre of mass motion of a quantum mechanical problem, we cannot
proceed any further with the solution of the problem in differential equation form.
2m
47rE: r
o
To do this let us
with M
N
(
N
n
I:
I:
n=l
m
n=l n
x.
2,3
1,2 --- N
. '1 ar express10ns
.
f or Y, Z, y.,
" z.
+ Nm, an d S1m1
J
J
=1
B. T. SUTCLIFFE
Indeed it is a very
....
1: a .. ~ .
r.
j =1 1J
a ..
j=l 1J
0,
2,3 -- N
(2.5)
_t2 1,72(R)
-1/
2M
_'h2
1,72
_ t2 1,72
2m
2ml
- Ze
41TE: r
0
(2.6)
1,72(R) _t?1,72 - Ze 2
2M
2]J
41T r
0
' (R,r)
T (R) '
(~).
(2.7)
.
1
where
(2.8)
ze 2 ) ' (r)
.' (r)
(2.3b)
41T o r
Formal functions which satisfy (2.8a) are, of course:
~
T(R)
N(t)eik,R
(2.9a)
(2.9b)
where 0
.::.1 kl 2.::.
00
Unfortunately
Let
Ze 2
HI (1,2,--N)
4'TTE: r.
o 1
2
E'
e
i ,j=l
4'TTE: r ..
(2.10a)
o 1J
- t2 2m
a'
-= 2
r:
lj=l
E'
lj=l
e
4'TTE:--r:-:o 1J
V(i). V(j)
(2.10b)
It'2
Now it is very easy to show that HI and H'2 differ only by a change
of scale. This can be shown by replacing the variab les r in HI
A
by r =
a
(2.11a)
and
a
(2.11b)
10
B. T. SUTCLIFFE
~
L
o i=l
(-!
L' _1_
N
lj=l rij
2
Z
V (i) - /r!) + ~
HI
(2.12)
being
(2.13a)
(2.l3b)
Hi
m/~
~/m,
Such a
11
(2.l4a)
and
(2.l4b)
The unit b
bohr radius.
Sometimes b
of course is the
"
fill
- 1
N
L
2ml ij=l
'Y(i).
(2.15 )
'Y(j)
12
B. T. SUTCLIFFE
and (2.10b) you will see that they are both invariant to any
orthogonal co-ordinate transformation of the form that maps
vectors in the co-ordinate space:
t.
-+"t.'
1
where
i.'
= R~.1
1
i = l,2--N
for
y.1
R has
(2.16)
the 3 by 3
such that if
(i
(2.17)
k)
Z.
with f j and
r.
r.' =Rr.,
-.1
_...J.
r.
,..
(2.18)
e r.
-1
(2.19)
and
where
matrix transpose.
,..
Typically we think of the operators R as rotations,
reflections and the inversion, and we think of this invariance as
reflected in the fact that we can classify atomic states according
to their L and
However if
because the nuclei are fixed and define the co-ordinate frame,
while the Hamiltonian in the centre-of-mass-removed system is
invariant under the orthogonal mapping of all electronic and
nuclear variables.
13
fix~d
~,
J.e.p.
41, 121,
14
B. T. SUTCLIFFE
However in the
try to separate off rotation the problem gets even worse and one
finds that one simply cannot avoid permutations of the variables
which re-define the orientation of the rotating co-ordinate
system.
15
n=l
zn / It.1 - t n I
(2.20)
16
B. T. SUTCLIFFE
Nevertheless we do in certain
Even if we do this
--
~(r,
In this context it is
~(I'
!).
It should of
17
~ell
11,
433, 1970
It will not
18
B. T. SUTCLIFFE
Academic
li,
479, 1968).
19
R = l:mARA/M
(2.22)
(the sum going over all electrons and nuclei) and where
~
= -m
l: r.
Mil
(2.23)
The trans-
=0
l:mA (r A + 8)
A
=0
(2.24)
n n
(2.25)
The conditions
(2.24) and (2.25) define the electron position vectors~. and the
. .
nuclear position
vectors ~
rn comp 1 ete 1 y.
.
~.
Here olioV is
the centre of mass ve 1 OClty
va is du"I dt and ~.
vA is the
Now
.olio
20
B. T. SUTCLIFFE
~
(2.27)
Now
as being expressed
n n x n
n
. h t he
toget h er w~t
(2.28)
'
requ~rement
, expresse d'~n t h e
th
at, '~ f ~o
rn ~s
In this case of
~n
the
rotat~ng
~o
If the molecule is
co-ord~nate
system we
21
~o
+ L:
(2.29)
viiin nq q
~pq
~o)
x vn
..
(2.30)
L: ~pqQp VQq
pq
..
E 1
1
np x nq
n
(2.31)
is
V Qq
....
V Qq
The
1S
the
22
B. T. SUTCLIFFE
...:.
frame.
...N
..:.
J - 1: (~.
p.)
1
1:
pq
spq
Q P
P q
(2.33)
-'"
The tensor
tensor
= x,
y, z) given by
(2.34)
where
if any
23
Phys. Rev.
~,
of the Euler angles for the rotating frame and momenta conjugate
to these Euler angles, we can proceed with the Podolsky transformation much in the way done in section 11.4 of Wilson, Decius
and Cross.
Ii = 1
2M
1
~2 + 1 r~4~ ~
"2
as
_I
~2~
as
p4
7; P ,,-~ P A~
2
+ l~
qll
qll" + lmEp. +
"2 i
"2q
where
ais
! (r .P-)
2M
n'Uc.
2 + V
P and
(2.35)
is there-
here that reference frame does matter for ~, in that its total
angular momentum component J has to be defined in terms of the
Euler angles referred to the molecule-fixed rotating co-ordinate
system.
24
I ---- J
B. T. SUTCLIFFE
1P
* 1P
sin eded!pdx'lf dQ
q q
'l1'
1.
dr.
1.
=1
(2.36)
where e, !p and X are the Euler 'angles of the problem and where
the dQq and dr i are the "volume" elements for the vibration coordinates and electronic co-ordinates respectively, in the
rotating co-ordinate system.
Now until quite recently that would be where our discussion
would have ended, but Watson in a really important paper (op.cit.)
was able to show (and Moss was able to generalise his discussions
to the case we are considering) that you could "commute" out the
factors and (neglecting the translational portion) write the
Hamil tonian as:
It' = 1 L ~ ~ SRS + 1 L P 2 + 1 LP ,2
2 a, S a a
2 q
2m
1.
(2.37)
In other words to show that the total effect of "~ was to add a
rather small mass dependent term to the potential.
He was also
able to show that "~aS was a function only of the elements of the
inertia tensor at the reference configuration and the vibrational
co-ordinates, and thus commuted with
of factors in (2.37) does not matter.
Na
and
NS
(ella s/'aQp ) 0
where the
(2.38)
In terms of ~ and
25
configuration) we obtain
-1 _
-0
1:
-u
-1 a 1+1 + 3
- -
I -1
-0
~I -1 ~I -1+ .
-"'0
(2.39)
where
a = kE a.~
(2.40)
fO
is a positive
problematic.
And now we are on the home stretch and the only thing we
need consider further is the spin co-ordinate.
consider electron spins for the moment.
Let us just
(2.37) does not explicitly contain any spin terms (except perhaps
in V) its solutions will be functions of electron spin and it will
be natural to refer electron spin to the rotating axis system
(that is Hund's case (a unless there is a strong external
magnetic field, or very strong spin-molecular-rotation interaction {Hund's case (b. Now if one wants to use spin functions
referring to the rotating co-ordinate system then one must find
the transformation operator,
~rot
~ ~fixed
0 R 0- 1
ro t. This transformation is
of course just the rotation matrix for spin functions, and as
~
26
B. T. SUTCLIFFE
1
N.
changing it from ~ to
N-S
Uoperator
Hinvolving
spin operators, but will not affect the usual electrostatic terms
in V so we will not consider its effects in this context in detail.
(I must say I always find this result
ing one since there are no "bare"
N~ N- S a
S terms
very surpris-
in the original
But of
1 involves
only the
27
see that this kind of operator is really highly singular and that
it is possible to show that a Hamiltonian containing such highly
singular terms is not generally a proper object in quantum
mechanics.
that
such
configurat~
....0
Now
Hdo
not diverge.
It is therefore
only if this second step can (at least in principle) be made that
(2.37) makes any sense as a Hamiltonian operator.
(In passing it
~,
113, 1935, where they did attempt to get at a result like this by
direct quantum mechanical discussion without specifying a
reference configuration so directly.
trouble develops there, and it is manifested in ghastly singularities in the operators and in dependence among the variables).
Supposing that we can attempt,say,a perturbation theoretic
approach of the full problem, then it only remains to determine
if the answers so obtained correspond to the situation one thinks
one is describing, and hopefully by a good choice of the reference
configuration,one can always do this.
Thus we conclude that (2.37) can at best make sense for a
strongly bound system and we are not sure whether it does for all
such systems (for example it is not clear how far one could go
with this approach in systems that one would think of usually as
being single bonds about which "free" rotation was possible).
Setting aside these problems for the moment however, we see
that if V is just the ordinary electrostatic potential and hence
28
B. T. SUTCLIFFE
not a function of the Euler angles and we quantise the spin along
the space fixed axes, then the solution to the problem specified
...
"-1
can
always be written as
(2.41)
where
0.
and
noticed that $
will be very tricky to make properly antien
symmetric with respect to fermion nuclear interchange and
symmetric with respect to boson nuclear interchange because the
nuclear space co-ordinates are all mixed up in Q and in any case
not all need necessarily be there).
contain the ~~ as parameters.
~,
MS and MI
Now, in a sense, we
part of 'H
" in (2.37) depends on them in our approximation, we can
just write, if we neglect symmetry/anti-symmetry considerations
where the spins are now referred to the rotating frame, and we
see that the component of the energy associated $
with respect to nuclear spin.
is degenerate
en
Now to cope with getting the right
29
Hen )
and
en
But of course in practice
~
(~
ell
~e
as
(2.42)
30
B. T. SUTCLIFFE
~an
Now if E, as a function
1,
....0
and then the Corio lis coupling constants and we can thus solve
the nuclear problem as accurately as we like.
Furthermore we can
31
and solve the problem for the nuclei on the electronic surface
you get (which is in practice one expressed in terms of 3 Nn -3
co-ordinates so that uniform translation, but not relative
translation, is taken out, but rotation is left in) and this is
now a growing art form.
32
B. T. SUTCLIFFE
IE'
(2.44)
= 8
X (2.45)
mx
nm for all ,...
....
<~ I~ >
= !1jJ
n n
(x)$ (x, X)
(2.46)
33
first term as
(2.48)
and the second term we can write as
Nn
E ~ p(q) .p(q)
(2.49)
q=l m
q
and our second term then becomes, using the product rule,
(2.50)
If we multiply from the left by
E 1jJ (X)
m-
where
cmn (X)
,"
E 1 (Aq + Bq )
mq
mn
mn
q=l
(2.52)
"
Since p is
(2.53)
where
(X)
m-
(2.54)
34
B. T. SUTCLIFFE
"A,
T~
and
T~
!')
NNnl , . . "
L
p(i) .p(q)
Ct
~,q
q
(2.55)
the C
and the whole idea of a potential falls to the ground
nm
again.
35
Thus if one
decides to study the formation of NH4CI from NH3 and HCI and so
constructs ones fixed nucleus electronic problem that one is
always essentially describing NH3 and HCI as separate entities,
the free co-ordinates being their separation and their almostrigid rotations, then almost certainly one is not in trouble.
You can think of other examples.
What I wanted to say was simply that it is premature, even
wrong, to suppose that you can always give the significance of a
"potential" to the clamped nucleus eigenvalues as functions of
the internal co-ordinates.
or not.
Certainly we
Un(~)
36
B. T. SUTCLIFFE
37
3.
that I did not make explicit in the previous section was of course
the well-known one that the differential equation determined
formally by the operator
eigen-states whatso-
38
B. T. SUTCLIFFE
perfectly possible to
"H(1,2)
- !
2
L
i=l
(i (i)
- Z )
r:-
(3.1)
1.
We "know"
that it has a discrete spectrum for Z=2 because we have seen the
line spectrum of the helium atom, and we know it has one for Z>2
on similar arguments, but has it got a discrete spectrum say for
Z=l?
39
without the electric field, say, and include the electric field
later by perturbation theory, to describe the Stark effect and
similarly for the Zeeman effect".
Of course
I agree with you when you assert that in 99 cases out of 100 in
systems of chemical interest we are not going to run into any
troub Ie on this kind of score, but I think that we are moving into
an era in computational quantum chemistry where we are going to be
asked to look at pretty bizzare molecules and ions for people,
such as those postulated as reaction intermediates and it will not
always be so clear as it seems now whether there are any discrete
solutions to the specified problem.
What then is known of these matters?
has been an enormous amount of work done and the name associated
with much of it Kato and certainly he provides a good review of
the field in his 1967 article, in the Supplement of the Progress
of Theoretical Physics 40, 3, 1967.
40
B. T. SUTCLIFFE
Press, 1972).
~,
It can
~,
~as
(~,
0).
Of
(~,
0).
It is not
41
(~*~)2
<k.
From this
in the discrete
spectrum.
These last results may strike one as a little odd,
particularly the result on the boundedness of
~,
because it
Thus if we
-ar
- Nax e- ar
etc.
(3.2)
(3.3)
B. T. SUTCLIFFE
42
= IJ'(O'~2
---
!N) (l-Z
.lo
r l ) + rla l (!.2'!.3 -+
O(r l 2 )
.!N)
(3.4a)
~
...l.
+ O(r 12 )
(3.4b)
where
~
= .:.r l
+ r2
r l2
F.
and ,..
r denotes the variable
43
are often called the integrated form of the Kato cusp conditions.
Now it might be thought at this stage that I am now going
to say that "It can further be shown that the eigen functions of
II
Technically the
Hsay,
In
~usually
44
B. T. SUTCLIFFE
We shall return to
get
B S (i)
-z
(3.5)
2m
~(')
2 ~s (.).::.
1 .r .. P J
81fm2
r ..
lJX
lJ
...l....l.
s(j). r .. p(j)
lJX
(3.6)
45
We
A s imil ar
This in its
46
4.
B. T. SUTCLIFFE
That is if
~
~.
can write.
~
= Lc.~.(xl'-~~'
.~~-..-z.
---
(4.1)
~-)
-N
We shall
The
m
L
i=l
(4.2)
c.~.
~ ~
(4.4)
JfJ2=!f*fdX
the integral being over all configuration space.
The requirement
47
(4.5)
the integral again being over all configuration space.
The convergence expressed in (4.3) is sometimes called
convergence in the mean.
(4.6)
x in range
~.
is a
is an eigenfunction of the
However a
discrete set of one particle functions is the set of threedimensional simple harmonic oscillatOr functions, though there
are plenty of other single particle complete sets that do not
arise directly from physical problems.
Now if we assume that we are working in a complete set of
functions, the immediate problem is how to determine the c., and
1
48
B. T. SUTCLIFFE
Kato, Trans. Am. Math. Soc. lQ, 195, 1951) that we can replace
the eigenvalue problem by a variational problem to determine the
c .
~
theorem.
Here:
<~.Igi ~.> _ H .
(4.8a)
<~.I~.>
~
J
(4.8b)
~J
0 .
~J
(4.2).
funct~onal
<~
m+
11 H1~m+1>
A
t h en
m+l
the roots of this problem E. "split" the roots of the m-th
~
49
every eigenfunction of
R,
exact energy from above and the overlap between our approximate
and the exact wave function tends to unity.
If there is a degeneracy
{~.}
1
is
{~}
in practice.
In general
{~.}
1
50
B. T. SUTCLIFFE
-1'
{~m(R)} ------2
'
and so on.
functions
~m
= E
i=l
c.~.
~ ~
and
-m
~
c.~.
1=1 ~ ~
~.
Generally of
+0,
51
52
5.
B. T. SUTCLIFFE
L:
cf
P P
q,
(5.1)
depends effectively in
~N
q, (r
q .. l
-- r ) q, "( r~ --- r )
..N
p,;c
-N
(5.2)
...
H.
space only, for the removal of spin from the integral is always
essentially trivial.
If we suppose for a moment that we can get numerical values
m
We would be able to
We have
some methods which will handle certain kinds of three and six
dimensional integration, but these are as yet only in their early
stages of development as techniques.
53
~p
It is this limitation
in (5.1) as an anti-symmetrical
HXl -- x..) = A
1'1
II
'l~~
(5.3)
~~
~=
orb~tal,
antisymmetrising operator.
With functions like (5.3) it is easy to show that our
integral (5.2) reduces to:
ij
~J
ijkl
~J
1/
r12Icpk1>
(5.4)
Now there
In
54
B.T.SUTCLIFFE
2l,
2l,
189 (1973),
other workers (see e.g. Daudey and Diner, Int. J. Quant. Chem. VI,
575 (1972). It is an area that could do with very much more work
since the requirement of analytical reducibility to one dimension
is a very strong one indeed and effectively rules out some interesting possibilities for orbital choices, for example many centre
orbitals or orbitals composed of numerical Hartree-Fock atomic
orbitals.
At the present time it seems as if lack of technique even
further confines us to orbitals of the form:
m
= ~~i
E crin r
r=l
(5.5)
' m~N
this view, but I simply wish to point out that in spite of the
successes of computational work in this field in the past few years,
there still remains an enormous problem of integral evaluation
that has to be tackled if we are to use functions that can do
better point by point than can gaussians.
For completeness it should perhaps at this stage be pointed
out that in the transcorrelated method one is not in fact forced
back to simple orbital product form, but to orbital product form
55
go~ng
to use
orb~tals
of the kind
~.
~ i $ j /11r12./ ~k ~ I>
(5.6)
The details of how this may best be done form another part of the
course, but it is clear that the optimum algorithm for such a
transformation is of order m5 operations for a complete transformation.
Indeed until
~J
56
B. T. SUTCLIFFE
then you can cut down the number of basic integrals required, for
because of the orthogonality restrictions,many vanish.
The very
worst case that can arise here is the so-called diagonal case
where the matrix element can be written as
<~IHI~> =
<Cp . cp 1
1
N
L
1/
N
L
i=l
'"
<cplhlcp> +!
1
L <cp.cp.1
i=p+l j=l 1 J
1/
p
I:
ij =1
N
L
1=1
j=p+l
r12Icp.cp.>
1
{<cpcp1
<cp. cp 1
1
1/
r12lcpcp>J
1/
(5.7)
57
We shall therefore
58
6.
B. T. SUTCLIFFE
The practice
We shall
x..) =
"N
{1r
(6.1)
that the sets {a} and {S} are the minimizing orbitals, but are
not orthonormal sets, though they are linearly independent sets
(if they were not
would vanish).
-f"
'" -+ '"
-'"
""'--
'" V'"
'"
= f"
-,-.
<:i:_al
r_a>
'I'
'I'
(6.2)
59
~.
1
If the
G in
~ .
1
the group
The
~.'
1
However if the
c.~.
.11
l=n
(6.3)
which is itself a
~.
60
B. T. SUTCLIFFE
If there is such
However it is not so
'l'p+~
= I
"
--- NS
(6.4)
= NS '
It
61
Jl.
_Ta
1xm m N
(6.5)
Cl
Ta
(6.6)
1xm DixN f3
where each column of T is a set of orbital coefficients.
Straight-
+ a term
Cl+B
where
(6.8)
(6.9)
(~ (!2)rs
~ (~\s = r Rut<nrntl
1/
tu
\zlnuns>
(6.10)
elements of hare h
= <n Ihln
" s >.
rs
r
The
---
---
(6.11)
where -n
I
is a unit matrix of dimension n by nand Srs
= <n r In s >.
a'
1.
-'
= cpa Ua
uB
(6.12)
1.
--
n Ta Ua
a'
=n T
--
(6.13)
62
B. T. SUTCLIFFE
(6.14)
.-
of, to show that the vectors forming the columns of T which have
'"
the required minimum properties can be obtained by iterative
(6.15)
where
(6.16)
with a similar problem for the ~.
The MO's obtained by solution of this problem are (in the
absence of degeneracy in the ~cx) uniquely defined to within a
phase factor (in the case of real orbitals to within a sign
factor), and constitute the canonical molecular orbitals of the
problem.
- -
space of the problem and hence any other solution vector may be
expanded in terms of them.
This invariance of the energy under a unitary transformation
also enables one to show, that
symmetry orbitals then
hF
i~
63
= - G ,
G
~
where
(6.17)
invariant as before.
,...
,...
nV.
--
y that reduces
0
.-
tv--
8'
.,.2
MJ.
- - --' -Vt 8 V
8'
S'
~ ~ ~ ~
=~
(6.18)
8'
-3
eigen-value equation as
(where ,....
hF may be either
or
hF
-a
hF V C'
,...
~ ~ Y..~
or
hF)
-13
(6.19)
and hence
or
-Vt -hF
h
F' C'
;{. ~'
(6.20)
64
B. T. SUTCLIFFE
--
FI
split the problem into symmetry blocks for solution at this stage.
The invariance assures us that this way we obtain exactly
the same results as we would have obtained if we had transformed
the integrals from the basis !l.. to the basis
.!l.!
and obtained
such that
..
:G
F'
C'
,...,
C' e: C'
,.....,,.,,..',,-
-1
= ,.,..,
Z
(6.21)
and La diagonal
matrix of eigen-values) and we can thus convert the pseudo eigenvalue problem into a proper problem very easily.
It should also be noticed that if you have a matrix c(i)
.......
""'"'
If the matrix
iterative converge.
forced into eigen-value form, and hence very easily solved using
a computer and also that as a consequence of the solution of this
problem, you obtain a set of orbitals which are naturally
orthonormal.
65
(the
unoccupied orbitals) and which may be used (though many would say
they should not be used) in improving the given function by
constructing extra configurations and performing a many configuration calculation.
However canonical form is not without its disadvantages,
since there are no theorems to show that an iterative process in
canonical form must converge and indeed it is easy (as we shall
see later) to construct examples where such a process definitely
does not converge.
If we do construct more determinants from an original
function, by replacing the occupied orbitals, one, two three and
so on at a time by orbitals from the unoccupied set (for any
given spin), then we know from Slater's rules that only determinants differing in two or fewer orbitals will have matrix elements
with the original determinant.
credited
52
though it is of
5z,
and
66
1.
B. T. SUTCLIFFE
i.e.
H!l' x --~)
;:z
....
--- ~
Nex
2.
Ms =
(N ex
N 8)
oS
3.
TI
'"
'" - Sf)
(S"'2 - S')/(S
Sf,S
Project out the required state from the general function and
then optimise the resulting projected function.
If the first procedure is adopted this leads to the open-
shell SCF problem, about which Dr. Vei11ard has spoken, except in
the case where S : Ms
By a happy accident
the optimisation.
67
It is sufficient here to
68
7.
B. T. SUTCLIFFE
To do this
For a
~')
= Nf
~*~i'
.!2 --
~) ~
(!.l'
---
is defined as
~)d~2
d~
(7.1)
(x
--
-1' -2
~.)dx
~
-t'l
= N (N -
---
1)
f ~* (!1'
!.21 --
~) ~
(7.2)
d~.
-i'l
Pl(~l'
Ell) and
P2 (:1' E2' ll" E21) are derived from these equations by integrating out the spin variables.
For a single determinent function it is very easy to show
that
+
PI (~ r') = PI (!J
where
PI
--
(.!J rl )= r
PI (!.!
t)
i=l
:.:)
+ PI
- (E.!
rl)
(7.3)
<p. (J)<P.*(!J
~
(7.4)
i=p+l
PI + (PI)
is the probability density (distribution) function for
69
The interpretation
+-
,P z
'P 2
-+
and P2
the first term describing the joint probability density for two
particles both with spin up the second term that for particle
one with spin up and two with spin down and so on.
Analysis of the single determinant wave function reveals
that we can write
(7.6)
(7.7)
++
70
B. T. SUTCLIFFE
wave function.
any really good wave function, instead of a form like (7.7) rather
a form like
(7.8)
where the "coulomb hole" function f+- described the details of
the correlation between the particle motions.
It is often useful
+-
properly.
+-
is
....r,
where L
= ! (!.l
+ 2) ~2
= ! (~ - 4)
~12'
~12'
(7.9)
This result
No one
knows quite what should happen when more than one pair come
simultaneously close together or when one pair come together near
a nucleus and so on.
~l
tends to
along
71
(7.10)
=0
+-
(El' Ez) to
1 detl a --- a
S
1
p p+l
IN'
NS
~J
-~
(7.11)
r.)
....J
~J
~,
3145 (1963)) is
(7.1Z)
This form is of course not the only one, nor can it be a
completely proper form, since it depends only on r lZ and not on
~
.lo,
r l - r Z'
72
B. T. SUTCLIFFE
From what we have said before (7.11) is clearly a nonstarter from a computational point of view, if we are going to
use the variation theorem and (7.12) directly.
(Boys in his
It is easy to see
possible practical
73
p'
(7.13)
~_)
(7.14)
~)
corresponding to
so it follows that
is closest to
if it is a product of
Anti-
74
B. T. SUTCLIFFE
= Pl -)
are "
1.
1.
The
Krauss was
It is
75
8.
~,
and
~
~
where~,
to our
as
X, and
(8.4)
A little simple re-arrangement of (8.3) gives
(8.5)
(8.6)
76
B. T. SUTCLIFFE
a bit of fiddling
(8.7a)
=1
=4
.7846
.279
0.028
.19
0.0.02
0.0002
=7
2.8 x 10- 3
2 x 10- 6
.99 but the term mUltiplying Bx does not in any real sense
become small until the 8 > 0.999999.
>
~)but
will depend on the operator and on the wave functions used in the
variation approach.
you put little pimples in the wave function far out from the
centres of interest (for example, by putting a gaussian with a
77
large exponent on a
cha~geless
Bas
~,
perturbation theory
It obviously follows at
Again,
78
B. T. SUTCLIFFE
Now we can expect these results only for the exact HartreeFock function, so that we have (hopefully) a complete set of
functions in which to describe X, and that we have also the
largest value possible for S.
Of course a
~.
is not optimal
Of course here
79
However we have a
Hellman-Feynmann theorem see e.g. Deb. Rev. Mod. Phys. 45, 22,
1973).
In passing, it should
80
B. T. SUTCLIFFE
one and Weinhold has shown (J. Chem. Phys. 46, 2448 1967) that a
better estimate is given by
(8.9)
where
Sl
~
but not
<1;1'1'>, S12
yet~.
= <~II;>,
81
0.89965.
exponents we get S
get S
0.993.
0.99.
0.95 but
~,
subject is a pretty venerable one and you will even find some
discussion of it in Pauling and Wilson's classic text (p. 189).
Let us begin with the older work on energy bounds, there is
a result by MacDonald (Phys. Rev. 46, 828, 1934) which asserts
I'
ex
[01 -
= a constant
ex)r~]*(<ii-ex)m-r~1
d,;
integer m.
~ r ~ m
(8.10)
82
B. T. SUTCLIFFE
will take the same value for all r, and is equivalent to the eigenvalue problem
'\
m
[ (H
- a)
(8.11)
andSI~12dr
if 01' = 0
m
"
spectrum of H.
(8.12)
where E
- a)m
- a)m.
< a <
Ii
= fC(H
- aH)* C(H -
a)~Jd~
= 2,
(8.13)
En+ 1)
=1
we get
(8.13)
and we see at once that
(8.14)
or
-(I')!
2
....< (En - a)
<
...
(I')~
2
(8.15)
83
,..
~(m)
~(l) = ~ ~
Hm~).
~,but
Thus if we
a1
It is easy to see the kind of trouble you can get into simply
by recognising that one of the terms in evaluating the expectation
value of HZ is going to be of the form
~
dr
(8.17)
and that
then of course such delta function terms do not arise, but if the
manifold is suitable then the two approaches must yield identical
results and from this you can see that it is not at all certain
that they do so.
84
B. T. SUTCLIFFE
are simply ignored and the latter formula is used to evaluate the
matrix elements.
22,
If we choose a
En ~ EW
= <H> - ~
n
where
(8.l9b)
~
from (8.15) that as expected the result is exact when the variance
vanishes.
Of course in this case our upper bound from (8.l5b) is
artificially high because we know that <H> alone is an upper bound,
and this fact might lead one to suspect that the lower bound is
also correspondingly slack, and indeed in cases where it can be
worked out, it turns out to be quite awful, for many, many
functions which give very good upper bounds.
and
85
<H> -
6.
(8.20)
(En+ l-<H
must know En + l , but one usually does not, so the formula is, to
all intents and purposes, quite useless. It could be useful in
theory since it holds for any lower bound to En +l , but of course
the problem is to get this bound and we are thus in a circular
situation.
Thus the situation with respect to putting bounds on the
calculated energy is at present pretty hopeless.
Admittedly they
Well you
86
B. T. SUTCLIFFE
good variational result but our reasons for hope are not at all
clear.
1\
H~/~,
~,
Also, if
87
9.
get out of the way some remarks which may easily sound to you as
being at once platidunious and rather offensive simultaneously.
It must always be remembered that for the most part so-called
experimental results are not really experimental results, but the
results of processing a set of primitive observations (meter
readings, time measurements and so on) according to a particular
theoretical model.
in the language of the way we talk about the particular result, and
there is always a difficulty in deciding whether the theory with
aid of which the results are derived is in fact the most appropriate
one to use from a numerical point of view.
Let me try and clarify this point by reference to a couple
of examples.
and bond angles of simple molecules what one gets often are
results that are obtained by processing observations of the microwave spectrum or from the rotation-vibration-spectrum in the infrared or perhaps even rotational fine structure in the electronic
spectrum.
rigid molecule approach outlined earlier and is for the most part
also based on the assumption of an almost-quadratic potential
function.
bond distances etc., that agree with experiment he has two courses
of action open to him.
in tune with the model and produce results in line with the
usually quoted experimental ones or he can build his own model,
perform his calculations in his own way and see if he can get
results in accord with the primitive observation.
of many similar examples like this where one has a hard and
developed normal model, atomic spectroscopy, many kinds of
magnetic experiments and so on.
well
88
B. T. SUTCLIFFE
Naturally
However
experimental result?
An even more acute situation like this arises when a
theoretician is asked to decide by calculation, some point which
is apparently an experimental point, but arises merely in the case
of developing a chemical explanation in terms of a model with
quantum theoretical words in it, but with no contact any longer
with quantum theory.
89
The experi-
point I feel.
But anyway let us reserve this last situation for another
time.
However to do so involves
90
B. T. SUTCLIFFE
I want to confine
About the
0.002 GHz.
The correction
or at most MHz.)
Obviously and very sensibly the first move here would be to
see if anything earth bound shows up in this kind of region.
One
91
McGraw Hill, 1955) or some other suitable book with a great table
of stuff in the back and hope for the best.
In these kind of
To
cut a long story short, what one finds there is that for example
H12CN has a line at about 88.63lGHz and H13CN has one at about
86.340GHz.
Anyway shortly
in fact the line was due to HCO+, Herzberg in fact suggested HNC
and somewhat later Barshun suggested CCH.
Letters 12,169, 1972).
(Astrophysical
89.l89GHz.
h 2 J(J + 1)
2r
J~
= hv = 2B(J
+ 1)
liE
(9.1)
BJ(J + 1)
J + 1
(9.2)
E r .. m.m. /Em.
~J
~ J i ~
1> j
92
B. T. SUTCLIFFE
course known.
It therefore seems sensible to see if we might be able at
least as a first go to try to calculate non-empirically the bond
lengths of the contenders and try for an assignment on the same
model.
we would have to abandon the rigid rotor model and assign on the
basis of an assymetric top model, but let us ignore that possibility for a moment.
~dth
c-o
Can we do this?
(For a very
251,
553, 1959 though its early interpretations by Pop1e and LonguetHiggins Mol. Phys.l, 372, 1958, and on through later work by
~,
~,
93
458, 1973).
I,
47, 1971).
In this kind of
(Again
there are questions here of what exactly is the bond length from
experiment, but of course what really matters to us at this stage
is how the simple rigid molecule model does against experiment in
a region where much experimental information is known and very,
very careful analysis possible).
94
B. T. SUTCLIFFE
but you will now be jumping up and down and shouting "but what
about LiH or NaH, or NaF?"
not fit my rule.
you can easily see if you look at the calculation of Cade and Huo
on the first row hydrides (J.C.P.
~,
However it is perhaps
triatomic HCN is a fantastically well studied species experimentally, and there are some extremely careful HF calculations on
it by McLean and Yoshimine (Int. J. Quant. Chem. 15, 313, 1967).
They obtain for the CH distance .1024 nm and for the CN distance
0.1116 nm.
It is quite interesting to
= 0.1063
nm,
95
c.r.
simply not enough experience to say how well one does in general
with say all single and double excitation
c.r.
from an HF start-
cr
c.r.
in the case of HCN the same basis used by Roos et al for the SCF
calculation, yielded, on all single and double excitation
cr
We must
would in fact have reduced the bond lengths by more than that
error, but perhaps in discussion here Dr. Roos can help us more.
At any rate the point here is that this is the best that we can
currently do and we must, if we wish to manage this kind of
experiment, use it.
Roos et al in fact performed calculations on HNC as well as
HCN and were able to get bond lengths for HNC of C-N= 0.1169 nm
and N-H of 0.0995 nm.
c-o
96
B. T. SUTCLIFFE
<lJ!A! r Z. e
i=l 1
m.
1
.....
(Other
action of the field with spin and other more complicated operators.
For example, spin induced transitions are observed in the microwave
region, but generally only in the presence of static magnetic
fields as in ESR spectroscopy).
97
~A
to
~B
how to go on from our earlier discussions if we are in the nearlyrigid-molecule region or if we are dealing with a diatomic molecule.
If we have explored (as we have) the energy surface for the
molecule in question, then we shall know whether the rigid
molecule model is a starter or not.
sure that for the range of interest then we can develop whole
molecule solutions in the form of the Born Adiabatic expansion
~
= !:
n
X (R) ~ (r, R)
n....
n ...
In fact very
little has been done at this level except for diatomic molecules
and then mostly at the HF level for the electronic parts.
Essentially the most has been done in diatomics is to expand the
calculated potential about the equilibrium internuclear distance
98
B. T. SUTCLIFFE
~.
cit.)
vJ
= we (v
= Be
Bv
- a (v +
e
!)
+ v (v + !)2
e
likely that one could, if one took the trouble, with CI wave
functions come to within a few percent for a.
e
The important
one could make the Born-Oppenheimer approximation, do a Dunhamtype analysis at the same level as for diatomics, but here of
course the whole situation is much more complicated, and even in
99
account of the relationship see Nielsen Rev. Mol. Phys. 23, 90,
1951, esp. pp 111-113).
work has been done on this problem and we cannot say how likely
we are to get agreement with experiment.
~,
To illustrate
= Be
- a /2 - a - a /2 - D
I
2
3
In this case
al
-0.582 GHz.
100
B.T. SUTCLIFFE
I could have
As you
will see from that review, there has been surprisingly little
straight CI work on this kind of thing.
If I had chosen to talk about the vibrational-rotational
structure of electronic transitions or indeed even of their gross
structure, I can only say my talk would have been much shorter.
The problem of dealing with excited electronic states at a
reasonable level of accuracy is simply a largely unsolved one at
the minute.
It seems to be
101
2,
99, 1970).
~,
I am
As I
In this
!, ..'
102
B.T. SUTCLIFFE
Jones one may use the method but only if one has very very good
wave functions and they cite as examples Pekeris' helium function,
and Sims' and Hagstrom's beryllium (Phys. Rev. 4!, 968, 1971) and
both of these functions are replete with r 12 's. What kind of
corrections might however we expect for the energy in atoms?
Walker in a careful correction of Clementi and Hartman's results
(J. Phys.
what these results mean is in dispute, but from them we can perhaps
get a feeling for what kind of shift we might expect in energy
levels with relativistic corrections.
In helium
(a
= e l /2hoc)
103
terms are about equally important, but then helium is a very light
atom.
104
B. T. SUTCLIFFE
understand these phenomena with what we have and that we can also
do well at what might be called the semi-quantitative level, in
fixed nucleus calculations, but the field really needs a very
brave soul to look at these effects in high accuracy calculations
on simple molecules.
I trust that this does not sound a too depressing ending to
this lecture course.
105
1.
INTRODUCTION
The present series of lectures is aimed to introduce scientists into the fundamentals of computer hardand software, in particular in relation to quantum chemical calculations.
In the early days of "electronic computers" it
has been quite normal, and even absolutely necessary,
for every user to have quite a detailed knowledge of
the computer he was working on, of its machine language,
the tricks, the possibilities offered, and the limitations imposed. With the advent of modern large, high
speed electronic computers, and the development of
powerful high level programming languages, both becoming available to a wide variety of users rather than
to a small group of specialists, the question came up
of how much knowledge of computer hard- and software
should be necessary and should be expected of a user
in order to write a program and to run it successfully.
The answer largely depends on the problem the user is
working on:
It seems to be well agreed in this context that
computers, and in particular the system software,
should be designed in such a way as to allow any user to
solve "simple" problems with a minimum knowledge of the
computer and its hard- and software features. The corre*)This contribution is based on the lectures presented
by one of us (GHFD).
Diercksen et al. (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 107-199.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
108
109
110
III
TIME
COMPUTER
I.
2.
3.
f---~~=:J----m~---{:::::J--~~
B i---
r----<D.I~c::::Jr=--~=~:::J------<ZII{~=:Jm--_CJZll{:=:::p--_c:J_____I
1-----fJ!~I__---__!_~_~---_lWLl
~WA_____I
A f---~==:::J------~~~----{~==J-____I
B~~~~~----~~~~--~~~
- - WORK MODE
---- WORK MODE IOVERHEAD)
-c::r SELECT MODE
~ DELAY MODE
Fig.
2.1
112
113
114
115
With these problems we reach already the discussion about the configuration of a computer installation,
namely the determination of the populations of the resources to be attached to a processor in order to
achieve the necessary performance and reliability.
To decide on those questions during the stage of designing a computer installation extensive and careful
studies have to be performed, analysing the expected
delays due to resource contention and the influence
on the productivity of the computer system. The same
applies to the design of a computer itself.
For our further discussion we redefine our present
model of the computer and we assume now that it consists
of a number of specialized units: the processor performing all the program logic and arithmetic operations,
the memory for storing the informations, the channels
and control units performing and controlling the input and output operations, and the input/output devices
as there are for example card reader, line printer,
card punch, and magnetic tape units. We are then concerned with providing a sufficient number of the various
resources to the processor in order to meet the expected needs of the system performing its work load and
with the distribution of the resources among different
processors if there is more than one in our computer
system. But even only one processor may compete with
itself for access to a resource: Assume that two tape
drives are attached to the processor via one control
unit and one channel. Then it is impossible for the
processor to gain access to both tape units simultaneously because control unit and channel can only serve one
device at a time. This contention can only be released,
if we provide a control unit and a channel to each tape
drive.
In what is normally understood to be a generalized
multiprocessor system there is a very high degree of
resource sharing, including the sharing of all primary~
directly addressable memory. Each processor has full
addressability to all locations. In the sharing of
memory the concept of private ownership is entirely
abandonned, and the assignement of space to a processor is normally dynamic from a pool of available locations. The situation is slightly different in case of
the input/output channels, and each processor has separate channels assigned to it. But in various systems
the possibility exists that one processor can request
116
117
118
su
STORAGE UNIT(SU)
CENTRAL PROCESSOR UNIT (CPU)
CHANNEL (CH )
CONTROL UN IT (CU)
DEVICE (0)
Fig.
2.2
119
Units down the tree tend to be highly specialized, capable of performing reasonably sophisticated functions
in their area with high speed. But they are useless
outside this area. The intelligence of each unit is
limited by its special function it has to perform within the total system. In a hierarchical system there is
a pyramid-like distribution of general capability.
Each parent unit does an interpretation of functions to determine to which unit of the tree it relates and passes it on to that unit for further interpretation or for execution. For example, the control
processor interprets an order to perform input/output
operations on a given channel and determines to what
channel to send the information necessary to perform
the operation. The channel receives relevant control
data, independently accesses any additionally required
data, and sends whatever is needed to a control unit
to which it initiates contact. The control unit addresses the device.
3. THE FUNCTIONS OF A PROCESSOR.
After having outlined the basic principles concerning the design of a computer of the present generation, we have to discuss now in more detail the
functions of the processor itself.
The first step to increase the efficiency of the
processor is to separate the performance of the arithmetic operations from all input/output operations and
to have the input/output operations controlled by
special functional units. This possibility has already been recognized very early because of the different execution times of logical and arithmetic operations compared to input/output operations. But this
separation has no real advantage for a number of jobs
with massive data manipulations and relatively little
input/output operations, as for example for calculations on atomic and molecular structures. To improve
the performance in this situation one has to introduce more parallel capability in the processor itself.
Parallelism is obtained here on the instruction level
or below, and the user of the system remains partly
unaware of the asynchronity and the overlap achieved
by the different components of the processor. To take
proper advantage of it is normally left to the hardware design and to the compiler, translating the source statements of a higher level language into a machine
120
121
122
Jsu
SIR
IU
~
: REGISTER
EU
Fig.
3.1
123
Table 3.2
Characteristic timings for the different steps (tasks)
that are necessary to perform an 'ADD'-operation.
time
step
1.
50 ns
2.
375 ns
3.
50 ns
225 ns
4./5.
6.
375 ns
7.
50 ns
8.
200 ns
9.
50 ns
'I-cycle'
'E-cycle'
update IC
fetch instruction
send OP-code to
register
decode and send
operand address to
address register
receive signal
fetch operand
send operand to
adder
ADD
send result to
register
I
Total time for I-cycle: 700 ns
Total time for E-cycle: 900 ns
124
racteristically on the question whether the instruction requires a memory fetch as well as on the complexity of the instruction itself.
For our further discussion it is convenient to
characterize the performance of a processor in terms
of millions of instructions per second (MIPS). Since
the time it takes to execute an instruction is variable,
MIPS-rates cannot be accurate criteria for the performance. We are interested in the MIPS-rates of the
I- and E-units. From our example (of table 3.2) the
following MIPS-characteristica can be obtained:
1.4 MIPS can pass through the I-unit and 1.1 MIPS
through the E-unit, assuming for simplicity the average
execution time to be~ual to that of the ADD operation,
and assuming that approximately 2.000.000 words of 36
bit length can be delivered from the S-unit (equal
number of reads and writes).
As a consequence of our initial assumption that
the I- and E-units have separate cycles following each
other it happens that the I-unit is idling for 675 ns
while the E-unit executes the instruction. This reduces the delivery rate of the I-unit from the highest
possible rate of 1.4 MIPS to approximately 0.727 MIPS,
i.e. to a usage of only 51 % of the total time. This
has of course the further implication that the E-unit
receives much less instructions than has been assumed
for its optimum performance rate and that therefore
the E-unit itself is also idling for about 35 % of
the total time.
The maximum throughput of this processor is obviously equal to the throughput of its slowest resource, the E-unit, which is able to deliver 1.1 MIPS.
To approach this limit one could try to arrange the
I- and E-units in an asynchroneous and overlapping
set up so that the I-cycle of an instruction is overlapping with the E-cycle of the preceeding one. Unfortunately this set up gives, however, raise to some
difficulties: (1) The time an instruction spends in
the E-unit depends on its complexity. Different instructions spend a different time in the E-unit. If the processor enters a string of more complex operations, the
delivery will be delayed for a longer time, and less
instructions per second can be performed. This relates immediately to one of the difficulties in evaluating the performance of a machine. The performance
strongly depends on the "mixture" of operations to be
executed, in particular on their types, the number of
125
126
simultaneously.
There are two possibilities to arrange consecutive storage locations into the different units, sometimes called banks in the CDC terminology: high bit
overlapping and low bit overlapping, both approaching
each other in the limit of an infinite number of units.
In high bit overlapping those storage locations are collected in the same unit, which have the same high bit
pattern of the address. For example, if there are two
units, each with 128k bytes, then byte 0 to (128k-1)
would be positioned in unit 1, and bytes 128k to
(256-1) in unit 2. In low bit overlapping those storage locations are collected in the same unit, which
have the same low bit address pattern. Assuming again
2 boxes, all storage locations with a 0 in the last
address bit would be put into unit 1, and all with a
1 in the last address bit into unit 2. Thus consecutive
storage locations would be put in different units. This
form of arranging storage locations in different units
is usually called interleaving. In some present day
machines (for example UNIVAC 1108, and IBM 360/75 between them), both techniques are provided. Which of the
two techniques has more advantages for avoiding contention depends on many factors, as for example average
program size, levels of multiprograming, and arrangement of data.
IU
EU
Fig.
3.2
: REGISTER
BUFFER
127
In the machine developed up to now, which is displayed in fig 3.2, the 1- and E-units are roughly balaced. The rate with which this machine can process
instructions is limited by the rate of delivering of
instructions by the I-unit. To speed up the machine
without speeding up the electronic circuity, we will
analyse in more detail the functions of the I-unit.
Among the 700 ns of I-cycle time, 375 ns are spent in
the interface between the 1- and S-unit. This process
of instruction fetching consists of: (1) the delivery
of an address by the I-unit to the SIR, which we can
assume to take 50 ns; (2) the fetching of the "word"
from the S-unit into the SIR, estimated to take 275 ns;
and (3) the delivery of the "word" from the SIR to the
I-unit, assumed to take another 50 ns. While during
this time intervall the S-unit is fully busy, the
I-unit is idling for the time needed to deliver the
instruction word from the S-unit to the SIR. That
means, that the I-unit is actually busy only for 100 ns
within the time intervall of 375 ns. The question is
now which modifications have to be introduced to avoid
this delay. At the beginning of this paragraph we have
stated that the performance of the machine is limited
by the instruction delivery rate of the I-unit. But
now we see that the limiting factor is not the I-unit
itself, but the dependence on the delivery rate of
information from the S-unit. If the storage would respond instantaneously, that means if it would have a
zero delivery time, then the processing time of each
instruction in the I-unit would be reduced to 425 ns
and its basic delivery rate would be increased to
2.35 MIPS. This situation is typical for large-scale
machines which are indeed memory bound/rather than
processor bound.
The other dominating step among the five tasks
performed in the I-cycle is the one in which the instructions are decoded with an acutal decoding time
of 225 ns per instruction (see table 3.2). Apart from
the wait-for-delivery time the rest of the tasks takes
only about 200 ns. This part of the I-cycle could possibly be speeded up if the remaining tasks could be
arranged such that they are performed in parallel to
the decode operation. But before we are going to consider this possibility we turn back to the instruction
fetching step and we try to reorganize the relation
between the 1- and the S-unit in order to speed up
the corresponding interface.
128
129
130
131
132
133
branches) operations. Therefore we can assign the execution of these instructions to the I-unit. This will
relieve the E-unit of work, but it does not yet allow
parallel processing of the I- and E-unit: The same
op-code register is still used by both units, and
while the I-unit is processing an instruction, the
E-unit has to wait and vice versa. This situation
can be improved if we supply a second op-code register:
one is used for instructions executed in the I-unit,
and the other one for instructions to be executed in
the E-unit. This set up would now allow parallel processing within some limits, but no look ahead techniques. Once the I-unit is working on one instruction,
and finds another to be executed in the I-unit as well,
no instruction can be forwarded to the E-unit, which
might be free for processing some instructions independent of those executed in or waiting for the I-unit.
To get around this difficulty we have to introduce an
instruction stack for operations to be executed by the
I-unit. This allows the I-unit to advance, to locate
E-type instructions, and to forward them to the E-unit
for processing if possible. Now, the I-unit, is
able to evaluate addresses of operands needed in E-type
instructions, and to initiate the necessary LOAD instructions for the operands into local storage of the E-units.
We transfer this task to the I-unit completely which
will allow us to use the local storage of the E-units
sensibly, and fill it by a look ahead technique.
At this point it is easy to extend the machine
concept further and to include the possibility of
address modification (indexing). The necessary capability and the necessary registers haveonly to be included into the I-unit, because this is the only place
where addresses need to be evaluated.
We have now reduced the work load on the E-unit
in two ways: We have reduced the number of instructions to be processed (its population) and we have
provided a local buffer, sensibly filled by the I-unit
using some look ahead technique operating on a 50 ns
cycle. The average instruction time of the E-unit is
thus 325 ns faster than in our previous set up. This
has been achieved without reducing the delivery rate
of the E-unit, but by introducing some additional hardware resources.
134
I I
I Isu
SIR
BSU
Jrn
---
~
XR
IU
,~----'--T----:-~(
L
LSMTU
1=
I=='=l
I r-------t--'------.=ErrU I '----'--'.E'""UI
Fig.
3.3
EU2
135
This set up includes most present day design characteristics of actual machines. Each of the two processors
has extended local store for various buffering purposes
and is actually a highly specialized subprocessor by
its own. The machine design allows to include any number of subprocessors for special purposes by defining
appropriate instruction classes. Traditionally these
have been specialized units for arithmetic operations.
This machine has an extensive capability for look ahead
and parallel processing, but special care has to be
taken to preserve the principle of the logical sequence
of instruction processing.
To study the performance of instructions in this
final set up of our pseudomachine we consider as a
simple example the sequence of instructions listed
below:
LOAD
MPLY
STRE
AR1 ,MEM1
AR1,MEM2,AR2
AR2,MEM3
Following the concept used in IBM machines it is assumed that each instruction set consists only of two
types of instructions: register-register and registermemory instructions.
Initially the I-unit brings the LOAD instruction
to the predecoder, discovers it to be of the class
which is executed by the LSMT-unit, sends it to the
LSMT unit and places it into its instruction stack.
Simultaneously with the processing (decoding) of the
LOAD by the LSMT unit, the I-unit brings the MPLY to
the decoder, and discovers that it is to be executed
by the E-unit. However, it contains a memory reference
that has to be processed by the LSMT unit.
There is now a fundamental difference between these
two instructions to be handled in the LSMT-unit: In the
LOAD instruction the source and destination is explicitly specified. The operand will be sent directly
from memory to the arithmetic register AR1. To process
the MPLY instruction we have to remember that the E-unit
expects all operands either to be in an addressable register, or in an operand buffer (local storage). Therefore, the predecoder must decompose the MPLY instruction into two instructions, one for the LSMT-unit and
one for the E-unit. The instruction sent to the LSMT
unit is to load an operand from the memory into the
136
AR1,MEM1
EBUF,MEM2
AR1,EBUF,AR2
AR2,MEM3
137
tiny ship, there is an enormous advantage of cost reduction in building identical parts. Therefore one
might expect that the design of future machines will
tend to introduce more identical units, rather than
functionally specialized units.
To conclude this chapter we will finally scetch
the basic design concepts of two high performance computer systems which are currently in use, the CDC 6600,
and the IBM 360/91.
The CDC 6600 has a control unit and 10 specialized
functional units: a branch unit, a Boolean unit, a
shift unit, a floating point add/subtract unit, two
floating point- and fixed multiply units, a divide unit,
two short-word fixed-increment units (used also for
indexing), and a fixed-point add/subtract unit. The
general structure of the machine is displayed in fig.
3.4.
STORAGE
CONTROL UNIT
BRANCH
UNIT
ENCREMENT
UNITS
BOOLEAN
UNIT
DEVIDE
UNIT
SHIFT
UNIT
FLOATINGP FIXP
ADD/SUBTRACT
UNIT
FLOATINGP/FIXP
MULTIPLY UNIT
Fig.
3.4
138
139
1Ie rl
------
---
INSTRUCTION
UNIT
INSTRUCTION
S - CONFLICT
STORE ADDR.
STORE DATA
8 (8)
8 (4)
8.(3)
8.(3)
:
.:t
----
----
-(16)
FIXED POINT
OPERATION
UNIT
I
-----
STORAGE UNIT
----
--
FLOATING POINT
OPERATION
UNIT
Fig.
3.5
140
141
its actual configuration. We have seen that the processor of most present day computers has some flexibility to schedule logically independent instructions
out of sequence or to avoid resource contention and
execution delay.
To some extent, there is a further agent which
is at least partly responsible for instruction scheduling: the high level language programmer. He has
obviously much less influence on the instruction scheduling than the assembly language programmer (who has
practically full control over it), in particular if
the compiler has an effective optimization phase. But
nevertheless, the programmer can write his high level
language statements in a way to support the task of
the optimization phase because there are decissions
which a compiler logically cannot make and which thus
have to be made by the programmer himself, e.g., to
decide if a variable has to be stored in the S-unit,
or if it is just an intermediate result used only once
or a few times within a limited sequence of statements,
only. In the latter case it would be sufficient and
preferable to keep the variable temporarily in a register or buffer. There are also optimization decissions
which classically are not thought to be tasks of the
optimization phase of a compiler. It is, for example,
well known that matrix arithmetic can be speeded up
by using such a code that matrices are written in
vector form avoiding time consuming index manipulations.
All the agents have the advantage that they "know"
a good deal about the design concept of the machine on
which the program is running, and thus can produce a
schedule of the instructions within the limits of logical independency that makes optimum use of the machine's hardware and organisational facilities. The
information most important for instruction scheduling
on all levels are: the instruction interdependencies
(program logic), the type and number of resources
available, and the instruction (task) execution timings.
Unfortunately the execution of the S-unit references
is largely unknown thus entering an un.predictability
into the instruction scheduling.
As an illustration we define a pseudomachine and
analyse how it executes codings with different instruction sequences of the same arithmetic sample equations:
142
It would be too complicated to analyse the execution of the sample codings on a pseudomachine which
has all the advanced features characteristic for the
contemporary large scale computers as they have been
discussed previously, in particular the look ahead
and bypass techniques, which in some respect are just
introduced to smoothen out the effect of non-optimum
instruction scheduling. Therefore we restrict the
present discussion to a much simpler pseudomachine
that has the basic features of present machines just
to give a realistic picture of the impact of instruction scheduling on the code execution.
The pseudomachine we will use is defined by the
following resources and characteristic features:
Storage unit, storage interface register, instruction
unit, two fetch/store units, add/subtract unit,
multiply unit, de vide unit, and seven registers.
This set up is shown in fig. 4.1.
Fig.
4.1
143
Table 4.1
INSTRUCTION
Pseudomachine instruction
execution- and unit busy times
FUNCTIONAL
UNIT
INSTRUCTION
EXECUTION TIME
(CYCLES)
UNIT BUSY
TIME
(CYCLES)
FETCH/STORE
FSU
SIR
4
5
ADD/SUBTRACT
ASU
MULTIPLY
MU
10
11
DEVIDE
DU
29
30
144
145
This rather simple pseudomachine includes a number of features of present day computers (especially
of the CDC 6600) most important for the understanding
of the performance of such machines. Other features,
designed to decrease the influence of resource contention (look ahead and bypass techniques) as well as
storage interleaving (instruction and operand fetching)
have not been included into this model. Although vital
for the performance of an individual machine, they are
not important for discussing optimum instruction scheduling.
We will now consider a simple mathematical operation, and see how it is executed in the machine depending on the actual coding:
x = A-B/C
Y
(4.1 a)
(4.1 b)
X+D-E+F*G
(4.10)
Y-X+D-E+F*G (4.1b)
Fig.
4.2
14
15
13
1
2
3
4
5
6
7
8
9
10
11
12
FETCH
FETCH
FETCH
DIVIDE
SUBTRACT
STORE
FETCH
ADD
FETCH
SUBTRACT
FETCH
FETCH
MULTIPLY
ADD
STORE
OPERATION
TABLE 4.2
R1 ,
R2,
R3,
R2,
R1 ,
R6,
R2,
R6,
R2,
R1,
R2,
R3,
R2,
R1 ,
R6,
B
C
R3,
R2,
X
D
R2,
E
R2,
F
G
R3,
R2,
Y
OPERANDS
B/C
A-B/C
X == A-B/C
X+D
X+D-E
F*G
X+D-E+F*G
Y == X+D-E+F*G
R2
R6
R1
R1
R2
R6
COMMENT(S)
9
9
9
30
5
9
9
5
9
5
9
9
11
5
9
INSTR.TIME
(CYCLES)
9
18
27
57
62
71
80
85
94
99
108
117
128
133
142
TOTAL TIME
(CYCLES)
'"
:><:
.,'"
:-0
'Tj
t""
:><:
en
'"
.,zz
"::E
.,z
"t;:;
:-rJ
"';:r:"
0
m
m
0\
.j::.
147
FUNCTIONAL UNITS
10
20
FETCH UNIT I
C
F
B
E
,
3
AOO/SUBTRACT UNIT
MULTIPLY UNIT
OEVIOE UNIT
30
60 T(CYCLES)
50
'0
G
I
I
F*G
B/c
Fig.
4.3
I'
1 A-B/c
2 1-0
3 2-E
3-F*G
148
___
r-------a
l1li
K
K
j'
f'
K
Fig.
~f~
~~---;
--E3
>------E3
r---E3
610
CD
~/r
K,
,5,
K
K
,K,
K{S,S,
t------El
-.
K/S
CD
IIII------E3
!II
7p
K,
f-----E3 >------E3
t------El
r-------a
.......
l1li
rm
>------E3
4.4
K!S
r-------a
-.
l1li
------.a
f--<
t------El
>---- ~
,K/S ,5,
>------E3
--E3
510
t------El:
t------El
..
l1li
8---B
:@>---- f
01
. . (~)
--.e-------<
_ _=~----1I---____1~p
,5,
,5,
I------E3
D
9? T(CYCLES)
f-------l
>--E
8p
+>
'D
tTl
'"
...,
~
o."
en
;J>
'IT'"
';J>"
::r:
tTl
c::
...,
."
o;,::
(j
.."
ten
;;:
tTl
;,::
o;J>
c::
."
150
x
y
(4.2a)
(4.2b)
B/C - A
F*G + D-E + X
+
-X-B/C-A
(4.20)
y= F*G+O-E+X (4.2b)
-B
-A
Fig.
4.5
ASU
R6,R7,R7
R7,Y
ADD
STRE
FUI
ASU
ASU
FUI
ASU
FU2
R7,R5,R7
ADD
FU2
R2 ,R6 ,R6
Rl,E
FETCH
MU
R6,X
R3 ,Rl. ,R7
MPLY
FUI
STRE
R5,D
FETCH
DU
SUB
Rl ,R2 ,R6
DIV
FU2
R2,A
RI.,G
FETCH
FUI
R7,Rl,R7
R3,F
FETCH
FU2
SUB
R2,C
FETCH
FUI
FETCH
Rl,B
FUNCTIONAL
UNIT
FETCH
INSTRUCTION
I!!I
I!!I
E3
i!!I
I!!I
II!!
E3
E3
II!!I
II!!I
II!!!I
E3
10
E3
Fig.
4.6
I!!I
i!!I
~---E3
20
I!!I
E3
30
--I
II!!I
40
50
T(CYCLES)
'Tj
Vl
tTl
;>:l
'Tj
en
"o
)'
;..
;>:l
;..
::I:
;>:l
t;j
'"c::
s:
(")
o'Tj
t-<
en
s:tTl
~
~
152
153
154
Table 5.1
INFIX NOTATION:
X=A-B+C-D+E-F
( 5.1)
R1AB-=
R2R1C+=
R3R2D-=
R4R3E+=
R5R4F- =
155
* *'
It is easily realized that this rule decodes the current infix expression correctly: Operations with operators of higher priority are performed prior to operations with operators of lower priority, and operations
with operators of equal priority are performed from
left to right.
XA
+
=
XAB-C
&
&
+
=
&
&
&
XAB-
XAB
XA
-1
0
-1
1
0
-1
-1
-1
-1
&
-1
&
-1
X
X
&
-D+E-F
C-D+E-F
+C-D+E-F
B+C-D+E-F
-B+C-D+E-F
X=A-B+C-D+E-F
=A-B+C-D+E-F
A-B+C-D+E-F
INPUT STRING
(INFIX FORM)
OUTPUT STRING
(POSTFIX FORM)
Table 5.2
(5.1)
s:t7j
t7j
;..
;;<:
:-0
C'l
;..
C'l
t""'
."
:E
tl
;..
'"ztTl
;;<:
tTl
S2
p::
tl
tTl
tTl
C'l
0\
Ul
&
XAB-C+D-E+F-=
&
&
&
XAB-C+D-E+F-
XAB-C+D-E+F
XAB-C+D-E+
&
+
=
XAB-C+D-E
&
+
=
&
&
XAB-C+D-
XAB-C+D
XAB-C+
-1
-1
-1
-1
-F
E-F
+E-F
D+E-F
"'l
-....l
Vl
'm"
...,
~
'""'l
I:)
::c
>
'"
Ii'
'>"
...,c::
is::
(")
"'l
'"0
>
t"'"
...,Z
>
is::
zI:)
c::
158
x.
B-C+D-E+F-=
A
X
&
SUB A,B,Rl
ADD Rl,C,R2
X
&
R2
X
&
Rl
SUB A,B,Rl
X
&
Rl
SUB A,B,Rl
B
A
X
&
D-E+F-=
+D-E+F-=
C+D-E+F-=
-C+D-E+F-=
XAB-C+D-E+F-=
AB-C+D-E+F-=
&
X
&
INPUT STRING
(POSTFIX FORM)
OUTPUT LIST
(GENERATED CODE)
Table 5.3
\0
Vl
co
;0
[/)
u
o
9
:>
z
;<l
:>
:c
;<l
co
::::
c""
>-l
()
[/)
I""'
:>
z>-l
::::
co
:>
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
ADD R3,E,R4
SUB R4,F,R5
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
ADD R3,E,R4
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
ADD R3,E,R4
R5
X
&
X
&
--
R4
F-=
+F-=
E+F-=
-E+F-=
R4
X
&
X
&
R3
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
X
&
R3
X
&
D
R2
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
SUB A,B,R1
ADD R1,C,R2
>
m
'"
:::
m
'"
:<'
C'l
>
C'l
'T1
t""
::;::
t:l
>
z
CIl
'"
t:l
t:l
;t
'"
m
m
C'l
0\
SUB A,B,R1
ADD R1,C,R2
SUB R2,D,R3
ADD R3,E,R4
SUB R4,R,R5
STffi R5,X
0\
tTl
;<'
o
'"'"r]
zt:l
:>-
I?
;<'
:>-
:I:
;<'
tTl
'"~
s:::
(')
;;;l
t;
o'"r]
tTl
s:::
:>-
t:l
'"r]
162
-AB
1
2
3
1
2
3
1
2
3
1
2
3
D
2
-AB
1
2
X
&
-AB
+1C
-2D
+3E
X
&
+1C
-2D
-AB
X
&
E
3
-AB
+1C
-2D
X
&
X
&
+1C
-2D
+1C
X
&
+1C
-AB
1
2
X
&
C
1
-AB
&
=
&
&
&
-1
-1
-1
1
0
-1
-1
-1
-1
&
&
&
-F
E-F
+E-F
D+E-F
-D+E-F
)0
>
ttl
s=
ttl
)0
:-=
C'l
>
z
C'l
"':I
t-<
t;)
>
z
'"zttl
)0
!2
ttl
;t
)0
t;)
ttl
ttl
C'l
0\
11)
r:QOQII.II.
~,-INr-<\.=::r
165
166
x
X-A-B+C-D+E-F (5.1)
Fig.
5.1
x
X =((A -B)+(C -O))+(E -F) (5.10)
Fig.
5.2
167
((A-B) + (C-D
+ (E-F)
(5.1a)
which is equivalent to expression (5.1) from the mathematical point of view and can be expected to be also
equivalent with respect to the numerical accuracy
(rounding errors!) on most computers. In this expression
brackets have been used to define the order of execution.
There are several techniques for compiling such expressions. As an illustration a simple method is chosen here
which consists in taking care of the bracket structure
by increasing the hierarchy of the operator by a multiple of the number of left (open) brackets to its left
hand side. The multiple must be qual to the highest
hierarchy value assigned to an operator.
The generation of an intermediate language list
for the expression (5.1a) according to this procedure
is demonstrated in table 5.5. The resulting instruction sequence given at the end of the table can be represented by the tree diagram of fi~ 5.2. This shows
that bracket structures are a possibility to enforce
an improved instruction scheduling. Unfortunately, there
is a tendency in designing modern compilers to ignore
bracket structures wherever possible.
We are therefore interested in compilers and their
optimizing phases, which directly generate instruction
sequences from any given high level language expression,
that can be represented by low and broad tree-diagrams,
to make best use of the parallel capabilities of the
computer system. An analyses of the process of code
generation for the present example, expression (5.1),
shows, that this can be achieved in two ways (or by a
combination of them): (a) to delay the unstacking of
every second operator of equal hierarchy, or (b) to
delay the unstacking of equal hierarchy operators, if
the operands are results of previous operations. This
ensures a minimum dependency of successive instructions
on each other.
First we implement restriction (a) to our basic
unstacking rule, and apply the method then to the direct code generation from the sample expression (5.1).
Re 1: A switch (SWITCH1) is introduced to control unstacking. Initially the switch is set OFF. It is
turned ON, if an operator unstacks an operator
of equal hierarchy, and it is turned OFF if an
-AB
&
A
X
&
1
X
&
+
=
&
B
A
X
&
&
&
A
X
&
&
&
&
&
&
&
-1
-1
-1
3
0
-1
-1
-1
-1
(C-D+(E-F)
+(C-D+(E-F)
B)+(C-D+(E-F)
-B)+(C-D+(E-F)
((A-B)+(C-D+(E-F)
=((A-B)+(C-D+(E-F)
X=((A-B)+(C-D+(E-F)
INPUT LIST
OPERATOR HOLD STACK
OPERATOR HIERARCHY (INFIX EXPRESSION IN
HIGH LEVEL LANGUAGE)
VALUE
OUTPUT LIST
OPERAND HOLD STACK
(INSTRUCTION TRIPLES)
Table 5.5
'"
is:
m
.,m'"
co
c:l
c:l
'TI
t"'"
.,z
en
(')
~
'"
.,zz
"@
:-n
0
:I:
'"
m
m
c:l
0\
00
-AB
-AB
1
2
3
1
2
3
1
2
3
-AB
-CD
+12
-AB
-CD
+12
-AB
-CD
+12
-AB
E
3
&
&
=
&
&
&
&
&
&
&
&
D
C
1
&
C
1
&
C
1
-1
2
1
-1
-1
-1
3
2
-1
3
2
-1
F)
-F)
(E-F)
+(E-F)
D+(E-F)
-D+(E-F)
>-xl
\0
0\
CT1
>-xl
...,
~:>::l
en
0
;.-
:>::l
;.-
::c
:>::l
ttj
""...,c::
s::
("l
>-xl
;.ten
0
s::ttj
z
...,
;.-
c::
1
2
3
4
1
2
3
&
+34
-EF
+12
-CD
-AB
X
&
F
E
+12
-CD
-AB
&
&
-1
0
-1
2
1
i"i
i"i
;:e
i"i
a::
i"i
;.-
;>::
;:e
Cl
z
:-c
:>
Cl
>Tj
~
t'""'
:>
i"i
CI'l
;>::
;:e
t;;
;r:
;:e
Cl
-J
171
(5.1 b)
-AB
&
X
=
&
A
X
t"r1
t"r1
;t:
tl
::a
C'l
&
1
X
&
B
A
X
&
+
=
&
&
&
-1
-1
ON
ON
OFF OFF
0
-1
1
OFF OFF
C-D+E-F
+C-D+E-F
::a
t"r1
;;::
::a
>
t"r1
;0
C'l
>
z
C'l
'"rj
t"'"
tl
>
z
t"r1
til
tl
til
::a
n
B+C-D+E-F
-B+C-D+E-F
A-B+C-D+E-F
X=A-B+C-D+E-F
=A-B+C-D+E-F
INPUT LIST
(INFIX EXPRESSION IN
HIGH LEVEL LANGUAGE)
:-n
OFF OFF
OFF OFF
OFF OFF
OFF OFF
S2
A
X
-1
-1
-1
-1
Sl
&
&
&
&
&
&
OPERAND
HOLD STACK
OUTPUT LIST
(INSTRUCTION TRIPLES)
Table 5.6
-.)
t-)
1
2
1
2
-CD
+12
-AB
-CD
+12
&
&
-AB
-AB
1
2
&
&
=
&
&
&
ON
>-l
-1
1
1
-1
1
1
-1
-1
OFF
ON
ON
-F
E-F
-.J
tTl
::0
:E
;..
>-r1
>-l
Ul
;..
'9
;..
::0
::0
>-l
tTl
c:::
""
=::
>-r1
Ul
t-'
;..
::r:
+E-F
D+E-F
-D+E-F
OFF ON
OFF ON
ON
=::
tTl
-1
1
1
-1
&
D
C
1
&
C
1
&
-AB
-AB
&
&
C
1
-CD
+12
-AB
>-r1
c:::
z
u
;..
-AB
-AB
-AB
-AB
1
2
1
2
1
2
-CD
-CD
-CD
+34
-EF
+12
-CD
-AB
+12
-CD
1
2
3
4
1
2
3
&
&
-1
1
1
0
-1
+
+
=
&
&
+
+
2
1
X
&
2
1
X
1
1
1
0
-1
1
1
0
-1
&
&
3
X
F
E
ON
ON
ON
OFF
-F
E-F
'"
t'j
t'j
;>
'is:"
'"
;c
z
C'l
;>
C'l
t""
'Tl
;>
z
z
I:j
~
Vl
t'j
("')
t'j
s:
;:t
I:j
'"
t'j
t'j
C'l
"""
-...I
1++lIo/j
o/j
.j..l
s::
0
0
\0
lJ'")
OJ
CQQ
C:X:o
1
CQQ~t<\.::r
C:X:O~N..--t
1++
r-I
..Q
E-I
..--tN
..--t N
t<\.::r LC\
175
176
hierarchy at a time.
So far we have been dealing only with a very
simple sample expression, including only operators
of the same hierarchy. The simple unstacking rule and
its restriction works however equally well for more
complicated expressions like
(5.2)
The generation of instruction triples for this example
is demonstrated in table 5.7. The operators "*", and
"I" have been assigned a hierarchy value of +2, and +3,
respectively. The generation of the code is demonstrated both with and without using the restriction Re 2
to the unstacking rule. The two tree diagrams corresponding to the generated code are displayed in fig. 5.3
and fig. 5.4, respectively. As expected, the formation
of instruction tripples with "result" operands is delayed longest under control of the unstacking restriction Re 2.
In our discussion about the generation of optimized intermediate language lists for expressions given
in a higher level language we have not yet considered
the problem of local optimization. In this context we
are particularly interested in the elimination of common
factors or of common subexpressions in larger, more
complex expressions, and its influence on the execution
time. These questions are of particular interest for
the application programmer.
From the point of view of supporting parallel activities in highly parallel computer systems it may not
always be economical to eliminate common factors and
subexpressions completely. Sometimes this could make
the majority of operations in a sequence dependent on
the result of previous operations and the total task
would be put into an unefficient sequence.
As an illustration we consider the expression
(5.3)
which has first been discussed by Hellermann. A completely factorized form of this expression is given
by the equation
= A + Z
* (B
+ Z
* (C
z'"
D) ) .
(5.3a)
177
x
X=A+B*C/O*E-F
(5.2)
Fig.
5.3
X
X=A+B*C/O*E-F
X=(A+(((B *(C/O)) *E)-F
Fig.
5.4
(5.2)
A
X
C/D*E-F
&
&
:0
-1
i'l
t<"l
~
t<"l
;..
i'l
zC'l
;..
B
A
X
'Tl
t"'
;..
'"zt<"l
i'l
:n
p:
i'l
0
t<"l
t<"l
C'l
C'l
OFF
*C/D~E-F
B*C/D*E-F
+B~C/D*E-F
A+B1tC/DJtE-F
X=A+B C/D~E-F
=A+B C/D*E-F
INPUT LIST
(INFIX EXPRESSION IN
HIGH LEVEL LANGUAGE)
&
&
*+
-1
+
=
B
A
X
-1
1
OFF
OFF
OFF
OFF
OFF
OFF
S2
&
+
=
-1
-1
-1
-1
&
A
X
&
&
&
X
&
&
X
&
&
&
OPERAND
HOLD STACK
OUTPUT LIST
(INSTRUCTION TRIPLES)
Table 5.7
-.-J
00
ICD
)(B1
*B1
ICD
3
1
+
=
&
l<
-1
A
X
&
&
&
-1
+
=
-1
&
-)(
C
B
A
X
&
-1
-1
&
*+
&
+
=
C
B
A
X
&
C
B
A
X
&
OFF
OFF
OFF
-F
E-F
*E-F
D~E-F
ID*E-F
"l"l
\0
--l
'"
"l"l
..,
0
en
0
:>
:I:
'"
'it'"
:>
..,c""m
:;:::
(')
"l"l
en
0
t"""'
:>
..,zm
:>
:;:::
leD
+A3
-4F
leD
leD
)t2E
*B1
+A3
leD
*2E
~B1
+A3
leD
~2E
~B1
3
4
5
3
4
3
4
Table 5. ,7 cont.
X
&
1
B
X
&
1
B
&
&
&
&
*+
&
2
2
-1
2
2
-1
1
0
-1
1
0
-1
&
X
&
X
&
ON
ON
-F
E-F
t'l
t'l
:;.;
t'l
is::
>
t'l
i>':
:;.;
:.,;
C'l
>
z
C'l
'TI
t""
>
z
t'l
en
i>':
@
:;.;
;C
:;.;
0
C'l
00
leD
1
2
*12
3
4
leD
1
2
+34
-AF
*BE
~d2
*BE
*12
:tBE
leD
1
2
&
A
&
A
X
&
&
&
&
-1
1
1
0
-1
-1
1
1
ON
ON
F
00
t1:I
::0
>-l
"lj
'"0
>
z
Ii'
>
::0
::t:
::0
>-l
t1:I
c:::
'<I
s::
(')
"lj
'"0
>
t""
>-l
>
s::
t1:I
"lj
182
Z2
X
Z::I'Z
A + B*Z + C*Z2 + D*Z*Z2.
(S.3b)
x
X-A+B*Z**2+C*Z*3 (53)
INSTRUCTION TRIPLES.
1 * BZ
2 ** Z(2)
3 * C2
4 ** Z(3)
5 * D4
6 + 35
7 + 26
8 + 17
Fig.
5.5
183
X-A+Z.(8+Z.(C+Z.O)) (5.3a)
INSTRUCTION TRIPLES
1
ZO
2 + Cl
3 * Z2
4 + 83
5 * Z4
6 + AS
Fig. 5.6
Z2 Q2*Z
X -A+8*Z+C*Z2+0*Z*Z2(5.3b)
INSTRUCTION TRIPLES
1
2
3
*
*
*
*
*
+
+
+
Fig. 5.7
ZZ
8Z
Cl
OZ
41
35
26
A7
184
The minimum execution time for the three instruction lists depends of course strongly on the resources
which are available to the processor. Therefore the execution times of the three instruction sequences are determined in table 5.8 for two different configurations
of resources: for a system with 4 multiply units and
1 add unit, and for a system with 2 multiply units and
1 add unit. Only the execution times of the multiply
and add instructions have been taken into account, i.e.
instruction fetching," operand fetching and storing, and
storage conflicts have not been considered. In addition,
the actual processing time has been calculated and
listed, which means the time that the different resources actually work on the exectuion of the instruction sequences.
We notice that the expression (5.3) has the shortest minimum execution time on the machine configuration with 4 multiply units, and that the expression
(5.3b) has the shortes minimum execution time on the
machine with 2 multiply units, which can be considered
as a realistic configuration. It is surprising, that
the expression (5.3) is executed only slightly slower
than the completely factorized expression (5.3a) on a
machine with 2 multiply units, although the expression
(5.3) has twice as many multiplications as the expression (5.3a). For the expression (5.3) and (5.3b) the
internal processing time is larger than those for the
expression (5.3a), which is to be expected, because
the expression (5.3a) has the smalles number of multiplications. - Obviously, the completely factorized expression (5.3a) - which has the minimum number of multiplications - will be executed fastest on a computer
system with only 1 multiplication unit. Here we observe
the" fact, that although more actual work is done by the
resources, the minimum execution time is much shorter
than the internal processing time, because much work
(that is instruction execution) can be done in parallel.
This leads to the conclusion that the elimination
of common factors and subexpressions leads not always
to a decrease in the execution time. Whether such an
elimination has some advantage or not depends strongly
on the independent fuctional units available in the computer system and must be considered carefully by the
programmer. It must be stressed, that so far no compiler
is available, that transfers the three expressions
(5.3), (5.3a) and (5.3b) into each in order to generate
a code optimally suited for execution on a computer
system with a given set of functional units.
185
INSTRUCTIONS
FETCH
FETCH
MPLY
FETCH
FETCH
MPLY
FETCH
FETCH
SUB
ADD
ADD
STRE
Rl,A
R2,B
Rl,R2,R6
R3,C
R4,D
R3,R4,R7
R5,E
Rl,F
R5,Rl,R2
R2,R7,R7
R7,R6,R6
R6,X
10
20
30
40 TICYCLESI
E3
I
I
E3
I
I
I
I
E3
E3
E3
E3
E3
>------E3
>------E3
>------E3
----.:3
f----------i
Fig. 5.8
-45.---_~
.---_~-29
-36L--=..-----'
L-....;.....---'-24
-34,---_~
-25'--~---'
- 25 ,-----""----,
INSTRUCTION LIST:
- 14 '--""""""---'
FETCH
FETCH
FETCH
FETCH
MPLY
FETCH
FETCH
MPLY
SUB
ADD
ADD
STRE
Fig. 5.9
Rl,C
R2,D
R3,A
R4,B
Rl,R2,R6
R5,E
Rl,F
R3,R4,R7
Rl,R5,R2
R2,R7,R7
R6,R7,R6
R6,X
186
Table 5.8
( 5 . 3)
(5.3a)
MRT
APT
(cycles)
11
11
4
4
44
26
4
4
30 715
C5.3b)
MRT
APT
(cycles)
11
11
33
26
4
4
3C)
4
4
Dr
( 5 . 3)
11
11
11
11
22
26
15
11
----zrn- 715
11
11
11
11
11
4
4
lf5lf5
(5.3b)
MRT
APT
(cycles)
11
11
11
11
(5.3a)
MRT
APT
(cycles)
MRT
APT
(cycles)
22
26
15
37 Dr
MRT
APT
(cycles)
11
22
11
11
11
11
4
4
lf5~
187
So far we have discussed optimization on the conceptual level, that is the rearrangement of individual
operations in such a way as to ensure minimal interdependence between the instructions. Optimization on this
level leads to an intermediate language list which allows
a maximum rearrangement of subtasks for parallel processing.
We will turn now to a discussion of the optimization on the machine specific level, that is the rearrangement of independent subtasks in such a way as
to make most economic use of the available resources
(independent functional units) and to ensure parallel
processing whenever possible.
First we describe briefly the assignment of registers to instructions. Register optimization is a
very subtle taks, and quite some literature exists
which is concerned with that problem. Again we restrict
ourselves here to a very simple approach which allows
to demonstrate the main features. The pseudomachine is
defined to have 7 registers: 5 registers (R1-R5) for
operand fetching and 2 registers (R6, R7) to store results. Each set of registers is used cyclicly. If no
result registers are available we use one of the available operand registers (R1-R5), to store intermediate
results. It is expected that this cyclic use of registers increases in the average the time before a register is re-used, and thus minimizes register conflicts.
Each register can be used again, if it has once been referenced as source of an operand. In general, the appropriate rule will be more complex, because more than one
instruction can refer to the same operand. Then a counter has to be used to keep track of the number of source references to the register, and it will become available to be re-used, if the counter is zero. We will
assume further that the registers available will be
sufficient to execute the given instruction sequence
without backing up register contents temporarily in
storage locations. In actual compilers, of course, care
has to be taken for such a backing up of registers.
The register assignment, as well as the optimization on the machine specific level to be described,
will be demonstrated using the expression
(5.4)
The intermediate language list (instruction triples)
188
189
-Table 5.9
( 5.4 )
X=A .. B+C~D+E-F
Instruction triples
"Instruction list"
*AB
FETCH A
FETCH B
IMPLY A,B,R1
FETCH C
FETCH D
MPLY C,D,R2
FETCH E
FETCH F
SUB E,F,R3
ADD R2,R3,R4
ADD R1,R4,R5
STRE R5,X
ltCD
-EF
+23
+14
Table 5.10
1
2
3
4
5
6
7
8
9
10
11
12
FETCH
FETCH
MPLY
FETCH
FETCH
MPLY
FETCH
FETCH
SUB
ADD
ADD
STRE
R1,A
R2,B
R1,R2,R6
R3,C
R4,D
R3,R4,R7
R5,E
R1,F
R5,R1,R2
R2,R7,R7
R7,R6,R6
R6,x
R2
K
K
K
K/S K/S
K
K/S K
K/S
R3
R4
K
K K
K/S K/S
R5
R6
K
K
K
K
K
K
K K
K/S K
K
K/S
R7
K
K
K
K
K
K/S
190
191
10
FETCH RI,A
FETCH R2,B
lIB
FETCH R3,C
FETCH R4,D
MPLY
lIB
MPLY
R3,R4,R2
ADD
R6,R7,R6
ADD
R2,R6,R7
STRE
R7,X
I!!I
: I
lIB I
I!!I
--j
lIB
f--------E3
lIB
0
I!!I
FETCH RI,D
SUB
ADD
R5,RI,R7
MPLY
R6,R7,R6
R3,R4,R7
ADD
STRE
R6,R7,R6
R6,X
f--------E3
-----.e
INSTRUCTIONS
FETCH R4,F
MPLY RI,R2,R6
FETCH R5,C
f--------E3
I!!I
Fig.
FETCH RI,A
FETCH R2,B
FETCH R3,E
40 TICYCLESI
~0
I!!!I
FETCH R5,E
FETCH RI,F
R5,RI,R7
30
RI,R2,R6
SUB
20
10
I--------l
5.10
20
30
I!!I
I!!!I
lIB
..
: I
.1
lIB
...
f---------D
f---------D
lIB
lIB
E3
Fig.
5. 11
40 TlCYCLESI
192
193
ticular, it was responsible for collecting the different data from the positions in core, and finally
to initialize the functional unit. The complete buffer area was then transferred to the external storage
under control of the I/O functional unit. Thus, the
control processor was still strongly involved in I/O
operations. There were a number of variations in this
approach as for example, the number of buffers supplied
or the technique of filling the buffer (scattered read
and write). But the fixed buffer length put quite some
restrictions on the data format on the external devices.
Present day I/O functional units are much more
advanced, that is much more control functions have
been transferred from the CPU to the I/O functional
units. But still today, the control logic differs considerably from manufacturer to manufacturer. The two
ends of the development are: highly general I/O processors that can serve many different devices, and
highly special I/O processors which are oriented to
serve one or a group of logically similar I/O devices.
We will discuss I/O processing, using the IBM facilities as example of a rather general and flexible system.
The input/output system basically consists of
channels, control units, and devices. The channel
provides paths for data and control signals from the
processor storage to the control unit and vice versa.
The channels vary by the following characteristic
features: (1) the speed, that is how many units of
data can be transferred in a time unit, (2) transfer
width, that is how many units of data can be transferred simultaneously, and (3) in their capability of
data transfer: simplex channels transfer data in one
direction, half-duplex in both directions sequentially,
and duplex in both directions simultaneously.
The use of a channel with a given speed characteristic is limited to such devices, which have equal
or smaller data speeds. Otherwise, the channel receives data from the control unit with a higher speed,
than it can transfer them to core, and data would be
overlayed and lost. - Specialized channels have been
developed for devices with a typically slow data rate,
like card readers, card punches, and line printers.
The characteristic of these multiplexor channels is,
that they serve a number of devices simultaneously at
194
195
Information isstored on magnetic tape as continuous areas of data and control information, called
physical records, separated by areas containing no information, called record gaps. To write and read information, the tape is transported over the write/
read head with constant speed. The time necessary
to write or read data is called the write/read time,
or data transfer time. The time necessary to accelerate
the tape to constant speed and to stop its movement
after the I/O operation, are called start and stop time.
It is actually because start and stop times are finite
that record gaps are necessary. The characteristic
feature of data stored on magnetic tape is that they
can be accessed only sequentially. The tape has to be
moved from its present position to the position of the
record to be accessed by passing over all records physically stored on the tape between the two positions.
Typical sizes of records and record gaps and of timings
for read and write operations are given in table 6.1.
A magnetic disk (pack) is built of a number of
disks, mounted on the same axes. In operation, the
disk pack is spinning permanently with constant speed.
Above each disk surface, one (or a small number) of
write/read heads are positioned. Each write/read head
can access a spherical trace on the spining surface,
called a track, the collection of all tracks on different surfaces above and below each other is called
a cylinder. All write/read heads are mounted on the
same arm in such a way, that data on the same cylinder
can be accessed without moving the arm. To access data
on different cylinders usually the disk arm has to perform a radial movement.
Data are stored on tracks, data exceeding one
track in length are stored on cylinders. The data are
stored on the disk in such a way that each track contains the same number of data. The characteristic
feature of data stored on disk is that each physical
record can be accessed directly. The write/read head
can be moved directly, according to control information,
into a position to access the specified record. This
mayor may not (but usually will) involve a radial
movement of the disk arm. The average time needed for
this movement is called the average radial delay time.
The average time passing until the record to be accessed passes over the write/read head is called the
average rotational delay time. Typical data transfer
and aver. delay times are listed in fig. 6.1.
MAGN.DRUM
IBM 2301
MAGN.DISK
IBM 2314 A1
IBM 3330
4.6
0.6
RECORD
RECORD
(7200 BYTES)
GAP
LENGTH
LENGTH
(inch)
(inch)
60
30
(ms)
(ms)
5.3
AV.ACCESS
TIME
START/STOP
TIME
8.6
12.5
8.4
(ms)
AV.ROTAT.
DELAY TIME
6.0
23.1
5.8
40.8
14.6
95.6
44.2
46.1
DATA
AV.WRITE/
TRANSM.
READ
TIME
TIME
(ms)
(ms)
MAGN.TAPE
IBM 2403/6
9 TRACK,1600 BPI
DEVICE
(MODEL)
Table 6.1
m
m
;<l
;;;::
m
m
;..
;<l
:e
Z
C'l
;..
C'l
""I
::E
0
r
zm
;..
z0
en
('l
;<l
;t
;<l
C'l
\0
0\
197
198
are transferred between the two areas on completion of both independent tasks. Normally, the completion of both tasks is controlled by the compute
task, but this is not a necessary condition. In
exchange buffering the assignment of the areas is
switched on completion of the indepen~t tasks: the
buffer area becomes compute data area and vice versa.
Obviously, dynamic buffering can only be realized
within the scheme of exchange buffering technique.
A special form of exchange buffering is the cyclic
buffering where the compute data and buffer areas
are switched cyclicly. The number of buffers to be
used and the buffering technique to be applied is
completely determined by the bounding characteristics
of the total elapsed time. We can decide between the
following cases:
(1) Compute bound problems, and
Problems with balanced compute and I/O times:
Here a second subdivision is necessary.
(1a) The compute time of processing a data record
is always larger then the I/O time required
for the record: In such cases single buffering
is necessary and sufficient to achieve complete overlap of the operations. The assignment of more than one buffer is a waste of
processor storage and leads to no decrease
in total elapsed time.
(1b) The compute time of processing a record
varies and may be smaller than the I/O time
required for a record: In such cases multiple
buffering has to be used to achieve completely overlapped operations. The number of
buffers necessary to achieve complete overlap
is hard to predict exactly and depends on the
variations in compute time per record. In
general, the number of buffers has to be encreased, if the variations in compute processing times of different records encrease. The
minimum number of buffers is two. The encrease
of buffer size may be useful within certain
limits, but it becomes less effective if the
variations in compute time per record increase.
If input and output operations are necessary
for each processed record, then dynamical,
cyclic buffering is ideally suited and advised.
199
H. Lorin,
A. Veillard
C.N.R.S., Strasbourg (France)
INTRODUCTION
Since the Schrodinger equation cannot be solved exactly for
polyelectronic systems, one has to look for approximate solutions
of any desired accuracy. One way is the use of the variation methodD]. The Eckart theorem shows that any trial wavefunction ~
(which is normalizable) leads to a value of the energy E which is
never lower than the true ground state energy E of the system
HI/!
E
EI/!
( 1)
<~IHI~>
(2)
E)- E
<~I~>
(3)
This lS the key to the power of the variation method of approximating solutions to the Schrodinger equation. One can choose
the "best" wavefunction from several alternatives on the basis of
the criterion of lowest energy (however one should remember that
the energy can be an insensitive criterion with respect to a "best"
wavefunction for other physical properties).
SCF procedures have arisen from the use of a variety of approximate wavefunctions given as linear combinations of Slater
determinants (this insures that the wavefunction is antisymmetric
with respect to exchange of any pair of electrons) :
Diercksen et al. (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 201-250.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
202
A. VEIL LARD
'I' =
or
cI> 1 ( 1 ) ...
cI>2 ( 1 ) ...
cI>1 (n)
cI>2(n)
cI> (n)
n
'1'( 1 ,2 . n) =/lcI>1 ( 1 )
(4)
cI>n(n)\l (5)
<Pl(2n)
( 6)
'I' =
the use of the variational method to,find ~he best possible orbitals <P leads to the Hartree-Fock SCF equations. If each M.O.<p is
expressed 'as a linear combination of basis functions (usually atomic orbitals hence the name of LCAO approximation), the HartreeFock (HF) equations turn to ~he Roothaan SCF equations [2].
Open-shell systems give rise to a number of difficulties.
For instance, in the case of two degenerate configurations as
tt-t-
or
4--tt
'I' =
203
or FC
= s
(8 )
= xC
Similar problems are encountered when trying to solve the various SCF equations in the most efficient way. Let us mention:
1) The problem of building in the most efficient way the F matrix
or matrices from the two-electron integrals, which can number up
to several millions and are necessarily held in a slow access store
(magnetic tape or disks) ;
204
A. VEILLARD
THE
P~RTREE-FOCK
AND
2:
~<v
2
2
32
-3- + 3 + 3z2
3y2
3x 2
v
v
v
-- +
r
~v
ZMZN
2:
M<N RMN
= kinetic
operator
"d 1)
..fiN!
( 11)
( 12)
( 10)
(PI ( 2 ) . . ..
l ( 2) . . ..
4> d 2N )
"i (2N)
(13 )
Alp AP
( 14)
THE LOGIC
or SCF PROCEDURES
2N
2N
1. +
i=l
205
i ,j= 1
i<j
J ..
lJ
(15 )
Z' K ..
i ,j lJ
*'
1. =
'*
H l (v)H(v).(v)dT = !.(v)H(v).(v)dv
l
v
l
l
V
(16)
( 17)
= dv
do
( 18)
*' *"
J ..
Kij
!:(jJ)j(V)H'(jJ,v)i (v)j(jJ)dvjJdv v
lJ
=:
1. +
i=l
2Z
~
i,j = 1
'Y
Z Z
i=l pq
C~.
lp
C~.
lq
H~
pq
+2 Z
~,jJ
Z
Z C~. C~. C.. C .
i,j=l pqrs lp lq ~Jr jJJs
~pq
r";"\
2Z
i
c,S\pq, ~rs
(20)
Summation lS over
(
(2J iJ'-\J') the space orbitals "'l' 21)
~~~::~~~
~pq ~pq
.l2
~,jJ
pqrs
(22)
D1'
C~.
lp C~.lq
~
~
1
<pqlrs> = Ix (1)x (2)-X (l)X (2)dv 12
p
r
r 12 q
s
(23)
(24)
(25)
p,qE:~
r,sEjJ
(26)
206
A. VEILLARD
*l
,+
2: CA S A CA =o .. =C.SC.
lp pq Jq lJ l J
.()J)dv =0 .. (27)
J
)J lJ
!~. ()J)~
(28)
pq
><:-
F~.=
2: ~.e .. (30)
. 1 J Jl
J=
2: FA CA =2: SA CA EJ i
pq lq jq pq Jq
q
Apq
=H
+2: 2:
Apq)J rs
fi?Apq,)Jrs D)Jrs
(33)
1
J. (v)f(v)= (J~.* ()l)~.
()J)dv ) f(v)
l
l
r
l
)J
(Coulomb operator)
1
K. (v)f(v)=(f~.* ()J)-f()J)dv
H. (v)
l
l
r
)J l
)JV
)JV
(3~)
F~.=e.~.
l
(39)
or
FC=ESC
(~O)
(~1
(~2)
2: CA' CA HA
pq
lp lq pq
E.
l
N
1.+ 2:
l
J=
12: D
(2J . . -K .. )
lJ
(~3)
lJ
+12: 2:
Apq Apq 2
Apq )Jrs Apq,)Jrs
pq.
)Jpqrs
(~4)
207
CHAPTER II. THE LOGIC OF THE SCF CALCULATION FOR THE CLOSED-SHELL
CASE
The SCF calculation proceeds in an iterative way. Each iteration includes essentially three steps :
- building the F matrix ;
- solving the pseudoeigenvalue equation FC = ESC ;
- recognizing whether the self-consistency has been reached ; if
not. one has to define a new set of trial vectors C to be used as
the input for the next iteration.
Each of these steps require some auxiliary calculations like :
the calculation of the density matrix D from a set of given vectors ;
- the calculation of the total energy E at each iteration simultaneously with the building of the Fock matrix;
- an orthogonalization process to insure that the set of trial
vectors is orthonormalized.
He shall go through each of these three steps.
1 - Building the F matrix
This requires the density matrix D and either the ~ supermatrix or the list of two electron integrals <pqlrs>. Some molecular
programs generate a list of two-electron integrals (POLYATOM [101
MUNICH [111. ASTERIX [12}) while some others (1I\OLECULE [131) create
directly the supermatrix (i.e. a list of
elements) without storing separately the two-electron intg~r~fs. He shall consider both cases.
gJ
.
.
A. From the D matrlx
and the.G)S supermatrlx.
He assume that
the basis set used is a basis of symmetry adapted functions belonging to the irreducible representations A of the molecular
point group. theu the Fock matrix F is symmetry blocked*:
A=l
A=2
A=3
Apq
=H
Apq
~ ~ Apq.~rs D~rs
(45)
~rs
*If the basis set used is not a basis of symmetry orbitals but rather one of pure basis functions (for instance atomic functions.
either Slater functions or contracted Gaussian functions. centered
on each atom) then A=l.
208
A. VEILLARD
(46)
(since
~APq,~rS= ~qp,~rs
and HApq=HAqp)
The index A refers to the irreducible representations of the molecular point group. The index i refers to the occupied molecular
orbitals. The indices p and q refer to the basis functions. Clearly DA =D A
and we need to store in computer only the lower half
pq
qp
.
.
()
of the square matrlx for lnstance p~.
For example, in the case of two irreducible representations
with respectively three and two symmetry orbitals, we need to consider the following elements of the D matrix
Aq
AP
11
12
13
24
25
11
12
13
111
121
131
122
132
133
24
25
244
254
255
To use the computer storage in the most efficient way, these matrix
elements are stored not as a matrix but as a one-dimensional array
which is sometimes called a supervector. They will be stored in the
order 111 121 122 131 ...
The total number of matrix elements is
L
mA(~+1)
c~n
be written as
Loop over A
Loop over p=l,m A
Loop over q=l,p
DApq=O.
Loop over i
D
=D
+2C. C .
Apq Apq
Alp Alq
The C vectors to be used are either the trial vectors at the first
iteration, or the results of the previous iteration or the results
of some extrapolation procedure (see below).
209
.
The next step is the contraction of the,) supermatrlx with
the density matrix (this is sometimes said to produce the P matrix) .
(\~
Apq
=H
Apq
+P
Apq
=H
Apq
~
D
, Apq, ].Irs ].Irs
+ Z Z
lJ rs
(48)
'.~)
j
Apq,lJ rS
[I
I]
(49)
r:'
j
Apq,lJrS
_0::,'
_;~
r--:\
-,\
_J
='S;'
Aqp,lJrS Apq,lJSr lJrS,Apq
(50)
l.:'
So the supermatrix \.~ lS symmetrical with respect to the exchanges
p
Apq
lJrs
(,
lJrs
111
121
111 / 11 1
121/111
122/111
122
121/121
122/121
131
133
244
254
255
122/122
if lJ=A
Loop over s= 1, r
if r<p
s=l,q
if r=p
CfApq , lJrs
132
if lJ<A
$' supermatrix
may be written as
210
A. VEILLARD
Cf
AN
P
Apq
z ~
].Jrs
Apq, ].Jrs
].Jrs
1
j
as above
Loop over s
Read
~Apq, ].Jrs
=P
].Jrs
P Apq
(),
+'S'
].Jrs
Apq,].Jrs
P Apq
+0
~ Apq,].Jrs
Apq
].Jrs
In order to have the correct contribution, it should be realized that each element ~A
in the calculation of PI..
should
also account for the term pq, ].Jrs
D . Since D pq= D
,
.
I\pq, ].Jsr .J.lsr.
].Jsr
].Jrs
the easlest way to have the correct contrlbutlon when only nonredundant elements of the supermatrix are stored is to store the
off-diagonal elements of the density matrix after multiplication
by a factor of 2.
9,
However, depending on the organization of the integral package, it is not always possible to generate the ~supermatrix in the
canonical order as defined by the above algorithm. Although this
COuld be done in a separate reordering step, then it is probably
more economical to have the j)supermatrix elements written in an
arbitrary order. Then each supermatrix element must be assigned
two indices IJ and KL :
211
I(I-1)/2+J
K(K-1) /2+L
IJ
KL
hr
the~
P ( IJ ) = P ( IJ ) +
supermatrix elements
5l (IJ , KL)
~ D( KL )
+~(IJ,KL) ~ D(IJ)
P(KL) = P(KL)
However, one has now to read from the slow store not only the
supermatrix elements but also the corresponding indices IJ and
KL. This may result in a significant increase of the 10 time whenever the number of
supermatrix elements is large.
5'
R>S
P>Q
1
PQ = -P(P-1 )+Q
2
and
PQ
RS
1
RS = -R(R-l
)+8
(54)
(55)
212
A. VEILLARD
<pqlrs>-~<prlqs>
(56)
instead of
S?pq,rs
(57)
pq
rs
ps
sp
rq
qr =
G
G
pr
rp
qs
sq
<pqlrs> D
+ <pql sr> D
rs
sr
<rslpq> D
+ <rslqp> D
qp
1
pq
--<pql sr> D
21
qr
- -<srlpq> D
12
rq
--<rslqp> D
21
sp
- -<qplrs> D
12
ps
--<pqlrs> D
21
qs
- -<rslpq> D
12
sq
--<qplsr> D
21
pr
- -<srlqp> D
2
rp
(58)
For the most general two-electron integral <pqlrs> with no COlncidence among the four indices, these contributions reduce to
2<pqlrs> D
rs
G
2<pqlrs> D
rs
1
pq
G
- -<pqlrs> D
ps
12
qr
G
= --<pqlrs> D
qr
21
ps
- -<pqlrs> D
G
pr
12
qs
G
--<pqlrs> D
qs
pr
2
G
pq
(59)
213
X2 = X + X
X=-05"X
G(PQ)
G(PQ)
G(RS)
G(RS)
G(PS)
G(PS)
G(QR)
G(QR)
G(PR)
G(PR)
G(QS)
G(QS)
+ X2 ~ D(RS)
+ X2 ~ D(PQ)
- X " D(QR)
- X " D(PS)
- X " D(QS)
- X " D(PR)
( 60)
(with P
Q)
(61 )
214
A. VEILLARD
DIMENSION A(5,5),B(5,5)
DATA I,J /4,2/
A(I,J)=O.
B(I,J)=O.
END
IMJ
+
X11,NINTR$
0000,0
AO,J
LA
MSI,XU AO,5
AO,I
AA
X1,AO.
LX
A-6,X1
SZ
SZ
B-6,X1
LMJ
X11,NSTOP$
+
(0050505050505)
Table 1. Fortran and machine code (on 8 Univac 1108) for adressing a matrix element A(I,J)(right) or an element of a linear
array A(IJ)(left).
C
IJKL I>J>K>L
TWOINT=TWOINO+TWOINO
IJ=ITABL(I)+J
IL=ITABL(I)+L
IK=ITABL(I)+K
JK=ITABL(J)+K
JL=ITABL(J)+L
KL=ITABL(K)+L
F(KL)=F(KL)+P(IJ)*TWOINT
F(IJ)=F(IJ)+P(KL)*TWOINT
F(IL)=F(IL)+P(JK)*TWOIN1
F(JK)=F(JK)+P(IL)*TWOIN1
F(IK)=F(IK)+P(JL)*TWOIN1
F(JL)=F(JL)+P(IK)*TWOIN1
LA
FA
SA
LX
LX
LX
LA
AA
SA
LX
LA
AA
SA
LX
LA
AA
SA
LX
LA
AO,TWOINO
AO,TWOINO
AO,TWOINT
X2,I
X6,J
X4,K
A2, ITABL-1 ,X2
A2,J
A2,IJ
X5,IJ
A4, ITABL-1 ,X2
A4,L
A4,IL
X1,IL
A6, ITABL-1 ,X2
A6,K
A6,IK
X3,IK
A8,ITABL-1,x6
215
AA
SA
LX
LA
AA
SA
LX
LA
AA
SA
LX
FM
FA
SA
LA
FM
FA
SA
LA
FM
FA
SA
LA
FM
FA
SA
LA
FM
FA
SA
LA
FM
FA
SA
A8,K
A8,JK
X2,JK
A1O,ITABL-l ,x6
Al0,L
Al0,JL
X6,JL
A12,ITABL-l,x4
A12,L
A12,KL
X4,KL
AO,P-l,X5
AO,F-l,x4
AO,F-l,x4
A2,P-l,x4
A2,TWOINT
A2,F-l,X5
A2,F-l,X5
AO,P-l,X2
AO,TWOINl
AO,F-l,Xl
AO,F-l,Xl
A2,P-l,Xl
A2,TWOINl
A2 ,F-l ,X2
A2 ,F-l ,X2
AO,P-1,x6
AO,TWOIN1
AO,F-1,X3
AO,F-1,X3
A2,P-l,X3
A2,TWOIN1
A2,F-l,x6
A2 ,F-1 ,x6
Table 2 (End). Algorithm for the computing of the G matrix for the
case P > Q > R > S : the Fortran algorithm and its transcription into
machine language (as given by the compiler).'
Note : in the above algorithm, the notations F and P stand for the
matrices G and D of the text. A relatively large number of instructions may be cut down by writing directly this fraction of the program in machine language.
There is an appreciable saving of computer time (of the order of
10 %) in the last case.
The above method requires that different formulae be used
according to the nature of equalities between the four indices
P,Q,R and S associated with a two-electron integral. If only the
half-matrices D and G are to be used, one has to consider fourteen
216
A. VEILLARD
SUBROUTINE AUX
COMMON A
DIMENSION A(3)
1=3
A(I)=O.
RETURN
LA,U
M,XU
SA
LA ,XU
SA
LA,XU
SA
LX
AX
SZ
AO,*O,Xl1
AO,-l
AO,NTEMP~
AO,3
AO,I
Xl,I
Xl,NTEMP$
O,Xl
END
LX
SZ
AO,3
AO,I
Xl,I
A-l,Xl
Table 3. Fortran and machine code (on a Univac 1108) for adressing
a matrix element which is passed either as a subroutine argument
(left) or through a common (right).
NiN2
Al(H 2 0)4
43
93582
1 '27"
l ' 21"
1'16"
65
1276823
4'27"
3'52"
217
I
J
K=L
I =K> J =L
I =J > K=L
I =J =K > L
I =J > K > L
I > J =K= L
I > J > K L
I > K L > J
I > J = K> L
I = K> J > L
I > K> J = L
I > J > K> L
I > K> J > L
I > K> L > J
IIII
IJIJ
IIKK
IIIL
IIKL
IJJJ
IJKK
IJKK
IJJL
IJIL
IJKJ
IJKL
IJKL
IJKL
Table 5. The fourteen classes of integral labels (the corresponding contributions to the Fock matrix may be found in
Ref.GO]and [1~
GO TO (14,12,11,10,4,13,7,8,6,5,9,1,2,3),IC
LA
TLE,U
JP
SLJ
J
J
J
J
J
J
J
J
J
J
J
J
J
J
+
A2,IC
A2,15
,A2
NERR2$
14L
12L
11L
10L
4L
13L
7L
8L
6L
5L
9L
1L
2L
3L
0157,(W.B.)
A2,~+1
218
A. VEILLARD
RefDY)
219
K7 L
IJ rKL
IJ(IJ-l)/2 + KL
= KL(KL-l)/2 + IJ
IJ 7 KL
KL > IJ
( 62)
220
A. VEILLARD
IJ = ITAB (I) + J
KL = ITAB (K) + L
IJKL = ITAB (IJ) + KL
(IJKL was computed in the first step
by storing it with the integral) ;
- the integral is stored in the sorting area according to the value of IJKL.
Integral
file on tape
(IJKL from
1 to 10 6 )
Integral
buffer
(in core)
20 buffers
(in core) for
each category
20 files on
disk (size of
each file
5.104 )
t y~
~
----.
BUF1
FILE 1
1~IJKL~5. 104
} I/o
Integral
sorting
BUF2
} Core
process
~}
'\
I/O
FILE 20
9.5 105<IJKL~106
FILE 2
5.104<IJK~105
"J
221
of this stage.
Some comparative timing for the SCF iteration either from
the integral file or from the ~ file has been given by Raffenetti
[n}. The benefit of preprocessing the integral file is probably
not decisive when only one electronic state is considered. It may
be worthwhile when several electronic states have to be computed.
Since most excited states have open-shell configurations, the SCF
treatment will then require two supermatrices ~ and Q.. or ~ and
(cf. below). Preprocessing of the integral file to form the ~Q.
or ~~ file is a straightforward extension of the above procedure
and has also been described by Raffenetti [17}.
X.
EC'
( 64)
(65 )
with s a diagonal matrix (s and t are eigenvalues and eigenvectors o~ the overlap matrix S~.
222
A. VEILLARD
Then the diagonal matrix s -1/2 1S set up and one constructs the matrix
a
.( 66)
We rewrite (64) as
VFV+(V+)-lC
(67)
or
F'C' = EC'
(68)
with
F'
VFV+
( 69)
C'
(V+)-l C
(70)
VSV+
slnce
(71)
Given the overlap matrix S, one has to solve for its eigenvectors
t and eigenvalues s . From these two matrices one computes the V
matrix and the F' m~trix. One finds C' by solving the eigenvalue
problem and one finds C as
= Z Y
X
p rp p
1S defined
or
S' =
00
+ +
= yXX Y
= ySy
223
u +8 ' u = g
-1/2 + 8 + -1/2
uyyug
11
Then one sets
~F~+
= F'
v
v
= V+W = g-1/2y+uW
(80)
cos
o
o
pq
-sin
column
tp
0 ..... 0 sin
1 ..... 0
0 ..... 1
row p
column q
(81)
224
A. VEILLARD
1S
B = T- 1 A T = TAT
pq
pq
pq
pq
One will choose
such that b
pq
= b
qp
= 0
(84)
i,j
i#j
C~.
1J
< E
(85)
FC = ESC
225
C+SC
o
(87)
(88)
C+SoC = 0
o
(89)
E =C FC
000
OE
(90)
If we sUbstitute C and E into Eq. (85), and multiply the resulting
equation from the left by C:. we obtain readily
oE=C+FoC=C+(F-E S)oC
o
(9')
g =(F -E
s)c 0
(94)
G oC =
o
where G
o
~s
226
A. VEILLARD
= oCI_(C+SOCI)C
o
(99)
227
- trial vectors for the borazane molecule BH3NH3 will start with
the SCF vectors for the NH3 and BH3 molecules.
In the POLYATOM program, trial vectors are taken as the
eigenvectors of the H matrix, id est the one-electron part of the
hamiltonian [10]. If the basis set used is a minimal basis set,
trial vectors for the valence orbitals may be obtained as the solutions from one of the many semi-empirical methods for the valence electrons (Extended Huckel Theory [22], CNDO method [23]).
If the basis set used is a double-zeta set, one may use the vectors solution of a semi-empirical method by splitting the corresponding coefficients. A scheme has been proposed for the construction of a trial density matrix when the basis set is made of localized orbitals (more precisely for a basis set of floating gaussian functions) based on the idea of combining molecular fragments
[24J.
Trial
vector
C.
i+l
C.+l
Final
vector
/~
Ci +1
Ci +2
228
A. VEILLARD
( 100)
/:J.C. = C.-C
1
= m/:J.C.1
C.
- C
1+1,p
p
m (C. -C )
p 1p P
( 102)
C.
- C
1+2,p
p
m (C.
-C)
p 1+1,p p
( 103)
( 104)
-1
(aCi-2SCi+1+yCi+2)
(105)
= (C i +2-C i + 1 ) (C i +2-C i + 1 )
= (C i +2-C i + 1 ) + (C i + 1-C i )
( 106)
229
C = aC i +(l-a)C i + l
( 107)
XC
(108)
~r ~l
(I:6+)(!_})(~d~2)(-f-)
~2
(I: 6 +) (-!~
6
+
= 1+6 6
(111 )
230
A. VEILLARD
The corresponding variations (to first order) In the density matrix are given by
D+oD =
+
rIo
o~+
) = (0:6) + (~lo-) + higher terms
~~)(r:~
with* D=CC+
(112)
(113)
pq
= L 20D
pq
pq
(H
pq
pq
+ L DC\))
rs
Fmo
pq
rsUpq rs
'
( 114)
Fmo now stands for the matrix elements of the Fock operator in the
bR~is set of the trial molecular orbitals. Then by using equatlon
(112) for the expression of oD one obtains
occ. virtual
oE = 4 L
L
~ki F~~ + higher terms ( 11 5)
i
k
One notices that if
i) all ~ki are chosen of opposite sign to corresponding F~~ ;
ii) all ~ . are sufficiently small to make the higher terms of
equation t~15) smaller in absolute value than the first order
term ;
then the energy will go down (or remain stationary if all F~~ are
zero, id est at self-consistency). Then one would have a metnod
guaranteeing convergence to a stationary point on the energy surface (although not necessarily the ground state).
Let us perform an iteration using as basis set the trial molecular orbitals of this iteration. This is easily achieved by
constructing the Fock matrix F in the basis functions representation and by transforming it to the molecular orbital basis through
a linear transformation
( 116)
231
\ Fmo
ki
1\
(117 )
We have assumed that the trial molecular orbitals obey the aufbau
principle (all Ffkmo greater than all F~~). Then we see that the
A '
II
.
H
U ki ~re lndeed 0
Opposlte slgn to ~he correspondln? Fki . ~w:ver
one lS unable to guarantee ~he magnltudes of 6 ki .belng sufflc:ently small that convergence lS assured. If the ~rlal wavefunctlon
is far from convergence, then the analysis is less clear since
the application of first order perturbation theory may be invalid,
and energy reordering of trial molecular orbitals may invalidate
equation (117).
However, convergence may be insured by adding a positive
constant A to the diagonal elements F~ before diagonalization.
One then performs the diagonalization of this "level-shifted"
hamiltonian and orders the resulting eigenvectors according to
the aufbau principle based on the resulting eigenvalues. First
order perturbation theory now yields
6 . = Fm~ / (F~~ - Fmo - A)
kl
kl
II
kk
(1
Hll
232
A. VEILLARD
C' = CV
C being the trial set of eigenvectors and V the eigenvectors resulting from the diagonalization of the level shifted Hamiltonian.
Then Ck ' the eigenvectors defining the molecular orbitals at the
kth iteration, are given by
mRy
233
234
A. VEILLARD
the basis set) and the time needed to carry out the diagonalization goes as m?, whereas the built-up of the Fock matrix goes as
the size.of the integral list, namely m4. This last process is
optimally carried out with integrals over symmetry adapted functions, since the number of two-electron integrals is reduced appreciably with the symmetry. However the transformation time for
computing symmetry adapted integrals from integrals over basis.
functions ~oes at b~st as the fif~h power of the basi~ set [33}
and the prlce of thlS transformatl0n becomesoverwhelmlng when
increasing the size of the basis set. Clearly one needs some way
to take advantage of the symmetry in the construction of the Fock
matrix without going through the integral transformation.
Such an approach has been proposed by Pitzer and collaborators [34-36J and is called the equal contribution method. It makes
use of the theorem that symmetry related integrals over basis
functions make equal contributions to symmetry adapted integrals
with totally symmetric integrands. A simple instructive example
given by Pitzer [35] corresponds to the calculation of overlap
integrals between the symmetry adapted functions obtained from
the three hydrogen s orbitals (denoted s1 s2 s3) in NH3 (C 3v symmetry)
a 1 = s 1 + s2 + s3
e = 2s 1 - s - s3
x
2
e = V3(s2 - s3)
y
( 121)
(123 )
which does have a totally symmetric integrand. As one may see from
the contributions of the integrals over basis functions to the
symmetry adapted integrals as given in Table 7, the optimum procedure is to compute only two integrals over basis functions, such
235
-4
-4
(e Ie)
x x
(e Ie)
y y
-6
(e I e)
-2
-2
-2
Table 7. Contributions of the overlap integrals over basis functions to the symmetry adapted overlap integrals for the s(H) functions of NH3
The corresponding algorithm to utilize this theorem In an SCF
program has been described in Ref.[34]. It requires that the integral routine first generates a list of all the integrals over basis functions, grouping together those which are required by symmetry to be equal in magnitude (more complicated symmetry determined relationships are not considered). Since each integral in a
group will make the same contribution to a given symmetry adapted
integral, only the first integral of a group is retained after
being multiplied by the number of integrals originally in the group.
Then the Fock matrix is constructed from this reduced list of integrals according to the method of Paragraph lB without modification for molecules whose symmetry groups possess one dimensional
irreducible representations only. The use of the theorem is based
on the fact that the Fock operator is a totally symmetric operator
and xhus its integrals between symmetry adapted orbitals of the
same symmetry have a totally symmetric integrand. In the case of
molecules whose symmetry groups possess multidimensional irreducible representations, the integrals in the diagonal blocks of the
symmetry blocked Fock matrix do not in general have a totally symmetric integrand and correct matrix elements can only be obtained
by averaging these integrals [34,36].
236
A. VEILLARD
Molecule
Point group
Mn 4
29
Td
39
94830
304590
46848
288486
Nb. of
26310
15132
uni~ue
non-zero integrals
8.8 sec.
62 sec.
4.6 sec.
4.5 sec.
uni~ue
Table 8. SCF timings (on an IBM 370/165) with and without symmetry
(from Ref. (34])
A somewhat different approach has been proposed by Dacre to
reduce the integral list [37] and was latter extended by Elder
[38J. However it leads to a more complicated algorithm for processing the shortened list of integrals since all symmetry related
contributions to the Fock matrix are generated from the one formed
from the reduced list by applying symmetry operations.
4 - Se~uence of operations and flow-chart for a SCF closed-shell
program.
A simplified se~uence of operations for a SCF closed-shell
program would be :
1. Read input tlata (including trial vectors).
2. Set the SCF threshold at 10-6
3. Read Sand H from tape.
4. Schmidt orthonormalize the trial vectors if needed.
5. Write input data.
6. Compute the closed-shell density matrix.
7. Read the
<p~lrs> integrals from tape and compute the P matrix.
8. Compute the one and two-electron energies and the F matrix.
9. Compute the total energy.
10. Diagonalize the overlap matrix if needed (first iteration if
Lowdin orthogonalization is used).
237
11. Compute the F' matrix (Lowdin orthogonalization or tranformation in the basis of molecular orbitals).
12. Solve the eigenvalue equation (Jacobi diagonalization).
13. Back transformation for the eigenvectors.
14. Order eigenvectors and eigenvalues.
15. Test for convergence.
16. If self-consistency is achieved, print and punch the final resuIts.
17. I f not, test for the number of iterations; Multiply the SCF
threshold if needed and extrapolate.
18. Test for interrupt. If no, go back to 6 (calculation of the
density matrix).
The corresponding flow-chart is represented in Fig.2. Some illustrativetiming for a relatively large calculation is given in
Table 9. One will notice that eighty per cent of the CPU time goes
into the computation of the P matrix from the list of two-electron
integrals and that the Schmidt orthonormalization is a relatively
expensive process. We have considered both the diagonalization of
the F matrix after a Lowdin orthogonalization (step 4a in Table 9)
according to the method of Paragraph 2A and the diagonalization of
the F matrix expressed in the basis of trial molecular orbitals
(step 4b in Table 9) according to the method of Paragraph 3C.
Clearly this latter procedure is more economical, with respect to
both the matrix manipulation and the matrix diagonalization.
1.
4.9
.1
10.3
2.3
1 .0
1.1
.4
.6
12.1
238
A. VEILLARD
threshold=10- 6 1
I<pq Irs> I
I
z;jJ
].Irs
=.1
2 Apq
Apq
Apq Apq
H D
Apq Apq
Apq
F
== H
Apq
+P
Apq
E=E +E +E
1 2 nuc.rep.
I
---J
rExtrapolatio~
lEnd
~-~--
. .
~
239
~0 )
(124)
c
and is assumed to be orthonormal, so that the two sets ~ and ~
are orthonormal and mutually orthogonal. In referring toCthe ingividual orbitals, we use the indices k and I for the closed-shell
orbitals, m and n for the open-shell orbitals, i and j for orbitals of either set. It is assumed that there is at most one openshell per irreducible representation (denoted A and ~).
~
(~
+.1 E E
2
A~
ENC
C
~
NC
C
Ak Akp Akq Apq,~rs ~l ~lr ~ls
kl pqrs
(126)
(closed-shell
closed-shell)
+1 E E
2
(closed-shell
open-shell)
+.1 E E
2
9Apq,~rs NC
C
~l ~lr ~ls
(open-shell
closed-shell)
+-1 E E
A~
A~
1
2
A~
E
A~
E NAkC
C
Akp Akq
kn pqrs
E NC
C
Am Amp Amq
1m pqrs
ENC
C
mn pqrs Am Amp Amq
Apq,~rs
NC
~n ~nr ~ns
E
ENC
C
<?
NC
C
mn pqrs Am Amp Amq Apq,~rs ~n ~nr ~ns
(open-shell
open-shell)
240
A. VEILLARD
Am
= N
~n
~,
-(\,
~pq,~rs '{~pq,~rs
Q. defined through
= a,
~~
<pqlrs>
-.....6.l:!...4 prlqs>+<pslqr
( 121 )
yielding
QApq,~rs
( 128)
with
( 129)
i) for a free radical with one unpaired electron, the total wavefunction of the system will be a single Slater determinant
1P =
(2n-1)~
L J:
E=II.+ I J .. - I'K ..
i l i<j lJ i<j lJ
(131 )
a=b=O
a=B=1
f=l2
( 132)
N =1
Am
241
aa SS as Sa
'\
( 133)'
m(Il)~n (v)U
1\ ..............1"m(11)<1> n (v)l\
1/11 1/1 2 1/1
~d
o~ly
1/11
an~
1/1 2 are
e:genf~ctlons of S2. From 1/1 3 and 1/14 we can bUllt two elgenfunc-
tlons of S2
'31/1 =
(1/13+1/14)
( 134)
( 135)
= E(31/1)+2K
mn
1/11, 1/12 and 31/1 are the three components of the triplet state. 11/1 is
the wavefunction associated with the singlet state. Since it is
assumed that the two open-shells belong to different representations (this is required to insure the orthogonality condition between the open-shell molecular orbitals) expressions (135)
of the energy may be fitted into the form of equation (125) with
the following choices :
f_1 a =a =a =1
2 U
All 1111
b,,=b
f=..! a =a =b =b =0
2 AA 1111 AA 1111
It It
All
=b
1111
=2
b
iii) The case where the open-shell set includes degenerate orbitals
will be handled in a similar way. For instance a system of three
electrons in two degenerate orbitals will be described by the wavefunction
242
A. VEILLARD
( 136)
lS
easily found
( 137)
or
Am
= ~
8 In
. order to have f2 x 2 x 2 aJ 12
a 12 = b 12 = 9
(the summation includes two terms mn and nm)
8 .
9 In
order to have f
x 2 x aJ 11
2J 12
= J 11
E(H
with P
Apq
Apq
+P
Apq
C,Apq
Apq
)C
=E S
Amq I
C
8
IN +E SApqCAnq 8 Amn IN Am
Apq Alq AIm Am
nq
( 139)
(140)
L:
(141 )
]Jrs
]Jrs
-Q
(142)
243
D
D
O,Apq
= L: N
AmCAmp CAmq
C,Apq
T,Apq
+D
(144)
O,Apq
=<1>
(145)
U
C C
where Uc and Uo are two unitary matrices which eliminate the offdiagonal multipliers 8 lk (closed-shell closed-shell term) and 8
.
.nm
( open-shell open-shell term.
) Such a transformatlon
cannot ellminate the off-diagonal multipliers 8 nk and 8 1 which coup~e the
closed and open-shells. However, the open-sheTI SCF equatlons are
still reducible to pseudo-eigenvalue problems and this is achieved
by reexpressing the closed-shell open-shell coupling terms in such
a way that those terms can be absorbed into the left-hand sides of
the equations. This is done by introducing two new square symmetrical matrices
(146)
R
O,Apq
L: (S
NCA-N OA uw
+Q
S)
( 147)
FCC
ESC
(148)
C
=E
L: S
C
or
Apq Amq
OApq Amq Am
q
FOC
ESC
( 149)
L: FCA
L: F
where~
,Apq
O,Apq
+P
+R
H
Apq Apq 0 Apq
( 150)
H
+P
-Q
+R
Apq Apq Apq C,Apq
(151 )
( 152)
However it has been shown that this single eigenvalue method can
give an energy that is not invariant to a unitary transformation
of the orbitals, hence leading to possible discontinuities in the
potential energy curve when different symmetries are investigated
244
(for instance C3v and D3h )
A. VEILLARD
[41 j .
245
<
246
A. VEILLARD
.J2
B. The organization of an open-shell SCF calculation. As mentionned above,the following organization was implemented with
enough fast core to accomodate three square matrices (with the possibility of storing two half-matrices considered as linear arrays
in the place of a square matrix). The corresponding sequence of
operations for a SCF open-shell program is
[12] :
6.
7.
8.
10.
11.
12.
13.
14.
15.
16.
17.
18-21.
247
.5
2.4
1
9.3
2.(1
.8
5
.6
17.4
CONCLUSION
In the above pages, we have emphasized both the theoretical
methods and the computational techniques which have been largely
in use, during the last ten years, for large-scale molecular calculations (for molecules with up to 30 atoms and basis sets in
the range of 50 to 150). However, a number of other methods and
techniques have been proposed in the litterature, which have been
less currently in use and for which less experience has been gained so far. We feel worthwhile to mention them briefly, since some
may gain more favor in a near future (or possibly have already
been favored in some specific cases).
The SCF equations as given by Roothaan, both in the closed
and open-shell cases, are solved by the pseudoeigenvalue method,
a technique which is at best linearly convergent. Numerical methods have been proposed for solving the SCF equations which are
based on the Newton Raphson iterative method (namely expanding
the SCF equations in terms of first order changes oC and then
solving the resulting equations for the optimum new oC) subject to
the constraints imposed by a Schmidt orthonormalization, the method being quadratically convergent [42J. In the steepest descent
method, one abandons the determination of the individual orbitals
and the total energy is minimized subject to the idempotency condition of the density matrix [43}. Direct minimization of the
energy functional has been based on unconstrained optimization
techniques through the incorporation of the orbital orthonormality
restrictions into the expresslons for the energy gradient [44-46J.
248
A. VEILLARD
[11
See for instance: a) R. McWeeny and B.T. Sutcliffe, "Methods of Molecular Quantum Mechanics", Academic Press, 1969
b) F. L. Pilar, "Elementary Quantum Chemistry", McGraw Hill,
New-York, 1968.
C.C.J. Roothaan, Rev. Mod. Phys., 23,69(1951).
C.C.J. Roothaan, Rev. Mod. Phys., 32,179(1960).
J.A. Pople and R.K. Nesbet, J. Che~ Phys.,22,571( 1954).
G. Berthier, J. Chim. Phys., 51,363(1954). -G. Das and A.C. Wahl, J. Chem-.-Phys.,44,b7(1966).
A. Veillard and E. Clementi, Theoret.-Chim. Acta,7,133(1967).
D.R. Hartree, Proc. Cambridge Phll. Soc.,24,89 and 111(1928).
v. Fock, z. Physik,,l, 126( 1930).
-D.B. Neumann, H. Basch, R.L. Kornegay, L.C. Snyder,
J. W. Moskowitz, C. Hornback and S. P. Lei bmann, "The Polyatom
(version 2) System of Programs", QCPE 199, Indiana Unlversity,
Bloomington, Indiana 47401, USA.
G.H.F. Diercksen and W.P. Kraemer, "MUNICH, Molecular Program
System'; Reference Manual, Special Technical Report, Max-Plank
Institut fur Physik und Astrophysik, Munich, Germany.
A. Dedieu, J. Demuynck, A. Strich and A. Veillard, "Asterix :
a system of programs for the Univac 1108", unpublished work.
249
250
A. VEILLARD
Bjorn Roos
1.
Introducti on
For many such reactions" the HF method has proven to give results in good
agreement with experiment ("good" here means that the error is of the order
of 1 kcal/mole). Example are: energies connected with conformational changes
(inversion and rotation barriers, cis-trans isomerism, etc.); hydrogen bond
energies; proton affinities; hydration energies, etc. Common to all cases
for which the HF method is applicable is that no drastic electron rearrangements take place while the system changes from state A to S: no electron
pairs are broken or formed, and the molecular orbitals are only slightly
modified. If these conditions are fulfilled, the error connected with the HF
approximation usually remains almost constant during the structural changes of
the system.
Many interesting problems, however, do not belong to this category. It
Diercksen et al. (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 251-297.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
252
BJORN ROOS
suffices to mention examples such as: ionization and excitation energies, and
especially chemical reactions involving the formation (or rupture) of
chemical bonds. Furthermore the HF method will be inappropriate for most
problems involving very accurate studies where the error is required to be
smaller than 1 kcal/mole.
Thus, even if the HF method gives more than 99 % of the non-relativistic
energy, the remaining error is often of chemical significance. Let us take
an example. The HF energy for the N2 molecule has been calculated to be
-108.99 a.u., which is 99.5 % of the total non-relativistic energy
(E NRL = -109.54 a.u.). A nitrogen atom has a HF energy of -54.40 a.u.
(E NRL = -54.59 a.u.). Using these energies we can calculate the binding
energy for the N2 molecule. In the HF approximation a value of 0.19 a.u.
(5.17 eV) is obtained. The experimental value is 9.9 eV. Thus while the
errors in the total energies are only around 0.5 %, the calculated binding
energy is in error with as much as 48 %.
What is the cause of this error? In the independent particle model, which
forms the basis for the HF approximation, one writes the total wave function
as a product of spin-orbitals. Thfs is equivalent to the assumption that the
motion of one specific electron is independent of the instantaneous positions
of the other electrons. The electron motion is said to be uncorrelated.
Energetically this is expressed by the form of the Coulomb operators in the
Hartree-Fock operator. An electron interacts only with the average field of
the other electrons. This approximation is good as long as the electrons are
far apart from each other. Suppose, however, that we take two electrons,
which originally are far apart, and pair them in the same molecular orbital
(thus forming a chemical bond). In the beginning they do not interact at all.
After the reaction, they occupy the same space and are in close contact with
each other. I n the HF mode 1 the ca 1cul a ted repu 1s i on energy i s no\~ too 1a rge
since the model does not take into account the instantaneous interaction and
therefore overestimates the probability of finding the two electrons close
together. Thus, we can explain the error in the HF binding energy for N2.
Three new electron pairs are formed corresponding to an increase of the
error of the HF energy of 4.8 eV (or 1.6 eV per electron pair).
Since a determinental wave function obeys the Pauli principle, two
electrons with parallel spin cannot occupy the same molecular orbital. The
motion of two such electrons is therefore correlated already in the HartreeFock model; and this correlation is expressed through the appaerence of the
exchange integrals in the energy expression. This effect is usually called
Fermicorrelation. The fact, that the probability of finding two electrons
with parallel spins in the same space point is zero, has an important implication. The probability of finding three electrons simultaneously close together
253
will be small since at least two of them must have parallel spins. The
electron motions will therefore correlate pairwise, which can, as will be
seen later, be used to simplify the treatment of the correlation effects.
The above discussion has referred to the so called dynamical correlation
between the electrons. In certain cases the Hartree-Fock model .breaks down
due to degeneracies or near degeneracies between different Slater determinants.
Such effects often occur when one tries to calculate energy surfaces. In the
above mentioned example, we compared the HF energy of a N2 molecule to the
HF energy for two separate nitrogen atoms. If we had instead made an attempt
to calculate the full potential curve for N2 by varying the NN distance, the
error in the binding energy would have been much larger. The reason for this
increase is that a single closed shell Slater determinant is not a correct
wave function for two nitrogen atoms at a large distance, but corresponds to
a mixture of a number of ionized and excited states. In order to obtain a
correct description of two N( 4S) atoms coupled to a 1Lg+ state, a linear combination of a number of Slater determinants must be used. This state of
affairs is sometimes referred to as the symmetry dilemma of the restricted HF
scheme. At large distances the bonding and antibonding molecular orbitals
become degenerate and should have equal occupation numbers. In the single
determinant wave function for N2 , the bonding orbitals 3ag and 11Tu are
completely occupied, while the corresponding antibonding orbitals3au and
iff are empty. The correct wave function, at large distances, has to include
g
determinants where these orbitals are occupied.
Similar effects occur when two orbitals are close in energy but not
degenerate, and when only one is occupied in the HF scheme. In the Be atom,
for example, the 2p-orbital is very close in energy to the 2s-orbital.
The configuration (ls)2(2P)2 has as result of this near degeneracy a large
coefficient in the wave function for the ground state of the beryllium atom.
Strong correlation effects therefore occur for the 2s electron pair. This is
usually called near degeneracy correlation.
Many different methods have been proposed for a quantita.tive description
of correlation effects in molecular systems. In the following chapters we
shall discuss one such method, the method of superposition of configurations,
or simpler, the Configuration Interaction (CI) method.
254
BJORN ROOS
2.
2.1
General Method
The Hartree-Fock energy is an upper bound the the exact eigenvalue of the
lowest eigenstate of the Hamiltonian. The correlation energy is the
difference between these two energies:
The method for calculating wave functions which we are concerned with in
these lectures are those which take into account correlation effects and
thus enable calculating also the correlation energy.
In the Hartree-Fock method for an n-electron problem, the wave function
is built from the n first spin orbitals of an, in principle, complete set:
------1P1'
1P2 '
IPn;
occupi ed
orbitals
------------------
(2)
IPn+1 '
virtual
orbita 1s
where II = (11 1 ,112 , .... ,lin) with "1 <112< .. <lin' and where A is the
antisymmetrizer:
A = (n!)-1/2
L: (-1)P
(5)
Although not necessary we assume that the basis set (2) is orthonorma1ized.
The set of Slater determinants is then also orthonormal, and the normalization
condition for the wave function (3) takes the form
255
10/
IJ.
IJ.
(6)
Hif>.,df
(8)
SIJ.Io/=Jif>IJ.*if>.,df
The equations (6) and (7) give an exact solution to the Schrodinger
equation only if the basis set (2) is complete. In practice, however, we
always have to work with a finite number of spin-orbitals. Further, the
number of determinants which can be generated from this orbital basis is
usually untractable. Consider as an example the water molecule. This is a tenelectron system. In a so called 'double-zeta' calculation, we would use 14 basis
functions and thus obtain 28 molecular spin orbitals. From these spin-orbitals
we can construct (~~) determinants. It is of course impossible to solve secular
problems of this size ( (i~)~ 1.3.10 7). There are however ways by which
the number of configurations can be significantly reduced. First of all the
number of non zero coefficients can be reduced by symmetry. For example,
in the lAl ground state of the water molecule, only those Slater determinants which belong to the irreducible representation Al of the point
spin-orbitals,
group C2v and which have an equal number of a and ~
contribute to the expansion (3). Furthermore, determinants whicn only differ
in the spin part of the orbitals, and thus belong to the same space configuration, are coupled together to form singlet states. This further reduces the number of independent coefficients Cv ' Methods to obtain such pure
spin states will be discussed later. But even after this symmetry reduction,
the number of terms in (3) is, in our specific example, (and in general), too
large to be manageable. A further reduction has to be based on ideas concerning the importance of different configurations. Also the choice of the
one-electron functions (2) is important. When canonical HF orbitals are
used, the expansion (3) is known to be slowly convergent. The search for
a basis set with optimum convergence properties therefore plays an important role in the development of efficient Cl schemes. We will deal with
this problem in more detail in connection with the discussion of the different
CI methods. One optimum set of orbitals for a Cl expansion of a given
256
BJORN ROOS
The integration is performed over the coordinates for all electrons except the
first. If the wave function is given by the CI expansion (3) the density
matrix can be written as
Pi(xi; Xi)
= L L C * Cv
j,Lv
j,L
Pi (j,L II'
IXI; xi)
(10)
and is the density matrix connected with the Slater determinant ~j.L The
non-diagonal terms are zero if the two determinants differ in more than one
spin-orbital. If they differ in just one spin-orbital, the transition
density matrix is given by (say that ~ is different from ~k )
( 13)
where q is the parity of the permutation which puts equal orbitals into
equal positions in the two determinants.
The density matrix (9) can thus be written in the form
(14)
257
Xt)
(15)
where the summation is taken over all determinants which contain the spinorbital I/J i .
Using relation (14) and the definition (9), we arrive at the following
important properties of the density matrix:
(16 )
Trp
=n
( 17)
(18 )
The density matrix is thus Hermitian; every diagonal element lies between
zero and one; and the sum of the diagonal elements equals the number of
electrons.
Since p is Hermitian, it is possible to find an unitary matrix U
which brings this matrix to diagonal form
n
where n
obta i n
= ut p u
(19)
or
(20)
(21)
258
BJORN ROOS
o<
n. < 1
1
and
L: n. = n
i 1
(22)
When the CI-expansion (3) is reduced to one term (Hartree-Fock approximation), the occupation numbers are either
or 0, depending on whether the
corresponding orbital is present is present is the HF wave function or not.
Also in this case, the HF orbitals are themselves the natural spin-orbitals.
In a general CI expansion the HF determinant is often the dominant
term with a coefficient>0.9. In such cases, the n first occupation numbers
are only slightly lower than one, and the corresponding natural spin-orbitals
to a large extent resemble the HF orbitals. The rest of the natural spinorbitals have small occupation numbers and describe correlation effects.
We can form a spin-less density matrix by integrating equation (21) over
the spin variable. The orbitals for which this density matrix is diagonal are
called the natural orbitals. They have occupation numbers between 0 and 2.
Example:
The HZ molecule.
Hagstrom and Shull studied the natural orbitals for the ground state of this
molecule (Rev.Mod.Phys.~, 624 (1963)). They used a 33 configuration wave
function and obtained the following natural orbitals and occupation numbers
(only the first six are given here):
10'
10'
11T
2ag
11Tg
3ag
1.9638
0.OZ04
0.0086
0.0060
0.0004
0.0002
259
( L f (i) )
i=i
=i
Ln. (A.I
ll
f 1A . )
(23)
L
aiJ' fi (xi) gJ' (,.)
i,j=1
(24)
where T is a common designation for the variables x 2 ,x3 '. ,xn ' and fi
a set of r orthonormal one-particle functions.
Our task is now to find the set of functions f i (x 1 ) which minimizes
the least square error
is
(25)
where 'It is the exact eigenfunction of our Hamiltonian. In order to do this,
we introduce the functions
(26)
260
BJORN ROOS
(27)
Introducing (26) into (24), we can write the matrix element (25) in the
following form
'>=NI'l!)
r
r
*
r
2
-!: c('l!If.h.)
-L: c. (f.hI'l!)
+!: Ic1 . (28)
i=i 1
1 1 x 1 ,r i=i 1
1 1
xi,r
i=i 1
xi,r
(29)
Xi
where integration now takes place only over the first coordinate xi'
After some rearrangement (29) takes the form
r
'> = NIIJi)
xi' r
(30)
SJ [S 'l! * (xi',r) IJi (Xi' r)] fi (x1') fi* (Xi) dXi dx1'
=.!:
1=1x i xi
The term within the bracket is the exact first order reduced density matrix,
and we finally obtain:
r
JS
261
(32)
This sum takes its largest value when the functions fi are chosen as
the r natural spin-orbitals with the highest occupation numbers. This
choice thus minimizes the least square error (25), and the 'best' finite
expansion of the type (24) is obtained as
r
CP(Xi,T)=L:
a .. A.(xi)g(T)
i,j=1 1J 1
J
(33)
The rest of the proof follows directly from the antisymmetry property of
the wave function. Suppose that Ak (x 1 ) is a natural spin-orbital orthogonal
to all orbitals present in (33) (i .e. k> r). This gives
CP,
or
(34)
It follows that each integral in this sum must. be zero. The functions
can be expanded in natural orbitals as follows:
h.
(X)
The same procedure can be repeated for all the coordinates, and thus we
finally obtain the following 'optimum' expansion
262
BJORN ROOS
10
15
20
0.1
J.2
a.u.
Fig.
263
the 20 virtual orbitals, was then used to construct natural orbitals. These
orbitals were ordered after decreasing occupation numbers, and a new set of
CI calculations was performed with one natural orbital added at a time.
The result is shown in curve b. As can be seen from the curve, there is a
clear correlation between the occupation number and the energy gain. Actually
the ten first natural orbitals yields 93 % of the total correlation energy
while the ten first virtual orbitals yields only 50 %. It is obvious that the
use of natural orbitals as basis functions considerably improves the convergence of the CI expansion.
Usually the natural orbitals are not accessible from the start, so one
might think that the optimum properties of these orbitals has no practical
implications. There exists however methods by which we can obtain
approximate natural orbitals which can be used as basis functions. We will
come back to a discussion of some of these methods later.
One approach, which utilizes the benefits of the natural orbitals, is the
Iterative Natural Orbital (INO) methods introduced by Bender and Davidsson
(1966). As the name indicates, the natural orbitals are in this method obtained by an iterative procedure according to the following scheme:
1. The basis functions used are the canonical HF orbitals. A set of configurations is selected and a CI calculation is carried out.
2. The density matrix is then calculated using the CI wave function obtained
in the preceeding step. This density matrix is diagonalized to obtain
the natural orbitals.
3. These orbitals are used to construct a new set of configurations and a
new CI calculation is carried out.
4. Steps 2 and 3 are repeated until an energy minimum has been reached.
A difficulty with this scheme is that one has to carry out several CI
calculations (usually from three to five) before an energy minimum is
reached. Since each calculation is, as we shall see later, rather time consuming, the INO method easily becomes very expensive. On the other hand,
significant energy improvements can be obtained with rather short expansions.
The method is similar in philosophy to the MC-SCF method. Actually, if
only the HF determinant plus all singly excited configurations are included
in the expansion, the INO method becomes an efficient tool for determining
the HF orbitals.
264
BJORN ROOS
l I e
}.
(36)
The orbitals are assumed to be symmetry adapted, i.e. to form a basis for
irreducible representations of the spatial symmetry group of the system.
The orbital set is divided into two subsets: an internal orbital set
(CPi j i=1, ni and an external set {CPi j i = ni + 1, ni+n e}.
This division
of the orbitals is closely related to the classification of the different
terms in the CI-expansion. Orbital sets can be obtained in many different
ways, Usually the CI calculation is preceded by a HF calculation, and the
orbitals can then be chosen as the canonical HF orbitals or some suitable
linear transformation of them (for example: localized orbitals or approximate natural orbital s). If the HF orbitals form the basis set, the internal
set must at least include the occupied orbitals, and the virtual orbitals
(or a part of them) then form the external set.
265
Of these, in the HF approximation, the five first are occupied; and the
remaining two, empty. Thus the internal set can be chosen to consist of the
fi ve fi rs t members of [cp}.
In other cases the internal set is extended beyond the occupied HF space.
If we, for example, want to make a calculation of the full potential curve
for the F2 molecule (HF configuration (1a)2(1a )2(2a )2(2a )2(11T )4(11T )4(3a )2),
g
u
g
u
u
g
g
we would include at least also the orbital 3au into the internal set, even
if this orbital is not occupied in the single determinant HF wave function
for the ground state. The reason is that at large FF distances, the orbitals
3a
and 3au become degenerate, implying that for a correct description
g
of the wave function at the dissociation limit, we must include a determinant in which 3ag has been replaced with 3au
If natural orbitals obtained from a previous study form the orbital set,
the internal orbitals include natural orbitals with large occupation numbers.
The spin-orbital basis is constructed from the orbital set by multiplication with either an a
or a ~ spin function. This set thus includes
2(n i +n e ) one-electron functions.
A configuration is now defined as a set of occupation numbers for the
orbita 1s:
(38)
Each of the occupation numbers can take the values 0, 1, or 2; and the
sum of the occupation numbers must be equal to the number of electrons.
(39)
4a l is
(40)
3a l lb 2 -4a 12b 2 , by
(2,2,1,1,2,1,1)
etc.
(41 )
266
BJORN ROOS
~ (n.)
1
Slater determinants by different choices of the spin functions. For the configurations (39)-(41) we can form 1,4 and 16 Slater determinants respectively.
The Slater determinants are in general not spin eigenfunctions. In order
to obtain functions which are eigenfunctions of the spin operators, we have
to make a linear transformation of the original set. The new set of functions
are called Configuration State Functions (CSF). The CSF's will thus in
general be linear combinations of Slater determinants.
= 3a 1
and CP6
= 4a 1)
(42)
where we have for simplicity omitted the doubly occupied orbitals. There
are many ways in which we can obtain spin eigenstates from the set (42). One
way, which is used in some CI programs, would be to simply diagonalize the 82
matrix for each value of ~. One could also use the projection operator
Os
A
82 -
S' (S' + 1)
(43)
267
~1
that
is the third triplet state function and has Ms=O. The Orthogonal state
function
In the general case a branching diagram can be used for calculating the
number of spin eigenstates which can be constructed for a system with a
given number of unpaired electrons (cf. fig. 2).
Total spin S
5/2
2
3/2
1
1/2
0
~
__r-__ __+-__1-__
~
Number of
-r__-+________________~unpaired
~__
electrons
Fig. 2 The branching diagram. The number-of spin eigenstates with eigenvalue S for n unpaired electrons is obtained as the sum of the
number of states with S'=S+1/2 and S'=S-1/2 for n-1 unpaired
electrons.
268
BJORN ROOS
... = CO~O
+ '"
'" a
Ld.. C,
i a 1
a",,,,
+ {., {., Cooab
i , j a, b 1J
~.
1
ab +
~ ..
1J
(44)
Example:
For the ground state of H20 a C1 expansion could take the form
(45)
is here the HF ground state CSF. The orbitals are numbered according
to their order in the orbital set (37). For simplicity only excitations from
the orbitals 3-5 are included in (45). Notice that only those CSF's which
belong to the irreducible representation Al of the point group C2v contribute to the wave function. Thus only two single replacement CSF's contribute.
Notice also that there are two singlet CSF's for the configuration 43 ~76
3a 1)(1b 2) ~ (4a 1) (2b 2)) .
These are labelled with 1 or 2 as a left upper
index. All the other double replacements correspond to closed shell configurations.
~o
269
(46)
Il
where each state Ill) corresponds to one of the CSF's 110) to ~o) .
All the CSF's are eigenfunctions to HO with eigenvalues <IlIHI/l')'
In the following discussion we will assume that the CSF's are orthogonal to
each other and to ~O. The perturbation part of the Hamiltonian is defined
"
"
as the difference between Hand
HO
= r
I /l)
C(n) Ill)
(48)
1110 p.
. h
..
.
Using n'th order Ray1elg
-Schrodlnger
perturbatlon
t heory,
ta i ne'd from the equa t ion
,T,(n)
~
is ob-
(49)
By inserting the expansion (48) into (49), we obtain for the coeffic,ients
C(n)
Il
n-2
r E k c(k) - r
k=1
nIl
v
IJ.I Hi I v)
- Ei 0
IlV
c~n-1)
(51)
270
BJORN ROOS
En = E C{n-1) ( 0
J.L
Il
1H 1J.L >
(52)
For the first order coefficients and the second order energy, we obtain
the usual expressions
(53)
(54)
Notice that diagonal elements of Hand Ho are equal, as are also nondiagonal elements of
and
To first order in perturbation theory only those CSF's which have a
non-zero matrix element with at least one of the CSF's in the reference state
thus have a non-vanishing coefficient in the CI expansion. The part of the
configuration space which fulfills this condition is called the first order
interacting space.The Hamiltonian
has the form
Hi.
An,..
.E
1=1
where
h(i)
h(i) +
"2
. '"
(55)
.E. g(i,j)
1, J
g(i,j)
a two-particle operator.
271
even this truncated CI expansion can become very long. We shall later discuss
methods which can handle expansions with a length having an order of 50 000
terms; but even this number is easily exceeded for larger systems (as an
example: with ni = 10 and ne = 56 a total of 56 268 singly and doubly
excited CSF's was obtained in a study of the water dimer). We must therefore
look for methods to further reduce the length of the expansion. We have
already mentioned some possible ways to reduce the number of orbitals in the
external set by the use of approximate natural orbitals. One way to obtain
such orbitals is to calculate the density matrix by means of first order
perturbation theory. The CI coefficients are obtained by the simple formula
(53); and the density matrix elements are obtained by inserting these coefficient into equation (10).
Another frequently used method to reduce the number of terms in the CI
expansion is to include only those configurations which contribute to the
second order energy (54) with an amount larger than a given threshold. An
extension of this method is the CIPCI method (Huron et~. 1973). The
procedure used in this method can be briefly outlined in the following way:
1. The first order coefficients c~l) are calculated
2. A subspace s(l) is defined which includes the CSF's with c (1) 1arger
J.L
than a given threshold; and the corresponding CI matrix is diagona1ized.
3. The wave function w(l) obtained in this way is taken to be the new reference
state. This wave function is again perturbed under the influence of the
CSF's not included in S(l). The most important of these new CSF's are
added to s(l), giving a larger subspace S(2).
4. Step 2 and 3 are repeated until the quantity
I <wen) I If I w(n
+ E~n) - (w(n-i)
I II I w(n-i
- E2 (n-i)
is small enough. Here E~n) and Er- i ) are the second order energy
contributions from CSF's not included in w(n) and w(n-1), respectively.
In this method the most important CSF's are treated variationally, while
contributions from the others are obtained by means of perturbation theory.
Another limited type of CI wave function is the first order function (not
to be confused with the first order interacting space), developed by
Schaefer et~. (1969). The internal orbital set includes in this case all
valence shell orbitals and the reference state consists of all CSF's (of a
given symmetry) which can be generated from the internal set. In this way all
degeneracy and near degeneracy effects are automatically included into the
reference state. In addition to the reference state CSF's, all single
272
BJORN ROOS
Example:
Zag' 20'u' 30'g' 11Tu ' 11Tu ' 11Tg , 11T g , 30'u
(In general the valence shell orbitals are those which can be constructed
from a minimal atomic orbital basis set.)
In the HF approximation only the seven first orbitals are occupied. If we, for
simplicity, include only excitations out of 30'g
and 11Tu '
we obtain the
following reference state configurations:
2
(11T )2
g
(11Tu)2 -
(111 )2
g
(30'g)2
(30'U) 2
( 11Tu)2 -
(30'u) 2
(30'g)
(30' ) (111 )
g
U
(30:) (111 )
U
g
2
2
(30' ) (11T)
g
u
(3au )2(11Tg)2
(30'g) 2 (11Tu) 2
(11T ) 4
g
(3ag> (11Tu)3 -
(3au ) (11Tg) 3
(11T )4 _
U
(3au ) 2 (11T g) 2
(11T )4 _
u
(11Tg) 4
it;
273
First order wave functions have been used to calculate potential energy
surfaces and also to study atomic hyperfine structure. The single replacements in the latter case account for the polarization of inner shell orbitals
in open shell systems which play an important role in the determination of
the hfs parameters.
2.3
(56)
ab
where <Pp
is the state function corresponding to an excitation of the
electron pair P into the virtual orbitals CPa and ~ . There are three
distinct types of pairs P. If the two electrons belong to the same molecular orbital CPi' they couple to a singlet; but if they belong to different
orbitals CPi and Wj , both a singlet and a triplet pair can be formed.
The following types are obtained:
Intraorbi ta 1 singlet pairs,
Interorbita 1 singlet pairs,
Interorbita 1 triplet pairs.
These electron pairs are called spin irreducible since they have the
correct spin symmetry. The orbitals CPa and ~ now couple to the hole
state p to form a singlet state <p~b
For CPi different from CPj we can
then form two doubly excited CSF's.
274
BJORN ROOS
where
(60)
where we have put Co equal to unity. HOO is the HF energy so the correlation energy is obtained as
E
carr
= E
(62)
since only the double replacement states will contribute when ~o is taken
as the HF wave function. However, even if the equations (62) and (57) look
identical, there is an important difference. The coefficients in (62) have
been obtained from the secular equation (6) with use of the full expansion
(44). These coefficients then implicitly include higher order terms, thus
taking into account orbital effects (single replacement states), pair-pair
interactions, etc.).
As was mentioned earlier the calculation of the correlation energy with
the CI method gives rise to difficult computational problems due to the size
275
i,j
1)
L:
i,j,k
(63)
U" k +
1)
where Ui' Uij , etc. are functions in which one, two, etc. spin-orbitals are
replaced with arbitrary one-, two-, etc., electron functions, for example
(64)
This expansion, the so called cluster expansion of the wave function, was
introduced by Sinanoglu (1963); and in a somewhat different form, by Nesbet
(1958). Considering only the pair correlation functions uij ' Sinanoglu, by
applying the variation principle, derived an effective two-electron
Schrodinger equation for U1)..
(65)
The total energy is then obtained as the HF energy plus the sum of all
pair correlation energies Bij The many-electron problem is thus reduced
to in(n-l) two-electron problem, that is, each pair correlation is treated
separately.
Equation (63) becomes identical to a CI expansion of the wave function if
the cluster functions are expanded in terms of configurations. Nesbet used
this result to formulate the many-electron problem as a hierarchy of BetheGoldstone equations. In the first step each pair of electrons is considered
separately with a trial function of the following type
a
b b
'l-'.. =<Po+ZC.<t.+zc.<t.+Z
1)
b))
a, b
ab ab
C .. <t ..
1)
1)
(66)
Thus >ij
includes only excitation out of the pair (i ,j). The difference
between the energy obtained with this trial function and the HF energy is
defined as the pair correlation energy. The total energy is again obtained
as a sum of the pair energies. In the next step, if carried out, an expansion
of the form (66) is considered for each triplet of electrons, and so on.
The final equation is, of course, identical to a full CI expansion. The
power of the method lies in the fact that higher order contributions very
quickly become small and can be neglected.
A similar method called the ~seudo ~atural Qrbital - Independent Ilectron
~air ~pproximation (PNO-IEPA) has been developed by Kutzelnigg and co-workers
276
BJORN ROOS
(Ahlrichs and Kutzelnigg, 1968). The pair correlation energies are here obin terms of pseudonatural
tained by an expansion of the pair function >Itij
orbitals. The prws are the orbitals which diagonalize the first order density
matrix for each pair function. The C1 expansion in terms of these orbitals
takes an especially simple form (Lowdin and Shull, 1956):
(67
Thus for singlet pairs (S), only diagonal terms enter the expansion while
for triplet pairs the natural orbitals occur in degenerate pairs (b,b').
The length of the expansion is now reduced from ne (n e+I)/2 terms (ne being
the size of the external space) to ne terms. Obviously the problem of
determining the C1 coefficients for each pair function has been made much
easier. The PNO's are obtained by applying the variational principle to the
expression for the total energy. This leads, after some further approximations,
to a pseudo eigenvalue equation which determines the PNO's.
The independent electron pair approximation, characterized as a method in
which the correlation energy is obtained as a sum of pair correlation energies,
has however certain features which restrict its practicability. Most important,
an upper bound to the exact energy is not obtained. Thus, it might happen
that more than 100 % of the correlation energy is obtained in an accurate
calculation. Recent experience also seems to indicate that the 1EPA overestimates the variation of the correlation energy with structural changes of
the system (e.g. bond distances become too long and force constants too
sma 11).
An interesting extension of the 1EPA-PNO method, called the PNO-C1 method,
has recently been proposed (Meyer, 1973). He writes the total wave function
in the following form (for Simplicity we only consider the closed shell case):
>It= .po+ ~ L: c~.a S.p~.a +
1 a..
11
11
II
I<J
ij
IJ
IJ
IJ
IJ"
(68)
whwre the sums over a .. , a .. and extend over the PtlO's for the intraorbital
II
IJ
pair (i,i) and the interorbital singlet and triplet pairs (i,j) (for
simplicity the indices
and ij have been left out where a and b appear
as upper indices). There are important differences between this type of
C1 expansion and the one we have discussed before. Since pseudonatural orbitals
form the orbital set only diagonal substitutions occur. However, this is
only possible with non-orthogonal orbitals. There is one set of PNO's for
each pair (i,j). Each set is orthonormalized:
277
(69)
The correlation orbitals for different pairs (i,j) and (k,l) are in general
not orthogonal, however. The orthogonality relations (69) are sufficient to
guarantee the orthogonality between the different configurations occurring
in (68). The calculation of the matrix elements is therefore only slightly
more complicated than in the case with a completely orthonormalized basis set.
The diagonal form of the C1 expansion is exact only if the PNO's are obtained from an over-all C1 calculation, but such a procedure would make the
whole method meaningless. Meyer has however shown that the PNO's used in
IEPA are good approximations to the exact PNO's. The computational procedure
in the PNO-CI method can be summarized as follows:
1. The coefficients c~b are calculated for each pair P by a perturbational
procedure. The pair functions Wp are set up and the corresponding density
matrix is diagonalized to obtain the PNO's.
2. A CI expansion of the type (68) is formed using the PNO's with occupation
numbers larger than a given threshold.
3. The CI secular equation is solved to obtain the correlation energy and the
Cl coefficients.
There are several computational advantages with this method. The CI
expansion becomes orders of magnitudes shorter when only diagonal substitutions
occur. The number of two-electron integrals necessary for the calculation of
the matrix elements is also drastically reduced, which simplifies the transformation considerably. This simplification however partly counterbalanced
by the somewhat more complicated forms of the matrix elements, which is due
to the use of non-orthogonal orbitals. The main drawback of the method seems
to be the difficulty to extend it to general spin symmetries.
The PNO-C1 method has been tested in calculation on a number of hydrides.
For CH 4 83 per cent of the correlation energy was obtained. Studies on a
number of diatomic hydrides yielded bond distances, force constants and
binding energies in very good agreement with experiments, the errors being
less than 0.005 A, 100 cm- l and 0.3 eV, respectively.
278
BJORN ROOS
3.
Computational Procedures
3.1
where X", are the basis functions, and the coefficients ci ", are to be
determined. This part of the program usually contains an SCF or an MC-SCF
package. The SCF (or MC-CSF) orbitals are often used themselves as basis
orbitals in the CI calculation. Sometimes unitary transformatiQns are performed in order to obtain, for example, localized orbitals. As has already
been pointed out, CI expansions based on canonical SCF orbitals are usually
slowly convergent. Step 2 therefore often includes routines for calculating
approximate natural orbitals, for example, by using first order perturbation
theory.
Once the orbital set has been chosen, all one- and two-electron integrals
have to be transformed to this new basis set. This transformation is computationally a very complicated problem which only recently has been solved
in an efficient way (Ycshimine, 1973; Diercksen, 1974) with present days
computer facilities, the transformation problem sets a limit to the size of
the orbital set (ni+n e ) at around 100 functions. The details of the transformation process are discussed in detail later.
In step 4 a selection of configurations is made. Not all programs include
this part. In some cases all configurations of a given type are included;
in other cases, the selection is made by hand, and a list of configurations
is given as input. The principles behind the selection of the most important
configurations has been discussed earlier and will not be treated further here.
l. Calculation of integrals
3. Integral transformation
program
4. Configuration selector
5. Symbolic matrix
element generator
6. Calculation of numerical
matrix elements
7. Matrix diagonalization
routines
Fig. 3
279
280
BJORN ROOS
281
where the orbitals ware defined according to (70). The integrals (72)
are obtained through a four index transformation of the integrals (71),
(73)
A direct transformation according to (73) implies that on the order of
mS multiplications and additions would have to be performed, where
m is
the basis set size. Consider, however, a stepwise transformation according
to
(Illli xl)
= I:
(74)
(Illli kl)
= I:
ck
(Illli xl)
(75)
(Ilj I kl)
= I:
c j II (Illli kl)
(76)
(ij I kl)
= I:
(Iljlkl)
c.
III
(77)
II
'"
MIN
+1
(7S)
IJII
nlJlI> K
IlI1
282
BJORN ROOS
Integra 1s with pa i r i ndi ces /J v in the range MIN/J v - MAX/J v can then be
stored simultaneously on the direct access device. In a first cycle it is
then possible to obtain the half-transformed integrals (/Jv I kl) for /JV
in this range.
3. Supposing that N words are available in the core storage, the core
storage is divided into blocks, each with a size of m(m+l)/2, where m
is the size of the basis X/J(/J=1.2 ... m). Each such block can then
store all basic integrals with a given value of /Jv. There will be M
such blocks available in core, where
M
= [ 2N /
m (m+1) ]
4. The integrals are then grouped together so that each group contains M pair
i nd ices /J v . There wi 11 be
Ng = [(MAX/JV - MIN/JV) / MJ + 1
such groups:
group 2
/J v = MIN - (MIN + M - 1)
/JV
/JV
/JV = (MIN + M) - (MIN + 2M -1)
/JV
/JV
group Ng
group 1
MAX
/JV
Indices
NSIZE words
LSIZE
1 word
IADR
1 word
283
6. The integrals can now be read into core again, group by group. The reading
process has been named back-chaining since, starting with one of the
IADR values kept in core, each record read contains the address to the
next record of integrals which belong to the same group. The reading
process for a given group is terminated when IADR = -1.
7. The transformation steps (74) and (75) are performed in core for each
group of integrals, and the resulting integrals (1J.IlIkl)are written onto
peripheral storage sequentially.
8. MINIJ.II is set equal to MA)LII + 1, and a new value for MAXIJ.II is computed
using equation (78). The whole process is repeated until MAXIJ.II~ m(m+l)/2
(the total number of pairs IJ.II).
9. Starting from the 1ist of half-transformed integrals (1J.IlIkl), the whole
procedure is repeated for the transformation steps (76) and (77).
The process requires a minimum amount of I/O time. If the total number
of non-zero integrals is NINT, the integral tape has to be read
[NINT/K] + 2 times. For most computers this number can be kept in the
range 1-10. The process is also not very sensitive to the size of the
core storage. A decrease in storage only means that the number of pair
indices IJ.II within each group of integrals becomes smaller. Thus Ng
increases, and the size of each record is smaller due to the increased
number of accesses, this procedure increases the I/O time somewhat but
does not affect the CPU time.
284
BJORN ROOS
(80a)
(80b)
Example:
follo~ling
Using the rules given above we can construct two singlet BF's for this configuration
(8la)
and
(8lb)
285
Example: For the functions (81a) and (81b). we have n=lO. p=5 and x=3.
Thus each of them can be written as a sum of 22=4 determinant. Inserting
the definitions (80). we obtain for (81a)
(82)
IlI1
IlV
The coefficients a ij
and bijkl
are called Projective Reduction
Coefficients (PRC). They depend only on the form in which the two bonded
functions <P/l and <Pv depend on the orbitals. Formulas for calculating the
PRC for general bonded functions using a diagram technique have been given
by Sutcliffe. The principles are as follows (the reader is referred to the
paper by Sutcliffe for more details):
1. The orbitals QJJ.i
286
BJORN ROOS
Example: The diagram connecting the bonded function (81a) with the HF
determinant
(84)
is, with
---
+1
,
2
-1
+1
,1
+1
-1
2
-,- ,
,
4
+1
-1
6
,
4
-1
'"
+1
,
(85)
-1
5
,~
q,1I =
....
I
,
2,
6,
3,
I I
.....
_-- }
,5
,-
(86)
287
- ....
I I
I I
.1
,2
,
3
[<P7
,
...
3
,5
4
\
7
~--
.. ,
(87)
4. From the diagram we can now generate the matrix element. Each vertical line
is given a parity Pi starting with plus one for the first line, minus one
for the second, and so on, as indicated in the diagram (85), The matrix
elements are obtained accordingly to the following formulas:
(a)
(88a)
(88b)
(b)
(88c)
(88d)
(c)
otherwise,
(88e)
1"1
"'J
1 Vj
IJ
J1.i" J
Yv '> J,
(88f)
288
BJORN ROOS
k
otherwise
1:
if c,o/J
(-1)
,; c,oll
k
otherwise
1:
r
c,o/J ,; c,oll
o +0
/J II
for k'; i
1 [(n-h) /2-mJ
(-"2)
V2')
(89)
where: 0/J and 011 are the signatures of the permutations which bring the
unpaired orbitals back to their original order in 'P/J and 'PII ' respectively;
n is the number of electrons; h the number of chains (ev~n and odd); m
the number of cycles; J the number of pairs for whichc,oH=c,o/Jj , but
C,lJvi'; C,lJvj
(or vise versa). The coefficients qij are given in table 1. Here
Pij is the product of Pi and Pj; a letter D indicates that i and j occur
in different patterns, and a letter S indicates that they occur in the same
pattern.
Table 1.
cycle
cycle
odd chcdn
odd chain
even chain
cycle
odd chain
cycle
odd chain
even chain
Pij
-1
+1
-1
+1
-1
+1
-1
+1
-1
+1
Pattern
qij
-1/2
-1/2
+1
-2
-1
+1
-2
S
S
D
D
-1
+1
289
Example:
Case (c) applies since all patterns are cycles. The overlap integral is zero
since not all orbitals in q,J.L and q,1I are identical (Q i in (88e) is always
zero). There is no one-electron contribution since there are two different
orbitals (Q i in (88f) is always zero).
Calculation of r :
0=
10
m =4
J=2.
= (-1)
0+0
J.L
II
= 0
1 [{10-0)/2-4J
(-'2)
(00
unpaired orbitals)
2
(t/2')
=-1
There is only one term in the last sum of (88f) for which Qij # 0, corresponding to the positions 5 and 8 in the diagram. Thus Pij = -1. Since the two
positions belong to the same cycle (pattern = S), qij is obtained from the
third row in table 1. This yields qij = -1 and
290
BJORN ROOS
processes, which are necessary for obtaining numerical matrix elements from
the list of symbolic matrix elements, is very similar to the one used to
reorder the two-electron integrals in the transformation program.
~(l) = A x(O),
C" =
,.
I:
II ;i P.
(H
IL 1/
- ES
IL II
) C
II
(E S
IL IL
- H
IL IL
(91)
By inserting on the right hand side the coefficients obtained in the (n-l)'th
iteration, equation (91) gives the coefficients of the n'th iteration. The
equation is usually written in a differential form. If we first define the
quantity
I: (H
II
IL II
- ES
IL II
) C(n-i)
II
(92)
we obtain
(93)
One component of the coefficient vector is fixed at unity throughout the
calculation and the others are incremented according to (93).
The normalization constant D and the energy E have to be updated in
each iteration. The computational procedure is the following:
1. A
291
are computed.
2. This step is performed for IJ= 1,2, ... ,N, where N is the dimension of
the problem, but the iteration is repeated from IJ = 1 if \llC(n) \
becomes 1a rger than a given thres ho 1d II
u(n) and II c(n) ~re
max
~
~
calculated according to (92) and (93). The increments to the normalization constant and the energy are computed as
llD(n) = llC(n) [2 L: c(n-1) S
+ llC(n) ]
IJ
v
v
IJv
IJ
(96)
(97)
IJ
-<C
IJ
+a
(98)
292
BJORN ROOS
(99)
where
a = (q H
- f S )
~ Illl
Il Illl
b=H
Illl
c=f
with
Il
= l:
II
IlII
( 100)
Il~
Il
and
II
f" = l: H
,..
II
which minimizes
= -2c
-ES
-q
The value of a
(b -
IlII
E
I b 2 - 4ac\
II
is
(101 )
n
l:
k=O
w(k)
(102)
293
z.:: H
II
j.lll
C (n-i)
II
( 103)
294
BJORN ROOS
with a reference state which is a closed shell determinant and which includes
all single and double replacements:
c.a 1'.a + L:
1
(104 )
i>j a>b
There are in this case six different types of configuration state functions:
( 105)
(one should notice that there are two independent CSF's for the configuration
ij -ab. when i j and alb). Using these CSF's, we can construct 21 different
types of matrix elements. All these elements can be written as simple linear
combinations of one- and two-electron integrals, which can then be directly
inserted into equation (103). The sum over matrix elements is then replaced
with a sum over two-electron integrals. For example, the two-electron contribution to 6a~ can, in the case where both ~ and v refer to doubly
excited configurations, be written as
L: [A/.LV (ikljl) +
k,l
+
A~v
(illkj) }
<~:~b
L: [A
(aclbd)+A'
(adlbc)} C~~-i)d
L: [B
(ailck) + B'
(aclki)} C~~-i)b
J-c
(bjlck)+B'
(bclkj)}C~~-i)
c,d
~v
k,c
L: [B
k,c
L: [C
k,c
L: {C
k,c
~v
~v
/.Lv
~v
~v
~v
~v
IJ-C
I'-ac
Ik-cb
(106 )
295
four orbitals from which it is formed. In the example used here, one has to
define the following different integral types
(ij Ik1)
(ailjk)
(ablij)
(ai I bj)
(ablci)
(ailbc)
(ablcd)
296
Table 2.
BJORN RODS
CPU time d
(minutes/ieration)
35
3304
0.8
-0.2306
35
10010
3.0
-0.2120
39
10101
3.6
-0.2402
F-(H 2O)
46
36204
10.5
-0.4148
(H 2O)2
58
56268
22.1
':0.4097
Molecule
n.1a
ne
H2O
H 0+
3
Li+(H 2O)
a
b
c
d
number of
number of
number of
on an IBM
Ecorr (a.u.)
internal orbitals
external orbitals
single and double replacement state functions
360/91 computer
REFERENCES
R. Ahlrichs and W. Kutzelnigg, J. Chern. Phys. 48, 1819 (1968).
C.F. Bender and E.R. Davidson, J. Phys. Chem. ZQ, 2675 (1968).
S.F. Boys, C.M. Reeves and I. Shavitt, Nature ~, 1207 (1956).
E. Brandas and 0. Goscinski, Phys. Rev. Al, 552 (1970).
A.J. Coleman, Rev. Mod. Phys. 35, 668 (1963).
G. Diercksen, Theoret. Chirn. Acta (Berl.) ~, 1 (1974).
G. Diercksen and B.T. Sutcliffe, to be published.
B. Huron, J.P. Malrieux and P. Rancurel, J. Chem. Phys. 58, 5745 (1973).
A.D. McLean and B. Liu, IBM Corp. Technical Report (1973).
P.O. Lowdin, Phys. Rev. 97, 1474 (1955).
P.O. Lowdin and Harrison Shull, Phys. Rev, lQl, 1730 (1956).
P.O. Lowdin, J. Math. Phys. &.' 1341 (1965).
W. Meyer, J. Chem. Phys. 58. 1017 (1973).
R.K. Nesbet. Phys. Rev. 109. 1632 (1958); see also: Advan. Chem. Phys. ~,
321 (1965).
R.K. Nesbet. J. Chem. Phys. 43, 311 (1965).
C.M. Reeves, Commun. ACM~. 276 (1966).
B. Roos, Chem. Phys. Letters ~, 153 (1972).
297
H.F. Schaefer III, R.A. Klemm and F.E. Harris, Phys. Rev. lQl, 137 (1969).
I. Shavitt, C.F. Bender, A. Pipano and R.P. Hosteny, J. Compo Phys. ll, 90
(1973).
O. Sinanoglu, J. Chern. Phys. 36, 3198 (1963).
B.T. Sutcliffe, J. Chern. Phys. 45, 235 (1966).
M. Yoshimine, IBM Corp. Technical Report RJ555, 1973.
MOLECULAR PROPERTIES
Peter
Swanstr~m
Contents:
1. The Hamiltonian of a molecule exposed to an external
electromagnetic field.
2. Reduction of this operator to the A-subspace.
The concept of molecular properties.
3. Explicit algebraic formulas of some first and second
order one-electron properties.
4. Computation of properties using perturbation theory.
5. Gauge problems of magnetic properties.
This series of lectures is intended to provide a background for understanding the concept of molecular properties.
We shall in fact interprete properties as functions which define the kind and magnitude of the various phenomena that actually can be observed under experimental circumstances, using
external fields as perturbations. We believe that this approach
is both fundamental and easy to understand; for this reason it
will be given a rather general formulation which pretends both
to survey the elementary literature, and to place each individual
property in its proper context.
It will always be subject to discussion whether a molecular
theory which involves electromagnetic fields should be presented
using field quantization methods, or not. In order to ensure a
maximum of overlap with the existing literature we have chosen
to consider the fields as not-quantized.
Some emphasis is given to the development of a theory which
Diercksen et aL (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 299-345.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, DordrechtHolland.
300
(1.1)
that is, as a sum of three operators representing pure electronic, pure nuclear, and mixed electronic-nuclear interactions.
TI~
n
H
L{mc Z
1 _
+ -~.
4mc
r-
lTI.X
~
~,J
e2
r ..
(1. 2)
~J
301
MOLECULAR PROPERTIES
1 aA
-Vet> - cat
(1. 3)
fixA
(1. 4)
curl A
n.1=1
P.
+ ~
A. = -ifiV.
Cl
+ ~
C
is defined as
TI.
1
A(r.t)
(1. 5)
= -gl1BS
(llB = eh/2mc)
(1. 6)
TI2
N
a
2
E{m c + Z eet>Cr: t) +
a
a
2m - lla
a
a
a
where
na
TI
P - 2:....
. H(fa t)} +
N Z Z e2
E' a b
2r ab
a,b
(1. 7)
Z e
c
Aa
-ih V
a
Z e
-
a
c
--
A(r t)
(1. 8)
302
ellipsoid creating local electric and magnetic fields at the nucleus. Thus we cannot say whether the nucleus has a spin because
of its magnetic moment, or vice versa. Another consequence of the
finite size and structure of the nucleus is that it may possess
an electric quadrupole (but not a dipole) moment; we shall return
to this point later. For these reasons the nuclear g-factors will
be treated as phenomenological values that can be determined by
experiment.
We can give a physical interpretation of the terms of the
nuclear operator (1.7): (a) the nuclear rest energ~ (b) interaction with external electric potential, (c) nuclear kinetic energy, (d) Zeeman-term, (e) Coulomb repulsion.
The operator covering the mixed electron-nucleus effects is
Z e2
n N
a
L: L: {r
a
i
ai
en
(0 1. jj a )r2.
al
e
2
al
'"Qa
r5.
al
Z e
a
2mc
-
iJ.1
. (ral.
r
3(jJ .
1
n.1 )
3.
al
r al.)(jJa
e )la
mc
+ -
. (r .
al
x iT. )
1
3.
al
r al.)
ai }
(1.10)
r .
e
2
-r
al
r.-r .
'"
Qa r al
r5.
al
1
-6
"''-' as
as
(1.11)
303
MOLECULAR PROPERTIES
ru
as
=e
E (3a S
- 0
pa
ap ap
as r2)
ap
(1.12)
The sum extends over the coordinates of the protons p within the
nucleus a; this expression is, of course, tajher fictious since
it is based on classical e~ectrostatics.
are the components
of the second rank tensor V which we deno~e the internal electric
field gradient operator:
a
V:
(1.13)
or, using a tensor algebra formalism:
ru
V (i)
3 r .r . - r2. ~
-e
a~
a~
a~
ru
(1.14 )
ru
(1.15 )
H
= constant
(1.16)
304
(i't)
(r)
and
(1.17)
A(rt)
V
II
.H
II x
II x H
div
4np
div H
curl
ext
(1.18)
(1.19)
(1.20)
curl H
(1.21)
A to
be a simple
(1.22)
(1. 23a)
(1.24)
l ~ =
c Clt
(1.23b)
MOLECULAR PROPERTIES
305
2m
p2
22 e
a
c
A p +
a
a
2m
(1.25)
that is, the effect of the external fields on the kinetic energy
of a nucleus is neglected. In this case the nuclear kinetic energy, when transformed to a molecule-fixed system, yields three
operators that represent translation, rotation, and vibration of
the nuclear framework:
N p2
N p2
L:
a
2m
+ L:
2m
(1.26)
We shall ignore the translation operator because it does not affect the stationary states of the molecule. In the rotational
operator, N denotes the angular momentum of the nuclear framework
and I (a ~ x,y,z) are the three moments of inertia (choosing
princ~pal axes); since we have assumed that the molecule rotates
rigidly the moments of inertia are constants. The summation in
the vibrational operator extends over the moments conjugate to
the molecule-fixed coordinates. The removal of the 6 (for linear
molecules 5) redundant operators must be accomplished by means of
the Eckart conditions.
We shall discuss for a moment the rotation operator of (1.26).
This operator represents the rotational energy of the nuclei only,
but we shall attempt to express it in terms of the total molecular
306
J = N+ L
where
n
L: L.
~
i
n
L: (f. x
~
i
p.)
(1.27)
N2
a
na
L:
(J -L ) 2
a a
21
J2
L: {~-
21
J L
a a
I
a
L2
+~}
21
(1. 28)
Q'(O)o + ~ 0 Q"(O)o +
Q(O) +
N
Q(O) +
L:(~).or
a '1or
(1.29)
N n Z e2
c
L: L: r . (0)
c i
c~
N n Z eZ
L: L: _c_ +
. r .
c ~ c~
+ .....
(1.30)
307
MOLECULAR PROPERTIES
II
c,d
Z Z e2
c d
2r cd (o)
Z Z e2
C d
2rcd
II
c,d
I or . (II
a
Z Z e 2r
ac
a c
r3
ac
N Z Z e 2 (3r -r2
1 N
ac ac
I or ( I I a c
2
a
5
r
a
c
ac
+ -
or
a
'V
Z Z e 2 (r2 I - 3i"
a b
ab
ab rab)
(
or
) orb
a
r5
ab
1
+ 2
l)
II
a,b
(1.31)
I {mc 2
e
e. + --2 (P. + - A.~ )2
m
- 1\
II
(1.32)
i,j
P~
-lJ~
~ {-ei
e2
+ 2m + 2mc H Li + 8mc 2
H+
__
1_
2mc
E. (jl.
~
'V
H(r~I - r.r.)H
~
II
x P.)} +
~
i,j
(1.33)
Z e
1
(P - ~A )2
I {m c 2 + Z e + --a
c a
- lJ a
2m
a
a a
a
a
N
H
n
II
a,b
and after some manipulations:
(1. 34)
308
2: {Z ecp - lJa
a a
a
J2
+ 2: {
J L
a a
2~ ---I
a
H}
Z Z e2
a b
2r ab (6)
2:'
a,b
p2
a
} + Z
2m
a
a
N
(1.35)
lJ.
Cr- .______
x (P. + ~ A.)]
a ______
____
___
2mc
r3.
n N
Z e2
Z 2: {- /.(6)
i a
a~
en
e
+-
lJ
Z e
(ii.
ila )r2.
a~
~a~~
[r . x (P. + ~
a~
~
c
mc
A. ) ]
c~_~
a~
a~
r .) (ila
- 3 (ii. 0
~
r .)
a~
a~
r5.
a~
Q(a)
V~~)(i)}
2:
as as
(1.36)
en
Z Z
i a
{-
e2
Z e2
e jJ. L .
~
a~
a
_ _a_
2mc r 3.
4mc 2
Z
r . (6)
a~
e
mc
+ -
-lJaoL
r
a~
a~
a~
e2
a~
'V
(r .0r.)I - r .r.
a~
+ --H
a~
2mc L
3.
a~
ra2 ~~ il~~
a~
r3.
r . (jJ. r .)
a~
a~
8n
-3
s: (-
r a~. ) lJOlJ
~
a
(1. 37)
309
MOLECULAR PROPERTIES
interaction operator. L .
. h respect to nuc 1 eus a~a.
a~
w~t
(1. 38)
n p~
N Z e2
I {2. - I _a_}
2m
a rai
I'
i,j
e2
2r ..
~J
(1.39)
I'
a,b
In (1.38), H
describes the rigid rotation of the molecule,
H. the nucIgr vibration. H.
represents the internal perv~b.
h
d 1 Ham~l~nt.
.
h
b
f external
turDat~ons to t e mo e
ton~an ~n t e a sence 0
fields. HE and HH cover the interactions between the molecule
and the external electric and magnetic fields. By inspection of
(1.33), (1.35), and (1.38) we find:
H
(1.40)
rot
N p2
Hv~'b
I ~
2m
(1.41)
+ V(8)
I I{-8r
a i
Z e 2r .
a~
a
r3.
a~
Z e 2 (3 r- . r
a
a~ ai
a
2 rS.
8r
a~
r2.
a~
1'>
8r }
a
310
Hora o(Z'
a,b
(Z'
c
r a b r a b) )
I - 3
ora
Z'
Z Z e 2 (3 r r -r2
a c
ac ac ac
ora
"v
"'- +
ur
b
(1.42)
Z Z {-
H.
~nt
a i
+ ]Ja
e
a
2mc
ii.oL .
a~
r 3.
Z Q(a)V(a)(i)
6 as as
as
ai +
mc r 3.
-]J.r 2 . - 3 r(ii.or-.)
~
a~
a~
a~
a~
o[~
J a (L.)
8IT
a~
5.
o(r.)jJ.]} (1.43)
a~
a~
hfs(a)
il.r2. -
Z {~~
i mc r3.
a~
r . (jj.o:r.)
a~
8IT
a~
r5.
<5
Cr . )ii.} (1.44)
a~
a~
a~
H.
represents various intrinsic molecular effects - mainly
tfig~e involving spin - which cannot be classified as pure vibrational or rotational. The individual terms of (1.43) describe: (a) electron spin-orbit coupling, (b) nuclear quadrupole
interaction, (c) coupling between total and pure electronic angular moments, (d) nuclear spin-electron orbit coupling, (e) magnetic dipole interactions, (f) Fermi contact between electrons
and nuclei. The latter three effects are entirely covered by the
hyperfine structure operator (1.44).
We proceed to the field-dependent operators:
n
Z {-e(r.) +
HE
1
2mc
E(r. )
N
0
(ii. x P.)} + Z Z e (r )
~
(1.45 )
H'o {_e_ L.
2mc
~ - Wi -
NZe 2
a 4mc 2
ii.(r. r.)-r.(r.
o
a~
a~
r3.
a~
jJ.)
~}
311
MOLECULAR PROPERTIES
n _
z::
N
z::
e2
2. ""
}_
H { - - (r. I - r.r.) H
8mc 2
H{___e_2.
2mc 2
~ ~
""
- ""}I ]1
(1.46)
r3.
a~
The operator which is linear in H describes the interaction between the external magnetic field and the molecular magnetic moment, composed of electron orbit, electron spin and mixed orbitspin contributions. The operator which is quadratic in H describes
the interaction between the external field and the induced magnetic moment, i.e. it defines the ability of the molecule to be affected by a field ('susceptibility'). The last operator covers
the interaction between the nuclear spin and the field, often denoted the nuclear Zeeman effect.
The mu1tipo1e expansion.
We observe that the electric operator (1.45) contains the
scalar potential ~ which is related to the external electric
field through equation (1.3):
(1. 47)
~.
...........
R'=R+r/
/
/
/
,r
,
....
Molecule-fixed system
/
/
/
312
"(R')
-- R'
q q
't'
- IR+rl
(1.48)
We expand cP in powers of r:
t r(VVcp(Rr
(1.49)
r E(O) -
21_'V-_
r F(O)r
(1.50)
+ ...
'V
F (0)
-ilcp(o) = q a/R 3
(1.51)
-VVCP(O) = q
(1.52)
(3RR-R2~)/R5
(1.53)
313
MOLECULAR PROPERTIES
1
L {er. E + - - E
HE
i
N
+ L
2mc
I.
{-z er
a
E -
1:.
. (jJ.I.
3 as
x P. )
I.
- 1:.
3 as
GaS(i) FaS}
(1.55 )
A.I.nt
A
ext
A.
(p,
a
or a ,J, ii,
Qa la
a
- ru (E, F, H)
1,2,oo.,N)
(Z.l)
(Z. Z)
(Z.3)
where H' is independent of ~ whereas HJ and HZ contain the Aoperato~s to first and second power. Tfius
314
H = ~ H(l)
H
2
and
=.!.
2
~ it'(2)~
(2.4)
.
.
where H(l) is a vector and '\,(2)
H
~s a matr~x, both conta~n~ng
pure electronic operators. In this way we have obviously expanded
the Hamiltonian in terms of ~ as follows:
= HI
A H(l)
+ .!.~ it'(2)~
(2.5)
Since HI is independent of A it will consist of the model Hamiltonian ~1.39) defining the unperturbed system, and of the electron
spin-orbit coupling operator appearing in (1.43), i.e.
n
HI
P:
z:; {-.2
2m
n N
+ z:; z:; {-
i a
N Z e2
n
N
2
z:; _a_} + z:;1 -e+ z:; I
r
2r ..
a
ai
i,j
a,b
~J
Z e
a
2mc
Z Z e2
a b
2r ab
iJ.
La~}
.
~
r
(2.6)
3.
a~
In this way we have in fact shifted the electron spin-orbit interaction operator from the internal perturbation operator H.
(1.43) into H . The apparent typographical confusion between~nt
the new HI (2?6) and the old H (1.39) disappears when we enter
actual ca~cu1ations in which tRe spin-orbit interaction conventionally again is considered as an independent perturbation to
H In order to keep the subsequent derivations physically rig8urous as far as possible we shall, however, maintain the new
definition (2.6).
H(l) and it'(2) can be written down immediately if one collects
the terms of the Hamiltonian (1.38) which are linear and quadratic, respectively, in the elements of I. Since this is completely
trivial we shall take the resulting operator (2.5) as ~t1~tands
~t2~out stating exp1icite1y the form of the operators Hand
H
The form of the Hamiltonian, (2.3) and (2.5), suggests the use
of perturbation theory, considering H1 and H2 as first and second
order perturbations to the model system described by HI. We introduce the Hamiltonian
0
(2.8)
315
MOLECULAR PROPERTIES
ek
+ ]12':1'(2) +
':I' = ':1'(0) + ~':I'(l)
k
k
k
k
.....
.....
(2.9)
(2.10)
(0)
= e k ':1'(0)
k
(2.11)
(2.13)
316
-(1)
e k = E(o) - A Ek
k
1 - '\,(2)-
2" A Ek A +
(2.14)
where
E (0)
k
(2.15)
E (1)
k
(2.16)
'\,(2)
Ek
- ZI
(2.17)
Ek
'" (2)
Ek
(2.18)
lim e k
A~o
(le k
lim ( - - )
A~o
elA
(l2e
k
lim (- -::-::-)
A~o
dAdA
(2.19)
(2.20)
317
MOLECULAR PROPERTIES
''
total
= '' (0)
k
(2.22)
318
P
ora
Lambda Doubling
lJ a
-E
Figure I. The first order property matrix i(l). The dotted line
separates the internal from the external properties.
The null-property arises from the fact that the Hamiltonian does not contain any operators which are linear
in P .
a
319
MOLECULAR PROPERTIES
We could now easill yyite do~~ formulas of all first and second order properties E(
and ~ ), using the operators and the
principles outlined in the preceeding sections. This would, of
course, be a rather comprehensive work; in this section we shall
select some properties of common interest in order to illustrate
the method.
In order to simplify the notation in the following we shall
omit the subscript that refers to the electronic fine structure
level. It is thus understood that we consider the stationary properties of a molecule in a single (but unspecified) electronic
state.
Hyperfine coupling constants (first order).
As seen from (1.44), hyperfine splitting is due to the coupling between electronic and nuclear spin magnetic moments. The
hyperfine coupling constant is defined as
(3.1)
whence
3H.
~nt
/3)1
hfs(a)
(3.2)
320
p
vibrational
G-matrix
quadratic
force
constants
ora
reciprocal
inertial
tensor
spin
rotation
constants
nuclear
spin-spin
coupling
constants
rotational
magnetic
moments
nuclear
magnetic
shielding
tensors
molecular
dipole
moment
derivative
ora
Figure II. The second order property matrix E(2). The matrix is
null-properties or which are unknown to the authors.
321
MOLECULAR PROPERTIES
nuclear
quadrupolequadrupole
coupling
constants
molecular
dipole
po larizabil i ty
tensor
nuclear
quadrupole
shielding
tensors
induced
electric
moments
molecular
quadrupole
polarizability
tensors
molecular
magnetic
susceptibili ty
tensor
'V
322
<'1'
L.
( ) n
0
IZ{~ ~ +
. mc
r.
al
al
(3.4)
that is, the hyperfine coupling constant is simply the expectation value of the hyperfine structure operator, hfs. The first of
the operators in (3.4) merely causes a shift of all the hyperfine levels, but not a splitting; for this reason it is usually
neglected in this context although it is not negligible in magnitude. The second operator in (3.4) is the dipole interaction term
which leads to anisotropic hyperfine splitting, and the last operator is the Fermi contact term which gives the isotropic hyperfine splitting. The hyperfine coupling constant vanishes for molecular states with no unpaired electrons. The anisotropic coupling averages to zero for gases and liquids. The contact term
vanishes unless there is a finite electron density at the nucleus
a.
It is still doubtful whether (3.4) in fact accounts for the
hyperfine coupling between electronic and nuclear spins. The
reason for this is that the hfs'operator is based on a multipole
expansion of the internal vector potential, and this expansion
cannot be proved to be convergent. The same matter of dispute can
be raised for the nuclear spin-spin coupling and shielding.
The hyperfine splitting is field-independent; it is observable in electron spin resonance spectroscopy where transitions
are induced between the Zeeman components of the fine structure
level, whose degeneracy has been removed by a strong magnetic
field.
Electric dipole and quadrupole moments (first order).
The molecular dipole and quadrupole moments define the first
order response of the molecule to an external electric field and
field gradient, respectively. Thus
)l
(3.5)
(3.6)
323
MOLECULAR PROPERTIES
11
11. x P.
~
~}
2mc
IIJI
()
0
> + L: Z er
a a
a
(3.7)
'Q"
N
e
L: Z (31' l' - r2
a a
a
2
a
a
+ -
1')
(3.8)
.
..
f ragTh e f ~rst
two d'~agona 1 blocks 0 f "'(2)
E
def~ne Jo~ntly a
ment of the reduced energy operator which is responsible for the
oscillations of the nuclei within the molecule. We can write this
operator fragment on the compact form
12
f-~- -i- ~ -]
1 N
1 N
- L: G p2 + L:
2
a a
2
a,b
a
or a
(3.9)
'"
324
}lab
all
+ 2 2:'
1
(3.10)
<'fl(o)!2:
a~
2 2:'
1
u ab
'"",
(3.11)
325
MOLECULAR PROPERTIES
:tab
h.c.
(3.12)
E(o)_E(o)
1
326
+ h.c.
(3.13)
E(o)_E(o)
1
- '\, - .r.)
(r- .or.)I-r
e2
2mc 2
L{
a1
N Zbe
~{~L.-jj.}-~
i 2mc
dH/ dil
a1 1
}_
r3.
a1
2-
b 4mc 2
-J,l.(rb.or.)-rb.(r.ojl.)
11
111
3
r bi
hf s (a)
Restricting ourselves to molecular states with no unpaired electrons the electron spin dependent operators will give vanishing
contributions to (3.13) which then reduces to the Ramsey formula
[3]for the nuclear shielding.
,..,
'\,
a(a)
1---
2mc 2
e2
+ h.c.
---~'
2m 2 c 2 1
(3.14)
The second and the third terms are usually denoted the diamagnetic and the paramagnetic contributions, respectively, although
this distinction is closely connected to the choice of gauge
origin of the electromagnetic potentials and therefore rather
arbitrary. This problem will be further discussed in Section S.
The nuclear shielding is often denoted as the chemical shift
though strictly speaking these two properties are not identical.
We can introduce the chemical shift in the following way: The
external magnetic field interacts with the nuclear spin magnetic
moments both directly and via the electrons. The chemical shift
tensor is defined to account for the electron-coupled interaction,
and the over-all energy shift can be written as
(3.15)
Comparing this expression to the definition of the shielding
327
MOLECULAR PROPERTIES
(3.13) we obtain
'V
0shift (a)
'V
'V
= I - o(a)
(3.16)
=-
ila .
eff
(3.17)
(a)
whence
(3.18)
corresponding to the fact that the electrons shield the magnetic
moment of the nucleus by decreasing the applied field strength.
The static electric polarizability (second order).
When a molecule is exposed to some external electric field
there will be a first order interaction between the field and
the molecular dipole moment. But in addition to this effect, the
field induces a molecular dipole moment which then interacts
with the applied field. This second order 'feed-back' effect is
known as polarization and it h~~2tts analogues in the diagonal
blocks of the property matrix E
. By inspection of Figure II
we see that the polarizability tensor is defined by
'V
ex
=-
d 2 e/dEdE
(3.19)
dH/dE
L: {er. +
l
i
jJ.xP.
1
2mc
l}
N
_
L: Z er
a a
a
(3.20)
Using the short hand notation d for this operator, the polarisability tensor becomes simply
- 2
(3.21)
328
E +
aE
(3.22)
~(E) = ~o
E+
t E~ E
'\,
(3.23)
+
'\,
= NV a E = K E
(3.24)
.
(3.25)
41fp
41f(p -div
o
P)
whence
div(E + 41fP)
= 41fp o
(3.26)
329
MOLECULAR PROPERTIES
=E
+ 4TIP
(1+4TIK)E
EE
(3.27)
The factor E which connects the effective and the applied field
strengths is the dielectric constant:
E
=1
+ 4TIN v a
(3.28)
or
Il+4TIN va
(3.29)
It is well-known that the refractivity index is frequency dependent. This frequency dependence appears when the applied
fields are time-dependent, i.e. when we deal with electromagnetic
radiation. In this case we must employ time-dependent perturbation theory to determine the frequency dependent or dynamic polarizability.
4. COMPUTATION OF PROPERTIES USING PERTURBATION THEORY.
In Section 2 we introduced the concept of molecular properties on the basis of perturbati~lyhe~t~J This gave rise to properties of various orders named E
,E
and so on. In tht~ secti~~2re shall present methods to compute the elements of E ) and
E
330
(4.1)
where h is a genuine one-electron operator while g covers the
mutual electronic interactions. When we deal with one-electron
perturbations as in the preceding sections h depends explicitely
on A,and g implicitely via the molecular orbitals. We partition
f into three terms according to powers of
"fo
X:
..
"
(4.2)
+ fl + f2
= f
where f
"
(0)
-(1)
f
1 - "'(2)2 A f
A
(4.3)
(4.4)
where
is an ordering parameter.
331
MOLECULAR PROPERTIES
(4.7)
FAO (~)
(4.8)
X(~) C.
(~)
(4.9)
For computational reasons we shall transform the general SCFequations (4.5) and (4(6) to a coordinate system in which the
unperturbed operator f ~(
diagonal. This can be accomplished
by me~~8)of the matrix C 0 formed by the unperturbed eigenvectors C.
as columns:
tS
c(o)}
(4.10)
We obtain
rv
F(~)u. (~)
= s.
u.
(~)
+rv
rv
(~)O(~)u. (~)
~
0 ..
O(~)u. (~)
~J
(4.11)
(4.12)
where
rv(o)+rv ()rv(o)
C
F AO ~ C
(4.13)
2;(0)+'S'
(4.14 )
AO
c. (~)
~
(~)2;(0)
2;(o)u. (~)
~
(4.15 )
332
""(0) .
. )
(The inverse of ""(0).
C
ex~sts because C
~s an orthogonal matr~x .
Inserting (4.15) into (4.9) we obtain the relation between the
molecular orbitals and the new eigenvectors u.:
~
(4.16)
where 8 is the transformed basis. 8 resembles the unperturbed
molecular orbitals because
(4.17)
is a unit matrix
""F .. (0)
6 .. s. (0)
(4.18)
""o .. (0)
6 ..
(4.19)
~J
~J
~J
~J
""
(4.20)
Collecting together corresponding orders in
equations
(0) - (0)
u.
J
J
= s.
( ""F(O)
- s.(0) "")-(1)
I u.
J
we get the
(4.21)
(4.22)
and so on. Here we are mainly intereste1l}n the first order equation because we want to determine the U.
which in turn define
the first order correction to the wave function. The zeroth order
equation (4.21) is assumed to be exactly solved, i.e. we assume
that the ordinary unperturbed Hartree-Fock solution is already
established.
In order to simplify the following derivations we shall let
the basis
be independent of the perturbation, i.e. X(~)=X(O),
333
MOLECULAR PROPERTIES
(4.23 )
(4.24 )
.
f rom left b y u-(0)+ we get
. l'~cat~on
Af ter mu 1 t~p
i
(E~O) - E~o))U~~)
J
~J
F~~)
(4.25)
~J
where F~~) = H~~) + G~~) is the first order matrix representation of~the Fo~~ oper~tor. By definition
H~~)
~J
G~~)
~J
[2(0)+~~~)2(0)]ij
r'V ( 0 ) +'V ( 1 ) 'V ( 0) ]
C
GAO C
ij
~C(~)*C(~)<X 1'h(1) Ix
>
(4.26 )
(4.27)
pq
p~
qJ
(4.28)
"
1\
334
(4.29)
1: C k(].l)C k(].l)<X
rs r
1
I-Ix
>X
q s
r r 12
(4.30 )
whence
(4.31)
(4.32)
=~
a).l
M _+
rv_
1: Uk(].l)M uk(].l)
k
(4.33)
(4.34)
335
MOLECULAR PROPERTIES
~ {-(l)+ ~ -(0)
uk
t...
kl
M~
-Co)+ ~ -(I)}
MUk
+~
M C (1)* + UkCll + ~
~ {uCl)*M +u Cl ) ~ }
lk u lk
k T=M+l Tk
Tk Tk -KT
(4.35)
]J
-uC.l) + ... )+ ~
I (-Co)
u.
J
-Cl) + .. )
+]J U.
0 ..
~J
whence
-Cl)+ -(0)
-Co)+ -(1)
u.
u.
+ u.
u.
~
o ,
(1)
..
u .. + u (1)*
~J
J~
(4.36)
G~:)
~J
M w
(1)
{u (1)*
E
E
MTk + u Tk ~T}
Tk
k T=M+l
(4.37)
Now, if the uC~) elements are real, corresponding to real oneetrytron pert~rbations (e.g. electric field perturbations)
G.. becomes
~J
M w
E
E
k T=M+l
uC~)
C4[Tklij]-[Tj lik]-[kj liT])
Tk
C4.38)
M w
E
E
k T=M+l
u~~) ([Tjlikl-[kjliT])
(4.39 )
336
(1)
Uk
L:
w
L:
k T=M+l
+0
OT
O'k ( S ( 0
J
)_
( 0 ) )}
s,
J
( 1 ) = -H (1. )
u k
T
oJ
(4.40)
- exchange term)
(4.42)
337
MOLECULAR PROPERTIES
(1)
~~ {2 u.
-(o)+~(l)-(o)
-(l)+~(o)-(o)
H u.
+ u.
H
u.
~
~
~
~
(o)+~(o)-(l)}
+ u.
u.
+.~
(4.44 )
~,k
M w
M
u(l)*{H(o) + 2:(2[T k lii]-[ Ti l'ik])} + c.c.
+ 2:
2:
Tk
Tk
k T=M+l
i
(4.45 )
H~~)
(2[Tkiii]-[Tilik])
(4.46 )
(1)
A -(1)
E
(4.47)
338
e (1) (]J) = 2 L:
u. (]J)
(]J)
(4.49)
(2)
M
~{2
6
-u(.o)+ ~H(2)u-(.o) + u.
-(l)+~(l)-(o) -(o)+~(l)-(l)}
H
u. +u.
H
u.
~
(4.50)
(2)
1 -
~(2)-
= - 2 AE
(4.52)
where ~(2) is the second order property matrix (2.17). The general expression for a second order property is then
M
M
- 2 L: H~~V) - 4 Re L:
i
~~
L:
i T=M+1
(4.53)
339
MOLECULAR PROPERTIES
(5.1)
+
1:.
c
Clg(rt)
Clt
(5.2)
for any decent function g(rt). This follows readily from application of (1.3) and (1.4):
- 1 ClA' n:t)
c
Clt
(5.3)
H' (rt)
vx
A'(rt)
vx
A(rt)
(5.4)
1 ClE
~3t
4 'IT
C
(5.5)
V x (V x A')
4'IT
c
(5.7)
340
1 3,j, I
c 3t
V AI + - --~-
=0
(5.8)
(5.9)
(5.10)
A(r) =
21
(H x r)
(5.11)
where
R is
341
MOLECULAR PROPERTIES
cp'(r)=CPCr)
(5.14 )
Any potential
i(H
x r) +
V(i(H
x r) R)
i(H
x r) -
V(i r
(H x R))
"21 -H x (r-R)
(5.15)
A'
342
(5.17)
n!~
~e
exp ~
-
~'
G)'IT.
(5.19)
and
(n!
~
n!)~'
(5.20)
ie
exp (nc G) ~
(5.22)
343
MOLECULAR PROPERTIES
Since
enters the Hamiltonian only via the linear momentum operator n, and only as a multiplication operator we conclude that
H'
ie
= exp(fic G)H
~'
(5.23)
<~ H ~>
(5.24 )
and
<~
IH I~
I
>
I I
= Ek(0) -
. -(1)
Ek
1. '\
2
~(2) A +
...
with
A
= (Pa , ora'
'\,
J, lJ a , Qa ; E, F<,
H)
About the energy (operator) e k we know that it is gauge invariant whether it is approximate or exact (5.24). It is also clear
that the perturbation operators Xare gauge independent. The
question to be answered is: are the electron properties gauge
invariant as well?
Seen from the physical viewpoint the answer is trivial: since
the properties (directly or indirectly) represent observable
quantities they must be independent of the choice of gauge.
Pty
.
However~ from the computa~io~a1
of v~t~)the situ~tion
1S not so s1mple. The reason 1S that Ek
and Ek
do not 1nvolve
the gauge invariant momentum operators n because we dissolved this
operator into its components already in Section 1. Since a time
independent gauge transformation results in the addition of an
arbitrary constant to the vector potential, as seen from (5.15),
344
345
MOLECULAR PROPERTIES
BIBLIOGRAPHY
[1]
[2]
[3]
[4]
D.W. Davies: The Theory of the Electric and Magnetic Properties of Molecules. (J. Wiley, 1967).
[5]
[6]
[7]
[8]
[9]
[10]
[11]
~,
397 (1937).
~,
~,
735 (1973).
107 (1967).
[12]
[13J
[14]
281 (1962).
B.J. Howard and R.E. Moss, Mol.Phys. ~, 433 (1970).
A. Dalgarno, Adv. Phys.
Resonance~,
l!,
137 (1966).
V.R. Saunders
Atlas Computer Laboratory, Chilton, Didcot,
Oxfordshire OX!! OQY, England
!.
INTRODUCTION
variables are not the concern of the present work, and we proceed
under the assumption that the spin integrations of the energy
expectation value have been completed.
~,
{~}.
It is
v.
348
R. SAUNDERS
Overlap Integral
s ..
!.(IH.(I) dTI
~
J
~J
(b)
T ..
~J
( c)
(la)
(I b)
!i(I)j(2)rl~dTI
V~.
~J
(I c)
(I d)
349
where
Xc
point C.
is the
required.
We will not take up the evaluation of such 'property' integrals
in any great detail, since we prefer to indicate how methods to
be described for the evaluation of the energy integrals may be
generalized so as to be applicable to as wide a range of
integrals as possible.
2.
BASIS FUNCTIONS
2.1
G(a,A)
exp(-ar A)
(2)
Thus
A = x-Ax
y-A
z-A
350
v. R. SAUNDERS
(3)
X(a,A,R-)Y(a,A,m)Z(a,A,n)
(4a)
where
X(a,A,t)
x! exp(-axi)
(4b)
If rA,e and
r A sin
(3
cos
YA
r A sin
(3
cos
zA
r A cos e
where
YR-m(e,~)
~I,
143),~
denotes an
If the spherical
harmonics are specified in real form, then the first few members
351
Is
2s
(a s /3Tf)2 rA exp(-ar A)
2p
3s
3px
3d
xy
xz'
3d
yz
E(a,A,ll,~,m,n)
(5)
352
v. R. SAUNDERS
closely approximated by a Coulombic central field eg in the
immediate neighbourhood of a nucleus.
(b)
Clearly
Clearly the
~s
Thus we find:
353
(a)
~s
comparatively
whilst it would
(c)
For example,
Steiner et al. [3J have suggested the use of the so-called 'cusp'
functions to supplement a GTF basis, so as to improve the quality
of the basis in the neighbourhood of nuclear-electron cusps.
Many other special-purpose sets of basis function are known
(particularly for use in atomic and diatomic cases), and simple
variants of the basic GTF or ETF/STF forms have also been used.
None of these will be discussed.
3.
rem)
f exp(-x)xm-I dx
OO
(6)
3.2
r (m)
[-x
m-I
exp(-x)]
00
oof
m-2
(m-I) exp(-x)x
dx
(7)
rem)
3.3
(m-I)r(m-I) (m>l)
(8)
r (I)
00
fexp(-x)dx
[-exp(-x)]
(n-I) !
(n integral and
I)
(9)
355
3.4
= ax 2 ,
fexp(-ax )x dx
2
2a
-(t+I)/2O
(10)
Hence, by symmetry
t
00
exp(-ax ) x dx
(t odd integer)
-00
N-(HI)/2 r (t+l)
(t even integer)
(I I)
00
-00-00
-00
r cose
r sine
ay/ar \. =
ay/ae
Hence
[rO)]
00
21T
ff
2
rexp(-r )dedr
00
21Tfrexp(-r )dr
o
356
V. R. SAUNDERS
r(D
7Tjexp(-u)du
o
r2, we find
7Tr
(I)
7T
Therefore
r(n+i)
(n integral and
0)
( 12)
2n
I.
= x-S/2a,
and (12)
jexp(-ax 2+Sx)dx
-co
( 13)
An alternative proof of equation (13) may be accomplished by
expanding exp(Sx) in power series, integrating term by term
(using equation (11, and noting that the resulting infinite
series may be written as an exponential in S2/4a with a factor
(7T/a)!
This latter proof is also valid for integrals of the form
jexp(-ax 2+iSx)dx
(14 )
-co
where i
= 1-1,
the result.
357
in
JxnexP(-ax2+iSx)dx
-00
(15 )
Noting the following definition of the Hermite polynomial, see
reference [5J page 150,
H (z)
n
(_I)n exp(z2) an
exp(-z2)
az n
we find
(-Iii)
n
(TI/a)
(~)
i (TI/a)2 (~)
2a 2
2a2
exp(-z2)H (z)
exp(-g2/~a)H
(g/2a 2 )
( 16)
H (z)
n
n!
[n/2]
I
k=O
(_I)k
(2z)n-2k
( 17)
-'--'-'---->-=.L-..,---
k!
(n-2k)!
~s
Equation
v.
358
4.
R. SAUNDERS
4.1
That is:
K G(y,P)
(18)
iA
i-A ' B
i
i-B
(19)
also, noting -+rp
KG(y,P)
""t-
.1:'
-+-+
-+-+
-+
(20)
t)
y
(21 )
359
(22)
(23)
where we define (AB)2
The notation AB
G(a2,B'~2,m2,n2).
Thus
iA
(~-P) + (P-A)
= xp
+ PAx and
xB
= xp
r; + (P-t), hence xA
(~-A)
v.
360
R. SAUNDERS
J/,z
L
j=O
and the product
J/,l +J/,Z
=
_
k
fk(J/,l,Jl.Z,PA ,PB )Xp
k=o
x
x
where
i+j=k
Jl.l Jl.z
(24)
4.2
361
fG~(a,A,t,m,n)dT
Hence N
_1
2,
where
I I
x Yz
where
I
(2a)-(2t+I)/2 r ( (2t+I)/2)
Hence
I
v.
362
4.3
R. SAUNDERS
(25a)
where I
(25b)
i=o
-- --
-(2i+I)/2
f . (R.l,R.2,PA ,PB )y
2~
r(
(2i+I)/2)
(2i-I)!!
(26)
(2y)i
z
<a/a>
We first define
<a/a>
(27)
363
We will also
The symbol
If the
KE
G(a2,B,22,m2,n2)dT
Clearly, the integral can be written as a sum of three components,
KE
+ I
+ I
(28)
where
2
and I .
z
(29)
obtained
(30)
2
3 13x G(a2,B,22,ffi2,n2)
_2
2 2
(22(22-I)xB-2a2(222+1)+4a2xB )G(a2,B,2 2 ,m2,n2)
364
v.
Whence I
I
R. SAUNDERS
may be written
(31 )
and I .
z
Equations (28) and (31) comprise our first scheme, the unsymmetric
scheme, for the evaluation of kinetic energy integrals.
The
~n
equation (31) are of the form where the quantum numbers of the
GTF centred on B are altered, whilst those of the GTF on A are
not.
4.5
00
-! J J
00
J~i a
lax
~j dx dy dz
-00-00-00
dv
Hence
whence du
00
~.,
00
JJ
00
Ja~./a
~
x
-00-00-00
a~j lax
~./ax
dx,
dx dy dZ]
(32)
[~.a~./axJ
~
-00
~.~O
~
and
a~.ldX~
~
as
x~.
Such basis
functions are the only allowable ones for bound state problems.
Therefore:
00
00
00
H J J a~. la x
~
a~./a
dx dy dz
(33)
365
(34)
There
may be written
x-C
(x-A ) + (A -C )
x
x x
and we find
D
<+1/0>
+ AC <0/0>
x
xA + AC
v. R. SAUNDERS
366
5.
5.1
(W)
Fm (W)
2 2m
J exp(-Wt)t
dt
(35)
0, and m is an integer
o.
The most
5.2
(2m+l)
F (0) is
m
-I
(36)
Recurrence relation
Let u
= exp(-Wt
) and dv
2m
dt, and integrate equation (35)
by parts,
F (W)
m
exp(-Wt
(2m+ 1 )
-I
2m+! ] I
)~2m+l)
2W
+ (2m+!)
(37)
367
recursion upwards.
Thus
Fm+ 1 (W)
(38)
as
W~O.
F (W)
m
exp(-W)
(Zm+ 1)
ZW
(Zm+l)
exp(-W)
[(Z!+l)
Fm+l (W)
ZW
]
(Zm+l) (Zm+3)
ZW
[(Z!+l) + (2m+l) (2m+3)
exp(-W)
2
(2W)
]
(Zm+l) (Zm+3) (Zm+5)
(ZW)
(ZW) 2F
Z (W)
m+
(Zm+l) (Zm+3)
F 3(W)
m+
(Zm+l) (Zm+3) (Zm+5)
3
More generally,
Fm (W)
exp(-W)
I
1=0
(2W) i
(Zm+ 1) (Zm+3) ...... (2m+Zi+ 1 )
(39)
v.
368
F (W), as explained in section 5. I.
m
R. SAUNDERS
~n
The appropriate
_9
range 0-+12.
8 + m/2 + 2W
Thus noting
r(m+i+3/2)
TI~(2m+2i+I)! !/2(m+i+l)
we find
F
(W)
r(m+~)
exp(-W)
I
~=o
Wi /r(m+i+3/2)
(40)
369
5.4
we find
aF (W)
J dt
aw
t 2m aexp(-Wt )
aw
- J dt
2m+2
exp(-Wt)
(41 )
- Fm+ 1 (W)
l.
F (W +W)
m
(_W)i F
. (W )/i!
m+~
i=o
(42)
equation (36),
Fm (W)
i=o
0, using
(-W)i/(i! (2m+2i+I
(43)
370
v.
R. SAUNDERS
00
r (a, W)
exp(-u)u
a-I
du
Wa exp(-W)
[w!
2-a
1+
I-a
W+
1+
W+
3-a
3
W+
1+
.. J
(44)
bS
~n
(45 )
equation (44)
~s
convergent
00
exp(-Wt ) t
2m
dt
(46)
F (W)
m
00
exp(-Wt)t
2m
dt - m(W)
(47)
371
= Wt
OO
cP
(W)
exp(-u)um-!2du
(48)
Therefore
F (W)
m
(49)
bn
A
B
(50)
Evaluation P
(b)
2' compare its value with Pn+ I' and if the difference
n+
is less than the threshold accept the value of Pn +2 . If the
372
V.R.SAUNDERS
A
+ a A
n-I
n n-2
where A_I
o , B- I
and B
I.
Thus the
The student
may care to confirm that the n'th partial result of the repeated
fraction
~n
approximant.
373
5.7
(W)
--
= tZmdt,
~
= exp(-wt
),.
to yield
(exp(-W) +
(W)
(Zm-1)~m_1
(51 )
(W) )/ZW
if
(5Z)
m~O
(W)
exp(-W)
[Z~
(Zm-I)
(ZW)
(Zm-l) (Zm-3)
(ZW)
'" .+
(53)
IR I
<
i~m
and we have a
i~m,
374
V. R. SAUNDERS
Point of
Absolute
minimum
Value of
Term
Term Number
There is no guarantee that for a given m and W value the point
of minimum error will give a sufficiently small error.
In
In fact it is
5.8
Tabular interpolation
The
lation.
comparatively large high order derivatives, a high order interpolation scheme and/or a closely spaced table will be required.
Consider
Thus
F (W)
m
(W) W-(m+D
375
6.
Nonetheless we
-I
1T -
ff"
2_1
exp(-sr )s 2ds
(54)
and
~s
available,
r- A
[r(A/2)J- I
exp(-sr 2 )s(A/2-I)ds
which
lS
Tabulations of
exp(-ts) g(s) ds
376
v.
as f (t)
6.2
-1
g(s)
R. SAUNDERS
-1
NAI
where y
= al+a2'
_2
= exp(-ala2(AB)
product theorem
2
exp(-yr p ) exp(-sr C)
_
D= (yP+sC)/(y+s),
so that
NAI
K;2
co
js-2
377
K7f
fOOs -!
(y+S)
-3/2
- 2
exp(rys(PC) I (y+s))ds
s/(y+s), we find
and
NAI
(55a)
The integral on the RHS of equation (55a) is of the form
discussed in section 5, ie
6.3
2 2
exp ( -y(PC) t ) d t
F (y(PC) )
o
(55b)
ERI
ff
Thus
2
2
-I
2
2
exp(-alrl A-a2 r lB) rl2exp(-a3r2C-a4r2D) dTl dT2
378
V. R. SAUNDERS
where Kl
-)
ERI
I s-!ds
x Y z
I I
(56)
where
I
and I .
z
Let u
(u-v + P -Q )
x x
= XI-P x
and v
_
(u-v + PQ )
x
Therefore
I
exp(-s(PQ )
x
) * f
-00
* f
2
du exp(-(Yl+s)u -2PQ su)
x
2
dv exp(-(Y2+s)v +2(PQ +u)sv)
x
1
X
11 2
* -cof
CO
exp (- s (PQ ) ) (y 2+ s)
X
_1
2
379
(Y2+ S )-!
_1
_1
2
exp(-S(PQ )s/(S+s
x
K1K271
5/2
(Y1+Y2)
Yl Y2(Yl+Y2) 2
and I )
s/(S+s)
-3/2 00
_ 2
fs -12(S+S) -3/2 exp(-S(PQ)s/(S+sds
2 2
exp(-S(PQ) t ) dt
o
(57)
6.4
380
V.R.SAUNDERS
OAx
The equation
2aG(a,A,t+l,m,n)
(a,A,t,m,n)
tG(a,A,t-l,m,n)
G(a,A,t+l,m,n)
aA
+ tG(a,A,t-l,m,n)/2a
Consider,
OAx
[exp(-ala2(AB)2/ y)
F (y(PC)2)]
0
__ 2
2n
exp(-ala2(AB) /y)
Y
a2
(C -P ) F 1 (y(PC)2) x x
Y
(A -B ) F (y(PC)2)]
x x
0
(58)
381
7.
(Beware of
F(x)
(2n)-Z
00
exp(ixk) f(k) dk
(59)
-00
i=(-1)2.
-+
00
-+
382
V. R. SAUNDERS
where
k
and r
k .+k +k
x y z
k:.k:
++
x +y +z
r.r
vector.
k x ,ky ,k z )
k,8 k ,!f>k
we find
00
I=2- l rr- 2f
00
00
-00-00-00
-I _2o rr2rr
rr
fff
000
irr~ediate,
001
r-Irr-Iff dk'du exp(ik'u)
0-1
(61 )
383
exp(-Ek )
2Ek
2 1
-3
exp(-Ew
-2 2
k )dw
(62)
-a
-ax
dx by letting
(63)
where K
The summations 1n equation (63) run from
m
=0
to (ml+m2) and n
Noting that
tc
=0
to (nl+n2).
= ~-c = (~-p) +
-1
Hence
NAI
(P-c)
= ~p
may be written
PC,
where
PC
-+-+
P-C,
384
V. R. SAUNDERS
(64)
where
1 ett~ng
= (4y)-I.
IX
1/2
"~'cr)1/2
i/2 exp(-k 2 ) H(
k)
X
x
y
'(7T)1/2
~
i/2 exp(-k 2 )2!
x
y
q=o
q!
(2-2q)!
(65)
Remember that [aJ denotes the largest integer less than or equal
to a. Similar results for I Y and I Z can be obtained.
m
n
Subtituting the result of equation (65) into (64) we obtain
NAI
K2
-1
_1
7T 2y
-3/2
(66)
where L = 2-2q, M = m-2r, N = n-2s.
~s
385
Thus
2E f dw
o
exp(-Ew
f dk
00
W-3
-00
-2
kL exp(-Ew-2 k 2 +iPC
k) f dk kM
x x
X
X X _~ Y Y
00
cof
k 2+iPC k) dk k exp(-Ew
Y
Y Y -co z z
-2
2 -
k +iPC k )
z
z z
[L/2]
t=o
[M/2]
u=o
[N/2]
v=o
fdw
(-I)t(PC )L-2t
x
t! (L-t)!
-I
E2
(~)
L-2t
(_I)u(PC )M-2u
y
(+)M-2u
u! (M-u)!
Eli
(_I)V(PC )N-2v
z
V!
(N-v)!
(+)N-2v
E2
(67)
Note that the integral on the RHS of equation (67) is of the form
of the function discussed in section 5,
I
Jdw
L-t+M-u+N-v y
(PC) 2)
(!I'1 +2)
I
=0
21TK/y
tl2
_
[tl2] 1
(-1) E !f(I,2,PA ,PB) I --,-x
x q=o q.
386
V. R.
SA~JNDERS
[L/2J
t!
t=o
(ml +m2)
\
L
(L-2t)!
m m/2
(-1) E
m=o
_
_
[m/2J 1
m!f (ml,m2,PA ,PB)
\_m
y
y
L r!
r=o
[M/2J
u!
u=o
(nl +n2)
\
L
(M-2u)!
n n/2
(-I) E
n=o
[NIl]
\'
v=o
*F
v!
[n/2] 1
\_L
s!
s=o
(-)-)N-2u
(N-2v)!
E2
L-t+M-u+N-v ( Y(PC) 2)
(68)
9.,12
that E
9., L-2t
(-I) (PC)
x
Not~ng
NAI
(L-2t) 12
q+t
IE
i=o
j=o
k=o
2)
X,Y,Z k F . . k(Y(PC)
~ J
~+J +
(69)
where
x.
(70)
387
The notation
L-t=i
I II
q t
X.1
t~
t~
(L-~t)!
(71)
condition L-t=i.
In the
X.
HX(i,A) (CP ) A
A=max(O,2i-1- i 2
(73)
The notation A=max(O,2i-1-2) means that the start point for the
A summation is chosen as the algebraically larger of zero or
(2i-1-2), and
L-2t=A
L-t=i GX(-E) t
I I
L t
t! (L-2t)!.
(74)
v.
388
R. SAUNDERS
G~(L=O to (1+2'
q=O to
L~/2J),
evaluate
L=~-2q,
Loop over
X. (i=O to (1+2
~
~s
then
389
7.4
The
We do not recommend
1S
defined by
where the overlap distributions on the LHS and RHS of r~~ are
functions of the coordinates of electrons 1 and 2 respectively.
Now apply the Gaussian product theorem (equations (21) to (24
twice, to yield
(75)
where
Yl
390
V. R. SAUNDERS
r~~
y y
z z
(76)
Jx~P exp(-YIX~p+ikxxIP)dxIP
-00
iinl/Z(Yl)-1/2(Ell/2)i i! exp(-E1k;)
[t/2] (-I)q(2Ell/2)Lk L
I
x
L!
q=o q !
07a)
and similarly
00
fX;Q
Ix'
exp(-Y2X~Q-ikxx2Q)dx2Q
-00
[i'~/ZJ(-I)q, (2E21/2)L\~'
q
=0
q'
L'!
(77b)
where El
I , I "
y
Substitution of the
391
= I)'IT
_2 1
f dw
-3
(78)
Yl +Y2) , an d
(___
YlY2
_I
1
1 '
! ' !
'IT 1o( -10)' ( Yl Y2 ) 2(q2) (22)
1 L'
[/2J ['/2J (-I)q+q' (21 ~)L
(222)
L q'=o
L q! q' ! L! L' !
q=o
* f dk x
L+L'
-2
exp(-I)w k 2 + iPQ k )
x
x
x x
2 3/2
_'IT__
(Yl +Y2)
'
(-I)
!
2
!'!
[/2J (l)-q
q=o
q!L!
. t=o
-2(L+L'-t
exp(-PQ )2/(41))w 2 )w.w
t! (L+L'-2t)! I)L+L'-t
(79)
m,m' ,r,r' ,u
FL+L'-t+M+M'-u+N+N'-V
( (PQ) 2/ (4 I) )
(80)
392
V. R. SAUNDERS
where
q!
q'! L'!
L!
(81 )
q=O to
q'=O to
[~/2J,
t=O to [(L+L')/2]
[~'/2J
[(t-2q+~'-2q')/2J
ERI
i j k
x,Y,Z k F . . kPQ)2/(4/))
1 J
1+J+
(82)
where
X.
(83)
393
x.
L+L'-t=ix
L,L' ,t
HL lIL ,
(84)
t! (L+L'_2t)!oL+L'-t
t-q
t
q
t!
(-I) f (tl,t2,PA ,PB )--;---::-.-_
L!
t
x
x
q!
t,q
(85a)
I\'
(85b)
I\
(L=O to (tl+t2
and
I\t
(L'=O to (t3+t4'
Loop over
I\'
Similarly, evaluate
y y
z z
factors ~1'HM' 'liN 'liN , .
(b)
HL.x
x.
x..
~
Evaluate the F
(d)
PQ)2/(4o)
for m=O to
394
v.
8.
R. SAUNDERS
K (W)
v
It
~s
f dy
2v-1
W 2 -2
exp(- Z(y +y
(87)
v
W
f dy
2v-1
W 2 -2
exp(- Z(y +y
(88)
Kv (W)
nl/2(W/2)v oof
V-I
r(v+I/2) I exp(-Wt) (tLl) :1
We define
(89)
I (A, B)
v
395
Bv
2 (A)
(A,B)
""f
dy y
2 -2
exp(-AB(y +y
2. (2A 2 )-v k
8.3
2v-1
(2AB)
(90)
W 2 -2
Integrate equation (88) by parts, letting u = exp(- I(y +y
2v-1
dv = y
dy
v 2v
W 2 -2
""
W [y
exp(- I(y +y /2vJ o
k (W)
v
Wv+1
oof
Iv
(2v+1 2v-3)
W 2 -2
y
-y
exp(- I(y +y
k (W)
v
(9Ib)
8.4
k (W)
v
v
W
fdz
00
-(2v+l)
-I
W 2 -2
exp(- I(z +z
396
V. R. SAUNDERS
(92)
Therefore
(k l / 2 + Wk_ I / 2 (W/2
WI / 200
-2
W 2 -2
2
fdy (I+y ) exp(- 2(y +y
o
2 -2
Noting that y +y
kl /2 (W)
Let z
y-y
-I
k l / 2 (W)
WI / 2
00
-2
W
-I 2
exp(-W) fdy (I+y ) exp(- 2(Y-y
o
exp(-W)
(rr/2)1/2 exp(-W)
(93a)
397
usually used (see, for example, [17J pages 375 and 378):
(a)
Denoting ~=4v2 ,
*(1
(94)
If
v~O,
K (W)
n
+ ~ (-W /2)
!(W/2)-n
k=o
n
(~(k+l)+ ~(n+k+l (W2 /4)k
L ....!.:-~~~,...>-:.:;~..:;....!...O<-...,;"-'-~
k=o
k! (n+k)!
00
provided that
~(k)
n~O,
-Y +
'"
k=o
(W 2 /4)k
k!
(n+k)!
(95)
and
k-I
j-I
j=1
W~8.
v.
398
9.
R. SAUNDERS
,
(
'8
R--~>
A,
and
are given by
R
-(A~-I)
399
Therefore
r
00,
-1 to 1 and 0 to 2n for A, )1
respectively.
00
(!) . f
1 2n
ff
1 -1 0
and
An (F)
B (Q)
n
00
{ exp(-FA)A
1
f
-1
n dA
exp(-Q)1) )1
d)1
(96a)
(96b)
v. R. SAUNDERS
400
E(a,A,~,,m,n)
(~-l)
E(a,A,~-2,+I,m,n)
E(a,A,~,-I,m,n)
E(a,A,~-I,+I,m,n)
Note that the integrals evaluated above have been of the form where
the x, y and z axes of have been defined rather conveniently with
respect to the AB bond.
hence an
A (P)
exp(-P)/P
401
n+ I(P)
9.3
exp(-P))/P
Bn+1(Q)
exp(-Q)
(_I)n exp(Q))/Q
(exp(Q) - exp(-Q))/Q
Ie we evaluate B
(Q) from a power series In Q, and
max
n max
then recur downwards, noting that
If IQI~n
B (Q)
n
= -2
I
(Q)2i
i =0 (2 i)! (n +2 i + 1)
(Q)2i-l
i=l (2i-l)! (n+2i)
(n even)
(n odd)
Gaussian integration
402
v.
b
jw(x) f (X) dx
a
'II
i=1
R. SAUNDERS
A.f(x.)
~
(97)
are defined.
w(x) x
dx
called the points (or nodes) of the quadrature, and the A. are
called coefficients (or weights).
403
This literature
bl
al
am
n
'VLA.f(x' j . . . . . x.)
i=1 ~
~,
~m
where the ~.=(x. I ..... x. ) are points in the space, and the A.
~
~,
~m
Normally,
we set our sights somewhat lower, and apply the ordinary one
dimensional Gaussian procedures m times.
dimensional cases:
b2
bl
J J
a2
al
b2 bl
f f
a2
wI (xI)W2 (x2)
aJ
nZ nl
'V
L L
j=li=1
A. 1 AJ. 2
~
404
9.5
v.
R. SAUNDERS
-ar
-2:-
ZIf I/Z
s-3/Z exp(-r 2 s -
(~Z)2/S)
ds
(98)
Thus
s -3/Z exp(-r 2s -
(a)2
2 /s)ds
I_I/Z(r,a/Z)
3/Z
Z
r k_I/Z(ar)
Zlf l / Z
exp(-ar)
a
The three
dT
405
f J
00
al a 2
41T
The
Therefore,
NAI
Therefore
z (I-u)
(SI,S2)
u , z
to I, z
to
00.
v. R. SAUNDERS
406
Hence
NAI
00
* Fo (z(CP)2)
(99)
where
ut..
+ (l-u)B
(IOOa)
Hence
C-p
(IOOb)
To integrate
2u-1
In
u.
A.
(U!+I)/2
1
A!/2
1
(note that du
du ' /2)
407
In
NAI
i=1
-3/2
A. u.
~ ~
(I -u)
-3/2
dz z
-3
We deal first
(this can only be so if the point C is on the line AB, and even
then is not necessarily so).
I, and the
where
2 1/2
(AB)]
,
[ u. ( 1-u .)
~
~s finite.
NA1
I
i=l
(101)
where
OJ
M(.Q;.in,g,h)
dw
wi - I exp(-gw-h/w)F (w)
m
and g
(102)
408
V. R. SAUNDERS
The evaluation of
M(~,m,g,h).
M(~,m,g,h)
M(~,m,g,h),
we now turn.
(a)
!r (m+ 1/2) I
M(,m,g,h)
II((g+I)1/2,h I/2 )
(g 1/ 2 h 1/2)
~-m-1/2'
~-
-I-
22
.
. g+l)
-1
1/2
,h
1/2
409
Expand F (w)
m
(the
max~mum
F (w)
m
~n
(-I) i (w-w ) ~ F
0
.
po~nt
-_(h/g)I/2,
. (w )
0
m+~
. I
~=o
. (w ) (-I) i
. I
L
m+~
. I
~=o
L (-1 )j
j=o
~.
j=o
i
(w )
m+~
~=o
a b out t h e
L
L
.
ser~es
a Taylor
(~) (w )i-jw j
J
M(i,m,g,h)
~=o
(c)
. I
j=o
~r(m+D
i=o
i+~
+1)1/2 hl/2)
,
M(t,m,g,h)
00
ff
dw dt wt - I t 2m exp(-(g+t 2 )w-h/w)
410
V. R. SAUNDERS
M(Q"m,g,h)
dt t 2m IQ,(g+t2)1/2,hl/2)
ERI
(l1(l2(l3(l4
16rr2
ffII
0000
*ff
We notice that the integration over the spatial coordinates is of
the form of an electron repulsion integral over GTF, see sections
6.3 and 7.4, so that after using equation (57), we obtain.
ERI
*
*
(103)
411
ERI
ffffdudvdtdzz
-11/2-3-3
(I-t)
0000
[u(l-u)v(l-v)]
-3/2
exp(-gz-h/z) F (zd)
o
(104)
where
g
h
d
u(l-u)t
[~
ut
AB2
+ v(l-v) (I-t)
(l~u)t
t (i-t) PQ2
+~
v(l-t)
CD2
<:(4 2
(i-v) (I-t)
/4
412
V.R.SAUNDERS
Au+B(I-u)
Cv+D(I-v)
The procedure is now to integrate over u, v and t numerically,
the integration over z being similar to that discussed for the
nuclear attraction integral in sections 9.5 and 9.6.
9.8
(a)
This
function method.
413
aE
aa
(a,A,~,l,m,n)
E(a,A,~+I,l,m,n)
and
aa
(1a
aE
aA
(a,A,~,l,m,n)
lE(a,A,~,1-I,m,n)/a2
+ lE(a,A,~+1 ,1-1,m,n)/a
-E(a,A,~,l+l,m,n)
414
10.
V. R. SAUNDERS
Thus the
A
B
415
Consider a
like' basis function SB in the new system, the overall effect being
as if we had rotated the system by 90 0 in the clockwise direction,
whilst leaving the coordinate systems unaltered.
(SASB/SCSD)'
We now propose an algorithm for the use of such symmetry transformation operators, and present a formal analysis thereof.
Let an integral (ij/kt) be tagged with a unique index or label,
L.
L'
(b)
T.L
1
(i = 1 to m) .
(c)
~L
v.
416
(d)
<
R. SAUNDERS
Suppose T)Lo
= L),
and LL o '
because
L
where K
>
I and J.
Clearly we
417
-I
Hence
~.
1.
1.
into
A non-zero entry, k,
~k'
V.R.SAUNDERS
418
Transformation
Basis
-)
-2
func-
tion
-2
3
-2
1- 2
1-5
-4
-3
-)
-)
In the case
signs to positive.
419
(a)
(b)
ABC D
Such a procedure
NAj
B means that under transformation Tj , the basis functions
A
in G transform into the basis functions of GB. That is, there is
. f unction
.
. GA , an d
a one to one correspondence between a basis
in
one in GB. The procedure is to use the group transformation table
Such an event is
420
v.
R. SAUNDERS
J..y
~I_ _ _~A
Permutational redundancy.
It is clear from inspection that
(ij/k)
(U/ij)
(ji/kt)
(ij/k)
(U/ji)
(k/ij)
(ji/k)
(k/ji).
421
(b)
E,
E.
Because a two
Clearly a rather
-I
k~
be over C and D.
l);~~
'i
Ce.t1-t,.t. ~
~i
'J)ipol~
422
V. R. SAUNDERS
-I
~F
1.
If ~F is large compared with RAB and ReD' the asymptotic approximation is extremely accurate, and is surprisingly accurate in many
less favourable situations, and as such will often provide a useful
basis for forming an approximation to an integral.
A method based
1.
k 1
423
REFERENCES
[IJ
[2J
[3J
[4J
[5J
[6J
[7J
[8J
[9J
~,
I.
424
V.R.SAUNDERS
CORRELATED WAVEFUNCTIONS
N.C. Handy
University Chemical Laboratory, Lensfield Road,
Cambridge, England, CB2 lEW.
where
+
')_1
1)1
r;,
(f..)
I.:.J
where
'1l}
Diercksen et aL reds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 425-433.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
426
c. HANDY
N.
'"
~:~,~
J
Ci
wi 'ts>Cs
(5)
; 0
:s
tn denotes the number of expansion functions '/>.. If thew
eigenvalues of equations (5) are denoted W~1fI) J W!;:,) ... W,..
where
5
then these eigenvalues can be shown to obey
Wr> wt':>,
w.(m+,) ~ W.tM)
...
(6)
j..
b)
""e
(Z~ .. ~r I ~-wl'l-s>~o
III
r ,
I
"'.J'z)A-= "'i.n <f(~ -Z<:sl/-s .f'(e.x.,d;} -L:.c..;'V's;>
~
t_
< (e-XAet)
"'.
-2.cl.S~
3
I i: (eXAct)
."
(10)
2:.rA.s
'3'!>?
~
427
CORRELATED WA VEFUNCTIONS
CI
t- /" \. 't
t I
+/ t 'f
10
/I
.. ,.
"
+/l'f+.?1~
(It.)
W'''') 'IfII
The above are perturbation-type exapnsions, with 1'
(II,
etc being terms in the expansions. These expressions
11
'1'01
.f
I l.t>
SPr ,
j:=
(l.0-0.100828s
-0.031779u 2 )
0.355808u
0.033124s
0.128521t 2
(13)
with W
-2.90324a.u. ,
and s ~ "f\ .... r,.)t: ('. -(').., .... ::....;,...
He demonstrated therefore that
this six parameter variationally determined wave function gave an
energy within O.00048a.u. of the exact value.
Being an atom, all
the integrals were trivial.
Using ten parameters an energy error
of 0.00012a.u. is obtained. More recently Pekeris [5J extended
the expansion set to obtain an eigenvalue which is within
experimental error of the exact eigenvalue.
It appears that the
inclusion of terms linear in~... are vital to obtain such accuracy.
(Although the cusp condition at ~~ =0 demands a linear dependence
it is in fact a linear
in the wavefunction near ('12.=-O
on 1"'11.
428
N. C. HANDY
Gl=O
0.80483 ( ~~ ... ?~
-0.5599
??,.
(J'i-)
0s)
is anti symmetric in 11
In (15), <fC!",!~,!\)
and.~
and is given by
<p :: 2'i-<'Y.P-
""1
)
b f'l,-c:.f',3
e,)(p(-o..l',- .
lit)
429
CORRELATED WAVEFUNCTIONS
In (19), cp... ) <P" I <PG. and J.L are linear combinations of Slater
type basis functions, (ij) denotes any pair of electrons and v
is any positive integer. The integrals for this expansion set
are no more difficult than those met in the lithium calculations.
Szasz and Byrne C1D were the first workers to perform such a
calculation, but more recently Sims and Hagstrom [12J have used
107 such expansion functions in a calculation. They obtained
W
-14.66654a.u., which compares with an estimated exact value
of -14.6667a.u., and the best configuration interaction value of
-14.664l9a.u. [l3J .
Beryllium represents the largest atom and H2 the largest
molecule for which variationally determined correlated wavefunctions are available.
It is probable that in the future we
shall see further correlated wave functions for small atoms,
but the difficulty of calculating the integrals <~~ 1~'~1
where 'f are correlated functions, for molecules with more than
three electrons, makes it improbable that correlated wavefunctions
for such molecules will also appear.
Because of this difficulty with integral evaluation, one may
consider the application of the method of moments to the
determination of molecular wavefunctions. The transcorrelated
method [14) represents one such application.
In this method the
wave function has the form
where
CI
c;: TI.J f (
f .. i ) (i
"
fi)
a
and ~ = 5+(cp, q>~" . 'PI))
is a Slater determinant.
wave function the Schrodinger equation (1) is
.. ;>
or
oFt c.:~> Wc g
(20)
For this
l7-I)
-1
430
N. C. HANDY
')
\7,. f
"'
-' -I.
2. "..- ~
'-J
\7., f''''
1j 1.. 1q (23)
_ .....
<i'J' ~ l, bj~.:ticl:f)4'(Y)
lei: I
where
forms
~(1)
~)(P L-\Q.'f,:J
and
bik
the following
=0
~., 1) 2. ...
I( j
=0)
11:: 1)2. .. G
>
<: ~C:I~) I PI- - w Ic.~ >
c,D.}<C) L~-'~) \ 1=1- ~V I C~
1\
(;.7)
dhJk
(2'&)
i,
(Z'f )
431
CORRELATED WAVEFUNCTIONS
J p is therefore
(30)
Thus equations (24), (27), (28) and (30) are sufficient to find
all the parameters and the energy for the transcorrelated
wavefunction C;jf. The connection of the transcorrelated method
and the method of moments is apparent when equations (27) and
(28) are compared with equations (9A).
For the particular basis set (26'), the integrals in the method
can be rather trivially evaluated, being dependent on the formula
rtJ~ -CIZ.r,,~
-ejl
'3~)vHl,dV1-JV3
(31)
D= flj-r -+lft'V-t-f)(Ca.!>t:!.1
+1-'1"" t ' l l
(32)
t'l""(CYl.'-+C2.~)
.,. C/ 2)
.J
-O.Obi?l
was obtained.
e- /2 O 'f'i.j
+-!J
(~f'(!i) rcrpCtJ ~J
r r
(?.3)
C.i
432
N.C. HANDY
>.&
if
(34)
4~ ~=o
References
1.
2.
3,
4,
Hylleraas E.A.
5,
6,
7.
8,
Herzberg G.
9~
(1929)
z.
l,
825
~,
1081
433
CORRELATED WAVEFUNCTIONS
10.
Larsson S.
11.
12.
13,
Bunge C.F.
14.
Boys S.F. and Handy N.C. (1969) Proc. Roy. Soc. (London)
A309, 209;
(1969) ibid A310, 43. Handy N.C~ (1972)
Mol. Phys. ~, 1. Handy N.C. (1969) J. Chern. Phys. 51, 3205.
M. A. Robb
Department of Chemistry, Queen Elizabeth College,
Campden Hill Road, Kensington, London W8 7AR
INTRODUCTION
The purpose of this review is to examine the quantum mechanical basis
436
M.A. ROBB
Lowdin (5). We shall see, however, that the language of MBPT is very powerful, in that it not only exposes the nature of the types of approximations
inherent in many of the more traditional quantum mechanical theories, but
also suggests some new and useful approaches.
The scope of this review must be rather limited. We shall be concerned
primarily with the calculation of ground state correlation energies. However,
it must be pointed out that the techniques we shall describe become even
more powerful when applied to the calculation of energies for electronic
excited states or second order one-electron properties. A general review of
many-body theories of electronic structure has recently been given by
Freed(6). Also, we must omit any detailed discussion of those methods which
have their origin in the valence bond method. Most of these methods may be
related to the pair function methods which we shall discuss if full orbital
optimization is allowed.
II
which all pair function theories are related to the general CI expansion of
the wavefunction. We assume that a good zeroth order wavefunction for our
quantum state of interest takes the form of a single antisymmetrized product,
10), of orthonormal spin orbitalS,{ <Po:} ,0:=1 , ... n. Further, we assume that
we have available the orthogonal complement to the set {<po: } a second set
{<Pk},k= n+1 , .. n+m such that the two sets together form a complete m_ co
set. We shall refer to the first set, <Po: ' <P fJ ' q; 'Y . as occupied orbitals or
"hole" states, and to the set <Pk ' <P I ' <I> m .... as virtual orbitals or
"particle" states.
437
The CI expansion of the best wave function for the considered state
can then be written (7, 8)
[ 1)
'11
= 10)
2:
1< I
0)/3 0/3
kl )
0/3
k)1
kim
2:
a/3'Y
a)f3)'Y
k) I)m
1:
o)P)y)o
C
opyo
k 1mn
k) I )m)n
Cok pI....
..
kim n>
o p yo
+ .....
~p:
0,
p . ..
<OIHIO)= Eo
{<0IHI~)}=t!O,1'
{<ol HI~b)}= t!O,2 , ...... {<~IHI~)}= t!1,1 '
{ <~~ 1H Iyg )}= th.2
C 1 = {Ca"} , ~2
Further, since
[4)
t!O,3
= {
Co Pk I}
, ......
438
M.A. ROBB
!::!0,1 =
and
[6]
C
-1
0-
[7]
{ !::! 0,3
!::! 04,
....
H
-O,n
{ H 2 ,3
H
-2,4
.... }
H 2 ,n
H
-3,3
!::!.4,3
H n,n
and
[8] {
C3
'
C4
, ....
.......... }
Cn
then the application of the variation method to equation [1] gives a scalar
and two matrix equations (9):
= E- E
[9]
HO,2
[ 10]
H
-2,0
[ 11]
H n ,2
C
+ (H
- E) C
-2
- n,n
-n
-2
(t:b,2-E)~2+
tb,n
=
~n=Q
Cn =
-(
!::!n,n -
-1
E)
i::!.n,2
n so that
Q2
where
Thus from equations [9] and [13] we have the important result that in
order to determine the energy one needs only the coefficients of the double
substitutions, C 2 . However, these coefficients depend upon the higher order
....
H 2,2
439
one may then compute the energy directly from pair functions as we shall now
discuss.
If we write out equation [9] in full, we have
[IS]
C kl H kl
ap
O,a p
E = Eo + ~
a>p
k>1
~e
[16]
A =
a"
so that ~e a
k>1
a"
kl
HO,aklp
__1_[, (1),
Irs ) ff r
(2) - ,
5
(1) ,
(2)J
Iua"A)=
IUa p) as
k>1
(U I.!. lap> .
a p= a
p r12
knowledge of the U a p gives
~e
Thus a
IU a p )
440
M.A. ROBB
One could imagine solving the equation system [20J or [13J by any of the
standard methods. However, this would only be feasible for the smallest of
systems. Thus one is forced to seek approximations. The form of equations
[12J and [13 J suggests the expansions of perturbation theory (5) and it is to
this method that we shall devote the remainder of our discussion. First,
however, one must comment briefly on the calculation of the energy with
approximate U a ~
Let us suppose we have a set ofU a ~which are approximations to the true
U a ~ The sum of l:1e a ~ calculated from equation [19J will not give an upper bound
to the energy. On improving one'su a ~ one does not approach the truel:1ea ~ from
above. The upper bound to the energy must be computed from
[22J
E=
<"'IHI"'> / <",I",>
where '" is the complete expansion in equation [I J. Thus one needs all the
expansion coefficients
(.l
I'
kim ...
y...
Of course,
one could define a trial function which was truncated at double substitutions,
[23J
=Io)+);~
the exact
Ua~
C k1lkl>
'"
a~ a~
os
k> I
and calculate an upper bound from equation [22J. However, as one approaches
equation [19J becomes exact, whereas the use of equation [23J
Ua ~
the l:1ea ~
computed from
441
illustrate the essential ideas. Suppose our two isolated systems A and B have
wavefunctions
[24]
"'A (1,2)
[25]
"'B
A2
A2
(3,4)
[cj>1(1)cj>2(2)]
[cj>3(3)cj>4(4)]
U 12 (1,2)
U 34 (3,4)
In the limit where there is no interaction between the two sub-systems, the
exact wavefunction for the combined system is just the antisymmetrized
product of
[26]
"'A
and'" B
"'AB ( 1,2,3,4)
=A4
+cj>1(1) cj>2(2)
u 3 J3,4)
+ U 12(1,2)
(3,4)}
34
Because of the appearance of the term involving the product of two pair
correlation functions, the trial function '" os given by equation [23] is
clearly not appropriate.
The correct form of trial wavefunction for use in pair function theories
has been discussed by Cizek (17) and also by Primas(lO). It takes the form
of a "linked pair expansion"
=L
2 a>~
and
[29]
Ta A 10)=
t'
k>1
k)1 kl)
Capap
However, we shall not pursue this point since such considerations arise
naturally in the formalism of MBPT.
442
III
M. A. ROBB
III -
..
-1
C2 = -(H2,2-E)
H2,o
+ ..... .
and choosing suitable definitions of A and B , one may expand the inverse
matrices in equations [32] and [14] and so derive (5) most of the standard
"sum over states" perturbation methods. However, as soon as one introduces
explicit expressions for the matrix elements in terms of spin-orbitals,
considerable simplifications and cancellations occur. MBPT is merely a reduction
of these perturbation expansions to elementary bits, where each bit is a
matrix element with respect to one-and two-electron operators over the
original spin-orbital basis set. All cancellations are taken care of from the
outset. Each bit is then enumerated using diagram techniques. The power of
the method arises form the fact that one may then re-sum selected terms in
the resulting expansion to yield approximate equations for the U
In
443
and
a perturbed part V
so that
[34]
H = Ho
where -,. .
+ -,...v
with energies
[36]
.!..
2:
Il=FO
1<1
Il
Il
The states,
<1>
Il
I.
operator although this does not imply that we need use HF orbitals. We shall
make the same choice as Amos and Musher(l8) and define
[39]
where
Ho =
2:i h (i)
so that
444
M.A. ROBB
HF
The operators t and V are the usual kinetic and nuclear-electron potential
operators,
and
VHF
is defined as
(1)
[42]
where
[43] rs
~It
+ V+ VHF
Is)
Thus, provided our orbitals are HF orbitals or some unitary transformation of the HF orbitals, we will always have
[44]
h'(1,J~ =al~
and hence
[45]
E:
I
y
Further, since
[46]
V = H-Ho
= I
i>j
th~
Ii (1)
445
from equation [1] into equation [38] and evaluate the resulting matrix
elements, we observe certain cancellations between the first term in each
order and the remaining "renormalization" terms at the same order. This is
easily illustrated at third order. Let us consider a third order term involving a
diagonal intermediate interaction Vfl fl' Combining both terms we have
[49]
IVol-'12 (VI-'fl-VOO)
E (3)
I-'fl
(Eg _ E~)2
where Voo comes from the renormalization term in).,3. If we set I-'=I~p) we have
1501 V
~If.~ ~ 51 V ~
V~Q,~ Q-IVHFlv) ]
+[:=ta.~
- ~IVHFI]
+ ~ilk~
and
[51]
Voo
=[I
y*a.~
"2
Voo
VI-' I-'
V - Vo
1-'1-'
VHF
. Thus we have
(kllk!> -<fklak)-cflla!>
and the renormalization terms which involve the first order energy
(olvlo).
446
M.A. ROBB
E~IV
r
\l=I~P
and v
=I~B)
and evaluate
(E~- EO)(Eg-Ee>
we obtain the algebraic expression for one of the resulting terms which is
shown on the right of Figure 1. The corresponding Feynman-like diagram is
shown to the left. All we now require is a set of rules for going from left
to right. Before giving the general rules, let us note the two essential
topological features of the diagram in Figure
which are labelled by hole or particle states, and the horizontal "interaction"
lines which intersect the oriented lines at nodes. The dictionary given in
Figure 2 then gives the rules for translating the diagrams into matrix element
expressions. The downward lines are hole or occupied orbital lines and the
upward lines are particle or virtual orbital lines. To keep relative signs
correct, we add a factor of
~1
factor for
"<-!. ,......
.;w
9A
(4)
I-
:;::
0...
co
UJ:JI!
.....,)
40
>-
UJr4.
>---
ttl
~
Ol
ttl
Cl
Q)
.::,t.
,-~,
.I
c:
c:
lL-
>
Ul
JP-
~
A
e
~
rQ.
ttl
>.
.JI::
Q)
Q)
-0
-0
~
-'=
l-
.ttl
~,<.
I
I
447
'oX
,,
I
I
~1J
>
~
le:(
u
.,...
0>.
Q)
~
~
Ol
lL-
448
M.A. ROBB
DIAGRAM ElEMENT
fa
t
k
o.V~
___lyf3
MEANING
HOLE-LINE SEGMENT
(OCCUPIED ORBITAL)
-1
PARTICLE- LINE SEGMENT
C. VIRTUAL ORBITA L)
-I
E----(9
1"12 ~
(r - P12) r
-I
12
>
*~----@
~
~~r::lt~~
-1
J-
ENERGY DENOMINATORS
1
- ~ e
[ 1:
't 'I k k
449
Also, we have 3 hole-line segments and two closed loops giving an overall
sign factor of -1. For the denominators, each m!ir of interaction lines
contributes a factor equal to the sum of hole-state energies minus particlestate energies. This rule is illustrated in Figure 3 and the application to
Figure I follows directly.
At this point it is convenient to introduce a simplification. For each
diagram with N two-electron interaction lines there will be 2N -I corresponding diagrams resulting from exchange. We shall include all 2N diagrams in one
diagram by using antisymmetrized interaction lines. (It is easily verified that
the sign rule for the complete set of diagrams still holds). For example, we
can obtain all the terms resulting from equation [53] by replacing the dashes
by the antisymmetrized interactions (wavy lines).
We now have all the formalism necessary to write the expansion [38]
1D
the lowest order terms. Since only the double substitutions contribute to the
energy at second order we have
[54]
E02
o>P
k>1
o>P
k>1
<olvl~~>2
o
EO
Eo - 1~lp>
(0 Plk I)
EO+
(k 110
P>
Ep-Ek-EI
The last summation is represented by the diagram, Figure 4a. We may now
generate a large class of related diagrams by inserting intermediate interaction
lines, in all possible ways, into this second order Diagram 4a. All the diagrams
in Figure 4 are derived in this manner, subject to the constraint that no
450
M.A. ROBB
.,.-....,
UJ
I
...
I~
+~
VI
'-""
+->
<0
!::
E
0
!::
Q)
Cl
~
Q)
>
e)az
C!l
~
I
I
I
I
I
I~ I
I
I
~,
('
!::
40
!::
+->
<0
::l
<0
>
(V")
Q)
~
::l
en
I.J...
ex
IX
IX
(e)
(c)
Figure 4.
(0.)
'P
+IX
ex
ex
(b)
(d)
f3
+
(b' )
/3
+ ...
~ 1+ ...
ex
+
p
+ ....
.j:>
U1
-<
::<I
g;
~
..,
::l
>
t>O
::<I
c::
"0
Pi
~
..,~
::<I
S2
~
I:)
'"
>
Z
~z
'"!j
::<I
4S2
M. A. ROBB
VHF
Ho
C(1)~~
of
I~A
correct to the first order is shown in Figure 6. One now has a new diagram
ell
Ofol
'j.
(e)
,b
0(.
J, ,t.k
. \ IX
+
~
()(.
(b)
(f)
.---@
Y
'I
(d)
--~
O~vvvvvTI
11'
(c)
:~
=;>
,..,.,.,~
(ll)
Figure 5,
j."
', ...
fJ
..,
.l>v.
'-<"
::c
t'1
to
..,'"
13z
..,
t'1
'c..,"
"C
';;::;;::"
Cl
to
);:
zto
13
zen
("J
..,
"Ij
'cz"
"C
454
M.A.ROBB
-JctL
~~
--~
1\ w
...,)
loX
'+-
vJ
s-.
? uJ~
Ul~
to
Ol
a
J::
0
.j..J
J::
::s
'+<lJ
>
to
;:;:
~
I
.:It.
sto
II
....J
"co
:=J
I.
I
s<lJ
"0
s-
.j..J
til
.....s-
l1..
I.D
<lJ
s-
::s
.....
Ol
l1..
455
element; namely, "free" particle and hole lines. The free lines label the
states in
I~~
>
substitution.
III - 2
term involving the second order energy. (We have already dealt with the
renormalization corrections involving the first order energy.) At the top of
Figure 7 we give the first term in ). 4
I~~
>
=1~~yg),K=I~b>, K=IYC).
The resulting diagrams for these two possibilities are given on the left of
figure 7. The arrows indicate the evaluation of the last two factors in the
L
:J
--- -
---
U----
Figure 7.
y (x + y)
+ __, -
+ e - E - e + E + e - f mn
- e )
= (YS) ~ (mn)
,B
xy
(Er+fb-Em-c n )
(. + e - E - E. + E + E - E - E. )
oc.,Bkl)"6/T1n
X
+
y
0(
CXPktYS
X
+
y
(6 + E - e - E )
(E
+ y)
.,.---:I~
(x
O.-k---:D~l\;;; --~Ob
1= (cxf3) -+ (kl.)
<DlvII> <I/vl:r><:r/vIK><K/V/O>
~Q:~~~~V:Q~~~~06
ex
UK
""'VI
0:;
o
0:;
::0
?=
0\
457
I = I~ g)
and
K = I ~~)
algebraic expression on the right of the bottom line of Figure 8. This term
exactly cancels the sum of the two disconnected diagrams of Figure 7
(equation [55] ).
Because of the factorization, we can write the sum of the two diagrams
in Figure 7 as the single diagram shown on the left side of Figure 8. The
Pk
a) that we have two factors in the donominator of the left hand part
of the diagram
and:
b) that the energy denominator for the left hand part is evaluated as
appropriate for the left hand part alone and similarly for the right
hand part.
In fact, as we shall presently discuss, whenever we have a set of diagrams
Eo - Er
t:
)~- -
---\Is
e.mn
- E. )
2.
( IX+ fj - Kl
- E.)2 (E.Y + E (1r. -
()(/31kL) C<Y5/mn
~(mn)
K = (()(,f3) ~ (k l)
I= (Y5)
----
Figure 8.
ex.
-Q~ -~D[~m---nJ&\
Eo - EK
)(I:[<OIVIK>]2)
K
_(l:[<OIVII>J Z
t>O
::c
otl:l
;s::
;>-
00
Ul
459
from the renormalization part that are not cancelled. However, these terms
all correspond to linked diagrams.
III - 3
and
and 6 and
interchanged. For each of these exchanges there are also terms arising for
k I+mn. k-+m etc. The algebraic expression for each diagram is given on the
are
j~ J~----:~.
Figure 9.
A +8
=
k'
+E
-e
<v6Imn>
rom"
(e + e.~--E -f-)
>
mn"/J
klmn
rxp Yh
CS
-E -e)
Ktmn>
I()(.~~S
6mn
-e)(~ +E -E -f +E ... e
t&~kL-r
(E
<"'I3/kL><)'S/kl;>
V6mn,Bkl
(L~
(E +E - E -~)
_ <t&fJ I kl>
;4:-----~~p ~----J~
A_k____
KLmn '\.
?'"
r;::
0\
.j::.
461
right. Note that the diagrams are not unlinked since both parts contain free
lines.
[56] C(2)klmn
o ~ yO
C(1)kl
o ~ yO
o~
C(1) mn
y 6
, 6 ~ etc., we have
C(1)kl
oy
C(1)kl
00
C(1)mn
~o
ETC.
We may now add any number of intermediate interaction lines (with all
possible relative orderings) in either part of each diagram without altering
this result so that one may immediately generalize equation [57] to infinite
order. However, if we add an intermediate interaction line that joins both
disjoint parts of each diagram in Figure 9, this sort of factorization is not
possible. Fortunately, the factorization becomes possible when we come to
evaluate the contribution to the correlation energy.
The fourth order contribution to the correlation energy from diagram 9
is obtained by taking the four pairs of free lines, in the upper and lower
diagrams, and joining them together, two pairs at a time, by two final
interaction lines. In fact, one obtains only five diffe!ent types of diagram. We
have already dealt with one type, Figure 7, which was unlinked. The
462
M.A. ROBB
>
One example is shown in Figure 10. The upper and lower diagrams
correspond to final states I~~
and 16~n>
(a~lk
I)
Ea+E~-Ek-EI
[<YOlmn>(~Olmn)
Ey + Eo - Em-En
0y Ik I)
Ea-+Ey-E k- EI
index, then we may treat the right hand side of the diagram as though it
were inserted in the left as shown in Figure 11. The inserted part has
energy denominators independent of the part into which it is inserted, so that
[59]
G1 =
~[
2
olmiV(~olmn
(y
>
E+E-E-E
~
y
m
n
brackets in equation [58] above. The insertion above can also be written in
terms of the coefficients of first order pairs so that
[60]
G1
=; [ (yolmn)
and similarly for the first and last factors in [58]. In summary we have
thus established two important results:
a) Diagram 10 factorizes so that one part may be thought of as
0(
Figure 10.
y(x+y)
xy
X
+
+- (f.IJ + E6 - Em- En )
y
+-(eOL +E~kl
-g -+E +E -E. -E)
'(.s mn
+- ( E(X + f y- E k - El)
x (x +y)
(b)
(.1)
c5
'"
0\
W
.j::.
'><:"
t'rl
5!
::I
;.-
0:>
'"
>-l
'C"
t'rl
"d
::I
(")
;.-
a:::
;.-
'a:::"
C'l
;;
\j
\j
~
'";.-
"!j
;:
Figure 11.
~--G
IX
I-{~+
'~
'2:
6J
.j>..
t:D
t:D
?>
::<l
o
.j::..
0'1
465
one~electron
operator.
~ =
y in Figure 10.
I~~~o">
which
would violate the exclusion principle. Remarkably, such a diagram does arise
from uncancelled terms in the renormalization expression
I
I
where I = I~~
0 I V I
>
1)2) (I
X
EO - EO
and
K = I~~
>.
466
M.A.ROBB
Dc.
(a)
IX
(b)
Figure 12.
Particle-Hole Rearrangement
Di agram
467
(a)
(b)
Figure 13.
Particle-Line Rearrangement
Di agram
468
M. A. ROBB
fJ
(a)
(b)
Figure 14.
Particle-Particle Rearrangement
Diagrams
469
Table I. One may again identify the coefficients in the expansion of the
first order pairs as was done for Diagram 10. The results are again
generalizable to infinite order by adding intermediate interaction lines joining
k
I.
Q~. mn ......
but not
Im.Qy ..... .
Finally, we must now treat the case where the two double substitutions
of Figure 9 are coupled by some intermediate interaction. For example, one
might add an intermediate interaction of the form
(Imll' m') .
The result-
ing third order expression for the coefficient of the quadruple substitution
I: ~ ~~~'>
I>
)x<!mll'm'>x [<:y 61m n>
( <'Q ja+ Ik/>.+--
Q I'" k - I
y6mn
>]
(y 61 m'n
y+ 6--1
m
n
We may again identify the expansion coefficients of the first order pairs and
subsequently generalize to infinite order. Some other examples are shown in
Figures ISb and ISc. One must also include diagrams where the same type of
interaction is introduced into Figures 10, 12, 13, and 14.
To summarize, the central result of this subsection is the factorization
of the contributions of the quadruple substitutions. Provided we can neglect
14
13
12
10
DIAGRAM
TABLE I
<"B/k~>
<"elk!.>
E,,+a-Ek-ER,
Ea+Ea-Ek-~
<"e/k!.>
Ea+e-Ek-E~
<aa/k~>
_1
_1
~Y6/mn.
x fr6/mn>
x fY6/mn>
+
<a6/~n>
E/Eo-k-E~
<yo/k~>
+ <y6/kR,>
<y6/kn>
+ <y6/kn>
Eyho-Ek-en
e-6-~-n
<B6/tn>
<e6/mn>
X [Y5Imn> c +6-m- En + <e6/mn>
e
VALUE
1
J
<y6/mn>
Ey+6-m- En
<y6/mn>
e: y+e: 6-DI-& n
<y6/mn>
+E -& -&
y 6 m n
<y6/mn>
E + -E -&
y 6 m n
<aa/mn>
E +e -e -E
" a m n
<aB/!.m>
E +e: -& -&
a a ~ '11
<ay/km>
Ea+<y-ck-En
E,,+Ey-Ek-EI.
<"y/k~>
.j:>.
-...l
tzj
otzj
::tl
!>
(c)
Figure 15.
471
472
M.A. ROBB
the single and triple substitutions, then the linked pair expansion (equation
[27]) should thus be an excellent trial function.
III - 4
1 +a 2 +
a 3 + ....
(1-a)
Ial<1
fJ
cx..,
fl
!fo
----
.t
DC
'"
at
----
fl
fl
fl
1____1
==
~
n
1-
r+
t tJ J-1 - ~t-~iJ
tpX:*--i; ]
Figure 16.
[+~
ottpt
oj::.
-...J
Vol
~
~
:<l
-<
::l
)-
:<l
c:::
co
trl
"C
III
:<l
C'l
;;;
I:)
I:)
fJ
)-
CIl
IX
fl
0(
(X
~::l
'Tl
:<l
474
M.A. ROBB
series
Diagram 4a + Diagram 4e + Diagram 4e' + ---------The intermediate interaction ~[61 Q~>
from Figure 4.
to give
In a similar manner,
Diagram 4a + Diagram 4b (m~k, 1_ n) + Diagram 4b'(m=k,l=n,o=k, p=l)
contributes a denominator shift of-(k Ilk I> while the analogous series
containing Diagram 4d contributes a shift of
< la >
Q
I<
all the diagrams that result from the partial cancellation of the renormalization
effects which invoive
Figure 17.
ex.
"
ex
m n
+{
P + -----
~o~+ B
a8D"
~([D~ [I +{ffi3Dt 0 }
t
}:--
~
~
Vl
-Po
--l
'"
:r:
tTl
o
-<
...,Z
:>
...,
(3
to
'"
...,
'"
tTl
"'"
::l
n
:>
:>
'"
;;
u
u
:>
Z
en
(3
...,
~
n
'"'1
;;;
476
M. A. ROBB
A e(2)
_
-
~ a~
I<
<a[3lkl )2
a+p.-k- 1
>I
and defining
[65] Q '2
[66]
H O(1,2)
Ikl)
k>1
h' (1)
II
<k
h' (2)
where
[67]
8e~~ _ <a~I_'
Q '2
<a~I_1r 12 Iu ap(1
where the U ~~
[69]
+,21
a~
{Ho (1,2) -
Ea - Ef3} Q ,2 1
which may be solved variationally. The third order energy expression which
includes Diagrams 4b, 4c, and 4d is given in the same manner as
[71]
(3)
where
[72]
~i3l-'-
ea (3 - ~
~~'2)
1
= r,2
Q '2
(1,2)
,..,
,..,
- (Ja(2)
Q '2
_,
Ea+ Ej!>- HO(1,2) r'2
Ka(2))-(J~(2)- Kf3(2)
+<a~la(3).
Ia~/~
'"
h (1)
477
(A- B)-1= A- 1
(Xl
+ A- 1 I
n=1
(BA- 1)"
with
E~_ Ho (1,2)}
012 {
Ea+
where
and thus
o
[78]
llea~ = ~~
\:12
IUa~ >
The equations [77] are in fact related to the Bethe-Goldstone equations used
478
M. A. ROBB
I) Non-Variational Methods
With the techniques of MBPT now at hand our task of examining the
origin of the various methods used for the computation of pair functions is
readily accomplished. From the general form of equation [13] or equation
[20] and from the discussion of the last subsection, it is clear that the PDE
for the pair functions must have the form
[79]
012{ Ho (1,2)-
a - (3
+ y,o
L
,.a~
+ Va~
t1,2)
0 12 }
La~;~o
IU yo
r
>+
IUa~>
0121<;l~
r
12
>
O.
Thus the various approximate methods used to calculate pair functions must
differ only in the choice of
Va~(1,2)
(1,2)
and L a~;yo
. Since these
479
extensively reviewed by Freed (31, 32), Cizek (17) and is implicit in earlier
work by Kelly (23). Indeed, the very early work of Brueckner (20) is
fundamental to the present approach.
Much of the derivation of the IEPA equations has been discussed in the
derivation of equation [77); however, the final steps are particularly instructive and we shall now sketch the details.
The IEPA corresponds to the infinite order summation of
a) all diagrams of Figure 4 yielding equation [77) with
Va~\1,2)
f3
ue
m) n
f3 and 6 = a
G1)
is just
t 2)
a~(1,2)
l1e a~
of-l1ea~
to
Vaf-l (1,2)
(/I}luai>
+ Q12Ia~>
l1e a f?>
Uar
480
M.A. ROBB
The non-linearity due to fl. eafj arises (32) from the summation of the EPV
rearrangement diagrams with only two hole states a and
1- fl.eay - fl.eo~
tI -
y>o
fl.e"
au
Va~ (1,2)
given by
fl.eyp.
r
There is a partial cancellation between the EPV terms for Diagrams 12, 13,
and 14 so that, for example, 13 and 14 sum to zero if mn = k 1
The sum of the EPV terms resulting from Diagrams 12, 13 and 14 give a
further contribution to
[82)
Va~(1,2)
of the form
Diagram 12 + 13 + 14
1<>1
where
[83)
fl. A~k
0,-
1)=
p-
m>n"" k,l
mn
yO
Thus when Kelly's second order equations are generalized, one obtains
equations similar to equation [80) with the additional terms arising from
481
equation [81] and [82] included. Thus Kelly's second order method
represents an improvement over the IEPA in that it represents an attempt to
include all the contributions of the quadruple substitutions correctly. The
main defect of both the IEPA and the second order method of Kelly is that
the pairs are assumed to be uncoupled (except for the rearrangement diagrams).
Topologically, we have included no diagrams where the hole-line label is
changed by the intermediate interaction. Thus we have no contributions to
the coupling operator LQ~)J'2)
by Cizek (17) on the other hand, include all coupling terms except those
arising from Figure 15 and we shall now comment briefly on this work.
The diagrams that couple the pairs begin to occur at third order and
the "direct" diagrams are summarized in Figure 18. In the same manner as
.
dISCUSSIon
.
. contnb
our prevIOUS
t hey sum to gIve
utIOn to
L Q~j
(1,2)
y6
f t he ..lorm
In addition, there are non-EPV terms arising from Diagram 10 and Diagram 12
. that contribute to
LQ~1j'~~
13 and 14 contribute to
VQ~(1,2)
be derived from Table I. The resulting equations give the coupled pair manyelectron theory CPMET of Cizek (17). These equations would be exact except
for the neglect of diagrams of the type shown in Figure 15.
There are two other methods which have recently been proposed which
lie in between the IEPA and CPMET. The coupled electron pair CEPA of
482
M. A. ROBB
--
Cb)
(e)
Figure 18.
483
Meyer (33, 34)) includes the coupling effects due to the diagrams in Figure
18 to infinite order but uses the same approximation as Kelly in the
treatment of the rearrangement diagrams. On the other hand, the Independent
Pair Potential (lPP) and the Coupled Independent Pair Potential (CIPP) used by
Van der Velde (36) include only the terms from diagrams 18a and b. The
relationship between the various methods is summarized in Table II.
Finally, we should mention that one may solve the IEPA equations
[80] in any manner one chooses. In particular, if one is using variational
methods, different basis sets may be used for each pair. Kutzelnigg and
Ahlriches (37, 38) have derived equations for optimum orbitals based on a
different pseudo-natural orbital expansion of each pair. Meyer (34) has
extended this work so that the coupling terms
(LQ~1;y2~ )
may be
Variational Methods
none*
none*
18a, 18b
18c
all
all
all
none
all.EPV
except
13 and 14
all
none
yo = as
EPV with
all
CEPA(b)
none*
all
none
yo = as
EPV with
all
CIPP
all
a11
none
all
all
CPMET
none
none
none
all EPV
all
using geometric
approximation
* When symmetry adapted pairs are used some contribution from these diagrams is included but
not to infinite order.
none
15
yo = as
EPV with
all
a11
CEPA(a)
IEPA
DIAGRAM
TABLE II
et!
oet!
::<l
?::
i'>
.j:>.
00
.j:>.
485
model proposed by Hurley, Lennard-Jones and Pople (39). The SEP model
corresponds to a wavefunction which is a single antisymmetrized product of
strongly orthogonal geminals APSG:
[85]
\.IJAPSG=
A{ A1~,2).... "
AR
[86]
{<I>R j }
AR
may write
[87]
AR (1,2) =IR1
IU R. R)
R1)+
[88]
Iu RR)=1
R >R
k
Rk RI Rk
R1 R1
R1
[89]
+1
R>R
k
A {111 1
1)
....
.,>
R1
>" .. }
IR1 R 1
486
M. A. ROBB
+ ...... .
the
AR
Although the SEP model is not a very good one, its analysis into a
diagrammatic expression is particularly instructive. Referring to Figure 6, the
only coefficients to be determined are those for which a-R"p= R,ik- Rk , I = Rr
Thus in Figure 4, it is easily verified that we need only those diagrams
where
a) all orbitals in any fermion loop belong to the same group
b) all orbitals at any interaction line must belong to the same group.
Because of constraint b) all the diagrams of Figure 18 vanish. Further, the
only non-vanishing contributions from the rearrangement diagrams 10, 12, 13
and 14 are the same EPV terms we encountered in the IEPA. The APSG
pairs are thus coupled only by diagrams of the type shown in Figure 15.
Robb and Csizmadia (43) have demonstrated numerically that the coupling
from the diagrams of Figure 15 is completely negligible. Thus for the SEP
constraint the IEPA gives essentially a variational result.
Wavefunctions computed with the SEP model also provide a numerical
487
89
is
and many diagrams of the type shown in Figure 10, 12, 13, and 14. Thus
the IEPA, with a SEP type partitioned basis might be very close to a
variation calculation. However, we are aware of no actual calculations.
As soon as the strong orthogonality constraint is relaxed, the resulting
variational pair equations only remain computationally tractable for closed
orbital pairs. Even then the pair expansions must be restricted to natural
orbital form which implies that one must use some multi-configuration SCF
procedure (49).
III - 6
488
M.A. ROBB
that the pair functions in the IEPA be so defined that the total wavefunctions correspond to an eigenfunction of
discussed by McWeeny and Steiner (50), we may first couple the pair
functions so that they are eigenfunctions of a two-electron spin operator:
[91] ,I,.
[92] ... (
K, S, m 11,2)=
flY
~
L
flY
The summation fl Y must involve a pair of orbitals with the same spatial
parts but different spin and thus
S2
value for
the state of interest. Provided one is using the full variational coupled
equations there is no ambiguity and one can greatly reduce the number of
equations to be solved. It is only when one attempts to decouple the resulting equations to give a spin adapted IEPA that conceptual problems begin to
arise. Thus for a closed shell case one could imagine four possible schemes
1) no spin coupling (equivalent to the original IEPA)
489
spin operator
3) couple pairs with same
(52, 34)
490
M.A.ROBB
III - 7
We can then generalize this result by adding all possible interactions of the
type shown in Figure 4 in both the top and bottom half of figure 18c. Thus
the first order correction to the IEPA correlation energy from Diagram 18c is
/ , IEPAI
~u a~
<a(311
0/.~
IEPA>
U yo
We have illustrated the scheme for the coupling of the IEPA pairs but it
could equally well be applied to the rearrangement diagrams or for correcting
for the single excitations.
At this point, one should also comment briefly on Nesbet's (28)
method of higher order Bethe-Goldstone increments for the correction of the
IEPA. Nesbet's technique is equivalent to solving a set of equations for all
491
diagrams with 2 holes, a set for all diagrams with 3 holes and so on. This
scheme does not take advantage of the factorization which is possible in
Diagrams 10, 12, 13, 14, and 15 and thus is unnecessarily cumbersome.
However, some modification of his scheme, applied to Cizek's (17) equations
might be very successful.
Mehler (47) has also recently discussed a method for the inclusion of
the terms which couple the pairs in the IEPA approximation. He proposes
using the IEPA pairs as the starting point for a full variational calculation.
The zeroth iteration would give the first order corrections just discussed.
However, beyond the zeroth iterate, the solution of these equations would
be exceedingly difficult because all the rearrangement effects (which occur
in his equations as overlap matrix elements between pairfunctions) are included
to infinite order.
IV
IEPA, are now being routinely applied to problems of chemical interest (see,
for example, references 54, 55, and 56). In addition, applications have even
been made within the framework of semi-empirical theories (57). However,
we shall limit ourselves to a brief discussion of some selected recent work
which illustrates some of the theoretical problems we have just been
discussing.
The most accurate calculations have been made on atoms. Recently,
two papers by Sasaki and Yoshimine (58, 59), report the results of extensive
CI calculations (up to "i" type orbitals, including up to quadruple substitutions)
492
M.A.ROBB
493
Ne(1 S)
result of Lee, Dutta and Das (LDD), calculated using Kelly's method is
-0.3584. LDD then evaluated the Diagrams 18a and 18c to give a correction
due to pair-pair repulsions of 0.0224 which, in turn, yields a final result of
-0.3362. This result does not compare well with the SY result of -0.3052.
The LDD result for Diagram 4a is -0.26582 which is considerably lower than
the result -0.18693 computed by King (62) by solving the second order
equations [70] using Gaussian geminals. Obviously, these discrepancies require
further study. However, the IEPA results of Nesbet, Barr, and Davidson (63)
are in better agreement with those of SY if one adds the estimate of LDD
for the Diagrams 18a and 18c. The Nesbet, Barr and Davidson IEPA result is
-0.3454 and when corrected for pair-pair repulsions is -0.3221. The difference
between the IEPA and Kelly's second order method results from the EPV
diagrams of Figure 10, 12, 13 and 14 which LDD calculate will reduce the
IEP A result by 12%. This correction would bring the Nesbet, Barr and Davidson
494
M.A. ROBB
subsection III - 6) and -0.3020 using the CI method. The good agreement
using space orbital pairs is encouraging, but it must be remembered that the
higher order EPV diagrams of Figure 10, 12, 13 and 14 which have been
included by Kelly are ignored in both these types of IEPA calculations. When
these terms were included, the IEPA calculation with space orbital pairs might
even come above the CI result.
In connection with the preceding discussion, one should also mention
the calculations of Mehler using the IPP method (35). He computes pairlike wavefunctions using the wavefunction form
[94]
1<5> +y~o
k>1
C or k
495
'lk 'Y')
0
In his calculations on BH, as the basis was extended the IPP correlation
energy actually became worse than the IEPA energy. At first this may seem
surprising as by this coupling procedure one has included the intermediate
interactions to be found in Figure 18a. However, in doing this one has
changed completely the form of the higher order pair-pair repulsions. When
one uses some sort of symmetry adapted pair functions one is doing a similar
sort of coupling on a smaller scale. One should not be too surprised if one
encounters examples where the symmetry adapted pairs give a worse result
than the IEPA model using simple spin-orbital pairs. The treatment of the
rearrangement terms is particularly ambiguous in the symmetry adapted
schemes.
The most sophisticated pair function calculation on molecules is
undoubtedly that of Paldus, Cizek and Shavitt (64) on the BH3 molecule.
They have used the complete coupled pair many-electron theory (CPMET)
equations of Cizek (17). This set of equations, as we have discussed in subsection III - 5 (i), includes the diagrams of Figure 4 and 18 as well as all
the rearrangement effects of Figure 10, 12, 13 and 14. Further, the theory
was extended to include the single and triple substitutions. Thus their
results will differ from a full variational calculation only by neglect of
terms which arise from diagrams of the type found in Figure 15. While they
have used only a double zeta basis in order that a comparison with a
complete CI can be made, the agreement of the variational and variation-
496
M.A. ROBB
497
included p and d orbitals at the centroids of the bonds and lone pairs). The
diagonal pair-pair repulsions are positive and the off-diagonal ones are
negative. Note that the pair-pair repulsions are extremely sensitive to the
inclusion of the EPV rearrangement diagram as observed by Kelly (60.)
TABLE III
Pair
Pai r Corr.En.
CH 1-CH 1
/Pr.-Pr. repulsion
Pair
/non-diagOna1 pair-pair repulsion
- 0.0186
CH 1-CH 2
- 0.0076 / 0.0010
CH 1-F up
- 0.0053 / 0.0007
CH 1-CF 1 / 0.0001
CH 1-CF 2 / 0.0001
CH 1-F up '
- 0.0003 / + 0.0000
CH 1-CF 1
- 0.0008 / 0.0001
F2tp - FUp
- 0.022B
F2tP-FHp
- 0.0035 / 0.0005
F2ip-F2ip'
- 0.0178 / 0.0024
CF 2 - CF 2 / 0.0003
F2ip,-Fup"/ 0.0003
F2tp-F21p'/ - 0.0003
F2tp,-F2tp'/ - 0.0004
FUp,-F2ip"/ - 0.0007
- 0.0003 / + 0.0000
FUp-CF2
- 0.0228 / 0.0035
F2tp :CF1
- 0.0008 / + 0.0000
F2tp,-F2iP'
- 0.0092
CF 2-CF2 / - 0.0004
CF 2-CF 2 / 0.0001
F2ip,-FUp"/ - 0.0001
- 0.0000 / + 0.0000
F2ip ,-CF 2
- 0.0123 / 0.0013
F2ip' -CF 1
- 0.0001 / 0.0000
- 0.0092 / 0.0009
CF 2-CF 2 / 0.0002
F2ip,-FUp" / - 0.0006
CF 2-CF 2
CF 2-CF l
- 0.0104
- 0.0003 I 0.0000
Sum of Pai rs
Diagram 18a
Diagram 18b
Diagram 18c
- 0.3669
0.0180
- 0.0058
0.0058
F2tp,-F2ip'/ - 0.0001
F2ip,-F2ip,1 0.0001
499
l M0
- 0.Z6S7
- 0.Z564
- 0.Z601
- 0.Z498
Pair-pair repulsions
0.0031
O.OOlZ
0.0006
0.00Z7
DIAGRAM lSa B= i y= ;
0.016S
0.0154
- 0.0143
- 0.0063
- 0.0043
- 0.0011
B=
or i
DIAGRAM lSb
- 0.0069
0.0029
0.0010
0.0001
0.0010
CONCLUSION
It would appear that completely variational pair function calculations
500
M. A. ROBB
References
1) R.D. Mattuck, A Guide to Feynman Diagrams in the Many Body
Problem. (New York, Academic Press)
2) N.H. March, W.H. Young and S. Sampanthar, The Many Body Problem
. 315 (1964)
18) A.T. Amos and 1.I. Musher, 1. Chern. Phys. 54, 2380 (1971)
19) K.A. Brueckner, Phys. Rev. 100, 36 (1955)
501
A1450 (1964)
~,
ill,
307 (1960)
11,
33 (1970)
1 (1969)
Chem.~,
Chem.Phys.~,
341 (1971)
1017 (1973)
E,
437 (1973)
36) G.A. Van der Velde, PhD. Thesis, University of Groningen (1974)
37) R. Ahlrichs and W. Kutzelnigg, 1. Chern. Phys. 48, 1819 (1968)
38) M. 1 ungen and R. Ahlrichs, Theoret. Chim. Acta
339 (1970)
39) A.C. Hurley, 1. Lennard-lones and 1.A. Pople, Proc. Roy. Soc.
A220, 446 (1953)
40) R. McWeeny, Rev. Mod. Phys. 32, 355 (1960)
41) 1.M. Parks and R.G. Parr, 1. Chern. Phys. 28, 335 (1960)
502
M. A. ROBB
185 (1960)
43) M.A. Robb and l.G. Csizmadia, J. Chern. Phys. 54, 3646 (1971)
44) V.R. Saunders and M.F. Guest, in Quantum Chemistry - the State of
the Art ed. V.R. Saunders and J. Brown (Proceedings of SRC Atlas
Symposium H4, 1974)
45) E. Kapuy, Theoret. Chim. Acta 6, 281 (1966)
46) E. Kapuy, Acta Phys. Acad. Sci. Hung. 12, 351 (1960)
47) E.L. Mehler, J. Chern. Phys. 59, 3485 (1973)
48) M.A. Robb and I.G. Csizmadia, Int. J. Quantum Chern. 4, 36 (1970)
49) A. Veillard and E. Clementi, Theoret. Chim. Acta 7, 133 (1967)
50) R. McWeeny and E. Steiner, Advan. Quantum Chern. 2, 93 (1965)
51) A.W. Weiss, Phys. Rev. A3, 126 (1971)
52) J.W. Viers, F.E. Harris, and H.F. Schaffer, Phys. Rev. AI, 24 (1970)
53) E.R. Davidson and C.F. Bender, J. Chern. Phys. 56, 4334 (1970)
54) B. Zurawski, R. Ah1richs, and W. Kutze1nigg, Chern. Phys.
Letters
309 (1973)
390 (1967)
63) R.K. Nesbet, T.L. Barr and E.R. Davidson, Chern. Phys.
Letters 4, 203 (1969)
64) 1. PaId us, 1. Cizek, and I. Shavitt, Phys. Rev. AS, 50 (1972)
65) B. Roos, Chern. Phys. Letters 15, 153 (1973)
Acknowledgements
The author is indebted to his two research students, Miss S.P. Prime
and Mr. S. Chin for carrying out the numerical calculations on
H20 and CH 2F 2 .
503
R. McWeeny
Department of Chemistry,
The University, Sheffield S3 7HF, U.K.
1.
= vle l
In two
+ v 2 e 2 and the
Evidently
Diercksen et al. (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 505-528.
All Rights Reserved Copyright 1975 by D. Reidel Publishing Company, Dordrecht-Holland.
506
R. McWEENY
We write
containing apples,
Suppose we take
3a + 2b + 5c
3a
selecting
o
We are constructing an algebraic representation of the operations,
introducing the rules as we go on;
3 apples, Oa
=0
The
(1.1)
507
(1.2)
where the null operator 0 produces from any set the null set.
If we introduce the identity operator 1, which leaves any set
unchanged, we discover another important property.
P f + Pbf + P f
3a + 2b + 5c
For
(distributive law)
f
this implies, since the result is true for any f, the operator
identity
1
(1. 3)
It simply
means that if we select all the subsets and then put them
together again we do not really change the set in any way!
Equations (1.1)-(1.3) summarize the general properties of
projection operators.
= Pa
+ P b is the
P abPc
1
2.
MATRIX REPRESENTATIONS
Let us now consider an m-dimensional Hermitian vector space,
508
R. McWEENY
Pe )
m
mm
(2.1)
The ith column thus contains the components of the new vector
e~
1.
We say IP is
ev
v'
IS
IP
IS
v'
(2.2)
= vt v =
1 and clearly
we obtain
vv
(2.3)
509
. t
= v ( vt v)v
= vv t
(2.4)
The matrices
~.
(2.6)
jp~
~
IP.
lP.iP .
~
IP 'IP'
= (() (ilj)
i=l
IP.
(2.7)
matrix~.
projection operators:
(2.8)
jpl
510
R.McWEENY
lP ,
lP,2
lP'
iPlP'=JP'IP=),
1P,+lP'
:0:
(2.9)
Only two subsets are now distinguished but the projection operators
which partition the space still have the basic properties (1.1),
(1. 2) and ( 1. 3)
Clearly
it is the trace
(2.10)
~1'2'
In such examples
of AO coefficients;
3.
511
2 tr
+ tr R G
(3.2)
(R
in (2.8), describing
The matrix
in (3.2)
is
defined in terms of R by
~(2R)
J(2R) - K(R)
(3.3)
Rk<XiXklgIXXj>
k,
K(R) ..
1J
k,
(3.4)
512
R.McWEENY
}-R
~2'
~l
(3.7)
R.AR. = R.BR.
1
(i,j
1,2)
(3.8)
R oR R
2
2
(l-R)o~(]-R)
=0
while the 1,2 and 2,1 components are undetermined and may be nonThe most general variation oR preserving idempotency to
zero.
first-order is thus
= R~(i-R)
+ (h.c.)
(3.9)
513
oE
2 tr oR (Ih+G)
and K matrices
(3.10)
tr 18 G(A)
2 tr oR Ih
The
oE
(l-R)1h R
IR ih (1L-1R)
(3.12)
arbitrary~,
leads to
= to
It/.
~)
~A'~B"")
514
R. McWEENY
~Tt.
using R
or, multiplying from the right by T and noting that the orthonormality conditions require rtT = 1,
(3.14)
where
Since
suitable choice of V;
of generality.
515
If the
We
4.
516
R. McWEENY
An example will
In dealing
12
{lp+I (r)
12
+ Ipo(r)
12
+ Ip_I (r)
12}
In
Another
m~
and ms
517
KKK
. $
~1,$2,
I [l
E
av
2~
HK
n K (nK-l)
mK (2mK-l)
Gal
II
nKn L GKL
mKm
L
K,L
~
(4.1)
where
HK
L <$~JhJ$~>
l.
l.
(4.2)
.
l.
GKL
i,j
K LJ g J $.$.>
K L - t<$.$.
K LJ g J$.$.>J
L K
[<$.$.
l.J
l.J
l.J
Jl.
(4.3)
518
R, McWEENY
orthonormal.
On defining a G matrix
av
in
the form
E
(4.4)
av
where
G(v'R ) +
KK
(4.5)
v'
(4.6)
= 0, under variations
av
+ aRK' is now accompanied by constraints which
~ +
(3.7
In terms
519
(all K,L)
(4.7)
~Z
(4.8)
~.
(all K)
~L
-x
KL
(4.11)
v tr o~ ~
KKK
(4.12)
520
R. McWEENY
to avoid
2:
av
tr IR \had
KKK
12: v K
K
tr ~(h + lli K)
(4.14)
OE
2:
2:
K(occ) L(fK)
v [tr
K
J(
IR Ih
KL L K
+ tr X
KL
~h ~l
(4.16)
each
~KL.
lRlhlR
K K Z
=1()
(4.17)
(all K)
(all K,L)
(4.18)
521
shells.
As in the closed-shell case, it is convenient to obtain the
solution of these equations by factorizing each R matrix in the
t
form ~ = ~K~K and showing that the columns of ~K can be chosen
as eigenvectors of a certain effective Hamiltonian;
this
We therefore seek a
(3.13
(all K)
(4.19)
rh.
~+~.
The eigenvectors
A matrix
corresponding to mixtures
IKKK
R zb R
KZ
(4.20)
~K
522
R. McWEENY
(4.21)
in (4.20)
is merely a
In
There is thus no
This
(e.g. the
(all K)
KL
1
(v -v )
(all K,L;
K<L)
(4.22)
From an
(4.20);
lh
is set up according to
During
523
RK~
if
~~
~,
then
belongs to
As in
Examples
have been given by Roothaan (1960), all for the case of a single
open shell, the best known being that of n l doubly-occupied
orbitals and n 2 singly occupied with all spins parallel.
Such states yield energy expressions of the form (4.4), apart
from a modification of the weights of the
the open-shell
5.
and K matrices in
matrix.
~TU
in which a pair of
~U.
~T
to an
524
R.McWEENY
q,
TU TU
(5.1)
a a
TU
<q,
IH I q, TU >
(5.2)
expression as
(5.3)
where T runs over the n orbitals in q, 0' and U over the "weak ly
occupied" orbitals in the q,TU (in general a subset of the
remaining m-n linearly independent functions).
~T
and
~u'
respectively;
The matrices
~~d
and
~T
and
with expansion
~U"
fRU
(5.4)
h~d
525
(5. S)
la TU
I2
~T
to
is p = a~ +
L
a2"
whilst that of a pair in
T 2
T
T' (;;tT),U T U . .
.
is PU = ~ a TU ; since
normal~zat~on requ~res
orbital
2
aO +
T,U
2
a
.TU
PT
[1 -
L aTul
Pu
L a TU
(5.6)
(5.7)
(1973) .
one pair in T and one in T', or with one in T and one in U, are
easily seen to be
1 -
(5.8)
WTT '
(5.9)
~U
526
R. McWEENY
\'
L V K tr
fR/tKad '
!had
K
Ih + .LQ;
2
(5.10)
L p [tr
K K
h + tr.R IS (IR ) 1
K-K
K
[R
(5.ll)
oE
L v tr h
KKK
olR
(5.13)
527
(5.14)
IhK = !h + (bK
The difference between IhK and
h~d
expansion (5.1).
same lines:
In:terms of the
(using
(5.15)
IH a
ell
(5.16)
528
R.McWEENY
iteration procedure.
have been striking (see, for example, Das & Wahl, 1970;
Wahl &
Das, 1970).
2.,
133
(in press)
~,
179
2'
261,
MOLECULES IN ASTROPHYSICS
G. Winnewisser
Max-Planck-Institut fur Radioastronomie,
D-53 Bonn, Germany
33
One solar mass 1 M
2 x ~Q g. In comparison
the mass of the earth i~ 6 x 1027g.
Diercksen et al. (eds.), Computational Techniques in Quantum Chemistry and Molecular Physics, 529-568.
All Rights Reserved. Copyright 1975 by D. Reidel Publishing Company, DordrechtHolland.
530
G. WINNEWISSER
MOLECULES IN ASTROPHYSICS
531
(2) Observations of different interstellar molecules and various transitions of anyone molecule provide an excellent tool for studying the physical state,
dynamics, radiation fields and chemistry within individual clouds. Many of these clouds appear to be likely sites for star formation.
(3) Maser emission spectra of OH, H2 0 and SiO allow the study of the early and late stages in the evolution of stars. In particular, this applies to the
study of red giant variable stars in the SiO lines,
whereas the OH, H2 0 maser appear to be primarily associated with the early phases of star formation.
(4) Optical and radio observatioBs provide an indirect method of measuring the intensity and spectrum
of the microwave background radiation.
(5) Molecules are now being increasingly used to
study galactic structure, in particular the galactic
nucleus. Thus, complementing and extending the information gained froM both the A 21 cm line of atomic hydrogen ann the recombination lines.
(6) The presently known composition of molecules
indicates that simple bio-molecules may exist in interstellar clouds. They may be detectable once their rest
frequencies are precisely known.
(7) The discovery of simple free radicals and
molecular ions in interstellar clouds allows the study
of their spectra and derivation of their structure,
which has not been possible+by laboratory techniques.
Examples are CH,CN,C 2 H, N2H .
In Table I we provide a complete list of all interstellar molecules detected at optical and radio frequencies up to September 1974. Interstellar transitions in the radio frequency portion of the electromaanetic spectrum are displayed in Fig. 1., together
with the laboratory spectra of three molecules, HCN,
H2 CO and I!2NCII. Detailed comparisons between detected
interstellar spectra and the rich rotational spectrum
of most molecules show that only a few transitions of
each individual molecule have been detected in interstellar space. Interstellar transitions occur in
emission or absorption. All molecular lines in the
millimeter wave region are seen in emission, whereas
at centimeter wavelenaths they can be seen both in
532
G. WINNEWISSER
a) Optical or UV detection
12 CH ,
12 cH +,
13 cn +, CN, CO,
13 CO , II 2 , IJD.
b) In Radioastronomical Techniaues
x-ogen
(HNC?)
(U
(U
(89190)
(90665)
...J
a
~
~
II:
>-
...
tii
II:
...J
...J
..c{ii:
II:~
HCN
2n-2'2
312-3,3
4'3-4'4
50
J=1-0
'~-112.10
5,,-:S,5
S,4-51S~1~000
J=1-0
III 1i
H'lc'4N \
1-Pc~l.i2C~
6,5':616
I
100
I
150
O'2C'4N
110-101
H'2C'~
J=3-2
I
200
;! I I~I I i
I 13~;-2~~':.>,j/~,-2203'2-2'1 I
4,,-4,.
~~f]'~r
322~221
,
I
GHz ]"=3-2
,----'-----,
Ii r Ii r
D12C'~ D'lc~
J=2-1
::c
::<l
'">-i
W
W
Vl
110-1"
H 2 CNH (==~:)
H2 CO
I---
I O'2C'4N
o'lc'''N
O'2C1SN
J=2-1
---"
>
~
~
'"z
534
G. WINNEWISSER
535
MOLECULES IN ASTROPHYSICS
=h
(Eu - E l )
r O ist the mo~ent of inertia perpendicular to the internuclear axis passing through the center of mass. J is
the rotational quantum number which gives the angular
momI~t~~ in unit~ ~~ h. Fig. 2 shows the energy levels
of
C Sand
C S toaether with the rotational
spectrum
~= ~(EJ+l
- EJ ) = 2
8,]"l?r o
-(J+l) = 2B o (J+l)
which is proportional to the reciprocal moment of inertia rO. The proportion~lity factor is given by
B x I = (5.053763)) x 10 MHz AMU $..2. B values of presently detected interstellar molecules extend over a
wide range of values. For example, light molecules such
as CH and OH have B values of 417.7 GHz and 555 GHz,
respectively. Theirofirst rotational transitions lie
therefore in the far infrared region. Heavy molecules
like cyanoacetylene, HCCCN, and carbonylsulfide, OC5,
have small values of B = 4.549 GHz and 6.081 GHz respectively. RotationalOtransitions of these molecules
occur at nearly regular intervals throughout the entire
microwave region.
The precise determination of the rotational constant from the microwave and millimeter wave spectra
allows a precise evaluation of the internuclear dis-
536
G. WINNEWISSER
cm- 1
J
4
15
195954
10
192818
3
146969
144617
2
97981
96413
1
48207
48990
~~~~~~~E ~I~+I
__I~+I__I~I-~~~I-+1~1,~~li~I,o
__
100
200
100
FREQUENCY(GHz)------
200
MOLECULES IN ASTROPHYSICS
537
r ro2
v=
538
G. WINNEWISSER
(3) The r
structure. The substitution bond length
is derived froffi isotopic substitution method. The effects of zero-point vibration are partially cancelled
but not actually removed.
(4) The(r> structure. The average bond distances
are evaluated from a partial correction of vibrational
effects involving the mean-square harmonic vibrational
amplitude.
In Table III results obtained from the various
methods of calculating the HCN bond distances are summarized and compared along with the dispersion in the
values.
COMPARISON OF DIFFERENT TYPES OF INTERATOMIC DISTANCES' IN HCN
r(C-H)
r,
ro
r, (with a
and y)c
r, (Rank et al.)e
r, (from 100,020,000'
1.06316
1.064
1.06549
1.06593
1.06573
~
~
~
~
r(C==N)
0.00010 b
O.Ollb
0.00024 d
1.15512
1.1564
1.15321
0.00010
0.00080 d
1.15313
1.15317
~
~
r(N-H)
0.00015 b
0.OO24b
0.OOOO5 d
2.21828
2.2205
2.21870
0.00002
0.00025 d
2.21906
2.21876
0.00012 b
0.OO90 b
O.OOO20 d
0.00008
O.OOO60 d
All calculations used I = 505376/B(MHz) and unified mass units given by L. A. KONIG,
J. H. E. MATTAUCH, and A. H. WAPSTRA, Nucl. Phys. 31. 18 (1962).
b The r, and ro distances are averages of the values obtained for all possible permutations of Bo values. The uncertainty is due to the dispersion of the calculated values and
does not allow for measurement errors.
c B,(HCN) = 44512.36:ol-: 0.23 MHz; B,eDCN) = 36329.49 ~ 1.53 MHz.
d The uncertainty is the estimated error limit from the Be values.
e See D. H. Rank, G. SKORINKO, D. P. EASTMAN, and T. A. WIGGINS, J. Opt. Soc. Amer.
60. 421 (1960).
f Calculated using Be = 272 Bo - 72 B100 - 72 B020 - 72 B001 for HCN, DCN, and DI3CN.
L.
= B
539
MOLECULES IN ASTROPHYSICS
Methinophosphide
Hydrogen cyanide
Cyanoacetylene
Fluoroacetylene
Fulminic acid
Nitrous oxide
1.0667(5)
H
H
H
H
1.063 16(10!
1.058
1.053
1.026~4!
C
C
C
C
C
N
1.5421(5)
1.155 12(15)
1.205
1.198
1.1679(4)
1.128 ~2)
N
C
C
N
N
1.378
1.279
F
0
0
1.1994(4)
1.187~3!
1.159
Note for example that the r (CH) of HCNO is anomalously short. The shrinkaqe by 8.0356 ~ of the CH bond
lenqth cannot be explained in terms of changing hybridization of the carbon atoms since the carbon atom in
both HCN and HeNO forms predominantly sp bonds, and
any other hybridization must lead to a longer CH distance. A reinterpretation of the rs structure parameters of fulminic acid in the light of the quasilinear
model leads to an explanation of the short CH bond
distance of 1.027 1\ as the projection of a cn bond
length of 1.060 (5) K upon the heavy atom axis (lVinnewisser et al. 1974).
b) Symmetric and asymmetric rotor molecules: For nonlinear molecules, the rotational energies and the resulting spectra are more complex. The inertial properties of the molecule have to be described by the three
moments of inertia
10 =
~
8~
1,
10 =
h 1
8".](2 B
10 =
8:Jf2 Co
0
0
with the definition Ia ~ Ib= I c Ao,B o ' Co are the three
rotational constants.
If two of the three moments of inertia are equal
Ib< Ie' the molecule is a symmetric top, such as
Cft 3 CN, CH CCH, NH3 and others. The three rotational
degrees of freedom are quantum mechanically described
by a complete set of commuting operators: the Hamiltonian operator H, the square of the total angular
momentrum operator p2 and its projection on the figure
axis P z . The rotational energy can therefore be expressed by the quantum number J of the total angular
momentrum and K its projection along the symmetry axis:
I
with J'9 K
540
G. WINNEWISSER
I
is the moment of inertia about the a-axis (symmetry
a~is) and Ib that about the b-axis which is perpendicular to the a-axis. Note that there exists a degeneracy in the K levels, due to the values of -K and +K
yielding the same energy. The selection rules are 6J =
+ 1 and in addition on account of symmetryAK = o.
Therefore each line of the rotational spectrum consists
of J+l components which coincide for the rigid molecule,
but a+e separated if centrifugal distortion is present.
An example of such a spectrum is presented in Fig. 3,
where the laboratory spectrum of CH 3CN is compared
with the astrophysical spectrum (Solomon et ale 1971,
1973)
With all three moments of inertia different,
I
Ib~I , the molecule is classified as an asymmetric
t8p. ThenCthe Hamiltonian is
541
MOLECULES IN ASTROPHYSICS
y (GHz)
110.390
0.370
0.350
0.330
59' B
V = 63 km/sec
0.80
Interstellar
space
0.60
0.40
J=6--S
0.20
0.00
- 0.20
_ _ _ _U - - L_ _
____
____
_ _ _ _ _ _ _ _- J
Laboratory
J=6-5
Fig. 3: Comparison of the interstellar emission spectrum of CH 3 CN with the laboratory absorption spectrum
of the J = 6 - 5 transition. The interstellar emission
spectrum is seen in the direction of Sagittarius B2,
and shows the various values of K (Solomon et al.,
1971). The line shapes of the laboratory spectra resemble
first. -. ---'-derivative.
---_. a ._.,-_.
..._--------------------
__
__
_._-_
56 312.9(190)
H C 2 HNH 2
e hylam1ne
1)
a)
b)
c)
d)
e)
f)
g)
62 927.723(9)
62 036.112(19)
6 134.260 6(8)
6 125 305 6(5)
11 071.010 27(62)
10 034.756(58)
5 584.753 9(8)
5 569.643 7(5)
10 910.577 48(64)
8 564.930(60)
4 868.940 21(25)
g
g
4 499.510 7(69)
4 513.828 9(8)
Ref.
9 915.239 6(62)
.j::..
:s
:s
Figures in paranfhesis are standard errors in units of the last significant figures
Johns,J.W.C.,Stone,J.M.R.and Winnewisser,G,J.Mo1ec.Spectrosc.42,523,1972.
(;')
Winnewisser, G., J.Molec.Spectrosc.46,16,1973.
-Gerry,M.C.L., and l'Jinnewisser,G., 48,1,1973.(J .Molec.Spectrosc.)
z
Yamada,K., and h'innewisser, M. (private communication) 1974.
zt"l
Lovas,F., Clark, F.O. and Tiemann,E. (private communication) 1974.
on
Hocking,W.H. ,Gerry,M.C.L., and Winnewisser,G.,Cag.J.Phys.(to be published) 1975.
on
t"l
~
Hocking,l'l1.H., and ~Hnnewisser,G., J.C.S. Chem.Comm. (in press) 1975.
cis
trans
mpnothioformic acid
HCOSH
HNCO
isocyanic acid
912 712.288(136)
51 479.457 8(30)
H C 2 HNC
vfnyl-iso-cyanide
4 971.214 1(8)
49 85-0. 700 ( 13 )
H2 C 2 HCN
vl.ny1 cyanide
5 386.648 56(25)
4 826.301 4(73)
10 293.972 2(59)
68 035.299 4(429)
HCCCHO
propyna1
H2 C2 O
ketene
Molecule
Vl
tv
543
MOLECULES IN ASTROPHYSICS
Parameter
Centrifuqal Distortion
Constants (MHz)
AJ
AJK
AK
SJ
OK
HJ
EJK
HKJ
-3
(2.2448 + 0.0021) x 10
(-8.5442 + 0.0018) x 10 -2
2.7183 + 0.0021
-4
(4.5716 + 0.0037) x 10
(2.4575 + 0.0097) x 10 -2
(4.73
x 10 -9
+
- 2.0
(2.30
x 10 -7
+
- 1.7
(-9.03
x 10 -6
+ 0.60
-4
HK
4.51
+ 0.80
x 10
hJ
3.38
+. 0.83
x 10- 9
hJK
(-3.25
+ 3.25
x 10
hK
+ 2.2
-5
x 10
7.49
-7
544
C. WINNEWISSER
Paral"1.eter
Value
Nuclear 0uadrupo1e
Coun1inq Constants
-3.74
0.127
Dipole t10ment
.a (D)
f-b ( D)
f'L
(D)
3.68
1.25
3.89 + 0.08
545
MOLECULES IN ASTROPHYSICS
I
0
iii
l<:
I
Ii)
Iii
Iii
iii
l<:
Iii
Ii)
l<:
Iii
Iii
1""1'"
l<:
Ii)
.
~
Ii)
I iii
iii
l<:
Ii)
J Iii
Iii
Ii)
0
0
CD
I'lilt!
N
>:
?i
"
/"
0
0
"
"
t./[i''.
en
co
I
en
S2
J-,
J-,
"
u
z
:r
0
0
;0
. i~r-'
",
.
,
-1~\\r
\
\ \
0
>:
~
:r
>:
/"
"
Z
0
co
I
en
"
J-,
:r
>:
N
\l{l/1
0
0
"It,
~'lIC''iIl'-III
-x "--x..
"--M---X
0
0
co
I
en
::E
;:;
"
J-,
z
u
:r
u
U
:r
~
0
"
co
I
z
u
:r
u
u
:r
0
0
CD
I
Fig. 4: Comparison between some a-type R-branch transitions of molecules with potential astrophysical interest. For each K-component the calculated rigid rotor
positions (x) and the laboratory measured (.) line positions taking centrifugal distortion into account.
The length and direction of the arrOvlS indicate the
effect of centrifugal distortion.
546
G. WINNEWISSER
1.0
..
\
.
1.209 A
~a=1.57D
40.5t
-0.5
+0.5
~ 1.166A
1.0
Q(Al
Fig. 5: Molcecular structure and positions of the atoMs in the principal axis system
of isocyanic acid. The direction of the total electric dipole moment is almost coincident with the NH bond.
1282 1 /
0986~~
""
~=2.07D
~b=1.35D
b(Al
.j:>.
--..l
Ul
en
;=;
::c
-<
en
;d
:>en
en
?ttl
(j
b;
548
G. WlNNEWlSSER
549
MOLECULES IN ASTROPHYSICS
260]
'i'
ATOMIC HYDROGEN
200 ti
~z 2.0
TOTAL MASS
IOu
>-
is
IS!
-'
~
100
1.5
rt
~ 1.0
0.5
0.1
20-(IJ Ikpc( 15
10
10
'R
.::L
......
!!
10
15 (iZ/kpc(-20
~ GLOBULAR CLUSTERS
.'
1
0
- 1
.....:.::: :::::;:;tl*;;;::,;;;s.;;
STARS
..
- 5
-10
2Cl-{t;l/kpc]15
10
10
15
20
550
G. WINNEWISSER
stars form a spherical dense region, known as the nuclear bulge. The oldest objects in the Galaxy, the
globular clusters show an approximate spherical distribution, interpreted as the original shape of the giant
cloud out of which the Galaxy formed. Globular clusters
are deficient in heavy elements.
a) Interstellar Clouds
Nearly all information about ISM and in particular the cold gas component has been gained from observations in our own Galaxy by means of spectroscopic methods. In the past ISM has generally been considered to consist mainly of atoms and ions, with an
average density of about 1 atom per em 3 . The gases are
ionized by the UV radiation from stars producing electrons and ions spanning a temperature range Tk =
500 to 10 000 K. There exist, however, huge amounts of
neutral atomic hydrogen II and large quantities of molecular hydrogen, H2 . Thus, in addition to the hot and
frequently ionized and te~ous gas, the neutral component is found to prevail in clouds of higher density
(n H > 10 cm 3 ). This gas is at considerably lower tempera~ures (T N 400 0 K). The gas can essentially exist
in two sta~le conditions. The coexistence of the two
phases (two component model) can be explained by the
fact that once the gas density increases, collisions
allow it to cool more efficiently. Once this process
was triggered it accelerates itself by very efficient
cooling and simultaneously intercepting less heating
energy from the outside. Thus a low density cloud starts
to contract pvobably into the high density clouds. It
is in these high density clouds where the complex molecules are being detected and where star formation
is expected to take place. Simultaneously the dust becomes dense enough to make the cloud appear "dark" or
"black" by screening the light from background stars.
This is the primary reason why it is not possible to
investigate them by optical astronomy.
However, with the discovery of polyatomic molecules these dark clouds have received in the last 5 years
considerable attention. The microwave and infrared radiation which emanate from these clouds can penetrate
them with ease, be~ause the size of the dust particles
of radii a ~ 0.15A is small compared to the wavelength
of this radiation. Light and some of the microwave radiation come through the earth's atmosphere, whereas
in case of infrared much of the radiation is blocked
by water vapor and carbon dioxide in the atmosphere.
MOLECULES IN ASTROPHYSICS
551
b) Molecular Clouds
Although there exists no standard interstellar
cloud it is possible to devide them roughly into two
types of clouds. Observations show that the majority
of the normal dust clouds or dark clouds (with a temperature fV 3K L.. Tk :E:. 50 K) , density 10fYl H ~ 103cm -3)
show the spectra of relatively simple
2 molecules
such as CO, OIl, and the centimeter transitions of H2 CO.
The observed lines are generally fairly narrow their
line width corresponding to the thermal motion of the
gas. Molecular lines which need higher excitation such
as the millimeter wave transitions of HCN, H2CO and
others are generally not seen.
The wide distribution of OH,CO,HCN,H2CO,NH3 and
others allows some of their molecular transitions
to be used towards extensive surveys. The first resuIts show that the distribution of molecules is similar to atomic hydrogen. Molecules are concentrated
towards the galactic plane in a fairly thin layer of
less than N 300 pc. ~Hthin this flat disk their distribution is fairly common with an apparent concentration
towards the galactic nucleus. The more complex molecules are restricted to a smaller number of fairly
dense and cool condensations. These qre the massive
molecular clouds which exhibit the very rich and
often very intense molecular spectra. Particularly
rich molecular sources are Sgr B2, Sgr A, the Orion
complex, ~\f51,T'J3,DR21 IRC+l0216. Their seizes and
masses as determined from their molecular lines dwarf
the giant HII regions often associated with them. The
galactic positions of both nIl regions and molecular
sources are shown in Fig. 7.
An HII region is an almost fully ionized cloud of
gas and dust. In most cases they are ionized by the
UV radiation of one or more hot stars embedded in the
cloud. A very readable account of recent radio observations of galactic HII regions has been given by
Churchwell (1974),
Systematic surveys of molecular clouds are being
conducted by use of various molecules. In this context CO is of special importance, due its high abundance and the ease with which it can be excited at
relatively low gas densities. Liszt et al. (1974)find
that th~ CO line intensities decrease smoothly with
increasing distance away from the dense central region, where usually also other molecules with much
552
G. WINNEWISSER
100
ORION 16
8
80
10
60
SGRA 11
SGR B2 20
3400
20
553
MOLECULES IN ASTROPHYSICS
tures have to be consinered: the excitation temperature Tex of a molecule, defined by the relative population of at least two levels, the kinetic temperature,
Tk,correspondin~ to the Max1.<Tellian velocity distribution of the gas particles, the radiation temperature,
Tr , . assuMing a black body radiation distribution and
finally the temperature of the dust particles Tdust.
Thermodynamic equilibrium is the exception in interstellar space and therefore all four temperatures will
usually be different. The non-equilibrium conditions
are likely to be caused in part by the delicate balance between various mechanisms of excitation and deexcitation of molecular energy levels. These include
collisions with other particles, radiative transitions
and in part the molecular formation process itself.
a) Rotational excitation temperature and kinetic temperature.
The excitation temperature Tex scans a very wide
range of values from temperatures just above the background radiation to more than 50 K in HII reaions.
Since the excitation temperature is a measure of the
popUlation difference between the two levels associated with the transition one obtains,
a--u /01
. ,. exp(-hVu e/kT ex )
where n is the number of molecules in the energy levels and g their statistical weigths. The subscripts
refer to the upper and lower levels. Statistical equilibrium requires that the number of transitions u-+~
equals the number of
inverse transitions
nu (A ue + ButUv,+C u ) = /)t. (Btu u" + Cluj
where A ann B are the Einstein coefficients of induced
and spontaneous emission as well as absorption. u'if is
the energy density of the radiation field, corresponding to Uv = (4 Jt"/c) Bv TJ;' where in the general ISM,
T = 2.7 K. The C coefficlents describe the collisional
iftduced transitions:
Cut = n <Out. v)
554
G. WINNEWISSER
STvoLv
o
=_C_
.eA.(L.e.N.t(1-<tf'Y1.-.-/~.(,I-I)'t-.e.)
81C kv Cf .e
555
MOLECULES IN ASTROPHYSICS
(r
In the small central cores associated with infrared objects the kinetic temperature may reach 150 K
as has been deduced from short lived, highly excited
states in polyatomic molecules like NH 3 , CH 3CN and
HNCO. On the average, however, one conludes that for
the Orion cloud Tk ~ 50 K, for Sgr B2 Tk~30 K and for
black clouds Tk~ 30K.
d) Densities, sizes and masses
The kinetic temperature Tk , the total mass and
the number density of a cloud are the most important
quantities for understanding cloud fragmentation and
the formation of star clusters. The total mass of a
cloud is calculated from the hydrogen density, which
in interstellar clouds ranqe between 10 3 and 10 7 molecules per em 3 . These values are determined by using
one of two methods: (1) the observed molecular column
densities are correlated with hydrogen densities,
(2) the measured excitation of rotational transitions
can be used to determine hydrogen densities. Both
methods are in fair agreement.
For an optically thin cloud the molecular abundance can be determined. t'i'i th
1, one obtains
for the integrated observed briqhtness temperature
of an emission line
dO
Evaluation of the riqht hand side in terms of the absorption coefficient yields
J
dO
3tt
.-L T c{\)
.e.. = 811~c:a-~ f-L1.. \12. 0 L
vlhere Nt represents the number of molecules in the
lower enerqy level contained in a column of cross
section 1 cm 3 along the line of sight. NL can be related to the total column density N, by evaluating
the rotational partition function. For a diatomic or
linear molecule, this can be carried out in closed form,
and one obtains:
556
G. WINNEWISSER
MOLECULES IN ASTROPHYSICS
557
11"
Il:!
g;
Oi
o
Z
~
5
z
()
'"r
"-<=<
'"
J!
\;'
=I~
:r
.!'"
.!'J
!1l
<f9
'f
fl: ellS
II
=~
:r
'h
, ..,'"
()
0
()
V>
",I
....,I
if
I~
c.
:rl~~
z,<
. .::r.
"'"I"s.,'~
o :r
(-,.0
!!! o
~ I~,
:r
):
w""
:r
51
lS
..!J
o~
"o
"
~,
~,
o>.J,
....
=J,~
, 8
"
:r
0"
o~ ~
z :r
:r z
Ig
~~~
C;
8 m
"....
'?)
:r
z
wr.~g~
lf9~~l?l
~ ~
01
~~ ()~~=i
:-1
~ : ~!! ~
1,5
I\J
3
VJ..
.., ':l
~
0>
'"
I~
'll
'"
<n.
....
'"
co
W
N
I~
IV
(ll
G>
::0
(fl
00
Ul
Ul
Fiq. 8: Continuum radiation and molecular distribution in declination for Sgr B2.
~
Th~ top part shows the brigthness distribution at 10.6 GHz and 5 GlIz in different
~
spatial resolution obtained with the Effelsberg loo-m telescope (D.Downes) and the
~
Cambridge One Mile Telescope (Martin and Downes, 1972). The molecular distribution
~
shows that transitions from short lived energy levels are excited only in the central ~
core of the source, whereas other transitions are seen over a wide range.
'"
'"
0-
"' ....
iD",
g:
c'"
zO>
'"co
iz
ill
kj
'"
'"~
:r
559
MOLECULES IN ASTROPHYSICS
E
cm- 1
413
4572 MHz
3
3
404
QS.Hz
~
N
-200
i- -200kHz
1371.829MHz
(a)
W
0:
::>
....
e{
0:
W
Il.
~
W
....
e{
""
In
1-----
w
....
z
e{
(b)
""o
In
1
13720
-
13715
FREQUENCY (MHz)
Size
6' -10'
~4
(>12?)
n
10- 1
2"_41
(>10 12 )
20
60
;.5
10 4
",2
<:5
<0.001"
0.7 <R<1.2
10- 5<R<4xl0- 1
2xl0- 6
0.5'- 3'
2~n
ex
<0
1700
>60
<0
500
0.3-60
( K)
700
T rad
o
500
105
~106
10 2 -10 6
10 2
(M
Mass
The numbers of these tables are taken from various authors. Particularly I would
like to mention
Martin,
A.H.M. and Downes, D., Astrophvs.
Lett.ll,2l9,1972.
1
.
" Lelles, C., Ann.Rev. Astron. Astrophvs. 9, 293, 1971.
Mezger, P.G., Proc. 2nd Adv. Course in Astr.and Astrophys.
Interstellar Matter: An. Observers View, Geneva Observ. 1972.
ionized regions
source
maser point
core region
genera 1
a) Molecular cloud
3) Or ion
0.1 < 10
211_41
ionized regions
50
(>12?)
20-80
~8
50
30
<10
"'3
- 5
Tkin
20<R<40
"'3
(n H ) (cm- 3 )
2
Density in powers of 10
1.5 <R< 6
<5xlO- 3
2'
<0.1"
0.5' -
25<R<50
maser point
source
6'
core region
- 14'
- IS'
gene ra 1
b) Molecular cloud
a) Dust cloud
8'
10<R<30
2)~
0.03<R<10
( pc)
1 i nea r
Black clouds
angular
(' or II)
1) Dust clouds
Object
tn
o
;s
0\
VI
MOLECULES IN ASTROPHYSICS
561
562
G. WINNEWISSER
in Laboratory
in Interstellar Space
Gas Phase Reactions
I-~IBIRTHI
I
I
I
---,
--;;=lD~
-
I ,..,10- 2 -104 hr
P
U
M
I
I
P
I
N
G
4-
1.......- -......
!~-IDEATHI
Photo-Dissociation
I
I
I
~-A-~-----------;
D
Steady State
10
-3 sec
(Boltzmann
Distribution)
Chemical Reaction
MOLECULES IN ASTROPHYSICS
563
564
G. WINNEWISSER
tions rate constants are needed. They are either determined experimentally or estimated on theoretical
grounds. Their precise determination often poses
serious difficulties and the possibility that all relevant reactions are not included in the treatment
adds considerable uncertainty to all gas-phase calculations. Barsuhn (1974) finds that by fitting the
OAO-C observations of n(H2)/ntotal and ~dus~to model
calculations that the density of these clouas must
lie between 50 <.. n (H 2 ) 1000.
Calculations considering formation on grain
surfaces (Watson and Salpeter, 1972) are considerably
more undefined and uncertain since the relevant physical processes on the surfaces of interstellar dust
qrains are still largely speculative. The information
nee'ded to describe the processes, include: condensation of the qas onto grains, sticking probability,
surface mobility, reaction rates, ejection mechanisms
and rates for molecules newly formed on grain surface.
Despite the high uncertainty whether polyatomic molecules can be formed this way, it is almost certain
that H2 is formed from atomic hydrogen on interstellar
grain surfaces (Hollenbach et al., 1974), but it is
uncertain whether polyatomi C molecules can form in
this way.
III. FUTURE PROSPECTS
The very existence of molecules in interstellar
space has provided considerable new insight into the
interstellar chemistry of our Galaxy. By providing
birth and shelter to molecules like HCN, H2 0, NH3
and H2 CO, known to be important in reactions which
synthesize amino acids , the interstellar environment
is not nearly as hostile as had oriqinally been assumed. Chemical evolution has clearly advanced much
further than had been expected !5the cold (~20 K)
and hiqh vacuum conditions (",10
Torr) of interstellar clouds.
It seems to me that the most important aspect of
the detection of interstellar molecular spectra is
that they provide probes into the interior of clouds.
Especiallv the millimeter wave emission and maser
emission emanate deep from the core of very dense
and often giant molecular clouds. Some of the molecular radiation comes from molecular species presently
outside the reach of laboratory spectroscopy.Thus two
MOLECULES IN ASTROPHYSICS
565
566
G. WINNEWISSER
REFERENCES
Barsuhn, J., Mitt.Astron.Gesellsch. 35,197,1974.
Buhl, D., Snyder, L.E. ,189, L31,1974-.Carrington, A., Microwave-Spectroscopy of Free Radicals, London, Academic Press 1974.
Cheung, A.C., Rank, D.M., Townes, C.H., Thornton, D.
D. and Welch,W.J., Phys.Rev.Lett. 21,
1701,1968.
-Cheung, A.C., Rank, D.M., Townes,C.H., Thornton, D.O.,
and Welch, W.J., Nature 221, 626,1969.
Churchwell, E., Ph.D.Thesis, Univ.of Indlana, 1970.
Churchwell, E., I.A.U. Symposium 60, ed. F.J.Kerr and
Simonson, 1974.
Costain, C.C., J.Chem.Phys.29, 864, 1958.
De Lucia, F.C., Cook, R.L.,-nelminger, D., and Gordy,
~"J., J.Chem.Phys. 55, 5334,197l.
Fourikis, N., Takagi, K. and Morimoto, N., Astrophys.
J. 191,L139, 1974.
Gardner, F.F., Ribe~J.C., Sinclair, M.W., Astrophys.
J. 169, Ll09,1971.
Gardner, F.F., and i"Vinnewisser, G., Astrophys.J. (in
press) 1975.
Gerry, M.C.L. and Winnewisser, G., J.Mol.Spectrosc.48,
1,1973.
-Green, S., Montgomery Jr., J.A. and Thaddeus, P.,
Astrophys.J.,193, L89,1974.
Helminger, P., Cook, R.L. and De Lucia, F.C., J.Mol.
Spectrosc. 40, 125,1971.
Herbst, E., and Klemperer, ~, Astrophys.J. 185,505,
1973.
-Herzberg G.,The Spectra and Structures of Simple Free
Radicals (Ithaca, Cornell University
Press.
Hills, R., Janssen, M.A., Thornton, D.O., Welch W.J.,
Astrophys.J. 175, L59, 1972.
Hocking, TT.H., Gerry, M.C.L. and lvinnewisser, G., Astrophys. J. 187, L89,1974.
Hocking, W.H., Gerry, M.C.L. anrl Winnewisser, G., to
be published Canad.J.Phys. 1975.
Hollenbach, D.J., Werner, M.W., Salpeter, E.E., Astrophys. J. 163,'165, 1971.
Kaifu, N., Morimoto, M., Nagane, K., Akabane, K., and
Takagi, K., Astrophys.J. 19l,L135,1974.
Kewley, R., Sastry, leV.L.N., Hinnewisser-;t:L and Gordy,
t-!., J.Chem.Phys. 39,2856,1963.
King, G.I':., Hainer, R.M., and Cros-s, P.C., J.Chem.Phys.
II, 27, 1943.
Kirchhoff, li!.H.,-J'.Mol.Spectrosc. Q, 333, 1972.
MOLECULES IN ASTROPHYSICS
567
568
G. WINNEWISSER