Escolar Documentos
Profissional Documentos
Cultura Documentos
1.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2
1.2.1
1.2.2
1.2.3
1.2.4
Wiens Laws . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2.5
1.2.6
1.2.7
1.2.8
1.3
Photoelectric effect . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.4
Compton effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.5
10
1.5.1
Davisson-Germer experiment . . . . . . . . . . . . . . . . .
11
1.5.2
13
1.5.3
14
1.5.4
14
15
1.6.1
16
1.6.2
17
1.6.3
18
19
1.7.1
19
1.7.2
21
1.7.3
22
1.7.4
22
1.6
1.7
1.8
1.9
Wave mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
24
1.8.1
25
1.8.2
25
1.8.3
26
1.8.4
27
28
1.9.1
28
1.9.2
Particle in a box . . . . . . . . . . . . . . . . . . . . . . . .
28
1.9.3
32
1.9.4
Tunnel effect . . . . . . . . . . . . . . . . . . . . . . . . . .
35
1.9.5
37
1.9.6
39
1.9.7
Harmonic oscillator . . . . . . . . . . . . . . . . . . . . . . .
40
1.9.8
42
Numerical Examples . . . . . . . . . . . . . . . . . . . . . .
43
Exercise . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
51
2.2
55
Crystal Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . .
55
2.1.1
55
2.1.2
Crystal systems . . . . . . . . . . . . . . . . . . . . . . . . .
57
2.1.3
Bravais lattices . . . . . . . . . . . . . . . . . . . . . . . . .
59
2.1.4
60
2.1.5
63
2.1.6
65
2.1.7
68
X-Rays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
72
2.2.1
Origin of x-rays . . . . . . . . . . . . . . . . . . . . . . . . .
72
2.2.2
73
2.2.3
75
2.2.4
Moseleys law . . . . . . . . . . . . . . . . . . . . . . . . .
77
2.3
2.4
2.5
X-ray diffraction . . . . . . . .
2.3.1 Braggs law . . . . . . .
2.3.2 Braggs spectrometer . .
2.3.3 Structure determination .
Electron diffraction . . . . . . .
Neutron diffraction . . . . . . .
Numerical Examples . . . . . .
Exercise . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
78
78
79
80
81
82
83
87
. . . . . . . .
89
89
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
90
90
95
96
97
100
103
104
105
106
107
108
110
112
115
.
.
.
.
117
117
118
119
119
4.3
4.4
4.5
4.6
5 Semiconductors
132
5.1
5.2
5.3
5.2.1
5.2.2
5.2.3
5.2.4
5.3.2
5.3.3
5.4
5.5
5.6
5.7
5.6.2
5.6.3
5.7.2
5.7.3
5.7.4
5.8
5.9
5.8.2
5.9.2
5.9.3
5.9.4
5.9.5
182
6.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
6.2
Polarization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
6.3
6.2.1
6.2.2
6.2.3
6.3.2
6.3.3
6.3.4
6.3.5
6.3.6
6.4
6.5
6.6
7 Magnetic Properties
7.1 Introduction . . . . . . . . . . . . . . .
7.2 Classification of magnetic materials . .
7.3 Origin of permanent dipoles . . . . . .
7.4 Magnetic hysteresis . . . . . . . . . . .
7.5 Hard and soft magnetic materials . . . .
7.6 Metallic and ceramic magnetic materials
7.7 Ferrites . . . . . . . . . . . . . . . . .
7.8 Applications of magnetic materials . . .
Numerical Examples . . . . . . . . . .
Exercise . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
8 Applied Optics
8.1 Absorption and emission of radiation . . . . . . . .
8.1.1 Luminescence . . . . . . . . . . . . . . .
8.1.2 Induced absorption . . . . . . . . . . . . .
8.1.3 Spontaneous emission . . . . . . . . . . .
8.1.4 Stimulated emission . . . . . . . . . . . .
8.2 Lasers - basic principles . . . . . . . . . . . . . . .
8.2.1 Einsteins theory of stimulated emission .
8.2.2 Conditions for laser action . . . . . . . . .
8.2.3 Methods of achieving population inversion
8.2.4 Requirements of a laser system . . . . . . .
8.3 Types Of Lasers . . . . . . . . . . . . . . . . . . .
8.3.1 Ruby Laser . . . . . . . . . . . . . . . . .
8.3.2 Helium-Neon laser . . . . . . . . . . . . .
8.3.3 Semiconductor diode laser . . . . . . . . .
8.4 Applications of lasers . . . . . . . . . . . . . . . .
8.4.1 Industrial applications . . . . . . . . . . .
8.4.2 Medical applications . . . . . . . . . . . .
8.4.3 Estimation of atmospheric pollution . . . .
8.4.4 Applications in basic sciences . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
209
209
210
212
216
218
219
220
221
222
224
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
225
225
225
226
227
227
228
228
230
231
231
234
234
236
237
239
239
240
240
240
8.5
8.6
8.7
Holography . . . . . . . . . . . . . . . . . . . . .
8.5.1 Recording of holograms . . . . . . . . . .
8.5.2 Reconstruction of images . . . . . . . . . .
8.5.3 Applications of holography . . . . . . . . .
Optical fibers . . . . . . . . . . . . . . . . . . . .
8.6.1 Materials for optical fibers . . . . . . . . .
8.6.2 Propagation of light through an optical fiber
8.6.3 Modes of propagation in a fiber . . . . . .
8.6.4 Signal distortion in optical fibers . . . . . .
8.6.5 Signal attenuation in optical fibers . . . . .
Applications of optical fibers . . . . . . . . . . . .
8.7.1 Fiber optic communication . . . . . . . . .
8.7.2 Applications in medicine and industry . . .
Numerical Examples . . . . . . . . . . . .
Exercise . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
241
241
242
243
243
243
244
247
249
249
250
250
252
252
254
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
256
256
256
257
257
258
258
259
260
261
263
263
265
265
267
269
9.7
9.8
9.9
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
271
273
274
274
275
278
279
279
280
283
284
284
284
285
285
287
288
289
.
.
.
.
.
.
.
.
.
.
291
291
291
292
293
294
296
299
301
301
303
Chapter 1
Introduction
The nineteenth century was a very eventful period as far as Physics is concerned. The
pioneering work on dynamics by Newton, on electromagnetic theory by Maxwell, laws
of thermodynamics and kinetic theory were successful in explaining a wide variety of
phenomena. Even though a majority of experimental evidence agreed with the classical physics, a few experiments gave results that could not be explained satisfactorily.
These few experiments led to the development of modern physics. Modern physics
refers to the development of the theory of relativity and the quantum theory. Inability of the classical concepts to explain certain experimental observations, especially
those involving subatomic particles, led to the formulation and development of modern physics. Early twentieth century saw the development of modern physics. The
pioneering work of Einstein, Planck, Compton, Roentgen, Born and others formed
the basis of modern physics. The dual nature of matter proposed by de Broglie was
confirmed by experiments. The wave mechanics and quantum mechanics were later
shown to be identical in their mathematical formulation. The validity of classical concepts was explained to be the result of an extrapolation of modern theories to classical
situations. In the present chapter, experimental observations of three important phenomena black body radiation, photoelectric effect and Compton effect considered
as the beginning of modern physics, are briefly described.
1.2
When radiation is incident on material objects, it is either absorbed, reflected or transmitted. These processes are dependent on the radiation and the object involved. An
object that is capable of absorbing all radiation incident on it is called a black body.
Practically, we cannot have a perfect black body but can have objects that are only
close to a black body.
For example, a black body can be approximated by a hollow object with a very
small hole leading to the inside of the object. Any radiation that enters the object
through the hole gets trapped inside and will be reflected by the walls of the cavity till
it is absorbed. Objects that absorb a particular wavelength of radiation are also
1
T2
E
T3
(iii) As the temperature increases, the peak shifts to lower wavelengths. In other
words, at higher temperatures, maximum energy is emitted at lower wavelengths.
1.2.2 Laws of black body radiation
The initial attempts to explain black body radiation were based on classical theories
and were found to be limited in application. They could not explain the entire spectrum
of the radiation satisfactorily.
1.2.3 Stefan Boltzmann radiation law
It states that the total energy density E of radiation emitted from a black body is
directly proportional to the fourth power of its absolute temperature T . Energy density
E 0 is defined as the total of all the energy emitted at all wavelengths per unit area
of the emitter surface.
E T 4
or
E = T 4
(1.1)
m 1/T or m T = b
(1.2)
where b is called the Wiens constant and is equal to 2.9 103 mK. The energy
density emitted by a black body in the wavelength range and + d is given by
E d = c1 5 exp (c2 /T )d
(1.3)
where c1 and c2 are constants. This is known as Wiens distribution law. This law
holds good for smaller values of but does not fit the experimental curves for higher
values of (fig 1.2).
Wien s law
E
R J law
This distribution agrees well with the experimental observation of black body radiation and is valid for all wavelengths. Further, it reduces to Wiens law for lower
wavelength region and to Rayleigh Jeans law for higher wavelength region.
1.2.7 Derivation of Wiens law from Plancks law
When is small, we can consider
ehc/kT > 1
5
E d = c1 . exp (c2 /T )d
E d =
i.e.,
(1.6)
hc
kT
< 1.
h
i
ehc/kT 1 hc/kT
8hc kT
d
5
hc
8kT
E d =
d
4
E d =
i.e.,
(1.7)
1.3
Photoelectric effect
Emission of electrons from a metal surface when light of suitable energy falls on it is
called Photoelectric effect. The experimental setup for observing photoelectric effect
consists of a pair of metal plate electrodes in an evacuated tube connected to a source
of variable voltage as shown in fig.1.4.
When light of suitable energy is incident on the cathode, electrons are emitted and
a current flows across the tube. The characteristic curves for the photoelectric emission
as shown in fig. 1.5.
The important properties of the emission are as follows:
(i) There is no time interval between the incidence of light and the emission of photoelectrons.
V
A
L3
L2
L1
(iv) The photoelectron emission can be stopped by applying a reverse voltage to the
phototube, i.e. by making the emitter electrode positive and the collector negative. This reverse voltage is independent of the intensity of incident radiation but
increases with increase in the frequency of incident light. The negative collector potential required to stop the photo electron emission is called the stopping
potential.
These characteristics of photoelectron emission can not be explained on the basis
of classical theory of light but can be explained using the quantum theory of light. According to this theory, emission of electrons from the metal surface occurs when the
energy of the incident photon is used to liberate the electrons from their bound state.
The threshold frequency corresponds to the minimum energy required for the emission.
This minimum energy is called the work function of the metal. When the incident photon carries an energy in excess of the work function, the extra energy appears as the
kinetic energy of the emitted electron. When the intensity of light increases, the number of photoelectrons emitted also increases but their kinetic energy remains unaltered.
The reverse potential required to stop the photoelectron emission, i.e. the stopping potential, depends on the energy of the incident photon and is numerically equivalent to
the maximum kinetic energy of the photoelectrons.
When a photon of frequency is incident on a metal surface of work function ,
then,
!
1 2
h = + mv
(1.8)
2
max
where ( 12 mv2 )max is the maximum kinetic energy of the emitted photoelectrons. This is
known as Einsteins photoelectric equation. Since = hv , it can also be written as
!
1 2
= hv = h(v v )
(1.9)
mv
2
max
If V is the stopping potential corresponding to the incident photon frequency v, then,
!
1 2
= hv = eV
(1.10)
mv
2
max
Then, by experimental determination of V , it is possible to find out the work function of the metal.
The experimental observation of photoelectric effect leads to the conclusion that the
energy in light is not spread out over wavefronts but is concentrated in small packets
called photons. All photons of a particular frequency have the same energy. A change
in the intensity of the incident light will change the number of photoelectrons emitted
but not their energies. Higher the frequency of the incident light, higher will be the
kinetic energy of the photoelectrons. These observations confirm the particle properties
of light waves.
1.4
Compton effect
When x-rays are scattered by a solid medium, the scattered x-rays will normally have
the same frequency or energy. This is a case of elastic scattering or coherent scattering. However, Compton observed that in addition to the scattered x-rays of same frequency, there existed some scattered x-rays of a slightly higher wavelength (i.e., lower
frequency or lower energy). This phenomenon in which the wavelength of x-rays show
an increase after scattering is called Compton effect.
Compton explained the effect on the basis of the quantum theory of radiation. Considering radiation to be made up of photons, he applied the laws of conservation of
energy and momentum for the interaction of photon with electron. Consider an x-ray
photon of energy h incident on an electron at rest (fig. 1.6.) After the interaction, the
x-ray photon gets scattered at an angle with its energy changed to a value h and
the electron which was initially at rest recoils at an angle . It can be shown that the
increase in wavelength is given by
=
h
(1 cos )
m c
(1.11)
h
= 0.0242A .
m c
2h
m c
10
Experimental observation indicate that the change in the wavelength of the scattered x-rays is indeed in agreement with equation (1.11), thus providing further confirmation to the photon model.
Thus, Plancks theory of radiation, photoelectric effect and Compton effect are
experimental evidences in favour of the quantum theory of radiation.
1.5
Quantum theory and the theory of relativity are the two important concepts that led to
the development of modern physics. The quantum theory was first proposed by Planck
to explain and overcome the inadequacies of classical theories of black body radiation.
The consequences were very spectacular. Louis de Broglie made the suggestion that
particles of matter, like electrons, might possess wave properties and hence exhibit
dual nature. His hypothesis was based on the following arguments:
The Plancks theory of radiation suggests that energy is quantized and is given by
E = hv
(1.12)
where is the frequency associated with the radiation. Einsteins mass-energy relation
states that
E = mc2
(1.13)
Combining the two equations, it can be written as
E = hv = mc2
Hence, the momentum associated with the photon is given by
p = mc = hv/c = h/
Extending this to particles, he suggested that any particle having a momentum p is
associated with a wave of wavelength given by
= h/p
(1.14)
This is called de Broglies hypothesis of matter waves and is called the de Broglie
wavelength.
The de Broglie wavelength can be calculated for any particle using the above relation. In case of charged particles like electrons, a beam of high energy particles can
be obtained by accelerating them in an electric field. For example, an electron starting
11
from rest when accelerated with a potential difference V, the kinetic energy acquired
by the electron is given by
(1/2)mv2 = eV
where v is the velocity of the electron. The momentum may be calculated as
p = mv = (2meV)1/2
Using the de Broglie equation, the wavelength associated with the accelerated electron can be calculated as
= h/p = h/(2meV)1/2
(1.15)
This equation suggests that, at a given speed, the de Broglie wavelength associated
with the particle varies inversely as the mass of the particle. This concept of matter waves aroused great interest and several physicists launched experiments designed
to test the hypothesis. Heisenberg and Schrodinger proceeded on to develop mathematical theories whereas Davisson and Germer, G.P. Thomson and Kikuchi attempted
experimental verification.
1.5.1 Davisson-Germer experiment
The hypothesis of de Broglie was verified by the electron diffraction experiment conducted by Davisson and Germer in the United States. The experimental set up used by
them is shown in the figure 1.7.
12
in the form of a cylinder with a small aperture maintained at a finite positive potential
with respect to the filament. They pass through the narrow aperture forming a fine
beam of accelerated electrons. This electron beam was made to incident on a single
crystalline sample of nickel. The electrons scattered at different angles were counted
using an ionization counter as a detector. The experiment was repeated by recording the scattered electron intensities at various positions of the detector for different
accelerating potentials (Fig.1.8).
50
44ev
48ev
54ev
64ev
68ev
13
Electron Gun
Aluminium Foil
Screen
Figure 1.9 Experimental arrangement for G.P.Thomson experiment.
He allowed a beam of accelerated electrons to fall on the aluminium foil and observed a diffraction pattern consisting of a series of concentric rings around the direction of the incident beam. This pattern was similar to the Debye-Scherrer pattern
obtained for aluminium using x-ray diffraction. Using the data available on aluminium,
he calculated the wavelength of the electrons using the Braggs equation,
n = 2d sin
He also calculated the de Broglie wavelength of the electrons with the knowledge of
accelerating potential using the relation,
= h/(2meV)1/2
The value of wavelength calculated from the two equations matched well thereby experimentally proving the de Broglies relation.
14
15
3. Wavelength of matter waves also depends on the mass of the body and decreases
with increase in mass. Due to this reason, the wavelike behaviour of heavier
bodies is not very evident whereas wave nature of subatomic bodies could be
observed experimentally.
4. A wave is normally associated with some quantity that varies periodically with
the frequency of the wave. For example, in a water wave, it is the height of the
water surface; in a sound wave it is the pressure and in an electromagnetic wave,
it is the electric and magnetic fields that vary periodically. But in matter waves,
there is no physical quantity that varies periodically. We use a wave function to
define matter waves and this wave function is related to the probability of finding
the particle at any place at any instant, which varies periodically.
5. Matter waves are represented by a wave packet made up of a group of waves of
slightly differing wavelengths. Hence, we talk of group velocity of matter waves
rather than the phase velocity. The group velocity can be shown to be equal to
the particle velocity.
6. Matter waves show properties similar to other waves. For example, a beam of
accelerated electrons produces interference and diffraction effects similar to an
electromagnetic wave of same wavelength.
1.6
(1.16)
(t kx) = constant
x = constant + t/k
Phase velocity v p = dx/dt = /k
= 2/(2/) =
(1.17)
16
The de Broglie waves are represented by a wave packet and hence we have group
velocity associated with them. Group velocity is the velocity with which the wave
packet travels. In order to understand the concept of group velocity, consider the combination of two waves represented by the formula
y1 = A cos(t kx)
y2 = A cos{( + )t (k + k)x}
The resultant displacement is given by
y = y1 + y2
{( + + )t (k + k + k)x}
(t kx)
cos
2
2
!
t kx
(1.18)
The velocity of the resultant wave is given by the speed with which a reference
point, say the maximum amplitude point, moves. Taking the amplitude of the resultant
wave as constant, we have
!
t kx
2A cos
= constant
2
2
!
t kx
or
= constant
2
2
or
x = constant + (t/k)
Group velocity vg = dx/dt = (/k)
(1.19)
Instead of two discrete values for and k, if the group of waves has a continuous
spread from to ( + ) and k to (k + k), then, the group velocity is given by
vg =
d
dk
(1.20)
It can be shown that the group velocity of the wave packet is equal to the velocity of
the particle with which the wave packet is associated.
1.6.1 Relation between phase velocity and group velocity
We have the mathematical relation for phase velocity given by
v p = /k
or
= k.v p
17
vg =
= v p + (2/)
dv p
d(2/)
= v p + (2/) (2 /2).
= vp
dv p
d
dv p
d
(1.21)
In the above expression, if (dv p /d) = 0, i.e., if the phase velocity does not depend
on wavelength, then the group velocity and phase velocity are equal. Such a medium
is called a non-dispersive medium. In a dispersive medium, (dv p /d) is positive and
hence the group velocity is less than the phase velocity.
1.6.2 Relation between group velocity and particle velocity
= 2 = 2mc2 /h =
and
(1.22)
(1.23)
(1.24)
(1.25)
18
Thus, the group velocity associated with de Broglie waves is just equal to the velocity
with which the particle is moving. If we try to calculate the phase velocity,
v p = /k = c2 /v = c2 /vg
(1.26)
Since the group velocity or the particle velocity is always less than c, the phase velocity
of de Broglie waves turn out to be greater than c. This only indicates that we cannot
talk of phase velocity of de Broglie waves since they are made up of a group of waves.
Phase velocity has no physical significance for de Broglie waves.
1.6.3 Derivation of de Broglie relation
The de Broglie relation may be derived as follows. If we assume a particle having a
kinetic energy equal to mv2 /2 to have a de Broglie wavelength , we can write
h = mv2 /2 (assuming the energy of the particle to be purely kinetic)
=
or
m 2
v
2h
(1.27)
(1.28)
But we have
d
2d
d
=
= 2
dk
2d(1/)
d
v
d
= 2
vg = v =
(1.29)
= 2
h d
d
Rewriting this, we have
dv
h
= 2
d
m
(1.30)
19
h
+c
m
By applying the boundary condition that the wavelength tends to infinity as the
velocity tends to zero, we find that the constant of integration has to be zero. Hence,
we get
h
=
(1.31)
mv
which is the de Broglie relation.
1.7
20
x k
1
2
(1.33)
But we have
k = 2/
Also
= h/p
Hence,
k = 2p/h
2
k =
p
h
Substituting in equation (1.33), we get
h
4
~
x p
2
(1.34)
(1.35)
(1.36)
x p
or
(1.37)
This equation states that the product of uncertainty x in the position of an object
at some instant and the uncertainty in the momentum in the x-direction at the same
instant is equal to or greater than ~/2.
Another form of uncertainty principle relates energy and time. In the atomic process, if energy E is emitted as an electromagnetic wave during an interval of time t,
then, the uncertainty E in the measured value of E depends on the duration of the
time interval t according to the equation,
E t ~/2
(1.38)
21
It may be mentioned that these uncertainties are not due to the limitations of the
precision of the measuring methods or measuring instruments but due to the nature of
the quantities involved.
1.7.2 An illustration of uncertainty principle
We have the following Thought experiment to illustrate the uncertainty principle.
Imagine an electron being observed using a microscope (fig.1.11).
Microscope
LightSource
Electron
Figure 1.11 Schematic diagram of experimental set up to study uncertainty principle.
The process of observation involves a photon of wavelength incident on the electron and getting scattered into the microscope. The event may be considered as a twobody problem in which a photon interacts with an electron. The change in the velocity
of the photon during the interaction may be anything between zero( for grazing angle
of incidence) and 2c (for head-on collision and reflection). The average change in the
momentum of the photon may be written as equal to (h/c) or (h/). This difference in
momentum is carried by the recoiling electron which was initially at rest. The change
or uncertainty in the momentum of the electron may thus be written as (h/). At the
same time, the position of the electron can be determined to an accuracy limited by the
resolving power of the microscope, which is of the order of . Hence, the product of
the uncertainties in position and momentum is of the order of h. This argument implies
that the uncertainties are associated with the measuring process. This illustration only
estimates the accuracy of measurement, the uncertainty being inherent in the nature of
the moving particles involved.
22
or
x = / sin
23
h sin
=h
sin
(b) Nuclear beta decay: In beta decay, electrons are emitted from the nucleus of
the radioactive element. Assuming the diameter of the nucleus to represent the
uncertainty in the position of electron inside the nucleus, the uncertainty in the
momentum can be calculated as follows:
Radius of the nucleus = r = 5 1015 m
x = 2r = 1014 m.
Assuming that the electron was at rest before its emission, the change in momentum can be taken as equal to its momentum. This magnitude of change in momentum indicates large velocity for the electron. Hence, the energy of the emitted
electron will be
E = pc = 1.055 1020 3 108 = 3.165 1012 J
= 19.8MeV.
24
1.8
Wave mechanics
Quantum theory is based on the quantization of energies. It deals with the particle
nature of radiation. It implies that addition or liberation of energy will be between
discrete energy levels. It assigns particle status to a packet of energy by calling it
quantum of energy or photon and treats the interaction of radiation with matter as
a two-body problem. On the other hand, de Broglies hypothesis and the concept of
matter waves led to the development of a different formulation called Wave mechanics. This deals with the wave properties of material particles. It was shown later that
the quantum mechanics and the wave mechanics are mathematically identical and lead
to the same conclusion.
25
26
by appropriate methods. Even though the wave function itself is not directly an observable quantity, the square of the absolute value of the wave function is intimately
related to the moving body and is known as the probability density. This probability
density is the quantum mechanical method of finding the body at a particular position
at a particular time. The wave function carries information about the particles wavelike behaviour. It also provides information about the momentum and energy of the
particle at any instant of time.
1.8.3 Schrodingers wave equation
The motion of a free particle can be described by the wave equation.
= Aexp{i(t kx)}
(1.40)
2
x2
(1.42)
~
E =
i t
(1.43)
p2 = ~2
= or
t
~
p2
+U
2m
(1.44)
where U is the potential energy of the particle. Multiplying both sides of the equation
by
p2
+U
(1.45)
E=
2m
Substituting for E and p2 from equation (1.42) and (1.43)
~
~2 2
=
+ U
i t
2m x2
(1.46)
27
i
(1.47)
E exp
or
or
2
(iEt) 2m
(iEt)
exp
+
(E
U)exp
=0
x2
~
~2
~
or
2 2m
+ 2 (E U) = 0
x2
~
(1.48)
This is the Schrodingers wave equation in one dimension. In three dimensions, the
above equation may be written as
or
2 2 2 2m(E U)
+ 2 + 2 +
=0
x2
y
z
~2
2m(E U)
=0
2 +
~2
This equation is known as the steady state or time independent Schrodinger wave
equation in three dimensions.
1.8.4 Eigen values and eigen functions
These terms come from the German words and mean proper or characteristic values or
functions respectively. The values of energy for which the Schrodingers equation can
be solved are called Eigen values and the corresponding wave functions are called
Eigen functions. The eigen functions possess all the characteristics properties of
wave functions in general (see section 1.8.1).
28
1.9
29
considered to be zero inside the box and infinite outside. Since the total energy of the
particle cannot be infinite, it is restricted to move within the box. The example is an
oversimplified case of an electron acted upon by the electrostatic potential of the ion
cores in a crystal lattice.
X=O
X=L
Figure 1.13 Schematic for a particle in a box. The height of the wall extends to infinity.
Since the particle cannot exist outside the box,
= 0 for x 0 and x L
(1.50)
We have to evaluate the wave function inside the box. The Schrodingers equation
(1.48) becomes
2 2m
+ 2 E = 0 for 0 x L
x2
~
!1/2
!1/2
2mE
2mE
x + B cos
x
= A sin
~2
~2
where A and B are constants.
Applying the boundary condition that = 0 at x = 0, equation (1.52) becomes
A sin 0 + B cos 0 = 0
or
B = 0.
!1/2
L=0
(1.51)
(1.52)
30
2mE
~2
!1/2
n = 1, 2, 3, . . .
L = n where
(1.53)
where
n = 1, 2, 3, . . .
(1.54)
From this equation, we infer that the energy of the particle is discrete as n can have
integer values. In other words, the energy is quantized. We also note that n cannot be
zero because in that case, the wave function as well as the probability of finding the
particle becomes zero for all values of x. Hence, n = 0 is forbidden. The lowest energy
the particle can possess is corresponding to n = 1 and is equal to
E1 =
2 ~2
2mL2
This is called ground state energy or zero point energy. The higher excited states
will have energies like 4E 1 , 9E 1 , 16E 1, etc. This indicates that the energy levels are
not equally spaced.
The wave functions or the eigen functions are given by
or
2mE n
n = A sin
~2
n
n = A sin x
L
!1/2
(1.55)
(1.56)
Since the wave function is non-vanishing only for 0 x L, it can be shown that
Z
L
2 nx
(1.57)
dx =
sin
L
2
31
L
2
=1
or
2
A=
L
!1/2
(1.58)
(1.59)
n=3
n=2
n=1
x=0
x=L
Figure 1.14 Variation of wave function associated with an electron confined to a box
in its ground state and excited states.
Figure 1.14 shows the variation of the wave function inside the box for different
values of n and Fig.1.15 shows the probability densities of finding the particle at different places inside the box for different values of n. Thus, wave mechanics suggests
that the probability of finding any particle at the lowest energy level is maximum at
the centre of the box which is in agreement with the classical picture. However, the
probability of finding the particle in higher energy states is predicted differently by the
two formulations.
32
n=3
||2
n=2
n=1
x=0
x=L
33
E
U
x=0
Figure 1.16 Schematic for a particle in a potential well of finite depth (E greater than U).
solution results. Let U be greater than the total energy E of the electron but finite. To
analyze this case, we have to consider the three regions separately.
In region II, since U = 0, the electron is free and the Schrodingers equation is
d 2 2m
+ 2 E = 0
dx2
~
(1.60)
d 2 2m
+ 2 (E U) = 0
dx2
~
(1.61)
(1.62)
(1.63)
ix
and III = Fe
where
ix
+ Ge
in region III
2 1/2
(1.64)
= [(2mE)/~ ]
(1.65)
(1.66)
(1.67)
34
(1.68)
(1.69)
To evaluate the constants, we consider the boundary condition in the region I where
the wave function should reduce to zero as x . Then equation (1.68) becomes
0 = A + B 0 or A = 0.
I = Bex
(1.70)
Similarly, in region III, since the wave function should reduce to zero as x ,
equation (1.69) becomes
0 = F 0 + G or G = 0.
III = Fex
(1.71)
This indicates that the wave function decreases exponentially as we move away from
the potential well on either sides. Inside the potential well the wave function represented by the equation (1.63) varies sinusoidally. Further, since the wave function and
its derivative are continuous at the boundaries corresponding to x = 0 and x = L, the
wave functions are non-zero at these boundaries. The plots of the wave functions and
the probability densities are shown in Fig. 1.17 and 1.18 respectively.
n=3
n=2
n=1
x=0
x=L
35
n=3
||2
n=2
n=1
x=0
x=L
in region I
(1.72)
36
II
x=0
III
x=L
in region II
(1.73)
in region III
(1.74)
where
I = Aeix + Beix
II = Cex + Dex
in region I
in region II
(1.75)
(1.76)
in region III
(1.77)
= [(2mE)/~2 ]1/2
(1.65)
2 1/2
and
= [2m(E U)/~ ]
= i
(1.66)
(1.67)
The wavefunction in the region I is made up of two terms as evident from equation
(1.75). The first term with a positive exponent represents an incoming or incident wave
moving in the positive x-direction and the second term represents a wave reflected by
the barrier moving in the negative x-direction. Similarly, the first term in equation
(1.77) represents the transmitted wave moving in region III in the positive x-direction.
The wavefunction in the region II is given by equation (1.76). Here, the exponents are
real quantities and hence the wavefunction does not oscillate. The probability density
|II |2 is finite and represent the probability of finding the particle within the barrier.
Such a particle may emerge into region III. This is called tunneling.
37
The transmission probability T for a particle to pass through the barrier is given by
T=
|III |2 FF
=
e2L
|I |2
AA
(1.78)
The above equation represents the dependence of tunneling probability on the width of
the barrier and the energy of the particle.
1.9.5 Examples of tunneling across a finite barrier
There are a few examples of tunneling across a thin finite potential barrier in nature.
These observations are in fact proof in favour of the theory of quantum mechanical
tunneling. Let us consider a few of them.
(a) Alpha decay: Alpha particles are made up of two protons and two neutrons. In
radioactive decay, the alpha particle must free itself from the attractive nuclear
force and penetrate through a barrier of repulsive coulombic potential to be emitted
out of the nucleus(Fig.1.20). A calculation of the energy of the particle inside the
nucleus and the measurement of the energy of the emitted alpha particle indicate
that it is not possible that the particle has surmounted the barrier of coulombic
potential but must have penetrated through it.
E
Energy of particle
Nuclear surface
38
H
H
Figure 1.21 Schematic diagram of ammonia molecule. Nitrogen atom oscillates between two symmetric positions across the repulsive plane of hydrogen atoms.
(b) Ammonia inversion: In a molecule of ammonia, the three hydrogen atoms form
a plane with the nitrogen atom placed symmetrically at a finite distance from the
plane. It has been observed experimentally that the nitrogen atom oscillates between two positions on either sides of the plane (Fig. 1.21).
Classical calculations show that the nitrogen atom cannot perform such oscillation
since the hydrogen atoms form a barrier against nitrogen atom to prevent it from
moving through the plane formed by the hydrogen atoms. However, nitrogen atom
oscillates across the plane with a frequency higher than 1010 per second. This can
be explained only on the basis of tunneling process.
(c) Zener and tunnel diodes: These are diodes made out of heavily doped semiconductors with special characteristics. The currentvoltage characteristics of these
diodes can be explained only on the basis of quantum mechanical tunneling process. The high speed of operation of these devices can be explained only as due to
tunneling since the movement of charge carriers is otherwise by diffusion which is
a very slow process. The scanning tunneling microscope is another device operating on the principle of tunneling.
(d) Frustrated total internal reflection: Figure 1.22 shows a beam of light reflected
totally from the surface of glass. If a second prism of glass is brought close to the
first, the beam appears through the second glass prism indicating tunneling of light
through the surfaces of glass which were otherwise acting as barriers.
39
(1.79)
According to this, the conservation of energy does not apply for a time duration t if
the change in energy is not greater than E. If we presume that the particle borrows an
energy E and returns the borrowed energy within a time interval of t, the observer
will still believe that the energy is conserved. The time interval within which the extra
energy must be returned is given by
t = ~/E = ~/(U E + K)
(1.80)
(1.81)
If the particle travels a distance x into the forbidden region and returns, then, the total
distance travelled is 2x and hence we can write
x = v t/2
!1/2
1 2K
~
=
2 m
(U E + K)
(1.82)
As the kinetic energy K tends to zero, the value of x also tends to zero since the velocity tends to zero. Also, as K tends to infinity, x tends to zero since it is the distance
40
travelled in a time interval t tending to zero. In between these limits, there must be
a maximum value of x corresponding to a particular value of K. Differentiating x
with respect to K in equation 1.82, we can find the maximum value of x as
"
#1/2
~2
1
(1.83)
xmax =
2 2m(U E)
Or xmax = (1/2)
(1.84)
From equation 1.78, the probability of finding the particle at a distance xmax from the
step is
T = e2xmax = e1
(1.85)
Hence, we may define the maximum penetration distance as the distance at which the
transmission probability is (1/e).
It may be mentioned that the particle is never observed in the forbidden region.
The particles incident on the potential energy step will be reflected back. Some are
reflected at the step itself where as others penetrate a finite distance before returning.
If the barrier width is small, the particle will re-emerge on the other side of the barrier.
This phenomenon is known as quantum mechanical tunneling.
1.9.7 Harmonic oscillator
When a body vibrates about an equilibrium position, the body is said to be executing
harmonic motion. We have many examples of such motion which we come across, like
the vibration in a spring that is stretched and released, vibrations of atoms in a crystal
lattice, etc. Whenever a system is disturbed from its equilibrium position, it can come
back to its original position only under the influence of a restoring force. Hence, the
presence of a restoring force is a necessary condition for harmonic motion. The system
oscillates indefinitely if there is no loss of energy.
A special case of harmonic motion is simple harmonic motion. In simple harmonic
motion, the restoring force F acting on a particle of mass m is linear. In other words, the
restoring force is proportional to the displacement x of the particle from its equilibrium
position and is in the opposite direction.
i.e.,
F = kx
(1.86)
where k is called the force constant. The relation is called Hookes law. From the
second law of motion, we have,
F = ma
(1.87)
41
kx = m
d2 x
dt2
d2 x
kx
or
+
=0
(1.88)
2
dt
m
This is the equation for the simple harmonic oscillator. The solution of this equation
may be written as
where
x = A cos(t + )
(1.89)
= 2 = (k/m)1/2
(1.90)
is called the frequency of the oscillator. is the phase angle and depends on the value
of x at t = 0. The potential energy U corresponding to the restoring force F may be
calculated as equal to the work done in bringing a particle from x = 0 to x = x against
the force.
i.e., U(x) = F(x)dx = k dx = kx2 /2
(1.91)
n=2
n=1
n=0
(1.92)
42
(1.93)
as y
||2 dy = 1
i.e.,
or
a = (2n + 1)
a = 2E/h = (2n + 1)
E = (n + 1/2)h
where n = 0, 1, 2, 3, . . .
where n = 0, 1, 2, 3,
(1.94)
43
Numerical Examples
1.1 Calculate the velocity of photoelectrons emitted from a metal surface of work
function 1.5eV when the metal surface is irradiated with a light beam of wavelength 4 107 m.
Solution:
hc 6.62 1034 3 108
=
4 107
= 4.97 1019 J
Incident energy = h =
= 2.57 1019 J.
44
1.3 Calculate the maximum change in wavelength that can take place during Compton scattering of a photon.
h
Solution: Change in wavelength = =
(1 cos)
mo c
This will be maximum when cos = 1, i.e., when = 180 .
2 6.62 1034
2h
=
mo c 9.1 1031 3 108
= 4.85 1012 m(Ans.)
()max =
()max
1.4 The material of the emitter of a photocell has a work function of 2eV. Calculate
the threshold frequency.
Solution:
Work function = h0 = 2 1.6 1019 J
1.5 The threshold frequency for the material of the emitter of a Photocell is 4
1014 Hz. What is the stopping potential required to supress photo electrons emission when light of frequency 6 1014 Hz is incident on the emitter?
Solution:
Stopping potential V0 = h( 0 )/e
= 6.62 1034 6 1014 4 1014 /1.6 1019
= 0.829V (Ans).
1.6 In a photocell, a stopping potential of 2.5V is required to stop the photo electron
emission completely. Calculate the kinetic energy of the emitted photo electrons.
Solution:
Kinetic energy = potential energy = e.V
= 1.6 1019 2.5J = 2.5eV or 4 1019 J (Ans).
45
1.8 X-rays of wavelength 1.54A are Compton scattered at an angle of 60 . Calculate the change in the wavelength.
Solution:
h
Change in wavelength = =
(1 cos )
m c
h
=
(1 cos 60 ) = 1.2 1012 m (Ans.).
m c
1.9 In a Compton scattering experiment, incident photons of energy 10 KeV are scattered at 45 to the incident beam. Calculate the energy of the scattered photon.
Solution:
h
(1 cos )
Change in wavelength = =
m c
= 7.1 1013 m.
= 1.243 1010 m.
= 1.25 1010 m.
= 1.59 1015 J
= 9.93keV (Ans.).
1.10 Gamma Rays of energy 0.5 MeV are scattered by electrons. What is the energy
of scattered gamma rays at a scattering angle of 30 ? What is the kinetic energy
of scattered gamma ray?
Solution:
Wavelength of incident gamma rays = = hc/E
= 6.62 1034 3 108 /(1.6 1019 0.5 106 )
46
= 2.486 1012 m.
h
(1 cos)
Change in wavelength = =
m c
= 3.24 1013 m.
Wavelength of scattered photon = = +
2.81 1012 m.
Kinetic energy of the scattered gamma ray = hc/
= 0.442MeV (Ans.).
1.11 X-rays of wavelength 1.5A are Compton scattered. At what angle will be scattered x-rays have a wavelength of 1.506A ?
Solution:
h
Change in wavelength = =
(1 cos )
m c
cos = (1 m c./h) = (1 0.247) = 0.753
Angle of scattering, = 41.20 (Ans.).
1.12 Calculate the de Broglie wavelength associated with an electron travelling with a
velocity of 105 ms1 . Assume the mass of the electron to be 9.1 1031 kg. and
h = 6.62 1034 Js.
Solution:
h
6.62 1034
=
P 9.1 1031 105
= 7.27 109 m. (Ans.).
De Broglie wavelength =
1.13 In an electron diffraction apparatus, the electron beam is accelerated to a potential of 25 kV. Calculate the de Broglie wavelength associated with the electrons.
Given:m = 9.1 1031 kg, h = 6.62 1034 Js. e = 1.6 1019C.
Solution:
h
h
=
De Broglie wavelength =
1/2
(2mE)
(2meE)1/2
6.62 1034
=
(2 9.1 1031 1.6 1019 25 103 )
= 7.6 1012 m. (Ans.).
47
1.14 Calculate the phase velocity and group velocity associated with an electron assuming (i) non relativistic case and (ii) relativistic case.
Solution:
Non relativistic case.
(E/~) E (p2 /2m)
p
v
Phase velocity v p =
=
= =
=
=
K
p/~
p
p
2m 2
Phase velocity is half the particle velocity.
Group velocity vg =
d dE d(p2 /2m)
p
=
=
= =v
dk
dp
dp
m
Phase velocity v p =
Phase velocity is greater than c. This indicates that we cannot talk of phase
velocity of a particle since it is represented by a group of waves or a wave
packet.
Group velocity vg =
i1/2
dE
d h 2 2
p c + mo c4
=
dp dp
h
i1/2 pc2
= pc2 p2 c2 + m2o c4
=
E
=
mvc2
= v assuming E = mc2
2
mc
48
Pn =
Zx2
dx = (2/L)
x1
Zx2
sin2 (nx/L)dx
x1
x2
x1
x1
n2 h2
8mL2
Energy required to take the electron from ground state (n = 1) to the first
excited state (n = 2) is given by
49
h2
3h2
2
2
E = E2 E1 =
(2 1 ) =
8mL2
8mL2
34
3 (6.62 10 )
=
8 9.1 1031 (1010 )2
= 1.81 1017 J = 112.87eV. (Ans.)
The probability of finding the electron in the first excited state between x =
0.4A and x = 0.6A is given by
Zx2
x1
dx = (2/L)
Zx2
sin2 (2x/L)dx
x1
x2
x1
1.18 In an experimental study of nuclear decay, the emitted energy spectrum shows
a peak with a spread of energy equal to 120eV. Compute the life time of the
decaying nuclei.
~
Solution: Life time of the decaying nuclei = t
E
50
6.62 1034
120 1.6 1019 2 3.14
= 5.5 1018 s. (Ans.)
t =
= h/(2meV)1/2
1/2
= 6.62 1034 / 2 9.1 1031 1.6 1019 106
51
= 1.45 1010 m
(Ans).
1.22 A cricket ball of mass 250 gms moves with a velocity of 100ms1 . If its velocity
is measured with an accuracy of 1%, what is the accuracy of a simultaneous
measurement of its position?
Solution:
x p x > ~/2 or
px = 1% of 250 10
x px ~
(Ans).
1.23 In a gamma decay process, the life time of decaying nuclei is found to be 2 ns.
Compute the uncertainty in the energy of gamma rays emitted.
Solution: E t ~/2 or E t ~
E = ~/t = 6.62 1034 /2 3.14 2 109
= 5.27 1026 J (Ans).
Exercise
1.1 Explain photoelectric effect. Give the assumptions needed, and the laws of photoelectric emission. Calculate the velocity of the photoelectrons emitted from a
metal surface whose work function is 1.5 eV and the wavelength of the incident
light being 4 107 m.
(March 1999)
1.2 Describe the ultraviolet catastrophe. Explain how the Plancks law of radiation
overcomes it.
(March 1999)
1.3 Explain Wiens law and Rayleigh-Jeans law. Mention their drawbacks.
(August 1999)
1.4 Explain Einsteins theory of photo-electric effect.
(August 1999)
52
1.5 What is Plancks radiation law? Show how Wiens law and Rayleigh-Jeans law
can be derived from it.
(August 1999)
1.6 The work function of tungsten is 5.4 eV. When light of wavelength 170 nm is
incident on the surface, electrons with energy 1.7 eV are ejected. Estimate the
Plancks constant.
(August 1999)
1.7 What is the nature of black body radiation? Explain its significance.
(March 2000)
1.8 In Compton scattering, calculate the maximum kinetic energy of the scattered
electron for a given photon energy. A metal surface has a photoelectric cut off
wavelength of 325.6 nm. What is the stopping potential for an incident light of
wavelength of 259.8 nm?
(March 2000)
1.9 State Compton effect and explain the experimental observations of Compton
shift.
(August 2000)
1.10 The stopping potential of 5.2 V was observed for light of frequency 3 1015 Hz.
What would be the frequency of light when the stopping potential is doubled?
(August 2000)
1.11 State and explain Plancks law of radiation. Show that it reduces to Wiens law
and Rayleigh-Jeans law under certain conditions.
(August 2000)
1.12 The photoelectric threshold for a certain metal is 5000 AU. Determine the maximum energy of the photoelectrons emitted when a radiation of wavelength 3000
AU is incident on its surface.
(March 2001)
1.13 Explain Plancks radiation law. Discuss Einsteins theory of photoelectric effect.
(March 2001)
1.14 Explain Wiens Displacement law. Mention its drawbacks.
(August 2001)
(August 2001)
1.16 What is ultraviolet catastrophe? Show how it can be overcome with the help of
Plancks radiation law.
(August 2001)
1.17 What is photoelectric effect? Mention the features of photoelectric emission.
(March 2002)
53
1.18 What is Compton scattering? With necessary equations, explain Compton scattering effect.
(March 2002)
1.19 Calculate the change in wavelength in Compton scattering at an angle of 60 to
the incident direction.
(March 2002)
1.20 Calculate the wavelength associated with an electron carrying an energy 2000 eV.
(March 1999)
1.21 Set up one dimensional Schrodingers wave equation for a free particle.
(March 1999)
1.22 Explain group velocity and phase velocity. Derive a relation between the two.
(August 1999)
1.23 Explain Heisenbergs uncertainty principle. Give its physical significance.
(August 1999)
1.24 What is de Broglies concept of matter waves? An electron has a wavelength of
1.66 1010 m. Find the kinetic energy, phase velocity and group velocity of the
de Broglie wave.
(August 1999)
1.25 Explain Heisenbergs uncertainty principle. Give its physical significance. The
position of an electron can be measured with an accuracy of 1.5 1010 m. Find
the uncertainty in its position after 1 sec.
(August 2001)
1.26 What are Eigen functions and Eigen values? Find them for a particle in one
dimensional potential well of infinite height.
(August 2001)
1.27 Obtain the time independent Schrodingers wave equation for a particle in one
dimensional potential well of infinite height and discuss the solution.
(August 2000)
1.28 Write a note on group velocity and phase velocity.
(March 2001)
1.29 Explain group velocity and phase velocity. Derive relation between the two.
(August 2001)
1.30 What is a wave function? Give its physical significance. What is normalization
of a wave function?
(August 2001)
54
1.31 What is de Broglie concept of matter waves? Derive an expression for de Broglie
wavelength.
(August 2001)
1.32 Set up time independent Schrodingers wave equation and explain Eigen functions and Eigen values.
(August 1999)
1.33 What is Heisenbergs uncertainty principle? Discuss its consequences.
(March 2000)
1.34 Explain the behaviour of a particle in a one-dimensional infinite potential well
in terms of de Broglie waves.
(March 2000)
1.35 What is the physical interpretation of wave function, nature of Eigen values and
Eigen functions.
(March 2000)
1.36 Discuss phase velocity and group velocity.
(August 2000)
1.37 Explain only the conclusions drawn in photoelectric effect and DavissonGermer
experiment leading to deBroglie hypothesis.
(Feb 2005)
1.38 Show that a free electron cannot exist in a nucleus of an atom.
(Feb 2005)
1.39 What are matter waves?Show that the electron accelerated by a potential differ(July 2005)
ence V volt is = 1.226/ Vnm for nonrelativistic case.
1.40 Explain phase velocity and group velocity. Derive the expression for deBroglie
wavelength using the concept of group velocity.
(July 2005)
1.41 Explain only the conclusions drawn in photoelectric effect and DavissonGermer
experiment leading to deBroglie hypothesis.
(Feb 2005)
1.42 Show that a free electron cannot exist in a nucleus of an atom.
(Feb 2005)
Chapter 2
Crystal Structure
The behaviour of solids is much different from that of liquids and the properties of
materials differ widely in their liquid and solid states. One of the reasons for such
differences is that in a solid, depending on the nature of bonding, there will be a tendency for the atoms or molecules to arrange themselves in particular order. The extent
and range of such an ordering may of course depend on the conditions under which
the solid state is formed. The valency, the electronic configuration, the coordination
number, etc. are factors that prompt the atoms or the molecules to arrange themselves
in a way to result in a solid. If such an ordering extends only to a few atomic spacing (and some ordering is necessary due to the conditions of bonding), the material is
referred to be amorphous or non-crystalline. The crystalline solids may be classified
as polycrystalline and single crystalline depending on the range of ordering. In case
of polycrystalline materials, the arrangement of atoms or molecules along the different
directions will be disrupted by some defects in the arrangement. Thus, the ordering
will extend only upto certain distance. The material may be considered to be made
up of a large number of small single crystals, oriented randomly in three dimensions.
Each single crystalline part is called a grain and the size of the grain is used as a measure of crystallinity. On the other hand, the ordering in a single crystal will continue
through out the sample.
Many important properties of materials are found to depend on the structure of
crystals. A study of the crystal structure has led to the understanding of the behaviour
of materials and hence a correlation between physical properties and the crystal structure has been possible. The study of the crystal structure and their relation with the
properties has led to the development of new materials for electronic and structural
engineering applications.
2.1.1 Unit cell and space lattice
A crystal is a solid in which the atoms or molecules are arranged in an orderly manner.
The periodicity of ordering may be different in different directions. It is convenient
to imagine arrangement of points in space to represent atoms or molecules. These
55
56
imaginary points are called lattice points and an arrangement of these imaginary
points in three dimensions constitutes a space lattice. Each point in the space lattice
represents the location of an atom or a molecule or a collection of atoms which repeat
through out the space lattice in the same shape and orientation. Such a collection of
atoms or molecules or group of atoms which is represented by a point in the space
lattice is called a basis. The space lattice in combination with the basis gives the
crystal lattice(Fig. 2.1).
Lattice
= crystal structure
basis
57
58
a=b=c
= = = 90
(a)
a = b 6= c
= = = 90
(b)
a 6= b 6= c
= = 90 6=
(d)
a = b 6= c
= = 90 = 120
(f)
a 6= b 6= c
= = = 90
(c)
a 6= b 6= c
6= 6= 6= 90
(e)
a=b=c
= = = 90 < 120
(g)
Figure 2.4 Unit cells for the seven crystal systems. The crystal systems are (a) cubic,(b) tetragonal,(c) orthorhombic,(d) monoclinic,(e) triclinic,(f) hexagonal and (g)
rhombohedral.
59
Cubic
Tetragonal
Orthorhombic
Monoclinic
Triclinic
Hexagonal
Rhombohedral
a=b=c
a=b,c
a,b,c
a,b,c
a,b,c
a=b,c
a=b=c
= = = 90
= = = 90
= = = 90
= = 90 ,
, , , 90
= = 90 , = 120
= = , 90 < 120
60
Bravais lattices
Cubic
Tetragonal
Orthorhombic
Monoclinic
Triclinic
Hexagonal
Rhombohedral
P, I, F
P, I
P, I, F, C
P, C
P
P
P
z
Figure 2.5 Representing directions in a unit cell
61
The direction of the vector r is determined by these integers. If these numbers have
common factors, they are removed and the direction of r is denoted as [n1 n2 n3 ]. This
includes all other lines parallel to this direction. Generally, square brackets are used
to indicate a direction. A similar set of three integers enclosed in a round bracket is
used to designate planes in a crystal. The crystal lattice may be regarded as made up of
an aggregate of a set of parallel, equidistant planes passing through the lattice points.
These planes are known as lattice planes and may be represented by a set of three
smallest possible integers. These numbers are called Miller Indices, named after
the crystallographer, W.H.Miller. It can be shown that planes and normal to the planes
will have the same set of Miller indices. Thus, Miller indices can be defined as a set
of three integers used to specify planes, directions and positions of atoms or molecules
with reference to an arbitrary origin.
The steps involved in determining the Miller indices of a set of parallel planes can
be explained using the Fig.2.6.
Y
5b
4b
B
3b
2b
b
3a
c
2c
3c
2a
A
4a 5a
x
OA
y
OB
x
OC
62
pa
qb
rc
1/p
1/q
1/r
(h
l)
The Miller indices of the plane ABC are (hkl). If d is the distance of the plane ABC
from O, then planes parallel to ABC and at perpendicular distances equal to 2d, 3d, 4d,
. . . from 0 will have the same Miller indices. Hence, (hkl) represents the Miller indices
for a set of planes separated by an interplanar distance d. Some important directions
and planes are shown in Fig.2.7 along with their Miller indices.
y
(100)
[100]
O
z
[111]
[110]
(110)
O
z
O
z
(111)
63
(iii) Mark the points A, B and C at distances a/h, b/k and c/l from a reference origin
O (a corner in the unit cell) along x, y and z directions respectively.
(iv) Join the points A, B and C. ABC is the required plane.
Planes with negative Miller indices may be plotted by shifting the origin accordingly. For example, if h is negative, the origin is shifted to (1, 0, 0) position. If k is
negative, the origin is shifted to (0, 1, 0) position and so on. If h and k are both negative, the origin is shifted to (1, 1, 0) position. Examples of some planes are represented
in Fig. 2.8.
Y
(222)
(212)
(423)
Y
O
O
Z
Z
(100)
Z
(101)
(111)
64
Y
C
N
B
3
2
1
O
d2 2
(h + k2 + l2 ) = 1
a2
a
or d = 2
2
(h + k + l2 )1/2
Here, d represents the interplanar spacing for the set of planes having Miller indices
(hkl).
65
(4r)2 = a2 + a2 or a = 2 2r
In a body centred cubic lattice, there is an additional atom at the centre of the unit
66
cell so that the atoms are in contact with each other along the body diagonal. Hence,
(a)
(b)
(c)
Figure 2.10 Atomic arrangements in unit cells showing atoms in contact (a)along
cube axis in simple cubic cell, (b) along face diagonal in face centred cubic cell and
(c) along body diagonal in body centred cubic cell.
The atomic packing factor is defined as the ratio of the volume of the atoms in the
unit cell to the volume of the unit cell. It represents the fractional volume of the unit
cell filled by atoms. Hence, we can write,
Packing factor =
We can calculate the packing factor for the cubic crystals as follows :
In simple cubic crystals
Number of atoms per unit cell = 1
Volume of each atom = (4/3)r3
Volume of unit cell = a3 = (2r)3
Packing factor =
(4/3)r3
= /6 = 52%
(2r)3
67
2(4/3)r3
Packing factor =
= ( 3/8) = 68%
{4/ 3r}3
4.(4/ 3)r3
= ( 2)/6 = 74%
Packing factor =
(2 2r)3
The various parameters associated with cubic unit cells are summarized in Table 2.3.
Table 2.3 Properties of cubic unit cells.
Number of atoms
Coordination
Atomic
Type of Cell
per unit cell
number
radius
Simple cubic
1
6
a/2
3.a/4
Body centred cubic
2
8
2.a/4
Face centred cubic
4
12
Packing
density
/6
3./8
2./6
A relation between lattice constant a and the density of the crystal may be derived
as follows. Consider the case of a cubic lattice.
Volume of the unit cell = a3 .
Mass of all the atoms(molecules)in the unit cell = a3
Number of atoms(molecules) per unit cell = z
Atomic(Molecular) weight of the material = M
Avogadro number = N
Mass of each atom(molecule) = (M/N)
Mass of atoms(molecules) in unit cell = z(M/N)
Hence, a3 = z(M/N) or a = (zM/N)1/3 .
68
Thus, the lattice parameter can be calculated for any cubic crystal with the help of the
available data.
2.1.7 Some crystal structures
Materials crystallize in a variety of structures. These structures are said to be simple if
the arrangement of atoms or molecules can be identified with the Bravais lattices. But
in some cases, the structure becomes complicated with one or more atoms or molecules
occupying specific positions within the unit cell, but retaining the periodicity of the
unit cell as a whole. Many of these structures can be considered to be a combination
of two or more unit cells having the same cell parameters but translated with respect
to each other in three dimensional space through finite space. We consider a few such
examples.
(a) Sodium chloride: Sodium chloride is an ionic solid. Sodium loses its outer electron to acquire a positive charge while chlorine accepts the electron to become a
negative ion. The crystal structure of sodium chloride has sodium ion and chlorine
ions arranged side - by - side in a three dimensional orthogonal array. The result
is a face centred cubic lattice (Fig. 2.11). In fact it can be
Cl
Na
69
The superposition is such that one sub-lattice is translated through a distance equal
to half the lattice parameter along the cube edge of the face centred cubic lattice. If
the sodium and chlorine ions are replaced by identical atoms, the structure would
then be simple cubic with a lattice parameter equal to half that of the NaCl structure. The lattice parameter, being the repeat distance along crystallographic axes,
is equal to the distance between two sodium ions or two chlorine ions. Consequently, the unit cell of sodium chloride has four sodium ions and four chlorine
ions in its unit cell. In other words, there are four molecules of NaCl in the crystal
structure of sodium chloride. Assuming one corner ion (say Na) as origin, the
positional coordinates for the other ions in the unit cell can be written as follows:
Na ions: 8 corner ions (each shared by 8 unit cells) with positional coordinates
(0 0 0) (1 0 0) (0 1 0) (0 0 1)
(1 1 0) (1 0 1) (0 1 1) (1 1 1)
6 face centre ions (each shared by 2 unit cells) with positional
coordinates
(1/2 1/2 0) (1/2 0 1/2) (0 1/2 1/2)
(1/2 1/2 1) (1/2 1 1/2) (1 1/2 1/2)
Cl ions: 12 edge centre ions (each shared by 4 unit Cells) with positional
coordinates
(1/2 0 0) (0 1/2 0) (0 0 1/2)
(1/2 0 1) (1/2 1 0) (1 1/2 0)
(0 1/2 1) (1 0 1/2) (0 1 1/2)
(1/2 1 1) (1 1/2 1) (1 1 1/2)
one body centre ion at (1/2 1/2 1/2).
Thus, the structure of NaCl contains four molecules per unit cell.
(b) Caesium chloride:
Ceasium chloride is also an ionic solid and crystallizes with a simple cubic structure (Fig. 2.12).
The lattice of CsCl is made up of two sub-lattices of caesium ions and chlorine
ions each being simple cubic in arrangement. The corner of one sub-lattice is the
body centre of the other. In other words, the two sublattices are related through a
translation along the body diagonal by a distance equal to half the body diagonal. If
the caesium and chlorine ions are replaced by identical atoms, the resultant lattice
70
Cs
Cl
71
Zn
S
72
2.2
X-Rays
X rays are electromagnetic radiations, like light, but with a much smaller wavelength.
They exhibit all the characteristic properties similar to light. By virtue of their small
wavelength (corresponding to higher frequency and photon energy) they have a higher
penetration power than light and hence are immensely useful in medical applications.
Since the wavelength is of the order of atomic spacings in solids, they provide important information about the arrangement of atoms in solids through diffraction patterns.
2.2.1 Origin of x-rays
X-rays are produced when electrons with high velocity interact with atoms. When an
electron beam is incident on a solid target material, most of the energy of the electrons
will be dissipated as heat and a very small amount of energy is emitted in the form of
x-rays. A conventional x-ray tube consists of an evacuated glass tube with a filament
as cathode and the target metal as anode (Fig. 2.15).
The filament is heated with a low voltage source to emit electrons. These electrons
are accelerated with a high voltage applied between the filament and the anode. When
the high energy electrons strike target, x-rays are emitted. The wavelength of x-rays
so emitted will depend on various factors including the initial and final energy of the
incident electron, nature of the target material and the nature of interaction. Those xrays possessing higher penetration power (i.e., x-rays of lower wavelengths) are called
hard x-rays and those with lesser penetration power are called soft x-rays.
73
X Rays
Water cooled anode
with target
Filament
H. T
74
v to v during such an interaction, the change in the energy of the electron will be
1/2m(v2 v2 ) where m is the mass of the electron. This energy will be emitted as
radiation of wavelength .
hc
1
m(v2 v2 ) = h =
2
(2.1)
The short wavelength limit corresponds to the case when the electron loses all its kinetic energy.
i.e. v = 0
1
hc
m v2 = hmax =
(2.2)
2
min
Since the kinetic energy of the electron is acquired due to the application of an electric
potential V,
1
(2.3)
mv2 = eV
2
From equation (2.2) and (2.3), we have
min =
hc
eV
(2.4)
This equation is known as Duane Hunt law and this equation was used for an experimental determination of Plancks constant. Substituting the values of the constants,
equation (2.4) may be rewritten as
min =
12400
V
(2.5)
where min will be obtained in A when V is in volts. Fig. 2.17 shows the variation
of the intensity of x-rays emitted as a function of wavelength at different accelerating
potentials.
The following features are characteristic of the continuous x-ray spectrum.
(i) The intensity of continuous spectrum increases at all wavelengths as the potential
is increased.
(ii) The short wavelength limit shifts towards lower wavelengths as the potential is
increased.
(iii) The wavelength corresponding to the maximum intensity also shifts towards
lower wavelengths as the potential is increased.
75
I
30 kV
20 kV
10 kV
Figure 2.17 Intensity of continuous x-rays emitted as a function of wavelength at different accelerating potentials (characteristic x-ray peaks are also shown)
2.2.3 Characteristic x-ray spectrum
It is also observed that some of the high energy electrons, during its interaction with
the target atoms, knock off the tightly bound electrons from the innermost shells (like
K-shell, L-shell, etc) of the atoms. The vacancies so produced are filled up by electrons from the outer orbits and the energy difference is given out in the form of xrays(Fig.2.18).
76
Since the transitions are between levels of well defined energies, the emitted x-rays
will form a line spectrum which is characteristic of the material of the target. Fig.
2.18 shows an electron from the K-shell of an atom of the target being knocked off
by the incident electron. An electron from any of the higher energy levels may fill
the vacancy so created in the K-shell, i.e., from L-shell, M-shell, N-shell, etc. The
emission lines due to these transitions are named K , K . . ., etc., and form the K-series
of x-rays. Similarly, if the transitions are from a higher shell to L-shell, the x-rays
belong to L-series (Fig.2.19).
M
L L
O
N
M
L
L
K
K
Figure 2.19 Energy level diagram showing the possible emission lines.
Thus, the line spectrum depends on the target material and hence called the characteristic x-ray spectrum. The important features of line spectrum are as follows:
(i) The characteristic x-ray spectrum consists of discrete spectral lines depending
on the transitions occurring during emission. K-series consists of spectral lines
emitted due to transitions from higher shells to K-shell.
(ii) The wavelength of different lines in the spectrum are characteristic of the target
material. This fact is used in the chemical analysis of the target material.
(iii) The wavelength of the K-lines shifts towards lower values as the atomic number
of the target element increases.
(iv) There is a minimum accelerating potential required to excite the emission of different spectral lines.
(v) The intensity of a spectral line depends on the probability of that particular transition taking place and hence on the accelerating potential and the incident electron
beam current.
77
Atomic No. z
Figure 2.20 Variation of the square root of frequency of emitted x-rays as a function of
the atomic number of the target material.
= a(z b)
(2.6)
where is the frequency of a particular spectral line (say K ) from a target of atomic
number z, a and b being constants. It can be shown that this is in accordance with the
Bohrs theory with modification for the screening effect of electrons. The law may also
be expressed in the form,
!
1
1
2 1
= R(z b)
(2.7)
n21 n22
where R is called Rydberg constant, b is the screening constant n1 and n2 are the principal quantum numbers corresponding to the final and initial energy levels corresponding
to the transition. Hence, for a K-line, n1 = 1 and b = 1
!
1
1
2
= R(z 1) 1 2 Where n2 = 2, 3, ..
(2.8)
K
n2
Similarly for L-line, n1 = 2 and b = 7.4
1 1
1
= R(z 7.4)2
L
4 n22
Where n2 = 3, 4, ..
(2.9)
78
One of the important application of Moseleys law was to resolve the discrepancies in
the arrangement of elements in the periodic table. The law suggests that the physical
and chemical characteristics of elements are more closely related to the atomic number
rather than the atomic weight. Accordingly, Moseleys law helped in assigning proper
places to the elements in the periodic table. For example, Argon (18 A40 ) should be
placed before Potassium (19 K 39 ) and Cobalt (27Co58.9 ) should be placed before Nickel
(28 Ni58.7 ). It has helped in assigning atomic numbers to rare earth elements and also in
the discovery of new elements.
2.3
X-ray diffraction
When a monoenergetic x-ray beam is made to incident on a sample of a single crystal, diffraction occurs resulting in a pattern consisting of an array of symmetrically
arranged diffraction spots. The single crystal acts like a grating with a grating constant
comparable with the wavelength of x-rays, making the diffraction pattern distinctly
visible. The arrangement of atoms in a crystal results in the formation of planes with
a high density of atoms which diffract x-rays preferentially along specific directions.
Hence, a study of the diffraction pattern helps in the analysis of the crystal parameters.
2.3.1 Braggs law
Consider the diffraction of x-rays from a single crystal sample as shown in Fig.2.21.
Let XX and YY be two parallel planes with an interplanar spacing d.
A
79
between the two rays can be found by drawing perpendiculars BM and BN to the rays
DE and EF respectively.
The total path difference = ME + EN
= d sin + d sin
= 2d sin
(2.10)
The two reflected rays will be in phase if this path difference is equal to an integral
multiple of , the wavelength of x-rays.
Hence, the condition for producing a maximum becomes
n = 2d sin
(2.11)
F
C
S
80
mounted at the centre of a turn table T. The turn table also carries a side arm on which
a detector D for x-rays (usually an ionization counter like G.M. counter) is mounted.
The sample holder and the detector can be rotated about an axis perpendicular to the
direction of the incident x-rays. The rates of rotation of the crystal holder and the
detector arm are such that the detector always receives the x-rays reflected from the
crystal. When x-rays are incident on any particular plane at an angle , the diffracted
beam which will be at an angle 2 will be received by the detector. The output of the
detector is measured or recorded on a x-y recorder R to obtain a chart showing the
variation of diffracted intensity of x-rays as a function of diffraction angle.
The verification of Braggs law is carried out by mounting a standard sample on the
turn table of the spectrometer. The x-rays are allowed to fall on the sample at an angle
close to the glancing angle ( = 0) and the angle is gradually increased. The detector
output observed at angle 2 in each case is measured and the angles corresponding to
peak intensities are noted. For the standard sample, knowing the crystal structure and
the lattice parameters, the interplanar spacings may be calculated. For example, for a
cubic crystal, the permitted d-spacing can be calculated as
d=
a
(h2
k2
+ l2 )1/2
(2.12)
n2 2 2
(h + k2 + l2 )
2
4a
(2.13)
Knowing, and a, by assigning different integer values for h, k, l and order n, allowed
values of can be computed. These values of are compared with those obtained from
the Bragg spectrometer recordings. A perfect match between the computed values and
the observed values of Bragg angles will be obtained thereby establishing the validity
of Braggs law.
2.3.3 Structure determination
The Bragg spectrometer analysis is a widely used experimental technique for the routine determination of crystal structure. It is very much suitable for identification and
structure determination of crystals possessing high symmetry (eg. cubic crystals). It
is also possible to distinguish between the lattices of the cubic system using extinction
rules.
The values for different planes can be obtained from the spectrometer recording.
For a given cubic lattice, it is possible to list of all combinations of h, k and l. It is seen
81
that sin2 values will increase with (h2 + k2 + l2 ). Since is a constant and known, the
lattice parameter a can be found out for which sin2 values will be in the same ratio
as (h2 + k2 + l2 ).
The distinction between lattices of cubic system (i.e s.c., b.c.c. and f.c.c.) is possible by using the fact that not all combinations of (h2 + k2 + l2 ) lead to diffraction for a
given lattice. Some planes may not give diffraction peaks if the net diffracted intensity
add up to be zero.
It is possible to derive the extinction rules for different cubic lattices. These rules
are listed below:
Lattice type
Allowed reflections
1. Simple cubic
2. Body centred cubic
3. Face centred cubic
From the extinction rules, we can derive the ratio of (h2 +k2 +l2 ) values and compare
with the observed sin2 value to identify the lattice type. The ratio of (h2 + k2 + l2 )
values (or sin2 values) will be as follows:
Lattice type
Ratio
1. Simple cubic
1:2:3:4:5:6:8:9:....
2. Body centred cubic (2:4:6:8:10:12:14:16: ...)
1:2:3:4:5:6:7:8:...
3. Face centred cubic 3:4:8:11:12:16:19:20:...
It may be mentioned that (h2 + k2 + l2 ) = 7, 15, . . . etc., correspond to forbidden
reflections as the sum of the squares of three integers can not be 7, or 15, etc. Thus, if
7 appears in the ratio of (h2 + k2 + l2 ) values, it must correspond to 14 and the lattice
type will be b.c.c. If 7 is absent, the lattice is simple cubic. Face centred lattice can be
easily identified by the distinct ratio.
2.4
Electron diffraction
h
h
=
p mv
82
Screen
2.5
Neutron diffraction
83
The main difficulty with neutron diffraction is to obtain mono energetic neutron
beams. Neutron monochromators, which are modified versions of Bragg spectrometer,
can be used to obtain neutron beams of thermal as well as higher energies. The detection and recording of diffracted beams is also complicated since ionization counters do
not detect neutrons directly.
Numerical Examples
2.1 A cubic crystal has a lattice parameter of 5.6A . Calculate the interplanar spacing
for (110) and (111) set of planes.
Solution:
a
d= 2
2
(h + k + l2 )1/2
where h,k,l are the Miller indices of the set of planes under consideration.
For (110) planes,
d110 =
(12
5.6
5.6
= = 3.96A (Ans.)
2
2
1/2
+1 +0 )
2
(12
5.6
5.6
= = 3.23A (Ans.)
2
2
1/2
+1 +1 )
3
2.2 A cubic crystal of lattice parameter 4.31A has a set of planes with interplanar
spacing of 1.76A . Find the Miller indices of the set of planes.
Solution:
a 4.31
=
= 2.45
d 1.76
(h2 + k2 + l2 ) = (2.45)2 = 6
(h2 + k2 + l2 )1/2 =
Sum of the squares of three integers is 6. Hence, the Miller indices of the
planes are (211). Since it is cubic system, (211), (121), and (112) are identical.
2.3 Find the Miller indices of a set of parallel planes which makes intercepts in the
ratio 3a : 4b on the x and y axis , and are parallel to the z axis where a, b and c
are the primitive translation vectors of the lattice.
84
3a 4b
Take reciprocals
1/3 1/4 1/
4
3
min =
85
2.7 First order Bragg diffraction of x-rays of wavelength 1.54 A was observed at an
angle of 12 . Calculate the interplanar spacing for the set of planes responsible
for the diffraction. If the crystal has a lattice parameter of 6.41 A , find the planes
giving rise to diffraction.
Solution:
1 1.54
n
=
d=
2 sin 2 sin 12
d = a/(h2 + k2 + l2 )1/2
(h2 + k2 + l2 )1/2 = a/d = 6.41/3.70 = 1.73
(h2 + k2 + l2 ) = (1.73)2 = 3.
Hence, the set of planes are (111) (Ans.).
2.8 An x-ray diffraction pattern for cubic lead crystal is obtained with x-rays of wavelength 1.54 A . If the first order reflection from (110) planes is observed at 12.7 ,
determine the interplanar spacing for (110) planes and also the lattice parameter
of lead.
Solution:
n
1 1.54
Interplanar spacing d =
=
2 sin 2 sin 12.7
= 3.50A (Ans.)
(Cu) = 0.71
(41)2
(28)2
= 1.52A (Ans.).
2.10 Calculate the minimum accelerating potential to be applied to an x-ray tube in
order to produce continuous x-ray emission down to a wavelength of 1 Acirc .
86
2.11 In an x-ray diffraction experiment, the first order diffraction from a particular set
of planes was observed at 12 . Find the angle at which the second order diffraction
occurs for the same set of planes.
Solution:
n = 2d sin
For n = 1, sin 1 = /2d = sin 120
For n = 2, sin 2 = 2/2d = 2 sin 120
= 2 0.2079
= 0.4158
2 = sin1 (0.4158) = 24.570 (Ans.).
2.12 The Bragg diffracted beam of x-rays from a set of planes is observed at an angle
of 15 when a molybdenum target ( = 0.71A ) is used. Find the position of the
diffracted beam for the same set of planes when a copper target ( = 1.54A ) is
used.
Solution: We have
n = 2d sin
Hence, n1 = 2d sin 1 and n2 = 2d sin 2
sin 2 = 2 sin 1 /1
= 1.54 sin 150 /0.71
= 0.5614
2 = sin1 (0.5614) = 34.150 (Ans.).
2.13 The wavelength of K lines from a copper target is found to be 1.54 A . Find the
wavelength of L line from the same target.
Solution: We know that
!
1
1
2 1
= R(z b)
n21 n22
For K line, b = 1, n1 = 1 and n2 = 2
87
Exercise
2.1 State and explain Moseleys law, and explain on of its important applications.
(March 1999).
2.2 What is a space lattice ? Describe briefly the seven systems of crystals.
(March 1999).
2.3 Describe Braggs law for x-ray diffraction in crystals. Describe how Bragg spectrometer is used for determination of wavelength of x-rays.
(August 1999).
2.4 Explain Miller indices and indicate the following planes in a cubic unit cell:
(110), (101) and (111).
(August 2000).
2.5 Explain Moseleys law and mention its applications.
(August 2000).
2.6 Derive Braggs law for x-ray diffraction. Describe the determination of wavelength of x-rays using Bragg spectrometer.
(August 2000).
2.7 Explain Moseleys law and its importance.
(March 2001).
2.8 If the lattice parameter of a cubic crystal is 3 A.U.,find the interplanar spacing
between (111) planes.
(March 2001)
2.9 Derive Bragg?s law of x-ray diffraction. Explain how Bragg spectrometer is used
in the verification of the law.
(March 2001).
2.10 Define unit cell and Bravais lattice. Derive the expression for the interplanar
spacing in terms of Miller indices. The interplanar spacing of (110) planes is 2
A.U. for a cubic crystal. Find out the atomic radius.
(August 2001).
2.11 State Moseleys law. Discuss its applications. Calculate the energy of electrons
that produce Bragg diffraction of first order at an angle of 220 when incident on
crystal planes with interplanar spacing of 1.8 A.U.
(August 2001).
88
2.12 What are Miller indices? How do you find the Miller indices of a given plane?
(March 2002).
2.13 Derive Braggs law.
(July 2005)
2.14 Explain the structure of NaCl and calculate the packing factor for a bcc structure.
(July 2005)
2.15 Obtain the expression for interplanar distance in terms of Miller indices.
(July 2005)
2.16 Sketch the following planes in a cubic unit cell : (101), (121), (010). (July2005)
Chapter 3
Introduction
Metals have been known for their excellent ability to conduct heat and electricity. Such
a high conductivity is attributed to the presence of free electrons in metals and attempts
have been made to explain the conductivity of metals quantitatively. Earlier theories
were classical in nature. In the year 1900, Paul Drude proposed a theory of conductivity of metals based on the free electron model which was later developed and refined
by Hendrik Lorentz. According to the classical free electron theory, a metal can be
considered to be made up of ion cores comprising of nucleus and inner electrons excluding the valence electrons. These ion cores are immobile and do not take part in
the conduction process. The valence electrons are considered to be free and almost
independent of the ions to which they belong. These valence electrons can move under
the influence of an applied electric field and hence contribute to the conductivity.
Before discussing the classical free electron theory, let us examine the Ohms law
which is the most fundamental of the laws governing the charge transport in materials.
Consider a metal sample to which a voltage source is connected. This results in the
flow of an electric current through the sample. The magnitude of the current I is found
to be proportional to the applied voltage V and the relation is expressed as
IV
or
I = GV
(3.1)
(3.2)
This relation is known as Ohms law. The quantities involved in this relation are
sample specific and depend on the dimensions of the sample. A more general form of
Ohms law is
J = E
(3.3)
where J is the current density representing the charge moving past a reference point per
unit area of cross section of the sample per unit time, E is the Electric field equivalent
to the voltage per unit length of the sample and is the electrical conductivity of the
89
90
l
where A is the area of cross section and l is the length of the sample. The advantage
of using Ohms law in its general form is that the quantities involved are not specific
to the sample and are independent of sample dimensions. The electrical resistivity is
a constant for the material and may be used to specify the relative ease with which an
electric current passes in metals.
3.2
91
equation as
v
x
= (e/m)E
(3.6)
t f ield
The partial time derivative of velocity and the subscript field indicate that the acceleration mentioned here is due to the effect of the applied electric field. Contribution
to the acceleration due to other possible effects will be considered later. Considering a
large group of electrons, the average velocity at any instant may be written as
N
1X
< vx >=
vxi
N i=1
(3.7)
where N is the number of electrons considered. Hence, we may write equation (3.6) as
< v >
x
= (e/m)E
(3.8)
t
f ield
Considering unit area of cross section of the sample and counting the total number of
carriers crossing the unit area along the x-axis as shown in Fig. 3.1 in unit time, we
find that all the carriers present in an elemental volume of unit area of cross-section
and upto a length equal to vx will cross the reference point at origin.
Y
1
X
1
vx
Z
Figure 3.1 Schematic representation of electron flow for calculation of current density.
In other words, all the electrons present in the elemental volume vx will contribute
to the current density. If there are n conduction electrons per unit volume in the given
sample, the number of electrons contributing to current density is equal to nvx . Since
each electron is associated with a charge (e), the current density at any instant may
be written as
J = ne < vx >
(3.9)
92
(3.10)
Comparing equation (3.8) with equation (3.10), since the time invariance of current
density is an experimentally confirmed fact, we conclude that equation (3.8) does not
give the total time rate change of velocity and that there must be other processes which
make the total time rate change of velocity equal to zero.
< v >
d < vx > < vx >
x
+
=0
i.e.,
=
dt
t
t
f ield
other
(3.11)
The other process responsible for the time rate change of velocity of conduction electrons has been identified as electron- lattice interaction. Since the electrons are moving in the crystal lattice of ion cores, such an interaction is a possibility. An experimental observation that a sample tends to warm up when an electric current is passed
indicates possible transfer of energy (momentum) from electrons to the lattice.
In order to understand the nature of lattice - electron interaction, let us consider a
sample subjected to an electric field E resulting in a current density J and an associated
average electron velocity < vx >. If the applied electric field is switched off at any
instant, say t = 0, the electron flow will stop indicating that the velocity < vx > goes to
zero. This decrease in the velocity is attributed to the interaction of electrons with the
lattice. The average velocity is assumed to decay exponentially with time (Fig.3.2) in
accordance with the equation
< vx >t =< vx >o exp (t/)
(3.12)
93
< vx >
Figure 3.2 Exponential decay of the average velocity of electrons when the applied
field is removed.
where < vx >t is the instantaneous value of average velocity at time t, < vx >o is
the average velocity when the electric field is turned off. The quantity is called the
relaxation time of conduction electrons. Differentiating equation (3.12)
< v >
< vx >
x
=
(3.13)
t
coll
In other words, when the applied electric field is turned off, the velocity component
will reduce to zero due to electron-lattice collisions and the second component of acceleration mentioned as due to other processes in equation (3.11) is now identified as
due to collisions. Substituting for the two components of acceleration from equations
(3.6) and (3.13), equation (3.11) becomes
eE < vx >
d < vx >
=
=0
dt
m
eE
or < vx > =
(3.14)
m
The steady state average velocity of electrons in presence of an applied electric field is
called the drift velocity of the carriers. This is proportional to the applied electric field
and the constant of proportionality is called the mobility of the carriers. Hence, the
mobility of carriers is defined as the drift velocity per unit applied field and is given by
e
=
(3.15)
m
94
ne2 E
m
(3.16)
Comparing equation (3.16) with equation (3.3), we have the electrical conductivity of
a metal given by the expression
=
ne2
= ne
m
(3.17)
This expression for the conductivity of a metal predicts the dependence of conductivity
on the total number of valence electrons per unit volume and their mobility.
Drude made use of the kinetic theory of gases to evaluate . Considering electrons
as equivalent to molecules in a gas, he evaluated the kinetic energy of the electrons as
1 2 3
mv = kT
2
2
where v is the root mean square velocity of electrons, which can be calculated as
v = (3kT/m)1/2
(3.18)
At room temperature, the thermal velocity of electrons calculated using the above equation is 1.15 105 ms1 . The drift velocity acquired by the electron due to an applied
field is much smaller than the thermal velocity. Hence, the relaxation time may be
written in terms of the mean free path as
= /v = (m/3kT )1/2
Hence, by substituting this value of in equation (3.17), the conductivity of a metal
may be expressed as
ne2
ne2
=
(3.19)
=
m
(3mkT )1/2
This equation suggests that the conductivity of a metal must be inversely proportional
to the square root of temperature. This is in contrast to the experimental observation
that the conductivity is inversely proportional to the temperature. This discrepancy
is due to the fact that is assumed to be independent of temperature. In fact, is a
function of temperature which is not explained or accounted for by the classical free
electron theory.
95
In order to account for the validity of Ohms law in the case of metals, we have introduced the concept of electron-lattice interaction. Let us now try to understand the
nature of such an interaction and its consequences.
vx
vx
This change in velocity is negative since the velocity decreases with each collision and
96
hence,
< v >
< vx > (1 < cos >)
x
=
(3.20)
t
c
coll
Comparing this equation with equation (3.13), we see that the mean collision time is
related to the relaxation time as per the equation
=
c
(1 < cos >)
(3.21)
When the scattering is isotropic, i.e., without any preferred directions, < cos >= 0
and
= c
If the scattering is predominantly along the direction of carrier flow, as is usually the
case, < cos > will have a positive value and
> c
If v is the total velocity of an electron, we can write the average distance travelled
between successive collisions as
= vc
(3.22)
where is called the mean free path of the electron.
3.2.3 Failure of classical free electron theory
1. The classical free electron theory assumes that the valence electrons in a metal,
by virtue of being the outermost electrons in the atoms, are almost free and hence
take part in the conduction process. Hence, divalent and trivalent metals, with a
larger concentration of valence electrons should possess much higher electrical
conductivity than monovalent metals. This is contrary to the experimental observation that silver and copper are more conducting than zinc and aluminium.
2. Assuming that the kinetic energy associated with the electrons to be equal to the
thermal energy, Drude obtained the expression for the conductivity as given by
the equation
ne2
ne2
=
=
(3.19)
m
(3mkT )1/2
This equation suggests a temperature dependence of conductivity as
T 1/2
97
3.3
According to the kinetic theory of gases, a gas is assumed to consist of a large number
of rigid and perfectly elastic particles, namely molecules. These molecules are all of
same mass and are moving freely inside the container. During their random motion,
they collide with each other and acquire all possible velocities. At a given temperature,
in a state of thermal equilibrium, the distribution of molecular velocities and energy
among the molecules of a gas is given by Maxwell-Boltzman statistics. This statistics
can be applied to a system of identical particles that can be distinguished from one
another. This means that the wavefunctions associated with the particles do not overlap
98
considerably. The distribution function that denotes the fraction of the total number of
molecules possessing an energy E is given by,
F MB (E) =
1
A exp (E/kT )
1
A exp (E/kT ) 1
99
We may recall that this was the function used in Plancks theory of radiation for explaining the black body radiation.
Later, Fermi and Dirac found it necessary to modify the Bose-Einstein statistics
for the case of particles that obey Paulis exclusion principle. These are particles having half integral spin like electrons and nucleons. They are called Fermions. The
wavefunctions of Fermions also overlap considerably and hence the particles are indistinguishable. However, the wavefunction of a system of Fermions is affected by the
exchange of any pair of them. A wavefunction of this kind is called a antisymmetric
wavefunction. The system is described by the wavefunction
100
of electrons per unit volume of the sample is very large, we can expect some electrons to have relatively large kinetic energy even at ordinary temperatures. In order to
make a theoretical estimation of the electrical conductivity of a metal, it is necessary
to know the distribution of electrons in various energy levels and also a knowledge of
the number of electrons actually contributing to the conductivity.
3.3.1 Density of energy states in a metal
The electrical conductivity of a metal depends on the total number of electrons contributing to conductivity. This can be evaluated with the knowledge of the total number
of energy states available for occupation at different energy levels and their occupation.
The number of allowed energy states per unit energy interval at a particular energy
value is called the density of energy states. We will now proceed to obtain an expression for the same.
Py
Y
Py
y
Pz
Px
PM
Px
Pz
(a)
(b)
101
by
x = y = z = 1
or
1 2 p2M
E = mv =
2
2m
p M = (2mE)1/2
8
(2mE)3/2
3h3
(3.24)
102
(3.25)
If M is increased by a small value dM, the radius of the sphere will increase from p M
to (p M + d p M ) and the highest energy level from E to (E + dE). By definition, the
density of energy states g(E) is given by
dM
DE
27/2 m3/2 E 1/2
=
h3
g(E) =
(3.26)
g(E)
103
(3.27)
Fermi factor represents the probability that a particular energy level E is occupied
at a temperature T . At absolute zero, we find the probability function has a value equal
to 1 for all energies less than E F and a value equal to zero for all energies higher than
E F . Thus, E F serves as a reference level and is called the Fermi energy. Fermi energy
can be defined as that energy at which the probability of occupation is half. This can be
shown to be valid at all temperatures. The distribution of electrons at different energy
levels is shown in Fig. 3.6.
EF
T = OK
T1
T2
104
At O K, all the energy levels upto E F are completely filled and all the energy levels
above E f are empty. Thus, Fermi energy can also be defined as the energy of the top
most filled level at 0 K. An increase in temperature results in a few electrons acquiring
energy greater than E F . However, the change is prominent only at energy values around
E F . Thus, only a small fraction of valence electrons lying at energy levels close to E F
will contribute to the electrical conductivity. Since the thermal energy kT at any finite
temperature is very small compared to the Fermi energy, the effect of temperature on
the number of electrons contributing to conductivity is negligible. The velocity of
conduction electrons can be calculated as
1 2
mvF = E F or vF = (2E F /m)1/2
2
(3.28)
It may be mentioned that the relaxation time of conduction electrons refers to the
value calculated for electrons with velocity vF . Hence, the electrical conductivity given
by the equation (3.17) may be expressed as
= ne f f
e2
m
(3.29)
where ne f f refers to the number of valence electrons at the Fermi energy and with a
velocity vF . The above equation may also be obtained by considering the movement of
electron in a metal to be equivalent to the propagation of a wave in a uniform potential
resulting from the ion cores. Thus quantum free electron theory is capable of explaining a few experimental observations which could not be explained on the basis of the
classical theory. The higher electrical conductivity of monovalent noble metals over
that of some bivalent and trivalent metals can be explained on the basis of the effective
number of electrons contributing to conductivity. Further, since the relaxation time and
the mean free path of the conduction electrons are inversely related to the temperature,
it follows that the conductivity varies inversely with temperature at high temperatures.
However, the large values of mean free path, of the order of hundreds of atomic distances, cannot be explained on the basis of quantum free electron theory. The theory
also fails to explain other experimental observations like positive Hall coefficient for
some metals and the temperature dependence of conductivity of semiconductors and
insulators.
3.3.3 Band theory of metals
The quantum free electron theory of metals assumes that a conduction electron is subjected to a constant potential due to the ion cores comprising of nucleus and inner
105
electrons. This amounts to the conclusion that the ion cores have no influence on the
movement of electrons that are free to move within the sample. The band theory treats
the metal as an infinite array of positively charged ions through which the conduction
electrons move. The potential energy due to coulombic attraction between the electron
and the positive ion may be represented by periodic square well type potential. Then,
we observe that the solution of the Schrodingers wave equation leads to the following
conclusions:
The motion of electrons in the metal lattice is restricted by two factors. The first
factor is the restriction on the energy of the electrons. The distribution of energy is
determined by the Fermi-Dirac statistics which predicts discrete energy levels for the
electrons. The second factor is the restriction on the energy possessed by the electrons
as a result of the formation of allowed and forbidden energy bands.
Thus, the quantum theory leads us to the Band theory of solids, which is used very
effectively to explain various properties of metals such as electrical conduction, heat
capacity and heat conduction.
3.3.4 Merits of quantum free electron theory
(i) The discrepancy in the electrical conductivity of metals of different valences
could not be explained by the classical free electron theory. Quantum mechanically, it is observed that all valence electrons are not conduction electrons and
only those valence electrons having an energy equal to or greater than the fermi
energy will take part in conduction. On this basis, it is observed that aluminium,
in spite of having three electrons per atom, have a fewer electrons available for
conduction as compared to silver and hence a lower electrical conductivity.
(ii) The temperature dependence of electrical conductivity of metals as explained by
the classical free electron theory is based on the expression,
ne2
(3.19)
(3mkT )1/2
This expression indicates that the conductivity is inversely proportional to the
square root of absolute temperature. However, this conclusion assumes that the
mean free path of the conduction electrons is a constant and independent of temperature. Quantum theory is capable of explaining the temperature dependence
by taking into account the temperature dependence of mean free path of the electrons. According to the quantum free electron theory, the conductivity can be
shown to be inversely proportional to temperature which is in agreement with the
experimental observations.
=
106
(iii) The large value of mean free path experimentally observed is not explained by
classical free electron theory. The quantum free electron theory, with the help
of the band model, assumes that the movement of an electron in the metal lattice is equivalent to the propagation of an electron wave in a periodic potential.
The scattering of electron is equivalent to the modification of the amplitude of
this electron wave due to the deviation from the periodic potential at the sites of
impurities and imperfections. This explains the large value of mean free path of
conduction electrons. Further, the relation between the concentration of defects
or impurities and the mean free path has been experimentally established.
In addition, many other physical properties associated with electrons in metals can
be successfully explained on the basis of quantum theory.
3.4
The large values of the mean free path of conduction electrons of the order of hundreds
of atomic distances have been computed for the case of many metals at room temperature. The quantum free electron theory tries to explain the situation by attributing
wave nature to the conduction electrons moving in a periodic potential of the lattice.
The obstacles responsible for the scattering of the electron wave come in the form of
deviations from the periodicity of the lattice. The deviations from periodicity arises
due to the impurities and imperfections. Thus, the scattering of conduction electrons
may occur due to the following reasons:
(a) vacancies in the lattice,
(b) atoms occupying wrong positions in the lattice,
(c) lattice atoms replaced by impurity atoms
(d) thermal vibrations of atoms about their mean position, etc.
These various kinds of obstacles have to be considered in the scattering of electrons,
each of them giving rise to certain relaxation time separately. The probability for
scattering by a particular kind of obstacle for unit time is equal to the reciprocal of the
relaxation time associated with the obstacle and the total probability of scattering of an
electron will be the sum of the individual probabilities. Hence, we can write
1/ = 1/i + 1/T
(3.30)
107
where i and T are the effective relaxation times due to temperature independent and
temperature dependent scattering events. The resistivity of the metal may be written
as
m
= 2 (1/i + 1/T )
(3.31)
ne
or = i + T
(3.32)
where i is called the residual resistivity which is independent of temperature and
is caused by defects and impurities whose characteristics are independent of temperature. The part T is the temperature dependent resistivity with contributions from
thermal vibration of the lattice and thermal defects. This part of resistivity is called
thermal resistivity or ideal resistivity. The equation (3.31) and (3.32) are referred to
as Matthiessens rule.
3.4.1 Effect of temperature
It is well known that the electrical resistivity of metals increases with increase in temperature. Fig. 3.7 shows the variation of resistivity with temperature. The curve may
be divided into three regions.
T /D
108
Hence, the thermal resistivity may be neglected in comparison with the residual resistivity. The total resistivity will be almost equal to the residual resistivity and almost
independent of temperature.
Region II: In the intermediate temperature region, resistivity is a complex function of
temperature. The resistivity may be expressed as
(T/D )n
(3.33)
where D is the Debye temperature and the exponent n can have values between
5 and 1. (The Debye temperature D corresponds to the temperature at which the
frequency of vibration D of the lattice has a maximum value given by the relation
hD = kD . Above Debye temperature, all the modes of vibration are excited.) The
value of n is equal to 5 at the low temperature end of the region and equal to 1 at the
high temperature end.
Region III: This region corresponds to temperature above the Debye temperature. In
this region, the value of n is 1 and hence, resistivity may be expressed as
= i + aT
(3.34)
109
%N i
Figure 3.8 Effect of nickel impurity concentration on the electrical resistivity of copper.
It is found that the resistivity increases with increase in the concentration of impurity added. Now, alternatively, small amount of Cu is added to the metal Ni. It
is observed that the resistivity increases irrespective of the electrical resistivity of the
impurity added. This is due to the fact that the impurity atoms destroy the periodicity
of the lattice and thus act as scattering centres. The change in the resistivity due to
impurity is related to the concentration of impurity according to the relation
i x(1 x)
(3.35)
(3.36)
In the case of alloys with higher concentration of the impurity the resistivity versus
concentration curve shows a peak. An example of Cu Au system is shown in Fig. 3.9.
The Cu Au alloy has a maximum resistivity at 50% concentration of each component.
This is called the disordered phase of the alloy. If the atoms in the alloy are ordered (by
annealing at elevated temperature) the resistivity drops at finite composition indicating
the formation of an ordered phase. In an ordered phase, the lattice is again periodic and
hence the resistivity reduces. This is in fact a proof for the assumption that impurities
enhance scattering probability.
110
Cu3 Au
CuAu
50
% Au
100
3.5
Thermionic emission
In any metal, at any temperature above 0 K, the energy states at the top of the band
are not completely full. A few electrons close to the Fermi level will find themselves
at energy levels above the Fermi energy and are free to move about within the metal
(fig 3.6). But these electrons are not free enough to move out of the metal. For a given
metal, we can define an energy level E s , above the Fermi level, at which the electron
will be completely free from the metal. This energy level E s is called the vacuum
level, which represents the energy of an electron at rest outside the metal (Fig.3.10).
ES
EF
Figure 3.10 Energy level diagram for a metal surface showing Fermi level, vacuum
level and barrier energy for electron emission.
111
The difference in the energy levels E F and E s is known as the work function of the
material. It is defined as the energy required to raise an electron from the Fermi level
to the vacuum level of the metal and is characteristic of the metal.
As the temperature of the metal is raised, the distribution of electrons in various
electron energy levels near the Fermi energy is modified (Fig.3.11).
ES
EF
T = OK
T > OK
E
(3.37)
where A is a constant dependent on the metal and is the work function of the metal.
Equation (3.37) is called Richardson - Dushman equation. An experimental study of
the variation of saturation current density with temperature helps in the determination
of the work function of the metal surface.
112
Numerical Examples
3.1 What is the mobiity of electrons in copper which has resistivity 1.6 108 ohm m
and electron density of 8.5 1028 m3 ?
Solution:
= ne
= /ne = 1/ne where is resistivity.
1
=
28
8.5 10 1.6 1019 1.6 108
= 4.6 103 m2 v1 s1 . (Ans.).
3.2 In the above example, calculate the relaxation time of conduction electrons.
Solution: We have,
= ne2 /m
= m/ne2 = m/ne2
=
9.1 1031
8.5 1028 (1.6 1019 )2 1.6 108
3.3 The relaxation time of conduction electrons in copper is 2.6 1014 s. If copper
has a resistivity of 1.6 108 Ohm m, calculate the number of free electrons per
unit volume and Fermi velocity. Assume Fermi energy as 7 eV.
113
Solution: We have,
= ne2 /m
n = m/e2 = m/e2
9.1 1031
(1.6 1019 )2 2.6 1014 1.6 108
n = 8.5 1028 m3 (Ans.).
=
Assuming Fermi energy as the kinetic energy of the electrons at the Fermi
level,
1
E f = mv2f
2
Fermi velocity, v f = [2E f /m]1/2
[2 7 1.6 1019 ]1/2
=
[9.1 1031 ]
= 1.57 106 ms1 . (Ans.).
3.4 Estimate the relaxation time of conduction electrons in silver from the following
data:
Resistivity = 1.6 108 ohm m
Density = 10.5 103 kg/m3
Atomic weight = 107.88
9.1 1031
(5.86) 1028 (1.6 1019 ) x1.6 108
= 3.79 1014 s. (Ans.).
3.5 The resistivity of copper at 0oC is 1.6108 ohm m and changes to 2.2108ohm m
at 100oC. By what factor does the relaxation time of conduction electron change?
Solution: Relaxation time,
= m/ne2
114
= 0.036Vm1 (Ans.).
3.7 What is the drift velocity of conduction electrons in a copper wire of cross sectional
area 1 106 m2 when a current of 2A flows through it? Assume the electron
density as equal to 8.5 1028 m3 .
Solution:
Current density J = nev or v = J/ne
v = 2/(1 106 8.5 1028 1.6 1019 )
115
Exercise
3.1 Elucidate the difference between classical free electron theory and quantum free
electron theory.
(March 1999).
3.2 Explain the classical free electron theory of metals. Mention its shortcomings.
(August 1999).
3.3 Explain the motion of a free electron in a periodic lattice.
(March 2000).
3.4 What are the effects of temperature and impurity on the electrical resistivity of
metals?
(March 2000).
3.5 What are relaxation time and collision time? On the basis of free electron theory
of metals, obtain an expression for the electrical conductivity of a metal in terms
of collision time.
(March 2000).
3.6 For a metal having 6.5 1028 conduction electrons per m3 , find the relaxation
time of conduction electrons if the metal resistivity is 1.43 108 ohm.m.
(August 2000).
3.7 Based on free electron theory, derive an expression for electrical conductivity in
metals and discuss the failure of classical free electron theory. (August 2000).
3.8 Calculate the mobility and the relaxation time of electrons in copper with the
following data:
Resistivity of copper = 1.73 108 ohm m.
Atomic weight = 63.5.
Density = 8.92gm cc1
Avogadro number = 6.02 1023 per gm mole.
(March 2001).
116
3.9 Assuming the electron-lattice interaction to be responsible for scattering of conduction electrons in a metal, obtain an expression for the conductivity in terms
of relaxation time.
(March 2001).
3.10 Obtain an expression for electrical conductivity with the help of classical free
electron theory.
(August 2001).
3.11 Give a qualitative account of thermionic emission.
(March 2002).
3.12 Find the mobility of conduction electrons in copper using the data given:
Resistivity of copper = 1.7 108 ohm m.
Atomic weight of copper = 63.54
Density of copper = 8.96 103 kgm3
(March 2002).
3.13 Based on free electron theory, derive an expression for the electrical conductivity
in a metal. Explain the deficiencies of classical free electron theory.
(March 2002).
3.14 Explain density of states. Derive the expression for number of allowed energy
states for a unit volume of a solid.
(Feb 2003).
3.15 Explain the terms drift velocity, mobility and relaxation time.
(Feb 2004).
3.16 Find the temperature at which there is 1% probability that a state at an energy
level 0.5 eV above Fermi level will be occupied.
(Feb 2004).
Chapter 4
Superconductivity
4.1
Introduction
The electrical resistivity of all metals decreases as the temperature is reduced. This
is because the thermal vibration of atoms decreases with decrease in temperature. At
very low temperature, the thermal vibrations contribute very little to the scattering of
conduction electrons and the electrical resistivity will be dominated by the effect of
impurities and defects. The electrical resistivity may tend to zero as the temperature
approaches O K only in the case of pure and perfect single crystals. However, some
metals show a remarkable behaviour. They lose their electrical resisistivity completely
below a certain temperature. For example, the electrical resistance of pure mercury
suddenly drops to zero (Fig.4.1) when cooled below 4.2 K. The material is said to
be in the superconducting state below the critical temperature and the phenomenon is
known as superconductivity.
T, K
118
Superconductivity
Table 4.1 Superconducting transition temperature of some selected materials
Element T c , K
Al
1.1
In
3.4
Ta
4.4
V
5.1
Pb
7.2
Nb
9.2
4.2
Compound T c , K
Pb2 Au
7
MoN
12
NbN
15
V3 S i
17
Nb3 S n
18
Nb3 Ge
21
Superconductivity
119
lithium, sodium and potassium do not exhibit superconductivity even at temperatures as low as 0.1 K.
4.2.1 Isotope effect
When an element exhibiting superconductivity exists in various isotopic forms, it is
observed that the transition temperature decreases as the isotopic mass increases. The
variation of transition temperature with isotopic mass is described by the relation,
M 1/2 T c = constant.
(4.1)
(4.2)
T > Tc
(a)
T < Tc
B > Bc
(b)
(c)
Figure 4.2 Demonstration of Meissner effect. (a) The magnetic flux lines passing
through the sample for T > T c, (b)flux lines being expelled for T < T c and (c) the flux
lines penetrating when the field is increased above the critical value.
120
Superconductivity
When a magnetic field is applied to the material, the magnetic flux lines pass
through the material. Now, if the temperature is reduced below the critical temperature the magnetic flux lines will be expelled from inside the superconductor (Fig 4.2b).
Hence, we have,
B = o (H + M) = 0
(4.3)
where M is the magnetization in the material due to an applied magnetic field H. The
magnetic susceptibility is given by
=
M
= 1
H
(4.4)
which indicates that the material in its superconducting state is a perfect diamagnetic
material.
If a superconductor is subjected to a strong magnetic field, the material loses its
superconducting property and becomes normal. As the magnetic field is increased in
strength, at a particular value of the magnetic field, the magnetic flux starts penetrating
into the material and makes it a normal material (Fig.4.2c). Hence, we can define a
critical magnetic field, corresponding to a temperature, upto which the material remains in the superconducting phase and above which the material becomes normal. At
any temperature T below T c , the critical magnetic field Hc (T ) upto which the material
remains in the superconduting phase, is given by the relation
"
#
T2
Hc (T ) = Hc (0) 1 2
Tc
(4.5)
where T c is the critical temperature in the absence of a magnetic field and Hc (0) is a
constant representing the critical magnetic field at T = 0K (Fig.4.3). The magnetic
field that destroys superconductivity in any material need not be an externally applied
field. It may also be an induced magnetic field due to an electric current flowing
through the material. The magnetic field H induced in a wire of radius r when a
current I flows through it is given by
I = 2rH
(4.6)
If this induced magnetic field becomes equal to the critical magnetic field, the material
becomes a normal conductor. Hence, we can define a critical current Ic corresponding
to the critical magnetic field Hc that is sufficient to destroy superconductivity in the
wire.
Superconductivity
121
HC
T, K
Figure 4.3 The variation of critical magnetic field with temperature.
Ic = 2rHc
(4.7)
Equation (4.7) is called Silsbees rule. This puts a limit on the electric current that can
be passed through a superconductor.
4.3
Classification of superconductors
Superconductors are classified as Type I (or soft) superconductors and Type II (or hard)
superconductors. The classification is based on the magnetic behaviour of superconductors. Type I superconductors have a small critical magnetic field and the transition
from the superconducting to the normal phase at the critical field is abrupt. Assuming that there is a magnetization in the superconductor in a direction opposite to the
direction of the applied magnetic field, the variation of magnetization with the applied
field will be as shown in Fig.4.4. At the critical magnetic field, there will be an abrupt
HC
122
Superconductivity
decrease in the magnetization and the material becomes normal. For all values of magnetic field above the critical field, the material shows finite resistivity and the magnetic
flux penetration is complete. In other words, Type I superconductors display Meissner
effect completely.
In the case of Type II superconductors, the material is a perfect superconductor
upto a magnetic field Hc1 (Fig.4.5). The magnetic flux penetration begins at Hc1 and
is complete at a magnetic field Hc2 . The material becomes a normal conductor for
magnetic field greater than Hc2 . The material is said to be in a mixed state or vortex
state between the two critical magnetic fields Hc1 and Hc2 . Inspite of the fact that the
magnetic flux lines penetrate the material in the vortex state,
Hc1
Hc2
Type II
1. These are usually elements in their These are impure elements, alloys or
pure form.
compounds.
2. The transition to normal state occurs
abruptly at a critical magnetic field,
Hc
Superconductivity
123
5. The superconductivity or the zero re- The material remains in the resissistance state is observed upto critical tanceless state even in the intermedimagnetic field Hc .
ate state between Hc1 and Hc2 and becomes normal only at Hc2 .
6. The small value of Hc , whether applied or induced, restricts their use
4.4
Applications of superconductors
The major applications that are based on the properties of the superconducting phase
are
(i) Lossless power transmission
(ii) Superconducting magnets
(iii) Magnetic levitation
(iv) Superconducting memories and switches
(v) Superconducting quantum interference devices.
(i) Lossless power transmission: The transmission of electrical power on a large
scale is limited due to the restrictions on the current carrying capacity of transmission lines and their maintenance. Loss of power due to power dissipation in
transmission lines due to the finite resistance offered by the cables is quite considerable. Further, due to the passage of the current, the metallic cable gets heated
up thereby increasing the electrical resistance of the cables. This increases the
joule heating losses further. If the transmission lines are made of materials in
their superconducting state, their current carrying capacity increases. Since these
lines do not offer any resistance to the flow of electric current, there is no power
124
Superconductivity
dissipation due to joule heating. Thus, power losses are minimised and current
carrying capacity is improved. This is the principle used in lossless power transmission.
(ii) Superconducting magnets: The problem of joule heating again puts a limit on
the magnetic field that can be obtained in an electromagnet. The magnetic field
generated in a coil depends on the strength of the direct current flowing through it.
Any attempt to increase the magnetic field by increasing the current through the
coil results in increasing the joule heating . This puts an upper limit on the current
carrying capacity of the wire used and hence on the magnetic field produced. Use
of superconducting coils in the electromagnets can enhance the magnetic field
generated as they can carry large currents with practically no dissipation. This is
the principle used in the construction of superconducting magnets. However,
the limitation now comes in the form of the effect of magnetic field on the superconducting state of the coils. When the magnetic field generated exceeds the
critical value at the operating temperature, the metal wire used in the coils will
return to their normal state and start offering large resistance to current flow and
drastically reducing the magnetic field generated.
(iii) Magnetic levitation: The principle of the repulsion of magnetic flux from a
superconductor can be used in magnetic levitation applications. When a magnet
is brought near a superconductor, there will be a repulsion and the superconductor
tries to move away from the magnet. One of the reasons for a limitation to the
speed of a train, for example, is the friction between the wheels of the train and
the tracks on which the train moves. Superconducting wheels experience reduced
friction on magnetic tracks. Alternatively, magnetic levitation may be used to
reduce friction between the wheels of a train and the track on which it moves.
It is well known that like poles of magnets repel and the friction between the
wheels and the tracks get reduced when they are made like poles of magnets.
The large magnitude of repulsive force required may be realized with the help of
superconducting magnets. The principle can also be used in making friction-less
bearings.
Practically, Maglev is a vehicle that runs levitated from the guideway, which is
similar to the tracks in railways. This is achieved by using electromagnetic forces
between superconducting magnets on the vehicle and coils on the ground. Levitation coils shaped like figure 8 are installed on the sidewalls of guideway (Fig.4.6
a). When the superconducting magnet attached to the vehicle passes below the
centre of these coils, electromagnetic force pushes the magnets upwards thereby
Superconductivity
125
111111111111
000000000000
N
S
000000000000
111111111111
000000000000
111111111111
000000000000
111111111111
000000000000
111111111111
000000000000
111111111111
(a)
111111111111
000000000000
000000000000
111111111111
S
111111111111
000000000000
000000000000
111111111111
N
GUIDWAY
(b)
Control, Nb(9.2K)
126
Superconductivity
wire are in the superconducting phase. Hence, the resistance of the core wire will
be zero. A current can be passed through the control coil to produce a magnetic
field sufficient to make the core wire normal. Thus, the core wire can be made
to possess zero or finite resistance depending on the control current being off
or on respectively. These two states of the core wire may be considered as the
ON and OFF states. Thus, cryotrons can be used as switches.
Based on the same principle is the construction of superconducting memory. If a
current is induced in a superconducting ring, the current persists and continues to
flow until a magnetic field is applied to make the ring normal, thereby allowing
the current to decay. The direction of current flow may be used to represent the
state of memory, namely 0 or 1.
M ET AL 1
M ET AL 2
Superconductivity
127
electromagnetic radiations. A dc magnetic field applied through a superconducting circuit containing two junctions causes the maximum supercurrent to show
interference effects as a function of the applied magnetic field intensity. This is
used in the construction of Superconducting Quantum Interference Devices
(SQUIDs).
J1
JIN
JOU T
J2
128
Superconductivity
This expression indicates that the output current varies with the applied magnetic
flux and shows oscillations(Fig.4.10).
J
4.5
Superconductivity
129
is equivalent to an attractive force between the two electrons which becomes maximum when the electrons have opposite spins and associated momentum. The electrons
overcome the electrostatic force of repulsion and are bound in pairs with a weak force
of attraction assisted by the lattice. Due to this interaction, the scattering felt by one
electron is exactly nullified by the response of the other electron of the pair, so that
the electrons move without experiencing any effect of scattering. The resulting Cooper
pairs are bosons with zero spin and are not bound by the Paulis exclusion principle.
Hence, they condense into a common ground state. Such a condensation results in the
formation of an energy gap with the Cooper pairs existing below the gap and the normal electrons above it. The energy gap represents the binding energy of these Cooper
pairs and is found to be related to the transition temperature. When the temperature is
increased above the transition temperature, the energy gap disappears and the Cooper
pairs break up to form normal electrons. The BCS theory also predicts a close relation
between the electron lattice interaction and superconductivity. The isotope effect is in
support of this view. Further, the absence of superconductivity in highly conducting
noble metals like gold, silver and copper , even at very low temperature, is attributed
to the absence of strong lattice-electron interaction. In other words, the BCS theory
has been able to explain the observed characteristic features of superconductors.
4.6
Superconductivity has been observed in a few elements, but in a large number of alloys
and compounds. Until 1986, the highest transition temperature was about 23 K. In
1986, several materials were discovered with T c in the range 30-40 K. In a couple of
years, spectacular results were reported with T c upto 125 K. This led to the hope that it
might be possible to develop materials that are superconductors at room temperature.
Discovery of such materials would mean large scale application of the phenomenon.
Further, the BCS theory predicts a superconducting transition temperature T c given by
T c = 1.14D exp {1/N(E F )U}
(4.8)
where D is the Debye temperature, N(E F ) is the density of states at the Fermi level
and U is a measure of the electron-phonon interaction. This equation suggests that
materials, which are characterized by high resistivities at normal temperatures, have
a better probability of becoming superconductors at higher transition temperatures.
Accordingly, some mixed oxide systems were studied and were found to have transition temperatures above liquid nitrogen temperature (77 K). These are known as high
temperature superconductors.
130
Superconductivity
Table 4.3 Some oxide superconductors.
Compound
Tc , K
Yba2Cu3 O7
90
110
T lBa2Ca3Cu4 O11
120
T l2 Ba2Ca2Cu3 O10
125
Numerical Examples
4.1 The critical temperature and critical magnetic field for superconducting lead are
7.2 K and 800 gauss respectively. What will be the temperature up to which lead
will be in superconducting state in a magnetic field of 400 gauss?
Solution:
h
i
Hc (T ) = Hc (0) 1 T 2 /T c2
i
h
400 = 800 1 T 2 /7.22
T = 5.09K. (Ans.).
4.2 Superconducting tin has a critical magnetic field of 217 gauss at 2 K. If the critical temperature for superconducting transition for tin is 3.7 K, find the critical
magnetic field at 3 K.
Solution:
h
i
Hc (T 1 ) = Hc (0) 1 T 12 /T c2
h
i
Hc (T 2 ) = Hc (0) 1 T 22 /T c2
h
i h
i
1 T 12 /T c2
T c2 T 12
Hc (T 1 )
i
i=h
=h
Hc (T 2 )
1 T 22 /T c2
T c2 T 22
h
i
T c2 T 22
i Hc (T 1 )
Hc (T 2 ) = h
T c2 T 12
Superconductivity
131
h
=
(3.7)2 (3)2
(3.7)2 (2)2
217
4.3 The critical temperature for Hg199.5 is 4.185 K. Calculate the critical temperature
for Hg203.4 .
Assume = 0.5.
Solution:
TC (199.5)1/2 = TC (203.4)1/2
TC =
(199.5)1/2
TC (199.5)1/2
=
4.185
= 4.145K (Ans.).
(203.4)1/2
(203.4)1/2
Exercise
4.1 Compare the dependence of resistance on temperature of a superconductor with
that of a normal conductor. Describe briefly the formation of Cooper pairs.
(March 99)
4.2 Describe different types of superconductors.
(March 99)
(August 01)
4.6 What is Meissner effect? Explain the validity of Meissner effect in Type I and
Type II superconductors.
(March 02)
4.7 What are SQUIDS ? Give a brief account of their applications.
(Feb 2003)
(Aug 2003)
(Aug 2003)
4.10 Distinguish between Type I and Type II superconductors. Discuss briefly the
BCS theory of superconductivity.
(Aug 2004)
4.11 Explain Type II superconductor. Discuss Maglev vehicle.
(Feb 2004)
Chapter 5
Semiconductors
5.1
A solid contains a large number of atoms packed closely together. Each atom, when
isolated, has a discrete set of electron energy levels 1s, 2s, 2p, 3s, 3p, 3d. . . . . . and so
on. Each of these energy levels can hold a finite number of electrons with different
magnetic and spin quantum numbers. A given atom has some of these levels filled and
some empty depending on the total number of electrons associated with the atom. For
example, an atom of carbon which has six electrons will have two electrons each in 1s,
2s, and 2p levels. However, 2p level can hold a total of six electrons out of which four
levels are empty. When atoms are brought nearer, as in the case of solids, the orbitals of
the outer electrons overlap indicating that these electrons have a sort of an interaction.
In a solid, because of the close packing of atoms the outer electrons from many atoms
interact. The result of these interactions is that these electrons cannot be considered to
belong to any particular atom. In other words, the electrons are no longer localized at a
particular atom and its wave function is altered due to the influence of the neighbouring
atoms. This results in shifting and splitting of energy levels into energy bands. The
effect of reducing the inter atomic distance in carbon is illustrated in Fig.5.1.
E
2p
10 01
00000000000
11111111111
00
11
11
00
00
11
2s
00
11
00
11
00
11
11
00
01 10
11111111111
00000000000
00
00
11
00 11
11
111111111111111111111111111s 01 01
00000000000000000000000000
r
r
111111
000000
0
Semiconductors
133
The discrete energy levels 2s and 2p will broaden into bands which overlap as the
inter atomic distance is reduced. With further decrease in the distance a split occurs
producing two bands with four electron states each per atom. The lower band containing four electron states occupied by four electrons is called the valence band and the
upper band containing four empty electron states is called the conduction band. At the
inter atomic distance equal to that observed in diamond crystal, the gap between the
valence and conduction bands is quite large and of the order of 6 eV. This gap is called
the forbidden energy gap as there are no permitted energy levels for occupation by
electrons. In diamond, all the energy levels in the valence band are full and the energy levels in the conduction band are empty at O K. Hence, diamond is an electrical
insulator (Fig.5.2a).
1111111111111 CB
0000000000000
0000000000000
1111111111111
0000000000000
1111111111111
1111111111111
0000000000000
0000000000000 CB
1111111111111
1.1eV
6eV
1111111111111
0000000000000
VB
0000000000000
1111111111111
0000000000000
1111111111111
(a)
1111111111111
0000000000000
0000000000000 VB
1111111111111
(b)
Figure 5.2 Energy band diagrams for (a) insulator and (b)semiconductor.
In silicon, the 3s and 3p bands overlap and result in valence and conduction bands
separated by a forbidden energy gap of 1.1.eV (Fig.5.2b). Germanium has a forbidden
energy gap of 0.7 eV. Silicon and germanium are classified as semiconductors because
at room temperature quite a few electrons can cross the forbidden energy gap and
contribute to electrical conductivity.
In the case of metals (or conductors), the outermost band containing valence electrons is either partially filled(as in Na, K, Al, etc) or overlaps with the next higher
band(as in Be, Mg, Ca, etc). For example, in sodium there is only one electron in the
3s state. Since the 3s state can accommodate two electrons, the band formed with these
electrons is said to be half-filled. In magnesium, the 3s state is full with two electrons.
However, 3p states overlap with 3s states making it a partially filled band. In either
case, the electrons can easily take part in electrical conduction as they need not cross a
forbidden energy gap (Fig.5.3).
Thus, materials can be classified as conductors, semi-conductors and insulators on
the basis of band theory of solids. The order of magnitude of electrical conductivity
134
Semiconductors
3p
1111111111111
0000000000000
0000000000000 3s
1111111111111
2p
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000 2s
1111111111111
1111111111111 3s
0000000000000
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000 2p
1111111111111
1111111111111 2s
0000000000000
1s
(a)
1s
(b)
Figure 5.3 Energy band diagrams for (a) sodium showing half-filled 3s band and (b)
magnesium showing 3s and 3p bands overlapping.
serves as an indicator for the classification. There are a few exceptions like conductors
with incomplete inner orbitals(like Sc, Mn, Zr, Re, Rh, etc) showing higher resistivities
and narrow band gap semiconductors(like InS b, Bi2T e3 , etc) with exceptionally high
conductivity. Hence, conductivity alone should not be the basis for classification of
materials under these categories. The temperature coefficient of resistivity helps us to
classify and distinguish between highly conducting semiconductors and less conducting metals. The positive value of temperature coefficient of resistivity for conductors
and negative value for semiconductors can be explained on the basis of band theory.
5.2
Intrinsic semiconductors
These are semiconducting materials which are chemically pure, structurally perfect
and stoichiometric in composition if compound. They have properties characteristic
of the material alone. The carriers contributing to electrical conduction are generated
due to the breaking of covalent bonds by thermal activation process. Pure germanium
or silicon are examples of intrinsic semiconductors. These materials have four valence
electrons and require four more to complete the subshell. Hence, they have a tendency
to form four covalent bonds with neighbouring atoms. This is shown schematically in
two dimensions in Fig.5.4.
It may be mentioned that the four covalent bonds are equally inclined to each other
in three dimensions and are referred to as tetrahedral bonding. The two dimensional
representation is used only for ease of explaining the generation and motion of carriers
in the semiconductor lattice.
Semiconductors
135
Figure 5.5 Breaking of covalent bond and generation of electron - hole pairs.
The movement of electrons in the valence band is conveniently explained as equivalent to the movement of holes. Since the direction of movement of these holes is
opposite to that of electrons, holes behave as if they are positively charged. Thus,
breaking of each covalent bond results in the formation of an electron-hole pair, and
they contribute to the electrical conductivity of the material. Since the electron and the
hole are created together in pair, the number of electrons in the conduction band will
be equal to the number of holes in the valence band at any given temperature.
136
Semiconductors
1
1 + exp {(E E F )/kT }
(5.1)
T>0 K
T=0 K
EF
0.5
F(E)
Semiconductors
137
empty. It can be shown that for an intrinsic semiconductor, the Fermi level lies at the
centre of the band gap and remains invariant with temperature. We notice that the
Fermi factor has a non-vanishing value even for the energy values corresponding to
the forbidden energy gap. However, no carriers are present in the forbidden gap since
there are no sites available for occupation by the charge carriers.
5.2.3 Conductivity of an intrinsic semiconductor
When an electric field is applied to an intrinsic semiconductor, there will be movement
of electrons in the conduction band and of holes in the valence band. Hence, there will
be two currents, an electron current in the conduction band and a hole current in the
valence band. As these currents are constituted by oppositely charged carriers and are
in opposite directions, the net current will be sum of the two currents.
If ve represents the drift velocity of electrons due to the applied field E, the current
density due to electrons is given by
Je = neve
(5.2)
where n is the number of conduction electrons available per unit volume and e is the
charge associated with these electrons (compare this with equation (3.9) for the case
of a metal). Similarly, the current density due to holes is given by
Jh = pevh
(5.3)
(5.4)
since the number of conduction electrons will be equal to the number of holes in an
intrinsic semiconductor. Further,it is observed that the drift velocity is directly proportional to the applied field E.
i.e.
or
v = E
(5.5)
where is called the mobility of carriers. Hence the above equation becomes
J = neE(e + h )
(5.6)
where e and h are the the electron and hole mobilities respectively. The conductivity,
which is defined as the current density per applied field, is given by
= ne(e + h )
(5.7)
138
Semiconductors
(5.8)
where E g is the energy gap of the semiconductor and k1 is a constant. The mobility of
electrons and holes generally depend on temperature as
= k2 T 3/2
(5.9)
where k2 is a constant. It may be mentioned that equation (5.9) is valid for a case where
the scattering of charge carriers is predominantly by phonons rather than ionised impurities. Substituting from equations (5.8) and (5.9) in equation (5.7), the conductivity
can be expressed as
= k1 k2 e exp (E g /2kT )
= K exp (E g /2kT )
(5.10)
where K = k1 k2 e
A plot of ln versus (1/T ) will be a straight line (Fig.5.7) with a slope equal to
(E g /2k). Hence, the band gap energy of the semiconductor can be calculated.
5.3
Extrinsic semiconductors
Pure or intrinsic semiconductors are of theoretical importance because of their low conductivity. The electrical conductivity of a semiconductor is very sensitive to the purity
of the material. Addition of small amounts of impurities results in drastic changes
Semiconductors
139
ln
(1 / T)
1111111111111
0000000000000
0000000000000CB
1111111111111
DL
1111111111111
0000000000000
0000000000000VB
1111111111111
Figure 5.8 Effect of adding impurity atoms of Group V showing free electron and the
donor energy level.
140
Semiconductors
The impurity atom occupies one of the lattice sites and since the sizes of the atoms
are not much different, the crystal is not distorted. However, each phosphorus atom
brings five valence electrons with it whereas only four are required to form covalent
bonds with the neighbouring silicon atoms. The fifth electron will be loosely bound
to the parent atom due to the high permittivity of silicon. The energy state of this
electron is represented by an energy level just below the bottom of the conduction band
(Fig.5.8) since the electron has a very small binding energy of the order of 0.05 eV.
The electron can be easily released into the conduction band by means of thermal
excitation. Hence, phosphorus atom or any group V element is referred to as a donor
since it contributes additional electron for conduction. The semiconductor is said to be
n-type semiconductor.
When a semiconductor is doped with a group III element, the impurity atom will
have three valence electrons but occupies the lattice position of a tetravalent semiconductor atom (Fig.5.9). It will have a tendency to capture an electron to complete the
fourth covalent bond. Hence the impurity is called an acceptor.
1111111111111
0000000000000
0000000000000CB
1111111111111
AL
1111111111111
0000000000000
0000000000000VB
1111111111111
Figure 5.9 Effect of adding impurity atom of Group III showing a hole and the acceptor
energy level.
Acceptor levels lie close to the top of the valence band and capture electrons from
the covalent bonds thereby generating holes in the valence band. The captured electrons are not available for conduction and the conductivity is due to the movement of
holes in the valence band. The semiconductor is called p-type semiconductor.
In case of compound semiconductors, a perfectly stoichiometric compound will
be intrinsic in nature. Any deviation from stoichiometry will make the compound
extrinsic. For example, semiconducting compound ZnS will be intrinsic in nature if
the compound is stoichiometric, i.e., the atomic composition of Zn and S is 50:50. Any
deviation from this composition, will result in an excess concentration of one of the
components over the stoichiometry and gives the compound extrinsic nature. Hence,
by controlling the composition of the compound, it is possible to obtain intrinsic as
well as extrinsic (n-type and p-type) semiconductors.
Semiconductors
141
(5.11)
(5.12)
where Nd represents the donor concentration in the semiconductor. Since the donor
concentration is quite high compared to the number of covalent bonds broken at room
temperature, the second term in the conductivity equation may be neglected
n Nd ee
(5.13)
142
Semiconductors
(5.14)
(a)
(b)
11111111111
00000000000
00000000000
11111111111
11111111111
00000000000
00000000000
11111111111
CB
11111111111
00000000000
00000000000 DL
11111111111
FL
11111111111
00000000000
00000000000
11111111111
T=0K
11111111111
00000000000
00000000000
11111111111
T = 200 K
VB
11111111111
00000000000
00000000000
11111111111
T = 400 K
11111111111
00000000000
00000000000
11111111111
11111111111
00000000000
00000000000
11111111111
CB
11111111111
00000000000
00000000000
11111111111
11111111111
00000000000
00000000000
11111111111
T=0K
11111111111
00000000000
00000000000
11111111111
T = 200 K
VB
11111111111
00000000000
00000000000
11111111111
T = 400 K
FL
AL
Figure 5.10 Effect of temperature on the Fermi level in (a) n-type semiconductor and
(b) p-type semiconductor.
However, as the temperature is increased, more and more covalent bonds will be broken thereby increasing the minority carrier concentration. The effect of temperature is
illustrated by the energy band diagrams shown in Fig.5.10. In an n-type semiconductor, at 0 K, all the donor levels will be neutral; i.e., no electrons are contributed to the
conduction band by the impurity atoms. The Fermi level, which represents the energy
level for which the probability of occupation is half, lies mid-way between the impurity
level and the bottom of the conduction band. As the temperature is increased, the donor
levels will be ionised to a greater extent as compared to breaking of covalent bonds.
At elevated temperatures, when most of the donor levels are ionised, the carrier generation due to breaking of covalent bonds will dominate the variation of conductivity
Semiconductors
143
ln
(1 / T)
144
Semiconductors
(5.16)
(5.17)
where J is the current density and vd is the drift velocity at an applied electric field E.
The quantity (vd /E) is a constant specific to a particular type of carrier and is called
the mobility of the carrier. Since we have both electrons and holes contributing to the
conductivity,
= nee + peh
(5.7)
It may be mentioned here that the mobility of holes is usually less than the mobility
of electrons since the mobility of holes is nothing but the mobility of electrons in the
Semiconductors
145
valence band. The mobility is dependent on the scattering processes in the semiconductor. The effect of scattering by the atomic vibrations is different from the effect of
ionised impurity atoms on the mobility of the carriers. The temperature dependence of
mobility is a combination of the effect of temperature on the two scattering processes.
A comparison of the important characteristics of intrinsic and extrinsic semiconductors is given in Table 5.1.
Table 5.1 Comparison between intrinsic and extrinsic semiconductors
Intrinsic semiconductors
Extrinsic semiconductor
1. It is a chemically pure, structurally It is an impure material, obtained by
perfect and if compound, stoichio- adding a group III or group V elemetric material.
ment as dopant into an intrinsic semiconductor or structurally imperfect or
nonstoichiometric, if compound.
2. It behaves as an insulator at 0 K
It behaves as an insulator at 0 K.
6. Fermi level lies at the centre of the forbidden energy gap at all temperatures
146
5.4
Semiconductors
At a given temperature, electron-hole pairs will be continually generated due to breaking of covalent bonds. This is equivalent to the excitation of electrons from the valence
band to the conduction band thereby creating two mobile charge carriers, electron in
the conduction band and hole in the valence band. At the same time, the reverse process of recombination is also taking place in which electron recombines with a hole
annihilating each other (Fig.5.12).
CB
VB
Figure 5.12 Energy level diagram showing the processes of generation and recombination of electron hole pairs.
The direct transition of an electron from the conduction band to the valence band
will have to satisfy the conservation laws of energy and momentum. Since the probability that an electron and hole have an equal and opposite momentum is very less,
a direct recombination is a rare event. Most of the recombinations take place with
the help of an impurity atom or a lattice defect. These sites where a recombination is
favoured are called traps or Recombination Centres. In a semiconductor, these
levels are located in the forbidden energy gap.
5.5
The energy band diagrams which we have used to describe semiconductors are too
simplified and are not sufficient to explain the behaviour of semiconductors. For example, the recombination process is dependent both on the presence of electron-hole
pairs but also requires the conservation of momentum. The energy and momentum distribution of carriers may be explained using band diagrams in k-space. These are called
E-k diagrams. In the reduced zone scheme, k represents the propagation vector and is
related to the momentum of the carrier. It is observed that the E k diagrams will be
different for different directions of k, i.e the direction of travel of the carrier. Fig.5.13
shows typical E k diagrams. The forbidden energy gap for this semiconductor will
Semiconductors
147
E
E
CB
CB
VB
(a)
VB
(b)
Figure 5.13 E-k diagrams for (a) direct band gap semiconductor and (b) indirect band
gap semiconductor.
be the energy difference between the valence band maximum and the conduction band
minimum.
If these two extreme points lie at the same k-value, say k = 0 as in Fig.5.13(a),
the semiconductor is said to be a direct band gap semiconductor. If the valence and
conduction band edges do not lie at the same value of k (Fig.5.13b), the semiconductor
is called indirect band gap semiconductor. Silicon and germanium are indirect band
gap semiconductors whereas gallium arsenide, cadmium selenide, cadmium telluride
etc are examples of direct band gap semiconductors. In case of a direct band gap semiconductor, when an energy equal to the band gap energy is supplied, a direct transition
of an electron from the valence band maximum to the conduction band minimum can
occur. However, in case of an indirect band gap semiconductor, in addition to the energy requirement, a transition can occur only with the assistance of a phonon (lattice
vibration). Thus, we see that in case of a direct band gap semiconductor, the energy and
momentum conservation requirement for the transition are met in the interaction of an
electron and a photon whereas in case of an indirect band gap semiconductor, an interaction of an electron with a photon is assisted by a phonon to satisfy the conservation
conditions.
The band gap of a semiconductor can be deduced from optical absorption studies.
Such studies also give us information whether the band gap is a direct one or an indirect one. When radiation of suitable energy is incident on a semiconductor sample of
thickness x, the intensity of radiation transmitted is given by
I = I exp (x)
(5.18)
148
Semiconductors
where I is the incident intensity and is the absorption coefficient. In a typical plot
of the variation of with the energy of the incident radiation, we observe an increase
in the absorption coefficient near the band gap energy. In case of a direct band gap
semiconductor, the increase at E g is very sharp and in case of an indirect band gap
semiconductor the increase will be less steep (Fig.5.14).
This is due to the fact that in case of indirect band gap semiconductors, transitions
of electrons from valence band to the conduction band requires the assistance of a
phonon in order to achieve the momentum conservation.
The major differences between direct and indirect band gap semiconductors are
summarized in Table 5.2.
(a)
(b)
Figure 5.14 Variation of absorption coefficient as a function of energy for (a) indirect
band gap semiconductor and (b) direct band gap semiconductor.
Table 5.2 Distinction between direct band gap and indirect band gap semiconductor.
Direct band gap semiconductor
1. The valence band maximum and the
conduction band minimum are at the
same value of propagation vector
Semiconductors
2. Transition between valence band and
conduction band involves an electron
and a photon of energy equal to the
band gap energy.
149
Transition between valence band and
conduction band involving an electron and a photon of energy equal to
the band gap energy is assisted by a
phonon to conserve momentum.
150
Semiconductors
Property
Atomic number
Germanium
32
Silicon
14
Gallium Arsenide
2.
Atomic weight
72.6
28.1
144.6
3.
Crystal structure
Diamond
Diamond
Zinc blende
4.
Lattice parameter, nm
0.565
0.543
0.565
5.
Band gap, eV
0.67
(indirect)
1.1
(indirect)
1.43
(direct)
6.
Intrinsic resistivity,
m (300 K)
45
2300
3.7 106
2.5 1019
1.5 1016
1.4 109
0.39
0.14
0.85
0.19
0.05
0.045
16
11.8
11.1
958
1412
1240
70
150
350
7.
8.
9.
10
Intrinsic carrier
concentration,
m-3 (300K)
Electron mobility,
m2 v1 s1
Hole mobility
m2 v1 s1
dielectric
constant.
Semiconductors
151
Ga
As
(a)
(b)
Figure 5.15 Unit cell of (a) germanium and silicon, and (b) gallium arsenide.
and Ge have 8 atoms per unit cell where as GaAs has four Ga and four As atom per
unit cell. Because of the identical crystal structure and the lattice parameter all the
three semiconductors have an identical number of atoms per unit volume. However,
the intrinsic carrier concentration and hence the intrinsic resistivity which are functions
of the band gap energy vary considerably. Germanium has the lowest value of band
gap and hence the highest value of intrinsic carrier concentration with an associated
lowest intrinsic resistivity. This low value of intrinsic resistivity leads to large leakage
currents in reverse biased p-n junctions which may not be desirable in some device
applications. Further, germanium has the lowest transition temperature at which the
material looses its extrinsic behaviour and becomes practically an intrinsic material.
Hence, p-n junction devices fabricated using germanium can not be used above this
transition temperature of 70C. On the other hand, silicon proves to be a very useful
material for many device applications. Gallium arsenide is preferred over silicon in
some applications like light emitting diodes, Gunn diodes, etc.
5.6
Hall effect
152
Semiconductors
Face 2 (+)
t
l
I
b
Face 1 ()
(5.19)
(5.20)
For a given semiconductor, the Hall coefficient RH is defined as the Hall field generated
in the material per unit magnetic field applied when the current density is unity.
i.e.,
RH = E H /BJ
(5.21)
The current density J can be expressed in terms of concentration of carriers n and their
drift velocity as
J = nev
(5.22)
From equation (5.20) and (5.22) we get
RH = Bv/Bnev = 1/ne
(5.23)
Semiconductors
153
(5.24)
since the Hall field developed is in negative Y- direction. For a p-type semiconductor,
RH will be positive and is given by
RH = 1/pe
(5.25)
Since all the three quantities E H , B and J can be measured, the Hall coefficient and
the carrier concentration can be found out.
Equations (5.23) and (5.24) are valid for the case of free electrons moving with
steady drift velocity. However, in practice, assuming average drift velocity for carriers,
the Hall coefficient is given by
3 1
RH =
(5.26)
(5.27)
(5.28)
8
p RH
3
(5.29)
(5.30)
154
Semiconductors
where VH is the Hall voltage generated across the breadth and I is the current flowing
along the length. Substituting these values in equation (5.30), we get
n=
3 BI
8 etVH
(5.31)
Here, n represents the number of charge carriers per unit volume of the semiconductor and the negative sign signifies that the carriers are electrons.
By maintaining current I constant, the variation of Hall voltage VH with magnetic
field is studied. The graph of VH versus B will be a straight line (Fig. 5.17). The carrier
concentration can be determined as
n=
1
3 I
8 et slope
(5.32)
The experiment may also be repeated by studying the variation of Hall voltage with
current I at a constant magnetic field.
VH
Figure 5.17 Variation of Hall voltage with the applied magnetic field.
5.6.2 Hall effect in intrinsic semiconductors
In the above discussion of Hall effect in extrinsic semiconductors, the motion of majority carriers is analysed neglecting that of minority carriers. In an intrinsic semiconductor, the number of electrons and holes will be same and hence the motion of both
Semiconductors
155
type of carriers is to be considered. The expression for the current density J used in
equation (5.22) has to be modified for the intrinsic semiconductor as
or
J = neve + pevh
J = (nee + peh )E x
(5.33)
where ve and vh are the drift velocities of electrons and holes respectively due to the
applied field E x . This yields an expression for RH as
RH =
1 p2h n2e
e (ph + ne )2
(5.34)
Even though the number of electrons is equal to the number of holes in an intrinsic
semiconductor, since the mobility of electrons is greater than that of holes, the Hall
coefficient will be negative.
5.6.3 Applications of Hall effect
Hall effect can be used to identify the semiconductor type, namely n-type or p-type.
It is useful in evaluating the carrier concentration. Knowing the conductivity, it is
possible to find the mobility of the carriers. Using a standard sample of known Hall
coefficient, it is possible to measure magnetic fields. This is the principle of construction of a gauss meter.
5.7
p-n Junction
The p-n junctions are among the most simple semiconductor devices. They are useful
devices in the electronic circuits to perform functions like rectification, amplification,
switching, etc. A p-n junction may be obtained by different methods and the junctions
are known by the methods of fabrication. They are:
(a)
(b)
(c)
(d)
(e)
Grown-in-junction
Alloy junction
Diffused junction
Ion implanted junction
Deposited junction.
A grown in junction is obtained by abruptly changing the dopant during crystal growth process from melt. Initially the melt may contain n-type impurities in the
semiconductor melt. At some stage of growth, p-type impurities are added in excess to
156
Semiconductors
compensate for the n-type impurities already present so that a net p-type character is
achieved. The process is called counterdoping and the junction is known as grownin junction. Another technique for obtaining p-n junction is alloying a semiconductor
with a metal acting as an opposite type of dopant. For example, an n-type germanium
sample may be alloyed with indium by placing a pellet of indium metal on it and melting it. This results in a region of germanium rich in indium giving it a p-type character.
The most common technique for forming p-n junctions is by impurity diffusion. It
is achieved by selective masking and exposing semiconductor regions to vapours of
impurities at elevated temperatures. If a wafer of n-type silicon is placed in a furnace
at 10000C in a gaseous atmosphere containing a high concentration of boron atoms,
the boron atoms diffuse into the wafer. If the concentration of the diffused boron atoms
is higher than the concentration of the n-type impurities in the silicon, the top layer
becomes p-type and hence the p-n junction is obtained. The junction obtained will be
a graded one with the depth and concentration of doping depending on the temperature
and duration of diffusion process. Ion implantation is a technique in which the impurity ions are accelerated and made to incident on semiconductor surfaces. With this
technique, it is possible to control the depth of the junction and also obtain abrupt junctions. Another technique widely used in integrated circuit fabrication is the deposition
of thin films of semiconductor materials with the required impurity and concentration
to obtain sharp junctions.
Junctions may be abrupt or graded. An abrupt junction, also known as a step
junction is one in which the acceptor concentration and the donor concentration are
constant and extend upto the junction. In a graded junction, the impurity concentration varies with distance from the junction in some manner. Both types of junctions
are used for different applications.
5.7.1 Unbiased p-n junction
Consider a junction between an n-type and p-type semiconductor which has just been
formed. The n-type semiconductor contains some of the semiconductor atoms replaced
by donor atoms of group V. Similarly, the p-type semiconductor contains some of the
semiconductor atoms replaced by acceptor atoms belonging to group III. The concentration of the donor atoms may be different from the concentration of acceptor atoms
in a given junction.
Fig. 5.18 shows a p-n junction so formed where only the impurity atoms are shown.
A n-type impurity atom is shown to have an additional electron weakly attached to
it. Similarly, p-type impurity atoms are associated with holes. When the junction is
formed, the electrons from the n-type region diffuse into the p-type region because of
Semiconductors
157
p
10
01
01
(a)
10
01
01
10
01
01
10
01
01
10
01
01
Group V atoms
with electrons
10
01
01
(b)
Figure 5.18 Formation of a p-n junction (a) before charge transfer and (b) after charge
transfer.
the large concentration of electrons on the n-side and the lack of it on the p-side. Hence,
the electrons from the donor atoms close to the junction cross the junction to combine
with the holes on the p-side. When the electrons leave the donor atoms, they leave
behind positively charged donor ions. Similarly, the acceptor atoms become negatively
charged acceptor ions when the electrons recombine with the holes. Consequently
there is positively charged region of immobile donor ions close to the junction on the
n-side and a negatively charged region of immobile acceptor ions on the p-side. These
are referred to as space charge regions. They are also called charge depletion regions
since there are no mobile charges in these regions. However, such diffusion of carriers
stops when the space charge produced is sufficient to prevent the carriers from crossing
over. The potential existing due to the space charge regions at equilibrium is called the
barrier potential.
Fig.5.19 shows the variation of impurity concentration, charge density and potential across the junction. Since the total charge density in the space charge region on
either sides of the junction is equal, we have
Nd Xn = Na X p
(5.35)
158
Semiconductors
p
1010
Nd
10
10
1010
d
10
10N a
10
10
10
eN 1
d 0
10
1010
Xp
Xn
11111111111111111111111111111111111111111111111111
00000000000000000000000000000000000000000000000000
100
d
1
10 e N
1010 a
10
100 V
10
110
V
Xp
10
Xn
111111111111111111111111111111111111111111111111111
000000000000000000000000000000000000000000000000000
100
d
V
10
110
100
1
(a)
(b)
(c)
Figure 5.19 Variation of (a) impurity concentration (b) space charge and (c) potential
as a function of distance for an abrupt p-n junction.
where Nd and Na represent the concentration of impurities in the n-type and p-type
materials, Xn and X p are the widths of the charge depletion regions on the two sides.
This is called the charge neutrality condition. Thus, we see that if the concentration
of impurities is higher on any side, the width of the depletion region on that side will be
proportionately smaller. The effective width of the junction is the sum of the depletion
widths on the two sides. In this region of the junction, the bulk properties of the p-type
and n-type materials are not valid.
Figure 5.20 shows the energy level diagrams for the p-n junction before and after
the diffusion of carriers. When the equilibrium is established after the transfer of carriers, the Fermi levels on the n-side and p-side will be levelled since the Fermi level
Semiconductors
159
p
11111111111111
00000000000000
00000000000000
11111111111111
11111111111111
00000000000000
00000000000000
11111111111111
EF
p
EF
11111111111111
00000000000000
00000000000000
11111111111111
11111111111111
00000000000000
00000000000000
11111111111111
(a)
11111111111111
00000000000000
00000000000000
11111111111111
11111111111111
00000000000000
00000000000000
11111111111111
eVB
11111111111111
00000000000000
00000000000000
11111111111111
11111111111111
00000000000000
00000000000000
11111111111111
00000000000000
11111111111111
(b)
Figure 5.20 Energy level diagrams for a p-n junction (a) before the formation of the
junction and (b) after the formation of the junction.
represents the energy level where the probability of occupation is half and is same on
either sides of the junction close to it. As a result, the energy bands on the p-type side
will be at a higher energy level as compared to those on the n-side. An electron at the
bottom of the conduction band on the n-side has to acquire an energy equal to eVB to
cross over to the p-side, where VB represents the barrier potential. It can be shown that
the barrier potential is related to the concentration of impurities on the two sides of the
junction and is given by
kT
Na Nd
VB =
loge 2
(5.36)
e
ni
The width of the charge depletion regions is also a function of the impurity concentration and are given by
#1/2
2 VB
Xp =
eNa (1 + Na /Nd )
#1/2
"
2 VB
and Xn =
eNd (1 + Nd /Na )
"
(5.37)
(5.38)
160
Semiconductors
Semiconductors
161
Figure 5.21 The energy level diagram for forward biased p-n junction.
The junction is said to be reverse biased when the p-type region is made negative
with respect to the n-type region (Fig.5.22a). The applied voltage now adds to the
barrier potential to present an increased barrier (VB + V) to the diffusion of majority
carriers (Fig.5.22b). If the reverse bias is large enough, the majority carriers will be
prevented from moving across the junction. The minority carrier current of electrons
from p-type to n-type and of holes from n-type to p-type will now be constituting the
total current across the junction. Since the drift current is independent of the barrier
height, this reverse current is called the reverse saturation current. Thus, the p-n junction offers high resistance to the flow of current when reverse biased. The voltagecurrent characteristics of a p-n junction under forward and reverse bias conditions is
shown in Fig.5.23. The total current across the junction is given by the diode equation,
Figure 5.22 The energy level diagram for reverse biased p-n junction.
I = Io [exp (eV/kT ) 1]
(5.39)
where V is the applied bias.
162
Semiconductors
I
(5.41)
(5.42)
Semiconductors
163
(5.43)
where n is called the ideality factor. The value of ideality factor is close to one for
germanium diode but is close to 2 for silicon diode under small current operation.
5.7.3 Incremental junction capacitance
The expression for the depletion width on the two sides of the p-n junction under biased
condition may be written as
#1/2
2 (VB V)
Xp =
eNa (1 + Na /Nd )
#1/2
"
2 (VB V)
and Xn =
eNd (1 + Nd /Na )
"
e Na Nd
C=
2(Na + Nd )
=
(Xn + X p )
#1/2
V 1/2
Thus, we see that for an abrupt junction, the capacitance varies as the square root of
the reverse bias Vr . In a graded junction, the capacitance can be written in the form
C Vrn
164
Semiconductors
The value of the exponent n is equal to (1/3) for a linearly graded junction. Thus, the
voltage sensitivity of capacitance is greater for an abrupt junction than for a linearly
graded junction. By choosing various doping profiles, it is possible to control the
dependence of C on Vr .
N
m=1
Na
m=0
m = 3/2
P
Semiconductors
165
the current begins to increase very rapidly (Fig.5.23). The reverse biased p-n junction
which was offering a high resistance to the flow of current suddenly starts conducting
heavily. This phenomenon in which the current increases very rapidly at a particular
reverse voltage is called breakdown. The mechanism of breakdown is due to impact
ionisation of lattice atoms in the depletion region by the energetic minority carriers.
For example, an electron from the p-side may acquire enough kinetic energy from the
applied electric field to cause ionising collision with lattice atoms. This results in the
breaking of covalent bonds leading to the generation of electron-hole pairs. The carriers so generated are further accelerated by the applied field and may cause ionsiation
events thereby resulting in an avalanche process and hence a large current. This type
of breakdown is called Avalanche breakdown and is normally observed in lightly
doped p-n junctions.
5.8
Zener diode
A zener diode is a heavily doped p-n junction. When the concentration of impurities in
the p and n type semiconductors used for the junction is high, the width of the charge
depletion region reduces. The voltage-current characteristics of a zener diode is identical to that of an avalanche diode with a forward region where the current increases
exponentially with voltage, a reverse region where the current shows a saturation and
a breakdown region. However, the mechanism of breakdown in a zener diode is distinctly different.
5.8.1 Zener breakdown mechanism
The generation of a large current at breakdown is due to a quantum mechanical phenomenon called tunneling. Tunneling can take place when the following two conditions are satisfied.
(i) The width of the depletion region is very small.
(ii) Under the biased condition, there are vacant sites in the conduction band of nside at the same energy level where valence electrons are present in the valence
band of p-side.
Application of reverse bias will result in the lowering of all energy levels on the
n-side (Fig.5.25). At the breakdown voltage, the bottom of the conduction band on
the n-side will be just below the top of the valence band on the p-side. Since the
barrier width is very small, electrons will tunnel from the valence band of p-side to
166
Semiconductors
p
111111111
000000000
000000000
111111111
CB
EF
111111111
VB000000000
000000000
111111111
111111111
000000000
000000000CB
111111111
n
EF
111111111
000000000
000000000
111111111
000000000VB
111111111
Figure 5.25 Energy level diagram for a Zener diode operating in the breakdown region.
the conduction band of n-side. In other words, zener breakdown can be considered
to be equivalent to field ionsiation where the applied field is large enough to pull the
electrons out of the covalent bonds and accelerate them to the n-side of the junction.
An electric field of the order of 106 V/cm is required for zener breakdown. However,
such a high field is realised even at low voltages (usually below 6V) since the width of
the depletion region is very small. If the breakdown occurs at a higher voltage (above
6V) the mechanism is avalanche breakdown. The avalanche breakdown voltage is very
stable and is useful as a reference voltage in electronic circuits. The p-n junction diodes
used for this purpose are called zener diodes even though they are avalanche diodes.
5.8.2 Identification of breakdown mechanism in a p-n junction
The temperature coefficient of breakdown voltage is helpful in identifying the mechanism of breakdown in a given diode. If the breakdown voltage is below 6V, the mechanism is zener breakdown and the temperature coefficient of breakdown voltage is
negative. In other words, the breakdown voltage decreases as the temperature is increased. This is because an increase in temperature increases the energy of the valence
electrons and hence lesser voltage is sufficient to pull them out of the covalent bonds. If
the breakdown voltage is above 6V, the breakdown is due to avalanche mechanism and
the temperature coefficient of breakdown voltage is positive. The breakdown voltage
increases with an increase in temperature. This is due to the fact that the vibrational
motion of the lattice atoms increases with temperature thereby increasing the probability of the charge carriers being scattered due to collisions. Thus, the charge carriers
have lesser probability gaining sufficient energy to start the avalanche multiplication.
Semiconductors
167
The distinction between avalanche diodes and zener diodes is summarized in Table 5.4.
Table 5.4 Distinction between avalanche diodes and zener diodes.
Avalanche Diodes
Zener Diodes
1. The doping level is low. The concen- The doping level is high. The concentration of impurity atoms is usually of tration of impurity atoms is of the orthe order of 1018 to 1020 m3 in silicon. der of 1022 to 1024 m3 in silicon.
2. The width of the depletion region (or
space charge region) is quite large and
is of the order of 106 m in silicon.
5.9
168
Semiconductors
(Fig.5.26). However, the output will be pulsating and far from being a substitute for
the d.c power supply. The out put may be made more smooth using filters. Capacitors
are normally used as filters in rectifier circuits.
D1
RL
D2
RL
Vi
Semiconductors
169
+
VZ1
+
Vi
(VZ1 + VZ2 )
VZ2
VZ
Vz
VZ
Vz
(a)
(b)
(c)
Figure 5.29 Use of zener diode for squarewave generation. (a) input wave form (b)
circuit diagram (c) output wave form.
5.9.3 Photo diode
When bulk semiconductors are irradiated by light of energy higher than the energy
band gap of the semiconductors, they show change in their electrical conductivity.
This is termed as photoconductivity and the excess conductivity is proportional to the
optical absorption rate. Thus, semiconductors can be used to detect and measure incident photon intensity through a measurement of the electrical conductivity. However,
junction devices show a better sensitivity and speed of response as detectors. A photodiode is basically a reverse biased p-n junction embedded in clear plastic with all
sides painted black except a small aperture through which light is concentrated on to
170
Semiconductors
0
L1
L2
L3
Semiconductors
171
The advantages of using a reverse biased p-n junction as a photo detector are as
follows:
(i) The reverse saturation current is almost independent of the applied reverse bias
voltage and hence variations in this voltage will not affect the measurements.
(ii) The change in the magnitude of the reverse saturation current due to incident
light is linearly related to the intensity of radiation. The dark current, which is
normally very small at low temperatures, may even be neglected.
(iii) The electron-hole pairs produced in the depletion region due to illumination are
separated by the field existing at the junction and the recombination probability
is reduced. The reverse bias condition results in wider depletion regions ensuring
complete collection of photo generated carriers.
Applications of photodiodes include detection and measurement of radiations including infrared, visible, laser, x-ray, -rays etc. They are used in high speed counting
applications like reading punched cards and tapes for computers. The are used in high
speed switching applications to convert time varying optical signals to electrical signals. They are also used in reading optical tracks of recorded sound, light operated
switches, burglar alarms, etc.
5.9.4 Photovoltaic effect and solar cell
It is observed that in case of a photodiode, a constant reverse saturation current flows
across the junction for large reverse voltages. As the reverse voltage is reduced, the
potential barrier existing at the junction is also reduced. This increases the probability
of majority carriers diffusing across the junction. This flow of majority carriers reduces the reverse saturation current as the reverse voltage approaches zero. However,
under illuminated condition, the reverse saturation current does not reduce to zero even
when the applied voltage goes to zero (Fig.5.31). In fact, the reverse current can be
reduced to zero only with an applied forward bias. As the forward bias is increased, a
point is reached when the majority carrier current in the forward direction just equals
the minority carrier current in the reverse direction. The net current becomes zero.
This forward voltage at which the net current becomes zero is called the photovoltaic
potential. Under open circuit condition (i.e., when no load is connected across the
p-n junction), the net current is zero and hence, the photovoltaic potential or the open
circuit photovoltage, Voc , is observed across the p-n junction. The reverse current observed when no bias is applied is called the short circuit photo current Isc (Fig.5.32).
172
Semiconductors
0
VOC
I SC
1
Semiconductors
173
3. The thickness of the n and p regions should be small compared to the diffusion
lengths of the minority carriers in the respective regions to reduce recombination.
4. The series resistance of the device should be small so that the power is not lost
due to ohmic losses in the device itself.
5. It is desirable to have a large contact potential to have large photovoltage. Open
circuit photovoltage is less than the contact potential which in turn is less than
the energy band gap equivalent of the material. The contact potential can be
increased by suitable doping of n and p regions. But, heavy doping reduces the
carrier lifetime, which in turn reduces diffusion length of carriers.
Antireflection
coating
Light
Top contact
00
11
11
00
00
11
00
pSi 11
nSi
(a)
1111111111111
0000000000000
0000000000000
1111111111111
Ohmic contact
1111111111111
0000000000000
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
0000000000000
1111111111111
(b)
Figure 5.33 Construction of a solar cell. (a) cross-sectional view b) top view.
Fig. 5.33 shows the construction of a solar cell. The p-n junction has appropriate
thickness of p and n regions. The p- region is made thin enough to allow sufficient
number of photons to reach the junction. An antireflection coating is given on top
of the p- region to reduce possible reflection of the incident radiation. The top metal
contact is made in the form of a grid to ensure effective collection of carriers.
The conversion efficiency of a solar cell is defined as
=
(5.44)
174
Semiconductors
The efficiency of a solar cell is also expressed in terms of quantum efficiency which
is defined as
number of electron-hole pairs generated
(5.44)
=
number of photons absorbed
A well made silicon solar cell can have a conversion efficiency of 15% ( = 0.15)
and can provide about 150 W of electrical power per m2 of cell area.
In space applications, solar cells are used for satellite power supply. In terrestrial
application, they are used in earth communication equipments, for lighting, heating,
water pumping, etc.
5.9.5 Light emitting diode
These are p-n junctions which emit visible light when forward biased (Fig.5.34).
Semiconductors
175
Numerical Examples
5.1 Calculate the conductivity of silicon doped with 1021 atoms m3 of boron if the
mobility of holes is 0.048m2 v1 s1 .
Solution: Conductivity of the extrinsic semiconductor with an acceptor concentration of Na is given by
= Na eh
= 1021 1.6 1019 0.048
5.2 Calculate the resistivity of intrinsic germanium if the intrinsic carrier density is
2.5 1019 m3 assuming electron and hole mobilities of 0.38 and 0.18 m2 v1 s1
respectively.
Solution: Resistivity is given by
=
1
1
=
ne(e + h)
176
Semiconductors
1
2.5
1.6 1019 (0.38 + 0.18)
= 0.45 ohm m (Ans.).
=
1019
5.3 Calculate the electron and hole densities in n-Si doped with 1019 donors m3 if
the intrinsic carrier concentration in silicon is 1.4 1016 m3 .
Solution: Density of electrons,
n = 1019 + 1.4 1016 1019 m3
Density of holes,
p = n2i /n
where ni is the intrinsic concentration.
(1.4 1016 )2
1019
= 1.96 1013 m3 (Ans.).
p=
5.4 Calculate the conductivity of germanium with the given data. What is the effect
of doping germanium with donor - type impurity to the extent of one atom per
108 germanium atoms?
Given data: Intrinsic carrier concentration in germanium at 300 K,
ni = 2.4 1019 m3
Electron mobility,e = 0.39m2v1 s1
Hole mobility,h = 0.19m2v1 s1
Number of atoms/m3 = 4.4 1028
= 2.23 ohm1 m 1
Semiconductors
177
3 BI
8 net
178
Semiconductors
3 3.14 0.2 100 103
8 1023 1.6 1019 1.2 104
VH = 0.0123 Volt or 12.3 mV (Ans.).
=
5.7 Calculate the barrier potential existing at the junction of p-type silicon with an
acceptor concentration of 1023 m3 and n-type silicon with a donor concentration
of 1020 m3 if the intrinsic carrier concentration at 300 K is 1.4 1016 m3 .
Solution: The barrier potential is given by
VB =
kT Na Nd
ln 2
e
n1
5.8 Intrinsic silicon has a carrier concentration of 1.1 1016 m3 . If the mobilities of
electrons and holes are 0.17 and 0.035m2v1 s1 respectively at room temperature,
compute the resistivity of silicon.
Solution: Conductivity is given by
i = ni e(e + h )
= 1.1 1016 1.6 1019 (0.17 + 0.035)
Semiconductors
179
5.11 The resistivity of germanium at 20C is 0.5 ohmm. What will be its resistivity at
40C if the band gap of germanium is 0.7 eV?
Solution: We have the relation,
(ln) E g
=
(l/T ) 2k
(ln) = E g .(1/T )/2k
5.12 sample of silicon is doped with 1022 phosphorus atoms m3 . What would you
expect to measure for its resistivity? What Hall voltage would you expect in
a sample 100 m thick when a current of 1 mA is passed perpendicular to a
magnetic field of 0.1 T. [Given: e = 0.07m2 V 1 s1 ]
Solution:
Conductivity = Nd ee
(5.1)
= 112 ohm m .
3 BI
8 net
(5.2)
(5.3)
(5.4)
(5.5)
180
Semiconductors
3 3.14 0.1 1 103
8 1022 1.6 1019 100 106
= 0.735 mV (Ans.).
(5.6)
(5.7)
5.13 An intrinsic semiconductor has an energy gap of 0.7 eV. Calculate the probability
of occupation of the lowest level in conduction band at 0 , 50 and 100C.
Solution: The lowest level in conduction band corresponds to
(E E F ) = 0.35 eV.
The probability of occupation of this level is given by
F(E) =
1
1 + exp (E E F )/kT
Exercise
5.1 Explain the formation of a potential barrier in a p-n junction.
(March 1999).
Semiconductors
181
(March 2000).
5.8 What is Hall effect? How is it useful in the study of conduction in materials?
(March 2000).
5.9 What is Hall effect? Derive an expression for the Hall coefficient. (August 2000).
5.10 Derive an expression for the conductivity of an intrinsic semiconductor. The
electron and hole mobility of silicon are 0.14 and 0.05 m2 v1 s1 respectively
at a given temperature. If the electron density is 1.5 1016 m3 , calculate the
resistivity of silicon.
(August 2000).
5.11 Discuss the I-V characteristics of a zener diode. Discuss the different types of
breakdown at the p-n junction.
(March 2001).
5.12 Describe a p-n junction and explain its voltage-current characteristics. What is
the effect of temperature on the conductivity of semiconductors.
(August 2001).
5.13 Explain how a potential barrier is formed in a p-n junction.
(March 2002).
5.14 Distinguish between direct band gap and indirect band gap semiconductors.
(March 2002).
5.15 Distinguish between zener breakdown and avalanche breakdown.
(March 2002).
Chapter 6
Introduction
In this chapter we study the properties of insulator materials in static electric fields. Insulators possess very few free charge carriers and hence are bad conductors of electric
current. However, application of an electric field results in some interesting processes
involving rearrangement of charges in the material. These characteristic features of
insulators are called dielectric properties. These properties and the behaviour of dielectric materials in a static electric field may be explained reasonably well with the
help of a simple atomic theory.
Gauss theorem is one of the most fundamental theorems related to static electric
fields. The theorem may be explained by considering a closed surface containing a
large number of charges. If Q1 , Q2 , Q3 , . . . . . . Qn represent the charges enclosed by a
surface, then the total electric flux emerging from the surface is given by
=
n
X
Qi =
i=1
D ds
(6.1)
We note that the total flux is expressed in units of charge, namely coulombs which
may also be written as equal to the surface integral where D represents the flux density
expressed in coulombs m2 and ds is the surface element where the flux density is
evaluated. The flux density in vacuum is proportional to the electric field strength E
according to the relation
D = o E
(6.2)
Since the electric field is measured in units of volts m1 , o will have the units farad m1 .
This constant of proportionality is called the permittivity of vacuum. If the electric
field and the flux density are considered in any other medium, the relation between
them will be modified as
D = E
(6.3)
where is the permittivity of the medium. This relation may also be expressed as
D = o r E
182
(6.4)
183
where r is a dimensionless quantity called relative permittivity or dielectric constant of the medium. For a dielectric medium, this is an important parameter and can
be easily evaluated by simple measurements. Consider a parallel plate condenser of
area A and separation between the plates, d (Fig.6.1).
+ + + + + + +
+ + + + + + +
Figure 6.1 A parallel capacitor with an applied electric field E. The flux lines will be
directed from positive plate to the negative plate.
Let q be the charge density on the plates which will be numerically equal to the
flux density D. Hence, we have the electric field existing between the plates given by
E = D/ r = q/ r
(6.5)
C = r A/d
(6.6)
(6.7)
If the dielectric medium between the two parallel plates is removed and the space
evacuated, the capacitance will be
Cvac = A/d
(6.8)
From equation (6.7) and (6.8), the dielectric constant of the medium is given by
r = C/Cvac
(6.9)
184
6.2
Polarization
An electric dipole, in its simplest form, consists of two equal point charges of opposite
sign, Q and Q, separated by a distance d (Fig.6.2). The dipole moment associated
with this electric dipole is equal to Qd and is represented by a vector pointing from the
negative charge towards the positive charge.
i.e. Dipole moment = Qd
(6.10)
+Q
Q
d
= Qd
Figure 6.2 Dipole constituted by two equal and opposite charges separated by a distance. The direction and magnitude of the dipole moment are indicated.
In a dielectric material, such electric dipoles may be present even in the absence of
an applied electric field. For example, in a diatomic molecule AB, if the atom A has a
tendency to give up an electron to B during the formation of a bond, atom A is said to
be more electropositive than atom B. Hence, the molecule AB carries a dipole moment
even in the absence of an applied electric field. This is called a permanent dipole. On
the other hand, the dielectric material may not have a permanent dipole but a dipole
may be induced under the influence of an applied electric field. For example, in an
atom, the centre of the electron cloud coincides with the nucleus and hence there is no
dipole moment associated with it. On application of an electric field, the electron cloud
and the nucleus move in opposite directions and the centre of the electron cloud will be
shifted through a finite distance from the nucleus, thus resulting in a dipole. The dipole
moment associated with this atom is said to be induced dipole moment. Depending on
whether a dielectric contains permanent dipoles or not, the materials are called polar
materials and non-polar materials respectively. The dipole moment per unit volume of
the material is called Polarization.
Thus we see that when a dielectric medium is subjected to an electric field, there
will be polarization occurring in the medium due to the presence of permanent dipoles
or induced dipoles or both. If we consider a parallel plate condenser with an applied
electric field E there will be a total charge +Q on the positive plate and Q on the
negative plate (Fig.6.3).
185
We have
E = Q/ A
+ + + + + + + + + + + + + + + + + + +
1111111111111111111
0000000000000000000
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
+ + + + + + + + + + + + + + + + + + +
0000000000000000000
1111111111111111111
0000000000000000000
1111111111111111111
(6.11)
+Q
Q
+Q
Q
Figure 6.3 A parallel plate capacitor showing the induced charges on the opposite
faces of the dielectric due to polarization.
where A is the area of the plate. When a dielectric medium is introduced between the
parallel plates, due to polarization in this medium, there will be induced charges at the
surface of the dielectric. The charge on the surface near the positive electrode will be
negative (Q ) and vise versa. These induced charges will tend to weaken the applied
field and the resultant electric field in presence of the dielectric medium is given by
E = (Q/ A) (Q / A)
(6.12)
(6.13)
where r is the dielectric constant of the medium. From equation (6.12) and (6.13) we
have
Q
Q
Q
=
r A A A
Q/r = Q Q
(6.14)
(6.15)
This indicates that the induced charge is less than the charge on the electrodes due to
the applied field and Q = 0 for r = 1.
186
(6.16)
(6.17)
Thus, we see that the polarization produced in the dielectric medium is proportional
to the applied electric field as long as the dielectric constant r is independent of E.
Further, equation (6.17) is valid for gaseous, liquid and solid dielectrics. The quantity
(r 1) is sometimes referred to as the electric susceptibility and denoted by the
symbol . Hence, we can write
= (r 1) = P/ E
or P = E
The term electric susceptibility is rarely used where as the term dielectric constant is
more popular and widely used.
6.2.1 Mechanisms of polarization
There are four mechanisms by which polarization can be produced in dielectric materials. They are
a) Electronic polarization
b) Ionic polarization
c) Orientational polarization
d) Space charge polarization
187
The effective polarization produced in a dielectric material is the net result of the contributions and can be explained on the basis of atomic theory.
a) Electronic polarization: Consider a single atom consisting of a positive nucleus
of charge Ze and z electrons moving around the nucleus. Taking into account the
size of the nucleus and that of the electron orbits we can consider the atom to be
made up of a point positive charge surrounded by a negative electron cloud. In the
absence of any external influence, the centre of the electron cloud coincides with
the central positive charge (Fig.6.4).
111111111
000000000
000000000
111111111
000000000
111111111
000000000
111111111
+ Ze
000000000
111111111
000000000
111111111
000000000
111111111
000000000
111111111
(a)
111111111
000000000
000000000
111111111
000000000
111111111
000000000
111111111
+ Ze
000000000
111111111
000000000
111111111
000000000
111111111
000000000
111111111
(b)
188
(a)
(b)
Figure 6.5 Effect of an applied electric field E on an ionic solid. Open circles indicate
negative charges and closed circles indicate positive charges. A dipole moment is
induced due to the field.
B
11001100
000000000000000
111111111111111
11111111111111
00000000000000
(a)
10111111111
000000000
111111111
10000000000
000000000
000000000
111111111
111111111
00000000010111111111
000000000
111111111
0
1
000000000
000000000
111111111
10111111111
000000000111111111
000000000
111111111
1010
000000000111111111
000000000
111111111
1010
000000000
000000000
111111111
000000000111111111
000000000
111111111
111111111
10
000000000111111111
000000000
111111111
(b)
Figure 6.6 Dipole moments in a molecule of the type ABA. In (a), the resultant dipole
moment is zero whereas in (b), the resultant dipole moment is the vector sum of the two
individual dipole moments.
The molecule consists of two dipole moments directed from B to A. It is observed
that the resultant dipole moment may be zero if the molecule has a centre of symmetry. Molecules of carbon dioxide (CO2 ) and carbon disulphide (CS 2 ) are examples.
However, in the absence of a centre of symmetry, the molecule carries a resultant
dipole moment given by the vector sum of the dipole moments of the individual
189
dipoles. An example is water molecule (H2 O) where the two OH bonds and the
dipoles associated with them are inclined at an angle of about 105 . When an external field is applied to a molecule carrying a permanent dipole moment, the dipoles
will tend to align in the direction of the field. The contribution to the polarization
from the process of orientation of permanent dipoles is called orientational polarization or molecular polarization.
d) Space charge polarization: In dielectric materials, there can be deviations from
periodic arrangement of atoms due to vacancies or presence of impurity atoms.
There can be a few inclusions of an additional phase. When an electric field is
applied, these charges may move through the material to its surfaces. Immobile
inclusions get polarized. This results in the development of charges near the electrodes (Fig.6.7).
E
++
++
+
++
+
++
-
+
++
+
+
+
190
polarization is due to the presence of permanent dipoles. In the absence of an electric field, the dipoles are randomly oriented with no net polarization. When a field is
applied, the permanent dipoles tend to align parallel to the field with the result that
there will be a net polarization. However, its contribution decreases at higher temperature due to greater thermal agitation. The orientational polarization is inversely
proportional to the temperature and is given by
P =
N2p E
3kT
(6.18)
where p is the permanent dipole moment associated with each molecule, N is the
number of molecules per unit volume and E is the applied electric field.
6.2.3 Effect of frequency on polarization
When an electric field is applied to a dielectric the dipoles tend to align parallel to the
field. However, if the electric field is alternating the dipoles will have to keep changing
their directions of alignment. At low frequencies, the dipoles will have sufficient time
to follow the applied field. As the frequency of the applied electric field increases,
the dipoles cannot keep up with the changing field. This results in a decrease in the
polarization with increasing frequency of the applied field.
The ability of the dipoles to follow the applied alternating field depends on the
mechanisms by which the dipoles are produced. Space charge polarization involves
the movement of ions in the dielectric medium and hence is the slowest process. Its
contribution is normally observed in static (or dc) fields and may vanish in alternating
fields of even very low frequency. Orientational polarization involving the rotation
of molecules is also a slow process and its contribution is observed upto frequencies
of the order of 106 108 Hz. Ionic polarization involving displacement of ions has
considerable contribution upto frequencies of 1013 Hz. Electronic polarization, which
is the fastest mechanism of polarization as it involves the motion of electron cloud,
is observed even upto 1015 Hz. Assuming that all the four types of polarization are
possible in a dielectric medium, the variation of total polarization with the frequency
of the applied electric field may be as shown in Fig.6.8.
Thus, the space charge polarization will contribute at power frequencies, orientational polarization in audio and radio frequencies, ionic polarization in infrared region
and electronic polarization in the optical region of frequency. Accordingly, the material will have different dielectric constants in different regions of frequency of the
applied electric field.
191
12
log f
Figure 6.8 Dependence of polarization on the frequency of the applied electric field
showing the contributions from different polarization mechanisms.
6.3
Dielectric Constant
01
+ Ze
11111
00000
00000
11111
00000R
11111
00000
11111
x + Ze
00000
11111
1
0
11111
00000
E
Figure 6.9 Shift of the electron cloud in an inert gas atom due to the applied electric
field.
When this atom is placed in an electric field E, the nucleus and the electron cloud
will try to move in opposite directions. However, the coulombic force of attraction
192
between the electron cloud and the nucleus will prevent them from pulling apart from
each other and an equilibrium will be obtained. Let x be the displacement of the
centre of the electron cloud from the nucleus at equilibrium. The electron cloud can be
divided into two regions by drawing a sphere of radius x with the centre of the electron
cloud as the centre. The nucleus of charge ze will lie on the surface of this inner sphere.
The force exerted on the nucleus will be due to the negative charge inside the sphere
of radius x since the charges in the region between the two spherical surfaces of radius
x and R do not exert any force, according to the Gauss theorem. The negative charge
present inside the sphere of radius x is equal to zex3 /R3 and the force exerted by this
charge on the nucleus is
F coulomb =
Ze(Zex3 /R3 )
1
4
x2
(6.19)
At equilibrium, this must be equal to the force acting on the nucleus due to the applied
field.
F f ield = ZeE
(6.20)
From equation (6.19) and (6.20) we get
x = (4 R3 /Ze)E
(6.21)
This equation indicates that the displacement of the electron cloud is directly proportional to the strength of the applied electric field. The dipole moment induced due to
the applied field is given by
ind = Ze.x = 4 R3 E = e E
(6.22)
The induced dipole moment is also proportional to the applied electric field and the
proportionality factor e is called the electronic polarizability of the atom.
If the gas contains N atoms per unit volume, the polarization produced in the gas
is given by
P = Ne E
(6.23)
Comparing this with equation (6.17) which relates polarization with the dielectric constant, we obtain
(r 1) = Ne = 4 NR3
or r = 1 + 4NR3
(6.24)
(6.25)
This is the expression for the dielectric constant of the gas. An experimental verification of the dielectric constant of an inert gas matches very well with the calculated
193
value from equation (6.25), thereby indicating that the simple atomic interpretation is
quite valid. Further, the electronic polarizability e , which is equal to 4 R3 , is found
to be independent of temperature in agreement with the experimental observations.
6.3.2 Dielectric constant of polyatomic gases
The case of mono atomic gases considered above is the simplest of the cases where the
polarization of the gas is only due to the shift of the electron cloud under the influence
of the applied electric field. In other words, only electronic polarizability contributes to
the polarization and hence to the dielectric constant of the gases. In the case of polyatomic gases, we have to consider the possibility of other polarization mechanisms
also contributing to the dielectric constant. We have seen that the electronic and ionic
contributions to the total polarization are independent of temperature. Hence, a study
of the temperature dependence of polarization will help us in identifying the presence
of permanent dipoles in the molecules. For example, the dielectric constant of gases
like CO2 , CS 2 , CH4 etc., is independent of temperature whereas gases like HCl, H2 O,
H2 S , CH3 Cl show temperature dependence of dielectric constant. Hence the net permanent dipole moment for molecules like CO2 , CS 2 , CH4 etc., is zero and molecules
like HCl, H2 O, H2 S , CH3Cl etc., have a net permanent dipole moment which will align
itself parallel to the applied electric field thereby contributing to the dielectric constant
of the gas. This, measurement of dielectric constant has been very much useful in the
investigation of molecular structure.
6.3.3 Internal field in solids and liquids
The atoms or molecules in a gas when subjected to an electric field will have dipole
moments, induced or permanent, aligning parallel to the applied field. These dipoles
were considered to be independent of each other in a gas. However, in a solid or a
liquid, these dipoles are close enough to influence the local field acting at the location
of each atom. The field seen by each atom or molecule in a solid or a liquid is called
the local field or internal field, E i . This internal field is larger than the applied field E.
Let us evaluate the internal field for the one dimensional case using Epsteins model of
a linear chain of similar, equidistant atoms (Fig.6.10).
Let e be the polarizability of atoms. When they are subjected to an electric field
E, along the chain of atoms, each atom will carry an induced dipole moment ind given
by
ind = e E i
(6.26)
194
Figure 6.10 A linear chain of atoms carrying induced dipole moments in presence of
an applied electric field.
where E i is the internal field which is written as
Ei = E + E
(6.27)
E being the additional field due to effect of neighbouring dipoles. Our task is to
evaluate E so that E i can be determined. Consider a dipole of dipole moment be
situated at the point O (Fig. 6.11).
Er
E
P
1
cos
4
r2
(6.28)
195
The electric field due to this potential has two components given by
V
1
2 cos
=
r
4
r3
1
sin
1 V
=
E =
r
4
r3
Er =
(6.29)
(6.30)
The electric field intensity at the point A due to the dipole on the atom at B can be
calculated by putting r = a and = 0 in the equations (6.29) and (6.30). This yields
the component E to be zero. Hence, electric field intensity at the atom A due to the
induced dipole at atom B is given by
EB =
ind
(2 a3 )
(6.31)
where a is the inter-atomic spacing along the chain of atoms. Similarly, the electric
field intensity at the atom A due to the induced dipole at B is also equal to the value
given by equation (6.31). The total electric field due to atoms B and B will be the sum
of the two since the dipoles on atoms B and B are similarly directed. Therefore, the
electric field at A due to dipoles on atoms B and B is
E BB =
ind
a3
(6.32)
ECC =
(6.33)
E DD
(6.34)
and so on. The net additional field E due to the induced dipoles is given by
i.e.,
E = E BB + ECC + E DD + . . .
1.2ind
ind X
1/n3 =
E =
3
a n=1
a3
(6.35)
Substituting for E and ind from equations (6.35) and (6.26) respectively in equation (6.27), we get
Ei = E +
1.2e E i
a3
(6.36)
196
Ei =
E
(1 1.2e / a3 )
(6.37)
Since the term (1.2e / a3 ) is positive, we conclude that the internal field is larger
than the applied field. In the three dimensional case, the calculation of the internal field
will be complicated. Proceeding similarly, we may show that for a three dimensional
case (1.2/) may be replaced by where is called the internal field constant. A
particular case which is often encountered is the one in which the atoms are cubically
arranged. For this particular case, the internal field constant has a value equal to
(1/3). The internal field for the case of such materials is called the Lorentz field.
Further, (1/a3 ) will be equal to N where N represents the number of atoms/unit volume.
Hence,
E i = E + P/3
(6.38)
It should be noted that only electronic polarization has been considered in this derivation. However, it can be extended to include ionic polarization but will not be valid for
the case of polar liquids.
6.3.4 Dielectric constant of elemental solids
Elemental solids are those containing only one kind of atoms and the polarization is
essentially due to the electronic polarizability of the atoms. The polarization in such a
solid is given by
P = Ne E i
(6.39)
where E i is the internal field and N is the number of atoms per unit volume of the solid.
Substituting for E i from equation (6.38), we have
Or
P = Ne [E + (1/3 )P]
Ne E
P=
1 (Ne /3 )
(6.40)
(6.41)
(6.42)
(6.43)
197
(6.44)
198
270
280
290
()
(a)
300
100
150
200
()
(b)
Figure 6.12 Variation of dielectric constant with temperature for (a) nitrobenzene and
(b) hydrogen chloride.
In the case of hydrogen chloride(HCl), a transition from liquid state to the solid
state results in a small, abrupt increase in the dielectric constant as shown in Fig. 6.12(b).
199
This increase is explained as due to a change in the density of the material on solidification. In the solid state, the dielectric constant increases with decrease in temperature indicating the continued contribution from orientational polarization. At a lower
temperature, the dielectric constant decreases suddenly indicating that the permanent
dipoles have become immobile.
Hence, a study of the variation of dielectric constant with temperature is helpful in
evaluating the contribution from orientational polarization and also to gain insight into
the molecular structure of materials.
6.4
Ferroelectric materials
(6.17)
There are some substances for which the polarization is not a linear function of the
applied field. These materials exhibit hysteresis in the variation of polarization with the
applied field. Such materials are said to be ferroelectric. Fig.6.13 shows the hysteresis
loop exhibited by a ferroelectric material.
P
Ps
Pr
Ec
200
When an electric field is applied to an unpolarized sample, the polarization increases with the applied electric field in a non-linear manner. When the field is reduced, the polarization also reduces but will not retrace its path. It is observed that a
certain amount of polarization is still present when the field is reduced to zero. This is
called remanent polarization, Pr . In order to make the polarization zero, a field in the
opposite direction has to be applied. The electric field required to bring the remanent
polarization to zero is called the coercive field, E c . The existence of hysteresis indicates that the material must be undergoing spontaneous polarization. This behaviour
of ferroelectric materials is explained on the basis of domain theory. According to this
theory, the specimen may be considered to be made up of a large number of small
regions called domains within which the dipoles are oriented parallel resulting in a
spontaneous polarization. In a virgin specimen the direction of polarization varies
from one domain to another and the resultant polarization is zero. When an electric
field is applied, the domains with parallel orientation grow in size at the expense of
others. Ultimately, the entire specimen may become a single domain resulting in saturation polarization, P s . Beyond this value of polarization, the increase is due to the
normal dielectric behaviour.
Hysteresis in polarization is the most characteristic property of ferroelectric materials. Material show this property only upto a certain temperature called ferroelectric
curie temperature, f . Above this temperature, the material no longer exhibits hysteresis but behaves like a normal dielectric. The dielectric constant is given by
r = C/(T ) for T > f
(6.45)
201
Ps
250
Ps
270
T, K
(a)
290
100
Ps
110
T, K
(b)
120
200
300
T, K
400
(c)
202
6.5
Piezoelectric Effect
Application of an electric field generally displaces ions and consequently, the dimensions of the sample may undergo changes. On the other hand, mechanical stress also
changes the dimensions of the sample but normally does not produce polarization. In
most of the materials, polarization is observed to produce deformation but the deformation does not lead to polarization. This electro-mechanical effect, present in all
materials, is called Electrostriction. A simple example is shown in Fig. 6.16.
E
+
+
+
+
(b)
+
+
(a)
+
(c)
Figure 6.16 Illustration of electrostriction showing (a) an ionic lattice, (b) effect of an
applied electric field and (c) the effect of an applied compressive force.
Application of an electric field along x-axis results in the movement of positive ions
along x-direction and the negative ions in the opposite direction. Hence, a distortion
of the lattice occurs. However, when a tensile or a compressive force is applied to the
sample, there will be no polarization. This property of electrostriction is characteristic
of all crystalline materials which have a centre of symmetry.
There are some dielectric materials in which application of an electric field produces deformation and conversely, a deformation of the sample produces a polarization. This effect is called piezoelectric effect. An example is shown in Fig. 6.17.
Application of an electric field along x-axis produces a deformation. A tensile or a
compressive force will result in a change in the angle , there by resulting in polarization. A tensile force increases the angle and a compressive force decreases the angle.
The direction of polarization will change depending on the force being tensile or compressive. This effect is observed only in materials which lack a centre of symmetry.
Piezoelectric materials are used as transducers to convert mechanical energy into electrical energy or electrical energy into mechanical energy. Some applications are in the
making of microphones, strain gauges and ultrasonic generators.
203
+ +
+
(b)
(a)
(c)
Figure 6.17 Illustration of piezoelectric effect showing (a) an ionic lattice, (b) effect of
an applied electric field and (c) the effect of an applied compressive force.
6.6
Dielectric losses
I
C
V
(a)
(b)
Figure 6.18 Equivalent circuit for a capacitor and the corresponding phasor diagram.
The current through the capacitor leads the voltage by 90 and hence is 90 out of
phase with the current through the resistor arm. Hence, the resultant current will be
204
leading the voltage by an angle less than 90 . The change in the lead angle is called the
loss angle, (Fig.6.18b). The dielectric loss is expressed as loss factor which is equal
to tan .
tan = IR /Ic
(6.46)
where IR and Ic are the components of current that would have flown through an equivalent parallel resistance and the capacitor respectively.
i.e., tan =
(V/R)
Xc
=
(V/Xc )
R
(6.47)
(6.48)
Hence, the power loss in the capacitor will be equal to VI or V 2 /R which may be
written in terms of tan as
P = CV 2 tan
(6.49)
Thus, the dielectric loss depends on the value of capacitance, the equivalent resistance
and the frequency of the applied field. A good dielectric would have a loss factor of
the order of 105 .
Numerical Examples
6.1 A parallel plate capacitor of area 650mm2 and a plate separation of 4mm has a
charge of 2 1010C on it. What is the voltage required when a material of
dielectric constant 3.5 is introduced between the plates.
Solution:
Capacitance = A/d
=
= 5.033 1012 F
2.0 1010
Voltage required V = Q/C =
5.033 1012
V = 39.73V. (Ans.).
205
Solution:
P = (r 1)E
6.3 The polarizability of a neon atom is 3.5 1041 Fm2 . Find the dielectric constant
of neon gas at OC and normal pressure.
Solution: No of atoms in neon gas per unit volume at NTP
Avogadro number
Molar volume
6.023 1023
i.e., N =
atom m3
3
22.4 10
= 2.69 1025 atoms m3
=
We have
(r 1) = Ne
8.85 1012
4
= 1.06 10
R = 1.000106 (Ans.).
Or (r 1) =
6.4 The electronic polarizability of argon atom is 1.43 1040 Fm2 . Find the induced
dipole moment and the relative shift of the electron cloud when it is subjected to
an electric field of 105 Vm1 .
Solution:
Induced dipole moment = ind = e E
= 1.43 1040 105
ind
1.43 1035
=
Ze
18 1.6 1019
x = 4.97 1018 m (Ans.).
x=
206
6.5 A solid contains 2 1028 identical atoms per m3 each with a polarizability of
5 1040 Fm2 . Assuming Lorentz force field to be operative, calculate the ratio
of the internal field to the applied field.
Solution:
E i = E + (/ )P = E + (/ )Ne E i
"
#1
Ei
2 1028 5 1040
1
= 1
=
E
1 (/ )Ne
3 8.85 1012
= (1 0.377)1
Ei
= 1.604 (Ans.).
E
6.6 An elemental solid containing 2 1028 atoms m3 shows an electronic polarizability of 2 1040 Fm2 . Assuming a Lorentz force field to be operative, calculate
the dielectric constant of the material.
Solution: From Clausius - Mosotti equation,
(r 1) Ne
=
(r + 2)
3
2 1028 2 1040
=
3 8.85 1012
= 0.1506
(r 1) = (0.1506)(r + 2)
r (1 0.1506) = (1 + 2 0.1506)
1.3012
r =
= 1.52 (Ans.).
0.8494
6.7 Capacitor consists of two conducting plates of area 200cm2 each separated by a
dielectric material ( = 3.7) of thickness 1 mm. Find the capacitance and the
electric flux density when a potential of 300 V is applied.
Solution:
Capacitance C = r A/d
Flux density D = r E
207
6.8 What is the polarization produced in sodium chloride by an electric field of 500
Vm1 if it has a relative permittivity of 6?
Solution:
Polarization P = (r 1)E
6.9 The electronic polarizability of argon is 1.5 1040 Fm2 . If the gas contains
2.7 1025 atoms m3 , calculate the relative permittivity of the gas.
Solution: We have
(r 1) = Ne
8.85 1012
= 4.58 104
R = 1.000458 (Ans.).
Or (r 1) =
6.10 A solid containing 8 1028 atoms m3 shows an internal field which is 1.5 times
the applied field. If the solid has an internal field of Lorentz type, calculate the
polarization associated with each atom.
Solution: Polarization associated with each atom is the polarizability of the
atom.
Ei
1
=
E
1 (/ )Ne
(1 E/Ei) (1 1/1.5) 8.85 1012
Or e =
=
N
(1/3) 8 1028
= 1.106 1040 Fm2 (Ans.).
208
Exercise
6.1 Describe different polarization mechanisms.
(March 1999)
6.2 A parallel plate capacitor consists of two plates each of area 5 104 m2. They
are separated by a distance of 1.5 103 m and filled with a dielectric of relative
permittivity 6. Calculate the charge on the capacitor if it is connected to a 100 V
D.C. supply.
(March 1999)
6.3 Explain the properties of ferroelectric materials.
(March 1999)
6.4 What are dielectrics? Arrive at an expression for internal field in the case of
solids and liquids.
(March 1999)
6.5 What are dielectric materials? Explain the dependence of polarizability and dielectric loss on frequency.
(March 2000)
6.6 What should be the voltage required to introduce a material of dielectric constant
4 between the plates of a parallel plate capacitor of area 1000 mm2, having plate
separation of 5 mm and a charge of 3 1010C.
(August 2000)
6.7 Define the term Internal field and derive an expression for the same in the case
of liquids and solids.
(August 2000)
6.8 Explain with theory how static dielectric constant of a material is determined.
(March 2001)
6.9 Write a note on dielectric loss in a dielectric material.
(March 2001)
6.10 What is electrical polarization of an atom? Obtain an expression for the internal
field developed in solids when an electric field is applied to it.
(March 2001)
6.11 Obtain an expression for internal field in a one-dimensional array of atoms in
solids. Discuss the properties of ferro-electric materials.
(August 2001)
6.12 Discuu different polarization mechanisms.
(Feb 2003)
6.13 Define dielectric polarization. Derive an expression for internal field in case of
solid and liquid dielectric by considering one dimensional array of atoms.
(Aug 2003)
6.14 What is polarization and dielectric loss in dielectric materials? Explain the dependence of polarizability and dielectric loss on frequency.
(Aug 2004)
Chapter 7
Magnetic Properties
7.1
Introduction
The magnetic behaviour of materials is due to atomic magnetic dipoles present in these
materials. The orbital motion or spin of charged particles is equivalent to an electric
current and hence is associated with magnetic effects. For example, the orbital motion
of an electron around a nucleus, the spin of an electron and the spin of a nuclear particle will all constitute magnetic dipoles and carry a dipole moment. An atom may be
considered to contain many such magnetic dipoles carrying different dipole moments.
These are called permanent dipoles as they exist even in the absence of an applied magnetic field. However, when a magnetic field is applied, the orbital angular velocity of
electrons is modified resulting in a change in the magnetic dipole moment. This can be
considered to be equivalent to an induced dipole moment acting in a direction opposite
to the applied magnetic field. The magnetic behaviour of all materials is the result of
the contributions of the induced and permanent magnetic dipoles. A reasonably correct interpretation of the magnetic behaviour is possible on the basis of a simple atomic
theory.
The magnetic flux density B due to a magnetic field H applied in vacuum is
given by
B = H
(7.1)
wherecirc is called permeability of vacuum (or free space). The magnetic flux density
is also known as magnetic induction and is expressed in units of tesla or weber per
square metre. Magnetic field strength has the units ampere per metre. The units of
permeability are henry per metre.
When the magnetic field is applied to a solid medium, the magnetic flux density
will have an additional contribution from the magnetic dipole moments present in the
solid. The net flux density is given by
B = H + M
(7.2)
where M is the magnetic dipole moment per unit volume of the material. It can be
considered to be the internally produced magnetic field and is called intensity of magnetisation. The magnetic flux density may also be expressed as
B = H = r H
209
(7.3)
210
Magnetic Properties
where is the permeability of the solid and r is called the relative permeability.
From equations (7.2) and (7.3) we have
r =
M
H+M
= 1+
=1+
H
H
Or (r 1) =
(7.4)
(7.5)
The term which is equal to (M/H) is called magnetic susceptibility of the material.
The relative permeability and magnetic susceptibility are important quantities used to
describe the magnetic behaviour of materials.
7.2
Materials are classified into various types depending on the magnetic properties of the
dipoles present and the interaction between them. The first distinction that can be made
is between materials possessing permanent dipoles and those without them. Materials which lack permanent dipoles are called diamagnetic. These materials, however,
show weak magnetisation when placed in a magnetic field due to induced dipoles. The
magnetisation is very small and is opposite in direction to the applied magnetic field.
Hence, the diamagnetic materials have a small negative susceptibility value. Examples of diamagnetic materials are metals like copper, gold, silver, semiconductors like
silicon, germanium, ionic slats like NaCl, oxides like alumina, organic solids like benzene, napthalene, rare gases like helium, neon, argon, etc. All these materials have a
magnetic susceptibility of the order of 105 . An important exception is the case of materials in their superconducting phase when the diamagntic susceptibility will be equal
to 1. Superconductors are called perfect diamagnetic materials. Materials which have
permanent dipoles are further classified on the basis of the interaction between them.
If the interaction between the permanent dipoles is absent or negligibly small, then the
material is called paramagnetic (Fig. 7.1).
Metals like aluminium, magnesium, tungsten, titanium, platinum, oxides like CoO,
FeO, Cr2 O3 , chlorides like CrCl2 , FeCl2 , sulphates like CoS O4 , MnS O4 , NiS O4 and
gases like oxygen, nitrogen etc. are examples of paramagnetic materials.
If the dipoles interact strongly leading to their parallel alignment, the material is
ferromagnetic. Examples of ferromagnetic materials are the elements of iron group
(iron, cobalt, and nickel) and alloys containing these elements. The strong interaction among the permanent dipoles leads to large value of magnetisation and hence
high value of magnetic susceptibility. If the result of strong interaction is to align the
neighbouring dipoles antiparallel, the material is antiferromagnetic or ferrimagnetic
in nature. In the case of antiferromagnetic materials, the dipole moments on the
Magnetic Properties
211
PARA
FERRO
ANTIFERRO
FERRI
TN
212
Magnetic Properties
Permanent
dipole
Interaction of
neighbouring dipoles
105
Diamagnetic
Absent
Nil
Paramagnetic
Present
Negligible
Ferromagnetic
Present
Large
Antiferromagnetic
Present
Negligible
Ferrimagnetic
Present
Large
7.3
103
Permanent dipole moments arise due to the angular momentum associated with the
charged particles. Accordingly, permanent dipoles will be present in an atom due to
the following three contributions to the angular momentum:
(i) Orbital angular momentum of electrons
(ii) Spin angular momentum of electrons
(iii) Spin angular momentum of nucleons
Consider the simplest case of a hydrogen atom in which an electron is going round
a proton in an orbit of radius R (Fig. 7.3).
The current associated with this orbital motion is equal to e f where f is the
frequency of rotation given by
f = /2
(7.6)
Magnetic Properties
213
R
+e
= (e/2) R2 = 1/2eR2
(7.7)
(7.8)
(7.9)
(7.10)
(7.11)
The negative sign included in equation (7.11) is due to the fact that m and Ma have
opposite direction as a consequence of the negative charge of the electron. It can be
shown that equation (7.11) is valid not only for the simplest case of a hydrogen atom
214
Magnetic Properties
but also for any charge distribution. The angular momentum of an electron can be
expressed in terms of the magnetic quantum number ml as
Ma = (h/2) ml
(7.12)
It may be mentioned here that the orbital motion of an electron may be described by
the quantum numbers n, l and ml . The principal quantum number n accounts for the
energy of the electron; the orbital quantum number l determines the orbital angular
momentum and the magnetic quantum number ml represents the component of the
angular momentum along the direction of the applied external magnetic field. The
quantum numbers are interrelated and have discrete values as follows:
n = 1, 2, 3, . . .
l = 0, 1, 2, . . . (n 1)
ml = l, (l 1), (l 2), . . . . . . 0, 1, 2, . . . l.
Hence, it is possible to compute the permanent dipole moment associated with an atom.
For example, the electrons in the K-shell corresponding to n = 1 will have l value zero
and ml value also zero indicating that these electrons do not contribute to the permanent
dipole moment. For the electrons in the L-shell (n = 2), the possible l values are zero
and one. For l = 0, ml = 0. For l = 1, ml can have values 1,0 and 1. Each of these ml
values can be possessed by two electrons with their spin quantum numbers m s equal to
1/2 and 1/2. Hence, the distribution of electrons in different shells can be as follows:
n=1
l=0
ml = 0
n=2
l=0
ml = 0
l=1
ml = 1
ml = 0
ml = 1
m s = 1/2
2 electrons.
m s = 1/2
2 electrons.
m s = 1/2
2 electrons.
m s = 1/2
2 electrons.
m s = 1/2
2 electrons.
eh
4m
(7.13)
Magnetic Properties
215
This is an atomic unit of magnetic moment and called Bohrmagneton, . The numerical value of magnetic moment corresponding to one Bohr magneton is equal to
= 9.27 1024 ampm2
Thus, contribution to the permanent dipole moment due to orbital angular momentum
of electrons can be expected only from partially filled electronic shells. Hence, the iron
group (atomic number 21 to 28 ) and rare earths (atomic numbers 39 to 45 and 58 to
71) are expected to show contributions from orbital angular momentum. However, in
the solid state, these dipole moments will be frozen in and will not be in a position to
orient themselves in presence of an external magnetic field. In the case of iron group
of elements, the incompletely filled shells are close to the outside of the atoms and
hence interact strongly with the neighbouring atoms. Thus, their contribution to the
magnetic dipole moment is negligible. In the case of rare earths, the incompletely
filled shells are relatively deep inside the atoms and interact to a lesser degree with
those of neighbouring atoms. Hence, they do contribute to the magnetic properties of
the material.
The second and the most important contribution to the magnetic behaviour of materials comes from the spin angular momentum of electrons. It can be shown that the
dipole moment due to the spin of an electron is equal to
m(spin) = (e/m) Ma(spin)
h(1/2)
where Ma(spin) =
= (h/4)
2
m(spin) = (eh/4m) =
(7.14)
(7.15)
Thus, each electron contributes a magnetic moment equal to depending on the spin
of the electron. In an atom with many electrons, the resultant magnetic moment due to
spin depends on the number of electrons with spin up and those with spin down. The
calculated values of resultant magnetic moments for some atoms is given in Table 7.2.
These values are valid for individual free atoms. In the metallic state, the observed
value differs from the calculated value.
The angular momentum associated with the spin of the nucleons is of the same order as that due to the spin of electrons. However, since the mass of the nucleon is larger
than that of an electron by a factor of 103 , the magnetic dipole moment associated with
the spin of nucleon is smaller by the same factor. Hence, the effect of nuclear spin is
usually negligible.
216
Magnetic Properties
Table 7.2 Dipole moments of atoms due to spin of electrons
7.4
Atomic
No.
Element
No. of III
electrons
20
21
22
23
24
25
26
27
28
Calcium
Scandium
Titanium
Vanadium
Chromium
Manganese
Iron
Cobalt
Nickel
0
1
2
3
4
5
6
7
8
Spin
alignment
0
Dipole
moment
0
2
3
4
5
4
3
2
Magnetic hysteresis
Materials with a large value of magnetic susceptibility namely ferromagnetic and ferrimagnetic materials exhibit hysteresis in the B versus H curve (or M versus H curve).
When a magnetic field H is applied to a typical sample, the magnetic flux density B
(or the magnetisation M) will vary in a non-linear manner. The permeability or the
magnetic susceptibility will no longer be constant but will be functions of the applied
magnetic field. As the magnetic field is increased the flux density (Magnetisation)
increases and reaches a saturation value Bsat (M sat ). When the field intensity H is reduced to zero, the flux density will not go to zero but will have a finite value called
remanent flux density, Br (or remanent magnetisation, Mr ). This remanent flux density may be reduced to zero by applying a magnetic field in the opposite direction. The
field required to reduce the flux density to zero is called the coercive field, Hc . The
variation of magnetic flux density (or magnetisation) with the applied magnetic field is
represented by the hysteresis loop shown in Fig. 7.4.
This is the prominent and characteristic feature of ferromagnetic and ferrimagnetic
materials. It is exhibited by materials below a characteristic temperature. For example,
a ferromagnetic material will show hysteresis in its magnetisation below a temperature
f called the ferromagnetic transition temperature. Above this temperature, the material behaves like a paramagnetic material. In the paramagnetic phase the magnetic
susceptibility may be expressed as
Magnetic Properties
217
B sat
Br
Hc
= C/T
(7.17)
and the relation is called the Curie law. The paramagnetic Curie temperature is usually
higher than the ferromagnetic Curie temperature f (Fig. 7.5).
The spontaneous magnetisation and hysteresis observed in ferromagnetic and ferrimagnetic materials are explained on the basis of domain theory. A virgin sample of
these materials consists of a large number of regions or domains which are spontaneously magnetised; i.e., the magnetic dipoles within the domains are aligned parallel
and have a resultant dipole moment. However, the direction of spontaneous magnetisation varies from domain to domain so that the resultant magnetisation in the sample
is zero (Fig.7.6). When an external magnetic field is applied to this sample, the domain with parallel orientation will grow in size at the expense of others. As the field
is increased, all the domains may orient in one direction leading to a saturation value
of magnetisation. Removal of the applied field leaves the sample in its magnetised
condition and a field in the opposite direction will be needed to re-orient the domains.
218
Magnetic Properties
1/
Antiferro
Para
Ferro
Figure 7.5 The reciprocal of susceptibility as a function of temperature for para, ferro
and antiferromagnetic materials.
(a)
(b)
Figure 7.6 Domain orientation in a ferromagnetic sample (a) in the absence of applied
magnetic field and (b) after application of a magnetic field.
The saturation magnetisation and the coercive field are important parameters that
define the characteristics of magnetic materials and their possible applications.
7.5
Ferromagnetic and ferrimagnetic materials are known for their high magnetic susceptibility and high value of saturation magnetisation. These materials are classified as
hard or soft depending on the coercive field associated with them. Materials with a
Magnetic Properties
219
low value of coercive field are called soft magnetic materials and those with a high
value of coercive field are called hard magnetic materials. A study of the hysteresis
loop will help us in identifying these materials (Fig.7.7).
B
Soft
Hard
Figure 7.7 Hysteresis loops for hard and soft magnetic materials.
Hard magnetic materials with their large values of magnetisation and also coercive
field have fat hysteresis loop. They require a large magnetic field in the opposite direction to alter their parallel alignment. For soft magnetic materials, on the other hand, a
low coercive field is sufficient to reduce the magnetisation to zero.
7.6
220
Magnetic Properties
such as Fe, Co, Mn, Zn, Cd, Mg, etc. Garnets have a chemical formula M3 Fe5 O12
where M is a rare earth ion such as S m, Eu, Gd, Y etc. Y Al garnet, Y Gd garnet,
Y Al Gd garnet, etc., are soft magnetic materials useful for microwave applications.
Ceramic magnetic materials, because of their high electrical resistivity, have low
eddy current losses.
7.7
Ferrites
Ferrites are solid solutions of two oxides with formula MO and Fe2 O3 . The magnetic
properties of ferrites are intimately related to their structure. By a suitable selection
of the composition, it is possible to design materials with the required magnetisation.
Ni Zn ferrite and Mn Zn ferrite are examples of soft magnetic materials, where as
barium ferrite is an example of hard magnetic material.
Let us consider the case of magnetite. This has a chemical formula Fe3 O4 and
may be written as a solid solution of two oxides FeO and Fe2 O3 . Each formula unit
of the ferrite has one Fe2+ ion and two Fe3+ ions. A unit cell of the compound has
eight molecules with oxygen ions forming a close packed face centred cubic structure.
It is observed that there are two types of interstitial sites available for the metal ions
to occupy. They are octahedral holes called B-sites (an interstitial site surrounded by
six oxygen ions) and tetrahedral holes called A-sites(an interstitial site surrounded by
four oxygen ions). Corresponding to one formula unit of magnetite, there are two
octahedral sites and one tetrahedral site available for occupation by the metal ions.
There exists an anti-ferromagnetic interaction between the metal ions present in Asites and B-sites. In other words, the spin on the ion present in A-site is oppositely
oriented as compared to the spin on the ion present in B-site. In magnetite, Fe2+ ions
occupy half the octahedral sites. Fe3+ ions occupy the other half octahedral sites and
the tetrahedral sites (Fig.7.8).
Fe 2+
Fe 3+
Octahedral
B Sites
5
Tetrahedral
A Sites
Fe
3+
Figure 7.8 Distribution of Fe2+ and Fe3+ ions in octahedral and tetrahedral holes.
Magnetic Properties
221
Hence the magnetic moments due to Fe3+ ions get cancelled and the magnetization
of Fe3 O4 is due to the magnetic moment of Fe2+ ions alone, i.e., 4 per molecule.
The magnetic properties of ferrites are very sensitive to atomic arrangements. It
is observed that replacing some of the Fe2+ ions with Zn2+ or Cd 2+ ions will lead to
an increase in the magnetization. This is because Zn2+ ions go preferentially into the
tetrahedral holes there by forcing Fe3+ ions to occupy octahedral holes. Figure 7.9
shows the situation when 50% of the Fe2+ ions are replaced by Zn2+ ions (i.e., one
Fe2+ ion from two formula units). Since Zn2+ ions have no magnetic dipole moment,
Fe 3+
Fe 3+
Fe 2+
Fe 3+
Octahedral
B Sites
5
Tetrahedral
A Sites
2+
Zn
Fe
3+
Figure 7.9 Distribution of Fe2+ , Fe3+ and Zn2+ ions in the case of a compound with
50% of the Fe2+ ions replaced with Zn2+ ions.
the net magnetization increases. Thus it is possible to control the magnetic behaviour
of ferrites by suitably modifying the chemical composition.
7.8
Diamagnetic materials are those which do not possess permanent dipoles and the induced dipole moment is extremely small. Hence, these materials find application
where the magnetic effects are undesirable.
Paramagnetic materials possess permanent dipoles but there is negligible interaction among them. The magnetic susceptibility is small ( 103 ) but positive. Further, the
susceptibility is inversely proportional to the temperature and decreases with increase
in temperature. Paramagnetic salts are used in obtaining very low temperatures (less
than 1 K) by adiabatic demagnetisation. They are also used in solid state masers.
Ferromagnetic and ferrimagnetic materials show large magnetisation and hence
are characterised by large values of magnetic susceptibility. They are classified as
hard or soft magnetic materials depending on the coercive field required to remove the
222
Magnetic Properties
Numerical Examples
7.1 A magnetic field of 2000 Am1 is applied to a material which has a susceptibility
of 1000. Calculate
(i) relative permeability of the material,
(ii) intensity of magnetisation and
(iii) flux density.
Solution:
(i) Relative permeability,
r = 1 +
= 1 + 1000 = 1001 (Ans.).
(ii) Intensity of magnetisation,
M = H
= 1000 2000 = 2 106 Am1 (Ans.).
(iii) Flux density,
B = o r H
= 4 107 1001 2000
= 2.52Wbm2 (Ans.).
Magnetic Properties
223
7.2 Iron shows a saturation magnetisation of 1.75 106 Am1 . Calcualte the contribution of each atom to the magnetisation if iron has a b.c.c structure with a lattice
parameter of 2.86A .
Solution:
Volume of unit cell = a3 = (2.86 1010 )3 = 2.34 1029 m3
Number of atoms
per unit cell = 2
Number of atoms
per unit volume = (2/a3 ) = 8.54 1028 m3
Magnetisation due
to each atom =
1.75 106
8.54 1028
2.05 1023
= 2.2 (Ans.).
9.27 1024
7.4 Atomic number of nickel is 28. Calculate the dipole moment per atom due to
spin of electron. If the dipole moment for an atom in the bound state is just half
the value for the free atom, find the saturation magnetisation in nickel. Assume
nickel to have 5 1028 atom m3.
Solution: The electron configuration for Ni is
1s2 2s2 2p6 3s2 3p6 3d 8 4s2
There are eight 3d electrons with their spins aligned as 5 UP and 3 DOWN
producing a resultant dipole moment of 2. Hence, in the bound state, each
atom contributes a dipole moment of .
224
Magnetic Properties
Saturation magnetization is the product of the dipole moment on each
atom and the total number of atoms per unit volume.
MS = 9.27 1024 5 1028
= 463500Am1 (Ans.).
Exercise
7.1 Describe the nature of hard and soft magnets.
(March 1999).
7.2 What are ceramic magnets? Discuss their properties and the reason for classifying ferrites as ceramic magnets. Explain two important applications.
(March 1999)
7.3 Explain magnetic hysteresis on the basis of domain theory.
(August 1999).
7.4 What are ceramic and metallic magnets? Give their properties and applications.
(August 1999).
7.5 Explain the properties and applications of ceramic magnets.
(March 2000).
7.6 What are the contributions to the magnetic dipole moment of an atom? How are
the magnetic materials classified? Discuss the changes in magnetic properties of
a ferromagnetic material as a function of external magnetic field. (March 2000).
7.7 Explain the classification and properties of magnetic materials. (August 2000).
7.8 Explain orbital angular momentum, spin angular momentum and nuclear spin.
(August 2001).
7.9 What are ceramic and metallic magnets? Give their properties.
(August 01).
7.10 How are magnetic materials classified on the basis of susceptibilities? Discuss
the contribution of angular momentum to the magnetic properties of materials.
(March 2002).
7.11 Discuss the properties of hard and soft magnetic materials with the help of hysteresis loop.
(Feb 2003, Aug 2003, Feb 2004).
7.12 Explain the properties and applications of ferrites.
(Aug 2004).
Chapter 8
Applied Optics
8.1
When radiation interacts with matter under appropriate conditions, it leads to an abrupt
transition of the quantum system such as an atom or molecule from one energy state
to another. If the transition is from a higher state to a lower state, the system gives
out a part of the energy and if the transition is in the reverse direction, then, it absorbs
the incident energy. When a substance absorbs energy, a part of the energy may be
re-emitted in the form of electromagnetic radiation in the visible or near-visible region
of the spectrum. This phenomenon is known as luminescence.
8.1.1 Luminescence
Luminescence involves the following two steps:
(i) The excitation of the electrons in the atoms of the solid, and
(ii) The subsequent emission of photons.
These two major steps may be associated with some intermediate steps. Depending on
the time interval between the excitation and the emission processes, luminescence is
classified as fluorescence and phosphorescence. Fluorescence is a process in which
the emission occurs during excitation or the time interval between the two processes of
excitation and emission being less than 108 s. The emission of light after the excitation
has ceased or with a lapse of time larger than 108 s is called phosphorescence. It is
observed that in majority of cases, the decay time is independent of temperature for
fluorescence and is temperature dependent for phosphorescence.
Luminescence is found to be associated with the presence of activators in materials. These activators are either impurity atoms present in minute quantities or a small
excess of one of the constituent elements in a compound. It is interesting to note that
the presence of certain other type of impurity may inhibit luminescence. Such impurities are called killers. Addition of activator impurity in a crystalline solid will give rise
to localized energy levels in the forbidden energy gap (Fig. 8.1).
225
226
Applied Optics
Figure 8.1 Absorption and emission processes and the role of localized energy states
in the band gap.
These levels may be due to the activator atoms themselves or due to the host atoms
influenced by the presence of activator atoms. They may also be associated with lattice
defects induced due to the addition of activator atoms. In case of fluorescence, the
excitation and emission processes involve these localized energy levels in addition to
inter band transition. In case of phosphorescence, the localized energy levels present
in the band gap may be metastable states. A transition to a metastable state results in
trapping of the electron and an emission can occur when the electron is released back
into the conduction band. This accounts for the delay in the emission process.
The process of luminescence is further classified on the basis of the mechanism
of excitation. If the carriers are excited by photon absorption, the resulting emission
is called photoluminescence. In cathodo-luminescence, the excitation is by bombardment with high-energy electrons. If the excitation is due to the passage of an electric
current, the resulting luminescence is called electro-luminescence. Other types of luminescence are tribo-luminescence, chemi-luminescence, radio-luminescence,
magneto-luminescence, etc. In all these cases, the nomenclature refers to the mechanism by which the excitation is achieved. On the other hand, thermo-luminescence
refers to the case of luminescence in which, irrespective of the method of excitation,
the emission is stimulated by thermal process. Luminescence is an important property
exhibited by certain materials, which makes them useful for device applications.
8.1.2 Induced absorption
Induced absorption is the absorption of an incident photon by a system as a result of
which the system is excited from a lower energy state to a higher energy state wherein
Applied Optics
227
the difference in energy of the two states is precisely the energy of the photon. Consider
an atomic system in which an outer electron has a ground state energy of E 1 and excited
energy state E 2 . An absorption process takes the electron from E 1 to E 2 when a photon
of energy E, precisely equal to (E 2 E 1 )) is incident on the atom. As a result, its
energy becomes
E 2 = E 1 + E
The atom is said to have made a transition to the excited state and is indicated as atom .
This phenomenon is called induced absorption which is represented as
Atom + photon atom
8.1.3 Spontaneous emission
Spontaneous emission is the emission of photon when a system (an atom) transits
from an excited energy state to the ground state without the aid of any external agency.
The rate at which a transition from a higher energy level to a lower energy level takes
place will be proportional to the number of electrons present in the higher energy level
at any instant. In the absence of simultaneous excitation, the number of electrons in
the excited state goes on reducing exponentially with increase in energy. Consider an
atom in the excited state of energy E 2 . A spontaneous emission results in a photon of
energy h when the electron makes a transition from excited state E 2 to the ground
state E 1 . The energy of the photon is given by
h = E 2 E 1
8.1.4 Stimulated emission
The above-mentioned process of emission of radiation occurs randomly and on its
own. Therefore it is classified as spontaneous emission. However, the electrons in
the excited state need not wait for spontaneous emission to occur if it is stimulated
to emit a photon and go to a lower energy level. This stimulation may be brought
about by photons themselves of proper energy. Such an emission of radiation is called
stimulated emission. The emitted photon will be in phase with the photon stimulating
the emission. If the process continues, a monochromatic, coherent radiation will result.
Consider an atomic system in which an outer electron has a ground state of energy E 1 and an excited state of energy E 2 . An absorption process involving supply of
necessary energy takes the electron from E 1 to E 2 . The excited electron returns to the
ground state by a spontaneous emission of a photon of energy h given by
h = E 2 E 1
(8.1)
228
Applied Optics
The emission may also be stimulated with the help of photons of similar energy. If
n1 and n2 represents the number of electrons present in the energy levels E 1 and E 2 at
any instant at any temperature, then, by Boltzmanns distribution,
n2 /n1 = exp {(E 2 E 1 )/kT } = exp {h/kT }
(8.2)
This equation indicates that the population at energy level E 2 is much smaller than
that at E 1 at equilibrium and most of the electrons are in the lower energy state. If
by some mechanism, the population at E 2 is increased, a stimulating radiation can be
used to produce emission of radiation in the form of a strong coherent beam. This is
the principle used in the construction of a laser.
8.2
Let represent the energy density of the radiation field. It is defined as the total energy
in the radiation field per unit volume per unit frequency. This may produce absorption
and subsequent emission in an atomic system (Fig. 8.2).
Applied Optics
229
Figure 8.2 Steady state processes of (a) absorption, (b) spontaneous emission and (c)
stimulated emission. The rates of the three processes are mentioned.
The rates of different processes that can occur may be written as follows:
(i) The rate of absorption depends on the number of electrons n1 available at the
lower level E 1 and the energy density of radiation field assisting absorption,
(ii) The rate of spontaneous emission depends on the number of electrons n2 present
in the excited energy state E 2 , and
(iii) The rate of stimulated emission depends on the number of electrons n2 in the
excited energy state E 2 and the energy density of the radiation field assisting
emission.
At steady state, we can write
Rate of absorption = Rate of spontaneous emission + Rate of stimulated emission
K1 n1 = K2 n2 + K3 n2
(8.3)
This relation is called Einsteins equation and the constants K1 , K2 and K3 are called
Einsteins coefficients. These coefficients K1 , K2 and K3 represent the probability of
the processes of absorption, spontaneous emission and the stimulated emission taking
place respectively.
In the absence of any stimulated emission, the above equation reduces to
K1 n1 = K2 n2
K2 n2 K2
Or =
=
exp {h/kT }
K1 n1 K1
(8.4)
230
Applied Optics
But, according to the Plancks theory of radiation, (Please refer equation (1.5) of
Chapter 1), we have,
1
8h5
(8.5)
=
4
h/kT
c (e
1)
It is clear that the two equations, (8.4) and (8.5) are not in agreement. In order to rectify the discrepancy, Einstein proposed another kind of emission, known as stimulated
emission. Equation (8.3) may be rewritten as
K2 n2
K1 n1 K3 n2
K2
=
h/kT
K1 e
K3
(8.6)
K2 8h5
=
K1
c4
Hence, the ratio of the rate of stimulated emission to the rate of spontaneous emission
is given by
1
K3 n2
= h/kT
(8.7)
K2 n2
(e
1)
Thus, the existence of stimulated emission is proved. The energy density of the radiation field may be written as
=
8h5
1
4
h/kT
c (e
1)
Applied Optics
231
The first condition can be achieved by using a radiation field of very large energy
density. This is made possible by using an optical resonant cavity in which the photon
density can be enhanced by multiple reflections. The second condition requires a population inversion in which the number of electrons in the excited state is higher than
that in the lower state. This is a non-equilibrium condition and is facilitated by the
presence of metastable states. A metastable state is one which has a relatively longer
lifetime and electrons excited to these levels will come down to lower levels at a much
smaller rate than the rate at which they are excited. However, a stimulated emission
can be induced using a suitable radiation field.
8.2.3 Methods of achieving population inversion
The process of exciting the active atoms in a medium to higher energy states in order
to achieve population inversion is called pumping. There are several ways of pumping
but the most commonly used methods are the following:
(i) Optical pumping or excitation using photons: In this case, an external optical
source like a flash lamp is used to produce radiation field from which the energy
may be absorbed to produce excited atoms. This type of pumping is used in ruby
laser and Nd:YAG laser.
(ii) Use of electric discharge: In this case, direct electron excitation in a gaseous
discharge is used. This type of pumping is used in gas ion lasers like argon laser.
(iii) Use of atomic collisions: In this type of pumping, a mixture of two gases having
same excited state is subjected to an electric discharge. An excited atom upon
collision with an atom of the second gas transfers the energy and sends it to the
excited state. This type of excitation is used in Helium-Neon laser.
(iv) Use of chemical reactions: In this method, a chemical reaction results in a product whose atoms or molecules will be left in an excited state. Under such a condition, population inversion may occur. This type of pumping is used in chemical
lasers like Hydrogen fluoride laser.
8.2.4 Requirements of a laser system
A laser requires three major components for its operation. They are:
(i) Pumping mechanism,
(ii) Active medium and
(iii) Resonant cavity.
232
Applied Optics
Applied Optics
233
11
00
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
PUMPING MEDIUM
ACTIVE MEDIUM
100 %
REFLECTOR
11
00
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
00
11
LASER
99.8 %
REFLECTOR
E3
E4
E2
E3
LASER
LASER
E1
(a)
E2
E1
(b)
E1
(c)
Figure 8.4 Transitions in a (a) two level, (b) three level and (c) four level lasers.
In the case of two level laser, the population inversion will not be possible. This
is because, as more and more electrons are pumped to the metastable state, there will
be more stimulated emission and at any instant of time, there will be no more than
half the number of electrons in each state. As a result, no laser amplification occurs.
234
Applied Optics
Population inversion will be possible only when the absorption is to a level higher than
the metastable state as in the case of a three level laser. Pumping takes the electrons
from the ground state to the excited state. From this excited state, the electron makes a
transition to the metastable state by spontaneous emission. The laser emission occurs
when the electron in the metastable state is stimulated to undergo a transition to the
ground state. However, a reverse transition is also possible from the ground state to
the metastable state by a re absorption of the emitted radiation. This removes some
photons from the laser beam and hence the efficiency of the laser decreases. This
problem is overcome in a four level laser. In this case, the laser transition occurs to
an unstable intermediate state rather than the ground state. The electron decays to the
ground state rapidly. Thus, the initial and the final laser levels are separated from the
higher excited state and the ground state respectively. The probability of re absorption
of laser radiation is thus avoided which improves the efficiency of the laser.
8.3
Types Of Lasers
Lasers are generally classified as continuous wave lasers and pulsed lasers. Continuous
wave lasers give a continuous output of constant intensity and the pulsed lasers give
out pulses of output with a definite frequency. Pulsed lasers can have very high peak
power and also very short pulse durations. These lasers have their own advantages and
disadvantages and are selected depending on the application for which they are used.
Further, depending on the active material used for laser action, the lasers are classified
as solid state lasers, gas lasers, chemical lasers, semiconductor lasers, etc.
8.3.1 Ruby Laser
The first working laser was constructed using a ruby crystal. Ruby, chemically aluminium oxide with about 0.05 weight percent of chromium impurity, in a single crystalline form is used as an optical cavity. A cylindrical rod of the crystal, 5 to 20 cm in
length and 0.5 to 2 cm in diameter, with its ends cut parallel and polished is surrounded
by a helical xenon flash lamp (Fig. 8.5).
The ends of the rod are coated with a highly reflecting material like silver so that
the rod acts like a resonant cavity in which light intensity can build up by multiple
reflections. One end of the rod is made highly reflecting (100% reflectivity) and the
other partially transmitting (50 to 80% reflectivity) so that the laser beam emerges out
of the ruby rod.
In ruby laser, chromium ions are the active centres responsible for laser transition.
The energy level diagram for chromium ions in ruby crystal is shown in Fig. 8.6. The
Applied Optics
235
Ruby rod
Flash lamp
Fully
reflecting
surface
Partially
transmitting
surface
Power supply
Figure 8.5 Schematic diagram of a ruby laser.
absorption of energy from the flash lamp occurs in a band of energies in the green
region. This results in electrons from the ground state E 1 being raised to the band
E 3 . These excited levels are highly unstable and the electrons decay to the level E 2 .
During this decay process, energy equivalent to (E 3 E 2 ) will be given out as heat.
The level E 2 is a metastable state with a lifetime of the order of 5 ms. This results in
a population inversion when the rate of absorption is large enough and a stimulated
emission from the metastable state E 2 to ground state E 1 produces the laser output
with a peak intensity at 6943A . E 2 is a doublet and hence the output has an additional
emission line at 6928A .
Figure 8.6 Energy level diagram for chromium ions in ruby laser.
Once the stimulated emission begins, the metastable state is quickly depopulated.
Thus, the output from ruby laser is in pulses, closely related to the rate of build-up and
depletion of population in the metastable state. Further, the flash lamp used for optical
pumping produces light of many wavelengths not useful for pumping and hence is a
236
Applied Optics
waste of energy. Thus, ruby laser is inefficient and also unsuitable for many applications.
8.3.2 Helium-Neon laser
Helium-Neon laser is a gas laser useful for continuous operation. In gas lasers such
as He-Ne laser, the atoms are characterized by sharp energy levels and an electric
discharge is useful for pumping. The laser consists of a discharge tube, 10 to 50 cm
in length and 0.5 to 2 cm in diameter, filled with Helium and Neon gases in the ratio
10:1 upto a pressure of 1 torr. The electrodes are connected to a high voltage source
of a few kilovolts dc supply (Fig. 8.7). The reflecting mirrors are placed outside the
discharge tube to make the alignment easier.
Brewster
window
Fully
reflecting
mirror
Discharge tube
Power supply
Partial
reflecting
mirror
Applied Optics
237
Neon
Laser
Transitions
Spontaneous
emission
Figure 8.8 Energy level diagrams for Helium and Neon atoms.
atoms increases in number. As these levels are metastable having relatively longer
lifetime, they remain in excited state till they collide with Neon atoms. This results in
the transfer of electrons from E 1 to E 4 and E 6 levels in Neon atoms. In fact, all the
excited states of Neon consist of many closely spaced sub-levels and the energy of E 4
and E 6 levels in Neon is close to E 2 and E 3 levels in Helium respectively. Thus, E 4 and
E 6 levels in Neon will be preferentially populated over other levels. Laser emission
occurs due to three allowed transitions, namely E 6 to E 5 , E 6 to E 3 and E 4 to E 3 . Of
these, the transition from E 6 to E 3 is the prominent one with an emission wavelength of
6328A . The other two laser transitions are in the infrared region. The laser radiation
is also associated with a spontaneous emission of wavelength about 6000A due to
tansition from E 3 to E 2 levels. The deexcitation from E 2 to E 1 level of Neon occurs by
atomic collisions with the walls of the tube.
8.3.3 Semiconductor diode laser
The basic principle of working of a semiconductor diode laser is exactly the same as
the principle of operation of a light emitting diode. In case of a light emitting diode, the
phenomenon of injection electroluminescence results in the emission of radiation from
a forward biased p-n junction. However, the emission is incoherent and less intense.
As the current across the junction is gradually increased, a stage will be reached at
which stimulated emission also begins along with spontaneous emission.
In a diode laser, the p-n junction is formed using degenerate semiconductors. When
the dopant concentration is increased, the impurity levels will expand into bands and
238
Applied Optics
then merge with the nearest energy band. In an n-type semiconductor, the impurity
band overlaps with the conduction band and for a p-type semiconductor, the impurity
band overlaps with the valence band. Such semiconductors behave like metals and are
called degenerate semiconductors. When p-n junctions made out of these degenerate
semiconductors are forward biased, electrons and holes are injected across the junction
in large concentration (Fig. 8.9).
p
n
n
EF
Ec
Ec
p Ga As
Laser
n Ga As
Ev
Ev
p
EF
Inversion
region
Figure 8.9 Diode laser (a) basic structure and (b) energy level diagram.
This results in a condition identical to population inversion in the region close to
the junction and transitions from the bottom of the conduction band to the top of the
valence band can take place with the emission of radiation of energy equal to the band
gap energy.
In selecting the material for the fabrication of a diode laser, it is essential that the
recombination is a direct process without the involvement of traps or recombination
centers. Hence, direct band gap semiconductors are suitable materials for the diode
laser. Further, it must be possible to dope these semiconductors to very high impurity
concentration and to form a p-n junction. An efficient resonant cavity has to be constructed at the junction to ensure laser emission. For this purpose, the front and back
surfaces are made flat and parallel.
The device described above contains a single p-n junction made out of a single material. Hence, it is called a homojunction laser. The efficiency of laser may be increased
by using multiple layers(junctions) of different materials. Such devices are called heterojunction lasers. For example, GaAs-AlGaAs heterojunctions and InGaAsP/InP systems are particularly well suited for lasers used in fiber optic communication. By
choice of composition, lasers can be made in the infrared region of wavelength 1.3
to 1.55 m required for fiber optics. In many other applications, different wavelength
Applied Optics
239
ranges may be required for laser output. In pollution diagnostics, ternary alloy PbSnTe is used to provide laser output wavelength from 7 to 30 m. For intermediate
wavelengths, InGaSb system can be used.
8.4
Applications of lasers
In contrast to ordinary light, laser radiation is highly directional, monochromatic, coherent and very intense. The applications are base on these distinguishing features.
One of the major applications is in pure science to investigate the interaction of matter
with intense electromagnetic radiation. The directionality and coherence of the laser
beam are useful in the measurement of distances based on interferometric methods.
In the field of communication, laser has been used with proper modulation for information transmission. Because of the high intensity and energy associated with laser
beams, they can be utilized for applications such as welding, cutting and ablation of
materials. Laser has also been used for medical applications like treatment of dental
decay, destruction of tumors, treatment of skin diseases and eye surgery. Holography is another important application which helps in recording the amplitude as well
as the phase of light reflected from objects thereby preserving the three dimensional
information.
8.4.1 Industrial applications
Two lasers widely used for industrial applications are Nd:YAG laser and CO2 laser.
They are used for a variety of applications in industries. Laser drilling is used to
produce holes with very small diameters. In general, pulsed laser is used for drilling
applications. Each pulse of laser heats the surface very rapidly leading to the evaporation locally. Since there is no physical contact, there is no wear and breakage of
the tool. The holes may be located with high precision and even hard materials can be
drilled easily. It is also possible to drill holes in area difficult to reach and at difficult
angles.
Another industrial application of lasers is in laser cutting. It involves heating of
the metal to its melting point with the laser and applying a jet of a gas to remove the
molten metal. Laser cutting involves local heating and hence, the heat affected zone is
minimum in size.
A focused high intensity beam can be used for laser welding applications. When
focused to a small spot size, the heat generated melts the metal producing a welded
joint. It is necessary to control the laser power precisely during welding application to
restrict the molten zone and also to avoid loss of material through evaporation. On the
240
Applied Optics
other hand, laser welding has the advantage that the heat affected area is small, offers
better control of the welding process and can be used in otherwise inaccessible areas.
8.4.2 Medical applications
Laser has been used in medical sciences for surgery. The advantage of a laser beam
is that it can be focused into a fine beam of very small size thereby concentrating the
optical energy to a narrow spot. This can be used for the vapourization of tissues
which need to be removed. Typically continuous wave lasers like CO2 laser or Ar
laser of 50-100 watt power are used for surgical purposes. Lasers are widely used in
ophthalmology for welding detached retina and also in cataract surgery. It is also used
to correct the defects of the eye. Recently, lasers are finding applications in dentistry
and dermatology.
8.4.3 Estimation of atmospheric pollution
Pollution has become a major concern, especially in urban areas where industries and
automobiles have contributed to the presence of various harmful chemicals in atmosphere. There are various methods available for the analysis of atmospheric pollutants.
A laser beam also may be used to estimate the concentration of particulate suspension
like dust or smoke in atmosphere. For this purpose, a beam from a pulsed laser is made
to incident on the particulate matter. The back-scattered rays are collected using wide
area concave mirror and analysed for intensity. The intensity of the back-scattered
radiation is a measure of the density of the suspended particulate matter. It is also possible to locate the pollutants by measurement of the time delay between the emission
of the laser pulse and its receipt after scattering.
A chemical analysis of the pollutants is more involved and complex. It can be
carried out by a spectral analysis of the emitted or scattered radiation. Changes in
the wavelength of the received radiation are indicative of the possible interactions and
hence characteristic of the pollutants and their composition.
The method is neither convenient nor accurate to deserve a more detailed discussion.
8.4.4 Applications in basic sciences
Lasers have been used for various applications in basic sciences. This is mainly because lasers are monochromatic sources of high intensity and hence are useful tools
in the study of interaction of radiation with matter. High temperature environments,
Applied Optics
241
which are otherwise difficult to produce, may be easily achieved in localized regions
to study the physical and chemical changes. Processes like surface modification and
surface ablation can be studied with the help of laser radiation. Laser has been used to
accelerate certain chemical reaction and to synthesize certain new compounds. More
recently, lasers have been used to simulate situations similar to the ones existing in sun
and stars, and hence to study the thermonuclear fusion reactions in laboratories. The
construction of various types of lasers has widened the scope of their use for applications. It will not be surprising if we find them useful in our daily life. Laser printers
and scanners for reading bar codes are some examples.
8.5
Holography
242
Applied Optics
REAL
IMAGE
VIRTUAL
IMAGE
LASER
HOLOGRAM
This produces two images. One image, called the primary image, is virtual and the
second image, called the conjugate image, is real. The real image may be recorded on
a screen or photographed.
Applied Optics
243
8.6
Optical fibers
The use of optical fibers for transmitting light signals over large distances is now a
well known phenomenon. With the development of laser and flexible fibers, optical
fibers are being used extensively for various communication applications. Fiber optic communication has significant advantages over the transmission by conventional
coaxial cables. The loss of signal strength is considerably less in optical fibers and
hence permits transmission over long distances. Use of light waves in place of radio
and microwaves has improved the speed of communication.
8.6.1 Materials for optical fibers
An optical fiber consists of a central core surrounded by a cladding (Fig. 8.12). The
refractive index of the core material is higher than that of the cladding. The core
and cladding are made of either glass or plastic. Additional covering is provided to
improve the strength and also to protect it from moisture and mechanical wear. With
all the protective coverings, the optical fibers are still very thin, less than 0.1 mm in
diameter typically.
244
Applied Optics
Core
Cladding
Reinforcing and
protective coverings
(8.8)
where i and r are the angle of incidence and the angle of refraction respectively. When
the ray of light is traveling from a medium of higher refractive index to a medium of
lower refractive index, i.e., n1 > n2 , we can define a critical angle of incidence at which
the refracted ray just grazes the interface, i.e., r = 90 . The critical angle of incidence
ic is given by
sin ic = n2 /n1 for n1 > n2
(8.9)
The ray will be totally internally reflected into the denser medium when the angle of
incidence is greater than the critical angle. Thus, if we have a fiber consisting of a core
Applied Optics
245
n2
n2
n2
ic
t
n1
n1
n1
(a)
(b)
(c)
Figure 8.13 Propagation of light from a medium of higher refractive index n1, to a
medium of lower refractive index n2. When the angle of incidence is less than the
critical value, it is refracted (a), grazes the interface when incident at the critical
angle (b) and totally internally reflected when the angle of incidence is more than the
critical value (c).
of higher refractive index surrounded by a cladding, the light ray may be confined to
travel inside the fiber.
Consider a ray of light incident on the entrance aperture of an optical fiber at an
angle as shown in Fig. 8.14.
i
r
Core ( n1)
AIR
(n = 1)
Cladding (n2 )
246
Applied Optics
= n1 cos i
= n1 (1 sin2 i)1/2
(8.10)
When corresponds to the maximum angle m of entrance at which the ray is capable
of propagating inside the fiber, the angle of incidence at the core-cladding interface will
be equal to the critical angle ic . The ray will be propagated by total internal reflection
for all < m . Thus, m defines the semi-angle of the cone of acceptance. Substituting
for sin ic from equation (8.9), we get
n0 sin m = n1 (1 n22 /n21 )1/2
= (n21 n22 )1/2
(8.11)
The quantity sin m is called numerical aperture of the optical fiber. The numerical
aperture may also be expressed in terms of relative refractive index difference
which is defined as
(n1 n2 )
(n1 + n2 )/2
(n1 n2 )(n1 + n2 )
=
(n1 + n2 )2 /2
2
(n1 n22 )
2n21
(8.12)
Substituting the value of (n21 n22 )1/2 from this equation in equation (8.11), we get,
sin m = n1 (2)1/2
(8.13)
The numerical aperture is an important parameter and is a measure of the light collecting ability of the optical fiber. It is interesting to note that the parameter is independent
of the dimensions of the core. Typical values of the semi-angle of the cone of acceptance and the numerical aperture of optical fibers are 10 and 0.2 respectively.
We may define a quantity called normalized frequency V given by
V = ka(n21 n22 )1/2
(8.14)
where k is the propagation vector for the incident radiation, a is the core radius and n1
and n2 are the refractive indices of the core and cladding respectively. This can also be
Applied Optics
247
expressed in terms of numerical aperture and the relative refractive index difference
as
V = (2/) a (NA)
(8.15)
1/2
= (2/) a n1 (2)
(8.16)
The normalized frequency is a dimensionless quantity and is also called the V number
of the fiber. It gives the combined information about the three design variables for
the fiber namely, operating wavelength, core radius and the relative refractive index
difference.
The number of modes that can be propagated through a multi mode step index fiber
is given by
M = 1/2(2a/)2 (n21 n22 )
(8.17)
248
Applied Optics
(a)
(b)
(c)
Figure 8.15 Variation of refractive index of core and cladding in (a) single mode step
index fiber, (b) multimode step index fiber and (c) multimode graded index fiber.
(a)
(b)
(c)
Figure 8.16 Propagation of light in (a) single mode step index fiber, (b) multimode step
index fiber and (c) multimode graded index fiber.
Applied Optics
249
In a multimode fiber, there are many such paths possible for the propagation. The
propagation in step index fibers is also referred to as due to reflective confinement.
In a graded index fiber, the refractive index of the core is maximum at the centre
of the core and decreases gradually and becomes equal to that of the cladding at the
interface between core and cladding. The propagation in such fibers is by refractive
confinement.
8.6.4 Signal distortion in optical fibers
Propagation of a signal through the optical fiber involves total internal reflection of
light rays many times. Further, the rays are reflected at various angles. The rays
reflected at higher angles travel greater distances than the rays reflected at low angles.
As a result, all the rays do not arrive at the end of the fiber simultaneously and the light
pulse broadens as it travels through the fiber. Since the output pulse does not match
with the input pulse, the signal is said to be distorted.
When white light source is used instead of a monochromatic radiation, another kind
of distortion occurs. Since radiation of different wavelengths has different velocities,
they do not arrive at the output simultaneously. This distortion is called chromatic
dispersion.
The signal distortion is quite considerable in multimode step index fibers. In graded
index fibers, the light travels with different velocities in different parts of the core as
the refractive index varies radially along the core. The rays travel faster near the outer
edge of the core(since the refractive index is smaller) than near the centre. Hence, all
the rays arrive at the output almost at the same time and the signal distortion is reduced.
In a single mode step index fiber also the distortion is less than that in multi mode step
index fibers.
8.6.5 Signal attenuation in optical fibers
Attenuation is the loss of optical power as light travels down a fiber. It is measures
in decibels (dB/km). Over a set distance, a fiber with a lower attenuation will allow
more power to reach its receiver than a fiber with higher attenuation. If Pin is the input
power and Pout is the output power after passing through a fiber of length L, the mean
attenuation constant of the fiber, in units of dB/km, is given by
= (10/L)log10 (Pin /Pout )
(8.18)
Attenuation can be caused by several factors, but is generally placed in one of two
categories: intrinsic or extrinsic.
250
Applied Optics
8.7
Optical fibers find extensive applications in communication of optical signals. Transmission using optical frequencies has the advantage of higher speed of transmission
and hence greater volume of information carried. They find application in all experimental arrangements where light is transmitted from one point to another without loss.
A few important applications of fiber optics are mentioned here.
8.7.1 Fiber optic communication
A conventional communication system consists of a transmitter which is used to transmit a carrier wave, usually at radio or microwave frequency, modulated by the signal.
The medium of transmission may be wires or free space. The signal is recovered by
demodulation at the receiving end.
In a conventional optical fiber communication system, the input signals, audio,
video or other digital data, are used to modulate light from a source like a light emitting
Applied Optics
251
TRANSDUCER
A/D
MODULATOR
(MICROPHONE)
CONVERTER
(LASER DIODE)
DEMODULATOR
(PHOTO DIODE)
D/A
CONVERTOR
TRANSDUCER
(RECEIVER)
DESTINATION
OF
INFORMATION
252
Applied Optics
(vii) The life span of optical fibers is considerably high at 25-30 years as compared to
around 15 years of life of copper cables.
(viii) Optical fiber links are more economical compared to copper cables because of
the longer life span, higher resistance to temperature and corrosion and ease of
maintenance of optical fibers.
8.7.2 Applications in medicine and industry
Optical fibers are also useful for medical applications for visualization of internal portions of the human body. They can also be used for the examination of visually inaccessible regions for engineering applications. A typical example of a flexible fiberscope
(endoscope) is shown in Fig. 8.18.
OBSERVATION
PROBE
SOURCE OF
ILLUMINATION
Numerical Examples
8.1 Calculate the numerical aperture and hence the acceptance angle for an optical
fiber whose core and clading has refractive index of 1.45 and 1.40 respectively.
Solution: If n1 and n2 are the refractive indices of core and clading respectively, then, numerical aperture is given by,
sin m = (n21 n22 )1/2
Applied Optics
253
= 0.38 (Ans.).
Acceptance angle m = sin1 (0.38)
= 22.3 (Ans.).
8.2 A He-Ne laser emits light at a wavelength of 632.8nm and has an output power
of 5mW. How many photons are emitted each second by this laser?
Solution:
Wavelength of emitted light = 632.8 109 m.
5 103
Number of photons emitted =
3.138 1019
= 1.59 1016 photons/sec (Ans.).
8.3 Find the ratio of population of the two energy states of the Ruby laser the transition between which is responsible for the emission of photons of wavelength
694.3nm. Assume the ambient temperature as 27C.
Solution:
Energy of the emitted radiation = (E 2 E 1 ) = hc/
The ratio of population (N2 /N1 ) between two energy states E 2 and E 1 is given
by the Boltzman relation,
(N2 /N1 ) = exp [(E 2 E 1 )/kT ]
2.86 1019
1.38 1023 300
= exp 69.08
= exp
254
Applied Optics
Note: If this is the ratio of population, then there is no population inversion
and there will be no laser emission. It should be noted that in ruby laser, the
energy state E 2 is a meta stable state and hence Boltzman relation cannot be
used to compute the population in that state.
Exercise
8.1 What are the advantages of optical communication over the other conventional
types of communication?
(March 1999).
8.2 Explain the terms spontaneous, stimulated emissions and population inversion.
Briefly explain the construction and working of a ruby laser with energy level
diagram.
(March 1999).
8.3 With a neat diagram, explain numerical aperture, and ray propagation in an optical fiber. Describe the types of optical fibers and modes of transmission.
(March 1999).
8.4 Describe the principle on which optical fiber works. Mention their applications.
(August 1999).
8.5 What is lasing action? Describe the working of He-Ne laser with the help of
energy level diagram. Mention industrial applications of lasers. (August 1999).
8.6 Distinguish between luminescence, fluorescence and phosphorescence. How are
these phenomena explained on the basis of energy band picture? (March 2000).
8.7 Explain the terms spontaneous emission, stimulated emission and population
inversion. With energy level diagram explain the working mechanism of a HeNe gas laser.
(August 2000).
8.8 Explain types of optical fibers. Mention advantages and limitations of optical
communication system.
(August 2000).
8.9 Explain how transmission of light takes place in optical fibers. Discuss different
types of optical fibers.
(March 2001).
8.10 Explain how lasing action takes place in He-Ne laser.
(August 2001).
8.11 Explain the principle of optical fibers. Derive an expression for the numerical
aperture. Calculate the numerical aperture of a given optical fiber if the refractive
indices of core and cladding are 1.623 and 1.522 respectively. (August 2001).
Applied Optics
255
8.12 Explain the conditions required for laser action. Describe the construction and
working of a He-Ne laser with necessary energy level diagram. (March 2002).
8.13 Explain the terms modes of propagation, cone of acceptance and numerical aperture.
(March 2002).
8.14 Calculate the numerical aperture and angle of acceptance for an optical fiber
having refractive indices of 1.565 and 1.498 for core and cladding respectively.
(March 2002).
8.15 Discuss the point to point optical fiber communication system. Mention the
advantages of optical fiber communication over the conventional communication
system.
(Feb 2003).
8.16 Obtain an expression for energy density of photons in terms of Einstein?s coefficients.
(Aug 2003).
8.17 Explain the construction and working of He-Ne gas laser.
(Aug 2003).
(Aug 2003).
8.19 An optical glass fiber of refractive index 1.450 is to be clad with another glass to
ensure total internal reflection that will contain light travelling within 5o of the
fiber axis. What maximum index of refraction is allowed for the cladding?
(Feb 2004).
8.20 Discuss the applications of laser.
(Aug 2004).
8.21 Find the ratio of population of two energy levels out of which one corresponds
to metastable state if the wavelength of light emitted at 330 K is 632.8 nm.
(Aug 2004).
[Note: It is not possible to solve this example if we assume a level to be metastable since we cannot use Boltzman relation].
Chapter 9
9.1
Ceramics
Ceramics are compounds between metallic and non-metallic elements. The term ceramics comes from the greek word Keramikos, which means burnt stuff. These
materials are usually bad conductors of heat and electricity. They are resistant to high
temperature and corrosive environments, but are usually brittle. Chemically, they are
usually oxides, nitrides or carbides.
Depending on the application for which they are used, ceramics are classified as
glasses, clay products, refractories and cements. The properties and applications of the
various categories of ceramics are discussed below.
9.1.1 Glasses
Glasses are non-crystalline silicates. Oxides like Na2 O, CaO, Al2 O3 , B2 O3 , etc., are
added to modify their properties to suit various applications.They are optically transparent and easy to fabricate. Glasses are produced by heating the raw materials to a
temperature above the melting point of all the constituents. In order to ensure transperancy, the melt should be mixed homogeneously and made free from bubbles. Upon
cooling, the melt becomes more and more viscous, passes through a state of supercooled liquid and then solidifies into glass.
The process of formation involves inclusion of internal stress, called thermal stresses.
These defects are eliminated by heat treatments like annealing and tempering. Anneal256
257
ing is a process in which the glass is heated to a temperature called annealing point.
At this temperature, atomic diffusion is rapid and any residual stress may be removed.
Then, it is slowly cooled to room temperature. The temperature at which transition
occurs from supercooled melt into glass is called the glass transition temperature. In
tempering, the glass is heated to a temperature above glass transition temperature but
below the softening point. Then it is cooled to room temperature in a jet of air or in
an oil bath. Tempered glasses are used for applications where high strength is needed.
This includes doors, automobile windshields, eye lenses, etc.
9.1.2 Clay products
Clay is one of the most widely used raw material. They are used in building bricks,
tiles, porcelain tableware and sanitary ware. They are alumino silicates containing
chemically bound water. For applications, they are mixed with minerals like flint,
quartz and a flux like feldspar. For example, porcelain used for tableware may have a
composition of 50% clay, 25% quartz and 25% feldspar.
The formation of clay products includes grinding the raw materials into a fine powder and mixing with water to a desirable consistency. They are drawn into required
shape by extrusion technique or moulded using a porous mould when water evaporates
to leave solid material. The product so obtained may contain some water and needs
to be dried. The process of drying is assisted by controlled air flow at suitable temperature. It may also be subjected to firing by heating it at a temperature in the range
90014000C to enhance its mechanical strength. During the process of heat treatment,
formation of liquid glass may lead to filling up of pores with an increase in the density
of the material. This process is known as vitrification.
9.1.3 Refractories and abrasives
Refractories are materials capable of withstanding high temperature without melting
or decomposing. They also remain inert to the atmosphere. They are useful in the manufacture of bricks and as lining material for furnaces. Chemically, they are mixtures
of oxides like S iO2 , Al2 O3 , MgO, etc. For commercial applications, the raw material
with the required composition is taken in the form of a powder. Upon firing, the particles undergo a bonding phase thereby increasing the strength. Porosity is an important
parameter and must be controlled to obtain suitable material properties.
Abrasive ceramics are materials which are known for their hardness or wear resistance. They possess a high degree of toughness. Diamond is the best known abrasive
with a high value for its hardness, but is relatively expensive. Other common ceramic
258
abrasives include silicon carbide, tungsten carbide, aluminum oxide, silica, etc. They
are used in several forms depending on the application as a powder, as a coating bonded
to grinding wheels, etc. Grinding, lapping and polishing wheels make use of abrasive
powders in a liquid medium.
9.1.4 Cements
There are some inorganic ceramic materials grouped as cements. They have a characteristic property of forming a paste when mixed with water, which subsequently
hardens. Examples are Portland cement, plaster of paris, lime, etc. They are useful in
solid and rigid structures as construction material.
9.1.5 Cermets
A cermet is a composite material composed of ceramic (cer) and metallic (met) materials. A cermet is ideally designed to have the optimal properties of both a ceramic and
those of a metal. Ceramics possess basic physical properties such as a high melting
point, chemical stability, and especially oxidation resistance. The basic physical properties of metals include ductility, high strength and high thermal conductivity. The
metal is used as a binder for an oxide, boride, carbide, or alumina. Generally, the
metallic elements used are nickel, molybdenum, and cobalt. Depending on the physical structure of the material, cermets can also be metal matrix composites, but cermets
are usually less than 20% metal by volume.
Cermets are used in the manufacture of resistors (especially potentiometers), capacitors, and other electronic components which may experience high temperatures.
They are used for the high-temperature sections of jet engines as well as high temperature turbine blades. Spark plug is also another example of a cermet. It is an electrical
device used in some internal combustion engines to ignite the fuel by means of an
electric spark. Ceramic parts have been used in conjunction with metal parts as friction materials for brakes and clutches. Bioceramics are materials that can be used in
the human body. They can be in the form of thin layers on metallic implants, composites with a polymer component, or even just porous networks. These materials work
well within the human body for several reasons. Cermets are also used in dentistry as
a material for fillings and prostheses.
9.2
259
Composite Materials
260
9.3
Smart Materials
Smart materials have one or more properties that can be dramatically altered by varying
the condition to which they are subjected. Most everyday materials have physical properties which cannot be significantly altered. A variety of smart materials already exist
and are being studied extensively. These include piezoelectric materials, magnetorheostatic materials, electro-rheostatic materials and shape memory alloys.
Each individual type of smart material has a different property which can be significantly altered, such as viscosity, volume, and conductivity. The property that can
be altered influences what types of applications the smart material can be used for.
Piezoelectric materials have a unique property which makes them useful as a smart
material. When a piezoelectric material is deformed, it produces a small but measurable electrical charge on the sample. Conversely, when an electrical signal is applied to
a piezoelectric material it experiences a significant change in size. Hence, piezoelectric
materials are most widely used as sensors.
261
9.4
Shape memory alloys are metals, which exhibit two very unique properties, namely,
elasticity and shape memory effect. Arne Olander first observed these unusual properties in 1938, but no were any serious research work was made in the field of shape
memory alloys until the 1960s. The most effective and widely used shape memory
alloys include Nitinol (NiT i), CuZnAl, and CuAlNi.
The two unique properties mentioned above are made possible through a solid state
phase change which occurs in the shape memory alloy. A solid state phase change is
one in which a molecular rearrangement occurs with the material remaining a solid. In
most shape memory alloys, a temperature change of only about 10C is sufficient to
initiate the phase change. The two phases, which occur in shape memory alloys, are
Martensite, and Austenite.
Martensite is the low temperature phase of the material. It is the relatively soft and
may be easily deformed. The molecular structure in this phase is twinned. Austenite
has a cubic structure and occurs at higher temperatures. The un-deformed Martensite
phase is the same size and shape as the cubic Austenite phase on a macroscopic scale,
so that no change in size or shape is visible in shape memory alloys until the Martensite
is deformed.
The shape memory effect is observed when a piece of shape memory alloy is cooled
below a critical temperature (Fig.9.1). At this stage the alloy is composed of Martensite
in its twinned form. This sample can be easily deformed by applying a suitable load.
The deformed Martensite is transformed to the cubic Austenite phase by heating it
to above a critical temperature. By cooling the alloy, the original twinned martensite
structure may be regained.
262
Temparature
Austenite
Heating
Cooling
Loading
Twinned martensite
Deformed martensite
Load
Pseudo-elasticity occurs in shape memory alloys, without a change in temperature,when the alloy is completely composed of Austenite. If the load on the shape
memory alloy is increased, the Austenite gets transformed into Martensite simply due
to the loading. When the loading is decreased, the Martensite begins to transform back
to Austenite and the sample springs back to its original shape.
Some of the main advantages of shape memory alloys include their attractive mechanical strength and corrosion resistant properties. These alloys are expensive compared to other materials such as steel and aluminum. They have poor fatigue properties
and a steel component may be hundred times more durable than a SMA element.
Shape memory alloys are being used for a variety of applications. They have been
used for military, medical and robotics applications. Nitinol couplers are used in F-14
fighter planes to join hydraulic lines tightly. It is used in robotics actuators and ma-
263
9.5
Microelectromechanical Systems
264
(a)
(b)
Figure 9.2 Working of a piezoelectric sensor crystal sample (a) in the absence of applied force and (b) in presence of applied force.
(a)
(b)
Figure 9.3 Schematic of a strain gauge (a) and the experimental set up (b).
A strain gauge consists of a very thin, long metal wire, arranged in the form of a
meandering pattern and bonded to the surface under study. It can also be in the form of
a thin film deposited on the surface. Any mechanical deformation of the surface results
in a change in the dimensions of the metal wire leading to a corresponding change in
its electrical resistance. The strain gauge may form one of the arms of a Wheatstones
net to determine accurately the change in the resistance. The change in the resistance
will be proportional to the stress to which the surface is subjected.
265
Semiconductor materials, when used as Hall probes, will work as magnetic sensors.
There are a few materials which show a large magneto-resistance, a phenomenon in
which application of a magnetic field results in changes in the electrical resistance of
the materials. Such materials can also be used as magnetic sensors.
Radiation sensors are very common and are used in various instruments as detectors of radiation. They are referred to as photo detectors and can be made sensitive
to radiations in the range from infrared to x-rays. Photo conducting materials and
photodiodes are used in these sensors.
9.5.2 Actuators
Actuators are devices which produce mechanical effects as a response to an applied
electrical signal. Reverse piezoelectric effect, i.e., deformation of a solid when subjected to an electric field, is an example.
Bi-metallic strip is an example of a thermal actuator. It consists of two metals
having widely different values for the coefficient of thermal expansion. The metals are
taken in the form of strips and are bonded to each other. When this bi-metallic strip is
heated, because of the unequal expansion of the two metals, it bends into an arc. The
device can be used to make or break circuits.
In piezoelectric actuators, application of an electric field to the opposite faces of
a single crystalline piezoelectric crystal like quartz results in a change in the dimensions of the crystal. The crystal regains its original shape and size when the applied
field is removed. Further, the change in size may be positive or negative(increase or
decrease) depending on the direction of the applied electric field. This is an example
of a mechanical actuator.
9.6
Nano Materials
Nano technology deals with structures of matter having dimensions of the order of a
nano meter. Even though the term nano technology is relatively new, structures and
devices of nanometer dimensions are known for a long time. Roman glass makers
are known to have used nano sized metal particles to produce coloured glasses. Photographic films contain silver halide which decomposes when exposed to light producing
nano particles of silver. Richard Feynman, in a lecture at a meeting of the American
Physical Society, predicted the potential applications of nano materials.
Nano materials exhibit properties strikingly different from those of bulk materials. This is because every property of a material has a characteristic length associated
with it. For example, the electrical resistance of a material is due to the scattering of
266
conduction electrons away from the direction of flow by collision with lattice atoms,
impurities, etc. The effect of scattering on the resistance depends on how many such
collisions takes place per unit distance of travel or the average distance travelled by the
electron before getting scattered. This characteristic length is called the mean free path
of conduction electron. When the dimension of the solid sample becomes comparable
to this characteristic length, the fundamental property of electrical resistance changes
and becomes a function of the dimension of the sample. This is called quantum size
effect and is responsible for the characteristic properties exhibited by the nano phase
of materials.
Nano materials are classified as quantum wells, quantum wires and quantum dots.
In a three dimensional structure, if one dimension, say thickness, is of nano size, then
the structure is called quantum well. If two dimensions are of nano size, then it is called
a quantum wire and if all the three dimensions are of nano size, then it is called a quantum dot. The word quantum is associated with the structures because the properties
exhibited by them are described by quantum mechanics.
In a bulk metal, for example, the conduction electrons are free to move throughout
the entire conducting medium. In other words, the electrons are completely delocalized. When one or more dimensions of the sample become small, say of the order of
a few atomic spacings, the delocalization of the electrons is restricted. The electrons
experience confinement as their movement is restricted in those dimensions which are
small. The number of electrons with a particular energy becomes a function of the size
of the sample. The density of states for a bulk metal, which is a measure of the number
of allowed energy states available for occupation at various energy values, is shown in
Fig.3.5. In the case of nano structures, the dependence of density of states on energy
will also get modified depending on the degree of confinement.
The dependence of density of states on energy for different quantum structures is
shown in the figure. Density of states g(E) versus Energy E in the case of (a) bulk material, (b) a quantum well, (c) a quantum wire, (d) a quantum dot and (e) an individual
atom.
In the case of a bulk sample, since the number of electrons interacting is very large,
the density of states as a function of energy is a continuum of available states at various
energies. As the size of the sample reduces in one or more dimensions, there will be
restrictions on the number of available states at different energies. As we go from bulk
to quantum well to quantum wire to quantum dot, the density of states approaches that
for an individual atom. This results in drastic modifications in the electrical, thermal
and other properties of nano structures.
g(E)
(a) bulk
material
267
g(E)
(b) a quantum
well
g(E)
(c) a quantum
wire
g(E)
g(E)
268
ambient to 500 C. These deposits have particles or grains in the range of 1 to 100 nm
size. The advantages associated with vacuum deposition process are high deposition
rates and economy. However, the deposition of many compounds is difficult.
Chemical Vapour Deposition (CVD) is a well known process in which a solid is
deposited on a heated surface via a chemical reaction from the vapour or gas phase.
The energy necessary for the chemical reaction to take place can be provided by several
methods. In thermal CVD the reaction is activated by a high temperature above 900
C. A typical apparatus comprises of gas supply system, deposition chamber and an
exhaust system. The reaction may be activated by plasma at elevated temperatures. The
chemical reaction may also be induced by laser radiation which has sufficient photon
energy to break the chemical bond in the reactant molecules. Typical nanocrystalline
materials like SiC, Si3 N, Al2 O3 , TiO2, SiO2 , ZrO2 with average particle size of a few
nanometre have been synthesized by this technique.
Sol-gel processing is a wet chemical synthesis approach that can be used to generate nanoparticles. It involves formation of colloidal suspension (sol) in continuous
liquid phase (gel). A catalyst is used to start reaction and control pH. Sol-gel formation
occurs in four stages namely, hydrolysis, condensation, growth and agglomeration of
particles. By controlling the growth parameters, it is possible to vary the structure and
properties of sol-gel derived inorganic networks. The significant potential of nanomaterial synthesis and their applications is yet to be explored. Understanding more of
synthesis would help in designing better materials.
Unlike many of the methods mentioned above, mechanical attrition produces its
nanostructures not by cluster assembly but by the structural decomposition of coarser
grained structures (top down approach). The ball milling and rod milling techniques
belong to the mechanical alloying process which has received much attention as a
powerful tool for the fabrication of several advanced materials. Mechanical alloying is
a unique process, which can be carried out at room temperature.
A ball mill, a type of grinder, is a cylindrical device used in grinding (or mixing)
materials like ores, chemicals, ceramic raw materials and paints. Ball mills rotate
around a horizontal axis, partially filled with the material to be ground plus the grinding
medium. Different materials are used as media, including ceramic balls, flint pebbles
and stainless steel balls. An internal cascading effect reduces the material to a fine
powder. Industrial ball mills can operate continuously, fed at one end and discharged
at the other end. High-quality ball mills are potentially expensive and can grind mixture
particles to as small as 5 nm, enormously increasing surface area and reaction rates.
The grinding works on principle of critical speed. The critical speed is the speed after
which the steel balls (which are responsible for the grinding of particles) start rotating
269
along the direction of the cylindrical device thus causing no further grinding.
9.6.2 Applications of nano materials
Although there has been much hype about the potential applications of nanotechnology, the current applications are limited to the ones developed in yesteryears. These
include nanoparticles of titanium dioxide, silver, zinc oxide etc in sunscreen, cosmetics, food packaging, clothing, disinfectants, household appliances, paints and outdoor
furniture varnishes. One proposed application is the development of so-called smart
materials. This term refers to any sort of material designed and engineered at the
nanometer scale to perform a specific task, and encompasses a wide variety of possible
commercial applications. A nanosensor would be a smart material, involving a small
component within a larger machine that would react to its environment and change in
some fundamental, intentional way.
A promising field for the applications of nanotechnology is in the practice of
nanomedicine. This involves the creation of nanoscale devices for improved therapy and diagnostics. Such nanoscale devices are known as nanorobots or nanobots.
These nanobots have the potential to serve as vehicles for delivery of medicine to repair metabolic or genetic defects. Similar to the conventional or macroscopic robots,
nanobots would be programmed to perform specific functions and be remotely controlled, but possess a much smaller size, so that they can travel and perform desired
functions inside the human body.
Band gap engineering involves tailoring of band gaps with the intent to create unusual electronic transport and optical effects, and novel devices. Most of the devices
based on semiconductor nanostructures are band gap engineered quantum devices.
Lasers fabricated using single or multiple quantum wells as the active region have been
extensively studied over the last two decades. Quantum well lasers offer improved performance with lower threshold current and lower spectra width as compared to that
of regular double heterostructure lasers. Quantum dots have been used in lasers and
detectors. Nanostructures have been used for making photo-electrochemical cells for
high efficiency conversion of light to electrical power due to its large surface area at
which photo-electrochemical processes occur.
Applications of molecular nanotechnology to mechanical engineering will be aimed
at realizing some mechanical systems on nano scale. The simplest mechanical system
we can think of is a mechanical bearing. A conventional bearing consists of a shaft and
a sleeve, with the relative motion of these components being facilitated by a suitable
lubricant. The efficiency of such mechanical bearing lies in ensuring minimum wear
and tear of the components along with minimum friction between them. A design of
270
(a) shaft
Figure 9.4 Components of a nano mechanical bearing showing (a) shaft and (b) sleeve
as seen along their axis.Each circle represents a group of atoms.
271
(9.1)
where L represents a linear dimension. Expressed in nano scale, a stress of 1010 Nm2
will be equal to (1010 /1018 )Nnm2 , i.e., 10nNnm2 . Similarly, shear stiffness which
increases with sample area but decreases with increasing length, is proportional to L.
272
A sheer stiffness of 1012 Nm1 can be expressed as 103 Nnm1 . Hence, deformation can
be written as
de f ormation f orce/stiffness L.
(9.2)
Assuming the density to be constant,
Mass volume L3 .
(9.3)
The mass of a cubic nanometre block of a material of density 5 103 kgm3 will be
5 1024 kg. Hence,
Acceleration (force/mass) (L2 /L3 ) L1
108 Nnm2 /5 1024 kg
2 1015 ms2 .
(9.4)
(9.5)
Thus, a cubic nano metre sample will experience large acceleration as compared to
macroscopic systems. Also, the effect of gravitational acceleration will be negligible
on nano mechanical systems.
These calculations are based on the assumption that scaling laws are applicable
even when we consider nano systems. It is to be realized that a transition from macro
to micro and nano scales are associated with major changes in the conditions of construction and operation. In nano systems, for example, the surface becomes more
important than the volume. However, the influence of the surfaces on the properties
are neglected.
In electromagnetic systems, classical scaling laws have to be applied more carefully
because quantum effects become dominant at small dimensions. If we assume the
electrical resistivity as a material constant,
Resistance (length/area) L1
(9.6)
Assuming an electrical resistivity of 1.5107ohmm, the resistance of a cubic nanometre of copper would be 150 ohms. This result has to be examined carefully for its validity since the calculation has ignored the effect of size on the electrical resistivity of
the material.
Similarly, the models for thermal conductivity in solids also breakdown for nano
systems since the thermal phonons are associated with a mean free path much larger
than the dimensions of the structure itself.
It may be concluded that the scaling laws based on classical models are not suitable
for describing the behaviour of nano systems.
273
274
C60 is a yellow powder which turns pink when dissolved in certain solvents such
as toluene. When exposed to strong ultraviolet light, it polymerizes forming bonds between adjacent balls. In the polymerized state, the C60 no longer dissolves in toluene.
This property makes it useful as a photoresist in photolithographic processes. Other
physical and chemical properties are being studied to evaluate the material for further
applications.
9.6.5 Carbon nano tubes
Carbon nano tubes are cylindrical fullerenes. They are tubes of carbon with a diameter
of a few nanometer and several millimeters in length. They may be obtained as single
walled tube or multi-walled tube with open or closed ends (Fig.9.6).
9.7
Liquid Crystals
The intermediate phase between the solid and the liquid is called the liquid crystal
phase. Liquid crystal materials generally have some unique characteristics. They usually have a rod-like molecular structure, with the molecules showing a tendency to
275
point along a common axis. This common axis is called the director. This is in contrast to molecules in the liquid phase, which have no intrinsic order. In the solid state,
molecules are highly ordered and have little translational freedom (Fig.9.7). The characteristic ordering in the liquid crystal state is between the traditional solid and liquid
phases. Substances that arent as ordered as a solid, but do have some degree of alignment are called liquid crystals.
(a)
(b)
(c)
Figure 9.7 Arrangement of molecules in the three phases, (a) solid, (b) liquid crystal
and (c) liquid.
To quantify the order present in a liquid crystalline material, an order parameter S
is defined as
S = (1/2) < 3 cos2 1 >
(9.7)
where is the angle between the director and the axis of each individual molecule.
In an isotropic liquid, the order parameter will be zero and for a perfect solid, it will
be equal to 1. Typical value for the order parameter of a liquid crystal depends on
temperature and may be anything between 0.3 and 0.9.
9.7.1 Classification of liquid crystals
There are many types of liquid crystal states, depending upon the nature and amount
of order in the material. The prominent ones are :
(1) Nematic phase
(2) Smectic phase
(3) Cholesteric phase.
276
The nematic liquid crystal phase is characterized by molecules that have orientational order but no positional order (Fig. 9.8). The molecules can flow and their
position is randomly distributed in the liquid. But, they have a tendency to point towards a particular direction leading to a finite orientational order. As a consequence,
nematic liquid crystals are strongly anisotropic.
Figure 9.8 Orientational order in nematic liquid crystal. The arrow represents the
director.
The smectic phase is another distinct phase of liquid crystal substances. Molecules
in this phase show some degree of translational order in addition to the orientational
order (Fig.9.9). In the smectic state, the molecules maintain the general orientational
order of nematics, but also tend to align themselves in layers or planes. The motion
is restricted to within these planes but the layers themselves can move past each other.
The increased order means that the smectic state is more solid-like than the nematic.
There are many types of smectic phases possible with different kinds of ordering.In the
smectic-A mesophase, the director is perpendicular to the smectic plane, and there is
no positional order within the layer. In smectic-B, the smectic plane is perpendicular to
the director, but the molecules are arranged in a network of hexagons within the layer.
In the smectic-C phase, molecules are arranged similar to that in the smectic-A phase,
but the director is not perpendicular to the smectic plane.
The cholesteric liquid crystal phase is composed of molecules containing a chiral
center which produces intermolecular forces that favor alignment between molecules
at a slight angle to one another. This leads to the formation of a structure which can
be visualized as a stack of very thin 2-D nematic-like layers with the director in each
layer twisted with respect to those above and below (Fig.9.10).
277
(a)
(b)
(c)
Figure 9.9 Three major types of smectic liquid crystals. (a) Smectic A phase, (b) Smectic B phase as seen along the director axis and (c) Smectic C phase.
1
P
2
Figure 9.10 Cholesteric liquid crystal showing the rotation of the director in a distance
equal to half the pitch.
278
11111
00000
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
6
Figure 9.11 Schematic of a liquid crystal display unit consisting of (1) polarizer,
(2)electrode, (3) liquid crystal layer, (4) back electrode, (5) second polarizer crossed
with the first and (6) reflector.
It consists of a liquid crystal layer sandwiched between two polarizers that are crossed
with each other. The liquid crystal selected is a twisted one so that the light passing
through the first polarizer is reoriented to pass through the second polarizer also. When
an electric field is applied between the two transparent indium-tin oxide electrodes, all
the molecules in the liquid crystal align parallel to the electric field. Since the two
polarizers are crossed, the light does not get transmitted through the second polarizer.
A reflector is placed after the second polarizer to reflect any light incident on it. Thus,
electric field can be used to switch a pixel on or off.
Liquid crystals have a many other uses. They can be used as thermal sensors since
many of their properties are very sensitive to temperature. They are now being explored for optical imaging and recording applications. They are used for nondestructive mechanical testing of materials under stress. As new properties and types of liquid
279
crystals are being investigated, these materials are gaining increasing importance in
industrial and scientific applications.
9.8
Non destructive testing(NDT)is the technique of determining the quality of a product without in any way affecting its properties. It is emerging as an important interdisciplinary technique useful for meeting the requirements of reliability and safety.
Based on the principle used for testing, the technique may be classified as follows:
(1) Radiographic methods
The basic principle involved in non destructive testing is to allow energy in some
form to pass through the test sample and measure the effects like changes in amplitude,
intensity, velocity, frequency, phase, etc of the transmitted signal.
9.8.1 Radiographic methods
In radiographic methods of NDT, radiations like x-rays or gamma rays are used as the
probe. Recently, electron and neutron beams are also being used as probes for certain
applications. When a radiation of suitable energy and intensity is made to incident
on the specimen under test, it penetrates through the material. The intensity of the
transmitted radiation will be modified by the presence of defects. The defects may be
in the form of deviations from the periodic arrangement of atoms in the lattice or may
be in the form of inclusion of foreign materials in the specimen. Such defects absorb
or scatter the radiation differently from the other regions of the sample and hence
result in a modification of the transmitted intensity. These changes in the intensity
of transmitted radiation may be recorded and analyzed using photography or other
methods.
Radiation for a specific NDT application is selected on the basis of their suitability
and efficiency. X-rays are often used for the analysis of defects in crystalline solids.
Since the wavelength of x-rays can be of the order of magnitude of the inter-atomic
distances in crystalline solids, they produce significant diffraction contrast leading to
easy detection of crystal defects. X-rays are also used in the analysis of flaws, inclusions, pinholes, cracks etc generated in welding or casting processes. Gamma rays are
more energetic and can be used for thicker samples. In the case of materials of high
atomic number, absorption poses a serious limitation to NDT techniques using x-rays
280
or gamma rays. Alternatively, electron or neutron radiography may be suitable for such
applications.
9.8.2 Ultrasonic methods
Ultrasonic waves are mechanical waves, like sound waves, with a frequency above
the audible range. Similar to sound waves, they produce local changes in the density
of the medium in which they are travelling and propagate with a velocity determined
by the medium. Ultrasonic waves are generated by making use of piezoelectric effect
or magnetostriction. Piezoelectric oscillators are widely used since they give a wide
range of frequencies from 20 kHz to 10 GHz, where as magnetostriction oscillators
can be used normally upto 100 kHz. Piezoelectric transducers are made from materials
showing piezoelectric effect, like quartz, tourmaline, lithium sulphate, etc. A typical
experimental set up is shown in Fig.9.12.
HT
QUARTZ
CRYSTAL
281
Ferromagnetic
rod
HT
1
0
0
1
0
1
0
1
SAMPLE
AMPLIFIER
A
B
HORIZONTAL
SWEEP
GENERATOR
282
The velocity of ultrasonic waves in a solid medium depends on the density of the
material and also on the elastic constants. In the case of a thin rod like sample whose
diameter is much smaller than the wavelength of ultrasonic waves, the velocity is given
by
v = (E/)1/2
(9.8)
Where E is the Youngs modulus and is the density of the solid sample.
If the sample under consideration is not piezoelectric, then, the velocity of ultrasonic waves in that sample may be determined as follows:
A quartz crystal in the form of a rod of cross sectional area A and length Lq is used
in an experimental set up shown in Fig. 9.15. The resonance frequency fq corresponding to the maximum amplitude of the well-defined wave pattern observed on the CRO
is
CRO
Oscillator
circuit
Sample
Metal
electrodes
Quartz crystal
Speciman
Figure 9.15 Experimental set up to find the velocity of ultrasonic waves through a solid
sample.
noted. The given sample is also taken in the rod form with the same area of cross
section A and length Ls . The sample is attached to the quartz crystal using a glue and
the resonance frequency fc of the composite (quartz + sample) is determined as before.
The natural frequency f s of vibration of the sample is calculated as
f s = fc + (mq /m s )( fc fq )
where fq , f s and fc are the resonance frequencies for the quartz crystal, sample under
study and the composite, mq and m s are the mass of the quartz crystal and the sample.
The velocity of ultrasonic waves in the sample can be calculated as
vs = 2 fs Ls
283
284
the sample may be scanned with a search coil to detect changes in the induced voltage
at the defect sites. Alternatively, magnetic particles such as iron oxide in the form of
fine powder or a colloidal solution are spread over the sample surface. The magnetic
particles arrange differently at the defects thereby revealing their presence.
9.8.4 Electrical methods
It is possible to gain insight into the quality of the sample by a study of different
electrical properties of the material. However, for an analysis of a localized defect,
some special techniques are used. For example, in eddy current testing, a test coil
carrying a high frequency alternating current is made use of. When the coil is brought
close to a conductor, eddy currents are induced in the coil which in turn change the
impedance of the coil. The presence of a defect in the conductor modifies the way
in which the eddy currents are induced and hence result in corresponding changes in
the impedance of the test coil. The method is suitable for detection of surface cracks,
variations in thickness and conductivity of coatings.
9.8.5 Optical methods
Optical microscopy is a widely used technique for surface examination and evaluation. Surface morphology and microstructures can be effectively studied by optical
microscopy. Modifications to the conventional microscopy includes interference microscopy and phase contrast microscopy which are based on the optical phenomena
of interference and polarization respectively yield valuable information on the sample surface. Holographic interferometry technique using laser helps in the study of
strained surfaces and surfaces subjected to vibration. Optical examination of transparent samples in transmission helps in the analysis of the internal features of the sample.
9.8.6 Thermal methods
The principle involved in thermal methods is to supply heat to a specimen and observe
the resulting temperature distribution. The temperature at the location of defects will
be different from that at other regions. The temperature distribution may be measured
using sensitive thermal detectors like thermocouples, bolometers or photo conducting
materials. Thermography is a related technique in which the thermal radiation emitted
by the test sample is analyzed. By virtue of the temperature at which the sample is
maintained, there will be emission of infra red radiation from the sample which can
be detected and measured using an infrared detector. At defect sites, the temperature
285
and hence the emission of infrared radiation will vary in intensity and wavelength. An
image may be constructed with the data to show the defects in the sample.
9.9
Quantum Computation
Quantum computation is the study of information processing using quantum mechanical systems. Quantum mechanics was developed when the classical physics was not
in a position to explain many phenomena. Quantum mechanics has been indispensable
since then and has been used in various fields with success. Any bulk sample may be
considered to be made up of a large number of quantum systems and the behaviour of
the bulk sample may be predicted by consolidating the behaviour of individual quantum systems. In an attempt to control the bulk behaviour, attempts are being made to
control individual quantum systems. Quantum computation is a result of the study of
quantum systems and its application to information processing.
9.9.1 Properties of quantum bits
In classical computation, the information is addressed and processed using bits. Bit
is defined as a unit of information. On a similar concept, quantum information is built
using quantum bits or qubits. A classical bit can have two states- 0 and 1. Similarly,
the qubit can be in any of the two states - |0 > and |1 >. This is referred to as Dirac
notation. A classical bit can exist in any one of the two states where as the quantum
bit can exist in states that are linear combination or superposition of the two states.
This may be represented as
|>= 0 |0> +1 |1>
(9.10)
where 0 and 1 are complex numbers. In other words, a qubit represents a vector in
two dimensional space. The states |0> and |1> are called basis states and are opposite
to each other in vector space.
Classical bit can be examined and its state may be determined to be either 0 or 1.
In the case of qubit, we can only find the probability that it exists in the state |0 > or
|1 >. The probability that it exists in the state |0 > is equal to |0 |2 and the probability
that it exists in the state |1> is equal to |1 |2 . Since the total probability has to be 1, we
have
|0 |2 + |1 |2 = 1
(9.11)
This is called the normalization condition for the qubit. However, it is emphasized that
the qubit exists in either |0 > state or |1 > state only, but with finite probabilities. For
286
(9.12)
we only mean that a measurement gives a value |0 > for the qubit in 50%[|1/ 2|2 ] of
the trials and a value |1> in the remaining 50% of the trials.
We can find an example of a quantum mechanical system in an electron. The
electron can exist in either ground state or an excited state. We can call these states
|0> and |1> respectively. During the excitation process in presence of a radiation field,
the electron may be found in |0 > state or |1 > state. The qubit represented by a linear
combination of these states indicates the probability of the two states being occupied.
Z
where and are real numbers representing the angular coordinates in the three dimensional sphere of unit radius. This sphere is called the Bloch sphere and is used to
visualize the state of a single qubit.
287
(9.15)
The normalization condition for the quantum state 0 |0 > +1 |1 > requires that
|0 |2 + |1 |2 = 1
(9.16)
This must also be true for the quantum state after the action of the gate. It can be shown
that
X+ X = I
(9.17)
288
in which the operation results in the change in the sign of |1 > to |1 >. The H-gate is
represented as
#
"
1 1 1
H=
2 1 1
(9.19)
1
in which the operation results in |0 > being changed into (|0 > +|1 >) and |1 >
2
1
being changed into (|0 > |1 >).
2
Thus, the operation of three single qubit quantum gates are represented as follows:
0 |0 > +1 |1 > X 1 |0 > +0 |1 >
0 |0 > +1 |1 > Z 1 |0 > 0 |1 >
0
1
0 |0 > +1 |1 > H (|0 > +|1 >) + (|0 > |1 >)
2
2
(9.20)
(9.21)
(9.22)
Since there are an infinite number of two by two unitary matrices, we can have an
infinite number of single qubit gates. But the properties of the complete set can be
understood from the properties of a smaller set. In other words, quantum computation
can be generated on any number of qubits using a finite universal set of gates. Such a
universal set requires the use of quantum gates involving multiple qubits.
9.9.3 Multiple qubits
If we have two classical bits, then there would be four possible states, namely 00, 01,
10 and 11. Similarly, a two qubit system has four states denoted by |00 >, |01 >, |10 >
and |11 >. Since the pair of qubits can also exist in super positional states, the state
vector describing the qubit system may be represented as
| >= 00 |00 > +01 |01 > +10 |10 > +11 |11 >
(9.23)
(9.24)
289
This state has the property that the measurement on the first qubit gives a result
0 with probability (1/2) and 1 with probability (1/2). A measurement on the second
qubit also gives the same result. This indicates that the results of measurement are
correlated. Such correlations have been a subject of intense study. It has been observed
that the measurement correlations are much stronger in the Bell state as compared to
what is observed in classical systems.
In general, if we consider a system of n qubits, the quantum state of the system
will be specified by 2n amplitudes. This indicates the enormous potential of quantum
computation.
Exercise
9.1 Write a note on nano-scale systems
(Jan 2003)
9.2 What are composite materials and give a brief account of classification of composite materials.
(Jan 2003)
9.3 Give a brief account of smart materials.
(Jan 2003)
(Aug 2003)
(Aug 2003)
(Aug 2003)
9.7 What are smart materials? Explain the functional properties of smart materials.
(Aug 2003, July 2006)
9.8 Explain the basic principles of quantum computation.
9.9 Discuss the nanoscale systems.
(Feb 2004)
(Aug 2004)
9.12 Explain nano-tubes and its applications by giving their physical properties.
(Aug 2004)
9.13 Explain smart materials with two examples.
(Aug 2004)
(Aug 2004)
290
(Feb 2005)
9.16 What are the advantages and disadvantages of composite materials? (Feb 2005)
9.17 Explain the term MEMS. Give a brief account of smart materials.
(Feb 2005)
9.18 Explain the term MEMS. Discuss different materials used for MEMS.
(Aug 2005)
9.19 Explain the advantages and disadvantages of composite materials.
(Aug 2005, July 2006)
9.20 Discuss the different types of nano-scale systems.
(Aug 2005)
9.21 What are composites? Discuss their merits in the context of modern applications.
(Jan 2006)
9.22 Write a note on nanotechnology and its importance.
(Jan 2006)
(Jan 2006)
Chapter 10
Introduction
When such quantities as length, time interval, and mass are considered in elementary
physics, no special point is made about how they are measured. Since a standard unit
exists for each quantity, who makes a certain determination would not seem to matter
everybody ought to get the same result. For instance, there is no difficulty in finding the
length of a rocket when it is stationary and on earth. But what if the rocket is in flight
and we are on the ground? We have standard methods to determine the length of a
distant object with knowledge of trigonometry. However, when we measure the length
of a moving rocket from the ground, we find it to be shorter than it is to somebody in
the rocket itself. In order to understand how this unexpected difference arises we must
analyze the process of measurement when motion is involved.
10.1.1 Frames of Reference
When we say that something is moving, what we mean is that its position relative to
something else is changing. A passenger moves relative to an airplane; the airplane
moves relative to the earth; the earth moves relative to the sun; the sun moves relative
to the galaxy of stars (the Milky Way) of which it is a member; and so on. In each case
a frame of reference is part of the description of the motion. To say that something is
moving always implies a specific frame of reference. A frame of reference is usually
a Cartesian coordinate system and the position of any object is defined with respect to
the frame. The choice of the frame of reference is determined by our own convenience.
An inertial frame of reference is one in which Newtons first law of motion holds.
In such a frame, an object at rest remains at rest and an object in motion continues to
move at constant velocity (constant speed and direction) if no force acts on it. Any
frame of reference that moves at constant velocity relative to an inertial frame is itself
an inertial frame.
All inertial frame are equally valid. Suppose we see something changing its position with respect to us at constant velocity. Is it moving or are we moving? Suppose we
are in a closed laboratory in which Newtons first law holds. Is the laboratory moving
or it is at rest? These questions are meaningless because all constant velocity motion
291
292
And
x
y
z
t
= x vt
=y
=z
= t.
This is Galilean transformation and it provides space-time relation of an event in different inertial frames. In this transformation, we have assumed that the time of an event
for an observer in S is the same as the time for the same event in S . This assumption
holds good for all classical cases where the velocity v is much smaller than the velocity
of light c. For example, consider the case of a person in a train moving with a speed
v. If he throws a ball with a speed u in the direction of motion of the train, the speed
of the ball to a stationary observer outside the train will be (v + u). What happens if
we replace the ball with a flash of light? Will the stationary observer find the speed of
light to be (c + v) ? Experiments were carried out by Michelson and Morley to verify
the addition rule for the velocities of light and of the earth. The speed of earth in its
orbit around the sun is about 30 km/sec which is about (1/10,000) of the velocity of
293
light. With precision experimental set up, they tried to measure the differences in the
velocity of light along and perpendicular to the direction of earths motion.
10.1.3 Michelson-Morley experiment
Albert A. Michelson (1852-1931) was born in Germany but came to the United states
at a very tender age with his parents, who settled in Nevada. He attended the U.S.
Naval Academy at Annapolis where he became a science instructor. To improve his
knowledge of optics, in which he wanted specialize, Michelson went to Europe and
studied in Berlin and Paris. Then he left the Navy to work first at the Case school of
Applied Science in Ohio, then at Clark University in Massachusetts, and finally at the
University of Chicago, where he headed the physics department from 1892 to 1929.
Michelsons specialty was high-precision measurement, and for many decades his successive figures for the speed of light were the best available. He redefined the meter
in terms of wavelengths of a particular spectral line and devised an interferometer that
could determine the diameter of a star (stars appear as points of light in even the most
powerful telescopes).
Michelsons most significant achievement, carried out in 1887 in collaboration with
Edward Morley, was an experiment to measure the motion of the earth through the
ether, a hypothetical medium pervading the universe in which light waves were supposed to occur. The notion of the ether was a hangover from the days before light
waves were recognized as electromagnetic, but nobody at the time seemed willing to
discard the idea that light propagates relative to some sort of universal frame of reference.
CP
294
10.2
Based on all the theoretical and experimental data available, Einstein put forward his
Special Theory of Relativity. Two postulates underlie the special theory of relativity:
The laws of physics are the same in all inertial frames of reference.
This postulate follows from the absence of a universal frame of reference. If the
laws of physics were different for different observers in relative motion, the observers
could find from these differences which of them were stationary in space and which
were moving. But such a distinction does not exist, and principle of relativity expresses this fact. The second postulate is based on the results of many experiments:
The speed of light in free space has the same value in all inertial frames of
reference.
295
The speed of light is 2.998 108 m/s to four significant figures. This means that
the velocity of light has the same value for all observers and is independent of their
motion or of the motion of the light source.
To appreciate how remarkable these postulates are, let us look at a hypothetical
experiment, basically no different from actual ones that have been carried out in a
number of ways. Suppose person A turns on a searchlight just as person B takes off in
a spacecraft at a speed of 2 108 m/s. Both of them measure the speed of light waves
from the searchlight using identical instruments. From the ground person A finds their
speed to be 3 108 m/s as usual. Common sense tell us that person B ought to find
a speed of (3 2) 108 m/s, or only 1 108 m/s for the same light waves. But he also
find the speed to be 3 108 m/s, even though to person A, person B seems to be moving
parallel to the waves at 2 108 m/s.
There is only one way to account for these results without violating the principle
of relativity. It must be true that measurements of space and time are not absolute but
depend on the relative motion between an observer and what is being observed. If
person A were to measure from the ground the rate at which person Bs clock clicks
and the length of his meter stick, person A would find that the clock ticks more slowly
than it did at rest on the ground and that the meter stick is shorter in the direction of
motion of the spacecraft. To person B, his clock and meter stick are the same as they
were on the ground before he took off. To person A they are different because of the
relative motion, different in such a way that the speed of light person B measures is
the same 3 108 m/s as person A measures. Time intervals and lengths are relative
quantities, but the speed of light in free space is the same to all observers.
Thus, the Galilean transformation equations relating the space and time coordinates in one frame of reference to those in the other frame of reference are not valid
for cases where the velocity v approaches the velocity of light. Transformation equations that apply to all speeds and also incorporate the constancy of the velocity of
light were derived by the German physicist H.A. Lorentz. These equations, known as
Lorentz transformation equations, for the case considered earlier of two inertial frames
of reference moving relative to one another with a velocity v along x direction are as
follows:
x = (x vt)
y = y
z = z
and
where = 1/(1 v2 /c2 )1/2 .
t = (t vx/c2 )
296
Before Einsteins work, a conflict had existed between the principles of mechanics, which were then based on Newtons laws of motion, and those of electricity and
magnetism, which had been developed into a unified theory by Maxwell. Newtonian
mechanics had worked well for over two centuries. Maxwells theory not only covered
all that was then known about electric and magnetic phenomena but had also predicted
that electromagnetic waves exist and identified light as an example of them. However, the equations of Newtonian mechanics and those of electromagnetism differ in
the way they relate measurements made in one inertial frame with those made in a
different inertial frame.
Einstein showed that Maxwells theory is consistent with special relativity whereas
Newtonian mechanics is not, and his modification of mechanics brought these branches
of Physics into accord. As we will find, relativistic and Newtonian mechanics agree for
relative speeds much lower than the speed of light, which is why Newtonian mechanics
seemed correct for so long. At higher speeds Newtonian mechanics fails and must be
replaced by the relativistic version.
10.3
Time Dilation
2LO
c
(10.1)
297
0
Figure 10.3 Passage of a light pulse between two mirrors moving with a velocity v.
Now, let us consider the case of the two mirrors and the clock in motion with a
velocity v in a direction perpendicular to the direction of motion of the light pulse
(Fig. 10.3). The time taken by the pulse for the return journey is t. Because the
clock is moving, the light pulse, as seen from the ground, follows a zigzag path. On
its way from the lower mirror to the upper one in the time t/2, the pulse travels a
horizontal distance of v(t/2) and a total distance of c(t/2). Since L0 is the vertical
298
= L20 +
vt 2
2
t2 2
(c v2 ) = L20
4
4L2
(2L0 )2
t2 = 2 0 2 = 2
c v
c (1 v2 /c2 )
2L0 /c
t= p
1 v2 /c2
(10.2)
But 2L0 /c is the time interval t0 between ticks on the clock on the ground, as in
Eq. (10.1) and so
t0
(10.3)
t= p
1 v2 /c2
Here is a reminder of what the symbols in Eq. (10.3) represent:
t0
1 v2 /c2
So the effect is reciprocal: every observer finds that clocks in motion relative to
him tick more slowly than clocks at rest relative to him.
Our discussion has been based on a somewhat unusual clock. Do the same conclusions apply to ordinary clocks that use machinery spring controlled escapements,
299
tuning forks, vibrating quartz crystals, or whatever to produce ticks at constant time
intervals? The answer must be yes, since if a mirror clock and a conventional clock in
the spacecraft agree with each other on the ground but not when in flight, the disagreement between them could be used to find the speed of spacecraft independently of any
outside frame of reference which contradicts the principle that all motion is relative.
10.4
Length Contraction
2.2 106 s
= p
= 34.8 106 s = 34.8s
2
2
2
2
1 (0.998c) /c
1 v /c
t0
300
The moving muons have lifetimes almost 16 times longer than those at rest. In a
time interval of 34.8s, a muon whose speed is 0.998c can cover the distance
v t = (2.994 108 m/s)(34.8 106 s) = 1.04 104 m = 10.4 km
Although its life time is only t0 = 2.2s in its own frame of reference, a muon can
reach the ground from altitudes of as much as 10.4km because in the frame in which
these altitudes are measured, the muon lifetime is t = 34.8s.
What if somebody were to accompany a muon in its descent at v = 0.998c, so that
to him or her the muon is at rest? The observer and the muon are now in the same frame
of reference, and in this frame the muons lifetime is only 2.2s. To the observer, the
muon can travel only 0.66km before decaying. The only way to account for the arrival
of the muon at ground level is if the distance it travels, from the point of view of an
observer in the moving frame, is shortened by virtue of its motion. The principle
of
p
2
relativity tells us the extent of shortening it must be by the same factor of 1 v /c2
that the muon lifetime is extended from the point of view of a stationary observer.
We therefore conclude that an altitude on the ground find to be h0 must appear in
the muons frame of reference as the lower altitude
p
h = h0 1 v2 /c2
In our frame of reference the muon can travel h0 = 10.4km because of time dilation. In the muons frame of reference, where there is no time dilation, this distance
abbreviated to
p
h = (10.4 km) 1 (0.998c)2 /c2 = 0.66 km
301
p
ground by the same factor of 1 v2 /c2 that the spacecraft appears shorter to somebody at rest. The proper length L0 found in the rest frame is the maximum length any
observer will measure. As mentioned earlier, only lengths in the direction of motion
undergo contraction. Thus to an outside observer a spacecraft is shorter in flight than
the ground, but it is not narrower.
10.5
Twin Paradox
We are now in a position to understand the famous relativistic effect known as the twin
paradox. This paradox involves a twin, one of which remains on the earth while the
other goes on a voyage onto space at the speed v and returns. Dick is 20y old when he
takes off on a space voyage at a speed of 0.80c to a star 20 light years away. To Jane,
who stays behind, the pace of Dicks life is slower than hers by a factor of
p
p
1 v2 /c2 = 1 (0.80c)2 /c2 = 0.60 = 60%
To Jane, Dicks heart beat only 3 times for every 5 beats of her heart; Dick takes
only 3 breaths for every 5 of hers; Dick thinks only 3 thoughts for every 5 of hers.
Finally Dick returns after 50 years have gone by according to Janes calendar, but to
Dick the trip has taken only 30y. Dick is therefore 50y old whereas Jane, the twin who
stayed home, is 70y old.
To look at Dicks voyage from his perspective, we must take into account that the
distance L he covers shortened to
r
p
(0.80c)2
= 12 light years
L = L0 1 v2 /c2 = (20 light years) 1
c2
To Dick, time goes by at the usual rate, but his voyage to the star has taken L/v =
15y and his return voyage another 15y, for a total of 30y. Of course, Dicks lifespan
has not been extended to him, because regardless of Janes 50-y wait, he has spent only
30y on the roundtrip.
The twin paradox has been verified by experiments in which accurate clocks were
taken on an airplane trip around the world and then compared with identical clocks that
had been left behind. An observer who departs from an inertial system and then returns
after moving relative to that system will always find his or her clocks slow compared
with clocks that stayed in the system.
10.6
Relativity of Mass
When a force is applied to an object free to move, the force does work on the object
that increases its kinetic energy. The object goes faster and faster as a result. Because
302
the speed of light is the speed limit of the universe, however, the objects speed cannot
keep increasing in proportion as more work is done on it. But conservation of energy
is still valid in the world of relativity. As the objects speed increases, so does its mass,
so that the work done continues to become kinetic energy even though v never exceeds
c.
To investigate what happens to the mass of an object as its speed increases, let us
consider an elastic collision (that is, a collision in which kinetic energy is conserved)
between two particles A and B, as witnessed by observers in the reference frames S
and S which are in uniform relative motion with S moving in +ve x-direction with
respect to S with a velocity v. The properties of A and B are identical when determined
in reference frames in which they are at rest. Before collision, particle A had been at
rest in frame S and particle B in frame S . Then, at the same instant, A was thrown in
the +y direction at the speed VA while B was thrown in the y direction at the speed
VB , where
VA = VB
(10.5)
Hence the behavior of A as seen from S is exactly the same as the behavior of B as
seen from S .
When the two particles collide, A rebounds in the y direction at the speed VA ,
while B rebounds in the +y direction at the speed VB .
If linear momentum is conserved in the S frame, it must be true that
mA VA = mB VB
(10.6)
Inserting these expressions for VA and VB in Eq. (10.6), we see that momentum is
conserved provided that
p
mA = mB 1 v2 /c2
(10.7)
Because A and B are identical when at rest with respect to an observer, the difference between mA and mB means that measurements of mass, like those of space and
time, depend upon the relative speed between an observer and whatever he or she is
observing.
In the example above both A and B are moving in S . In order to obtain a formula
that gives the mass of a body measured while in motion in terms of its mass m0 when
measured at rest, we need only consider a similar example in which VA and VB are very
small compared with . In this case an observer in S will see B approach A with the
velocity v, make a glancing collision (since VB v), and then continue on. In S
m A = m0
and
mB = m
303
and so
m0
(10.8)
m= p
1 2 /c2
The mass of a body moving at the speed v relative to an p
observer is larger than
its mass when at rest relative to the observer by the factor 1/ 1 v2 /c2 . This mass
increase is reciprocal; to an observer in S , mA = m and mB = m0 . Measured from the
earth, a spacecraft in flight is shorter than its twin still on the ground and its mass is
greater. To somebody on the spacecraft in flight the ship on the ground also appears to
be shorter and to have a greater mass.
Relativistic mass increases are significant only at speeds approaching that of light.
At a speed one - tenth that of light the mass increase amounts to only 05 percent,
but this increase is over 100 percent at a speed nine - tenths that of light. Only atomic
particles such as electrons, protons, mesons, and so on have sufficiently high speeds for
relativistic effects to be measurable, and in dealing with these particles, the ordinary
laws of physics cannot to used. Historically, the first confirmation of Eq. (10.8) was the
discovery by Bucherer in 1908 that the ratio e/m of the electrons charge to its mass is
smaller for fast electrons than for slow ones. This equation, like the others of special
relativity, has been verified by so many experiments that it is now recognized as one of
the basic formulas of physics.
p
As v approaches c, 1 v2 /c2 in Eq. (10.8) approaches 0, and the mass m approaches infinity. If v = c, m = , from which we conclude that v can never equal
to c: no material object can travel as fast as light. But what if a spacecraft moving at
v1 = 0.5c relative to the earth fires a projectile at v2 = 0.5c in the same direction? We
on earth might expect to observe the projectiles speed as v1 + v2 = c. Actually, velocity addition in relativity is not so simple a process, and we would find the projectiles
speed to be only 0.8c in such a case.
10.7
Massless Particles
Can a massless particle exist? To be more precise, can a particle exist which has no
rest mass but which nevertheless exhibits such particle like properties as energy and
momentum? In classical mechanics, a particle must have rest mass in order to have
energy and momentum, but in relativistic mechanics this requirement does not hold.
Let us see what we can learn from the relativistic formulas for total energy and
linear momentum: Totaly energy,
E= p
m0 c2
1 2 /c2
(10.9)
304
Relativistic momentum
m0 v
p= p
1 v2 /c2
(10.10)
p2 c2 =
m20 v2 c2
1 v2 /c2
2 2
(10.11)
305
Numerical examples
1. A spacecraft is moving relative to the earth. An observer on earth finds that,
according to his clock, 3601 seconds elapse in a period of one hour as per the
clock on the spacecraft. What is the spacecrafts speed relative to earth?
Here, t0 = 3600s, t = 3601s
Or v = c 1 (t0 /t)
t= p
t0
1 v2 /c2
2. Solar energy reaches the earth at the rate of about 1.4 kW m2 of the surface
perpendicular to the direction of the sun. By how much does the mass of the sun
decrease per second due to this energy loss? The mean radius of the earths orbit
round the sun is 1.5 1011 m.
Energy lost per second = (1.4 103 ) (4)(1.5 1011 )2 = 4 1026 J.
3. An electron and a photon have a momenta of 2 MeV/c. Find the total energy of
each.
q
Electron energy E e = m20 c4 + p2 c2 = 2.064 MeV
Photon energy E p = pc = 2 MeV.
306
Physical Constants
Avogadro number, NA = 6.025 1023 per gram mole