Você está na página 1de 495

ENCYCLOPAEDIA OF MATHEMATICS

Volume 1
ENCYCLOPAEDIA OF MATHEMATICS

Managing Editor

M. Hazewinkel

Scientific Board

J. F. Adams, S. Albeverio, J. B. Alblas, S. A. Amitsur, I. J. Bakelman, J. W. de Bakker,


C. Bardos, H. Bart, M. Bercovier, M. Berger, E. A. Bergshoeff, E. Bertin, F. Beukers,
A. Beutelspacher, K.-D. Bierstedt, H. P. Boas, J. Bochnak, H. J. M. Bos, B. L. J. Braaksma,
A. E. Brouwer, M. G. de Bruin, R. G. Burns,Bums, P. Cartier, J. M. C. Clark, Ph. Clement,
A. M. Cohen, J. W. Cohen, R. F. Curtain, M. H. A. Davis, M. V. Dekster, C. Dellacherie,
G. van Dijk, H. C. Doets, I. Dolgachev, A. Dress, L. P. D. van den Dries, J. J. Duistermaat,
D. van Dulst, H. van Duyn, H. Dym, A. Dynin, W. Eckhaus, J. Eells, P. van Emde Boas,
H. Engl, G. Eskin, G. Ewald, V. I. Fabrikant, A. Fasano, M. Fliess, R. M. Fossum,
M. Freidlin, B. Fuchssteiner, G. B. M. van der Geer, R. D. Gill, V. V. Goldberg, J. de Graaf,
J. Grasman, P. A. Griffiths, L. Gross, P. Gruber, E. J. Hannan, K. P. Hart, G. Heckmann,
A. J. Hermans, W. H. Hesselink, C. C. Heyde, K. Hirsch, M. W. Hirsch, A. T. de Hoop,
P. J. van der Houwen, N. M. Hugenholtz, C. B. Huijsmans, A. Isidori, E. M. de Jager,
P. T. Johnstone, D. Jungnickel, M. A. Kaashoek, V. Kac, W. L. J. van der Kallen,
D. Kanevsky, Y. Kannai, H. Kaul, M. S. Keane, E. A. de Kerf, W. Klingenberg, T. Kloek,
C. Koeman, J. A. C. Kolk, G. Komen, T. H. Koornwinder, Koomwinder, S. Kotz, L. Krop,
B. Kuperschmidt, H. A. Lauwerier, J. van Leeuwen, J. Lennox, H. W. Lenstra Jr.,
J. K. Lenstra, H. Lenz, M. Levi, J. Lindenstrauss, J. H. van Lint, A. Liulevicius, M. Livshits,
W. A. J. Luxemburg, R. M. M. Mattheij, L. G. T. Meertens, P. Mekenkamp, A. R. Meyer,
J. van Mill, I. Moerdijk, H. Neunzert, G. Y. Nieuwland, G. J. Olsder, F. van Ostayen,
B. Pareigis, K. R. Parthasarathy, I. I. Piatetskii-Shapiro, H. G. J. Pijls, N. U. Prabhu,
G. B. Preston, E. Primrose, A. Ramm, C. M. Ringel, J. B. T. M. Roerdink, K. W. Roggenkamp,
G. Rozenberg, W. Rudin, F. H. Ruymgaart, S. N. M. Ruysenaars, A. Salam, A. Salomaa,
P. Saunders, C. L. Scheffer, R. Schneider, J. A. Schouten, A. Schrijver, F. Schurer,
I. A. Segal, J. J. Seidel, A. Shenitzer, V. Snaith, T. A. Springer, J. H. M. Steenbrink,
J. D. Stegeman, F. W. Steutel, P. Stevenhagen, I. Stewart, L. Streit, K. Stromberg,
L. G. Suttorp, D. Tabak, F. Takens, R. J. Takens, N. Temme, S. H. Tijs, B. Trakhtenbrot,
N. S. Trudinger, L. N. Vaserstein, M. L. J. van de Vel, F. D. Veldkamp, W. Vervaat,
P. M. B. Vitanyi, H. A. van der Vorst, J. de Vries, F. Waldhausen, B. Wegner,
J. J. O. O. Wiegerinck, J. Wiegold, J. C. Willems, J. M. Wills, B. de Wit, S. A. Wouthuysen,
S. Yuzvinskii, L. Zalcman, S. I. Zukhovitzkii
ENCYCLOPAEDIA
OF
MATHEMATICS
Volume 1
A-B

An updated and annotated translation of the Soviet


'Mathematical Encyclopaedia'

Reidel
KLUWER ACADEMIC PUBLISHERS
Science and Technology Division
Dordrecht / Boston / Lancaster / Tokyo
Library of Congress Cataloging-in-Publication Data

Matematicheskaia entsiklopediia. English.


Encyclopaedia of mathematics.

1. Mathematics- - Dictionaries. I. Hazewinkel, Michie!. II. Title.


QA5.M3713 1987 510'.3'21 87-26437

Published by Reidel
an imprint of
Kluwer Academic Publishers,
P.O. Box 17, 3300 AA Dordrecht, Holland.

Sold and distributed in the U.S.A. and Canada


by Kluwer Academic Publishers,
101 Philip Drive, Norwell, MA 02061, U.S.A.

In all other countries, sold and distributed


by Kluwer Academic Publishers Group,
P.O. Box 322, 3300 AH Dordrecht, Holland.

ISBN 978-90-481-8235-0 ISBN 978-94-015-1239-8 (eBook)


DOI 10.1007/978-94-015-1239-8
All Rights Reserved
1988 by Kluwer Academic Publishers, Dordrecht, Holland
Softcover reprint of the hardcover 1st edition 1988
No part of the material protected by this copyright notice may be reproduced or
utilized in any form or by any means, electronic or mechanical
including photocopying, recording or by any information storage and
retrieval system, without written permission from the copyright owner
SOVIET MATHEMATICAL ENCYCLOPAEDIA

Editor-in-Chief

I. M. Vinogradov

Editorial Board

S. 1. Adyan, P. S. Aleksandrov, N. S. Bakhvalov, A. V. Bitsadze, V. 1. Bityutskov (Deputy


Editor-in-Chief), L. N. BOl'shev, A. A. Gonchar, N. V. Efimov, V. A. Il'in, A. A. Karatsuba,
L. D. Kudryavtsev, B. M. Levitan, K. K. Mardzhanishvili, E. F. Mishchenko, S. P. Novikov,
E. G. Poznyak, Yu. V. Prokhorov (Deputy Editor-in-Chief), A. 1. Shirshov, A. G. Sveshnikov,
A. N. Tikhonov, P. L. UI'yanov, S. V. Yablonskii

Translation Arrangements Committee


V. I. Bityutskov, R. V. Gamkrelidze, Yu. V. Prokhorov

'Soviet Encyclopaedia' Publishing House


PREFACE

This ENCYCLOPAEDIA OF MATHEMATICS aims to be a reference work for all parts of mathema-
tics. It is a translation with updates and editorial comments of the Soviet Mathematical Encyclo-
paedia published by 'Soviet Encyclopaedia Publishing House' in five volumes in 1977 - 1985.
The annotated translation consists of ten volumes including a special index volume.
There are three kinds of articles in this ENCYCLOPAEDIA. First of all there are survey-type
articles dealing with the various main directions in mathematics (where a rather fine subdivision
has been used). The main requirement for these articles has been that they should give a reason-
ably complete up-to-date account of the current state of affairs in these areas and that they
should be maximally accessible. On the whole, these articles should be understandable to
mathematics students in their first specialization years, to graduates from other mathematical
areas and, depending on the specific subject, to specialists in other domains of science, en-
gineers and teachers of mathematics. These articles treat their material at a fairly general level
and aim to give an idea of the kind of problems, techniques and concepts involved in the area in
question. They also contain background and motivation rather than precise statements of pre-
cise theorems with detailed definitions and technical details on how to carry out proofs and con-
structions.
The second kind of article, of medium length, contains more detailed concrete problems, re-
sults and techniques. These are aimed at a smaller group of readers and require more back-
ground expertise. Often these articles contain more precise and refined accounts of topics and
results touched on in a general way in the first kind of article.
Finally, there is a third kind of article: short (reference) definitions.
Practically all articles (all except a few of the third kind) contain a list of references by means
of which more details and more material on the topic can be found. Most articles were specially
written for the encyclopaedia and in such cases the names of the original Soviet authors are
mentioned. Some articles have another origin such as the Great Soviet Encyclopaedia (Bol,' shaya
Sovieticheskaya Entsiklopediya or BSE).
Communication between mathematicians in various parts of the world has certainly greatly
improved in the last decennia. However this does not mean that there are so-to-speak 'one-to-
one onto' translations of the terminology, concepts and tools used by one mathematical school
to those of another. There also are varying traditions of which questions are important and
which not, and what is considered a central problem in one tradition may well be besides the
point from the point of view of another. Even for well established areas of mathematical inquiry,
terminology varies across languages and even within a given language domain. Further, a concept,
theorem, algorithm, ... , which is associated with one proper name within one tradition may
well have another one in another; especially if the result or idea in question was indeed disco-
vered independently and more-or-Iess simultaneously. Finally, mathematics is a very dynamic
science and much has happened since the original articles were finalized (mostly around 1977).
This made updates desirable (when needed). All this, as well as providing additional references
to Western literature when needed, meant an enormous amount of work for the board of

vii
PREFACE

experts as a whole; some indeed have done a truly impressive amount of work. I must stress
though that I am totally responsible for what is finally included and what is not of all the mate-
rial provided by the members of the board of experts.
Many articles thus are provided with an editorial comment section in a different and some-
what smaller typeface. In particular, these annotations contain additional material, amplifica-
tions, alternative names, additional references, .... Modifications, updates and other extra
material provided by the original Soviet authors (not a rare occurrence) have been incorporated
in the articles themselves.
The final (lO-th) volume of this ENCYCLOPAEDIA OF MATHEMATICS will be an index volume.
This index will contain all the titles of the articles (some 6600) and in addition the names of all
the definitions, named theorems, algorithms, lemmas, scholia, constructions, ... , which occur
in the various articles. This includes, but is by no means limited to, all items which are printed
bold or italic. Bold words or phrases, by the way, always refer to another article with (precisely)
that title.
All articles have been provided with one or more AMS classification numbers according to
the 1980 classification scheme (not, for various reasons, the 1985 revision). So have all items
occurring in the index. A phrase or word out of an article which is included in the index always
inherits all the classification numbers of the article in question. In addition, it may have been
provided with its own classification numbers. In the index volume these numbers will be listed
with the phrase in question. Thus e.g. the Quillen - Suslin theorem of algebraic K-theory will
have its own main classification numbers (these are printed in bold; in this case that number is
18F25) as well as a number of others, often from totally different fields, pointing e.g. to parts of
mathematics where the theorem is applied, or where there occurs a problem related to it. In this
case e.g. 93D15. The index volume will also contain the inversion of this list which thus for each
number will provide a list of words and phrases which may serve as an initial description of the
'content' of that classification number (as far as this ENCYCLOPAEDIA is concerned). For more
details on the index volume, its structure and organisation, and what kind of things can be done
with it, d. the (future) special preface to that volume.
Classifying articles is a subjective matter. Opinions vary greatly as to what belongs where and
thus this attempt will certainly reflect the tastes and opinions of those who did the classification
work. One feature of the present classification attempt is that the general basic concepts and de-
finitions of an area like e.g. 55N (Homology and Cohomology theories) or 601 (Markov proces-
ses) have been assigned classification numbers like 55NXX and 60JXX if there was no finer clas-
sification number different from ... 99 to which it clearly completely belongs.
Different parts of mathematics tend to have differences in notation. As a rule, in this
ENCYCLOPAEDIA in a given article a notation is used which is traditional in the corresponding
field. Thus for example the (repeated index) summation convention is used in articles about
topics in fields where that is traditional (such as in certain parts of differential geometry (tensor
geometry)) and it is not used in other articles (e.g. on summation of series). This pertains espe-
cially to the more technical articles.
For proper names in Cyrillic the British Standards Institute transcription system has been
used (cf. Mathematical Reviews). This makes well known names like S. N. Bernstein come out
like Bernshte'in.
In such cases, especially in names of theorems and article titles, the traditional spelling has
been retained and the standard transcription version is given between brackets.
Ideally an encyclopaedia should be complete up to a certain more-or-Iess well defined level of
detail. In the present case I would like to aim at the completeness level that every theorem,

viii
PREFACE

concept, definition, lemma, construction which has a more-or-Iess constant and accepted name
by which it is referred to by a recognizable group of mathematicians occurs somewhere, and can be
found via the index. It is unlikely that this completeness ideal will be reached with this present
ENCYCLOPAEDIA OF MATHEMATICS, but it certainly takes substantial steps in this direction.
Everyone who uses this ENCYCLOPAEDIA and find items which are not covered, which, he feels,
should have been included, is invited to inform me about it. When enough material has come in
in this way supplementary volumes will be put together.
The ENCYCLOPAEDIA is alphabetical. Many titles consist of several words. Thus the problem
arises how to order them. There are several systematic ways of doing this of course. For inst-
ance using the first noun. All are unsatisfactory in one way or another. Here an attempt has
been made to order things according to words or natural groups of words as they are daily used
in practice. Some sample titles may serve to illustrate this: Statistical mechanics, mathematical
problems in; Lie algebra; Free algebra; Associative algebra; Absolute continuity; Abstract
algebraic geometry; Boolean functions, normal forms of. Here again taste plays a role (and us-
ages vary). The index will contain all permutations. Meanwhile it will be advisable for the read-
er to tryout an occasional transposition himself. Titles like K-theory are to be found under K,
more precisely its lexicographic place is identical with 'K theory'. I.e. '-' = 'space' and comes
before all other symbols. Greek letters come before the corresponding Latin ones, using the
standard transcriptions. Thus X2-distribution (chi-squared distribution) is at the beginning of the
letter C. A * as in C* -algebra and *-regular ring is ignored lexicographically. Some titles involve
Greek letters spelled out in Latin. These are of course ordered just like any other 'ordinary'
title.
This volume has been computer typeset using the (Unix-based) system of the CWI, Am-
sterdam. The technical (mark-up-Ianguage) keyboarding was done by Rosemary Daniels,
Chahrzade van 't Hoff and Joke Pesch. To meet the data-base and typesetting requirements of this
ENCYCLOPAEDIA substantial amounts of additional programming had to be done. This was done by
lohan Wolleswinkel. Checking the translations against the original texts, and a lot of desk edit-
ing and daily coordination was in the hands of Rob Hoksbergen. All these persons, the mem-
bers of the board of experts, and numerous others who provided information, remarks and
material for the editorial comments, I thank most cordially for their past and continuing efforts.
The original Soviet version had a printrun of 150.000 and is completely sold out. I hope that
this annotated and updated translation will turn out to be comparably useful.

Bussum, August 1987 MICHIEL HAZEWINKEL

ix
A
A -INTEGRAL - One of the generalizations of the countable intersection, and is idempotent. With respect
Lebesgue integral, given by E. Titchmarsh [1] for the to ~ -operations, the Haire property (of subsets of an
integration of functions conjugate to summable ones. A arbitrary topological space) and the property of being
measurable function f(x) is called A -integrable over Lebesgue measurable are invariant.
[a, b] if
m{x: If(x) 1>n} = 0 !
[ ) References
[IJ Al.EKsANDRov, P.S.: c.R. Acad. Sci. Paris 162 (1916), 323-325.
[2J Al.EKsANDROV, P.S.: Theory offunctions of a real variable and
and if the theory of topological spaces, Moscow, 1978 (in Russian).
b [3] KOI.MOGOROV, A.N.: 'P.S. Aleksandrov and the theory of as-
I = lim j[f(x)ln fix operations', Uspekhi Mat. Nauk 21, no. 4 (1966), 275-278 (in
n~oo a
Russian).
exists, where [4] SusLIN, M.YA.: c.R. Acad. Sci. Paris 164 (1917),88-91.
[5] LUZIN, N.N.: Selected works, Vol 2, Moscow, 1958, p. 284.
f(X) if 1 f(x) 1 "';;n, [6J KURATOWSICI, K..: Topology, Acad. Press, 1966-1968 (translated
[f(x)ln = {0 if 1 f(x) 1 >n. from the French). A. G. El'kin
The number I is called the A -integral. It is denoted by Editorial comments. The d -operation is in the West
b usually attributed to M.Ya. Suslin [4], and is therefore also
(A) jf(x)dx. called the Suslin operation, the Suslin d -operation or the
d
Suslin operation d. d -sets are usually called analytic sets.
References AMS 1980 Subject Classification: 04A15
[IJ 1iTcHMARSH, E.G.: 'On conjugate functions', Proc. London
Math. Soc. 29 (1928),49-80.
[2J VINOGRADOVA, IA.: 'Generalized integrals and Fourier series', A POSTERIORI DISTRIBUTION - A conditional
ftogi Nauk. Mat. Anal. 1970 (1971),65-107 (in Russian).
probability distribution of a random variable, to be
LA. Vinogradova contrasted with its unconditional or a priori distribu-
AMS 1980 Subject Classification: 26A42 tion.
Let 8 be a random parameter with an a priori den-
.s;/-oPERATION, operation ~ - A set-theoretical sity p(8), let X be a random result of observations and
operation, discovered by P.S. Aleksandrov [1] (see also let p(x 18) be the conditional density of X when 8=0;
[2], [3D. Let {E", ... lit} be a system of sets indexed by then the a posteriori distribution of 8 for a given
all finite sequences of natural numbers. The set X = x, according to the Bayes formula, has the density
co
P = U nEil, ...... ' p(fJ 1x) = +}(8)p(x 18)
" .. ___ ..... __ k=J
j p{fI)p(x l8)dfJ
where the union is over all infinite sequences of natural -co
numbers, is called the result of the ~ -operation If T(x) is a sufficient statistic for the family of distribu-
applied to the system {E", ... lit}. tions with densities p (x I 8), then the a posteriori distri-
The use of the ~-operation for the system of inter- bution depends not on x itself, but on T(x). The
vals of the number line gives sets (called ~ -sets in asymptotic behaviour of the a posteriori distribution
honour of Aleksandrov) which need not be Borel sets p(Olx), ... ,x,,) as n~oo, where Xj are the results of
(see ~ -set; Descriptive set theory). The ~ -operation is independent observations with density p(x I(0 ), is
stronger than the operation of countable union and 'almost independent' of the a priori distribution of 8.

1
A POSTERIORI DISTRIBUTION

For the role played by a posteriori distributions in A PRIORI PROBABILITY of an event - The proba-
the theory of statistical decisions, see Bayesian bility of the event, considered in contrast to the condi-
approach. tional probability of the same event under certain addi-
References tional conditions. The latter is called the a posteriori
[1] BERNSHTEiN, S.N.: Probability theory, Moscow-Leningrad, 1946 probability. This terminology is, as a rule, used in con-
(in Russian). v V.. P rokh orov nection with the Bayes fonnula.
~u. Yu. V. Prokhorov
Editorial comments. AMS 1980 Subject Classification: 60A 15
References
[A1] SVERDRUP, E.: Laws and chance variations, 1, North-Holland,
1967, p. 214ft.
sI-SET, analytic set, in a complete separable metric
space - A continuous image of a Borel set. Since any
AMS 1980 Subject Classification: 62F15 Borel set is a continuous image of the set of irrational
numbers, an d -set can be defined as a continuous
A POSTERIORI PROBABILITY of an event - The image of the set of irrational numbers. A countable
conditional probability of an event taking place under intersection and a countable union of d -sets is an d-
certain conditions, to be contrasted with its uncondi- set. Any d -set is Lebesgue-measurable. The property
tional or a priori probability. There is no difference of being an d -set is invariant relative to Borel-
between the meaning of the terms 'conditional' and 'a measurable mappings, and to d -operations (cf. d -
posteriori'. The former term is employed if the condi- operation). Moreover, for a set to be an d -set it is
tion itself is hypothetical and is not directly observed in necessary and sufficient that it can be represented as
the course of the experiment. The latter term is the result of an d -operation applied to a family of
employed if it is desired to stress that the condition in closed sets. There are examples of d -sets which are
question is actually observed. The a posteriori probabil- not Borel sets; thus, in the space 21 of all closed sub-
ity is connected with the a priori probability by the sets of the unit interval I of the real numbers, the set of
Bayes fonnula. all closed uncountable sets is an d -set, but is not
Yu. V. Prokhorov
Borel. Any uncountable d -set topologically contains a
AMS 1980 Subject Classification: 62F15 perfect Cantor set. Thus, d -sets 'realize' the contin-
uum hypothesis: their cardinality is either finite, No or
A PRIORI DISTRIBUTION - The probability distri- 2No. The Luzin separability principles hold for d -sets.
bution of a random variable, to be contrasted with the References
conditional distribution of this random variable under [1] KURATOWSKl, K.: Top%gy, 1, Acad. Press, 1966-1968
certain additional conditions. Usually the term 'a priori (translated from the French).
distribution' is used in the following way. Let (8, X) be [2] LUZIN, N.N.: Lectures on analytic sets and their applications,
Moscow, 1953 (in Russian).
a pair of random variables (random vectors or more B.A. Efimov
general random elements). The random variable 8 is Editorial comments. Nowadays the class of analytic sets
considered to be unknown, while X is considered to be is denoted by };1, and the class of co-analytic sets (cf.
the result of an observation to be used for estimation of Cd-set) by 111.
e. The joint distribution of e and X is given by the
References
distribution of 8 (now called the a priori distribution) [A1] JECH, T.J.: The axiom of chOice, North-Holland, 1973.
and the set of conditional probabilities Po of the ran- [A2] MOSCHOVAKIS, Y.N.: Descriptive set theory, North-Holland,
dom variable X given e=o. According to the Bayes 1980.
fonnula, one can calculate the conditional probability AMS 1980 Subject Classification: 04A 15, 54H05
of 8 with respect to X (which is now called the a pos-
teriori distribution of 8). In statistical problems, the a
priori distribution is often unknown (and even the A -SYSTEM - A 'countably-ramified' system of sets,
assumption on its existence is not sufficiently founded). i.e. a family {Ani'" n,} of subsets of a set X, indexed
For the use of the a priori distribution, see Bayesian by all finite sequences of natural numbers. An A-
approach. system {Ani" . nJ IS called regular if
Yu. V. Prokhorov Ani' .. n,n CAn n,'
I ... A sequence
HI

Editorial comments. Ani' .. ,Ani "n,'" of elements of an A-system


References indexed by all segments of one and the same finite
[A1] SVERDRUP, E.: Laws and chance variations, 1, North-Holland, sequence of natural numbers is called a chain of this
1967, p. 214ft.
A-system. The intersection of all elements of a chain is
AMS 1980 Subject Classification: 62F15 called its kernel, and the union of all kernels of all

2
ABEL DIFFERENTIAL EQUATION

chains of an A -system is called the kernel of this A- is convergent.


system, or the result of the .s;/ -operation applied to this 2) Abel's criterion for series offunctions. The series
A -system, or the .s;/ -set generated by this A -system.
Every A-system can be regularized without changing n=l
the kernel (it suffices to put converges uniformly on a set X if the series
A~, "'n, =AI n '" nAn, n'). If At is a system of
sets, then the kernels of the A -system composed from
n=1
the elements of At are called the .s;/-sets generated by
At . The d-sets generated by the closed sets of a topo- converges uniformly on X and if the functions an(x),
logical space are called the .s;/-sets of this space. n = 1,2, ... , for any x EX, form a monotone sequence
that is uniformly bounded on X An Abel criterion for
References
[1] ALEKSANDROV, P.S.: An introduction to set theory and general the uniform convergence of integrals
topology, Moscow, 1977 (in Russian). 00

[2] KURATOWSKI, K.: Topology, 1, Acad. Press, 1966 (translated ja(n, x)b(n, x)dn,
from the French).
A.G. El'kin
which depend on a parameter x EX, can be formulated
Editorial comments. Thed-operation is an important
in a similar manner.
tool in descriptive set theory. It was introduced by M.Ya.
The Abel criteria can be strengthened (see, for exam-
Suslin, hence it is also known as the Sus/in operation (also
So us/in operation). In this connection, an A-system is also
ple, Dedekind criterion (convergence of series. See
called a Sus/in (Souslin) scheme. See also d-operation; also Dirichlet criterion (convergence of series); Abel
d-set. transformation.
While [2] is the standard reference for classical results, a References
modern approach can be found in [A1]. [1] FICHTENHOLZ, G.M.: Differential und Integralrechnung, I,
Deutsch. Verlag Wissenschaft., 1964.
References [2] KUDRYAVTSEV, L.D.: Mathematical analysis, I, Moscow, 1973
[A 1] CHRISTENSEN, J.P.R.: Topology and Borel structure, North-
(in Russian).
Holland, 1974.
[3] WHITTAKER, E.T. and WATSON, G.N.: A course of modern
AMS 1980 Subject Classification: 54H05, 04A 15 analysis, 1-2, Cambridge Univ. Press, 1952.
L.P. Kuptsov
AMS 1980 Subject Classification: 40A05, 40A10
ABACUS - 1) A calculating frame used for arithmeti-
cal calculations in ancient Greece, ancient Rome and
later, up to the 18-th century, in Western Europe. The ABEL DIFFERENTIAL EQUATION - The ordinary
frame was subdivided into strips, inside which the differential equation
counters (bones, pebbles, etc.) were moved. The y' = !O(X)+!I(X)Y +/2(x)y 2 +!3 (x)y 3
Chinese equivalent of the abacus - swan-pan - was
(Abel's differential equation of the first kind) or
used in the Far East, while the 'schety' was employed
in Russia. [go(x)+gl(X)Y}y' = !O(X)+!I(X)y+/2(x)y 2+/J(X)y 3
2) In nomography an abacus is a special sketch (the (Abel's differential equation of the second kind). These
so-called net chart nomogram). equations arose in the context of the studies of N.H.
BSE-3
Abel [1] on the theory of elliptic functions. Abel's dif-
Editorial comments. ferential equations of the first kind represent a natural
References generalization of the Riccati equation.
[A 1] ROUSE BALL, W. W.: A short account of the history of
If fl EC(a, b), fz and h ECI(a, b) and h(x)i=O for
mathematics, Dover, reprint, 1960, pp. 123-125.
x E[a, b], then Abel's differential equation of the first
AMS 1980 Subject Classification: OOA08, OOA25, kind can be reduced to the normal form
01A20 dz / dt=z3 +cI>(t) by substitution of variables [2]. In
the general case, Abel's differential equation of the first
ABEL CRITERION - 1) Abel's criterion for series of kind cannot be integrated in closed form, though this is
numbers. If the series possible in special cases [2]. If gO,gIECI(a,b) and
gl(X)i=O, gO(X)+gl(X)yi=O, Abel's differential equa-
n=l
tion of the second kind can be reduced to Abel's dif-
is convergent and if the numbers an form a monotone ferential equation of the first kind by substituting
bounded sequence, then the series go(x)+gl(x)y= 1 / z.
Abel's differential equations of the first and second
kinds, as well as their further generalizations

3
ABEL DIFFERENTIAL EQUATION

n m n Let A~ be the class of functions 1 (z) of the form


y' = ~(x>'y;, y'~gj(x>'yj = ~(x>.y;, co _
;=0 j=O ;=0
f(z) = ~(n!)-"a"zn, lim I a" Illn :E;; T,
have been studied in detail in the complex domain (see, n=O n-+oo

for example, [3]). O<r < 00, O~a< 00, and let Aa. be the class of all pos-
References sible sequences {)..,,} such that I)"" I ~(n + 1 I , r-
[I] ABEL, N.H.: 'Precis d'une th60rie des fonctions elliptiques', J. n =0, 1, .... By definition, the bound 01 convergence for
Reine Angew. Math. 4 (1829), 309-348.
[2] K.uoo!, E.: Differentialgleichungen: LOsungsmethoden und
the class Aa. is the supremum S of the values of r for
/I(

LOsungen, 1, Chelsea, reprint, 1971. which every function 1 (z) EA~ can be represented by a
[3] GoLUBEV, V.V.: Vorlesungen iiber Differentialgleichungen im series (*). The infimum W of the r-values for which
/I(

Komplexen, Deutsch. Verlag Wissenscbaft., 1958 (translated there exist a function I(Z)EA~ and a sequence
from the Russian). N.Kh. Rozov {)..,,}EA.. such thatj<n)()..,,)=O, n=O, 1, ... , l(z)"O,
is called the bound 01 uniqueness. The quantities WI
AMS 1980 Subject Classification: 34AXX
and Wo are known, respectively, as the Whittaker con-
stant and the Goncharov constant. It has been shown
ABEL - GoNCHAROV PROBLEM, Goncharov prob- that S I = WI (cf. [6]); the more general statements
lem - A problem in the theory of functions of a com-
plex variable, consisting of finding the set of all func- S .. = WI> W.. = WI> O:E;;a<oo,
tions 1 (z) from some class satisfying the relations have also been proved [5], [10].
j<n)()..,,)=A n (n=O, 1, ... ). Here {An} and {)..,,} are Thus, if {~n}EA.., the Abel-Goncharov problem is
sequences of complex numbers admissible for the given reduced to finding the constant WI. Its precise numeri-
class. The problem was posed by V.L. Goncharov [2]. cal value is not known, but one has obtained the
A function 1 (z) is put in correspondence with the bounds 0.7259< WI <0.7378 [9].
series Relative to the Abel- Goncharov problem for the
""
~j<n)(An)Pn(z), (*) class A'i of functions which are regular in the region
n=O
I z I ;;;;.1 and such that I( 00) = 0, the following has been
the Abel-Goncharov interpolation series, where Pn(z) is shown: For any set of numbers {~}, I ~ I;;;;'1, which
the Goncharov polynomial defined by the equalities satisfy the condition
P~k)(At) = 0, k=O, ... ,n-I; p~n)(z) - l. . nk
N.H. Abel gave a formal treatment of the case k~1A;f = 0,
)""=a+nh, n=O, 1, ... , where a and h are real where {nk} is an ~creas~~ ~ence ::.f natural
numbers (h=j=O) (cf. [1]). Here numbers, the equatIons f
(~k)-O, k-O, 1, ... ,
Pn(z) = -\-(z-a)(z-a-nhr- I imply l(z)=O. Moreover, for any number b>O there
n. exist a sequence {~n}
The series (*) is used to study the zeros of the succes-
Jim_n_ =b
sive derivatives of regular functions. The set of func- I An
n .... "" I
tions 1 (z) representable by a series (*) is called the con- and a function l(z) .. O,/(z)EAi, for which
vergence class of the Abel- Goncharov problem.
J<n)(~n)=O, n =0, 1, . .. [7].
In the case when limn . . . oo I ~n I = 00, the The Abel- Goncharov problem includes the so-called
Abel- Goncharov convergence class has been expressed two-point problem posed by J.M. Whittaker [12]. Let the
in terms of bounds on the order and the type of entire sequences {Jlk } and {p.,. } be such that
functions I(z), depending on the growth of the quan- n
{Jld U {p.,.} = {n} and {Jld {p.,.} = 0. The problem
tity ~~:~ I ~k+ I - ~k I [2]. is to determine conditions under which there exists a
If ).." =nl/Pl(n), where l(n) is a slowly increasing function l(z)"O, re~ar on the segment [0, 1], satisfy-
function, O<p< 00, the Abel- Goncharov convergence ing the conditions f r.)(1) =0, r)(O) =0. This problem
class has, in a sense, been exactly determined [6]. Other was solved for various subclasses of the class of func-
Abel- Goncharov convergence classes for entire func- tions regular in the disc I z I<R (R> 1). The resulting
tions of finite and infinite order have been identified in conditions, which are in a sense precise, are expressed
terms of various constraints on the indicators of the in terms of various bounds imposed on the coefficients
respective classes of functions. The Abel- Goncharov a,. of the expansions
problem has also been studied for entire functions of
00
several variables. Exact estimates of Goncharov polyno- j(z) = ~a z
mials were obtained for some classes of interpolation k=O

nodes. depending on {Jld [3]. This problem was generalized

4
ABEL INTEGRAL EQUATION

and solved using methods of the theory of infinite sys- Abel's inequality is proved by means of the Abel
tems of linear equations [4]. In the particular case transfonnation.
where the sequence {pd forms an arithmetic progres- LD. Kudryavtsev
sion and one is dealing with entire functions of AMS 1980 Subject Classification: 26015
exponential type, the two-point problem has, in a cer-
tain sense, been solved completely [8]. ABEL INTEGRAL EQUATION - The integral equa-
References tion
[1] ABEL, N.H.: 'Sur les fonctions generatrices et leurs
determinants', in Oeuvres completes, Vol. 2, Christiania, 1839,
I
x cp(s) ds = f(x),
Yx-s
(1)

pp. 77-88. Edition de Holmboe. to which the solution of the Abel problem is reduced.
[2] GoNTCHAROFF, W.L. [V.L. GoNCHAROV]: 'Recherches sur les
derivees successives des fonctions analytiques. Generalisation The generalized Abel integral e~tion is the equation
de la serie d'Abel', Ann. Sci. Ecole Norm. Sup. (3) 47 (1930),
1-78. Ja(x-sf
cp(s) ds = f(x), a';;;x ~b, (2)
[3] GEL'FOND, A.O. and IBRAGIMOV, I.I.: 'On functions, the
derivatives of which vanish in two points', Izv. Akad. Nauk
where a>O and O<a<l are known constants, f(x) is
SSSR Ser. Mat. 11 (1947), 547-560 (in Russian).
[4] DZHRBASHYAN, M.M.: 'Uniqueness and representability a known function and <p(x) is the unknown function.
theorems for analytic functions', Izv. Akad. Nauk SSSR Ser. The expression (x - s) - a is called the kernel of the
Mat. 16 (1952), 225-252 (in Russian). Abel integral equation, or Abel kernel. An Abel
[5] DRAGILEV, M.M. and CHuIrnLOVA, O.P.: 'On convergence of
certain interpolation series', Sibirsk. Mat. Zh. 4, no. 2 (1963), integral equation belongs to the class of Volterra equa-
287-294 (in Russian). tions of the first kind (d. Volterra equation). The equa-
[6] EVGRAFov, M.A.: The Abel-Goncharov interpolation problem, tion b

falx-sla
Moscow, 1954 (in Russian).
[7] KAz'MIN, YU.A.: 'On zeros of sequences of derivatives of ana- p(s) ds = f(x), a~x~b, (3)
lytic functions', Vestnik Moskov. Univ. Ser. I Math. Mekh., no.
1 (1963), 26-34 (in Russian). is called Abel's integral equation with fixed limits.
[8] KAz'MIN, Yu.A.: 'On a problem of Gel'fond- Ibragimov',
If f (x) is a continuously-differentiable function, then
Vestnik Moskov. Univ. SeT. I Math. Mekh. 19, no. 6 (1965),
37-44 (in Russian). the Abel integral equation (2) has a unique continuous
[9] MAcINTYRE, S.S.: 'On the zeros of successive derivatives of solution given by the formula
integral functions', Trans. Amer. Math. Soc. 67 (1949),241-251.
[10] SUETIN, Yu.K.: 'Coincidence of constants of uniqueness and cp(x) = sina'IT ~fx (t)dt (4)
convergence of certain interpolation problems', Vestnik 'IT dx a (x-t)l-a'
Moskov. Univ. Ser. I Math. Mekh. 21, no. 5 (1966), 16-25 (in
Russian). or
[11] OsKOLKOV, V.A.: 'On estimates for Gonwov polynomials', cp(x) = sina'IT [ (a) +fx t)dt ] (5)
Math. USSR-Sb. 21, no. 1 (1973), 57-62. (Mat. Sb. 92, no. 1 'IT (x-a)l-a a (x-t)l-a '
(1973), 55-59)
[I2] WHITTAKER, J.M.: Interpolatory function theory, Cambridge which is the same thing. Formula (5) also gives the
Univ. Press, 1935. solution of an Abel integral equation (2) under more
V.A. Oskolkov
general assumptions [3], [4]. Thus, it has been shown [3]
Editorial comments. For an introduction to the general that if f (x) is absolutely continuous on the interval
area, cf. [A 1]. [a, b], the Abel integral equation (2) has the unique
References solution in the class of Lebesgue-integrable functions
[A1] BoAS, RP.: Entire functions, Acad. Press, 1954. given by formula (5). For the solution of the Abel
AMS 1980 Subject Classification: 30E05 integral equation (3) see [2]; see also [6].
References
ABEL INEQUALITY - An estimate for the sum of [1] BOCHER, M.: 'On the regions of convergence of power-series
which represent two-dimensional harmonic functions', Trans.
products of two numbers. If sets of numbers ak and bk
Amer. Math. Soc. 10 (1909),271-278.
are given such that the absolute values of all sums [2] CARLEMAN, T.: 'Ueber die Abelsche Integralgleichung mit kon-
Bk=b) + ... +bko k=l, ... ,n, are bounded by a stanten Integrationsgrenzen', Math. Z. 15 (1922),111-120.
number B, i.e. I Bk I ~B, and if either ai ;;:'ai +) or [3] ToNELU, L.: 'Su un problema di Abel', Math. Ann. 99 (1928),
183-199.
ai ~ai + J, i = 1, ... ,n - 1, then [4] TAMAllKIN, J.D.: 'On integrable solutions of Abel's integral

Ik~lakbk I.;;; B(I al 1+21 an I).


equation', Ann. of Math. (2) 31 (1930), 219-229.
[5] MnrnuN, S.G.: Linear integral equatiOns, Hindushtan Publ.
Comp., Delhi, 1960 (translated from the Russian).
If the ak are non-increasing and non-negative, one [6] GAKHOV, F.D.: Boundary value problems, Pergamon, 1%6
has the simpler estimate: (translated from the Russian). B V. Kh -1 l'd
.. veue I ze

Ik~lakbk I.;;; Bal' Editorial comments. The left-hand side of (2) is also
known as a Riemann - Liouville fractional integral, where

5
ABEL INTEGRAL EQUATION

Rea<1, cf. [A1]. If one integrates in (1) and (2) from x to as M(p, </ approaches P(1, </>0) non-tangentially inside the
00 and replaces x - s by s - x, then one obtains left-hand disc, cf. [A2], pp. 129-130.
sides which are known as Abel transform and Weyl frac- References
tional integral (ct. [A1)), respectively. This Abel transform [A1] ZYGMUND, A.: Trigonometric series, 1-2, Cambridge Univ,
is the case SL(2, R) of the Abel transform on a real semi- Press, 1979.
simple Lie group, cf. [A2]. [A2] Tsun, M.: Potential theory in modern function theory,
Maruzen, 1959.
A general source for integral equations is [A3].
References AMS 1980 Subject Classification: 40G 10
[A1] ERDELYI, A., MAGNUS, W., OBERHETINGER, F. and TRIcOMl,
F.G.: Tables of integral transforms, II, McGraw-Hili, 1954, ABEL PROBLEM - To find, in a vertical plane (s, r),
Chapt. 13.
[A2] GoDEMENT, R.: 'Introduction aux travaux de A. Selberg', in
a curve such that a material point moving along it
sem. Bourbaki, Vol. 144, 1957. under gravity from rest, starting from a point with
[A3] FENYO, S. and STOLLE, H.W.: Theorie und Praxis der linearen ordinate x, will meet the r-axis after a time T = f (x),
Integralgleichungen, 3, Birkhauser, 1984, Sect. 13.2.4.
where the function f (x) is given in advance. The prob-
AMS 1980 Subject Classification: 45E10 lem was posed by N.H. Abel in 1823, and its solution
involves one of the first integral equations - the Abel
ABEL - POISSON SUMMATION MEfHOD - One integral equation - which was also solved. In fact, if w
of the methods for summing Fourier series. The Fourier is the angle formed by the tangent of the curve being
series of a function fEL[O,271"] is summable by the sought with the r-axis, then
Abel- Poisson method at a point cp to a number S if
~ =- V2g(x -s) sinw.
lim j(p, </ = S,

where
p-->i-O
Integrating this equation between and x and putting
aO _.1_ = <t>(s), - -y2g<I>(x) = j(x),
= 2+ ~(akcosk</>+bksink</l,
00
j(p,</ smw
k=i
one obtains the integral equation
ji( "') =
p, 'I'
1- j" ji("'+ t)
'1T _" 'I'
1- p2 dt
2(1-2pcost+p2)'
(*)
f p(s)ds
o Vx-s
= j(x)
If fEC(O, 271"), then the integral on the right-hand side
for the unknown function cp(s), the determination of
is a harmonic function for I z I I pe i </> I <1, which is,
as has been shown by S. Poisson, a solution of the Diri- which makes it possible to find the equation of the
chlet problem for the disc. The Abel summation method curve being sought. The solution of the equation intro-
applied to Fourier series was therefore named the duced above is:
Abel- Poisson summation method, and the integral (*)
was named the Poisson integral.
<t>(x) =~
'1T
[1(Ql+ f ((T)dr].
Vx VX-T
0

If (p, cp) are polar coordinates of a point inside the


disc of radius one, then one can consider the limit of References
[1] ABEL, N.H.: 'Solutions de quelques problemes Ii l'aide
f(p, cp) as the point M(p, cp) approaches a point on the d'integrales defines', in Oeuvres completes, nouvelle Iid., Vol. I,
bounding circle not by a radial or by a tangential but Grondahl and Son, Christiania, 1881, pp. 11-27.
rather along an arbitrary path. In this situation the B. V Khvedelidze
Schwarz theorem applies: If fbelongs to L[O, 271"] and is Editorial comments. In the case that f(x)=const, this is
continuous at a point </>0, then the famous tautochrone problem first solved by Chr,
Huyghens, who showed that this curve is then a cycloid.
(P,</>~,</>o/(p, </ = j(</>o)
References
irrespective of the path along which the point M(p, cp) [A 1] JERRI, A.J.: Introduction to integral equations with applica-
approaches the point P(l, </>0) as long as that path tions, M. Dekker, 1985, Sect. 2.3,
remains within the disc with radius one. [A2] HOCHSTADT, H.: Integral equations, Wiley (Interscience),
1973.
References [A3] MOISEIWITSCH, B.L.: Integral equations, Longman, 1977.
[I] BARY, N.K. [N.K. BARI]: A treatise on trigonometric series, Per-
gamon, 1964 (translated from the Russian). A A Z kh AMS 1980 Subject Classification: 70D05, 45E10
. . a arov
Editorial comments. A theorem related to Schwarz' ABEL SUMMATION METHOD - One of the
theorem stated above is Fatou's theorem: If fEL[O, 2'1T], then methods for the summation of series of numbers. The
for al most all </>0 series
lim
(p, </-->(1, </>0)
f(p, </ = f(</>o)

6
ABEL THEOREM

can be summed by the Abel method (A -method) to the number R E[O, 00] such that if I z - b I <R the series is
number S if, for any real x, O<x < 1, the series convergent, while if I z - b I > R the series is divergent.
The number R is called the radius of convergence of the
series (*), while the disc I z - b I <R is known as the
disc of convergence of the series (*).
is convergent and
3) Abel's continuity theorem: If the power series (*)
converges at a point z 0 on the boundary of the disc of
convergence, then it is a continuous function in any
This summation method can already be found in the closed triangle T with vertices z 0, Z 1 , z 2, where z J, z 2
works of L. Euler and G. Leibnitz. The name 'Abel are located inside the disc of convergence. In particu-
summation method' originates from the Abel theorem lar
lim S(z) = S(ZO).
on the continuity of power series. The Abel summation z-+zo,
ZET
method belongs to the class of totally regular summa-
This limit always exists along the radius: The series (*)
tion methods and is more powerful than the entire set
converges uniformly along any radius of the disc of
of Cesaro summation methods. The Abel summation
convergence joining the points b and z o. This theorem
method is used in conjunction with Tauberian theorems
is used, in particular, to calculate the sum of a power
to demonstrate the convergence of a series.
series which converges at the boundary points of the
A closely related summation method is the A-
disc of convergence.
method. Let z be a complex number, I z I < 1; the
4) Abel's theorem on Dirichlet series: If the Dirichlet
series 00
series
~ak .p(S) = ~ane-'\"s, s=a+it, An>O,
k=O
n=1
is summed by the A -method to the number S if converges at the point So =00 +ito, then it converges in
00

lim ~akzk = S, the half-plane 0>00 and converges uniformly inside


k=O any angle I arg(s-so) I ~O<7T /2. It is a generaliza-
where z~ 1 along any path not tangent to the unit cir- tion of Abel's theorem on power series (take An =n and
cle. put e -s = z). It follows from the theorem that the
domain of convergence of a Dirichlet series is some
References
[I] HARDy, G.H.: Divergent series, Clarendon, 1949. half-plane o>c, where c is the abscissa of convergence
[2] BARY, N.K. [N.K. BARI]: A treatise on trigonometric series, Per- of the series.
gamon, 1964 (translated from the Russian). A A Z kh The following theorem is valid for an ordinary Diri-
. . a arov
chlet series (when An =lnn) with a known asymptotic
AMS 1980 Subject Classification: 40G10
behaviour for the sum-function An =al + ... +an of
the coefficients of the series: If
ABEL THEOREM - 1) Abel's theorem on algebraiC
equations: Formulas expressing the solution of an arbi- An = B nSI(lnn)a +O(n P),
trary equation of degree n in terms of its coefficients where B, s 1 ,0: are complex numbers, (3 is a real
using radicals do not exist for any n ;:'5. The theorem number, 01-1<{3<01, 01 =Resl, then the Dirichlet
was proved by N.H. Abel in 1824 [1]. Abel's theorem series converges for 01 <0, and the function cp(s) can be
may also be obtained as a corollary of Galois theory, regularly extended to the half-plane (3<0 with the
from which a more general theorem follows: For any exception of the point s = s I. Moreover
n;:' 5 there exist algebraic equations with integer coeffi- .p(s) = B f(u+ l)s(s -sd- a - I +g(s)
cients whose roots cannot be expressed in terms of rad-
if 0:*-1, -2, ... ,and
icals of rational numbers. For a modern formulation of
Abel's theorem for equations over an arbitrary field, see AJ) -- B (_u_l)!ss-si
'i'\S (_l}-a (
ns-si )+ gs
)-a-Il ( ()
Algebraic equation.
2) Abel's theorem on power series: If the power series if 0: = - 1, - 2, . . .. Here g (s) is a regular function for
00 0>{3.
S(z) = ~ak(z-bt, (*) E.g., the Riemann zeta-function t{s) (An =n, B = 1,
s 1 = I, 0: =0, (3)0) is regular at least in the half-plane
k=O

where ab b, z are complex numbers, converges at 0>0, with the exception of the point s = 1 at which it
z = z 0, then it converges absolutely and uniformly has a first-order pole with residue 1. This theorem can
within any disc I z -b I ~p of radius p< I Zo -b I and be generalized in various ways. E.g., if
with centre at b. The theorem was established by N.H. k
Abel [2]. It follows from the theorem that there exists a An = ~B)nSj(lnntj +O(n P),
)=1

7
ABEL THEOREM

where Bj , Sj' aj (l:S;;;;j:S;;;;k), are arbitrary complex some of the characteristic properties of the category of
numbers, and Ok -1 <{3<Ok < ... <01, then the Diri- all Abelian groups. Abelian categories were introduced
chlet series converges for 0>0J, and <Pes) is regular in as the basis for an abstract construction of homological
the domain 0> {3 with the exception of the points algebra [4]. A category ~ is said to be Abelian [2] if it
S 1, . ,Sk at which it has algebraic or logarithmic satisfies the following axioms:
singularities. Theorems of this type yield information AO. A null object exists (cf. Null object of a
on the behaviour of the Dirichlet series in a given half- category).
plane, based on the asymptotic behaviour of An. AI. Each morphism has a kernel (d. Kernel of a
References morphism in a category) and a cokernel.
[1] ABEL, N.H.: Oeuvres completes, Vol. 1, Christiania, 1881. A2. Each monomorphism is a normal monomorphism,
[2] ABEL, N.H.: 'Untersuchungen tiber die Reihe so that it occurs as the kernel of a morphism; each epi-
l+mx /2+m(m-l)x 2 /(2,1)+", ',J. Reine Angew. Math.
I (1826), 311-339. morpliism is a normal epimorphism.
[3] MARKUSHEVICH, A.!.: Theory offunctions of a complex variable, A3. A product and a coproduct exist for each pair of
1, Chelsea, 1977 (translated from the Russian). objects (cf. Product of a family of objects in a category).
L.P. Kuptsov In defining an Abelian category it is often assumed
Editorial comments. More on Abel's theorems 2) - 4) that ~ is a locally small category (d. Small category).
can be found in [A 1]. The coproduct of two objects A and B of an Abelian
References category is also know as the direct sum of these objects
[A1] HARDY, G.H.: Divergent series, Clarendon, 1949. and is denoted by A EB B, A liB or A -+ B.
AMS 1980 Subject Classification: 10HXX, 12AXX, Examples of Abelian categories. 1) The dual category
12E12, 30830, 30815,40G10 of an Abelian category is also an Abelian category.
2) The category R we of all unitary left modules over
ABEL TRANSFORMATION, summation by parts - A an arbitrary associative ring R with a unit element and
transformation all R-module homomorphisms is an Abelian category
N N-' (e.g. the category of all Abelian groups).
~akbk = aNBN-a,B o - ~Bk(ak+,-ak)'
3) Any full subcategory of an Abelian category, that
k=' k='
is, a subcategory that contains for each one of its mor-
where ak> bk are given, Bo is arbitrarily selected, and
phisms also the kernel and cokernel of that morphism,
Bk = Bk-,+bk =Bo+b,+ ... +bb k=I, ... ,N. and that contains for each pair of objects A and B also
The Abel transformation is the discrete analogue of the their product and coproduct, is an Abelian category.
formula for integration by parts. The small Abelian categories are exhausted by the
If aW"~O and if the sequence {Bk} is bounded, then subcategories of the above type of categories R we of
the Abel transformation can be applied to the series unitary left modules. Accordingly, the following
00 00 Mitchell theorem is valid: For each small Abelian
~akbk = ~(ak -ak+,)Bk -a,Bo category there exists a full exact imbedding into some
k=' k='
category R we.
The Abel transformation is used to prove several cri-
4) Any category of diagrams i5{'tI, 21), with diagram
teria of convergence of series of numbers and functions
scheme 'tI over an Abelian category ~, is an Abelian
(cf. Abel criterion). The Abel transformation of a series
category. In the scheme 'V one may distinguish the set
often yields a series with an identical sum, but with a
C of commutativity relations, i.e. the set of pairs (cp, !f;)
better convergence. It is also regularly used to obtain
of paths CP=(CP1,'" ,CPn), !f;=(!f;I,'" ,!f;m) in 'tI with a
certain estimates (cf. Abel inequality), in particular, for
common begin and end. Then the complete subcategory
investigations on the rate of convergence of a series. It
of the category i5{'tI, ~) generated by all those diagrams
was introduced by N.H. Abel [1].
D : 'tI~~ that satisfy
References
[I] ABEL, N.H.: 'Untersuchungen tiber die Reihe D(</ = D(</>,) ... D(</>n) = D(l/;,) ... D(l/;m) = D(l/;)
I+mx /2+m(m-1)x 2 /(2'1)+'" ',l. Reine Angew. Math.
1(1826),311-339. is an Abelian category. In particular, if 1) is a small
[2] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable, category and if the set C consists of all pairs of the
I, Chelsea, reprint, 1977 (translated from the Russian).
[3] WHITTAKER, E.T. and WATSON, G.N.: A course of modern
form (a/3, y) with y=a/3, then the corresponding sub-
analysis, 1-2. Cambridge Univ. Press, 1952. category is the Abelian category of one-place covariant
L.P. Kuptsov
functors from 'tI into ~ (cf. Functor).
AMS 1980 Subject Classification: 65810 Suppose that a null object exists in a small category
1). A functor F: '1)~~ will then be called normalized if
ABELIAN CATEGORY - A category that displays it takes a null object into a null object. The complete

8
ABELIAN DIFFERE]\iTIAL

subcategory of the category of functors generated by ~ with finite products is Abelian if and only if it is
the normalized functors will then be an Abelian additive and if any morphism ex has a kernel and a co-
category. In particular, if ~ is a category the objects of kernel and can be decomposed into a product
which are all integers and the null object is N, while the a = Coker(Kera) 8 ker(Cokera),
non-null non-identity morphisms form a sequence
L, do d,
in which () is an isomorphism.
"'~(-1)~O~1~2~'" The Mitchell theorem quoted above constitutes the
underlying principle of the so-called 'diagram-chasing'
in which dndn + 1 =0, n=O, -+-1, -+-2, ... , then the
corresponding subcategory generated by the normalized method in an Abelian category: Any proposition about
functors is called the category of complexes over ~. On commutative diagrams that is valid for all categories of
the category of complexes there are defined the additive left modules R fIR and that is a consequence of the
functors zn, B n , H n of n-dimensional cycles, n- exactness of certain sequences of morphisms, is valid in
dimensional boundaries and n-dimensional homologies, all Abelian categories.
respectively, with values in~. These constitute the In a locally small Abelian category, the ~-subobjects
basis for the development of homological algebra. of an arbitrary object form a Dedekind lattice. If pro-
ducts (or coproducts) of any family of objects exist in
5) A complete subcategory ~I of an Abelian category
~, this lattice will be complete. These conditions are
~ is said to be dense if it contains all subobjects (d.
known to be met if there is a generating object U in ~
Subobject) and quotient objects (d. Quotient object) of
and if the coproducts
its objects, and if for the exact sequence
O~A~B---,>C~O
II
iEJ
Vj, V, = V,

BEOb~1 if and only if A, CEOb~I' The quotient exist for any set I. These conditions are satisfied, for
category ~ / ~I is then constructed as follows. Let instance, by Grothendieck categories (d. Grothendieck
(R, J.t] be a subobject of the direct sum A EB B with pro- category), which are equivalent to the quotient
jections '171, '172 and let the square categories of the category of modules by a localizing
subcategory (the Gabriel- Popescu theorem).
R ---'> A
References
/L'172 ~ ~a
[1] BUCUR, I. and DELEANU, A.: Introduction to the theory of
B ---'> X categories and functors, Wiley, 1968.
fJ
[2] FREYD, P.: Abelian categories: An introduction to the theory of
be co-universal (i.e. a co-fibred product). The subobject functors, Harper and Row, 1964.
(R, J.t] is called an ~I-subobject if Cokeq.L'17I, Kerj3E~I' [3] GABRIEL, P.: 'Des categories Abeliennes', Bull. Soc. Math.
France 90 (1962), 323-448.
Two ~I-subobjects are equivalent if they contain some [4] GROTHENDIECK, A.: 'Sur quelques points d'algebre homo1o-
~I-subobject. By definition, the set H iJI / iJI , (A, B) con- gique', Tohoku Math. J. 9 (1957), 119-221. M Sh T. I k
. . sa en 0
sists of the equivalence classes of ~I-subobjects. The
usual multiplication of binary relations is compatible Editorial comments. Composition of morphisms is writ-
with the equivalence relation thus introduced, which ten from left to right in this article; i.e. # denotes the com-
position of <jl: A~B, >J;: B~C. A dense subcategory is more
makes it possible to construct the quotient category
often called a Serre subcategory.
~ / ~I' which is an Abelian category. The faithful
functor T: ~~~ / ~I is defined by assigning to each References
[A 1] MITCHELL, 8.: Theory of categories, Acad. Press, 1965.
morphism ex:A-'?B its graph in A EBB. A subcategory [A2] POPESCU, N.: Abelian categories with applications to rings
~I is said to be a localizing subcategory if T has a com- and modules, Acad. Press, 1973.
plete univalent right-adjoint functor Q: ~ / ~I -'?~.
AMS 1980 Subject Classification: 18EXX
6) For any topological space X the category of left
G-modules over X, where G is a sheaf of rings with unit ABELIAN DIFFERENTIAL - A holomorphic or
element over X, is an Abelian category. meromorphic differential on a compact, or closed,
It is possible to introduce into any Abelian category Riemann surface S (d. Differential on a Riemann sur-
~ a partial sum of morphisms so that A becomes an face).
additive category. For this reason the product and the Let g be the genus of the surface S (d. Genus of a
coproduct of any pair of objects in an Abelian category surface); let alb I ... agbg be the cycles of a canonical
are identical. Moreover, in defining an Abelian basis of the homology of S. Depending on the nature
category it suffices to assume the existence of either of their singular points, one distinguishes three kinds of
products or coproducts. Any Abelian category is a Abelian differentials: I, II and III, with proper inclu-
bicategory with a unique bicategorical structure. These sions I clI CIII. Abelian differentials of the first kind
properties characterize an Abelian category: A category are first-order differentials that are holomorphic every-

9
ABELIAN DIFFERENTIAL

where on S and that, in a neighbourhood U of each a_ n


[- + ... a_I]
+-+f(z) dz, (3)
point POES, have the form w=pdz=p(z)dz, where zn z
z = x + iy is a local uniforrnizing variable in U, where f (z) is a regular function, n is the order of the
dz =dx +i dy, and p(z) is a holomorphic, or regular,
pole (if a -n=l=O), and a - I is the residue of the pole. If
analytic function of z in U. The addition of Abelian n = 1, the pole is said to be simple. An Abelian dif-
differentials and multiplication by a holomorphic func- ferential of the second kind is a meromorphic differen-
tion are defined by natural rules: If tial all residues of which are zero, i.e. a meromorphic
w = pdz, 'TT = qdz, a = a(z), differential with local representation
then
w+'TT = (p+q)dz, aw = (ap)dz. [ a_
n + ... + a-2 +f(Z)]dZ,
zn Z2
The Abelian differentials of the first kind form a g-
An Abelian differential of the third kind is an arbitrary
dimensional vector space 21. After the introduction of
the scalar product Abelian differential.
Let w be an arbitrary Abelian differential with A-
(w, 'TT) = f f w*"iT,
periods A I, . . . ,Ag; the Abelian differential
s
w' = w - A I </>1 - ... - Ag</>g then has zero A -periods
where w*7i is the exterior product of w with the star-
and is known as a normalized Abelian differential. In
conjugate differential 7i, the space 21 becomes a Hilbert
particular, if PI and P 2 are any two points on S, one
space.
can construct a normalized Abelian differential wI,2
Let A I, B I, . . . ,Ag, Bg be the A- and B-periods of
the Abelian differential of the first kind w, i.e. the with the singularities (1 / z)dz in PI and (-1/ z)dz in
P 2 , which is known as a normal Abelian differential of
integrals
the third kind. Let w be an arbitrary Abelian differential
Aj = fw, Bj = fw, j=l, ... ,g. with residues c I, . . . ,Cn at the respective points
aJ hJ
PI,'" ,Pn ; then, always, CI + ... +cn=O. If Po is
The following relation then holds:
any arbitrary point on S such that Po=l=Pj ,
II w 112 = ((A)ij-BjA;) ;;;. O. (1) ) = 1, ... ,n, then w can be represented as a linear
j=\ combination of a normalized Abelian differential of the
If A'I, B'I, ... ,A~, B~ are the periods of another second kind W2, a finite number of normal Abelian dif-
Abelian differential of the first kind 'TT, then one has ferentials of the third kind Wj,O, and basis Abelian dif-
ferentials of the first kind </>k:
i(w, "iT) = (A;B) - BjA)) = O. (2)
j=\
w = W2 + c;w;,o + Ak<l>k'
The relations (1) and (2) are known as the bilinear j = \ k =\

Riemann relations for Abelian differentials' of the first Let W3 be an Abelian differential of the third kind with
kind. A canonical basis of the Abelian differentials of only simple poles with residues Cj at the points Pj'
the first kind, i.e. a canonical basis </>1, ... , </>g of the ) = 1, ... , n, and let WI be an arbitrary Abelian dif-
space 21, can be chosen so that ferential of the first kind:
Aij = f<l>l = B,; , Ak = fW\, Bk = fW\,
Q) Qk hk

where 8;; = 1 and 8ij =0 if )=I=i. The matrix (B;), A~ = fW3' B~ = fW3' k=l, ... ,g,
i,) = 1, ... ,g, of the B-periods Ok hA,

where the cycles ak, bk do not pass through the poles of


BIj = f<l>l
b, W3' Let the point Po ES not lie on the cycles ab bk and
IS then symmetric, and the matrix of the imaginary let L j be a path from Po to Pj' One then obtains bi-
parts (1m B;j) is positive definite. An Abelian differen- linear relations for Abelian differentials of the first and
tial of the first kind for which all the A-periods or all third kinds:
the B-periods are zero is identically equal to zero. If all
the periods of an Abelian differential of the first kind w (AkB~-BkA~)
=I
= 27Td>JWl'
are real, then w = O. k ; - I l'j

Abelian differentials of the second and third kinds are, Bilinear relations of a similar type also exist between
in general, meromorphic differentials, i.e. analytic dif- Abelian differentials of the first and second kinds.
ferentials which have on S not more than a finite set of In addition to the A- and B-periods Ab Bb
singular points that are poles and which have local k = I, ... ,g, known as the cyclic periods, an arbitrary
representations Abelian differential of the third kind also has polar

10
ABELIAN FUNCTION

periods of the form 27Ticj along zero-homologous cycles cept of an eUiptic function of one complex variable to
which encircle the poles Pj' One thus has, for an arbi- the case of several complex variables. A function f (z)
trary cycle 'I, in the variables z 1, . . . ,Zp' Z =(z 1, . . . ,zp), which is
meromorphic in the complex space a, p;;;. I, is called
fW3 = (lkA k+ lg+k B k)+27Tim/j,
y k=1 ;=1
an Abelian function if there exist 2p row vectors in a

where h, 19+k' and mj are integers.


Important properties of Abelian differentials are which are linearly independent over the field of real
described in terms of divisors. Let (w) be the divisor of numbers and are such that fez +w.)= fez) for all
the Abelian differential w, i.e. (w) is an expression of ZEa, p=I, ... ,2p. The vectors w. are called the
the type (w) = p~l ... P~', where the Prs are all the periods or the system of periods of the Abelian function
zeros and poles of wand where the ars are their multi- f (z). All periods of the Abelian function f (z) form an
plicities or orders. The degree d[(w)]=a] + ... +an of Abelian group r under addition, which is known as the
the divisor of the Abelian differential w depends only period group (or the period module). A basis of this
on the genus of S, and one always has d[(w)]=2g-2. group is known as a basis system of periods of the
Let a be some given divisor. Let n(a) denote the com- Abelian function, or also as a system of basic (or primi-
plex vector space of Abelian differentials w of which tive) periods. An Abelian function fez) is said to be
the divisors (w) are multiples of a, and let L(a) denote degenerate if there exists a linear transformation of the
the vector space of meromorphic functions f on S of variables z], ... ,Zp which converts f (z) into a func-
which the divisors (f) are multiples of a. Then tion of fewer variables; otherwise f (z) is said to be a
dimQ(a)=dimL(a / (w)). Other important information non-degenerate Abelian function. Degenerate Abelian
on the dimension of these spaces is contained in the functions are distinguished by having infinitely small
Riemann - Roch theorem: The equality periods, i.e. for any number >0 it is possible to find a
dimL(a -I )-dim~(a) = d[a]- g + 1 period w. for which
is valid for any divisor a. It follows from the above, for
example, that if g = 1, i.e. on the surface of a torus, a
II wp II = Vk~1 I wkp 12 < {.
meromorphic function cannot have a single simple pole. If p = I, the non-degenerate Abelian functions are ellip-
Let S be an arbitrary compact Riemann surface on tic functions of one complex variable. Each Abelian
which there are meromorphic functions z and w which function with period group r is naturally identified
satisfy an irreducible algebraic equation F(z, w) = O. with a meromorphic function on the complex torus
Any arbitrary Abelian differential on S can then be a/ r, i.e. the quotient space a/
r (cf. Quasi-Abelian
expressed as w=R(z, w)dz where R(z, w) is some function).
rational function in z and w; conversely, the expression The study of Abelian functions began in the 19-th
w=R(z, w)dz is an Abelian differential. This means century in connection with the inversion of Abelian
that an arbitrary Abelian integral integrals (cf. Abelian integral) of the first kind (cf.
fR(z, w)dz = fw Jacobi inversion problem, [1], [2]). The Abelian func-
tions obtained in solving this problem are called speCial
is the integral of some Abelian differential on a com-
Abelian functions; sometimes, in earlier work, only such
pact Riemann surface S.
functions were considered to be Abelian functions. Let
See also Algebraic function.
u], ... ,up be linearly independent normal Abelian
References integrals of the first kind, constructed on a Riemann
[I] SPRINGER, G.: Introduction to Riemann surfaces, Addison-
Wesley, 1957, Chapt.lO.
surface F:
x x
= f du I, . . , up = f dup'
[2] NEVANLINNA, R.: Uniformisierung, Springer, 1953. Chapt.5.
[3] CHEBOTAREV, N.G.: The theory of algebraic functions, U1
c] Cp
Moscow-Leningrad, 1948, Chapt.3;8 (in Russian).
E.D. Solomentsev and let
Editorial comments. Another good reference, replacing XI Xp

[3], is [A1]. Zj = fdu;+ ... + fdu;, )=1, ... ,p,


Cl Cp

References
[A1] LANG, S.: Introduction to algebraic and abelian functions, be a given system of sums in which the lower integra-
Addison-Wesley, 1972. tion limits c 1, . . . ,cp are considered as fixed on the
AMS 1980 Subject Classification: 14H05, 30F30 surface F. It is then possible to define the special
Abelian functions as all rational functions in the p
ABELIAN FUNCflON - A generalization of the con- coordinates of the upper limits x], ... ,xp ' the latter

11
ABELIAN FUNCTION

being in turn considered as functions of p points wi


z 1, . . . ,zp on F. In the symbolic notation introduced
W=
by K. Weierstrass, any special Abelian function Al(z) o
may be represented as
The Riemann relations between the elements of the
Al(z) = Al(zb ... ,zp) = matrix "ajk ,I, j, k = 1, ... ,p, now ensure the sym-
= R[X\(Zb ... ,zp), ... ,XP(Zb ... ,zp)]. metry of the matrix, ajk =akj' and the positive definite-
ness of the matrix of real parts II Re ajk II,
The complex tori a / r corresponding to special j, k = 1, ... ,p. However, for p>3 there will be only
Abelian functions are the Jacobi varieties of algebraic 3(p - 1) independent elements among the elements ajk
curves. of II ajk II, i.e. as many as the number of conformal
The matrix W whose columns form a period basis of moduli for the Riemann surface F on which the inver-
the Abelian function f (z) has dimension p X 2p and is sion problem is solved (cf. Moduli of a Riemann sur-
known as the period matrix of the Abelian function face). In addition to the Riemann relations there are in
f (z). A necessary and sufficient condition for a given such a case (p - 2)(P - 3) / 2 transcendental relations
matrix W of dimension p X 2p to be the period matrix between the ajb the explicit form of which for the case
of some non-degenerate Abelian function f (z) is for p =4 was first found by E. Schottky in 1886; for a
the matrix to satisfy the following conditions (the review of subsequent advances in the field see [5].
Riemann- Frobenius conditions). It must be a Riemann Special Abelian functions can be represented as the
matrix, i.e. there must exist an anti-symmetric non- quotient of two entire theta-functions with half-integral
degenerate square matrix M with integer elements, of characteristics of a special kind. These representations
order 2p, and such that 1) WMW T =0, where WT is yield a number of properties of special Abelian func-
the transposed matrix of W; and 2) the matrix tions which generalize many properties of elliptic func-
iWMW T defines a positive-definite Hermitian form [3]. tions. Thus, derivatives of an Abelian function f (z)
If the conditions 1) and 2) are expressed as equations with respect to any argument Zj are Abelian functions;
and inequalities respectively, a system of p (p - 1) / 2 any p + 1 Abelian functions are related by some alge-
Riemann equations and p (p - 1) / 2 Riemann inequalities braic equation; any Abelian function can be rationally
is obtained. The number p is called the genus of the expressed in terms of some p + 1 Abelian functions, e.g.
matrix W and of the corresponding Abelian function by means of an arbitrary Abelian function and its p
f(z). The columns w v = Re w v + i 1m w v of W, regarded partial derivatives of the first order; Abelian functions
as vectors in the real Euclidean space R2p, define the satisfy the addition theorem, i.e. the value of an Abelian
period parallelotope of f (z). function at the point a + b E a may be rationally
All Abelian functions corresponding to the same expressed in terms of the values of a certain (p + 1)-
period matrix W form an Abelian function field Kw. If tuple of Abelian functions at the points a, b EO'.
the field Kw contains a non-degenerate Abelian func- Abelian functions are very important in algebraic
tion, its degree of transcendence over the field C is p; geometry as a means of uniformization of algebraic
the torus 0' / r is then an Abelian variety, and Kw varieties of certain types.
turns out to be its field of rational functions. If, on the
References
other hand, all Abelian functions of Kw are degenerate, [1] SPRINGER, G.: Introduction to Riemann surfaces, Addison-
then K w is isomorphic to the field of rational functions Wesley, 1957.
on an Abelian variety of dimension lower than p. See [2] CHEBOTAREV, N.G.: The theory of algebraic functions,
Moscow-Leningrad, 1948 (in Russian).
also Quasi-Abelian function. [3] SIEGEL, c.L.: Automorphe Funktionen in mehrerer Variablen,
As in the case of elliptic functions, any Abelian func- Math. Inst. Gbttingen, 1955.
tion can be represented as the quotient of two entire [4] WElL, A.: Introduction a l'hude des varihes kahleriennes, Her-
mann, 1958.
transcendental theta-functions (d. Theta-function),
[5] MUMFORD, D.: Curves and their Jacobians, Univ. of Michigan
which in turn can be represented as theta-series. A Press, 1975.
given Riemann matrix W determines a class of theta- [6] STAHL, H.: Theorie der Abelschen Funktionen, Leipzig, 1896.
series by means of which all Abelian functions of the [7] CLEBSCH, A. and GORDAN, P.: Theorie der Abelschen Funk-
tionen, Teubner, 1866.
field Kw can be constructed. [8] CONFORTO, F.: Abelsche Funktionen und algebraische
For special Abelian functions, the matrix W can Geometrie, Springer, 1956.
E.D. Solomentsev
always be brought - by means of a linear transforma-
tion of the independent variables z 1, . . . , Zp - to the Editorial comments. A basis system of periods is also
form known as a period basis. Instead of [5] one can also con-

12
ABELIAN GROUP

suit the more recent [A 1]. The Riemann - Frobenius condi- ite and primary, while the others are infinite (G. Fro-
tions are also known as the Riemann bilinear relations or benius, L. Stickelberger). In particular, finite Abelian
simply as the Riemann relations. The Schottky problem con- groups split into a direct sum of primary cyclic groups.
cerns the description ot the space of all Jacobians in the Such splittings are, in general, not unique, but any two
space of matrices satisfying the Riemann conditions. It has splittings of a finitely generated Abelian group into
recently been solved, ct. Jacobi variety. direct sums of non-split cyclic groups are isomorphic,
References so that the number of infinite cyclic summands and the
[A1] ARBELLO, E., CORNALBA, M., GRIFFITHS, P.A. and HARRIs, 1.: collection of the orders of the primary cyclic summands
Geometry of algebraic curves, 1, Springer, 1985.
do not depend on the splittings chosen. These numbers
AMS 1980 Subject Classification: 33A25, 32A20 are called invariants of the finitely generated Abelian
group. They constitute a complete system of invariants,
ABELIAN GROUP - A group whose operation is in the sense that two (finitely generated) Abelian
commutative (cf. Commutativity). They are named after groups are isomorphic if and only if they have the same
N.H. Abel, who used such groups in the theory of solv- invariants. Each subgroup of a finitely generated
ing algebraic equations by means of radicals. It is cus- Abelian group is itself finitely generated.
tomary to write the operation in an Abelian group in Not every Abelian group is a direct sum of (even
additive notation, i.e. to use the plus sign (+ ) for that infinitely many) cyclic groups. For primary groups
operation, called 'addition', and the zero sign (0) for there is a necessary and sufficient condition for the
the neutral element, named zero (in multiplicative nota- existence of such splittings, the so-called Kulikov cri-
tion it is called the unit element). terion. Let A be a primary Abelian group for a prime
Examples of Abelian groups. All cyclic groups (cf. number p. A non-zero element a of A is said to be an
Cyclic group) are Abelian, in particular, the additive element of infinite height in A if for any integer k the
group of integers. All direct sums (cf. Direct sum) of equation p k X = a is solvable in A, and a is called an ele-
cyclic groups are Abelian. Also the additive group of ment of height n if this equation is solvable for k ~n
rational numbers Q is Abelian; it is moreover a locally only. The Kulikov citerion states that a primary Abelian
cyclic group, i.e. a group in which all finitely generated group A is a direct sum of cyclic groups if and only if
subgroups are cyclic. Finally, the groups of type p 00 it is the union of an increasing sequence of subgroups
(or the quasi-cyclic groups Zp~ ), where p is an arbi- such that the set of heights of the elements in each one
trary prime number, are Abelian (cf. Group of type poo; of these subgroups is bounded. Any subgroup of an
Quasi-cyclic group). Abelian group that is a direct sum of cyclic groups
The free composition in the variety of Abelian groups itself is such a direct sum. The indecomposable (non-
coincides with the direct sum. A free Abelian group is a splittable) into direct sums primary Abelian groups are
direct sum of infinite cyclic groups. Every subgroup of the primary cyclic groups and the group Zp~.
a free Abelian group is free Abelian. The set of all ele- A finite set of elements gl, ... , gk in an Abelian
ments of finite order in an Abelian group forms a sub- group is called linearly dependent if there exist integers
group, which is called the torsion subgroup (periodic n I, . . . , nk, not all equal to zero, such that
part) of the Abelian group. The quotient group of an ~~=lnigi=O. If such numbers do not exist, the set is
Abelian group by its torsion subgroup is a group said to be linearly independent. An arbitrary collection
without torsion. Thus, every Abelian group is an exten- of elements of A is said to be linearly dependent if
sion of a torsion-free group by a torsion group. This there exists a finite linearly dependent subcollection.
extension does not always split, i.e. the torsion group is An Abelian group that is not a torsion group has maxi-
usually not a direct summand. An Abelian torsion mal linearly independent sets. The cardinality of all
group in which the order of every element is a power of maximal linearly independent collections of elements is
a fixed prime number p is said to be primary with the same and is called the rank (Profer rank) of the
respect to p (the term p-group is used in general group given Abelian group. The rank of a torsion group is
theory). Every torsion group splits uniquely into a considered to be zero. The rank of a free Abelian group
direct sum of primary groups that correspond to dis- coincides with the cardinality of a set of free generators
tinct prime numbers. of it.
A complete classification is known for finitely gen- Every torsion-free Abelian group of rank 1 is iso-
erated Abelian groups. This is given by the fundamental morphic to some subgroup of the additive group of
theorem of finitely generated Abelian groups: Every fin- rational numbers. There exists a complete description
itely generated Abelian group is a direct sum of finitely of such groups in the language of types. Each element
many non-split cyclic subgroups some of which are fin- of an Abelian group without torsion determines a

13
ABELIAN GROUP

characteristic, which is a countable sequence consisting of all extensions of one group by another. The theory
of non-negative numbers and the symbol 00. This of modules is closely connected with Abelian groups
sequence is constructed in the following manner. Sup- regarded as modules over the ring of integers. Many
pose that the prime numbers are enumerated in increas- results in the theory of Abelian groups can be applied
ing order PI.P2, . ... An element a determines the to the case of modules over a principal ideal ring.
sequence whose k-th element is equal to Sk if the equa- Owing to their relative simplicity and to the fact that
tion a =I'i: x is solvable in the group but the equation they have been very thoroughly studied (which is con-
a = pi: + x is not solvable, and the k-th element is equal firmed, for instance, by the solvability of the elemen-
to 00 if a =PkX is solvable for all s. Two characteristics tary theory of Abelian groups), and to the availability
are equivalent if they are equal except possibly for a of a sufficient variety of objects, Abelian groups serve
fInite number of places and if the symbol 00 occurs in as a constant source of examples in various fields of
exactly the same places in both sequences. The charac- mathematics.
teristics of two linearly dependent elements are equal. References
A class of equivalent characteristics is said to be a type. [I] KUROSH, AG.: The theory of groups, 1-2, Chelsea, reprint,
To each torsion-free Abelian group of rank I there 1955-1956 (translated from the Russian).
[2] FUCHS, L.: Abelian groups, Hungarian Acad. Sci., Budapest,
corresponds a uniquely determined type, called the type 1958.
of the given group; non-isomorphic groups having dif- [3] FUCHS, L.: Infinite abelian groups, I, Academic Press, 1970.
ferent types. [4] KAPLANSKY, I.: Infinite Abelian groups, Michigan Univ. Press,
1954.
A torsion-free Abelian group that splits into a direct [5] Itogi Nauk. Algebra Topol. Geom 1965 (1967), 9-44.
sum of groups of rank I is said to be completely split. [6] MiSHINA, AP.: 'Abelian groups', J. Soviet Math. 2, no. 3
Not every subgroup of a completely split group is (1974), 239-263. (Itogi Nauk. i Tekhn. Algebra Topol. Geom 10
completely split, though every direct summand is. For (1972),5-45) v
~u.
L E hrs ov
each integer n there exists a torsion-free Abelian group Editorial comments. For an introduction to the theory of
of rank n that cannot be decomposed (or split) into a finite rank torsion-free Abelian groups as it has developed
direct sum. For countable torsion-free Abelian groups since the publication of the second volume [A2] of [3] in
one can to construct a complete system of invariants. 1973 cf. [A1]. The proceedings [A3] - [A5] give a good idea
An Abelian group is called complete, or divisible, if of recent results and current research in Abelian groups.
for anyone of its elements a and for any integer m the References
equation mx =a has a solution in the group. All divisi- [A1] ARNOLD, D.M.: Finite rank torsion free abelian groups and
ble Abelian groups turn out to be direct sums of groups rings, Lect. Notes in Math., 931, Springer, 1982.
[A2] FUCHS, L.: Infinite abelian groups, 2, Acad. Press, 1973.
isomorphic to Q and the groups Zp~ , and the cardinal- [A3] GOBEL, R. and WALKER, E. (EDS.): Abelian group theory,
ities of the sets of components isomorphic to Q, as well Lect. Notes in Math., 874, Springer, 1981.
as to Zp~ (for each p), from a complete and indepen- [A4] GOBEL, R., LADY, L. and MADER, A (EDS.): Abelian group
dent system of invariants of the divisible group. Every theory, Lect. Notes in Math., 1006, Springer, 1983.
[A5] GOBEL, R., METELLI, c., ORSAITI, A and SALCE, L. (EDS.):
Abelian groups can be isomorphically imbedded in Abelian groups and modulus, CISM courses and lectures,
some divisible Abelian group. The divisible Abelian 287, Springer, 1984.
groups and only they are the injective objects in the AMS 1980 Subject Classification: 20KXX
category of Abelian groups. A divisible Abelian group
is a direct summand of each Abelian group containing ABELIAN INTEGRAL, algebraic integral - An
it. Therefore, an Abelian group is a direct sum of a integral of an algebraic function, i.e. an integral of the
divisible Abelian group and a so-called reduced group, form z]

i.e. a group that contains no non-trivial divisible sub- jR(z, w)dz, (1)
groups. A classification of reduced Abelian groups is Zo

known only in certain special cases. Thus, VIm's where R(z, w) is some rational function in variables
theorem ([ I]) gives the classification of all countable z, w that are related by an algebraic equation
reduced Abelian torsion groups.
The theory of Abelian groups has its origins in
F(z, w) - ao(z)w n +al(z)w n -I + ... +an(z) = 0 (2)

number theory, and is now extensively applied in many with coefficients a/z) that are polynomials in z,
modern mathematical theories. Thus, the duality theory j = 0, ... , n. To equation (2) there corresponds a com-
of characters for finite Abelian groups has been consid- pact Riemann surface F which is an n-sheeted covering
erably extended to the duality theory of locally com- of the Riemann sphere. On this Riemann surface z, w,
pact Abelian groups. The development of homological and consequently also R(z, w), can be considered as
algebra has made it possible to solve a whole series of single-valued functions of the points on F.
problems in Abelian groups, such as classifying the set The integral (1) is then given as the integral w of i
14
ABELIAN INTEGRAL

the Abelian differential w=R(z, w)dz on F taken along the second kind is said to be an Abelian integral of the
some rectifiable path L. In general, specifying only the second kind. Considered as a function of the upper
initial and end points z 0 and z 1 of the path L does not bound, it has no singularities anywhere on F except for
completely determine the value of the Abelian integral poles. An Abelian integral of a normalized Abelian
(1), or, which is the same, the integral (1) turns out to differential of the second kind is known as a normal
be a multi-valued function of the initial and end points Abelian integral of the second kind.
of the path L. An Abelian integral of the third kind is an arbitrary
The behaviour of an Abelian integral on F depends, Abelian integral. It usually has logarithmic singularities
first of all, on the topological structure of F, in particu- on F; however, such singularities can only occur in
lar on the topological invariant called the genus g of pairs. An Abelian integral of a normal Abelian dif-
the surface F (cf. Genus of a surface). The genus g is ferential of the third kind is called a normal Abelian
connected with the number of sheets n and with the integral of the third kind. Any Abelian integral can be
number of branch points II (counted with multiplicities) represented as a linear combination of normal Abelian
by the relation g = (II / 2) - n + 1. For g = 0 the vari- integrals of the first, second and third kinds. Unlike
ables z and w can be rationally expressed in some Abelian integrals of the first and second kinds, Abelian
parameter t, and the Abelian integral becomes the integrals of the third kind usually also have the so-
integral of a rational function in t. This will happen, called polar periods, beside the A- and B-periods (which
for example, in the elementary cases w 2 =aOz+al and are called cyclic periods). Polar periods are taken along
w 2 =aoz 2 +alz +a2' cycles which are homologous to zero, but encircle the
If g:? 1, any Abelian integral can be expressed in the logarithmic singularities of the Abelian integral. They
form of a linear combination of elementary functions are caused by the poles of the Abelian differential w
and canonical Abelian integrals of the three kinds. The with non-zero residues.
integral i w is called an Abelian integral of the first A number of relations depending on the topological
and conformal structure of F exist for arbitrary Abelian
kind if w is an Abelian differential of the first kind. In
other words, Abelian integrals of the first kind are integrals on the same Riemann surface F. Thus, if wP,P J

characterized by the fact that for a fixed initial point z 0 is a normal Abelian differential of the third kind with
of the path L they are a function of the upper bound simple poles in Pi and Pi' then the following theorem
z 1 that is an everywhere finite, usually multi-valued, on the permutation of the parameters and the bounds of
function on F. Such a characterization may be used, an Abelian integral of the third kind holds for all
for example, to construct analogues of Abelian integrals points PI, P 2, P 3, P 4 :
of the first kind on non-compact Riemann surfaces. P2 P4

Any Abelian integral of the first kind can be f


PI
WP J P 4 == f
PJ
WP1P2'

represented in the form of a linear combination of g


linearly-independent normal Abelian integrals of the first Relations which connect Abelian integrals with
kind rational functions on F are known as Abelian theorems.
In terms of divisors, for example, the Abelian theorem
for Abelian integrals of the first kind has the following
of differentials cf>1, ... ,cf>g that constitute a canonical
form: A divisor a on F is the divisor of a meromorphic
basis for the Abelian differentials of the first kind. If
function if and only if there exists a chain L with
the surface F is cut along the cycles alb 1 . . . agbg of a
canonical basis for the homology, a simply-connected
i
aL = a and w = 0 for all Abelian differentials of the
domain F* is obtained. The integrals i- cf>i are single-
first kind on F. There also exist variants of Abelian
theorems for Abelian integrals of the second and third
valued functions of the upper bound z 1 for all paths
kinds [4]. Abelian integrals and, in particular, Abelian
L * CF* with fixed initial point Zo and fixed end point
z I. The multi-valuedness of the integrals Ui = i cf>i along theorems, are the basis of the transcendental construc-
tion of the Jacobi variety of a Riemann surface. The
an arbitrary path L C F joining z 0 with z 1 is now com- question of the inversion of an Abelian integral as a
pletely described by the fact that it differs from the function of its upper bound also leads to the concepts
integrals i- cf>i only by an integral linear combination of an Abelian function; an elliptic function; and theta-
of the A-periods Aij and the B-periods Bij of a basis of functions (cf. Theta-function; Jacobi inversion problem).
the differentials of the first kind. These constitute the Historically, the theory of Abelian integrals followed
period matrix of dimension g X 2g, which satisfies the from the consideration of a surface of genus g = 1. If
bilinear Riemann relations (cf. Abelian differential). one writes the corresponding equation in the form
iL
An integral w where w is an Abelian differential of F(z, w) _ w 2 - fez) = 0,

15
ABELIAN INTEGRAL

where f (z) is a polynomial in z of the third or fourth S. A number of fundamental properties of Abelian
degree, then one obtains elliptic integrals (d. Elliptic varieties carry over to Abelian schemes. For instance,
integral) as the respective Abelian integrals. They first an Abelian S-scheme A is a commutative group S-
appeared at the end of the 17-th and the beginning of scheme [1], and if S is a normal scheme, A is projective
the 18-th century as the result of the rectification of over S, [2].
curves of the second order in the studies of Jacob and Abelian schemes are used in the context of moduli
Johann Bernoulli and of G. Fagnano. L. Euler tackled schemes of Abelian varieties with various auxiliary
the addition theorem of elliptic integrals, which is a structures, and also in the theory of reduction of
special case of a theorem of N.H. Abel (1752). Abel Abelian varieties (d. Neron model).
and e.G.J. Jacobi (1827) stated the problem of inver-
sion of elliptic integrals and obtained the solution. The References
beginnings of the theory of elliptic functions were thus [1] MUMFORD, D.: Geometric invariant theory, Springer, 1965.
[2] RAYNAUD, M.: Faisceaux ampIes sur les schemas en groupes et
established. However, some facts concerning this theory les espaces homogenes, Springer, 1970.
had been established by e.F. Gauss early in the 18-th I. V Dolgachev
century. Abel and Jacobi dealt with the much more dif-
ficult case of inversion of Abelian integrals in the case AMS 1980 Subject Classification: 14KXX
g> 1. During the very first stages of development stress
was laid on hyper-elliptic integrals, where ABELIAN VARIETY - An algebraic group that is a
F(z, w)=w2 - f(z) with f(z) a polynomial of the fifth complete algebraic variety. The completeness condition
or sixth degree without multiple roots. Here g = 2 and implies severe restrictions on an Abelian variety. Thus,
the difficulty of the inversion problem can already be an Abelian variety can be imbedded as a closed sub-
noticed. The principal advances in the theory of inver- variety in a projective space; each rational mapping of
sion of Abelian integrals are due to B. Riemann (1851), a non-singular variety into an Abelian variety is regu-
who introduced the concept of Riemann surfaces and lar; the group law on an Abelian variety is commuta-
formulated and gave proofs of a large number of tive.
important results. The theory of Abelian varieties over the field of com-
Multi-dimensional generalizations of the theory of plex numbers e is, in essence, equivalent to the theory
Abelian integrals form the subject matter of algebraic of Abelian functions founded by e.G.J. Jacobi, N.H.
geometry and the theory of complex manifolds. Abel and B. Riemann. If en denotes n-dimension vec-
tor space, r c en is a lattice (cf. Discrete subgroup) of
References
[I] CHEBOTAREV, N.G.: The theory of algebraic functions, rank 2n, then the quotient group X = en / r is a com-
Moscow-Leningrad, 1948, Chapt. 8;9 (in Russian). plex torus. Meromorphic functions on X are the same
[2] SPRINGER, G.: Introduction to Riemann surfaces, Addison- thing as meromorphic functions on en that are invari-
ant with respect to the period lattice r. If the field K
Wesley, 1957, Chapt.lO.
[3] NEVANLINNA, R.: Unijormisierung, Springer, 1953, Chapt 5.
[4] BLISS, G.A.: Algebraic functions, Amer. Math. Soc., 1933. of meromorphic functions on X has transcendence
[5] STAHL, H.: Theorie der Abelschen Funktionen, Leipzig, 1896. degree n, then X can be given the structure of an alge-
[6] SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
braic group. This structure is unique by virtue of the
(translated from the Russian).
E.D. Solomentsev compactness of X, and it is such that the field of
rational functions of this structure coincides with K.
Editorial comments. An interesting and useful additional
The algebraic groups formed in this way are Abelian
reference is [A 1].
varieties, and each Abelian variety over the field e
References arises in this way. The matrix which defines a basis of
[A 1] LANG, S.: Introduction to algebraic and abelian functions,
Addison-Wesley, 1972.
r can be reduced to the form (E IZ), where E is the
identity matrix and Z is a matrix of order n X n. The
AMS 1980 Subject Classification: 30F30, 26C15, complex torus X = en / r is an Abelian variety if and
26E99 only if Z is symmetric and has positive-definite ima-
ginary part. It should be pointed out that, as real Lie
ABELIAN SCHEME - A smooth group scheme over groups, all varieties X are isomorphic, but this is not
a base scheme S, the fibres of which are Abelian true for their analytic or algebraic structures, which
varieties (cf. Abelian variety). The following is an vary strongly when deforming the lattice r. Inspection
equivalent definition: An Abelian scheme over S, or an of the period matrix Z shows that its variation has an
Abelian S-scheme, is a proper smooth group S-scheme analytic character, which results in the construction of
all fibres of which are geometrically connected. Intui- the moduli variety of all Abelian varieties of given
tively, an Abelian S-scheme may be understood as a dimension n. The dimension of the moduli variety is
family of Abelian varieties parametrized by the scheme n(n + 1) / 2 (cf. Moduli problem).

16
ABNORMAL SUBGROUP

The theory of Abelian vanetIes over an arbitrary References


field k is due to A. Wei! [1], [2]. It has numerous appli- [1] WElL, A.: Courbes algebriques et varietes abeliennes. Varietes
abeliennes et courbes algebriques, Hermann, 1971.
cations both in algebraic geometry itself and in other [2] WElL, A.: Courbes algebriques et varietes abeliennes. Sur les
fields of mathematics, particularly in number theory courbes algebriques et les varietes qui s'en deduisent, Hermann,
and in the theory of automorphic functions. To each 1948.
[3] WElL, A.: Introduction a /'etude des varietes kahteriennes, Her-
complete algebraic variety, Abelian varieties (cf. mann,1958.
Albanese variety; Picard variety; Intennediate Jacobian) [4] LANG, S.: Abelian varieties, Springer, 1983.
can be functorially assigned. These constructions are [5] MUMFORD, D.: Abelian varieties, Oxford Univ. Press, 1974.
[6] MANIN, Yu.I.: }I-Adie automorphic functions', J. Soviet Math.
powerful tools in studying the geometric structures of 5, no. 3 (1976), 279-333. (Itog; Nauk. i Tekhn. Sovrem Prob-
algebraic varieties. E.g., they were used to obtain one of lemy 3 (1974), 5-93)
the solutions of the Liiroth problem. Another applica- [7] SERRE, J.-P.: Groupes algebrique e{ corps des classes, Hermann,
tion is the proof of the Riemann hypothesis for alge- 1959.
[8] SIEGEL, C.L.: Automorphe Funktionen in mehrerer Variablen,
braic curves over a finite field - the problem for which Math. Inst. Gottingen, 1955.
the abstract theory of Abelian varieties was originally B.B. Venkov
developed. It was also one of the sources of I-adic A.N. Parshin
cohomology. The simplest example of such a cohomol- Editorial comments. For recent information on the Tate
ogy is the Tate module of an Abelian variety. It is the conjectures see [A3]. For the theory of Tate-Honda see
projective limit, as n-'HXl, of the groups X[ln] of points also [A4]. Mumford's theory of uniformization is developed
of order In. The determination of the structure of such in [A 1], [A2].
groups was one of the principal achievements of the References
theory of Wei!. In fact, if m is coprime with the charac- [A 1] MUMFORD, D.: 'An analytic construction of degenerating
curves over complete local rings', Compos. Math. 24 (1972),
teristic p of the field k and if k is algebraically closed, 129-174.
then the group X[ m] is isomorphic to (Z / mZ)2dirnX. If [A2] MUMFORD, D.: 'An analytic construction of degenerating
m =p, the situation is more complicated, which resulted Abelian varieties over complete local rings', Compos. Math.
24 (1972), 239-272.
in the appearance of concepts such as finite group [A3] FALTINGS, G.: 'Endlichkeitssatze fi.ir abelsche Varietaten uber
schemes, formal groups and p-divisible groups (cf. Fin- Zahlkorpern', Invent. Math. 73 (1983), 349-366. (Errratum:
ite group scheme; Fonnal group; p-divisible group). The Invent. Math. 75 (1984), p. 381)
study of the action of endomorphisms of Abelian [A4] TATE, J.: 'Classes d'isoganie des variates aooliennes sur un
corps fini (d' apres T. Honda)', in sem. Bourbaki, Vol. 21,
varieties, in particular of the Frobenius endomorphism 1968-1969.
on its Tate module, makes it possible to give a proof of
AMS 1980 Subject Classification: 14KXX
the Riemann hypothesis (for algebraic curves over finite
fields, ct. Riemann hypotheses) and is also the principal ABNORMAL SUBGROUP - A subgroup A of a
instrument in the theory of complex multiplication of group G such that g E <A, A g > for any element g E G.
Abelian varieties. Another circle of problems connected Here <A, A g > is the subgroup generated by A and its
with the Tate module consists of a study of the action conjugate subgroup A g = gAg -1. As an example of an
of the Galois group of the closure of the ground field abnormal subgroup of a finite group G one can take
on this module. There resulted the Tate conjectures the normalizer (ct. Nonnalizer of a subset) NG(P) of
and the theory of Tate - Honda, which describes any Sylow p-subgroup peG, and even any maximal
Abelian varieties over finite fields in terms of the Tate subgroup NeG which is not normal in G. In the
module [5]. theory of finite solvable groups (cf. Solvable group),
The study of Abelian varieties over local fields, where many important classes of subgroups are abnor-
including p-adic fields, is proceeding at a fast rate. An mal, use is made of the concept of a subabnormal sub-
analogue of the above-mentioned representation of group A of a group G, which is defined by a series of
Abelian varieties as a quotient space en / r, usually subgroups
known as uniformization, over such fields, was con- A =A0 C A Ie .. C An = G,
structed by D. Mumford and M. Raynaud. Unlike the
complex case, not all Abelian varieties, but only those where Ai is abnormal in Ai + 1, i =0, ... ,n-1.
having a reduction to a multiplicative group modulo p, References
are uniformizable [6]. The theory of Abelian varieties [1] HUPPERT, B.: Endliche Gruppen, I, Springer, 1967.

over global (number and function) fields plays an A.!, Kostrikin


important role in Diophantine geometry. Its principal Editorial comments. Nowadays, Ag is mostly defined as
result is the Mordell- Wei! theorem: The group of A9=g-1Ag. Instead of 'solvable' also 'soluble' is frequently
rational points of an Abelian variety, defined over a used.
finite extension of the field of rational numbers, is fin- AMS 1980 Subject Classification: 20D25, 20D30,
itely generated. 20E07

17
ABSCISSA

ABsassA - One of the Cartesian coordinates of a a non-discrete extremally disconnected space does not
point. contain any convergent sequence of pairwise distinct
AMS 1980 Subject Classification: 51 N 10 points, the absolute of any non-discrete space is non-
metrizable (and does not even satisfy the first axiom of
ABsOLUTE - l) The absolute of a regular topological countability).
space X is the space aX which is mapped perfectly and Of the numerous ways in which the absolute aX of a
irreducibly onto X, and is such that any perfect irredu- given (regular) space X can be constructed, the follow-
cible inverse image of the space X is homeomorphic to ing method is one of the simplest.
aX. Each regular space X has a unique absolute. The A family ~= {A} of non-empty canonical /Ca-sets, i.e.
absolute of a space X is always extremally disconnected of canonical closed sets A of the space X, is called a
and completely regular, and is perfectly and irreducibly thread if it is inclusion-directed, i.e. if for each two ele-
mapped onto X by means of a transformation ments A, A' of the family ~ there exists an element A"
'lTx: aX~x. If two spaces X and Y are connected by a contained in A n A '. A thread ~ is called a maximal or
single-valued or multi-valued perfect irreducible map- an end thread if it is not a subfamily of any thread dif-
ping f : X ~ Y, then their absolutes are homeomorphic, ferent from it. It can be shown that threads exist;
and there exists a homeomorphism fa: aX~a Y such moreover, it can be shown that for each non-empty /Ca-
that f='lTyfa'ITx 1. set A the set DA of all threads containing the set A as
If a homeomorphism fa: aX~a Y is given, the map- an element is non-empty. Each thread is contained in
ping f='lTyfa'ITX 1 will be, in the general case, multi- some maximal thread. The intersection of all sets which
valued, irreducible and perfect. Thus, absolutes and are elements of a maximal thread ~ is either empty or
their homeomorphisms 'control' the entire class of per- contains a single point x(~); in the latter case the
fect irreducible mappings of regular spaces. The mean- thread ~ is said to be convergent (to the point x(~)). A
ing of this fundamental property is that absolutes of topology is introduced in the set aX of all ends (i.e.
regular topological spaces are projective objects in the maximal threads), by taking the collection of all sets
category of regular spaces and perfect irreducible map- DA as a basis for the closed sets. The resulting topology
pings. If a regular space X is compact, respectively, is Hausdorff and compact. The convergent ends in the
finally compact or complete in the sense of Cech, the compactum aX form an everywhere-dense subspace.
respective property is also displayed by the absolute of The subspace of the space aX consisting of the conver-
this space. The absolute of a paracompact space is even gent ends is at the same time the absolute aX of X; it
strongly paracompact, and is, moreover, perfectly zero- turns out that aX is identical with the Stone-tech
dimensional. However, the absolute of a normal space compactification paX of ax' If X is not only regular,
need not itself be normal. If X is a completely regular but completely regular, the formula of commutativity of
space, then the Stone - Cech compactification of its the operators a and p is valid:
absolute is the absolute of any compactification of X.
Two spaces are called co-absolute if their absolutes are aX = paX = apX
homeomorphic. v.I. Ponomarev
Thus, the class of regular spaces is subdivided into Editorial comments. The definition of the absolute of a
disjoint (pairwise not intersecting) classes of co- regular topological space X given above is slightly impre-
absolute spaces. A space X is co-absolute with some cise. A better (more precise) definition is: The absolute of a
metric space if and only if it is a paracompact feath- regular topological space X is a pair (aX, 7Tx), where TTX is a
ered space containing a dense a-discrete system of open perfect irreducible mapping of aX onto X, such that for
sets. A compact space is co-absolute with some metriz- every regular topological space Y and every perfect irredu-
able compactum if and only if it has a countable 'IT- cible mapping f of Yonto X there is a mapping g of aX onto
weight. If a compact space has a countable 'IT-weight Y such that 7TX= fg.
In Western literature finally compact spaces are also
and has no isolated points, then and only then it will
called Lindelof spaces and canonical closed set are also
be co-absolute with the perfect Cantor set. Conse-
called regular closed sets (or sometimes regularly closed
quently, all metrizable compacta without isolated points sets). The maximal or end threads are commonly called reg-
are co-absolute with the perfect Cantor set. The abso- ular closed ultra filters.
lute of a countable metrizable compactum is an exten- The simple construction of ax given above can be
sion of the Stone - Cech compactification of the natural rephrased as follows: The family of regular closed sets
numbers. The absolute of an extremally disconnected forms, in a natural way, a complete Boolean algebra. The
space is homeomorphic to it. Thus, the class of abso- space aX is then simply the Stone space of this Boolean
lutes (whatever this may be) of regular spaces coincides algebra (the set of all ultrafilters (cf. Ultrafilter) on it, topolo-
with the class of extremally disconnected spaces. Since gized using the sets OA as a base for the closed sets).

lk
ABSOLUTE CONTINUITY

References l0 such that the integral I if d/L I < for any set e
[A1J AlurnANGEL'SKIT, A.V. and PONOMAREV, V.I.: Fundamentals
of general topology. Problems and exercises, Reidel, 1984. with !L(e)<l>. In the general case the integral with
The O-absolute of a O-proximity space (X, l is the pair respect to a finitely-additive set function with scalar or
(X 8, '1Tx) consisting of a proximity space X8 and a pro- vectorial for /L is an absolutely continuous function.
jection '1Tx: X8~X which is a regular O-mapping. Here A.P. Terekhin
O-mapping is a term denoting any O-perfect, irreducible, VF. Emelyanov
O-proximity-continuous mapping. Any O-proximity 2) Absolute continuity of a measure as a concept in
space has a unique O-absolute. Any regular O-mapping the theory of measures. A measure p is absolutely con-
on a O-absolute is a proximity equivalence. The 0- tinuous with respect to a measure /L if p is an absolutely
absolute of a space (X, l is the maximal inverse image continuous set function with respect to /L. Thus, let p be
of the space (X, l under regular O-mappings. For each a finite measure, given together with /L on some fixed
regular O-mapping f: (X, l~(Y, l>') there exists a a-algebra G; p will then be absolutely continuous with
proximity equivalence F: X8~ Y 8' such that the respect to /L if it follows from !L(A) = 0, A E G, that
diagram F II{A ) = O. A generalized finite measure (d. Charge) p is
Xs ~ Ys' absolutely continuous with respect to a generalized
'1Tx~ ~'1Ty measure /L if II{A) = 0, provided that I /L I(A) = 0, where
X ~ Y I /L I is the total variation of /L. A.P. Terekhin
f
is commutative. 3) Absolute continuity of a function is a stronger
For maximal O-proximities on regular topological notion than continuity. A function f defined on a seg-
spaces the concept of a regular O-mapping is identical ment [a, b] is said to be absolutely continuous if for any
with that of a perfect irreducible mapping, while the >0 there exists a l0 such that for any finite system
concept of a O-absolute is identical with that of the of pairwise non-intersecting intervals (ak, bk)C(a, b),
absolute of a regular topological space. k = 1, ... , n, for which
n
References ~ (bk -ak) < 8,
[1) PONOMAREV, V.I.: 'On spaces co-abolute with metric spaces', k=i
Russian Math. Surveys 21, no. 4 (1966),87-114. (Uspekhi Mat.
Nauk 21, no. 4 (1966), 101-132)
the inequality
[2) GLEASON, A.M.: 'Projective topological spaces', Illinois J.
Math. 2, no. 4A (1958), 482-489. ~ Ij(bd-j(ad I<
k=i
[3) PONOMAREV, V.I.: 'Paracompacta, their projection spectra and
holds. Any absolutely continuous function on a seg-
continuous images', Mat. Sb. 60 (102), no. 1 (1963), 89-119 (in
Russian).
ment is continuous on this segment. The opposite
[4) FEDORCHUK, V.V.: 'Perfect irreducible mappings and general-
implication is not
ized proximities', Math. USSR-Sb. 5, no. 4 (1968), 498-508. true: e.g. the function
(Mat. Sb. 74 (118), no. 4 (1968),513-536) V V r:' d f(x)=x sin(l / x) if O<x~ 1 and f(O)=O is continu-
h k
. . re orc u
ous on the segment [0, 1], but is not absolutely continu-
2) The absolute in projective geometry is the curve
ous on it. If, in the definition of an absolutely continu-
(surface) of the second order constituting the set of
ous function, the requirement that the pairwise intersec-
infinitely-distant points in the Klein interpretation of a
tions of intervals (ab bk ) are empty be discarded, then
hyperbolic plane (space). The absolute can be used to
the function will satisfy an even stronger condition: A
introduce a metric on a projective plane (space) (d.
Lipschitz condition with some constant.
Projective detennination of a metric). For instance, the
If two functions f and g are absolutely continuous,
projective measure of a segment AB is defined as a
then their sum, difference and product are also abso-
quantity which is proportional to the natural logarithm
lutely continuous and, if g does not vanish, so is their
of the double ratio (ABCD) of four points, where C and
quotient f / g. The superposition of two absolutely con-
D are the points of intersection of the straight line AB
tinuous functions need not be absolutely continuous.
with the absolute.
A.B. Ivanov However, if the function f is absolutely continuous on a
AMS 1980 Subject Classification: 54C99, 51 N15 segment [a, b] and if A~f(x)~B, xE[a, b], while the
function F satisfies a Lipschitz condition on the seg-
ABSOLUTE CONTINUITY - 1) Absolute continuity of ment [A, B], then the composite function F[f(x)] is
an integral as a property of the (Lebesgue) integral. absolutely continuous on [a, b]. If a function f, which
Let a function f be /L-integrable on a set E. The integral is absolutely continuous on [a, b], is monotone increas-
of f over /L-measurable subsets e CE is an absolutely ing, while F is absolutely continuous on [f (a),f (b )],
continuous set function (see Subsection 3 below) with then the function F[f (x)] is also absolutely continuous
respect to the measure /L if for any >0 there exists a on [a, b].

19
ABSOLUTE CONTINUITY

An absolutely continuous function maps a set of the relations I) " p.; 2) ,,+ p., ,,- p.; 3)
measure zero into a set of measure zero, and a measur- I " IEO; I p. I are equivalent. If the measure " is finite,
able set into a measurable set. Any continuous function "p. if and only if for any t:>0 there exists a 8>0
of finite variation which maps each set of measure zero such that I p. I <8 entails I" I <t:o According to the
into a set of measure zero is absolutely continuous. Radon-NikocJYm theorem, if p.," are (completely) 0'-
Any absolutely continuous function can be represented finite, (i.e. XES and there exists a sequence {En},
as the difference of two absolutely continuous non- n=I,2, ... , such that
decreasing functions.
A function f that is absolutely continuous on the seg- n91En = X, I p.(En) I, I P(En) I < 00 ]

ment [a, b] has a finite variation on this segment and


has a finite derivative / (x) at almost every point. The and if "p., then there exists on X a finite measurable
derivative / (x) is summable over this segment, and function f such that
x P(E) = ildp" EES.
I(x) = I(a) + jl(t)dt.
a
Conversely, if p. is (completely) a-finite and the
If the derivative of an absolutely continuous function is
almost everywhere equal to zero, then the function
I
integral dp. makes sense, then lei
dp. as a function of
the set E is absolutely continuous with respect to p.. If p.
itself is constant. On the other hand, for any function cp
that is summable on [a, b] the function f
cp(t)dt is
and " are (completely) a-finite measures on (X, S);
there exist uniquely' defined (completely) a-finite meas-
absolutely continuous on this segment. Accordingly, the ures "1 and "2 such that "="1 +"2, "1 p. and "2 is
class of functions that are absolutely continuous on a singular with respect to p. (i.e. there exists a set A E S
given segment coincides with the class of functions that such that 1"2 I(A)=O, I p.1 (X\A)=O) (Lebesgue's
can be represented as an indefinite Lebesgue integral, theorem). A measure, defined on the Borel sets of a
i.e. as a Lebesgue integral with a variable upper limit of finite-dimensional Euclidean space (or, more generally,
a certain summable function plus a constant. of a locally compact group), is called absolutely con-
If f is absolutely continuous on [a, b], then its total tinuous if it is absolutely continuous with respect to the
variation is b Lebesgue (Haar) measure. A non-negative measure p.
VV(x) = j II (x) I dx. on the Borel sets of the real line is absolutely continu-
a
ous if and only if the corresponding distribution func-
The concept of absolute continuity can be general- tion F(x)=p.{( - 00, x)} is absolutely continuous (as a
ized to include both functions of several variables and function of a real variable). The concept of absolute
set functions (see Subsection 4 below). continuity of a set function can also be defined for
finitely-additive functions and for functions with vector
References values.
[I) KOI.MOGOROV, A.N. and FOMIN, S.V.: Elements of the theory
offunctions and functional analysis, Graylock, 19571961 References
(translated from the Russian). [I] HALMos, P.R.: Measure theory, v. Nostrand, 1950.
[2) SMIRNOV, V.I.: A course of higher mathematics, 5, Addison- [2) NEVEU, J.: Bases mathematiques du calcul des probabilites, Mas-
Wesley, 1964 (translated from the Russian).
~I~ V.V.S
. . azonov
L.D. Kudryavtsev
4) Absolute continuity of a set function is a concept Editorial comments.
usually applied to countably-additive functions defined References
on a a-ring S of subsets of a set X. Thus, if p. and " are [A 1] ROYDEN, H.L.: Real analysis, Macmillan, 1968.
[A2] ZAANEN, A.C.: Integration, North-Holland, 1967.
two countably-additive functions defined on Shaving [A3] RUDIN, W.: Principles of mathematical analysis, McGraw-Hili,
values in the extended real number line [ - 00, 00], then 1953.
" is absolutely continuous with respect to p. (in symbols [A4] RUDIN, W.: Real and complex analysis, McGraw-Hili, 1966.
this is written as "p.) if I p.1 (E)=O entails P(E)=O. [AS] TAYLOR, A.E.: General theory of functions and integration,
Blaisdell, 1965.
Here I p. I is the total variation of p.: [A6] ALIPRANTZ, C.D. and BURLEINSHAW, 0.: Principles of real
analysis, North-Holland, 1981.
1p,1 =p,++p,-,
p,+ = sup{p.(F): E-:JFES}, AMS 1980 Subject Classification: 26A 15, 28A 10

p,- = -inf{p.(F): E~FES},


ABSOLUTE ERROR - See Error.
p.+ and p.- are measures, known as the positive and AMS 1980 Subject Classification: 65GXX
negative variations of p.; according to the
Jordan - Hahn theorem, p. =p. + - p. -. It turns out that ABSOLUTE GEOMETRY - A geometry based on the

20
ABSOLUTE SUMMABILITY

axioms of Euclidean geometry, except for the axiom on closed subset. A space satisfying the property in the article
parallel straight lines (postulate V). The assumptions above is then called an absolute extensor CAE) for normal
underlying absolute geometry are common to Euclidean spaces. One then proves that a space is an CAR) if and only
geometry and to Lobachevskii geometry. The term if it is an CAE). Absolute retracts and extensors can be
'absolute geometry' was introduced by J. Bolyai in defined for any class of spaces (i.e. not just for normal
spaces).
1832.
A.B. Ivanov A binormal space is a space X for which the product X x I
AMS 1980 Subject Classification: 51-01 is normal. It can be proved that a space X is binormal if and
only if X is normal and countably paracompact Ccf.
ABSOLUTE MOMENT of a random variable X - The Paracompactness criteria).
mathematical expectation of I X I r, r >0. It is usually References
[A1] Hu, S.T.: Theory of retracts, Wayne State Univ. Press,
denoted by fin so that Detroit, 1965.
f3, = EI X I' AMS 1980 Subject Classification: 54C55
The number r is called the order of the absolute
moment. If F(x) is the distribution function of X, then ABSOLUTE SUMMABILITY - A special type of
summability of series and sequences, which differs from
f3, = f
-xc
I x I' dF(x), (I) ordinary summability in that additional restrictions are
imposed. For matrix summation methods the require-
and, for example, if the distribution of X has density ment is that the series and sequences obtained as a
p(x), one has + CI.) result of the transformation corresponding to the given
f3, = f
-co
Ix I'p(x)dx. (2) summation method must be absolutely convergent (d.
Matrix summation method). Let a summation method A
In relation to the equations (I) and (2) one also be defined as a transformation of sequences {sn} into
speaks, respectively, of the absolute moments of the sequences {(In} by means of the matrix II ank II :
distribution function F(x) and the density p(x). The 00

existence of fir implies the existence of the absolute an = ~ankSb n=O, 1, ....
k=O
moment fir and also of the moments (d. Moment) of
order /. for 0</ ~r. Absolute moments often appear The sequence {sn} is then absolutely summable by the
in estimates of probability distributions and their method A (I A I-summable) to the limit s if it is A-
characteristic functions (d. Chebyshev inequality in pro- summable to this limit, i.e. if
bability theory; Lyapunov theorem). The function logfir lim an = s,
is a convex function of r, and the function fi~ / r is a n-->oo

non-decreasing function of r, r>O. and if the sequence {an} is of bounded variation:


Yu. V Prokhorov
AMS 1980 Subject Classification: 60A05, 60E05, (1)
n=l
60E10
If the Sn are the partial sums of a series
ABSOLUTE REfRACf FOR NORMAL SPACES - A
(2)
topological space X such that every mapping g: A ~ X
of any closed subset A of an arbitrary normal space Y then the series (2) is absolutely summable by the method
can be extended to the entire space Y. A direct product A to the sum s. Condition (1) is an additional require-
of absolute retracts for normal spaces is an absolute ment which makes absolute summability different from
retract for normal spaces, as is any retract (cf. Retract ordinary summability. Absolute summability is defined
of a topological space) of an absolute retract for normal in a similar manner for methods involving matrix
spaces. In particular, the following spaces are absolute transformations of series into sequences. If the summa-
retracts for normal spaces: the unit interval I; the n- tion method is defined by a transformation of the series
dimensional cube In; and the Hilbert cube IW. Any two (2) into a series 00

mappings of a binormal space into an absolute retract ~ an (3)


for normal spaces are homotopic; while a binormal n=O

absolute retract for normal spaces is contractible into a by means of a matrix II bnk II:
point. 00
M.l. Vottsekhovskit an = ~bnkUk'
k=O
Editorial comments. Usually an absolute retract CAR) for
normal spaces is defined to be a normal space which is a then the supplementary restriction is absolute conver-
retract of every normal space in which it is imbedded as a gence of the series (3). In the particular case in which

21
ABSOLUTE SUMMABILITY

the method A corresponds to the identity transforma- problems involved in the multiplication of summable
tion of a sequence into a sequence (or of a series into a series, absolute summability was defined and studied
series), absolute summability of a series is the same as by the method of Cesaro IC I, Ik I)-summability).
absolute convergence. The general definition of absolute summability is more
The supplementary requirements are suitably modi- recent, and has found extensive application in studies
fied for non-matrix summation methods. Thus, in the on the summation of Fourier series.
case of the Abel summation method such a requirement References
is that the function [l] HARDy, G.H.: Divergent series, Clarendon, 1949.
00
[2] KOGBETLIANTZ, E.: Sommation des series et integrales diver-
I(x) = ~UkXk gentes par les moyennes arithmetiques et typiques, Gauthier-
k=O Villars, 1931.
[3] KNopp, K. and LORENTZ, G.G.: 'Beitriige zur absoluten Limi-
is of bounded variation on the semi-interval O:S;;;x < 1. tierung', Arch. Math. (Basel) 2 (1949-1950), 10-16.
In the case of integral summation methods, absolute [4] KANGRO, G.F.: 'Summability theory of series and sequences',
summability is distinguished by the requirement of ftogi Nauk. i Tekhn. Mat. Anal. 12 (1974),5-70 (in Russian).
absolute convergence of the correspondiong integrals. 1.1. Volkov
Thus, for the Borel summation method the integral Editorial comments.
00 00 U Xk References
[e-xk~o-h-dx [A1] ZELLER, W. and BEEKMANN, W.: Theorie der Limitierungsver-
tahren, Springer, 1970.
must be absolutely convergent.
AMS 1980 Subject Classification: 40F05
A summation method is said to preserve absolute con-
vergence of a series if it absolutely sums each absolutely
ABSOLUTE TOPOLOGICAL PROPERTY - An
convergent series. If each such series is summable by
obsolete term denoting the properties of a given set as
this method to a sum equal to the sum to which it con-
a topological space in contrast to properties related to
verges, then the method is called absolutely regular. For
an imbedding of it in other spaces.
instance, the Cesaro summation methods (C, k) are
P.s. Aleksandrov
absolutely regular for k ~O. The Abel method is abso-
lutely regular. The following are necessary and suffi- AMS 1980 Subject Classification: 54A05
cient conditions for the absolute regularity of a summa-
tion method defined by a transformation of a series ABSOLUTE VALUE, modulus, of a real number a -
into a series by means of a matrix I bnk I : The non-negative number, denoted by Ia I, which is
00 00 defined as follows: If a~O, lal=a; if a<O,
~lbnkl';;;M, ~bnk=l,k=O,I, ... I a I = -a. The absolute value (modulus) of a complex
n=O n=O
number z =x +ry, where x and yare real numbers, is
(the Knopp- Lorentz theorem). Summation methods by
the number + Vx 2 +y2 . Absolute values obey the fol-
means of other types of transformations are subject to
lowing relations
analogous restrictions.
A generalization of absolute summability is absolute Ia I = I -a I, Ia 12 = Ia 2 = a 2 (a real),
1

summability of degree p where p ~ 1. Here the condition lal-Ibl"';; la+bl",;; lal+lbl,


~ p-ll IJn-IJn-1 IP < lal-Ibl"';; la-bl"';; lal+lbl.
00

...:.,n 00.
n:=:: 1

is the supplementary restriction by which absolute sum- labl = lal'lbl, andifb*o* = 1*1.
mability of degree p is distinguished from ordinary
summability in the case of, say, a summation method A generalization of the concept of the absolute value
defined by a transformation of a sequence {sn} into a to the case of arbitrary fields exists, cf. Norm on a field.
sequence {on}. AMS 1980 Subject Classification: 26A03
The concept of absolute summability was introduced
by E. Borel for one of his methods, but was stated dif- ABSOLUTELY CONVERGENT IMPROPER
ferently from its modem formulation: absolute summa- INTEGRAL - An improper integral such that the
bility was distinguished by imposing the condition integral of the absolute value of the integrand con-
verges. If an improper integral is absolutely convergent,
eLf
oe - x
I
k~O
00 Uk'
/!xk I dx < 00 it is also convergent. To take a concrete example, let an
improper integral be given by
for each p =0, 1, .... Absolute summability was 101-
h ~
tially used in the study of the summability of power fl(x)dx = Iimfl(x)dx, -oo<a<b",;;+oo, (*)
series outside the disc of convergence. In view of the (J l}-h {1

22
ABSOLUTEL Y CONVERGENT SERIES

where the function f (x) is Riemann- (or Lebesgue-) The absolute convergence of the integral then implies
integrable on all intervals [a, 1]], a';;;;1]<b. its convergence, but the converse proposition is not
A necessary and sufficient condition for the absolute true.
convergence of the integral (*) (Cauchy's criterion for References
the absolute convergence of an improper integral) is [I] IL'IN, V.A. and POZNYAK, E.G.: Foundations of mathematical
that for any >0 there must exist an 1]0 a ';;;;1]( <b, such analysis, 1, Moscow, 1971 (in Russian).
[2] KUDRYAVTSEV, L.D.: Mathematical analysis, 1, Moscow, 1973
that for any 1]' and 1]", 1]( ';;;;1]' <b, 1]( ';;;;1]" <b, the ine-
(in Russian).
quality [3] NIKOL'sKIi, S.M.: A course of mathematical analysis, 1, Mos-
cow, 1973 (in Russian).
L.D. Kudryavtsev

is true. Editorial comments.


If the improper integral is absolutely convergent, References
then it is equal to the Lebesgue integral of the (A1] ApOSTOL, T.M.: Mathematical analysis, Addison-Wesley,
1969.
integrand. There exist improper integrals which are
convergent but not absolutely convergent, for example AMS 1980 Subject Classification: 40A 10, 26A42
+00

j SlllX dx. ABSOLUTELY CONVERGENT SERIES - A series


o x

In order to find out whether or not a given integral is n~1

absolutely convergent it is expedient to use the tests for with (in general) complex terms for which the series
convergence of improper integrals of non-negative func- 00

tions; for example, the absolute convergence of ~ IUn I (2)

f
n=i

sin\lnx dx converges.
i x For a series (1) to converge absolutely, a necessary
is established with the aid of the comparison criterion and sufficient condition (Cauchy's criterion for the
of convergence. absolute convergence of a series) is that for any >0
For most of the available definitions of multiple there exist a number n( such that for all numbers
improper integrals there is a different relation between n >n( and for all integers p;;;'O the inequality
convergence and absolute convergence. Let a function
nfluk I< (
f (x) be defined on an open set G in the n-dimensional k=n
Euclidean space. If, for any sequence of cube-filled is true. If a series is absolutely convergent, it is also
domains Gk , k = 1, 2, ... , which monotonically convergent. The series
exhaust G (i.e. Gk C Gk C Gk + I, and
00 ~4,
U Gk = G), n=ln
k=i
where i = v=T, is absolutely convergent; the series
the limit of the Riemann integrals 00 i=JL
~
jf(x)dx, n=1 n
G,
is convergent, but not absolutely convergent. Let
as k~oo, exists and if this limit is independent of the
choice of the above sequence of domains, then it is usu- (3)
ally called the improper integral
be a series consisting of the same terms as the series (1)
jf(x)dx. but taken, in general, in a different order. Since the
G
series (1) is absolutely convergent, so is the series (3),
The integral thus defined is convergent if and only if it and the sum of the series (3) is equal to the sum of the
is absolutely convergent. There are also other defini- series (I). If the series
tions of multiple improper integrals. For example, for a 00 00

function f (x) which is defined on the whole space En ~un and ~vn
n= I n= I
and which is Riemann-integrable on any n-dimensional
sphere Q, of radius r < + 00 with centre at the origin, are both absolutely convergent, so is anyone of their
one can define the improper integral over En by the linear combinations
equation:
jf(x)dx = r~ If(X)dx.

23
ABSOLUTELY CONVERGENT SERIES

The series of all possible pairwise products Urn Vn of the ABSOLUTELY INTEGRABLE FUNCflON - A func-
terms of these series, arranged in an arbitrary order, is tion such that its absolute value is integrable. If a
also absolutely convergent, and its sum is equal to the function is Riemann-integrable on the segment [a, b],
product of the sums of the original series. These pro- a <b, its absolute value is also Riemann-integrable on
perties of absolutely convergent series are also this interval, and

[I I
displayed by multiple series:
~ Un!> ... ,nk' (4) I[f(X)dX I,,;; f(x) dx.
(nJ, ... n,)

If a multiple series is absolutely convergent, it is con- A similar assertion is also valid for a function of n vari-
vergent, for example, both in the sense of spherical and ables which is Riemann-integrable on a cube-filled
of rectangular partial sums, and its sums will be the domain in the n-dimensional Euclidean space. The con-
same in both cases. If the multiple series (4) is abso- verse theorem for Riemann-integrable functions is not
lutely convergent, the iterated series true: The function f which is equal to I for rational
values of x and is equal to - I for irrational values of x
(5) is not Riemann-integrable, but its absolute value is
nt=l nl=l
integrable. The situation is different for Lebesgue-
is absolutely convergent, i.e. all series obtained by suc- integrable functions: A Lebesgue-measurable function f
cessive summation of terms of the series (4) by the is Lebesgue-integrable (Lebesgue-summable) on a
indices n 1, . . . ,nk are absolutely convergent; more- measurable set E in the n-dimensional space if and only
over, the sums of the multiple series (4) and the iterated if its absolute value is Lebesgue-integrable on this set.
series (5) are identical with the sum of any simple series The following inequality is valid:
formed by all terms of the series (4).
If the terms of the series (1) are elements of some I!(x)dx I ,,;; I f(x) I dx.
Banach space with norm II . II, the series (1) is said to
be absolutely convergent if the series In the case of improper one-dimensional Riemann or
Lebesgue integrals on a half-open interval [a, b),
a <b,,;;; + 00 (provided that the function f is, respec-
n=i
tively, Riemann-integrable or Lebesgue-integrable on
is absolutely convergent. The properties of absolutely any segment [a,1)], a<1)<b), the existence of the
convergent series of numbers discussed above can also improper integral of the absolute value of the function,
be generalized to include the case of absolutely b
convergent series of elements of a Banach space. In j I f(x) I dx,
particular, (the partial sums of) an absolutely a

convergent series in a Banach space converge(s) in that implies the existence of the integral
space. The concept of an absolutely convergent series is b

applied in a similar manner to multiple series in a jf(x)dx,


a
Banach space.
but the converse statement is not true (d. Absolutely
References
[I] IL'IN, V.A. and POZNYAK, E.G.: Foundations of mathematical
convergent improper integral). In this connection it
analySiS, I, Moscow, 1971 (in Russian). should be noted that, if the improper integral
[2] KUDRYAVTSEV, L.D.: Mathematical analySiS, I, Moscow, 1973
b ~
(in Russian).
[3] NIKOL'SKlI, S.M.: A course of mathematical analySiS, I, Mos- j I f(x) I dx = lim j I f(x) I dx
a 'r1-b a
cow, 1973 (in Russian).
L.D. Kudryavtsev exists, then f is Lebesgue-integrable on [a, b), and its
Editorial comments. A useful Western reference is [A 1]. improper integral is equal to the Lebesgue integral.
References In the case of functions of several (more than one)
[A1] ApOSTOL, T.M.: Mathematical analysis, Addison-Wesley, variables, improper integrals are usually so defined that
1969. the existence of the improper integral of the absolute
AMS 1980 Subject Classification: 40A05 value of a function is equivalent to the existence of the
improper integral of the function itself.
ABSOLUTELY-FLAT RING - A ring over which any
Let the values of a function f be in some Banach
(right or left) module is a flat module. This class is
space with norm II II. The function f is then called
identical with the class of von Neumann regular rings absolutely integrable on a measurable set E if the
(d. Regular ring (in the sense of von Neumann. integral
AMS 1980 Subject Classification: 13C11, 16A30 jllf(x)lldx
E

24
ABSORPTION LAWS

exists; also, if f is integrable on E, the relationship An example of a Markov chain with absorbing state 0
is a branching process.
II [f(x)dX II.;;; [II f(x) II dx The introduction of additional absorbing states is a
convenient technique that enables one to examine the
is true. properties of trajectories of a Markov chain that are
References associated with hitting some set.
[1] IL'IN, V.A. and POZNYAK, E.G.: Foundations of mathematical Example. Consider the set S of states of a homo-
analysis, 1, Moscow, 1971 (in Russian).
[2] KUDRYAVTSEV, L.D.: Mathematical analysis, I, Moscow, 1973
geneous Markov chain ./) with discrete time and tran-
(in Russian). sition probabilities
[3] NIKOL'sKII, S.M.: A course of mathematical analysis, 1, Mos-
cow, 1973 (in Russian). Pi} = P{~t+1)=jl~t)=i},
[4] ScHWARTZ, L.: Cours d'analyse, I, Hennann, 1967. in which a subset H is distinguished and suppose one
L.D. KudryavIsev has to find the probabilities
Editorial comments. qih = P{7(H=h IO)=i}, iES, hEH,
References
[A1) ROYDEN, H.L.: Real analysis, Macmillan, 1968.
where T(H) = min{1 >0: T(t)EH} is the moment of first
hitting the set H. If one introduces the auxiliary Mar-
r
[A2) ZAANEN, A.C: Integration, North-Holland, 1967.
[A3) RUDIN, W.: Principles of mathematical analysis, McGraw-Hili, kov chain (I) differing from ./) only in that all states
1953.
[A4) RUDIN, W.: Real and complex analysis, McGraw-HIli, 1966.
r
h EH are absorbing in (I), then for h EH the proba-
[A5) TAYLOR, A.E.: General theory of functions and integration, bilities
Blaisdell, 1965.
[A6) ALIPRANTZ, C.D. and BURLEINSHAW, 0.: Principles of real = P{'T(H)';;;t, ~'T(H=h I~O)=i}
analysis, North-Holland, 1981.
are monotonically non-decreasing for It 00 and
AMS 1980 Subject Classification: 26A39, 26A42
qih = limp~h(t). iES, hEH. (*)
1-->00

ABSOLUTELY-UNBIASED SEQUENCE A By virtue of the basic definition of a Markov chain


sequence of random variables X\, ... , Xn for which
p~h(t+1) = llijP~h(t), t;;'O, iES\H, hEH,
the conditions JES

E(Xd = 0 and E(Xn+! I Xl> ... ,Xn) = 0 Phh(t) = 1, h EH; P~h(t) = 0, i, h EH, i=h.
are fulfilled, for n = 1, 2, .... The partial sums The passage to the limit for I~oo taking into acount
Sn =X\ + ... + Xn of an absolutely-unbiased sequence (*) gives a system of linear equations for qih:
form a martingale. These two types of sequences are qih = llUqih, iES\H, hEH,
interconnected as follows: The sequence {Yn } forms a jES

martingale if and only if it is of the form qhh = 1, hEH; qih = 0, i, hEH, i=h.
Yn =X\ + ... + Xn +c (n = 1, 2, ... , and c=E(Y\) is
a constant), where {Xn} is an absolutely-unbiased References
[1) W.FELLER: An introduction to probability theory and its applica-
sequence. Thus, all martingales are related to partial tions, 1, Wiley, 1968.
sums of certain absolutely-unbiased sequences. Simple A.M. Zubkov
examples of absolutely-unbiased sequences are AMS 1980 Subject Classification: 60J10
sequences of independent random variables with
mathematical expectation zero. Besides the term ABSORPTION LAWS - Identities of the form
'unbiased' the term 'fair' - with the related concept of xA(xVy) = x, xV(xAy) = x,
a 'fair play', is also employed.
A. V. Prokhorov where 1\ and V are two-place operations on some set
Editorial comments. In [A 1] the term 'absolutely fair' is L. If these operations satisfy also the laws of commuta-
used instead of absolutely-unbiased. tivity and associativity, then the relation x ~y defined
References by the equivalence
[A 1] FELLER, W.: An introduction to probability theory and its x.;;;y ~-uVy=y (*)
applications, 2, Wiley, 1966, p. 210.
(or equivalently, by the equivalence x.;;;y~xl\y=x) is
AMS 1980 Subject Classification: 60G42
an order relation for which x 1\Y is the infimum of the
elements x and y, while x Vy is the supremum. On the
ABSORBING STATE of a Markov chain ./) - A state other hand, if the ordered set (L,';;;;) contains an
i such that
infimum x 1\y and a supremum x Vy for any pair of
P{t)=i I~s)=i} = 1 for any t;;'s. elements x and y, then for the operations V and 1\ the

25
ABSORPTION LAWS

laws of absorption, commutativity and associativity, as theory (Weil's 'general points' ('generic points')
well as the equivalence (*) apply. language) to be generally accepted as a base of abstract
References algebraic geometry for a long time thereafter.
[11 RAsIOWA, E. and SIKORSKI, R.: The mathematics of In the early nineteen-fifties powerful methods of
metamathematics, Polska Akad. Nauk, 1963. VN G . h' commutative algebra [6], [8] were brought to bear on
. TIS In
abstract algebraic geometry. The subject was further
Editorial comments. Instead of absorption laws one also modified by J.-P. Serre in his study [7] on coherent
uses the term absorptive laws, cf. [A1], Chapt. 2, Sect. 4.
algebraic sheaves (cf. Coherent algebraic sheaf). The
References ideas and methods of homological algebra were here
[A 1] CoHN, P.M.: Universal algebra, Reidel, 1981.
introduced into algebraic geometry for the first time.
AMS 1980 Subject Classification: 04A05, 08AXX The development of abstract algebraic geometry
paralleled that of the concept of an algebraic variety.
ABSTRACf ALGEBRAIC GEOMETRY - The branch Following the definition of an abstract algebraic variety
of algebraic geometry dealing with the general proper- given by Weil, various generalizations of this concept
ties of algebraic varieties (cf. Algebraic variety) over were proposed, and the concept of a scheme proved to
arbitrary fields and with schemes (cf. Scheme), which be the most useful. A systematic exposition of these
are their generalizations. The first studies in abstract ideas and the development of the theory of schemes
algebraic geometry appeared as early as the nineteenth was initiated in 1960 by A. Grothendieck in a series of
century, but the main development of the subject dates memoirs [5], in which the language of functors and of
back to the nineteen-fifties, with the creation of the the theory of categories was introduced into abstract
general theory of schemes by A. Grothendieck. Interest algebraic geometry, and many classical constructions in
in algebraic geometry over arbitrary fields arose in the the field were radically transformed.
context of number-theoretical problems and, in particu- The rapid development of abstract algebraic
lar, in the theory of equations with two unknowns. geometry was due to the recognition of the fact that it
Very important in the development of abstract alge- is possible, in the framework of the theory of schemes,
braic geometry was the introduction of the concept of to apply to the 'abstract case' practically all of the
the zeta-function of an algebraic curve by E. Artin in known concepts of the classical complex case and, in
1924 (cf. Zeta-function in algebraic geometry), and the particular, the homology theory of complex-analytic
proof of the analogue of the Riemann hypothesis for manifolds. An important role in the development of
elliptic curves by H. Hasse in 1933. The theory of alge- abstract algebraic geometry was played by the Weil
braic curves over an arbitrary field of constants, which conjecture (1947), which postulated the existence of a
was developed at the same time, was of fundamental cohomology theory in which the Lefschetz formula for
importance in this proof. the number of fixed points of a mapping would be
The general development of the theory of rings and valid, and which established an intimate connection of
fields in the first two decades of the twentieth century this hypothesis with purely arithmetical problems of
prepared the ground for a systematic development of algebraic varieties (cf. Zeta-function in algebraic
higher-dimensional algebraic geometry over arbitrary geometry).
fields. In his series of articles (1933 - 1938), B.L. van The concept of a topologized category (a Grothen-
der Waerden based abstract algebraic geometry on the dieck topology) found numerous applications, the
theory of polynomial ideals. In particular, he developed development of which laid the foundations of new
the intersection theory on a non-singular projective branches of abstract algebraic geometry: the theory of
algebraic variety. The results of the studies conducted representable functors (cf. Representable functor), for-
in this field are summarized in [4]. mal geometry, Weil cohomology; K-theory, and the
It was noted in 1940 by A. Weil that a proof of the theory of group schemes (cf. Group scheme). The ideas
Riemann hypothesis for algebraic curves of arbitrary and methods developed in this way had their influence
genus involves the theory of higher-dimensional in many branches of mathematics (commutative alge-
varieties over arbitrary fields. He accordingly developed bra, category theory, the theory of analytic spaces and
the theory of abstract algebraic varieties (not neces- topology).
sarily projective) over an arbitrary ground field, the The recent generalization of the concept of an alge-
theory of divisors, the intersection theory for such braic variety to that of an algebraic space, which was
varieties, and the general theory of Abelian varieties, proposed in the late nineteen-sixties, made it possible
which had been previously studied only from the ana- to extend the framework of abstract algebraic
lytic point of view. The appearance in 1946 of the book geometry, and to connect it even more closely with
by Weil [9] made the theory of valuations and field other branches of algebraic geometry.

26
ABSTRACT ANALYTIC FUNCTION

References tion f (z) of a complex variable z with values in a


[I] GROTHENDIECK, A.: 'The cohomology theory of abstract alge- Banach space or even in a locally convex linear topo-
braic varieties', in Proc. Internat. Math. Congress Edinburgh,
1958, Cambridge Univ. Press, 1960, pp. 103-118.
logical space Y. In such a case any weakly analytic
[2] DOLGACHEV, LV.: 'Abstract algebraic geometry', J. Soviet function f (z) in a domain D of the complex plane C is
Math. 2, no. 3 (1974), 264-303. (Itogi Nauk. i Tekhn. Algebra an abstract analytic function. One can also say that a
Topol. Geom. 10 (1972),47-112)
function f (z) is an abstract analytic function in a
[3] SERRE, J.-P.: Matematika 7, no. 5 (1963), 3-93.
[4] GROBNER, W.: Moderne algebraische Geometrie; die domain DeC if and only if f (z) is continuous in D
Idealtheoretische Grundlagen, Springer, Wien, 1949. and if for any simple closed rectifiable contour LcD
[5] GROTHENDIECK, A. and DIEUDONNE, J.: 'Elements de
geometrie algebrique', Pub!. Math. IRES 4.
the integralkf (z) dz vanishes. For an abstract analytic
[6] SAMUEL, P.: Methodes d'algebre abstraite en geometrie function f (z) of a complex variable z the Cauchy for-
algebrique, Springer, 1955. mula (cf. Cauchy integral) is valid.
[7] SERRE, J.-P.: 'Faiseaux algebriques coherentes', Ann. of Math.
(2) 61, no. 2 (1955), 197-278. Let f (x) be a weakly analytic function in a domain
[8] ZARISKI, 0.: 'Algebraic geometry. The fundamental ideas of D of a Banach space X. Then f(x + ~h), as a function of
abstract algebraic geometry', in Proc. Internat. Congress the complex variable ~, has derivatives of all orders in
Mathematicians Cambridge, 1950, Vol. 2, Amer. Math. Soc.,
1952, pp. 77-89. the domain D = {~: x +~h ED}, hEX, t!Iese derivatives
[9] WElL, A.: Foundations of algebraic geometry, Arner. Math. Soc., being abstract analytic functions from D into Y. If the
1946.
I. V Dolgachev set {x +~h: I ~ I.;;;; 1} belongs to D, then
00 1
Editorial comments. The systematic development of the I(x +h) = ~ -, f)n/(x, h),
concept of a topologized category and its applications was n=O n.
carried out by Grothendieck, P. Deligne and many others where the series converges in norm, and
(cf. [A1], [A2]).
References f)n/(x, h) = ~/(x +~h)1
[A 1] DELIGNE, P.: 'La conjecture de Weil I', Publ. Math, IHES 43 dt f=O
(1974), 273-307,
[A2] DELIGNE, P.: 'La conjecture de Weilil', Publ Math, IHES 52 = ~ ( I(x+~h~-n-l d~.
(1980), 237-252, 2m Id=l
A function y =P(x) from X into Y is called a
AMS 1980 Subject Classification: 14-XX
polynomial with respect to the variable x of degree at
ABSTRACf ANALYTIC FUNCflON, analytic map- most m if, for all x, h EX and for all complex t one has
ping of Banach spaces - A function f (x) acting from m
some domain D of a Banach space X into a Banach P(x+~h) = ~P.(x,hW',
.=0
space Y that is differentiable according to Frechet every-
where in D, i.e. is such that for any point a ED there where the functions P .(x, h) are independent of ~. The
exists a bounded linear operator 8f(a, .) from X into Y degree of P (x) is exactly m if Pm(x, h );~:O. A power
for which the following relation is true: series is a series of the form ~:=oPn(x) where Pn(x)
II h ,,-I. "/(a +h)-l(a)-f)/(a, h)" = 0,
are homogeneous polynomials of degree n so that
lim
II h 11--+0 Pn(ax)=il Pn(x), x EX, for all complex a. An arbitrary
where 11'11 denotes the norm on X or on Y; 8f(a, h) is weakly convergent power series ~:=oPn(x) in a
called the Frechet differential of f at a. domain D converges in norm towards some weakly
Another approach to the notion of an abstract ana- analytic function f(x) in D, and Pn(x) = 8nf (0, x) / n!,
lytic function is based on differentiability according to OED. A function f(x) is an abstract analytic function
Gateaux. A function f (x) from D into Y is weakly ana- if and only if it can be developed in a power series in a
lytic in D, or differentiable according to Gateaux in D, if neighbourhood of all points a ED
for each continuous linear functional y' on Y and each
element hEX the complex function y'(f(x+~h)) is a
holomorphic function of the complex variable ~ in the n=O

disc I ~ I<p(x, h), where p(x, h)=sup{ I ~ I: where all Pn(h) are continuous in X.
x + ~h ED}. Any abstract analytic function in a domain Many fundamental results in the classical theory of
D is continuous and weakly analytic in D. The converse analytic functions - such as the maximum-modulus
proposition is also true, and the continuity condition principle, the uniqueness theorems, the Vitali theorem,
can be replaced by local boundedness or by continuity the Liouville theorem, etc. - are applicable to abstract
according to Baire. analytic functions if suitable changes are introduced.
The term 'abstract analytic function' is sometimes The set of all analytic functions in a domain D forms a
employed in a narrower sense, when it means a func- linear space.

27
ABSTRACT ANALYTIC FUNCTION

The notion of an abstract analytic function can be abstractions is one of the principal tasks of the founda-
generalized to wider classes of spaces X and Y, such as tions of mathematics. A careful consideration of the
locally convex topological spaces, Banach spaces over problems related to this circle of questions resulted in
an arbitrary complete valuation field, etc. the recognition of the fundamental importance of the
References following factors: 1) the evaluation of abstract objects
[1] HILLE, E. and PHILLIPS, R.: Functional analysis and semi resulting from the superposition of far-reaching ideali-
groups, Amer. Math. Soc., 1957. zations requires the development of special means for
[2] EDWARDS, R.E.: Functional analySiS: theory and applications.,
Holt, Rinehardt, Winston, 1965.
their understanding; such a development is a difficult
[3] ScHWARTZ, L.: COUTS d'analyse, 2, Hermann, 1967. task, which forms the subject of a special discipline:
A.A. Danilevich semantics; and 2) the logical apparatus which may be
E.D. Solomentsev applied to any given mathematical theory essentially
depends on the nature of the fundamental concepts of
AMS 1980 Subject Classification: 46G20, 58812,
this theory, and hence on the nature of the
58C10,32A30
mathematical abstractions accepted in the formulation
ABSTRACflON, MATHEMATICAL - Abstraction in of these concepts. See Intuitionism; Constructive
mathematics, or mental abstraction, is a significant mathematics.
component of the mental activity aimed at the formula- Major contributions to the analysis of abstractions
tion of basic mathematical concepts. The most typical used in mathematics were made by L.E.J. Brouwer [I],
abstractions in mathematics are 'pure' abstractions, H. Weyl [2], D. Hilbert [3], A.A. Markov [4], and oth-
idealizations and their various multi-layered superposi- ers.
tions (see [5]). References
The mental act of 'pure' abstraction consists of the [1] BROUWER, L.E.J.: 'De onbetrouwbaarheid der logische prin-
cipes', Tijdschrift voor Wijsbegeerte 2 (1908), 152-158.
fact that in a certain situation being considered, atten- [2] WEYL, H.: Das Kontinuum, Chelsea, reprint, 1973.
tion is fixed only on certain (essential) features of the [3] HILBERT, D.: Grundlagen der Geometrie, Springer, 1913,
objects being considered and on the interrelationships Appendix VIII.
[4] MARKov, A.A.: On the logic of constructive mathematics, Mos-
between them, with the exclusion of other properties cow, 1972 (in Russian).
and relationships which are considered as irrelevant. [5] SHANIN, N.A.: 'Constructive real numbers and constructive
The result of such an act of abstraction, firmed in an function spaces', Trudy Mat. Inst. Steklov. 67 (1962), 15-294 (in
Russian).
appropriate language, begins to play the part of a gen- N.M. Nagornyf
eral concept. A typical example of mathematical
AMS 1980 Subject Classification: 03A05
abstraction of this kind is abstraction by identification.
The mental act of idealization means that, in a cer- ABSTRACflON BY IDENTIFICATION - A manner
tain situation being considered, one's imagination gen- of forming general abstract concepts in which, when
erates a certain concept which becomes the object con- considering concrete initial objects, only those differ-
sidered by one's consciousness. The properties imparted ences are considered that, for any reason, are relevant
to the concept by one's imagination are not only those to the given situation, while other differences that are
actually posessed by the initial objects as a result of the regarded as irrelevant are ignored. Initial objects differ-
act of 'pure' abstraction, but also other, imagined pro- ing from one another only in irrelevant aspects are con-
perties, which reflect the original properties of the ini- sidered as identical. Linguistically, abstraction by iden-
tial objects in a modified manner, or even properties tification is manifested by the fact that two similar ini-
altogether absent in reality. One of the most tradi- tial objects are identified with each other, and are spo-
tional mathematical idealizations is the abstraction of ken of as a single object, after having been assigned
actual infinity, which leads to the idea of an actual some suitable term. For instance, by identifying similar
infinity. This abstraction is the base of the set-theoretic letters (words, alphabets) with each other, one arrives
development of mathematics. Another traditional ideal- at the concept of an abstract letter, word or alphabet
ization - the abstraction of potential realizability - (see [l]); by identifying equivalent fundamental
leads to the idea of a potential infinity. This abstraction, sequences of rational numbers one arrives at the con-
in conjunction with the refusal to use the abstraction of cept of real numbers (cf. Real number); by identifying
actual infinity, forms the base of constructive founda- isomorphic groups with each other, one arrives at the
tions of mathematics. concept of an abstract group, etc.
The nature of any mathematical theory is largely See also Abstraction, mathematical; Equivalence; Iso-
determined by the nature of the mathematical abstrac- morphism.
tion on which the formulation of the fundamental con- References
cepts of this theory is based. The analysis of such [I] MARKOV, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!.,

28
ACCELERATION OF CONVERGENCE

1961 (translated from the Russian). Also: Trudy Mat. Inst. principle, be indefinitely extended (e.g. the successive
Steklov. 42 (1954). generation of positive integers starting from zero), the
[2] MARKov, A.A.: On the logic of constructive mathematics, Mos-
cow, 1972 (in Russian). abstraction of potential realizability consists in ignoring
N.M. Nagornyi any possible spatial, temporal or material obstacles to
AMS 1980 Subject Classification: 03A05 the realization of each successive step of the process,
and to consider each step as potentially realizable. The
ABSTRACfION OF ACfUAL INFINITY - A application of the abstraction of potential realizability
mathematical idealization, related to a certain form of to the example given above is tantamount to assuming
the idea of infinity in mathematics - the idea of the that a unit can be added to any natural number, that it
so-called actual infinity. is possible to form the sum of any two natural
As applied to constructive processes of potentially numbers, etc., but it does not imply the idea of the
infinite duration (e.g. the generation of successive posi- existence of the natural sequence as an actual 'infinite
tive integers starting from zero), abstraction of actual object'.
infinity consists of ignoring the fact that such processes The acceptance of the abstraction of potential real-
do not terminate in principle and in considering the izability logically leads to the principle of mathematical
results of such processes on the assumption that they induction.
have terminated, viz. that the sets of objects have been Abstraction of potential realizability plays a special
generated. The resulting sets (objects) are then mentally role in constructive mathematics, in which propositions
regarded as actual 'finished' objects. The application of concerning the existence of constructive objects that
the abstraction of actual infinity to the above example satisfy given conditions are regarded as propositions on
makes it possible to consider the set of all non-negative the potential realizability of such objects.
integers - the natural sequence - as a mathematical See also Abstraction, mathematical.
N.M. Nagornyi
object.
Logically, acceptance of the abstraction of actual AMS 1980 Subject Classification: 03A05
infinity leads to the acceptance of the law of the
excluded middle as a logical principle. ACCELERATION OF CONVERGENCE (for an itera-
Abstraction of actual infinity is particularly impor- tive method) - The construction of a modification of
tant in the construction of mathematics on the base of the iterative method, by means of that method, that
the general theory of sets, established by G. Cantor. possesses a greater rate of convergence. The accelera-
Abstraction of actual infinity, being a far-reaching tion methods (acceleration processes) used are quite
idealization, especially so when repeatedly employed in varied (see [1] - [6]) and depend both on the problem to
conjunction with other idealizations, generates objects be solved and on the type of iterative method. In those
whose 'tangibility' is indirect, as a result of which the cases where the iterative method can be considered as a
problems involved in understanding propositions con- particular case of a class of iterative methods contain-
cerning such objects meet with certain difficulties. ing free iteration parameters, acceleration of conver-
Unrestricted use of abstraction of actual infinity in gence can be reduced to the problem of the optimal
mathematics as a legitimate method of generating choice of these parameters. The optimization problem
mathematical objects met with objections by a number can take various forms, and may reduce, for example,
of mathematicians (L. Kronecker, c.F. Gauss, D. Hil- to a transition from a method of simple iteration,
bert, H. Weyl, and others). Positive programs of u n+ 1 = un-T(Lun-f), (1)
mathematical construction based on abstraction of
potential realizability, without recourse to abstraction for the solution of a system of linear algebraic equa-
of actual infinity, were proposed by L.E.J. Brouwer (see tions
Lu = j, L = L' > 0, (2)
Intuitionism) and by A.A. Markov (d. Constructive
mathematics). either to the Richardson method with Chebyshev
See also Abstraction, mathematical. parameters or to the method of conjugate gradients.
N.M. Nagornyi The rate of convergence of similar classical iterative
AMS 1980 Subject Classification: 03A05 methods depends on the condition number v(L) of the
matrix L and can be fairly slow for large v(L). In such
ABSTRACfION OF POTENTIAL REALIZABILITY - situations, and particularly for the solution of systems
A mathematical idealization, related to a certain form of grid equations, modifications of these methods are
of the concept of infinity in mathematics - the idea of often used, these can be defined by the fact that they
a potential infinity. are used not for (2), but for an equivalent system
As applied to constructive processes which can, in B-ILu = B-Ij,

29
ACCELERATION OF CONVERGENCE

where B =B* >0 is a specially selected operator (see [2] References


[A1] AxELSSON, 0.: 'Conjugate gradient type methods for unsym-
- [4]).
metric and inconsistent systems of linear equations', Linear
The operator B- 1L is self-adjoint and positive on algebra and its Appl. 34 (1980), 1-66.
some Euclidean space and the rate of convergence of [A2] GoLUB, G.H. and LooN, C.F. VAN: Matrix computations,
the modifications obtained depends on P(B -1 L). Simi- North Oxford Academic, 1983.
[A3] GoLUB, G.H. and VARGA, R.S.: 'Chebyshev semi-iterative
lar modifications are also used for more general prob- methods, successive over-relaxation methods, and second-
lems, including non-linear problems (see Non-linear order Richardson iterative methods. Part I, II', Numerical
equation, numerical methods). In order to realise these Math. 3 (1961),147-156; 157-168.
modifications it is important to be able to solve the sys- [A4] MANTEUFFEL, T.A.: 'The Tchebychev iteration for nonsym-
metric linear systems', Numerical Math. 28 (1977),
tems Bv = g effectively, since, for example, modification 307-327.
of (1) leads to the relation [A5] MEIJEllINK, J.A. and VORST, H.A. VAN DER: 'An iterative solu-
tion method for linear systems of which the coefficient matrix
(3) is a symmetric M-matrix', Math. Compo 31 (1977), 148-162.
(see Minimization of the labour of calculation). [A6] STEWART, G.W.: 'Conjugate gradients methods for solving
systems of linear equations', Numerical Math. 21 (1973),
One of the traditional general methods for the 284-297.
acceleration of convergence for methods (1) is the 82 _ [A7] VARGA, R.S.: Matrix iterative analysis, Prentice Hall, 1962.
process. It is used along with a whole series of other AMS 1980 Subject Classification: 65F10
acceleration methods (see [1]) in iterative methods for
the partial eigen value problem. ACCUMULATION OF ERRORS in the numerical
In solving non-linear problems, acceleration of con- solution of algebraiC equations - The overall effect of
vergence is often achieved by choosing a special initial rounding-off at the various stages of a computation
approximation on the basis of methods of continuation procedure on the accuracy of the computed solution to
by a parameter. For these same problems acceleration a system of algebraic equations. The most commonly
of convergence is also sometimes realized by the use of employed technique for a priori estimation of the total
iterative methods of a higher order (the effect of rounding-off errors in numerical methods of
Newton - Kantorovich method and others). linear algebra is the scheme of inverse (or backward)
Various methods for the acceleration of convergence analysis. Applied to the solution of linear algebraic
are also used in probabilistic iterative methods of equations
Ax = b (1 )
Monte-Carlo type (see [2]).
References the scheme of inverse analysis is as follows. On the
[I] FADDEEV, D.K. and FADDEEVA, V.N.: Computational methods assumption that some direct method M has been used,
of linear algebra, Freeman, 1963 (translated from the Russian). the computed solution XM does not satisfy (1), but it
[2] BAKHVALOV, N.S.: Numerical methods: analySis, algebra, ordi-
nary differential equations, Mir, 1977 (translated from the Rus- can be expressed as the exact solution of a perturbed
sian). system
(2)
[3] MARCHUK, GJ.: Methods of computational mathematics, Mos-
cow, 1980 (in Russian). The quality of the direct method is estimated in terms
[4] SAMARSKIl, A.A. and NIKOLAEV, E.S.: Methods of solving differ-
ence equatiOns, Moscow, 1978 (in Russian).
of the best a priori estimate that can be found for the
[5] GLOWINSKI, R.: Numerical methods for nonlinear variational norms of the matrix FM and the vector k M . These 'best'
problems, Springer, 1984. FM and kM are known as the equivalent perturbation
[6] D'YAKONOV, E.G.: 'On iterative methods with saddle opera- matrix and vector, respectively, for the method M.
tors', Dokl. Akad. Nauk. SSSR, New Series 292 (1987), 1037-
If estimates for FM and kM are available, the error of
1041 (in Russian). E.G. D'yakonov
the approximate solution XM can be estimated theoreti-
Editorial comments. In the Western literature, cally by the inequality
Richardson's method with Chebyshev parameters is usually
called the Chebyshev semi-iterative method. Chebyshev II X-XM II,,;::: cond(A) [II FM II + II kM II] 3
iteration is extensively analyzed in [A3] and [A7] , Chapt. 5. II x II "" I -II A -I II II FM II II A II IfbT I )
A further analysis for the case when L is non-symmetric can Here cond(A) = II A II II A -1 II is the condition number
be found in [M]. of the matrix A, and the matrix norm in (3) is assumed
Unlike Chebyshev iteration, the method of conjugate gra-
to be subordinate to the vector norm II II.
dients does not need accurate estimates of the largest and
In reality, an estimate for II A -1 II is rarely known
smallest eigen values of the iteration matrix 1- TL in (1).
Good references are [A2] , pp. 362-379, and [A6]. Further
in advance, and the principal meaning of (2) is the pos-
developments are discussed in [AS].
sibility that it offers of comparing the merits of dif-
The pre-multiplication by a matrix B- 1 of the system (2) ferent methods. In the sequel some typical estimates for
is often called pre-conditioning. An up-to-date survey of this the matrix FM are presented.
technique is given in [A 1]. For methods with orthogonal transformations and

30
ACCUMULATION OF ERRORS

floating-point arithmetic (A and b in the system (l) are recent developments (interval analysis and automatic error
assumed to be real) analysis) can be found in [A2], [A3] and [A8]. A complete
introduction to interval analYSis is given in [A4].
(4)
References
In this estimate, is the relative precision of in the [A1] FORSYTHE, G.E. and MOLER, CB.: Computer solution of
computer, II A liE = (~at)1 /2 is the Euclidean matrix linear algebraic systems, Prentice Hall, 1967.
[A2] LARSEN, I. and SAMEH, A.: 'Efficient calculation of the effects
norm, and f(n) is a function of type Cnk, where n is of roundoff errors', ACM Trans. Math. Software 4 (1978),
the order of the system. The exact values of the con- 228-236.
stants C and the exponents k depend on such details of [A3] MILLER, W. and SPOONER, D.: 'Software for roundoff
analysis II', ACM Trans. Math. Software 4 (1978), 369-390.
the computation procedure as the rounding-off method, [A4] STEWART, G.W.: 'Conjugate gradients methods for solving
the use made of accumulation of inner products, etc. systems of linear equations', Numer. Math. 21 (1973), 284-
Most frequently, k = I or 3/2. 297.
[AS] WILKINSON, I.H.: 'Error analysis of direct methods of matrix
In Gauss-type methods, the right-hand side of the inversion', J. Assoc. Comput. Machinery 8 (1961),281-330.
estimate (4) involves yet another factor g(A), reflecting [A6] WILKINSON, I.H.: 'Modern error analysis', SIAM Review 13,
the possibility that the elements of A may increase at 548-588.
[A7] YOHE, I.M.: 'Software for interval arithmetic', ACM Trans.
intermediate steps of the method in comparison with
Math. Software 5 (1979), 50-63.
their initial level (no such increase occurs in orthogonal
methods). In order to reduce g(A) one resorts to vari- Accumulation of rounding-off errors (or of errors)
0us ways of pivoting, thus putting bounds on the occurs when one is solving problems in which the solu-
increase of the matrix elements. tion is the result of a large number of consecutively
In the square-root method (or Cholesky method), performed arithmetic operations.
which is commonly used when the matrix A is positive A significant proportion of such problems arises in
definite, one has the sharper estimate the solution of linear or non-linear algebraic problems
(see above). In turn, one is most commonly concerned
IIFMIIE';;; ellA IIE(' with algebraic problems thai arise from the approxima-
There exist direct methods (the methods of Gordan, tion of differential equations. These problems display
bordering and of conjugate gradients) for which a certain specific features.
direct application of a scheme of inverse analysis does Errors accumulate in accordance with the same - or
not yield effective estimates. In these cases different even simpler - laws as those governing the accumula-
arguments are utilized to investigate the accumulation tion of computational errors; they may be investigated
of errors (see [6] - [9]). when analyzing a method for the solution of a problem.
References There are two different approaches to investigating
[I] GIVENS, W.: 'Numerical computation of the characteristic the accumulation of computational errors. In the first
values of a real symmetric matrix', Us. Atomic Energy Com
mission Reports. Ser. ORNL, no. 1574 (1954).
case one assumes that the computational errors at each
[2] WILKINSON, I.H.: Rounding errors in algebraic processes, Pren- step are introduced in the most unfavourable way and
tice Hall, 1962. so obtains a majorizing estimate for the error. In the
[3] WILKINSON, I.H.: The algebraic eigenvalue problem, Oxford
second case, one assumes that the errors are random
Dniv. Press, 1969.
[4] VOEVODIN, V.V.: Rounding-off errors and stability in direct and obey a certain distribution law.
methods of linear algebra, Moscow, 1969 (in Russian). The nature of the accumulation of errors depends on
[5] VOEVODIN, V.V.: Computational fundamentals of linear algebra, the problem being solved, the method of solving and a
Moscow, 1977 (in Russian).
[6] PETERS, G. and WILKINSON, I.H.: 'On the stability of
variety of other factors that appear at first sight to be
Gauss-Gordan elimination with pivoting', Commun. ACM 18, rather inessential: such as the type of computer arith-
no. I (1975), 20-24. metic (fixed-point or floating-point), the order in which
[7] BROYDEN, CG.: 'Error propagation in numerical processes', J.
the arithmetic operations are performed, etc. For exam-
Inst. Math. and Appl. 14, no. 2 (1974), 131-140.
[8] REID, I.K.: Large sparse sets of linear equations, London New ple, in computing the sum of N numbers

[9]
York, 1971, pp. 231-254.
IKRAMov, KH.D.: 'The condition for intermediate matrices for
AN = aj + ... +aN
Gauss, Gordan and optimal elimination methods', Zh. Vychisl. the order of the operations is significant. Suppose that
Mat. i Mat. Fiz. 18, no. 3 (1978), 531-545 (in Russian).
the computation is being done in a computer with
Kh.D.lkramov floating-point arithmetic with t binary digits, all
Editorial comments. Reference [3] above contains a numbers lying in the interval 1/2< I an I.;;; 1. In a
detailed account of rounding-off error analysis (also called direct computation of AN via the recurrence formula
roundoff error analysis) of algebraic processes. In addition
one can mention [A1], pp. 87-97, and [A5], pp. 69-82. A
historical analysis of roundoff analysis is given in [A7]. More the majorizing error estimate is of the order 2- 1N. But

31
ACCUMULATION OF ERRORS

one can proceed otherwise (see [1]). First compute sums ous step, as an error in initial data. A lower bound for
of pairs, Al =a2k-1 +a2k (if N=21 + 1 is odd, one puts A(h) is therefore dependent on the behaviour of the
AJ+I =a2/+1)' Then compute further sums of pairs, difference between slightly different solutions of the
A~=A!k-1 +A!k> etc. If 2m - I <N.;;;;,2m , then m steps differential equation, as defined by the small-
of pairwise addition using the formulas disturbance equation.
AZ = A~-~l +A~-l, A2 _ ak>
In the numerical solution of an ordinary differential
equation y' = f(x, y), the small-disturbance equation
yield AT =AN; the majorizing error estimate is of the has the form ,
order 2-I logz N. 11 = hex, y)1! + S,
In typical problems, the numbers am are computed and so, when solving the problem on an interval
according to formulas, in particular recurrence formu- (Xo, X), one cannot reckon on the constant A(h) in a
las, or appear consecutively in the operating memory of majorizing estimate for the computational error being
the computer; in such cases use of the scheme just essentially better than
described increases the load on the computer memory. x
X [h(t.Y(I))dl
However, the sequence of computations can be so fe dx.
organized that the load on the operating memory is Xo

never more than ""' logz N cells. When solving this problem, therefore, the most com-
In the numerical solution of differential equations monly used methods are one-step methods of the
one may encounter the following cases. As the grid Runge- Kutta or Adams type (see [3], [7]), where the
spacing h tends to zero, the error increases as (a(h)i-', accumulation of errors is basically determined by the
where q >0, while limh-->o I a(h) I > 1. Such methods of solution of the small-disturbance equation.
solution fall into the category of unstable methods. They In many methods, the principal term of the error
are of little practical interest. increases according to a similar law, while the computa-
In stable methods the error will characteristically tional error builds up significantly more rapidly (see
increase at the rate A (h)h -q, where [3]). The domain of practical applicability for such
li~_o I A (h) I < 00. The error in such methods is usu- methods is substantially narrower.
ally estimated in the following way. One constructs an The accumulation of computational errors essentially
equation for the perturbation, whether due to the intro- depends on the method employed to solve the grid
duction of rounding-off errors or other errors, and then problem. For example, when solving grid boundary
investigates the solution of this equation (see [2], [3]). value problems corresponding to ordinary differential
In more complicated cases, one uses the method of equations by the shooting or double-sweep method, the
equivalent perturbations (see [1], [4]), developed in con- accumulation of errors behaves like A (h)h -q, where q
nection with the investigation of the accumulation of is the same. The magnitudes A (h) in these methods
computational errors in the solution of differential may differ to such an extent that in a certain situation
equations (see [3], [5], [6]). Computations according to one of the methods becomes useless. In the shooting
some standard scheme with rounding-off are treated as method for a grid boundary value problem, the accu-
computations without rounding-off but for equations mulation of errors behaves like c 1/ h, C >0, while for
with perturbed coefficients. Comparing the solution of the double-sweep method this is Ah -q. If one adopts a
the original difference equation with the solution of the probabilistic approach to the investigation of the accu-
equation with perturbed coefficients, one obtains an mulation of errors, there are cases in which one
estimate for the error. assumes a priori a certain distribution law for the
Considerable care is taken to select a method in errors (see [2]); in others, one introduces a measure on
which q and A(h) are as small as possible. When a the space of problems under consideration and, on the
fixed method of solution is being used, the standard basis of this measure, derives a distribution law for the
formulas can usually be transformed in such a way that rounding-off errors (see [8], [9]).
q';;; I (see [3], [5]). This is particularly important in the When measuring the accuracy of the solution of a
case of ordinary differential equations, where the problem, the majorizing and probabilistic approaches
number of steps in the various cases turns out to be to the estimation of the accumulation of computational
very high. errors usually yield qualitatively identical results: The
The magnitude A (h) can increase strongly as the errors either accumulate in both cases within admissible
integration interval is enlarged. The goal is therefore to bounds, or they exceed them in either case.
use methods with A (h) as small as possible. When deal-
References
ing with Cauchy problems, the rounding-off error at [1) VOEVODIN, V.V.: Computational foundations of linear algebra,
each specific step can be regarded, relative to the previ- Moscow. 1977 (in Russian).

32
ACTION OF A GROUP ON A MANifOLD

[2] SHURA-BURA, M.R.: Prikl. Mat. i Mekh. 16, no. 5 (1952), 575- integral of a function, the stationary values of which
588. determine a real motion of a mechanical system acted
[3] BAKHVALOV, N.S.: Numerical methods: analysis, algebra, ordi-
nary differential equations, Mir, 1977 (translated from the Rus- upon by given active forces, in the class of kinemati-
sian). cally possible motions between some two final positions
[4] WILKINSON,l.H.: The algebraic eigenvalue problem, Oxford Po and P I in space and satisfying certain conditions.
Univ. Press, 1969.
[5] BAKHVALOV, N.S.: Computational methods and programming, One distinguishes between Hamiltonian, Lagrangian
Vol. I, Moscow, 1962, pp. 69-79 (in Russian). and Jacobian actions, which appear in the correspond-
[6] GoDUNOV, S.K. and RYABEN'KIi, V.S.: The theory of difference ing principles of stationary action.
schemes, North-Holland, 1964 (translated from the Russian).
The Hamiltonian action
[7] BAKHVALOV, N.S.: 'On an estimate of the error at numerical
integration of differential equations by Adams' extrapolation 1\

method', Dokl. Akad. Nauk SSSR 104, no. 5 (1955), 683-686 j(T+U)dt
(in Russian). 10

[8] BAKHVALOV, N.S.: 'Optimal convergence bounds for quadratic


processes and integral methods of Monte Carlo type for classes
is defined in the class of kinematically possible motions
of functions', Zh. Vychisl. Mat. i Mat. Fiz. 4, no. 3 (1964), of a holonomic system for which the initial and the
399-404 (in Russian). final positions of the system, as well as the time of
[9] LAPSHIN, E.A.: 'A statistical investigation of roundoff errors in
motion between them, are the same as the respective
the numerical solution of an ordinary differential equation',
Zh. Vychisl. Mat. i Mat. Fiz. 11, no. 6 (1971), 1425-1436 (in ones for real motion.
Russian). The Lagrangian action
N. S. Bakhvalov
t
Editorial comments. Statistical treatment of rounding-off j2Tdt
(or roundoff) errors can be found in the references below. It 10

should be remarked that statistical analyses give more real- and the Jacobian action
istic bounds, but are much more difficult to compute.
References
[A 1] P.: Discrete variable methods in ordinaT'/ differential
HENRIe!,
equations, Wiley, 1962.
[A2] HENRIe!, P.: Error propagation for difference methods, Wiley,
1963.
[A3] HULL, T.E. and SWENSON, J.R.: 'Tests of probabilistic models
are defined in the class of kinematically possible
for propagation of roundoff errors', Commun. ACM 9 (1966),
108-113. motions of a holonomic conservative system for which
the initial and the final positions of the system, and the
AMS 1980 Subject Classification: 65GXX, 65F35,
constant energy h, are the same as the respective mag-
65LXX
nitudes for a real motion. Here T is the kinetic energy
ACCUMULATION POINT of a set A - A point x in a of the system, and for a conservative system
topological space X such that in any neighbourhood of
x there is a point of A distinct from x. A set can have
1
T = -2 ~ a;/J;qj'
;.j=l
many accumulation points; on the other hand, it can where qi are the generalized Lagrange coordinates, and
have none. For example, any real number is an accu- U(q) is the force function of active forces.
mulation point of the set of all rational numbers in the For more details see Variational principles of classical
ordinary topology. In a discrete space, no set has an mechanics; Hamilton - Ostrogradski principle; Lagrange
accumulation point. The set of all accumulation points principle; Jacobi principle.
of a set A in a space X is called the derived set (of A). V V Rumyantsev
In a T I-space, every neighbourhood of an accumula- AMS 1980 Subject Classification: 70-01, 70BXX
tion point of a set contains infinitely many points of
the set. ACTION OF A GROUP ON A MANIFOLD - The
The concept just defined should be distinguished best-studied case of the general concept of the action of
from the concepts of a proximate point and a complete a group on a space. A topological group G acts on a
accumulation point. In particular, any point of a set is a space X if to each g E G there corresponds a homeomor-
proximate point of the set, while it need not be an phism CPg of X (onto itself) satisfying the following con-
accumulation point (a counterexample: any point in a ditions: I) CPg 'CPh = CPgh; 2) for the unit element e E G the
discrete space). mapping CPe is the identity homeomorphism; and 3) the
A. V Arkhangel'skif mapping cp: G XX~X, cp(g, x)=CPg(x) is continuous. If
AMS 1980 Subject Classification: 54AXX X and G have supplementary structures, the actions of
G which are compatible with such structures are of spe-
ACTION - A functional expressed by the definite cial interest; thus, if X is a differentiable manifold and

33
ACTION OF A GROUP ON A MANIFOLD

G is a Lie group, the mapping cp is usually assumed to Recent results (mid-nineteen-seventies) mostly con-
be differentiable. cern: 1) the determination of types of orbits with vari-
The set {cpg(xo) }gEG is called the orbit (trajectory) of ous suplementary assumptions concerning the group G
the point Xo EX with respect to the group G; the orbit and the manifold X ([6]); 2) the classification of group
space is denoted by X / G, and is also called the quo- actions; and 3) finding connections between global
tient space of the space X with respect to the group G. An invariants of the manifold X and local properties of the
important example is the case when X is a Lie group group actions of G in a neighbourhood of fixed points
and G is a subgroup; then X / G is the corresponding of XG. In solving these problems an important part is
homogeneous space. Classical examples include the played by: methods of modem differential topology
spheres S n -I = 0(n) / O(n - I), the Grassmann mani- (e.g. surgery methods); KG-theory [1], which is the
folds O(n) / (O(m)XO(n -m, and the Stiefel mani- analogue of K-theory for G-vectorbundles; bordism and
folds O(n) / O(m) (cf. Grassmann manifold; Stiefel cobordism theories [3]; and analytical methods of
manifold). Here, the orbit space is a manifold. This is studying the action of the group G based on the study
usually not the case if the action of the group is not of pseudo-differential operators in G-bundles [2], [7].
free, e.g. if the set X G of fixed points is non-empty. A References
free action of a group is an action for which g = e fol- [II ATIYAH, M.F.: K-theory: lectures, Benjamin, 1967.
lows if gx =x for any x EX. On the contrary, X G is a [2) ATIYAH, M.F. and SINGER, I.M.: 'The index of elliptic opera-
tors', Ann. of Math. (2) 87 (1968), 484-530.
manifold if X is a differentiable manifold and the
[31 BUKHSHTABER, V.M., MISHCHENKO, A.S. and NOVIIWV, S.P.:
action of G is differentiable; this statement is valid for 'Formal groups and their role in the apparatus of algebraic
cohomology manifolds over Zp for G = Zp as well topology', Russian Math. Surveys 26 (1971), 63-90. (Uspekhi
(Smith's theorem). Mat. Nauk 26, no. 2 (1971), 131-154)
[4) CONNER, P.E. and FLOYD, E.E.: Differentiable periodic maps,
If G is a non-compact group, the space X / G is usu- Springer, 1964.
ally inseparable, and this is why a study of individual [5) BREDON, G.: Introduction to compact transformation groups,
trajectories and their mutual locations is of interest. Acad. Press, 1972.
[6) HSIANG, Wu YI: Cohomology theory of topological transforma-
The group G = R of real numbers acting on a differenti- tion groups, Springer, 1975.
able manifold X in a differentiable manner is a classical [7) ZAGIER, D.B.: Equivariant Pontryagin classes and applications to
example. The study of such dynamical systems, which orbit spaces, Springer, 1972.
in terms of local coordinates is equivalent to the study [8) Proc. conf. transformation groups, Springer, 1968.
[9) Proc. second conf. compact transformation groups, Springer,
of systems of ordinary differential equations, usually 1972.
involves analytical methods. A. V. Zarelua
If G is a compact group, it is known that if X is a Editorial comments.
manifold and if each gEG, g=l=e, acts non-trivially on References
X (i.e. not according to the law (g, X)-H), then G is a [A1] PETRIE, T. and RANDALL, J.D.: Transformation groups on
manifolds, M. Dekker, 1984.
Lie group [8]. Accordingly, the main interest in the
action of a compact group is the action of a Lie group. AMS 1980 Subject Classification: 57SXX
Let G be a compact Lie group and let X be a com-
pact cohomology manifold. The following results are ACYCLIC CONTINUUM - A continuum with trivial
typical. A finite number of orbit types exists in X, and homology groups (d. Homology group).
the neighbourhoods of an orbit look like a direct pro- P.s. Aleksandrov
duct (the slice theorem); the relations between the coho- AMS 1980 Subject Classification: 54F15
mology structures of the spaces X, X / G and X G are
of interest. ACYCLIC ELEMENT - A component of the set of
If G is a compact Lie group, X a differentiable mani- dividing points and terminal points of a continuum.
fold and if the action A.A. Mal'tsev

</>:GXX ~ X AMS 1980 Subject Classification: 54F15

is differentiable, then one naturaly obtains the follow-


ADAMS METHOD - A finite-difference method for
ing equivalence relation: (X, CP)--(X', CP') if and only if
solving Cauchy's problem for systems of first-order dif-
it is possible to find an (X", CP") such that the boundary
ferential equations
ax" has the form ax" =X U X' and such that
cp" I x = cp, cp" I x = cp'. If the group G acts freely, the y' = I(x,y), y(xo) = Yo
equivalence classes can be found from the one-to-one In integration over a grid with a constant step
correspondence with the bordisms Q.(B G ) of the classi- Xn = X 0 + nh the computational formulas may be based
fying space BG (cf. Bordism). a) on extrapolation

34
ADDITION

k step size. However, Adams' method proves to be more


Yn+\ = Yn +h ~:U-A!(Xn-A'Yn-~J; economical than the Runge - Kutta method in most
A=O
cases; it was first introduced by J.e. Adams in 1855.
or b) on interpolation
k-\ References
Yn+\ = Yn +h ~ v-A!(xn-A,Yn-A). [1) BEREZIN, I.S. and ZHIDKOY, N.P.: Computing methods, 2, Per-
A=-\ gamon Press, 1973 (translated from the Russian).
[2) BAKHVALOY, N.S.: Numerical methods: analysis, algebra, ordi-
For a given k, the latter formula gives more accurate
nary differential equations, Mir, 1977 (translated from the Rus-
results but a non-linear system of equations must be sian).
solved in order to find Yn + I. [3) TnrnONOY, A.N. and GoRBUNOY, A.D.: 'Asymptotic expan-
In practical work, an approximation by the former sions of the error in the difference method of solving Cauchy'S
problem for systems of differential equations', USSR Comput.
method is found and one or two corrections are per- Math. Math. Phys. 2, no. 4 (1962), 565-586. (Zh. Vychisl. Mat.
formed according to the formula i Mat. Fiz. 2, no. 4 (1962),537-548)
k-\ [4) LOZINSKI!, S.M.: 'An estimate of the error of numerical
y~tP = Yn +h ~ v-A!(Xn-A,Yn-A)+hvt!(xn+"y~~\), integration of differential equations 1', Izv. Vyssh. Uchebn.
A=O Zaved. Mat. 5, no. 6 (1958), 52-90 (in Russian).
which converge pointwise if h I v I I I (3f / 3y) I < l. [5) BAKHYALOY, N.S.: 'On an estimate of the error at numerical
integration of differential equations by Adams' extrapolation
The initial conditions Y I, . . . ,Yk for Adams' method, method', Dokl. Akad. Nauk SSSR 104, no. 5 (1955), 683-686
which are needed to begin the calculations by formula (in Russian).
N. S. Bakhvalov
a), must be found by some special method. The error
in the solution can be written as Editorial comments. The Adams method is a special k
or (k+ 1) multistep method. It yields explicit difference

x.. H [I. g(x, Ijy(" "(I) dl 1h" , +O(h"'), equations (predicator formula) in the extrapolation case a)
and implicit difference equations (corrector formula) in the
interpolation case b). Certain special case are known as
where Q(x, t) is the solution of the system Adams- Bashforth methods, viz. the series of explicit
methods k =0, Uo = 1 (Euler's: method);
dfJ}:, t) = /y(x,y(xfJ(x, t) k=1, uo=3/2, U-1 =-1 /2;
k=2, uo=23/12, U-1 =-4/3, u_2=5/12; etc.
when Q(t, t)=E.
A certain series of implicit methods is known as
The structure of the term O(h k +2) is such that, for Adams- Moulton methods, viz. k= 1, V1 = 1/2, Vo =1/2
small values of h, it is usually uniformly small as com- (the'trapezoidal rule);
pared to the main term on large intervals of integra- k=2, V1 =5/12, vo=2/3, V-1 =-1 /12; etc.
tion. This means that Adams' method can be applied References
over a large integration interval in the case of an [A1) Fox, L. and MAYERS, D.F.: Computing methods for scientists
absolutely-stable solution of the differential problem. In and engineers, Clarendon Press, 1968.
[A2] ROSSER, J.B.: 'Solving differential equations on a hand-held
particular, as distinct from the Milne method, it may be
programmable calculator', in G.H. Golub (ed.): Studies in
used for finding stable periodic solutions of differential numerical analysis, Math. Assoc. Amer., 1984, pp. 199-242.
equations. The standard Adams procedure for integra-
AMS 1980 Subject Classification: 65LXX, 65M10,
tion with automatic step selection is much more
65N10
involved than the standard Runge - Kutta method, since
the step-changing algorithm is much more involved and ADDmON - One of the basic arithmetic operations.
the selection of the initial values of Yk is not standard- The result of addition is called the sum. The sum of
ized. two numbers a and b is denoted by a + b, and a and b
In the case of the equations y' = -ay, a>O, the are called the summands. Addition of numbers is com-
extrapolation formula a) has the form mutative: a + b = b + a, and assocIative:
k
(a + b) + c = a + (b + c). The operation inverse to addi-
Yn+\ = Yn+h~U-A(-aYn-A).
10.=0 tion is called subtraction.
Particular solutions of this equation are Yn = CJln, where Addition is also the name usually given to the opera-
Jl is a root of the equation tion in an Abelian group (in additive notation) and to
the binary operation in a ring under which its elements
form an Abelian group. Here also, addition is associa-
tive and commutative. Sometimes, a non-commutative
If ah '2/ _ I u - A I > 2, then one root of this equation operation in a group is called addition, such as in the
is I JlI=>\ and the rounding-off errors rapidly case of a multi-operator group.
O.A. Ivanova
increase. When integrating with automatic step selec-
tion, this results in an unjustified diminution of the AMS 1980 Subject Classification: 00A25, 20XX

35
ADDITION OF SETS

ADDITION OF SETS - Vector addition and certain [3) FIREY, W.J.: 'Blaschke sums of convex bodies and mixed
other (associative and commutative) operations on sets bodies', in Proc. Coli. Convexity Copenhagen, 1965,
Copenhagen Univ. Mat. Inst., 1967, pp. 94-101.
Ai' The most important case is when the Ai are convex [4) DINGHAS, A.: Minkowskische Summen und Integrale. Superad-
sets in a Euclidean space Rn. ditive Mengenfunktionale. Isoperimetrische Ungleichungen, Paris,
The vector sum (with coefficients Ai) is defined in a 1961.
VP. Fedotov
linear space by the rule
AMS 1980 Subject Classification: 03EXX, 04AXX,
S = ~AiAi = U {~AiXi}'
x,EA j I
52A25, 52A20

where the Ai are real numbers (see [1)). In the space Rn, ADDITION THEOREM for weights - If a Hausdorff
the vector sum is called also the Minkowski sum. The compactum X can be represented as the union over a
dependence of the volume S on the Ai is connected set of infinite cardinality ';;;'T of its subspaces of weight
with mixed-volume theory. For convex Ai, addition ';;;'T, then the weight of X does not exceed 'T. The addi-
preserves convexity and reduces to addition of support tion theorem (which was formulated as a problem in
functions (cf. Support function), while for C 2 -smooth [I)) was established in [3] for 'T=~o and in [4] in com-
strictly-convex Ai eRn, it is characterized by the addi- plete generality. Cf. Weight of a topological space.
tion of the mean values of the radii of curvature at
points with a common normal. References
[1) ALEKSANDROV, P.S. and URYSOHN, P.: Memoire sur les espaces
Further examples are addition of sets up to transla- topologiques compacts, Koninkl. Nederl. Akad. Wetensch.,
tion, addition of closed sets (along with closure of the Amsterdam, 1929.
result, see Convex sets, linear space of; Convex sets, [2) ENGELKING, R.: General topology, PWN, 1977 (translated from
the Polish).
metric space of), integration of a continual family of [3) SMIRNOV, Yu.M.: 'On metrizability of bicompacta, decompos-
sets, and addition in commutative semi-groups (see [4]). able as a sum of sets with a countable base', Fund Math. 43
Firey p-sums are defined in the class of convex bodies (1956), 387-393 (in Russian).
Ai eRn containing zero. When p ;;;;'1, the support func- [4) ARKliANGEL'sKIi, A.V.: 'An addition theorem for weights of
sets lying in bicompacta', Dokl. Akad Nauk SSSR 126, no. 2
tion of the p-sum is defined as C'L./IIf)l / P, where Hi (1959), 239-241 (in Russian).
are the support functions of the summands. For p.;;; - 1 [5) ARKHANGEL'sKIi, A.V. and PONOMAREV, V.I.: Fundamentals of
general topology: problems and exercises, Reidel, 1984
one carries out (-p )-addition of the corresponding (translated from the Russian).
polar bodies and takes the polar of the result (see [2]). A. V Arkhangel'skii
Firey p-sums are continuous with respect to Ai and p. AMS 1980 Subject Classification: 54A25
The projection of a p-sum onto a subspace is the p-sum
of the projections. When p = 1, the p-sum coincides
ADDITIVE ARITHMETIC FUNCflON - An arith-
with the vector sum, when p = -1 it is called the
metic function of one argument that satisfies the fol-
inverse sum (see [1]), when p = + 00 it gives the convex
lowing conditions for two relatively prime integers m, n
hull of the summands, and when p = - 00 it gives their
intersection. For these four values, the p-sum of polyhe- f(mn) = f(m) +f(n).
dra is a polyhedron, and when p = +2, the p-sum of An additive arithmetic function is said to be strongly
ellipsoids is an ellipsoid (see [2)). additive if f(pa)= f(P) for all prime numbers p and all
The Blaschke sum is defined for convex bodies positive integers a. An additive arithmetic function is
Ai eRn considered up to translation. It is defined by said to be completely additive if the condition
the addition of the area functions [3]. f(mn)= f(m)+ f(n) is satisfied for relatively non-
The sum along a subspace is defined in a vector space prime integers m, n as well; ill such a case
X which is decomposed into the direct sum of two sub- f(pa)=af(p)
spaces Y and Z. The sum of Ai along Y is defined as Examples. The function Q(m), which is the number
of all prime divisors of the number m (multiple divisors
zl(z {~(Yz nAi+ are counted according to their multiplicity), is an addi-
tive arithmetic function; the function w(m), which is
where Y z is the translate of Y for which Yz n z = {z } the number of different prime divisors of the number
(see [1 D m, is strongly additive; and the function log m is com-
pletely additive. J.P. Kubilyus
References
[I] ROCKAFELLER, R.T.: Convex analySiS, Princeton Univ. Press, Editorial comments. An arithmetic function is also called
1970.
[2] FlREY, W.J.: 'Some applications of means of convex bodies',
a number-theoretic function.
Pacif. 1. Math. 14 (1964), 53-60. AMS 1980 Subject Classification: 10A20

36
ADDITIVE GROUP

ADDITIVE CATEGORY - A category (; in which for integer different from zero and n is a sufficiently large
any two objects X and Y an Abelian group structure is number. In particular 'T2(m)=T(m) is the number of
defined on the set of morphisms HOIllQ:(X, y), such divisors of the number m. Sums of the form (1) express
that the composition of morphisms the number of solutions of the equations
HOIIlq(X, Y)XHoIIlq(Y, Z) ~ HOIIlq(X, Z) X I . . . Xk, - YI . . . Yk, = a, (2)
is a bilinear mapping. Another necessary condition is xI . . . Xk, +Y I . . . Yk, = n, (3)
that (; includes a null object (zero object, cf. Null
respectively. Particular cases of the additive divisor
object of a category) as well as the product X X Y of
problem (k l =k2 =2, kl =2 and k2 =3) are considered
any two objects X and Y.
in [1] - [3]. The additive divisor problem with kl =2
In an additive category the direct sum XEB Y of any
and an arbitrary positive integer k2 was solved using
two objects exists. It is isomorphic to their product
the dispersion method of Yu.V. Linnik [4].
XX Y. The dual category to an additive category is also
an additive category. References
[1] INGHAM, A.E.: 'Some asymptotic formulae in the theory of
A functor F: (;~(;I from an additive category (; into numbers', J. London Math. Soc. (1) 2 (1927),202-208.
an additive category (;1 is said to be additive if, for any [2] EsTERMAN, T.: 'On the representations of a number as the sum
objects X and Y in (;, the mapping of two products', Proc. London Math. Soc. (2) 31 (1930), 123-
F: HOIIl(;r(X, Y)~HoIllQ:, (F(X), F(Y) is a homomor- m.
[3] HooLY, C.: 'An asymptotic formula in the theory of numbers',
phism of Abelian groups. An additive category is said Proc. London Math. Soc. (3) 7 (1957), 396-413.
to be pre-Abelian if a kernel (cf. Kernel of a morphism [4] LINNIK, YU.V.: The dispersion method in binary additive prob
lems, Amer. Math. Soc., 1963 (translated from the Russian).
in a category) and a cokernel exist for any morphism.
B.M. Bredikhin
If for a morphism u: X ~ Y in an additive category
there exists an image Im(u) and a co-image Coim(u), Editorial comments. The function T2(m)=T(m) is also
then there exists a unique morphism denoted by d(m) or ao(m), cf. [A 1], Sect. 16.7.
u: Coim(u)~Im(u) such that the morppism u splits as References
the composition [A1] HARDY, G.H. and WRIGHT, E.M.: An introduction to the
theory of numbers, Clarendon Press, 1979.
X ~ Coim(u) ~ Im(u) ~ Y.
AMS 1980 Subject Classification: 10JXX
An Abelian category is additive by definition. Exam-
ples of non-Abelian additive categories are the category ADDITIVE FUNCTION, finitely-additive function (on
of topological modules over a given topological ring sets, on domains) - A real-valued function JL defined on
with respect to the morphisms which are continuous a system of sets E and such that
linear mappings, and also the category of Abelian I-'< U E;) = ~I-'<E;) (*)
groups r with a filtration r = r0 ::J r I::J . . . ::J r n = {O}
with respect to the morphisms which are group for any finite number of pairwise-disjoint sets
homomorphisms preserving the filtration. {E I, . . . ,En} of E whose union also belongs to E.
Countably-additive set functions are an important kind
References of additive functions (cf. Countably-additive set func-
[1] BUCUR, I. and DELEANU, A.: Introduction to the theory of
categories and functors, Wiley, 1968. tion).
[2] GROTHENDIECK, A.: 'Sur quelques points d'algebrique homolo-
A.P. Terekhin
gique', Tohoku Math. J. 9 (1957), 119-121. Editorial comments. Suppose that E is a a-algebra on a
[3] GRUSON, L.: 'Completion abelienne', Bull. Sci. Math. (2) 90 set X. Then a non-negative function p. (taking, possibly, the
(1966), 17-40. 1. V. Dogac
I hev val ue + 00) on E is an additive (finitely-additive, countably-
additive) measure if it satisfies (*) for an arbitrary (respec-
AMS 1980 Subject Classification: 18A05, 18816, tively, finite, countable) number of disjoint sets E; in E.
18E05 Usually, a measure (sic) is a countably-additive measure.
References
ADDITIVE DmSOR PROBLEM - The problem of [A1] ROYDEN, H.L.: Real analysis, Macmillan, 1968.
finding asymptotic values for sums of the form:
AMS 1980 Subject Classification: 28A 10
~ Tk, (mfrk,(m +a),]
m~n

(1) ADDITIVE GROUP of a ring - The group formed by


~ Tk,(mfrk,(n -m), all elements of the given ring with respect to the addi-
m<n
tionoperation in the ring. The additive group of a ring
where Tk(m) is the number of different factorizations of is always Abelian.
an integer mink factors, counted according to multi- o.A. Ivanova
plicity. Here kl and k2 are integers ~2, a is a fixed AMS 1980 Subject Classification: 16AXX, 13AXX

37
ADDITIVE NOISE

ADDITIVE NOISE - An interference added to the prime number and two squares (cf. Hardy - Littlewood
signal during its transmission over a communication problem). Problems in additive number theory are
channel. More precisely, one says that a given commun- solved by analytic, algebraic, elementary and mixed
ication channel is a channel with additive noise if the methods, and also by methods based on probabilistic
transition function Q(y, .) of the channel is given by a concepts. Depending on the method selected, additive
density q(y, y), y E !lJI, YE CfII =!lJI (!lJI and CfII are the problems form a part of various branches of number
spaces of the values of the signals at the input and out- theory - analytic, algebraic and probabilistic number
put of the channel, respectively) depending only on the theory.
difference y-y, i.e. q(Y,Y)=q(j-y). In this case the The first systematic results in this field were obtained
signal ij at the output of the channel can be represented in 1748 by L. Euler, who employed power series to
as the sum of the input signal ." and a random variable study the partition of integers into positive summands;
r independent of it, called additive noise, so that in particular, he investigated the problem of partition-
ij=.,,+r. ing a given number into a given number of summands.
If one considers channels with discrete or continuous Many classical problems in additive number theory
time over finite or infinite intervals, the notion of a are solved by means of generating functions. This
channel with additive noise is introduced by the rela- method was introduced by Euler and forms the basis of
tion ij(t) =.,,(t) +W), where t is in the given interval, the analytic methods developed by G.H. Hardy, J.E.
.,,(t), ij(t) and W) are random processes representing Littlewood and LM. Vinogradov. As a first step, assign
the signals at the input and the output of the channel to given sequences Ai = {ad (where ai;;;;'O is an integer,
with additive noise, respectively; moreover, the process and i = I, 2, ... ) the power series
W) is independent of .,,(t). In particular, if is a nt) j;(z) = L zai
Gaussian random process, then the considered channel O=S;;;;aj<oo

is called a Gaussian channel. and consider the corresponding generating function


References k 00

[I] R.: Information theory and reliable communication,


GALLAGER, F(z) = IIf;(z) = Lr(n)z",
i=J "=0
McGraw-Hill, 1968.
[2] KHARKEVICH, A.A.: Channels with noise, Moscow, 1%5 (in where r(n)=rk,A(n) is the number of representations of
Russian). the number n in the form
P.L. Dobrushin
v. V. Prelov n = aJ + ... +ako ai E Ai, A = {AIoA2'" .}.
Editorial comments. More generally, especially in sys- Here r (n) can be calculated using the Cauchy integral.
tem and control theory and stochastic analysis, the term In Vinogradov's method trigonometric sums
additive noise is used for describing the following way noise
j;(a) = L i7Tiaai,
enters a stochastic differential equation or observation equa-
tion: dx= f(x, t) dt+ dw, dy= h(x, t) dt+ dv, where wand v
f F(a)e27Tiaai da,
J
are Wiener noise processes. The general situation of a sto- r(n) =
chastic differential equation of the form o
dx = f(x, t) dt + g(x, t) dw is referred to as having multiplica- are used instead of power series. A main part of r (n ),
tive noise. located in intervals distributed over neighbourhoods of
AMS 1980 Subject Classification: 94AXX rational points, is extracted. Instead of the analytic pro-
perties of F(z) which, in several problems of additive
ADDITIVE NUMBER THEORY - The branch of number theory, necessitate the use of hypotheses
number theory dealing with problems involving the related to the Riemann hypothesis (cf. Riemann
partitioning (or decomposition) of integers into sum- hypotheses), a crucial role in the calculation of r(n) is
mands of a given type, and also with the algebraic and played by purely arithmetic estimates of trigonometric
geometric analogues of such problems concerning alge- sums by Vinogradov's method, and by the law of the
braic number fields and sets of lattice points. These distribution of prime numbers in arithmetic progres-
problems are said to be additive, and the ones usually sions, obtained by the analytic methods of the theory
considered are partition problems of large integers. of Dirichlet L-functions. It was found that, depending
Classical problems in additive number theory on k, either r(n)i=O for all n;;;;.l or r(n)i=O for suffi-
include: the representation of a number as a sum of ciently large n (n;;;.no(A, or the relation r(n)i=O is
four squares, nine cubes, etc. (cf. Waring problem); the valid for almost all values of n, i.e.
representation of a given number as a sum of not more
than three prime numbers (cf. Goldbach problem); and L 1)
n~x

the representation of a given number as a sum of one lim


X--,lo'X)
]. ~nlJO
X
= 1,

38
ADDITIVE NUMBER THEORY

or, finally, r(n) can be expressed by an asymptotic for- use of elementary concepts of probability theory, which
mula. The smallest number k satisfying one of the pro- were also employed by P.L. Chebyshev in his proof of
perties above is denoted by g(A), G(A), Go(A), or the law of large numbers. It consists of reducing the
ko(A), respectively. If {a;} = {P}, where {P} is the given binary equation to a large number of auxiliary
sequence of prime numbers, and k = 3, then one equations for which the expected number of solutions
obtains Vinogradov's theorem: Every sufficiently large S;(n) and the true numbers of solutions Q;(n) of the
odd integer can be represented as a sum of three prime equations are connected. If it is shown by a computa-
numbers; if k=2, one obtains Chudakov's theorem: tion of the dispersion that Q;(n) does not differ much
Almost all even integers can be represented as a sum of from S;(n) 'in the mean', then Q(n)= ~S;(n) (within
two prime numbers. an acceptable error). The dispersion method was also
Certain problems in additive number theory are employed in the study of the general
solved by studying the structure of the sets obtained by Hardy- Littlewood equation.
addition of sequences A; = {a;} given only in terms of The range of application of the dispersion method
their densities d(A;)=infnA;(n) / n, where intersects with the range of application of the method
A;(n)=~1
~ai~n
1. If Al = ... =Ak=A, then the posi- of the large sieve, developed in 1941 by Linnik. The
tivity of d(A;) implies that g(A)<oo. The application method makes it possible to sieve out sequences with
of this result to problems in additive number theory in the aid of prime numbers, with an increasing number
which sequences of density zero are added, is realized of discarded residues. Actually, the method of the large
by constructing a new sequence, with a positive density, sieve is a consequence of the laws of distribution of
from the given sequence. This is done, mainly, by sieve weakly-dependent random variables.
methods (cf. Sieve method), by which it can be proved Additive number theory includes problems whose
that d(A;) is positive. It was shown by L.G. systematic study belongs to other branches of number
Shnirel'man in this way that positive integers can be theory: the problem of representing integers by qua-
represented as a sum of a bounded number of prime dratic or higher-degree expressions; and the study of
summands, while Yu.V. Linnik found an elementary Diophantine equations, which may be treated in the
solution of Waring's problem. framework of general additive number theory.
The elementary sieve methods of Y. Brun (cf. Brun In modern number theory various branches of addi-
sieve) and of A. Selberg (cf. Selberg sieve), if applied to tive number theory are now being intensively
a number of problems in additive number theory, yield developed. There is a trend to transfer the problems
results which are as yet unobtainable by analytic tools. and methods of additive number theory to arbitrary
However, the most advanced solutions of certain prob- algebraic number fields.
lems in additive number theory are obtained by com- References
bining analytic and elementary methods. In sieve [1) VINOGRADOV, I.M.: Selected works, Springer, 1985 (translated
methods, the principle of sieving the prime numbers from the Russian).
out of the sequence of positive integers (cf. Era- (2) LINNIK, YU.V.: The dispersion method in binary additive prob
lems, Amer. Math. Soc., 1963 (translated from the Russian).
tosthenes, sieve of) is extended to other sequences as (3) HUA, L.-K.: 'Abschlitzungen von Exponentialsummen und ihre
well. Thus, a simultaneous, suitab~ accurate, sieving of Anwendung in der Zahlentheorie', in Enzyklopaedie der
the prime numbers ,,;;;n 8\ and ";;;n 2, respectively (where Mathematischen Wissenschaften mit Einschluss ihrer Anwen
dungen, Vol. 1, 1959. Helft 13, Teil 1.
01 < 1 and O2 < 1 are suitably chosen positive constants) (4) OSTMANN, H.H.: Additive Zahlentheorie, Springer, 1956.
out of the sequences {m} and {2n - m} yields a solu- (5) CHUDAKOV, N.G.: Uspekhi Mat. Nauk 4 (1938), 14-33.
tion of the so-called Goldbach - Euler quasi-problem on (6) BREDIKHIN, B.M.: 'The dispersion method and definite binary
additive problems', Russian Math. Surveys 20, no. 2 (1965),
the representation of an even integer as a sum of two 85-125. (Uspekhi Mat. Nauk 20, no. 2 (1965), 89-130)
numbers one of which has at most k I, while the other (7) DAVENPORT, H.: Multiplicative number theory, Springer, 1980.
has at most k2 prime factors. (8) HALBERSTAM, H. and RICHERT, H.E.: Sieve methods, Acad.
Press, 1974.
Linnik in 1959 successfully solved the B.M. Bredikhin
Hardy- Littlewood problem by the dispersion method
Editorial comments. Elementary formulas for the number
developed by him. He showed [2] that any sufficiently of partitions of an integer can be found in [A1], Chapt. XIX.
large integer can be represented as the sum of a prime To derive such formulas the method of generating functions
number and two squares of integers. The dispersion (cf. Generating function) can be used.
method yielded solutions of several so-called binary The standard reference on the early historical aspects is,
problems, viz. how to find the number Q(n) of solutions of course, [A2].
of the equation a+/3=n, where 0: and /3 run through References
given sequences which are sufficiently well distributed [A1] HARDy, G.H. and WRIGHT, E.M.: An introduction to the
in arithmetic progressions. Linnik's method involves the theory of numbers, Clarendon Press, 1979.

39
ADDITIVE NUMBER THEORY

[A2] DICKSON, L.E.: History of the theory of numbers, 1-3, Chel- cially powerful results are obtained by the method of
sea, reprint, 1971. the large sieve and by the dispersion method, due to
AMS 1980 Subject Classification: 10JXX Yu.V. Linnik. Additive problems of the type 6) above
are also binary. They are studied by special arithmetic-
ADDITIVE PROBLEMS - Problems in number geometric methods of the theory of quadratic forms.
theory concerning the decomposition (or partition) of References
integers into summands of a given kind. The solution [1] VINOGRADOV, I.M.: The method of trigonometric sums in the
of classicru additive problems resulted in the develop- theory of numbers, Interscience, 1954 (translated from the Rus-
ment of new methods in number theory. The classical sian).
[2] LINNIK, YU.V.: The dispersion method in binary additive prob-
additive problems include: lems, Amer. Math. Soc., 1963 (translated from the Russian).
1) The Goldbach problem on representing an odd [3] LINNIK, YU.V.: Ergodic properties of algebraic fields, Springer,
integer, larger than 5, as the sum of three prime 1968 (translated from the Russian).
[4] HUA, L.-K.: 'Abschatzungen von Exponentialsummen und ihre
numbers, and the Euler - Goldbach problem on the Anwendung in der Zahlentheorie', in Enzyklopaedie der
representation of an even integer, larger than 2, as the Mathematischen Wissenschaften mit Einschluss ihrer Anwen-
sum of two prime numbers. These problems were first dungen, Vol. I, 1959. Heft 13, Teil I.
B.M. Bredikhin
posed in 1742.
2) The Waring problem on representing any positive AMS 1980 Subject Classification: 10JXX
integer as a sum of s=s(k) non-negative k-th powers,
where k~2 is given. First posed in 1770. ADDITIVE RELATION - A sub module r of the
Other additive problems are, for example: direct sum A EB B of two modules A and B over some
3) The problem on the representation of posItIve ring R. An additive relation can thus be regarded also
integers as the sum of a bounded number of prime as a (not necessary single-valued) mapping r: A ~B or,
numbers (the weak Goldbach problem). more exactly, as a 'many-valued' homomorphism, i.e. a
4) The Hardy - Littlewood problem on representing homomorphism rO of the submodule Def r into the
any integer, larger than 1, as the sum of a prime quotient module B /Ind(r) where
number and two squares (posed in the nineteen- Defr = {aEA: 3bEB (a, b)Er},
twenties).
5) Problems concerning the representation of all suf-
Ker r = {a EA: (a, O)Er},
ficiently large numbers as sums of two numbers with a Ind r = Ker r -I .
bounded number of prime factors. Here, r -\ : B ~A is the relation inverse to r; it consists
6) Problems concerning the representation of integers of all pairs (b, a)EBEBA such that (a, b)Er. Con-
by quadratic forms in three and four variables, and versely, if a sub module S CA, a quotient module B / L
similar problems. of the module B and a homomorphism {3: S~B / L are
Additive problems may be solved by analytic, alge- given, then there also exists a unique additive relation
braic, elementary and mixed methods (cf. Additive r : A ~B such that rO = {3.
number theory). A large number of additive problems H two additive relations r:A~B and s:B~C are
belong to one of the following two classes: given, then, as in the case of other binary relations, it is
a) Ternary additive problems of the type possible to define their product, sr: A ~C, which is the
n=a+{3+y, where a and {3 belong to sufficiently set of all pairs (a, c) EA EB C such that there exists an
dense sequences of integers which are well distributed element bEB for which (a, b)Er and (b, C)ES. This
in arithmetic progressions, while y belongs to a multiplication is associative (wherever defined) and,
sequence which may be thin, but for which the moreover, the additive relations form a category with
corresponding relevant trigonometric sums behave involution r~r -] .
properly. Additive relations are used in natural definitions of
b) Binary additive problems of the type n = a +{3, connecting homomorphisms for exact sequences of
with a and {3 subject to the same conditions as those complexes. The above considerations are valid not only
stated in a). in the category of modules, but also in any other
A universal tool for solving ternary additive prob- Abelian category.
lems for sufficiently large n is the general analytic
method of Hardy - Littlewood - Vinogradov: the References
[1] MAC LANE, S.: Homology, Springer, 1963.
method of trigonometric sums (cf. Vinogradov method). [2] PuPPE, D.: 'Korrespondenzen in Abelschen Kategorien', Math.
Binary additive problems usually cannot be solved by Ann 148 (1962),1-30.
A. V. Mikhalev
such methods, and are solved by different variants of
the elementary sieve method (cf. Sieve method). Espe- AMS 1980 Subject Classification: 18810

40
ADDITIVE UNIFORM STRUCTURE

ADDITIVE THEORY OF IDEALS - A branch of existence theorem, too, remains valid. Such a generali-
modern algebra. Its principal task is to represent any zation has in fact been found [4], and is known as ter-
ideal of a ring (or of another algebraic system) as the tiarity (cf. Tertiary ideal). It was subsequently shown
intersection of a finite number of ideals of special type that, within certain natural limitations, tertiarity is the
(primary, tertiary, primal, uniserial, etc.). The type of only 'good' generalization of the concept of primarity
the representation is so chosen that: I) for any ideal [6], [7], [8].
there exists a representation, in other words, some During the nineteen-sixties the additive theory of
'existence' theorem holds; 2) the representations chosen ideals developed further within the framework of lattice
must be unique apart from certain limitations or, in theory, of systems with fractions and of multiplicative
other words, some 'uniqueness' theorem must hold. The systems [4], [5], [6]; this stimulated the development of
fundamental principles of the additive theory of ideals the additive theory of ideals for non-associative rings,
were introduced in the nineteen-twenties and the normal divisors of a group and submodules of a
nineteen-thirties by E. Noether [I] and W. Krull [2]. module.
All special features of the additive theory of ideals References
are clearly manifested in the case of rings. Let R be a [I) NOETHER, E.: 'Idealtheorie in Ringbereichen', Math. Ann. 83
Noetherian ring, i.e. an associative ring with the max (1921),24-66.
imum condition for ideals. If A is an ideal of R, then [2) KRULL, W.: 'Idealtheorie in Ringen ohne Endli-
chkeitsbedingung', Math. Ann. 101 (1929), 729-744.
there exists a largest ideal N of R for which N k CA for [3) ZARISKl, O. and SAMUEL, P.: Commutative algebra, I, Springer,
some integer k;;;,.l. This ideal N is known as the pri- 1975.
mary radical of A (in R) and is denoted by pr(A). An [4) LESIEUR, L. and CROISOT, R.: Algebre noetherienne noncommu
tative, Gauthier-Villars, 1963.
ideal Q of R is said to be primary if for any two ideals [5) MURATA, K.: 'Additive ideal theory in multiplicative systems',
A and B of R, the condition J. Inst. Poiytechn. Osaka City Univ. (AJ 10, no. 2 (1959), 91-
115.
AB CQ, A Q ~ B cpr(Q) [6) ANDRUNAKIEVICH, V.A. and RYABUKHIN, YU.M.: 'The addi-
is satisfied. The intersection theorem is valid for primary tive theory of ideals in systems with residuals', Math. USSR
Izv. 1, no. 5 (1967), 1011-1040. (Izv. Akad. Nauk SSSR Ser.
ideals: The intersection of two primary ideals having Mat. 31 (1967), 1057-1090)
the same primary radical P is itself a primary ideal [7) RILEY, l.A.: 'Axiomatic primary and tertiary decomposition
with radical P. This theorem is used to prove an theory', Trans. A mer. Math. Soc. 105 (1962), 177-201.
[8) GoYAN, I.M. and RYABUKHIN, Yu.M.: 'On the axiomatic addi-
existence theorem: If the ring R is commutative, then tive theory of Riley ideals', Mat. Issl. 2, no. I (1967), 14-25 (in
for any ideal A =l=R there exists a representation of A as Russian).
the intersection of a finite number of primary ideals Ai: [9) FUCHS, L.: 'On primal ideals', Proc. A mer. Math. Soc. 1 (1950),
1-6.
(1) [lO) Itogi Nauk. Algebra Topol. Geom. 1965 (1967), 133-180.
such that none of the ideals Ai contains the intersection V.A. Andrunakievich
of the other ones, and such that the primary radicals Editorial comments. A primary representation is also
pr(Ai) are pairwise different. Such representations are called a primary decomposition. More generally one has pri-
known as non-contractible or primarily reduced [1], [4]. mary decompositions for submodules N of modules Mover
The uniqueness theorem holds for such representations: a Noetherian ring R. This is a representation
If (I) holds and
A = B, n ... nBm (2)
where each Ass(M / OJ) consists of a single prime ideal P"
is a second primarily-reduced representation of the (By definition Ass(M) for a module M, the set of prime ideals
ideal A of the ring R, then m =n and pr(Ai)=pr(Bi) for associated to M, is the collection of all prime ideals P for
l,,;;;;i ";;;;n, provided the ideals Bi are suitably renum- which there exists an xEM such that p={rER: rx=O}.)
bered. There is also a corresponding uniqueness theorem, stating
The additive theory of ideals of Noetherian commu- that there is a reduced decomposition, which of course
tative rings (the classical additive theory of ideals) has means that n.J7'='
OJ' C OJ holds for no i E I and that the
found numerous applications in various branches of Ass(M / OJ)=P, are all different.
mathematics. References
If the ring R is non-commutative, the above- [A1) BOURBAKI, N.: Aigebre commutative, Hermann, 1961,
mentioned existence theorem is no longer valid, but the Chapt. 3; 4.

uniqueness and intersection theorems still hold. This is AMS 1980 Subject Classification: 13CXX
why, ever since the nineteen-thirties, repeated attempts
have been made to find a generalization of classical ADDITIVE UNIFORM STRUCfURE of a topological
primarity to the non-commutative case such that the skew-field K - The uniform structure of its additive

41
ADDITIVE UNIFORM STRUCTURE

group. A base of neighbourhoods for the uniform struc- called the idele group of k (the idele group is the group
ture of the commuta!ive topological group of K is of units in the adele ring Ak)' 3) If Gk = GL(n, k) is the
formed by the sets V of all pairs (x, y) such that general linear group over k, then GA consists of the ele-
x - y E V, where V is an arbitrary neighbourhood of ments g=(gv)E IIVE vGv for which gv EGL(n, Ov) for
zero. A covering of a topological skew-field K is uni- almost all valuations v.
form for the additive uniform structure if a covering of The concept of an adele group was first introduced
the type {Vy:yEK} refining it can be found, where V by C. Chevalley (in the nineteen-thirties) for algebraic
is an arbitrary neighbourhood of zero and number fields, to meet certain needs of class field
Vy = {x +y: x E V}. In particular, a base of the addi- theory. It was generalized twenty years later to alge-
tive uniform structure of the real line is formed by all braic groups by M. Kneser and T. Tamagawa [1], [2].
coverings by intervals of given fixed length. The real They noted that the principal results on the arithmetic
line is the completion of the field of rational numbers of quadratic forms over number fields can be con-
with respect to its additive uniform structure. veniently reformulated in terms of adele groups.
The uniform structure of the group Rn , which is The image of the diagonal imbedding of Gk in GA is
known as its additive uniform structure, is the product a discrete subgroup in GA , called the subgroup of princi-
of the uniform structures of its factors R. A base is pal adeles. If 00 is the set of all Archimedean valuations
constituted by all coverings by (open) spheres of a
given fixed radius.
of k, then C = IIc k,
X IIc 0,
v. V. Fedorchuk
A(ao)
VEOO VEOO

Editorial comments. A uniform structure is also called a is known as the subgroup of integer adeles. If Gk =k',
uniform topology. A base for a uniform structure is also then the number of different double cosets of the type
called a uniformity. See also Uniform space. GkxGA(oo) of the adele group GA is finite and equal to
References the number of ideal classes of k. The naturally arising
[A1] ISBELL, l.R.: Uniform spaces, Amer. Math. Soc., 1964. problem as to whether the number of such double
AMS 1980 Subject Classification: 54E15, 12JXX classes for an arbitrary algebraic group is finite is con-
nected with the reduction theory for subgroups of prin-
ADDITIVITY - The property of magnitudes that can cipal adeles, i.e. with the construction of fundamental
be defined as follows. The value of a magnitude domains for the quotient space GA / Gk . It has been
corresponding to a whole object is equal to the sum of shown [5] that GA / Gk is compact if and only if the
the values of the magnitudes corresponding to its parts group G is k-anisotropic (cf. Anisotropic group).
for any division of the object into parts. For instance, Another problem that has been solved are the cir-
additivity of volume means that the volume of a whole cumstances under which the quotient space GA / Gk
object is equal to the sum of the volumes of its consti- over an algebraic number field has finite volume in the
tuent parts. Cf. Additive function; Countably-additive Haar measure. Since GA is locally compact, such a
measure always exists, and the volume of GA / Gk in
set function.
the Haar measure is finite if and only if the group G
AMS 1980 Subject Classification: 00A05, 00A25 has no rational k-characters (cf. Character of a group).
The number T( G) - the volume of GA / Gk - is an
ADELE - An element of the adele group, i.e. of the important arithmetical invariant of the algebraic group
restricted topological direct product G (cf. Tamagawa number). It was shown on the
IIck.(cO,) strength of these results [5] that the decomposition
VE V m

of the group Gk with distinguished invariant open sub- CA = U CkXjCA(ao)


j=1
groups Go.. Here Gk is a linear algebraic group, defined is valid for an arbitrary algebraic group G. If k is a
over a global field k, V is the set of valuations (cf. function field, it was also proved that the number of
Valuation) of k, kv is the completion of k with respect double classes of this kind for the adele group of the
to v E V, and Ov is the ring of integer elements in kv. algebraic group is finite, and an analogue of the reduc-
The adele group of an algebraic group G is denoted by tion theory was developed [6]. For various arithmetical
GA' Since all groups Gk. are locally compact and since applications of adele groups see [4], [7].
Go. is compact, GA is a locally compact group. References
Examples. 1) If Gk is the additive group k + of the [I] WElL, A.: Ade/es and algebraiC groups, Princeton Univ. Press,
field k, then GA has a natural ring structure, and is 1961.
[2AJ TAMAGAWA, T.: 'AdeJes', in AlgebraiC groups and discontinuous
called the adele ring of k; it is denoted by A k . 2) If Gk subgroups, Proc. Symp. Pure Math., Vol. 9, Amer. Math. Soc.,
is the multiplicative group k' of the field k, then GA is 1966, pp. 113-121.

42
ADIABATIC INVARIANT

[2B] KNESER, M.: 'Strong approximation', in Algebraic groups and


discontinUOWi subgroups, Proc. Symp. Pure Math., Vol. 9, Amer.
':i; = v, ~ - -W2(ft)X, (*)
Math. Soc., 1966, pp. 187-198.
[3] CASSELS, l.W.S. and FROm.ICH, A. (EDS.): Algebraic number
where ( is a small parameter and (,.)(s) is a positive, suf-
theory, Acad. Press, 1967. ficiently smooth function, the adiabatic invariant is
[4] PLATONOV, V.P.: 'Algebraic groups', J. Soviet Math. 4, no. 5
v
2
(1975),463-482. (Itogi Nauk. Algebra Topol. Geom 11 (1973), I = wx 2 +-.
W
5-37)
[5] BoREL, A.: 'Some finiteness properties of adele groups over If (")=const, the system (*) describes an ordinary har-
number fields', Publ. Math. IHES, no. 16 (1963), 5-30. monic oscillator with frequency (,.). Thus, in this case, if
[6] HARDER, G.: 'Minkowskische Reduktionstheorie tiber
Funktionenkorpern', Invent. Math. 7 (1969), 33-54. the variation of the parameters of the system is slow in
[7] PLATONOV, V.P.: 'The arithmetic theory of linear algebraic comparison to the oscillation period, then its energy
groups and number theory', Trudy Mat. Inst. Steklov. 132 (v 2 + ,i x 2 ) / 2 varies proportion'ally to the frequency.
(1973), 162-168 (in Russian).
[8] WElL, A.: Basic number theory, Springer, 1974. In this example, as well as in most other cases, it is
v.P. Platonov assumed that if the parameters did not change at all,
the Hamiltonian system under consideration would be
Editorial comments. Let I be an index set. For each VEl
completely integrable and its motion would be quasi-
let Gy be a locally compact group and Oy on open compact
subgroup. The restricted (topological) direct product of the
periodic (in this case simply periodic); other generaliza-
Gy with respect to the Oy, above denoted by tions are also known. In the publications by many
mathematicians and experts in celestial mechanics on
G = II GvC Oy) , Hamiltonian systems which are close to being com-
YEI

consists (as a set) of all (Xy)EII IGy such that Xy in Oy for pletely integrable, the term 'adiabatic invariant' is not
YE
used at all, even though the information presented in
all but finitely many v. The topology on G is defined by tak-
ing as a basis at the identity the open subgroups II
IUy
these publications seems to indicate that certain magni-
YE
tudes are in certain cases in fact adiabatic invariants.
with Uy an open neighbourhood of Gy for all V and Uy= Oy
for all but finitely many v. This makes G a locally compact
'Approximate invariance' of an adiabatic invariant
topological group. l(t) means that for all t under consideration the differ-
ence I (t) - 1 (0) remains small. (The derivative
AMS 1980 Subject Classification: 12A85, 20GXX,
dI(t) / dt can be a magnitude of the same order as the
22E55
derivatives of the other parameters of the system, but
ADIABATIC FLOW - A flow for which there is no the variations of the adiabatic invariant are not cum-
heat exchange between the individual parts of the mulative with time.) Such approximate invariance may
liquid and the bodies which surround it. If the flow of take place over a very large but finite period of time
a compressed liquid is adiabatic and continuous, then (temporal adiabatic invariants) or over the entire infinite
the entropy of a liquid element is constant. If the t-axis (stationary or permanent adiabatic invariants [I D.
motion is steady, i.e. if the characteristics of the field of The notion of 'slow variation of the parameters of the
the flow are independent of time, then the sum of the system' may be rendered more precise in two ways: a)
kinetic energy and the enthalpy for any liquid element the Hamiltonian H is an explicit function of the time t
is constant. (non-autonomous system), but the derivative dH / dt is
small; b) the system under consideration, with canoni-
References
[l] LANDAU, L.D. and LIFSHITZ, E.M.: Mechanics, Pergamon,
cal variables p, q, is a subsystem of a large system with
1960 (translated from the Russian). A P 17 k'~ variables p, q, p' , q', which is itself autonomous and is
. . ravors /l
such that either p', q' vary slowly, or else their
AMS 1980 Subject Classification: 76N10 variation does not materially affect the subsystem.
In view of the above restrictions, the existence of an
ADIABATIC INVARIANT - A term of physical origin adiabatic invariant can be affirmed only under various
whose meaning in mathematics is not unambiguous. additional assumptions, a precise general formulation
The adiabatic invariant is usually defined as a quantita- of which is difficult [I]. Temporal adiabatic invariants
tive characteristic of the motion of a Hamiltonian sys- for systems of the type just described in fact belong to
tem which remains practically unchanged during an the field of asymptotic methods of perturbation theory
adiabatic (i.e. very slow in comparison to the time scale (if the problem is posed in a more general manner, a
typical of the motion in the system) change of its few rigorous results can also be obtained [4]). To prove
parameters. (The change may last for as long as the the temporal asymptotic invariance of some magnitude
values of these parameters - unlike the adiabatic 1 (t), the usual procedure is to construct another magni-
invariant - undergo significant changes.) Thus, for the tude J(t) such that the values of l(t) oscillate around
simplest system J (t), while dJ (t) / dt has a lower order of magnitude

43
ADIABATIC INVARIANT

than the derivatives of the other parameters of the sys- zero consists of the powers ~n of some two-sided ideal
tem. Thus, in the example (*) for J = I + f.W' xv / w2 (the ~. The topology is then said to be ~-adic, and the ideal
prime denotes the derivative of w with respect to the ~ is said to be the defining ideal of the topology. The
argument s =f.t) direct differentiation yields closure of any set F CA in the ~-adic topology is equal
dJ (t) / dt = O(~); this is a proof that I is a temporal to nn;;'O(F+~n); in particular, the topology is separ-
adiabatic invariant. The existence of permanent adia- able if, and only if, n n;;'
0 ~n = (0). The separable com-
batic invariants is open to doubt in case a) above if H
A

pletion A of the ring A in an ~-adic topology is iso-


depends on t in some general manner (in particular, in
morphic to the projective limit lim(A / ~n).
the absence of periodicity or limits as t ~ + 00). In case <-
b) above, the problem of permanent adiabatic invari- The ~-adic topology of an A-module M is defined in
ants is connected with small denominators [2]. a similar manner: its fundamental system of neighbour-
Historically, adiabatic invariants played an important hoods of zero is given by the submodules ~n M; in the
role in the quantum theory of Bohr - Sommerfeld, in ~-adic topology M becomes a topological A-module.
which quantizable magnitudes must be adiabatic invari- Let A be a commutat~ve ring with identity with an
ants. This rule lost its importance with the subsequent ~-adic topology and let A be its completion; if ~ is an
advances in quantum mechanics. Adiabatic invariants i,?eal of finite type, the topology in A is f-adic, and
A

are used in modern physics in the study of motion of ~n = ~n A. If ~ is a maxjmal ideal, then A is a local
charged particles in electromagnetic fields [3]. Here, the ring with maximal ideal ~. A local ring topology is an
important magnitude is the ratio vi / H, where H is adic topology defined by its maximal ideal (an m-adic
the magnetic field strength and v ~ is the velocity com- topology).
ponent of the particle lying in a plane perpendicular to A fundamental tool in the study of adic topologies of
the vector H; this ratio is an adiabatic invariant if the rings is the Artin - Rees lemma: Let A be a commuta-
magnetic field strength does not significantly vary tive Noetherian ring, let ~ be an ideal in A, let E be an
along the Larmor radius. A-module of finite type, and let F be a submodule of
It should also be noted that, during an adiabatic E. Then there exists a k such that, for any n ~O, the
change of state in quantum mechanics, certain magni- following equality is valid:
tudes retain their values (e.g. quantum numbers, except mncmkE nF) = mk+nE nF.
for processes which bring about a degenerate state of
the system). It is therefore possible to speak of adia- The topological interpretation of the Artin - Rees
batic invariants in quantum mechanics as well, but this lemma shows that the ~-adic topology of F is induced
concept does not play a significant part in quantum by the ~-adic topology of E. It follows that the com-
mechanics and is not ordinarily used in that science. pletion A of a ring A in the ~-adic topology is a flat A

A-module (d. Flat module), that the completion E <;?f


References
[1] MANDEL'SHTAM, L.I., ET AL.: Zh. Russk. Fiz.-Khim. Obshch. 60, the A-module E of finite type is identical with EAA,
no. 5 (1928), 413-419. and that Krull's theorem holds: The ~-adic topology of
[2] ARNoL'n, V.I.: 'Small denominators and problems of stability a Noetherian ring is separable if and only if the set
of motion in classical and celestial mechanics', Russian Math.
Survrys 18, no. 6 (1963), 91-191. (Uspekhi Mat. Nauk 18, no. 6
I + ~ contains no zero divisors. In particular, the
(1963),91-192) topology is separable if ~ is contained in the (Jacob-
[3] NORTHROP, T.: The adiabatic motion of charged particles, Inter- son) radical of the ring.
science, 1963.
[4] KASUGA, T.: 'On the adiabatic theorem for the Hamiltonian References
system of differential equations in the classical mechanics I, II, [I] ZARISKl, O. and SAMUEL, P.: Commutative algebra, 2, Springer,
III', Proc. Japan Acad. 37, no. 7 (1961), 366-382'D V A 1975.
. . nosov [2] BOURBAKI, N.: Elements of mathematics. Commutative algebra,
A.P. Favorskir Addison-Wesley, 1972 (translated from the French).

AMS 1980 Subject Classification: 70HXX, 81 CXX VI. Danilov


AMS 1980 Subject Classification: 54H13, 13JXX
Amc COHOMOLOGY, 1- - See l-adic cohomology.
ADJOINT CONNECTIONS - Linear connections r
AMS 1980 Subject Classification: 14F30
and r such that for the corre_sponding operators of
Amc NUMBER, p- - See p-adic number. covariant differentiation v and v there holds
ZB(X, Y) = B(\lzX, Y)+B(X, \lzY)+2w(Z)B(X, Y),
-
AMS 1980 Subject Classification: 12BXX
where X, Yand Z are arbitrary vector fields, B(, .) is a
Amc TOPOLOGY - A linear topology of a ring A in quadratic form (i.e. a symmetric bilinear form), and
which the fundamental system of neighbourhoods of wO is a I-form (or covector field). One also says that

44
ADJOINT DIFFERENTIAL EQUA nON

V and V are adjoint with respect to B. In coordinate


form (where X, Y, Z~ai' B~bij' W~Wi, v~rt),
akbij - fLbsj - f~As = 2Wkbij'
For the curvature _operators R and R and torsion
- If y(t) and W) are arbitrary solutions of 1(y)=0 and
n~)=O, respectively, then
operators T and T of the connections v and v, n k-l
respectively, the following relations hold: ~ ~(-ly(an_kI)U)y<k-j-l) const, tEl.
k = Ij =0
B(R(U,Z)X, Y)+B(X, R(U, Z)Y) = A knowledge of m (..;,n) linearly independent solutions
= 2{ w([U, Z])- Uw(Z)+Zw(U)}B(X, y), of the equation t(~)=O enables one to reduce the order
B(Z, AT(X, Y) - B(AT(Z, Y)X) = of the equation 1(y)=0 by m (see [1] - [3]).
For a system of differential equations
= B(AT(Z, X), y), AT = T- T.
L(x) = 0, L(x) - x+A(t)x, tEl,
In coordinate form,
where A(t) is a continuous complex-valued (n Xn)-
R':;Am + R~ibjm = -2(a,w, - asw,)bij' matrix, the adjoint system is given by
ATijbsk-ATk)bsi-An,bsj = 0. L'(I/J) - -~+A'(tN = 0, tEl
References (see [l], [4]), where A * (t) is the Hermitian adjoint of
[I] NORDEN, A.P.: Spaces with an affine connection, Nauka,
A(t). The Lagrange identity and the Green formula
Moscow-Leningrad, 1976 (in Russian). Ml 17 - kh
. . ~ o/tse ovsk'-
II take the form
Editorial comments. Instead of the name adjoint con- - -,- d-
nections one also encounters conjugate connections.
(I/J,L(x-(L (I/J), x) = dt(I/J,x),
Sometimes the 1-form w is not mentioned in the notion of
adjoint connections. Strictly speaking this notion of an
'adjoint connection' should be called 'adjoint with respect to
[[(~'L(X-(L'(I/J)'X)ldt = (~,x{~~;
Band w'.
where (', .) is the standard scalar product (the sum of
AMS 1980 Subject Classification: 53805
the products of coordinates with equal indices). If x(t)
ADJOINT DIFFERENTIAL EQUATION to an ordi- and 1/I(t) are arbitrary solutions of the equations
nary linear differential equation 1(y)=0 - The ordinary L(x)=O and L *(1/;)=0, then
linear differential equation n~) = 0, where
(~t), x(t _ const, tEl.
/(y) ao(t)y<n)+ ... + an (t)y, (1)
The concept of an adjoint differential equation is
d>
y< = ~, YEC(I), akEC-k(I), closely connected with the general concept of an adjoint
operator. Thus, if I is a linear differential operator act-
ao(t) =j=. 0, tEl; ing on c n (l) into C(1) in accordance with (1), then its
Crn(I) is the space of m-times continuously- adjoint differential operator t maps the space C* (I)
differentiable complex-valued functions on 1 =(a, {3), adjoint to C(1) into the space C n* (1) adjoint to en(l).
and The restriction of t to en(l) is given by formula (2)
t(~)-(-I)n(ao~)<n)+(-I)n-l(al~)<n-l)+ ... +an~' (2)
(see [5]).
Adjoints are also defined for linear partial differen-
~ E C(l) tial equations (see [6], [5]).
(the bar denotes complex conjugation). It follows at Let ~=[to, tdeI, and let Uk be linearly independent
once that linear functionals on cn(~). Then the boundary value
(/1 +/ 2)' = I; +1;, CAl)' = Xl', problem adjoint to the linear boundary value problem
l(y) = 0, tEA, Uk(y) = 0, k= I, ... ,m, m<2n, (3)
for any scalar A. The adjoint of the equation n~)=O is
1(y)=0. For all n-times continuously-differentiable is defined by the equations
functions yet) and t), Lagrange's identity holds: t'm = 0, Uj(~) = 0, j= I, ... ,2n -m. (4)
-
$(y)-t'(~)y = -
d {~n k-l }
~(-IY(an_kI)u>y<k-j-l) . Here the Uj are linear functionals on cn(~) describing
dt k=lj=O the adjoint boundary conditions, that is, they are defined
It implies Green's formula in such a way that the equation (see Green fonnulas)


1\
T

j[II(y)- I' (~)y I dt = j[II(y)-t(~Yldt =


S 10

45
ADJOINT DIFFERENTIAL EQUATION

holds for any pair of functions y, gEcn(~) that satisfy For the linear boundary value problem
the conditions Uk(y)=O, k=l, ... ,m; Vj(~)=O, L(x) _ x+A(t)x = 0, U(x) = 0, tEt!., (7)
j=l, ... ,2n-m.
If where U is an m-dimensional vector functional on the
Uk(y)
n
= ~[O:kpy(p-I)(to)+,8kpy(p-I)(tl)l space Cn(~) of continuously-differentiable complex-
p=1 valued n-dimensional vector functions with m <2n, the
are linear forms in the variables adjoint boundary value problem is defined by
y(P-l)(to), y(P-I)(t l ), p=I, ... ,n, L*(~) = 0, U*(~) = 0, tEt!. (8)
then Vj mare linear forms in the variables (see [ 1D. Here U* is a (2n - m )-dimensional vector
&-I)(tO), &-I)(tl), p= 1, ... ,no functional defined such that the equation

Examples. For the problem (~t), X(tII:I\


I-to
=0
y+a(t)y = 0, O.,;;;t.,;;;l,
holds for any pair of functions x, "'ECk(~) satisfying
y(O)+ay(1)+,8y(1) = 0, the conditions
y(0)+yy(I)+8y(l) = 0, U(x) = 0, u*(~) = o.
with real a(t), n, {3, y, 8, the adjoint boundary value The problems (7), (8) possess properties analogous to
problem has the form those listed above (see [I D.
~+a(t~ = 0, O";;;t.,;;;l, The concept of an adjoint boundary value problem is
o:O)+yO)+I) = 0, closely connected with that of an adjoint operator [5].
Adjoint boundary value problems are also defined for
,80)+80)+1) = o. linear boundary value problems for partial differential
If problem (3) has k linearly independent solutions equations (see [6], [7]).
(in this case the rank r of the boundary value problem References
is equal to n - k), then problem (4) has m - n + k [1] KAMKE, E.: Differentialgleichungen: LOsungsmethoden und
linearly independent solutions (its rank is LOsungen, 1, Chelsea, reprint, 1971.
r' = 2n - m - k). When m = n, problems (3) and (4) [2] NAiMARK, M.A.: Linear differential operators, Moscow, 1969
(in Russian).
have an equal number of linearly independent solu- [3] CODDINGTON, E.A. and LEVINSON, N.: Theory of ordinary dif-
tions. Therefore, when m = n, problem (3) has only a ferential equatiOns, McGraw-Hill, 1955.
trivial solution if and only if the adjoint boundary [4] HARTMAN, P.: Ordinary differential equations, Birkhauser, 1982.
[5] DUNFORD, N. and ScHWARTZ, J.T.: Linear operators. Spectral
value problem (4) has the same property. The Fredholm theory, 2, Interscience, 1963.
alternative holds: The semi-homogeneous boundary [6] MIKHAiLOv, V.P.: Partial differential equations, Moscow, 1976
value problem (in Russian).
[7] WLADIMIROW, W.S. [V.S. VLADIMIROV]: Gleichungen der
l(y) = f(t), Uk(y) = 0, k=I, ... ,n, mathematischen PhYSik, Deutsch. Verlag Wissenschaft., 1973
(translated from the Russian).
has a solution if J(t) is orthogonal to all non-trivial E.L. Tonkov
solutions t) of the adjoint boundary value problem AMS 1980 Subject Classification: 34A30, 34805
(4), i.e. if 1\

j"&t)f(t) dt = 0 ADJOINT FUNCfOR - A concept expressing the


10 universality and naturalness of many important
(see [1] - [3], [7]). mathematical constructions, such as a free universal
For the eigen value problem algebra, various completions, and direct and inverse
limits.
l(y) = '>"y, Uk(y) = 0, k=l, ... ,n, (5)
Let F: S1'~G' be a covariant functor in one argument
the adjoint eigen value problem is defined as from a category S1' into a category CL F induces a func-
I'm =~, Vjm = 0, J=l, ... ,n. (6) tor
If A is an eigen value of (5), then Jl=X is an eigen value
of (6). The eigen functions yet), t)
corresponding to
where ~* is the category dual to st', is the category of
sets, and H dX, Y): st'* X G'~ is the basic set-valued
eigen values A, Jl of (5), (6), respectively, are orthogonal
functor. The functor HF is contravariant in the first
if A=FJl (see [l] - [3]): argument and covariant in the second. Similarly, any
1\
covariant functor G: G'~~ induces a functor
jy(t)t)dt = 0.
to H c;CX, Y) = H\l(X, G( Y: sf X IT __ ;S,

46
ADJOINT GROUP

which is also contravariant in the first argument and arbitrary variety of universal algebras into the category
covariant in the second. The functors F and G are of sets. The functor P has a left adjoint F: ~~,
adjoint, or form an adjoint pair, if HF and HG are iso- which assigns to every set X the free algebra of the
morphic, that is, if there is a natural transformation variety ~ with X as set of free generators.
(): HF ~HG that establishes a one-to-one correspon- 5) The imbedding functor Ido:, Sf: (~Sf of an arbi-
dence between the sets of morphisms H \i;(F(X), Y) and trary reflexive subcategory ( of a category Sf is the
H Sf (X, G(Y)) for all objects X EOb Sf and Y EOb (. right adjoint of the (-reflector. In particular, the
The transformation () is called the adjunction of F with imbedding functor of the category of Abelian groups in
G, F is called the left adjoint of G and G the right the category of groups has a left adjoint, which assigns
adjoint of F (this is written (): FG, or simply FG). The to every group G its quotient group by the commutator
transformation ()-I: HG~HF IS called the co- subgroup.
adjunction. Properties of adjoint functors. The left adjoint functor
Let ():FG. For all XEObSf and YEOb(, let of a given functor is uniquely determined up to isomor-
phism of functors. Left adjoints commute with co-
x = O(IF(X), 1jy = 0- 1(lG(y'
limits (e.g. co-products) and send null objects and null
The families of morphisms {(x} and {1jy} define morphism into null objects and null morphisms, respec-
natural transformations (: IdSf~GF and 1j: FG~Ido:, tively.
called the unit and co-unit of the adjunction (). They Let Sf and ( be categories that are complete on the
satisfy the following equations: left and locally small on the left. A functor G: @~Sf
G(1jY}G(Y) = IG(y), 1jF(x)F(x) = IF(X)' has a left adjoint F: Sf~( if and only if the following
In general, a pair of natural transformations conditions hold: a) G commutes with limits; b) for
cp: IdSf~GF and 1/;: FG~Id\i; leads to an adjoint pair (or every X EOb Sf, at least one of the sets H(X, G(y)),
adjunction) if the following equations hold: Y EOb Sf, is non-empty; and c) for every X EOb Sf,
there is a set S cOb ( such that every morphism
G(I/IY)</>G(Y) = IG(Y)' I/IF(x)F(c[>x) = IF(X) a: X ~G(Y) is representable in the form a = G(a')cp,
for all objects X and Y. A natural transformation where cp: X ~G(B), BE S, a' : B ~ Y.
cp: IdSf~GF is the unit of some adjunction if and only By passing to dual categories, one may establish a
if for any morphism a: X~G(Y) in Sf there is a unique duality between the concepts of a 'left adjoint functor'
morphism a':F(X)~Yin ( such that a=G(a')x. This and a 'right adjoint functor'; this enables one to deduce
property expresses the fact that F(X) is a free object the properties of right adjoints from those of left
over X with respect to the functor G in the sense of the adjoints.
following definition. An object Y E Ob ( together with The concept of an adjoint functor is directly con-
a morphism (: X~G(Y) is free over an objectXEOb Sf if nected with the concept of a triple (or monad) ill a
every morphism a: X~G(Y') can be written uniquely category.
in the form a = G( a')( for some morphism a' : Y ~ y'. A
References
functor G: (~Sf has a left adjoint if and only if for (I] TSALENKO, M.SH. and SHUL'GE1FER, E.G.: Fundamentals of
every X EOb Sf there is an object Y that is free over X category theory, Moscow, 1974 (in Russian).
[2] MACLANE, S.: Categories for the working mathematician,
with respect to G.
Examples of adjoint functors. 1) If G: (~, where Springer, 1971. M.Sh. Tsalenko
is the category of sets, then G has a left adjoint if and Editorial comments. A category is called complete on
only if it is representable. A representable functor the left if small diagrams have limits. A category is called
Gc:::::.HA =H dA, Y) has a left adjoint if and only if all locally small on the left if it has small hom-sets. The state-
co-products IIxExAx exist in (, where X EOb and ment that a functor has a left adjoint if and only if a), b) and
Ax =A for all x EX. c) above holds, is called the Freyd adjoint functor theorem.
2) In the category of sets, for any set A the basic AMS 1980 Subject Classification: 18A10
functor HA(Y)=H(A, Y) is the right adjoint of the
functor XXA. ADJOINT GROUP of a group G - The linear group
3) In the category of Abelian groups, the functor Ad G that is the image of the Lie group or algebraic
Hom(A, Y) is the right adjoint of the functor XA of group G under the adjoint representation (d. Adjoint
tensor multiplication by A, and the imbedding functor representation of a Lie group). The adjoint group Ad G
of the full subcategory of torsion groups is the left is contained in the group Aut g of automorphisms of
adjoint of the functor of taking the torsion part of any the Lie algebra g of G, and its Lie algebra coincides
Abelian group. with the adjoint algebra ad g of g. A connected semi-
4) Let P: ~~ be the forgetful functor from an simple group is a group of adjOint type (i.e. is iso-

47
ADJOINT GROUP

morphic to its adjoint group) if and only if its roots ADJOINT MATRIX, Hermitian adjoint matrix, of a
generate the lattice of rational characters of the maxi- given (rectangular or square) matrix A = II aik II over
mal torus; the centre of such a group is trivial. If the the field C of complex numbers - The matrix A' whose
ground field has characteristic zero and G is connected, entries a;k are the complex conjugates of the entries aki
then Ad G is uniquely determined by the Lie algebra g of A, i.e. a;k =aki. Thus, the adjoint matrix coincides
and is either called the adjoint group or the group of with its complex-conjugate transpose: A' = (A ') where-
inner automorphisms of g. In particular, if G i1: serni- denotes complex conjugation and the ' denotes transpo-
simple, Ad G coincides with the connected component sition.
of the identity in Aut g. Properties of adjoint matrices are:
References (A+B)" = A'+B', (M)' = M',
[1) PONTRYAGIN, L.S.: TopolOgical groups, Princeton Univ. Press,
1958 (translated from the Russian). (AB)' = B'A', (A')~l = (A~l)", (A')' = A.
(2) SERRE, J.-P.: Lie algebras and Lie groups, Benjamin, 1965
(translated from the French). Adjoint matrices correspond to adjoint linear
(3) HUMPHREYS, J.E.: Linear algebraic groups, Springer, 1975. transformations of unitary spaces with respect to ortho-
A.L. Onishchik normal bases.
Editorial comments. For references, see Matrix.
TS. Pogolkina
References
[A1] BOURBAKI, N.: Elements of mathematics. Lie groups and Lie AMS 1980 Subject Classification: 15A04, 10C05
algebras, Addison-Wesley, 1975, Chapt. 2; 3 (translated
from the French).
ADJOINT MODULE, contragradient module, dual
AMS 1980 Subject Classification: 20G05, 22E10, module - The module of homomorphisms of a given
22E15,22E20 module into the ground ring. More precisely, let M be
a left module over a ring R. The Abelian group
ADJOINT LINEAR TRANSFORMATION of a linear HomR(M, R) of homomorphisms of Minto R regarded
transformation A - The linear transformation A' on a as a left R-module can be made into a right R-module
Euclidean space (or unitary space) L such that for all M' by putting
x, y EL, the equality
x(cpA) = (x </A, xEM, </>EHomR(M, R), AER.
(Ax, y) = (x, A 'y)
This right module M' is called the adjoint of M. For
between the scalar products holds. This is a special case x EM, one can define an element x EM" by putting
of the concept of an adjoint linear mapping. The x(cp)=x(cp) for all cpEM'. This defines a homomor-
transformation A' is defined uniquely by A. If L is phism of Minto M'. For any left R-module C, the
finite-dimensional, then every A has an adjoint A', the mapping~: M' RC~HomR(M, C) given by
matrix fJB of which in a basis e 1, . . . ,en is related to
the matrix .;;I of A in the same basis as follows: x((cpc)n = (x </c, xEM, </>EM', CEC,

fJ6 = G~I.9l'G, is also a homomorphism. Both of these are isomor-


phisms when M is a finitely-generated projective
where .;;I' is the matrix adjoint to .;;I and G is the module [2]. It follows from the properties of the functor
Gram matrix of the basis e 1, . . . ,en. S II
Hom that (~M ~ M" (where ~ is the direct
In a Euclidean space, A and A' have the same
characteristic polynomial, determinant, trace, and eigen
sum, and II
the direct product) and that there is a
homorphism of M" into M'. The composite map-
values. In a unitary space, their characteristic polyno-
ping M' ~M'" ~M' is the identity, but M" need
mials, determinants, traces, and eigen values are related
not be isomorphic to M. The torsionjree modules in
by complex conjugation.
TS. Pigolkina the sense of Bass are those for which the above
Editorial comments. More generally, the phrase 'adjoint homomorphism of Minto M" turns out to be a
transformation' or 'adjoint linear mapping' is also used to monomorphism. This property is equivalent to the
signify the dual linear mapping cf,": M' --,>L ' of a linear map- imbeddability of M in a direct product of copies of the
ping cp: L--,>M. Here M' is the space of (continuous) linear ground ring. If R is right and left Noetherian, then the
functionals on M and cp'Cm'CI))=m'CCP(/)). The imbeddings mapping Mf->M' defines a duality between the
L--,>L', M--,>M', 1...... (-./) connect the two notions. Cf. also
categories of finitely-generated left and right R-modules
Adjoint operator.
if and only if R is a Quasi-Frobenius ring.
References
[A 1] REED, M. and SIMON, B.: Functional analysis, 1, Acad. Press, References
1972, Sect. 2. [1] BOURBAKI, N.: Elements of mathematics. Algebra: Modules.
Rings. Forms, 2, Addison-Wesley, 1975, Chapt. 4; 5; 6
AMS 1980 Subject Classification: 15A04, 47 A05 (translated from the French).

48
ADJOINT SURFACE

[2] MAc LANE, S.: Homology, Springer, 1963. where [ , ] is the bracket operation in the algebra g.
[3] MISHINA, A.P. and SKORNYAKOV, L.A.: Abelian groups and The kernel Ker ad is the centre of the Lie algebra g.
modules, Arner. Math. Soc., 1976 (translated from the Russian).
The operators ad x are derivations of g and are called
L.A. Skornyakov
inner derivations. The image ad g is called the adjoint
AMS 1980 Subject Classification: 13C99, 16A53 linear Lie algebra and is an ideal in the Lie algebra
Der g of all derivations of g, moreover Der g / ad g is
ADJOINT OPERATOR A linear operator
the one-dimensional cohomology space HI(g, g) of g,
A': Y' ~X' (where X' and y' are the strong duals of
defined by the adjoint representation. In particular,
locally convex spaces X and Y, respectively), con- ad g = Der g if g is a semi-simple Lie algebra over a
structed from a linear operator A : X ~ Y in the follow- field of characteristic zero.
ing way. Let the domain of definition DA of A be
everywhere dense in X. If for all XEDA, References
[I] JACOBSON, N.: Lie algebras, Interscience, 1962.
<Ax, g> = <x, g'>, (*) [2] PONTRYAGIN, L.S.: Topological groups, Princeton Univ. Press,
1958 (translated from the Russian).
where AXEY, gEY' and g'EX', then A'g=g' is a [3] SERRE, J.-P.: Lie algebras and Lie groups, Benjamin, 1965
uniquely defined operator from the set DA' of elements (translated from the French).
g satisfying (*) into X'. If DA = X and A is continuous, [4] HUMPHREYS, J.E.: Linear algebraic groups, Springer, 1975.
then A' is also continuous. If, in addition, X and Y are A.L. Onishchik
normed linear spaces, then II A' II = II A II. If A is com- Editorial comments.
pletely continuous, then so is A'. Adjoint operators are References
of particular interest in the case when X and Yare Hil- [A1] BOURBAKI, N.: Elements of mathematics. Lie groups and Lie
bert spaces. algebras, Addison-Wesley, 1975, Chap!. 2; 3 (translated
from the French).
References
[I] YOSIDA, K.: Functional analysis, Springer, 1980. AMS 1980 Subject Classification: 17805, 20G05,
[2] RIEsz, F. and SZOKEFALVI-NAGY, B.: Lefons d'analyse fonc- 22E10, 22E15, 22E20
tionelle, Acad. Sci. Hongrie, 1953.
v.I. Sobolev
Editorial comments. In Western literature the adjoint ADJOINT SPACE of a topological vector space E -
operator as defined above is usually called the dual or con- The vector space E' consisting of continuous linear
jugate operator. The term adjoint operator is reserved for functions on E. If E is a locally convex space, then the
Hilbert spaces, in which case it is defined by functionals fEE' separate the points of E (the
(Ax, g) = (x, A' g), Hahn- Banach theorem). If E is a normed space, then
where (-, .) denotes the Hilbert space inner product. E' is a Banach space with respect to the norm

References Ilfll = sup I [ex) I.


[A1] TAYLOR, A.E. and LAY, D.C.: Introduction to functional x#! II x II
analysis, Wiley, 1980. There are two (usually different) natural topologies on
AMS 1980 Subject Classification: 47 A05 E' which are often used: the strong topology deter-
mined by this norm and the weak-*-topology.
ADJOINT REPRESENTATION OF A LIE GROUP References
or algebraic group G - The linear representation Ad of [I] RAiKov, D.A.: Vector spaces, Noordhoff, 1965 (translated from
G in the tangent space Te( G) (or in the Lie algebra g of the Russian).
G) mapping each a E G to the differential Editorial comments. Instead of the term adjoint space
Ada = d(Inta)e of the inner automorphism one more often uses the term dual space. The weak-*-
Int a : x ~axa -I. If G <: GL( V) is a linear group m a topology on E' is the weakest topology on E' for which all
space V, then the evaluation mappings f..... f(z), fEE', XEE, are continu-
ous.
(Ada)X = aXa- 1 , XETe(G) =g C End(V).
References
The kernel Ker Ad contains the centre of G, and if G is [A 1] ScHAEFER, H.H.: Topological vector spaces, Macmillan,
connected and if the ground field has characteristic 1966.
zero, coincides with this centre. The differential of the AMS 1980 Subject Classification: 46A20, 46810
adjoint representation of G at e coincides with the
adjoint representation ad of g. ADJOINT SURFACE - A surface Y that is in Peter-
The adjoint representation of a Lie algebra g is the son correspondence with a given surface X and is,
linear representation ad of the algebra g into the moreover, such that the asymptotic net on Y
module g acting by the formula corresponds to a conjugate net a on X with equal
(adx)y = [x,y], X,yEg, invariants, and vice versa. The adjoint surface Y is the

49
ADJOINT SURFACE

rotation indicatrix for X, and vice versa. If (J is a princi- Editorial comments. The adjustment method is also
pal base for a deformation of X, then Y is a Bianchi called the time-stepping method. Such a time-stepping
surface. method with m =1 arises quite naturally when an elliptic
I.Kh. Sabitov boundary value problems is viewed as a steady state of a
AMS 1980 Subject Classification: 53A05 (dissipative) parabolic problem (ct. [A2]). Because of the
inherent (numerical) stiffness an implicit discretization
ADJUSTMENT MEfHOD - A method in which the method, such as BDF, should be advocated.
Similar ideas can be used in optimization theory by e.g.
solution u of a stationary problem
constructing a mechanical system whose stable equilibrium
Au =j (1) state is the desired optimum. Again this leads to all kinds of
is regarded as the steady-state limit solution for t~oo iteration algorithms, as in [A3].
of a Cauchy initial value problem for a non-stationary References
[A 1] BABUSHKA, I. and SoBOLEY, S.L.: 'The optimization of numeri-
evolution equation involving the same operator A (cf.
cal processes', Appl. Mat. 10 (1965),96-130.
Cauchy problem). This evolution equation may e.g. be [A2] KUBICEK, M. and HLAVACEK, Y.: Numerical solution of non-
of the form m diu(t) linear boundary value problems with applications, Prentice-
i~l Ci dt i = j - Au(t), (2) Hall,1983.
[A3] RAzUMIKHIN, B.S.: Physical models and equilibrium methods

dku
dt k I = UOb k=O, ... ,m-I.
in programming and economics, Reidel, 1984.
[A4] RHEINBOLDT, W.e.: Numerical analysis of parametrized non-
1=0 linear equations, Wiley, 1986.

Here the Ci are suitable operators which guarantee the AMS 1980 Subject Classification: 65HXX
existence of the 'adjustment limit' liml-+OO u(t)=u.
ADSORPTION - 'Consumption' of matter from a gas
A result of using adjustment is that it permits one to
use approximate solution methods of (2) in order to or from an opening of the interface between them (or
construct iteration algorithms for solving equation (1) from the surface of a solid body). In other words,
(cf. Iteration algorithm). Thus, for the non-stationary
adsorption is the 'consumption' by an adsorbate from a
volume of gas on the surface of the adsorbent. Adsorp-
equation (2) one could employ a discretization (dif-
tion is a particular case of sorption.
ferencing) with respect to t solution method which is
The molecules of the adsorbate falling on the surface
convergent and stable to obtain approximate solutions.
of the adsorbent are retained by the surface force for a
For example, for m = 1, an explicit method of the form
period of time, which depends on the natures of the
C 1 U(tn+l;-U(t n) = j-Au(tn) adsorbent and adsorbate, on the temperature T and on
n the pressure p, after which they leave the surface (are
where Tn =tn +3 - tn >0. And then this method can be desorbed). Under conditions of thermodynamic and
interpreted as an iteration algorithm molecular equilibrium, the rates of adsorption and
C1(un+1-u n) = Tn(j-Au n), n =0,1, ... , UO = uoo, desorption are equal. The relation between the relative
pressure cf>=p / ps of the adsorbent and the relative
for solving equation (1), in which C I and Tn are now concentration O=c / cs , where the index s stands for
seen as characterizing this (iteration) method. the limit value at a constant temperature, is known as
Varying the form of the operators Ci and considering the adsorption isotherm.
different discretizations with respect to t in equation (2) Langmuir's equation of mono-molecular adsorption
(explicit schemes, implicit schemes, splitting schemes, has the form
etc.) gives the possibility of obtaining a wide variety of 8
</> = k(l-8)'
iteration methods for solving equation (1). For these
methods equation (2) will be the closure of the compu- where k is the equilibrium constant which roughly
tational algorithm (d. Oosure of a computational algo- describes the interaction between the adsorbent and the
rithm). A generalization of the adjustment method IS adsorbate.
the continuation method (to a parametrized family). Brunauer's equation [l] is commonly used in the case
of a homogeneous surface of the adsorbent and poly-
References molecular adsorption.
[1] BAKHVALOY, N.S.: Numerical methods: analysis, algebra, ordi
nary differential equations, Mir, 1977 (translated from the Rus Posnov's formula [2], which is empirical, is widely
sian). employed for capillary bodies:
[2] GoDUNOY, S.K. and RYABEN'Kli, V.S.: The theory of difference
schemes, North-Holland, 1964. 7i1 = A In</>+ 1,
[3] MARCHUK, G.!. and LEBEDEY, V.I.: Nt:merical methods in the
theory of neutron transport, Moscow, 1971 (in Russian). where A is a coefficient which varies with the tempera-
VI. Lebedev ture and with the structure of the adsorbent.

50
AERODYNAMICS. MATHEMATICAL PROBLEMS OF

References general equations by making the Reynolds number -


[I) BRUNAUER, S.: Adsorption of gases and vapors, Princeton Univ. which describes the order of the ratio between the iner-
Press, 1943.
[2) POSNOV, V.A.: Zh. Tekhn. Fiz., no. 23 (1953), 865. tial and the viscous forces - grow without limit.
[3) IL'IN, B.V.: The nature of adsorption forces, Moscow-Leningrad, In most problems of applied aerodynamics of aircraft
1952 (in Russian). and rockets, and in seacraft hydrodynamics, the Rey-
[4) BOER, J.H. DE: The dynamical character of adsorption, Claren-
don Press, 1968.
nolds numbers are very large (l(f - 108 ), so that the
A. V. Lykov model of an ideal fluid yields an accurate description of
Editorial comments. The Brunauer- Emmett- Teller flow processes around solid bodies under these condi-
equation, or BET-equation, [A1] generalizes the Langmuir tions, except for the boundary layer zone. The system
equation. It assumes multi-molecular adsorption for which of aerodynamic equations of an ideal fluid are:
each layer obeys a Langmuir equation. There are several
modifications, cf. [A2]. p-
dV = pg-Vp
dt
References
[A1] BRUNAUER, S., EMMETT, P.H. and TELLER, E.: J. Amer. Chem. (equation of motion);
Soc. 60 (1938), 309.
[A2] BRUNAUER, S. and COPELAND, L.E.: 'Surface tension, adsorp- !!e.
dt
+ p div V = 0
tion', in E.U. Condon and H. Odishaw (eds.): Handbook of
physics, Vol. 2, McGraw-Hili, 1967, Chapt. 7. (continuity equation);
AMS 1980 Subject Classification: 76S05, 80A20,
82A99 p![';2+V) = p(g, V)-div(pV)

AERODYNAMICS, MATHEMATICAL PROBLEMS (equation of conservation of energy). Here V is the


OF - Problems involved in the solution of fundamental velocity vector, p is the density, p is the pressure, g is
equations in aerodynamics, which are an accurate the acceleration due to gravity and U is the internal
description of the laws of motion of a gas-like medium energy. It is a system of equations of the first order
and its interaction forces with solids moving in this with respect to the functions sought. This is also true
medium. Turbulence, for which no satisfactory for a mixture of chemically interacting gases: The equa-
mathematical model has yet been established, forms an tions of the transfer of the mixture components are
exception to this rule. Processes without turbulence or added to the above equations, while the heat generation
in which the turbulence can be neglected may, in prin- due to chemical reactions is added to the energy equa-
ciple, be sufficiently completely investigated by a tion.
mathematical solution of the relevant equations. The A number of simplified models can be derived from
real possibilities of solving such equations differ from the model of an ideal fluid. The most important ones
the theoretical; accordingly, the orders of the individual are the model of a weight-less fluid and the model of
terms in the equations are estimated, and on this basis an incompressible fluid. The former model is the limit-
simplified mathematical models are established for dif- ing case of the general model as the Froude number
ferent classes of aerodynamic processes, in which one increases without limit; the Froude number describes
or more 'small' terms in the complete set of aero- the order of the ratio of the inertial to the gravity
dynamic equations are discarded. Even for such simpli- forces. This model is very accurate when applied to the
fied models analytic solutions can be obtained only for aerodynamics of aircraft and rockets (with Froude
the simplest cases, and numerical methods are widely numbers of order 1()2 - 103 ). However, the force of
employed in practical work in aerodynamics. These gravity becomes significant in seacraft hydrodynamics
methods have become prevalent with the introduction and especially so in meteorological applications (with
of electronic computers. Froude number of order ~ 1).
As regards the nature of the equations involved in The equation of an incompressible fluid is obtained
aerodynamical problems, these can be subdivided into from the general system by putting p = const. This form
four large classes. of the equation of motion is obtained as the Mach
1) Problems in the aerodynamics of an ideal (non- number tends to zero (which is the relationship of the
viscous and thermally non-conducting) fluid, the word characteristic velocity of sound in a fluid medium). In
'fluid' being used in this context to denote both liquids the general case this is not true; it is only valid for
and gases. Froude numbers ~ 1 (i.e. if MaFr), and also for
2) Problems in the aerodynamics of viscous fluids. quasi-stationary processes, in which the Strouhal
3) Problems in the aerodynamics of radiant gases. number (the criterion of similitude of non-stationary
4) Problems in the aerodynamics of rarefied gases. motions of the liquid or the gas) is of order ~ 1.
The equations of an ideal fluid are obtained from the The simplest model is that of an incompressible

51
AERODYNAMICS, MATHEMATICAL PROBLEMS OF

fluid. In this model the entire system of equations is solutions have been found for planar problems of the
reduced to the single Laplace equation for the velocity theory of waves only. Little progress has been made on
potential cI> (\7 cI> = V). However, it is only in the case the spatial problem of waves with a finite amplitude.
of plane-parallel motions of a weight-less fluid and On the other hand, the linearized problem (small wave
axi-symmetric motion around solids of revolution that amplitudes) has been largely solved, and the calculation
the solution of aerodynamic problems is reduced to of the wave resistance to the motion of a ship is actu-
problems in classical potential theory. ally based on the linear theory.
The problem of flow around a given shape is solved The possibility of aircraft travelling at near-sonic and
by classical methods of the theory of conformal map- supersonic speeds stimulated the search for solutions of
ping. The flow around solids of revolution at zero angle aerodynamic problems of compressible fluids. The
of incidence is determined by the classical solution of problems involving subsonic, transsonic and supersonic
the exterior Neumann problem for the Laplace equa- velocities are all treated in substantially different
tion. In view of the practical importance of these prob- manners. In the subsonic range the aerodynamic equa-
lems (determination of the characteristics of wing pro- tions remain elliptic and the solutions qualitatively
files and fuselages) many methods for their solutions resemble those obtained for incompressible fluids.
have been developed that achieve an acceptable accu- In the first period of research on the aerodynamics of
racy by means of relatively simple calculations. If elec- compressible fluids much research was done on the
tronic computers are employed, there is no point in effect of compressibility on the aerodynamic charac-
using these methods, since accurate numerical methods teristics of the solids around which the flow takes place.
for solving these problems belong to the simplest prob- All these methods involve some sort of linearization of
lems of computational mathematics. the velocity of the unperturbed flow. Khristianovich's
In the spatial case the formal solution of the Neu- method is an exception, permitting as it does to obtain
mann problem for the velocity potential will accurate solutions of the equations of subsonic aero-
correspond to the physical picture of the flow only in dynamics, though for a solid whose shape is unknown
very exceptional cases. In a viscous fluid a solid is fol- in advance.
lowed by a turbulent trail. As the Reynolds number In practice, the difficulties involved in the problems
increases, this trail becomes thinner (for a continuous of subsonic aerodynamics are the same as those to be
flow) and in the limit it becomes an infinitely thin tur- dealt with when investigating incompressible fluids:
bulent surface, the intensity of the turbulent layer of problems involving plane-parallel and axi-symmetric
which vanishes only in exceptional cases (e.g. in the flows may be fairly easily solved by numerical methods
case of plane-parallel motion). Accordingly, the prob- (cf. Integral-relation method; Adjustment method). In
lems involving discontinuous velocity potentials in the the case of spatial flows with discontinuous velocity
region of the flow zone represent the real spatial prob- potential, solutions can only be obtained for linearized
lems of an incompressible fluid. The location of this equations. These equations can be converted, by an ele-
discontinuity surface is unknown, and for this reason mentary transformation, into equations for incompres-
the exact problem of flow around solid bodies in the sible fluids in which the boundary conditions and the
presence of a discontinuous potential surface is a very conditions on the surface of discontinuity are the same
complicated non-linear problem. It is only in the linear as those for incompressible fluids.
approximation, i.e. on the assumption that the solid The zone of trans sonic aerodynamics is, from the
around which the fluid is flowing disturbs the basic point of view of mathematics, including numerical
mainstream only to a small extent (a thin wing at a mathematics, the most difficult to study. The presence
small angle of incidence) that the problem could be of local supersonic (parabolic) zones which almost
solved. In such a case the surface of turbulence can be always terminate in shock waves precludes any analytic
considered as horizontal, and it may be assumed that approximation. Moreover, these problems display the
the potential along the stream has a constant discon- principal drawback of equations of elliptic type - the
tinuity. The normal derivatives of the velocity poten- propagation of the effect of any particular disturbance
tial are given on the projection plane of the solid. Ana- throughout the space. The most suitable method for
lytic solutions have been obtained on these asumptions solving the problems of transsonic aerodynamics is the
(for a circular and an elliptic wing). Numerical methods adjustment method, in which a non-stationary aero-
of solving this problem for other shapes of wings are dynamic problem is solved (starting from an arbitrary
available and simplify the task to a considerable extent initial state), and the solution of the stationary problem
(cf. Wing theory). of trans sonic flow around a solid is obtained as a limit
The principal difficulty in problems involving a solution of the non-stationary problem as the time
heavy fluid is that the boundary conditions for the tends to infinity. This method is very laborious, but is
potential on the free surface are non-linear. Exact quite feasible if electronic computers are employed.

52
AERODYNAMICS, MATHEMATICAL PROBLEMS OF

As regards mathematical methods of study, super- the local subsonic zone (including the supersonic part
sonic aerodynamics may be subdivided into three fields. of the region of influence) is followed by calculations of
1) Pure supersonic flow. the flow by methods of purely supersonic aerodynam-
2) Mixed flow, with formation of local subsonic lCS.
zones. There is another type of problems, which are con-
3) High-temperature hypersonic flow producing nected with hypersonic flow around thin solids (prior
chemical reactions in the gas. to the initiation of chemical reactions). Hypersonic
At higher velocities and, consequently, at higher tem- flows around thin solids (excepting the blunt-shaped
peratures ionization takes place and radiation processes regions) are distinguished by small changes in the velo-
become significant. This part of hypersonic aerodynam- city component along the main stream. For this reason
ics may be said to form a separate field, since the the equations may be simplified so that the problem of
mathematical problems involved are quite different flow around a thin body of a given shape (in the planar
from those arising in the case of 'transparent' gases. and axi-symmetric cases) becomes identical to the one-
Problems in pure supersonic flow have been most dimensional stationary problem. Many important qual-
thoroughly investigated. This included the develop- itative characteristics of hypersonic flows have been
ment of numerical methods of characteristics, finite obtained in this way and approximate similitude laws
difference methods and methods of semi-characteristics, have been established; these are also extensively
by which relatively simple solutions may be obtained employed in the analysis of the results of numerical cal-
not only for plane-parallel or axi-symmetric but also culations, which are then reduced to very compact rela-
for spatial flows. The linear theory of supersonic flow tions in a very wide range of Mach numbers and
has also been exhaustively studied. Analytic solutions geometric parameters of the solid.
can be obtained of numerous practical problems. The Aerodynamic problems concerned with gases under-
latter may be somewhat complicated by the occurrence going chemical reactions involve the simultaneous solu-
of weak shock waves, but even these difficulties are of a tion of the equations of motion and the equations of
computational and not of a principle character. chemical kinetics. While the resulting systems of equa-
Practically speaking, these problems are limited to tions are more complicated, the numerical methods
flows around sharply pointed solids and to the prob- employed in their solution are essentially the same as
lems of internal aerodynamics (calculations of jet noz- those employed in the aerodynamics of an ideal gas,
zles). However, if the supersonic velocities are very except that they are much more laborious. Such calcu-
high, sharp-edged solids are not employed (charring of lations are extensively carried out in practical work,
the edges), while a local subsonic flow always occurs at and since modelling natural conditions in aerodynamic
the blunt end. Having computed this region, the tubes cannot be realized in this temperature range, the
remaining (pure supersonic) flows are calculated by the aerodynamic characteristics of hypersonic instruments
methods of supersonic aerodynamics. are mainly obtained by numerical methods.
As compared with the aerodynamics of transsonic There are two principal trends in the theory of
flow, the advantage of the problems of aerodynamics of viscous fluids - the theory of the complete equations
supersonic flow with local subsonic zones (even though of a viscous fluid (Navier - Stokes equations), and
such zones are mixed) consists in the fact that such boundary-layer theory. The boundary-layer equations
zones usually represent bounded narrow domains in the represent the principal term of the asymptotic expan-
vicinity of the blunt ends. This is why it was possible to sion of the Navier-Stokes equations in the vicinity of
develop effective numerical methods for the computa- the boundary of the solid to which the fluid particles
tion of local subsonic zones (the method of integral adhere, completely or in part. It is this fact which radi-
relations, the method of inverse problems and the cally disturbs the solutions obtained for ideal fluids
adjustment method). It should be noted, however, that near the surface of the solid around which the flow
up to the time of writing (1977) no rigorous takes place. The order of the error involved in the
mathematical investigations of these problems have boundary-layer equations is 1 / 5e, which means
ever been performed. There is also no proof that their that the theory is valid for large Reynolds numbers
solutions do exists and are unique. Accordingly, numer- only, and even then only in the regions of smooth con-
ical methods are developed on the assumption that the tinuous flow. Despite the fact that the boundary-layer
conditions of continuous velocities and accelerations equations contain all the main terms of viscous stresses,
during the transition from the subsonic to the super- their mathematical structure is much simpler. While the
sonic zone ensure the uniqueness of a solutions, which complete equations are of elliptic type, the boundary-
is physically realistic. This hypothesis is confirmed by layer equations are parabolic with characteristics
all the results of numerical calculations. directed along the normal to the surface of the solid. It
Up till here, it has been said that the computation of is therefore possible to conduct the computations 'layer

53
AERODYNAMICS, MATHEMATICAL PROBLEMS OF

by layer', i.e. to pass from one section of the boundary only be obtained by some kind of iterative process. The
layer to another, irrespective of the conditions prevail- iterative process is so constructed that separate equa-
ing outside the range between the sections. tions for the vortex velocity and for the flow function
One consequence of the practical importance of the are obtained at each iteration step. Such a separation
theory of boundary layers (computations of the resis- must take place not only in the equations, but in the
tance, of the surface temperatures, of the rate of des- boundary conditions as well. Thus, by choosing the
troyment of the surface under hypersonic flight condi- solution of the ideal fluid equations as the first
tions) was the development of numerous approximate approximation, one obtains the following set of itera-
methods for performing such calculations (the tions:
Pohlhausen method, the one-parameter A _ aWn +1 _ [a<pn aWn _ a<pn aWn J
Kochin - LoitsyanskiI method, etc.). However, if elec- I.1Wn +1 2 ax - 2 ay ax ax ay ,
tronic computers are employed, all these approximate
methods become superfluous, since an accurate numeri- ~o/n+l = Wn+h

cal solution of the boundary-layer equations presents with the following boundary conditions on the solid
no difficulties, even in the difficult case of a hot gas in boundaries:
which chemical reactions are taking place. The numeri- alj;n
o/n+l = const, Wn+l = aa;-+wn'
cal routines may be so constructed that at each step the
where CP=t/;-y, which means that a separate set of
system of equations is split, with respect to the x-
equations for the vortex flow and the flow function is
variable (along the tangent to the surface of the solid),
thus obtained in the (n + 1)-st approximation. The solu-
into a number of separate second-order differential
tion may be obtained in a similar manner by the
equations which, from the computational point of view,
adjustment method, but it is not necessary in such a
is highly advantageous.
case to take a real system of non-stationary equations;
What was said above applies to plane-parallel and to
rather, one may consider the system of parabolic equa-
axi-symmetric flows. The structure of three-
dimensional boundary layers is more complicated. The tions: J
equations of the three-dimensional layers themselves
~
at = ~W-2[~~-~~
ay ax ax ay ,
vary significantly with the geometry of the solid around
~
at = ~,,-w
which the flow takes place. The development of 't',

methods of calculation of three-dimensional boundary which is more convenient to handle in numerical calcu-
layers is much less advanced, partly because calcula- lations.
tions on plane sections are often used in practice; this, The problem of flow around a body by a viscous
despite the lack of a theoretical basis, often yields suffi- fluid involves not merely the development of computa-
ciently accurate results. There are no principal difficul- tion methods, but also research on fundamental ques-
ties in solving the exact equations of a three- tions. Thus, it is unclear whether a stationary solution
dimensional layer. exists at large Reynolds numbers (e.g. for the case of
Except for a few special cases, in which analytic solu- flow around a cylinder). Numerical methods yield
tions could be obtained, the development of methods acceptable solutions up to Reynolds numbers of several
for solving the complete system of equations for a hundreds, but above these values of the Reynolds
viscous liquid is due to the introduction of electronic number the numerical calculations no longer converge.
computers. The principal difficulty in solving such Problems in the aerodynamics of radiant gases and
equations is their high order and the fact that the of rarefied gases have been solved only for the simplest
region of influence is unbounded. In the simplest case cases. However, in view of the practical importance of
of stationary motion of a viscous incompressible fluid these problems (in both these fields), mathematical
with a constant viscosity coefficient one obtains a sys- models of the relevant phenomena have been proposed.
tem of two equations of the second order with respect Such models very often are no limiting cases of the
to the stream function t/; and vortex velocity w. They complete equations (model of gray radiation, model of
may be written in the dimension-less form as follows: diffusive radiation in the theory of radiant gases,
Crook's model in the theory of rarefied gases), but
~w = 2 [2f~-~~J
ay ax ax ay , merely qualitative reflections of the relationships
described in the complete equations. The volume of the
~Ij; = w. computations involved in solving these problems is very
The boundary conditions on solid boundaries are only large, and success in this field will depend on the possi-
for the stream function t/; = const, at/; / an = O. The solu- bilities for increasing the effectiveness of electronic
tion of this non-linear system of the fourth order can computers.

54
AFFINE CONNECTION

References any affine algebraic set can be defined by a finite


[1] KocHIN, N.E., KmEL', LA. and ROZE, N.V.: Theoretical hydro- number of polynomials (fl, ... ,fi)Ek[T]. The equal-
dynamics, Interscience, 1964 (translated from the Russian).
[2] KHRISTIANOVICH, S.A.: Gas flow around bodies with large sub-
ities fl = ... = fi =0 are called the equations of X. The
sonic speed, Moscow, 1940 (in Russian). affine algebraic sets of 'C form a lattice with respect to
[3] SEDOV, L.J. [L.I. SEDOV]: Two-dimensional problems in hydro- the operations of intersection and union. The ideal of
dynamics and aerodynamics, Acad. Press, 1965 (translated from
the Russian).
the intersection X n Y is identical with the sum of
[4] HAYES, W.: Hypersonic flow theory, Acad. Press, 1966. their ideals Wx + Wy , while the ideal of the union
[5] CHERNY!, 0.0.: Gas flow with large supersonic speed, Moscow, X U Y is identical with the intersection of their ideals
1959 (in Russian).
[6] BELOTSERKOvsKIi, O.M. and CHusHKIN, P.I.: 'The numerical Wx n Wy. Any set 1(n is an affine algebraic set, called
solution of problems in gas dynamics', in Basic developments in an affine space over k and denoted by AZ; to it
fluid dynamics, Vol. 1, Acad. Press, 1965, pp. 1-126. corresponds the zero ideal. The 'empty subset of In is
[7] CHusHKIN, P.I.: The method of characteristics for spatial super-
sonic flows, Moscow, 1968 (in Russian). also an affine algebraic set with the unit ideal. The
[8] Flow around blunted bodies of a supersoniC gas flow, Moscow, quotient ring k[X]=k[TJ/ Wx is called the coordinate
1967 (in Russian). ring of X. It is identical with the rin~ of k-regular func-
[9] Numerical methods in fluid mechanics, Wroclaw, 1969 (in Pol-
ish). tions <,:n X, i.e. with the ring of k-valued functions,
A.A. Dorodnitsyn f: X~k, for which there exists a polynomial FEk[T]
Editorial comments. In the equation of the conservation such thatf(x)=F(x) for all XEX. An affine algebraic
of energy (g, V) denotes the scalar product of two vectors. set is said to be irreducible if it is not the union of two
For some material on the branch or aerodynamics dealing affine algebraic proper subsets. An equivalent defini-
with such things as plasmas and the mechanics of stellar tion is that the ideal Wx is prime. Irreducible affine
gases (magneto- and electro-aerodynamics) see the article
algebraic sets together with projective algebraic sets
Magneto-hydrodynamics, mathematical problems in.
were the subjects of classical algebraic geometry. They
References were called, respectively, affine algebraic varieties and
[A1] BACHELOR, O.K.: An introduction to fluid dynamics, Cam-
bridge Univ. Press, 1970. projective algebraic varieties over the field k (or k-
[A2] PAl, S.I.: Viscous flow, v. Nostrand, 1956. varieties). Affine algebraic sets have the structure of a
[A3] HOWARTH, L. (ED.): Modern development in fluid dynamics. topological space. The affine algebraic subsets are the
High speed flow, 1-2, Oxford Univ. Press, 1953.
[A4] BlRKHOFF, G.: Hydrodynamics, a study in logic, fact and
closed sets of this topology (the Zariski topology). An
similitude, Princeton Univ. Press, 1960. affine algebraic set is irreducible if and only if it is
[A5] KARMAN, TH. VON, ET AL. (EDS.): High speed aerodynamics irreducible as a topological space. Further development
and jet propulsion, 1-2, Princeton Univ. Press, 1964. of the concept of an affine algebraic set leads to the
[A6] PAl, S.I.: Radiation gas dynamics, Springer, 1966.
[A7] FERRARO, V.C.A. and PLUMPTON, c.: An introduction to concepts of an affine variety and an affine scheme.
magneto-fluid mechanics, Oxford Univ. Press, 1966. References
[A8] KOGAN, M.V.: Rarified gas dynamics, Acad. Press, 1969 [1] ZARISKI, O. and SAMUEL, P.: Commutative algebra, 2, Springer,
(translated from the Russian). 1975.
[A9] BERS, L.: Mathematical aspects of transsonic gas dynamics, [2] SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
Wiley, 1958. (translated from the Russian).
[3] HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
AMS 1980 Subject Classification: 73JXX, 76GXX
1. V. Dolgachev
AFFINE ALGEBRAIC SET, affine algebraic k-set - V.A. Iskovskikh
The set of solutions of a given sy~em of algebraic Editorial comments. A topological space is irreducible if
equations. Let k be a field and let k be its algebraic it is not the union of two closed proper subspaces.
closure. A subset X of the Cartesian product 1(n is said AMS 1980 Subject Classification: 14A25, 14A 10,
to be an affine algebraic k-set if its points are the com- 14A05
mon zeros of some family S of the ring of polynomials
k[T]=k[TI' ... ,Tn]. The set Wx of all polynomials AFFINE CONNECTION - A differential-geometric
in k [TI' ... ,Tn] that vanish on X forms an ideal, the structure on a smooth manifold M, a special kind of
so-called ideal of the affine algebraic k-set. The ideal Wx connection on a manifold (cf. Connections on a mani-
coincides with the radical of the ideal I (S) generated fold), when the smooth fibre bundle E attached to M
by the family S, i.e. with the set of polynomials has the affine space An of dimension n = dim M as its
fEk[TJ, ... ,Tn] such that fmEI(S) for some natural typical fibre. The structure of such an E involves the
number m (Hilbert's Nullstellensatz; cf. Hilbert theorem assignment to each point x EM of a copy of the affine
3)). Two affine algebraic sets X and Y coincide if and space (An)x, which is identified with the tangent
only if Wx = Wy. The affine algebraic set X can be centro-affine space TAM). In an affine connection
defined by a system of generators of Wx. In particular, each smooth curve L EM with origin Xo and each one

55
AFFINE CONNECTION

of its points XI is thus provided with an affine mapping A smooth field of frames is chosen in a coordinate
(An)x, ~(An)xo which satisfies the condition formulated neighbourhood of the origin Xo of the curve L, and the
below. Let M be covered with coordinate domains, image of the frame at point XI is defined as the solution
each provided with a smooth field of affine frames in {x(t), ei(t)} of the system
(An)x. The origin of these frames coincides with X (i.e. n
smooth vector fields, linearly independent at each point du = (W')X(I}(x(tu" 1 (5)
X of the domain, are given). The requirement is that, as du) = (wj)X(t)(x(tu"
t~O, when XI moves along L towards Xo, the mapping
for the initial conditions u(O)=O, ui(O)=ei' where
(An)x, ~(An)xo tends to become the identity mapping,
Xi =Xi(t) are the defining equations of the curve L. The
and that the principal part of its deviation from the
curve which is described in (An)x o by the point with
identity mapping be defined, with respect to some
position vector X (t) with respect to Xo is known as the
frame, by the system of linear differential forms
development of L. The field of frames in the coordinate
w' = ndx k , det I n 1=1= O,} neighbourhood may be so chosen that Wi =dx i ; then
wj = fJkwk. (1)
w; = r)k dxk. In the intersection of the coordinate
Thus, for (An)x, ~(An)xo' the image of the frame at XI is neighbourhoods, dx i =(OXi /oxj)wI, i.e.
the system consisting of the point in (An)x o with posi- Aj =ox i /ox j and
tion vector ei[wi(X)t +t:i(t)] and n vectors r L
,
, = axi ax] axk,
--~~f' +
2 x' ax' a (6)
e;[8j+wj(X)t+t:j(t)], where Xis the tangent vector to L ]k ax' ax) axk ]k ax) axk ax"
at Xo, and
Sjk = fjk-n),
af'1 ar 'k
1
a;' + r~kr'i/- r~/r'ik'
(7)
lim M = 0, lim (j(t) = 0.
t~O t t~O t Rjkl = ax~ -
A manifold M with an affine connection defined on Here Sjk and Rjkl are, respectively, the torsion tensor
it is called a space with an affine connection. During the and the curvature tensor of the affine connection on M.
transformation of a frame of the field at an arbitrary An affine connection on M may be defined by a system
point, XEM according to the formulas ei =A{-ej, of functions fjk on each coordinate neighbourhood
ej = A j ei, i.e. when passing to an arbitrary element of which transforms in the intersection of two neighbour-
the principal fibre bundle P of frames in the tangent hoods according to formula (5). The system fjk is
spaces (An)x with origins at the point x, the forms (1) called the object of the affine connection. The mapping
are replaced by the following I-forms on P: (An)x, is obtained with the aid of (5) into which
w' = Ajw', } w' = dx', wj = rjk dXk,
wi,] = AidA k + A i A I, wk (2)
is to be substituted.
k] k] I,

while the 2-forms If, in some neighbourhood of the point Xo, a vector
field X=~iei is given, then, when (An)x,~(An)xo' the
Q', = dw ,+W ,I\w' , }
l ]I
vector Xx (I) is mapped into the vector ~i(xl)ei(t) (where
(3)
Qj = dwj+wkl\wj, {ei(t)) is the solution of system (5. The differential of
are transformed as follows: this in (An)x o at t =0:

where Qi and Qj are composed from the forms (2)


(d~'+~'wj)e, = [aa;~ +~rjk] dxke"
according to (3). The equations (3) are called the struc- is called the covariant differential of the field X with
ture equations of the affine connection on M. Here the respect to the given affine connection. Here
left-hand sides - the so-called torsion forms Qi and
curvature forms Qj - are semi-basic (cf. Torsion form; \l t'
kS
= ~+tif'
axk""]k
Curvature form), i.e. they are linear combinations of the
form a tensor field, called the covariant derivative of the
wk Aw':
QI = .lSI
2 ] k w'/\ W k , 1 field X=~iei' If a second vector field Y=1{ek is given,
the covariant derivative of the field X in the direction
(4)
i_I I k 1
of Y is defined as
Q) - '2R]kIW I\w.

All I-forms Wi and w;,


defined on P and satisfying
which may also be defined with respect to an arbitrary
equations (3) with left-hand sides of type (4), define a
certain affine connection on M. The mapping field of frames by the formula
(An)x, ~(An)xo for a curve L EM is obtained as follows. w'(\l yX) = Y W'(X)+ w,,(Y)d(X).

56
AFFINE COORDINATE FRAME

An affine connection on M may also be defined as a The concept of an affine connection arose in 1917 in
bilinear operator V which assigns a vector field V y X Riemannian geometry (in the form of the Levi-Civita
to each two vector fields X and Y, and which possesses connection); it found an independent meaning in 1918 -
the properties: 1924 owing to work of H. Weyl [1] and E. Cartan [2].
\7 y(jX) = (Yj)X + f\7 yX, \7 jYx = f\7 yX, References
[1) WEYL, H.: Raum, Zeit, Materie, Springer, 1923.
where f is a smooth function on M. The relation [2A) CARTAN, E.: 'Sur les varietes a connexion affine et la throrie
between these definitions is established by the formula de la reIativite generaIisee (premiere partie)', 40 (1923), 325-
V ek ej = f)k ej where {ej} is the field of frames. The
412.
[2B) CARTAN, E.: 'Sur les varietes a connexion affine et la theorie
fields of the torsion tensor and curvature tensor de la relativite generaIisee (premiere partie suite)" 41 (1924),
1-25.
S(X, Y) = S)kI'!T/ei = Oi(X, Y)ei' [2C] CARTAN, E.: 'Sur les varietes a connexion affine et la throrie
R(X, Y)Z = (R)k/~rI')I!ei = O)(X, y)wi(Z)ei de la relativite generaIisee (deuxieme partie)', 42 (1925), 17-88.
[3A) CARTAN, E.: 'Sur les varietes it connexion projective', Bull. Soc.
are defined by the formulas: Math. France 52 (1924), 205-241.
[3B) CARTAN, E.: 'Sur les espaces it connexion conforme', Ann. Soc.
S(X, Y) = \7x Y -\7yX-[X, Y], Polon. Math. 2 (1923), 171-221.
R(X, Y) = \7x\7yZ-\7y\7xZ -\7[X,y]z. [4) RASHEWSKI, P.K. [P.K. RAsHEVSKI11: Riemannsche Geometrie
und Tensorana/yse, Deutsch. Verlag Wissenschaft., 1959
A vector field X is said to be parallel along the curve L (translated from the Russian).
if V x(t)Xx(t) =0 holds identically with respect to t, i.e. [5) POSTNIKOV, M.M.: The variational theory of geodeSiCS, Moscow,
1965 (in Russian).
if, along L, (j Lumiste
d~i +I'!wj = 0,
Editorial comments. Instead of the articles [3A], [3B],
Parallel vector fields are used to effect parallel displace- one may consult [A 1]. Useful additional up-to-date refer-
ment of vectors (and, generally, of tensors) in an affine ences in English are [A2] and [A3].
connection, representing a linear mapping of the References
tangent vector spaces Tx,(M)~Txo(M), defined by the [A 1] LICHNEROWICZ, A.: Theorie globale des connexions et des
groupes d'holonomie, Cremonese, 1955.
mapping (An)x, ~(An)xo' In this sense any affine con-
[A2] KOBAYASHI, S. and NOMIzu, K.: Foundations of differential
nection generates a linear connection on M. geometry. 1, Wiley (Interscience), 1963.
A curve L is called a geodesic line in a given affine [A3] STERNBERG, S.: Lectures on differential geometry, Prentice-
Hall, 1964.
connection if its development is a straight line; in other
words, if, by a suitable parametrization, its tangent vec- AMS 1980 Subject Classification: 53B05
tor field x(t) is parallel to it. Geodesic lines are AFFINE COORDINATE FRAME - A set of n
defined with respect to a local coordinate system by the linearly-independent vectors ej (i = 1, ... ,n) of n-
system dimensionru affine space An, and a point O. The point
d 2 Xi dxi dx k
o is called the initial point, while the vectors ej are the
.
-2-+ f j kd - - d = O.
dt t t
scale vectors. Any point M is defined with respect to
Through each point, in each direction passes one geo- the affine coordinate frame by n numbers - coordi-
desic line. nates x j , occuring in the decomposition of the position
There is a one-to-one correspondence between affine vector OM by the scale vectors: OM=xses (summation
connections on M and connections in principal fibre convention). The specification of two affine coordinate
bundles of free affine frames in (An)x, x EM, generated frames defines a unique affine transformation of the
by them. To closed curves with origin and end at x space An which converts the first frame into the second
there correspond affine transformations (An)x ~(An)x' (see also Affine coordinate system).
which form the non-homogenous holonomy group of A.P. Shirokov
the given affine connection. The corresponding linear Editorial comments. An equivalent, and more usual,
definition is as follows. An affine coordinate frame in affine
automorphisms TX<M)~TX<M) form the homogeneous
n-space is a set of n + 1 points Po, ... ,Pn which are
holonomy group. In accordance with the holonomy
linearily independent in the affine sense, i.e. the vectors
theorem, the Lie algebras of these groups are defined by
Po Pi , i = 1, ... ,n, are linearly independent in the
the 2-forms of torsion OJ and curvature OJ. The Bianchi corresponding vector space. Independence of the vectors e;
identities apply to the latter: in the definition should be understood as independence in a
dO i = Oil\w
]
i -(.lI\W
J'
dO;J = Oik I\wJk -Wik I\ok
.OJ'
corresponding vector space.

In particular, for torsion-free affine connections, when References


[A1) SNAPPER, E. and TROYER, R.J.: Metric affine geometry,
OJ =0, these identities reduce to the following: Academic Press, 1971.

AMS 1980 Subject Classification: 51 N10

57
AFFINE COORDINATE SYSTEM

AFFINE COORDINATE SYSTEM - A rectilinear For references, see Affine differential geometry.
coordinate system in an atTme space. An affine coordi- A.P. Shirokov
nate system on a plane is defined by an ordered pair of AMS 1980 Subject Classification: 53A15
non-collinear vectors e) and e2 (an affine basis) and a
point 0 (the coordinate origin). The straight lines pass- AFFINE DIFFERENTIAL GEOMETRY - The branch
ing through the point 0 and parallel to the basis vec- of geometry dealing with the differential-geometric pro-
tors are known as the coordinate axes. The vectors e) perties of curves and surfaces that are invariant under
and e2 define the positive direction on the coordinate transformations of the affine group or its subgroups.
axes. The axis parallel to the vector e) is called the The differential geometry of equi-affine space has been
abscissa (axis), while that parallel to the vector e2 is most thoroughly studied.
called the ordinate (axis). The affine coordinates of a In an equi-affine plane any two vectors a, b have an
point M are given by an ordered pair of numbers (x, y) invariant (a, b) - the surface area of the parallelogram
which are the coefficients of the decomposition of the constructed on a and b. With the aid of this concept,
vector OM by the basis vectors: the invariant parameter
OM = xel +ye2. s = jl (r,r) 11/3dt,
The first number x is called the abscissa, while the 10

second number y is called the ordinate of M. known as the equi-affine arc length, can be constructed
An affine coordinate system in three-dimensional for a non-rectilinear curve r=r(t). The differential
space is defined as an ordered triplet of linearly- invariant
independent vectors e), e2, e3 and a point O. As in the k = [dds 2r ' .!!!.!.]
ds
2
3
case of the plane, one defines the coordinate axes -
abscissa, ordinate and applicate - and the coordinates is called the equi-affine curvature of the plane curve.
of a point. Planes passing through pairs of coordinate Constant equi-affine curvature characterizes curves of
axes are known as coordinate planes. the second order. A natural equation k = f (s) deter-
A.S. Parkhomenko mines a curve up to an equi-affine transformation. The
vector n =d 2 r / ds 2 is directed along the affine normal
AMS 1980 Subject Classification: 51 N1 0
to a plane curve; the affine normal at a point M, k=j=O,
is the tangent to the locus of the mid-chords of the
AFFINE CURVATURE - A differential invariant of a curve parallel to the tangent at M, and coincides with
plane curve in the geometry of the general aff'me group the diameter of the parabola which has third-order con-
or a subgroup of it. The affine curvature is usually tact with the curve at M.
understood to mean the differential invariant of the Passing to the general affine group, two more invari-
curve in the geometry of the unimodular affine (or ants of the curve are considered: the affine arc length a
equi-affine) group. In this geometry the affine (or, more and the affine curvature K. They can be expressed in
exactly, the equi-affine) curvature of a plane curve terms of the invariants s and k introduced above:
y = y (x) is calculated by the formula
a = jkl/2ds K = _l_.dk
, k 3 / 2 ds'
k = -t[(Y")-2/3f,
(In equi-affine geometry, the magnitudes sand k them-
while the affine (or, more exactly, equi-affine) arc selves are called the affine arc length and the affine
length of the curve is curvature, for the sake of brevity.) The centro-affine arc
s = f(y'')1/3 dx. length, centro-affine curvature, equi-centro-affine arc
length and equi-centro-affine curvature of a plane curve
There is a geometrical interpretation of the affine cur- are constructed in a similar manner.
vature at a point M 0 of the curve: Let M be a point on In equi-affine space it is possible to assign to any
the curve close to M 0, let s be the affine length of the three vectors a, b, c the invariant (a, b, c), which is the
arc MoM and let (J be the affine length of the arc of volume of the oriented parallelepiped defined by these
the parabola tangent to this curve at M 0 and M. The vectors. The natural parameter (equi-affine arc length)
affine curvature at M 0 then is of a curve r=r(t) (rEe 3) is defined by the formula
ko = lim "' /nO(a-s).
M~Mo VS5 s = j I (r, r, r) 11/ 6 dt.
10
In the affine theory of space curves and surfaces there
are also notions of affine curvature which resemble the The differential invariants K=(r', r"', r"''),
respective notions of Euclidean differential geometry. T= -(r", r''', r"''), where the primes denote differentia-

58
AFFINE GEOMETRY

tion with respect to the natural parameter, are called, Two such forms, which satisfy supplementary differen-
respectively, the equi-affine curvature and the equi-affine tial conditions, determine the surface up to equi-affine
torsion of the spatial curve. The study of the curve is transformations. All these statements have appropriate
reduced to selecting some moving frame; the frame generalizations in the multi-dimensional case.
formed by the vectors Many specific classes of surfaces are distinguished in

{r, r, 'r+K! } affine and equi-affine spaces: affine spheres (for which
the affine normals form a bundle), affine surfaces of
revolution (the affine normals intersect one proper or
and defined by the fourth-order differential neighbour-
improper straight line), affine minimal surfaces, etc.
hood of the curve being studied, is especially impor-
In addition to curves and surfaces, other geometrical
tant. The centro-affine theory of spatial curves has been
objects in equi-affine space are. also studied, such as
developed [5].
congruences and complexes of straight lines, vector
The following tensor is constructed for a non-
fields, etc.
developable surface r=r(u 1 , u 2) in equi-affine space:
In parallel with equi-affine differential geometry,
_ aij
development is also in progress of the differential
gij - laI 1 / 4 '
geometry of the general affine group and of its other
where aij =(rJ, r2, rij), a =det(ai), ri = air, rij = aijr. The subgroups both in three-dimensional and in multi-
vector dimensional spaces (centro-affine, equi-centro-affine,
affine-symplectic, bi-affine, etc.).
where V k is the symbol of the covariant derivative with References
respect to the metric tensor gij' determines the direction [I] BLASCHKE, W.: Vorlesungen aber Differentialgeometrie und
geometrische Grundlagen von Einsteins Relativitatstheorie. Affine
of the affine normal to the surface. The affine normal Differentialgeometrie, 2, Springer, 1923.
passes through the centre of the osculating Lie quadric. [2] SALKOWSKI, E.: Affine Differentialgeometrie, W. de Gruyter,
The derivational equations 1934.
IS [3] NORDEN, A.P.: Spaces with an affine connection, Nauka,
ajri = fijr, +gijN Moscow-Leningrad, 1976 (in Russian).
1k [4] LAPTEV, G.F.: 'Differential geometry of multi-dimensional sur-
define an intrinsic connection of the first kind fij of faces', Itogi Nauk. Geom 1963 (1965), 3-64 (in Russian).
[5] SHIROKOV, P.A. and SHIROKOV, A.P.: Differentialgeometrie,
the surface. There also arises at the same time an
2k Teubner, 1962 (translated from the Russian). A P Sh k
.. Iro ov
intrinsic connection of the second kind f ij , defined by
the derivational equations Editorial comments. For the development of affine dif-
2' ferential geometry after W. Blaschke, see [A1].
a/Ii = fijJl, + AijJl, References
where JI is a covariant vector defining the tangent plane [A1] SIMON, U.: 'Zur Entwicklung der affine Differentialgeometrie
nach Blaschke', in Wilhelm Blaschke gesammelte Werke, Vol.
to the surface and subject to the normalization condi-
4, Thales Verlag, 1985, pp. 35-88.
tion NJI= 1. The connections
lk 2k AMS 1980 Subject Classification: 53A 15
fij and fij
AFFINE DISTANCE - An invariant determined by
are conjugate with respect to the tensor gij in the sense
two line elements in an equi-aff'me plane. A point M
of A.P. Norden [3]. The tensor
together with a straight line m passing through it is
TkIJ = .12 [V-g]
IJ
called a line element (M, m). For two line elements
(M, m) and (N, n) the affine distance is 2j1 /3, where f
Ij'

which also plays a major part in projective differential is the surface area of the triangle MNP and P is the
geometry, makes it possible to construct the symmetric
point of intersection of the straight lines m and n. The
covariant tensor
Tijk = gksT'fj . affine distance for two elements tangent to a parabola
The two principal surface forms are also constructed: is equal to the affine arc length of this parabola (cf.
the quadratic form Affine parameter). In the three-dimensional equi-affine
space the affine distance may also be defined in terms
q, = gij du i du j of elements consisting of pairwise incident points,
and the Fubini - Pick cubic form straight lines and planes.
A.P. Shirokov
'" = Tijk du i du j du k .
AMS 1980 Subject Classification: 51 N1 0, 51 N25
These forms are connected by the apolarity condition
gijT;jk = o. AFFINE GEOMETRY - The branch of geometry

59
AFFINE GEOMETRY

whose subject are the properties of figures that are AFFINE MORPIDSM - A morphism of schemes
invariant with respect to affine transformations (cf. f : X ~S such that the pre-image of any open affine
Affine transfonnation). Examples are the simple rela- subscheme in S is an affine scheme. The scheme X is
tion for three points to lie on a straight line, or the called an affine S-scheme.
parallelity of straight lines (planes). A.F. Mobius, in the Let S be a scheme, let A be a quasi-coherent sheaf of
first half of the 19-th century, was the first to study the (') s-algebras and let Ui be open affine subschemes in S
properties of geometric images that go over into each which form a covering of S. Then the glueing of the
other as a result of affine transformations; however, the affine schemes Spec f( Ui , A) determines an affine S-
concept of 'affine geometry' only arose following the scheme, denoted by SpecA. Conversely, any affine S-
appearance in 1872 of the Erlangen program, according scheme definable by an affine morphism f: X ~S is
to which each group of transformations has its own isomorphic (as a scheme over S) to the scheme
geometry, the subject of which are the properties of the SpecJ.( (') x). The set of S-morphisms of an S-scheme
figures that are invariant with respect to the transfor- f: Z ~S into the affine S-scheme SpecA is in bijective
mations of this group. The group of affine transforma- correspondence with the homomorphisms of the
tions contains various subgroups; as a result, in addi- sheaves of (') s-algebras A ~J. ( (') z).
tion to general affine geometry, subordinated Closed imbeddings of schemes or arbitrary mor-
geometries equi-affine geometry, centro-affine phisms of affine schemes are affine morphisms; other
geometry, etc. - arose, corresponding to these groups. examples of affine morphisms are entire morphisms
Affine geometry also deals with problems of differential and finite morphisms. Thus the morphism of normali-
geometry corresponding to specific transformation sub- zation of a scheme is an affine morphism. Under com-
groups (cf. Affine differential geometry). position and base change the property of a morphism
References to be an affine morphism is preserved.
[1] ALEKSANDROV, P.S.: Lectures on analytical geometry, Moscow, References
1968 (in Russian). [1] GROTHENDIECK, A.: 'The cohomology theory of abstract alge-
[2] EFiMOV, N.V.: Hohere Geometrie, Deutsch. Verlag Wissen- braic varieties', in Proc. Internat. Math. Congress Edinburgh,
schaft., 1960 (translated from the Russian). 1958, Cambridge Univ. Press, 1960, pp. 103-118.
E. V. Shikin [2] DIEUDONNE, J. and GROTHENDIECK, A.: 'Elements de
Editorial comments. Two English-language references geometrie algebrique', Pool. Math. IHES 4 (I960).
are [A 1], [A2]. v.I. Danilov
References I. V. Dolgachev
[A 1] BORSUK, K.: Multidimensional analytic geometry, PWN, 1969. Editorial comments. f: X~S is a finite morphism if there
[A2] MESERVE, B.E.: Fundamental concepts of geometry, exist a covering (Sa) of S by affine open subschemes such
Addison-Wesley, 1955.
that f- 1 (Sa) is affine for all IX and such that the ring Ba of
AMS 1980 Subject Classification: 51 N 10 f- 1 (Sa) is finitely generated as a module over the ring Aa of
Sa The morphism is entire if Ba is entire over A a, i.e. if
AFFINE GROUP - The fundamental group of every xEBa integral over A a , which means that it is a root
transformations of an affine space. It is a subgroup of of a monic polynomial with coefficients in A a , or,
the projective group and is represented by those projec- equivalently, if for each xEBa the module Aa[x] is a
tive transformations that map a fixed hyperplane of the finitely-generated module over Aa.
projective space into itself (cf. Projective transfonna- References
tion). [A1] HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
A.P. Shirokov
AMS 1980 Subject Classification: 14A 15, 14E40
AMS 1980 Subject Classification: 51 N1 0
AFFINE NORMAL - A straight line defined in an
AFFINE HULL of a set M in a vector space - The affine-invariant manner at each point of a hypersurface
intersection of all translated subspaces containing M. in an affine space with the aid of the third-order dif-
V.A. Zalgaller ferential neighbourhood of this hypersurface; it is
essential that the principal quadratic form of the hyper-
AMS 1980 Subject Classification: 15A03, 51 N1 0
surface does not degenerate. The affine normal at a
AFFINE MINIMAL SURFACE - A surface whose point M of a plane curve coincides with the diameter of
affine curvature is zero. As distinct from ordinary the parabola that has third-order contact with the curve
minimal surfaces, which consist of saddle points only, at M. If tangent hyper-quadrics are employed, a similar
an affine minimal surface may contain elliptic points as interpretation can be given to the affine normal to a
well. Thus, an elliptic paraboloid consists only of ellip- hypersurface. In particular, the affine normal of a
tic points, and is an affine minimal surface. hyper-quadric coincides with its diameter.
E. V. Shikin A. P. Shirokov
AMS 1980 Subject Classification: 53A 15 AMS 1980 Subject Classification: 53A 15

60
AFFINE SCHEME

AFFINE PARAMETER, affine arc length - A parame- AFFINE SCHEME - A generalization of the concept
ter on a curve which is preserved under transformations of an affine variety, which plays the role of a local
of the affine group, for the determination of which the object in the theory of schemes. Let A be a commuta-
derivatives of the position vector of the curve of the tive ring with a unit. An affine scheme consists _of a
lowest order must be known. The best known is the topological space SpecA and a sheaf of rings A on
parameter which is invariant with respect to the equi- SpecA. Here, SpecA is the set of all prime ideals of A
affine transformations, i.e. with respect to the affine (called the points of the affine scheme), provided with
unimodular group. For a plane curve r=r(t) this affine the Zariski topology (or, which is the same, with the
parameter is computed by the formula: spectral topology), in which the basis of open sets is
constituted by the subsets D(f)={pESpecA: fflp},
s = 11 (r,r) 11/3dt, where f runs t~ough the elements of A. The sheaf of
to
local ~gs A is defined by the condition
where (r, r) is the skew product of the vectors rand r. f(D(f),A) = A f , where Af is the localization of the
In particular, for the affine length of an arc MoM 1 of a ring A with respect to the multiplicative system
parabola, the affine p'arameter is s = 2l /3, where f is {i}n;;;.O, cf. Localization in a commutative algebra.
the area of the triangle formed by the chord MoM 1 Affine schemes were first introduced by A. Grothen-
and the tangents to the parabola at the points M 0 and dieck [1], who created the theory of schemes. A scheme
MI' It is possible to introduce in a similar manner the is a ringed space which is locally isomorphic to an
affine parameter of a space curve in the geometry of affine scheme.
the general affine group or anyone of its subgroups.
A,P' Shirokov An affine scheme SpecA is called Noetherian
(integral, reduced, normal, or regular, respectively) if the
Editorial comments. The arc length given by the formula
ring A is Noetherian (integral, without nilpotents,
above is sometimes refered to as the special affine arc
integrally closed, or regular, respectively). An affine
length.
The notion of an 'affine parameter' is also used in the
scheme is called connected (irreducible, discrete, or
theory of geodesics. An affine parameter of a geodesic (an quasi-compact, respectively) if the topological space
autoparaliel curve) of an affine connection is a parametriza- SpecA also has these properties. The space SpecA of
tion x(t) of the geodesic such that for the corresponding an affine scheme is always compact (and usually not
covariant derivative \l the equation Hausdorff).
\l ;(t)x(t) =0 The affine schemes form a category if the morphisms
is valid (cf. [A3]). of these schemes, considered as locally ringed spaces,
References
are considered as morphisms of affine schemes. Each
[A1] SPIVAK, M.: Differential geometry, 2, Publish or Perish, 1970. homomorphism of rings </>; A ~B define_s a morphism of
[A2] BUCHIN, S.: Affine differential geometry, Science Press; Gor- affine schemes: (SpecB, B)~(SpecA, A), consisting of
don and Breach, 1983.
the continuous mapping a</>: SpecB~SpecA
[A3] O'NEILL, B.: Semi-Riemannian geometry, Academic Press,
1983. (a<f>(p)=</I- 1(p) for _p~SpecB)! and a homomorphism of
sheaves of rings </I: A ~a</l:(B), which transforms the
AMS 1980 Subject Classification: 53A 15, 53B05
section a / f of the sheaf A over the set D (f) into the
section <f>(a) / <P(f). The morphisms of an arbitrary
AFFINE PSEUDO-DISTANCE The number
scheme (X, f) x) into an affine scheme (SpecA, A)
p=(MM', t), equal to the modulus of the vector pro-
(which are also called X-valued points of SpecA) are in
duct of the vectors MM' and t, where M' is an arbi- a one-to-one correspondence with the homomorphisms
trary point in an equi-affme plane, M is a point on a of rings A~f(X, f) x); thus, the correspondence
plane curve r = r(s), s is the affine parameter of the
A ~(SpecA, A) is a contravariant functor from the
curve and t=dr / ds is the tangent vector at the point category of commutative rings with a unit into the
M. This number p is called the affine pseudo-distance
category of affine schemes, which establishes an anti-
from M' to M. If M' is held fixed, while M is moved
equivalence of these categories. In particular, in the
along the curve, the affine pseudo-distance from M' to
category of affine schemes there are finite direct sums
M will assume a stationary value if and only if M' lies
and fibre products, dual to the constructions of the
on the affine normal of the curve at M. An affine
direct sum and the tensor product of rings. The mor-
pseudo-distance in an equi-affine space can be defined
phisms of affine schemes which correspond to surjective
in a similar manner for a given hypersurface.
homomorphisms of rings are called closed imbeddings of
A.P. Shirokov
affine schemes.
AMS 1980 Subject Classification: 53A15, 51 N25 The most important examples of affine schemes are

61
AFFINE SCHEME

affine varieties; other examples are affine group point, which is denoted by a + I, i.e. the additive group
schemes (cf. Group scheme). of vectors of the space L acts freely and transitively on
In a manner similar to the construction of the sheaf the affine space corresponding to L.
A it is p~ssible to ~nstruct, for any A-module M, a Examples. 1) The set of the vectors of the space L is
sheaf of A-modules M on SpecA for which the affine space A (L); the space associated to it con-
cides with L. In particular, the field of scalars is an
f(D(j), M) = Mf = M AAf . affine space of dimension 1. If L =k n, then A (kn) is
Such sheaves are called quasi-coherent. The category of called the n-dimensional affine space over the field k, and
A-modules is equival~ent to the category of quasi- its points a=(a1,'" ,an) and b=(b 1, ... ,bn) deter-
coherent sheaves of A-modules on SpecA; projective mine the vector ab=(b 1 -a1,' .. ,bn-an)
modules correspond to locally free sheaves. The coho- 2) The complement of any hyperplane in a projective
mology spaces of quasi-coherent sheaves over an affine space over the field k is an affine space.
scheme are described by Serre's theorem: 3) The set of solutions of a system of linear (alge-
Hq(SpecA, M) = 0 if q>O. braic or differential) equations is an affine space the
associated space of which is the space of solutions of
The converse of this theorem (Serre's criterion for affin-
the corresponding homogeneous set of equations.
ity) states that if (X, (!) x) is a compact separable
A subset A' of an affine space A is called an affine
scheme, and if H1(X, F)=O for any quasi-coherent
subspace (or a linear manifold) in A if the set of vectors
sheaf of (!) x-modules F, then X is an affine scheme.
ab, a, b EA " forms a subspace of L. Each affine sub-
Other criteria for affinity also exist [1], [4].
space A' CA has the form a+L' ={a+/: IEL'}, where
References L' is some subspace in L, while a is an arbitrary ele-
[I] GROTHENDIECK, A. and DIEUDONNE, J.: 'Elements de ment of A'.
A mapping J :A 1 ~A 2 between affine spaces A 1 and
geometrie algebrique', Publ. Math. IHES 4 (1960).
[2] DIEUDONNE, J.: 'Algebraic geometry', Adv. in Math. 1 (1969),
233-321. A 2 is called affine if there exists a linear mapping of
[3] MANIN, Yu.I.: Lectures on algebraic geometry, 1, Moscow, 1970 the associated vector spaces cp: L 1~L2 such that
(in Russian).
J(a+I)=J(a)+<p(I) for all aEA 1, IEL 1. A bijective
[4] GooDMAN, J. and HARTSHORNE, R.: 'Schemes with finite-
dimensional cohomology groups', Amer. J. Math. 91 (1969), affine mapping is called an affine isomorphism. All
258-266. affine spaces of the same dimension are mutually iso-
v.I. Danilov
morphic.
I. V. Dolgachev
The affine isomorphisms of an affine space A into
Editorial comments. Reference [A1] is, of course, stan- itself form a group, called the affine group of the affine
dard. It replaces [3]. An alternative to [1] is [A2].
space A and denoted by Aff(A). The affine group of
References the affine space A (kn) is denoted by Affn(k). Each ele-
[A1] HARTSHORNE, R.: Algebraic geometry, Springer, 1977,
[A2] GROTHENDIECK, A. and DIEUDONNE, J.: Elements de
ment JEAffn(k) is given by a formula
geometrie algebrique, 1, Springer, 1971,
fu], ... ,an = (b l , ,bn),
AMS 1980 Subject Classification: 14A15, 14L17,
14K99 where
b, = ~a1aj +c;,
AFFINE SPACE over a field k - A set A (the ele- j

ments of which are called the points of the affine (a{) being an invertible matrix. The affine group Aff(A)
space) to which corresponds a vector space Lover k contains an invariant subgroup, called the subgroup oj
(which is called the space associated to A) and a map- (parallel) translations, consisting of the mappings
ping of the set A XA into the space L (the image of an J: A ~A for which cp: L~L is the identity. This group
element (a, b)EA XA is denoted by ab and is called the is isomorphic to the additive group of the vector space
vector with beginning in a and end in b), which has the L. The mapping J~CP defines a surjective homomor-
following properties: phism of Aff(A) into the general linear group GL, with
a) for any fixed point a the mapping x~a1, x EA, is the subgroup of parallel translations as kernel. If L is a
a bijection of A on L; Euclidean space, the pre-image of the orthogonal group
b) for any points a, b, c EA the relationship is called the subgroup oj Euclidean motions. The pre-
ab+bt:+w = 0, image of the special linear group SGL is called the
equi-affine subgroup (d. Affine unimodular group). The
where -0 denotes the zero vector, is valid. The dimen- subgroup Ga CAffn(A) consisting of the mappings
sion of L is taken for the dimension of the affine space f:A~A such that J(a +l)=a+<p(l) for a given aEA
A. A point a EA and a vector IE L define another and arbitrary I EL is called the centro-affine subgroup; it

62
AFFINE TRANSFORMA nON

is isomorphic to the general linear group GL of the AFFINE TORSION - One of the differential invari-
space L. ants of a curve in equi-affine space:
In algebraic geometry an affine algebraic set is some- T = - (r", r"', r"''),
times called an affine space. A finite-dimensional affine
space can be provided with the structure of an affine where the primes denote differentiation with respect to
the affine parameter of the position vector of the curve.
variety with the Zariski toplogy (cf. also Affine scheme).
Affine spaces associated with a vector space over a Similar concepts are introduced for curves in spaces
with other fundamental groups such as centro-affine.
skew-field k are constructed in a similar manner.
A.P. Shirokov
References
[I] BOURBAKI, N.: Elements 0fmathematics. Algebra: Algebraic Editorial comments.
structures. Linear algebra, I, Addison-Wesley, 1974, References
Chapt.l; 2 (translated from the French). [1] BUCHIN, S.: Affine differential geometry, Science Press; Gor-
l. V. Dolgachev
don and Breach, 1983.
A.P. Shirokov
Editorial comments. An affine isomorphism is also AMS 1980 Subject Classification: 53A 15
called an affine collineation. An equi-affine group is also
called a Euclidean group. AFFINE TRANSFORMATION of a Euclidean space -
References A one-to-one point mapping of the plane or space into
[A1] BERGER, M.: Geometry, 1, Springer, 1987, Chap!. 2 itself, where to three points lying on a straight line
(translated from the French).
correspond three points also lying on a straight line.
AMS 1980 Subject Classification: 51 N1 0, 15A03 Thus, by an affine transformation straight lines are
transformed into straight lines. An affine transforma-
AFFINE SPHERE - A surface the affine normals (cf. tion of the plane transforms intersecting lines into
Affine normal) of which intersect at one point. An intersecting lines, and parallel lines into parallel lines.
affine sphere is said to be improper if this point is infin- An affine transformation of space results in an affine
itely distant (or improper (ideal; otherwise it is proper. mapping of each plane onto some plane; intersecting
In particular, any surface of the second order is an planes are mapped into intersecting planes, while paral-
affine sphere. An improper convex affine sphere is an lel planes are mapped into parallel planes. Moreover,
elliptic paraboloid. the mutual location of two straight lines is preserved:
E. V. Shikin
intersecting straight lines are mapped into intersecting
AMS 1980 Subject Classification: 53A 15
straight lines, parallel straight lines are mapped into
AFFINE TENSOR - An element of the tensor pro- parallel straight lines, while skew lines are mapped into
duct of p copies of an n-dimensional vector space En skew lines.
and q copies of the dual vector space E~. Such a tensor Under an affine transformation the ratio between
is said to be of type (p, q), the number p + q defining directed segments lying on the same line or on parallel
the valency, or degree, of the tensor. Having chosen a straight lines is equal to the ratio of their images. The
basis {e;} in En, one defines an affine tensor of type ratio between the surface areas of two measurable fig-
(p, q) with the aid of nP +q components 1)', '..... J, which ures (on the Euclidean plane) and the ratio between the
transform as a result of a change of basis e; = A Jes volumes of two measurable bodies (in the Euclidean
according to the formula space) are preserved as well. Under an affine transfor-
mation a set of vectors in the plane (in space) is one-
r", ...... ~ =A'I, "'A'ipAt, "'A/qT.' 'Sp
} 1}q SI Sp JI Jq tI ... 1'1 ' to-one mapped on a set of vectors in the plane (in
where AjA; =8). It is usually said that the tensor com- space), and this mapping is linear. An affine transfor-
ponents undergo a contravariant transformation with mation is defined in an affine coordinate system by a
respect to the upper indices, and a covariant transfor- non-degenerate (non-homogeneous) linear transforma-
mation with respect to the lower. tion; thus, in the case of a plane, an affine transforma-
A.P. Shirokov
tion is analytically expressed by the formulas
Editorial comments. An affine tensor as described
above is commonly called Simply a tensor. x = CIIX +CIV +C13,
References y' = C21 X + C2V+ C23,
[A 1] DUBROVIN, B.A., FOMENKO, A.T. and NOVIKOV, S.T.: Modern with the supplementary condition
geometry - methods and applications, Springer, 1984
(translated from the Russian).
[A2] GREUB, W.H.: Multilinear algebra, Springer, 1967.
[A3] DODSON, c.T.J. and POSTON, T.: Tensor geometry, Pitman,
1977.
Affine transformations in space are defined in a similar
AMS 1980 Subject Classification: 15A69, 53A45 manner.

63
AFFINE TRANSFORMATION

Under an affine transformation an algebraic curve is one obtains a centro-affine unimodular group of
converted into an algebraic curve, the order of the transformations isomorphic to the group of all matrices
curves being preserved. In particular, a curve of the of order n with determinant equal to one. Such a
second order is converted into a curve of the second group of matrices is called the unimodular group or spe-
order, moreover, ellipses are converted into ellipses, ciallinear group of order n and is denoted by SL(n).
hyperbolas into hyperbolas, parabolas into parabolas, A.P. Shirokov
etc. AMS 1980 Subject Classification: 20H05
Examples of affine transformations: isometric
transformations, similarity transformations, and uni-
AFFINE VARIETY, affine algebraic variety - A gen-
form contractions of a plane to a straight line. Each
eralization of the concept of an affme algebraic set. An
affine transformation in a plane is a product of an
affine variety is a reduced affme scheme X of finite
isometric transformation and two uniform contractions
type over a field k, i.e. X = SpecA, where A is a com-
to two mutually-perpendicular straight lines. Each
mutative k-algebra of finite type without nilpotent ele-
affine transformation in a space is a product of an
ments. The affine variety X=Speck[T 1 , . . , Tn],
isometric transformation and three uniform contrac-
where k [T 1, . . . , Tn] is the ring of polynomials over k,
tions to three pairwise-perpendicular planes.
is called affine space over k and is denoted by AZ. An
The affine transformations form a group; the similar-
affine scheme is an affine variety if and only if it is iso-
ity transformations constitute a subgroup of this group;
morphic to a reduced closed subscheme of an affine
the set of isometric transformations is a subgroup of
space. Each system of generators Xl, . . . ,Xn of a k-
the group of similarity transformations.
algebra A defines a surjective homomorphism
Affine transformations are the most general one-to-
one mappings of a plane (a space) into itself under
cp: k[TJ, ... ,Tn]~A, defined by the formula CP(Ti)=Xi'
Let "k be the algebraic closure of k. The subset of the
which straight lines are preserved.
set "k n consisting of the common zeros of all the poly-
References nomials of the ideal ker cp is an affine algebraic set over
(1) ALEKSANDROV, P.S.: Lectures on analytical geometry, Moscow,
1968 (in Russian). k. The coordinate ring of such an affine algebraic set is
[2) POSTNIKOV, M.M.: AnalytiC geometry, Moscow, 1973 (in Rus- isomorphic to the ring A. Each affine algebraic set over
sian).
A.S. Parkhomenko k in turn defines an algebraic variety Speck[X], where
k[X] is the coordinate ring of X. The set of points of
Editorial comments. In terms of coordinates like above
similarity transformations take the form
an affine variety is in a one-to-one correspondence with
the irreducible subvarieties of the corresponding affine
x' = C11X+C12Y+C13, Y' = -ec12x+ec11y+c23
algebraic set.
Similitudes, or homothetic transformations, which in terms of To each affine variety X=SpecA a functor on the
coordinates take the form x' = C11 x+ C13, i
= C22Y+ C23' category of k-algebras is assigned. It is defined by the
constitute another interesting subgroup of the group of
correspondence:
affine transformations.
References B ~ X(B) = Homk -alg(A, B).
[A 1) COXETER, H.S.M.: Introduction to geometry, Wiley, 1961. If ~=k (respectively, if B=k), the elements of the set
[A2] MESERVE, B.E.: Fundamental concepts of geometry,
Addison-Wesley, 1955. X(k) (respectively, X(k)) are called ge<!..metric (respec-
tively, rational) points of X. The set X (k) is in a bijec-
AMS 1980 Subject Classification: 51 N1 0
tive correspondence with the set of maximal ideals
Specm (A) of the ring A, and with the set of points of
AFFINE UNIMODULAR GROUP, equi-affine group -
an algebraic set V whose coordinate ring is isomorphic
The subgroup of the general affine group consisting of
to A. The spectral topology in the space X induces on
the affine transformations of the n-dimensional affine
the everywhere-dense subset Specm (A ) a topology
space
X 1-+ X = Ax+a (*) which corresponds to the Zariski topology on V.
that satisfy the condition det(A) = 1. If the vectors x I. V. Dolgachev
and x are interpreted as rectangular coordinates of Editorial comments. Frequently the name 'variety'
points in the n-dimensional Euclidean space En, then means a reduced and irreducible scheme of finite type over
the transformation (*) will preserve the volumes of n- an algebraically closed field.
dimensional domains of P. This makes it possible to References
introduce the concept of volume in an equi-affine [A 1] SHAFAREVICH, I.R.: BaSIC algebraic geometry, Springer, 1977
(translated from the Russian).
space, which is a space with a fundamental affine uni-
modular group. If, in formulas (*), one puts a = 0, then AMS 1980 Subject Classification: 14A 10

64
AIRY FUNCTIONS

AFFINITY - A brief synonym for an affine transfor- so are w( wz) and w( w2 z), where w = e 2m j3, and any
mation, defined as a non-degenerate linear substitution two of these solutions are linearly independent. The fol-
X' = A~xs +a'. lowing identity holds:
A.P. Shirokov w(z)+w(wz)+w(w 2z) _ O.
AMS 1980 Subject Classification: 51 N10
References
[1) AIRY, G.B.: 'On the intensity of light in the neighbourhood of
AFFINOR - An affine tensor of type (1, 1). Specify- a caustic', Trans. Cambridge Phi/os. Soc. 6 (1838), 379-402.
ing an affinor with components f} is equivalent to [2) BABICH, V.M. and BULDYREv, V.S.: Asymptotic methods in
specifying an endomorphism of the vector space problems of diffraction of short waves, Moscow, 1972 (in Rus-
according to the rule vi =/1 vS To the iden ti ty sian).
[3) ABRAMOWITZ, M. and STEGUN, l.A. (EDS.): Handbook of
endomorphism there corresponds a unique affinor. The mathematical functions, Appl. Math. Series, Vol. 55, Nat.
correspondence by which the matrix If} I is assigned Bureau of Standards, 1964.
M. V. Fedoryuk
to each affinor realizes an isomorphism between the
algebra of affinors and the algebra of matrices. An affi- AMS 1980 Subject Classification: 34A30
nor is sometimes defined in the literature as a general
(affine) tensor. AIRY FUNCflONS - Particular solutions of the Airy
A.P. Shirokov
equation.
Editorial comments. I.e. one is concerned here with the The first Airy function (or simply the Airy function)
isomorphism VI8i V~End(V) of linear algebra. is defined by

Icos [3-T +
AMS 1980 Subject Classification: 15A69, 53A45
Ai(x) = -;1
00
]
xt dt.
AFFIx OF A COMPLEX NUMBER Z = a + bi in its
geometric representation - The point in the complex For complex values of z
plane corresponding to this number, i.e. the point with
Cartesian coordinates (a, b). The affix is sometimes 1 [exp [
Ai(z) = 2", zt-t3 ] dt,
3

identified with the complex number itself.


where y = (ooe -2?Ti /3,0] U [0, + 00) is a contour in the
AMS 1980 Subject Classification: 30-01
complex t-plane. The second Airy function is defined
AIRY EQUATION - The second-order linear ordi- by
Bi(z) = iw
2 Ai(w2Z)-iwAi(wz), w 27Ti 3 =e / .
nary differential equation
The functions Ai(x) and Bi(x) are real for real x.
y" -xy = O. A second collection of Airy functions was introduced
It occurred first in G.B. Airy's research in optics [1]. Its by V.A. Fock [V.A. Fok]:
general solution can be expressed in terms of Bessel
functions of order +- I / 3: v(z) = V; Ai(z),

w,(z) = 2e i7T / 6v(wz),


y(x) = C,Vxl'/3 [fiX3/2]+C2Vxl_,/3 [fiX3/2].
W2(Z) = 2e- i7T / 6v(w-'z);
Since the Airy equation plays an important role in vari- in this case v(z) is called the Airy- Fok function
ous problems of physics, mechanics and asymptotic (Airy- Fock function). The following identities hold:
analysis, its solutions are regarded as forming a distinct
w,(Z)-W2(Z) --
class of special functions (see Airy functions). v(z) = 2i ,w,(z) = W2(Z)' (1)
The solutions of the Airy equation in the complex
plane z, Any two of v(z), wJ(z) and W2(Z) are linearly indepen-
w"-zw = 0, dent.
have the following fundamental properties: The most important Airy function is v(z) (or Ai(z.
I) Every solution is an entire function of z and can Its asymptotic behaviour on the real axis is given by
be expanded in a power series
vex) = ~ X-.;!;4 exp[-fx3/2][l+O(x-3/2)], x~+oo,
w(z) = w(o) (l+~+ Z6
23 (2,3)'(5'6)
+ ... ] +
v(x) = IX~/4 [Sin[tlxI3/2+~]+O(IXI-3/2)J.
+w'(O) [z+ ;.: + (3.4;.:6'7) + ... ], x~-oo,


which converges for all z.
2) If w(z) 'jE is a solution of the Airy equation, then
so v(x) decreases rapidly for x >0, x I and oscillates
strongly for x<O, I x I 1. The functions Wl(X) and

65
AIRY FUNCTIONS

W2(X) increase exponentially as x~+oo. For complex where q(x) is a smooth real-valued function on the
z the Airy functions have the following asymptotic interval I=[a, b] and A>O is a large parameter. The
expansions as I z I~ 00: zeros of q(x) are called turning points (or transfer

v(z) '" 2~z-'/4exp [_f 3/2] x


z (2)
points) of the equation (3). Let
a < Xo < b, q(xo) = 0, q' (xo) =1=
(such a point is called simpfe),
for I argz I,,;;;;,w-,
00
x ~ (-Iranz -3n /2
n =0
q(x) =1= for xEI, x=l=xo, q'(xo) > O.
Set

00
X ~ anz -3n /2
n =0
for I argz - -
W

3
I ";;;;'W-, ((x) ~ [~!. Vq(t) dtr" ';80 ((x) ~ ';8o(x - x,>
where Yo(x) = ((x-'/2Ai(-A2 / 3x,
_ r [3n+t]9- n Y,(x) = ((x-'/2Bi(-A2 / 3X.

an - (2n)! Equation (3) has linearly independent solutions Yo(x)


andYI(x) such that, as A~+oo,
The asymptotic expansion of W2(Z) is of the form (2),
but it is valid in the sector
Yi x ) = Yi x ) [1+0 [~ J]. a";;;;'x";;;;'xo, j=O, I,

iarg [z+;] I,,;;;;, W-.

yo(x) = Yo(x) [I+O[ ~]]+Y,(x)o[f],


Here E(O,77) is arbitrary, the branches of and Vz
z 1 / 4 are positive on the semi-axis (0, 00), and the
asymptotic expansions are uniform with respect to argz y,(x) = Y,(x) [I +0 [~]] + Yo(x)O [fJ,
and can be differentiated term by term any number of
Xo ,,;;;;, x , ;;;;, b,
times. In the remaining sector I arg - z I < the asymp-
totic expansion of v(z) is expressed in terms of those uniformly with respect to x.
of WI (z) and W 2 (z) by means of (1); hence, the asymp- This result has been generalized in various directions:
totic expansion of v(z) has a different form in different asymptotic series have been obtained for the solutions,
sectors of the complex z-plane. This fact was first esta- the case q = q(x, A) has been studied (for example, if
blished by G.G. Stokes [2] and is called the Stokes q(x, A) can be expanded in an asymptotic series
phenomenon. q'" '2/::=OA -nqn(x) as A~+ 00), and the asymptotic
The Airy functions occur in the study of integrals of behaviour of the solutions near multiple turning points
rapidly-oscillating functions, of the form has been investigated. Other generalizations concern the
b
equation w" +A2 q(X)W = 0, (4)
leA, a) = jtfAS(x,a)f(x, a)dx,
a where the function q(z) is analytic in a domain D of
for A>O, A~oo. Here f and S are smooth functions, S the complex z-plane. Let f be the maximal connected
is real and 0: is a real parameter. If for small values of component of the level line
0:;;00 the phase S has two close non-degenerate station- Z

ary points XI(O:) and X2(0:) that coincide for 0:=0, for Re j v'q(t) dt = 0,
example, if Zo

emanating from a turning point z 0 and containing no


sex, a) = ax - x 3 + 0(X4) as x-)oO,
other turning points; then f is called a Stokes line. If
then for small values of 0:;;00, as A--+ + 00, the contribu- q = - z (that is, (4) is the Airy equation), then the
tion to the asymptotics of the integral coming from a Stokes lines are the rays (- 00,0) and (0, ei1T /3).
neighbourhood of the point x = 0 can be expressed in Analogously, if zo is a simple turning point of (4), then
terms of the Airy function v and its derivative (see [6]). there are three Stokes lines fl' f2 and f3 emanating
Integrals of this kind occur in the study of short-wave from it and the angle between adjacent lines at z 0 is
fields near a simple focus (see [7] and [8]); the Airy equal to 277 /3. Let Si be a neighbourhood of z 0 from
functions arose in connection with the study of this which a neighbourhood of the Stokes line fi' j = 1, 2, 3,
problem [1]. has been removed. For a suitable numbering of the Si'
Consider the second-order differential equation equation (4) has three solutions w/z), j = 1, 2, 3, such
y" +A 2q(X)y = 0, (3) that, as ;\--+ + 00,

66
ALBANESE VARIETY

values of two interpolation polynomials of consecutive


degrees coincide in the required number of decimal
for Z ESj. places. The Aitken scheme is convenient for interpolat-
The Airy functions also occur in the study of asymp- ing the values of a function given in the form of a table
totic solutions of ordinary differential equations and (of values), by renumbering the interpolation nodes in
systems of higher order near simple turning points. the order in which I x - Xi I increases.
References References
[I] AIRY, G.B.: Trans. Cambridge Phi/os. Soc. 6 (1838), 379-402. [I] BEREZIN, I.S. and ZHIDKOV, N.P.: Computing methods, I, Per-
[2] STOKES, G.G.: Trans. Cambridge Phi/os. Soc. 10 (1857), 105- gamon Press, 1973 (translated from the Russian).
(2) BAKHVALov, N.S.: Numerical methods: analysis, algebra, ordi-
128.
[3] Fmc, V.A.: 'Tables of the Airy functions', Moscow, 1946 nary differential equations, Mir, 1977 (translated from the Rus-
(in Russian). sian).
M,K Samarin
[4] SOOUN, A. and ABRAMOWITZ, M.: Handbook of mathematical
functions, Appl. Math. Ser., Vol. 55, Nat. Bur. Stand., 1970. Editorial comments. Aitken published the gist of his
[5] BABICH, V.M. and BULDYREv, V.S.: Asymptotic methods in method in [A1]. Aitken's scheme is disadvantageous when a
problems of diffraction of short waves, Moscow, 1972 (in Rus- number of interpolations must be carried out over the same
sian).
[6] FEDORYUK, M.V.: The saddle-point method, Moscow, 1977 (in
range. In such cases, an alternative to the computation of
Russian). the Lagrange polynomials is the use of divided differences
[7] LANDAU, L.D. and LIFSHITS, E.M.: The classical theory of in the construction of Newton's interpolation formula.
fields, Addison-Wesley, 1951 (translated from the Russian).
References
[8] MASLOV, V.P. and FEDORYUK, M.V.: Quasi-classical approxi-
[A1] AITKEN, A.c.: 'On interpolation by iteration of proportional
mation for the equations of quantum mechanics, Moscow, 1976
parts, without the use of differences', Proc. Edingburgh
(in Russian).
Math. Soc. 3, no. 2 (1932), 56-76.
[9] DORODNITSYN, A.A.: 'Asymptotic laws of distribution of the
[A2] HILDEBRAND, F.B.: Introduction to numerical analysis,
characteristic values for certain types of second-order differen-
McGraw-Hili, 1974.
tial equations', Uspekhi Mat. Nauk 6, no. 7 (1952),3-96 (in
Russian). AMS 1980 Subject Classification: 65D05
[10] WASOW, W.: Asymptotic expansions for ordinary differential
equations, Interscience, 1965.
(II] FEDORYUK, M.V.: Asymptotic methods for linear ordinary dif- ALBANESE VARIETY - An Abelian variety Alb(X)
ferential equations, Moscow, 1983 (in Russian). canonically attached to an algebraic variety X, which is
M. V. Fedoryuk the solution of the following universal problem: There
Editorial comments. The Airy function can be expressed exists a morphism cJ>: X ~Alb(X) such that any mor-
in terms of modified Bessel functions of the third kind: phism f : X ~A into an A~elian variety A factors into a
product f= fcJ>, where f:A~Alb(X) (so named in
Ai(x) = ?T~ VxK, j3 [ ; X2/3]. honour of G. Albanese). If X is a complete non-
singular variety over the field of complex numbers, the
The function Ai(z) satisfies the differential equation
Albanese variety can be described as follows. Let [21 be
w"(z)=zw(z), cf. [A2].
the space of everywhere-regular differential forms of
References degree I on X. Each one-dimensional cycle y of the
[A1] OLVER, F.W.J.: Asymptotics and special functions, Acad.
topological space X determines a linear function
Press., 1974.
[A2] LEBEDEV, N.N.: Special functions and their applications, 1
W~ w on [21. The image of the mapping
y
HI(X, Z)~([2I) thus obtained is a lattice r in ([21)*,
Dover, reprint, 1972 (translated from the Russian).

AMS 1980 Subject Classification: 33A40 and the quotient space ([21)* / r coincides with the
Albanese variety of X. From the algebraic point of
AITKEN SCHEME - A method for computing the view, an Albanese variety may be considered as a
value at a point x of the interpolation polynomial method of defining an algebraic structure on some quo-
4(x) with respect to the nodes Xo, ... ,Xn, based on tient group of the group Z of zero-dimensional cycles
the successive application of the formula of degree 0 on X. If X is a non-singular complete alge-
Lk(X) = L(o,. ,k)(X) = (*) braic curve, both its Picard variety and its Albanese
variety are called its Jacobi variety. If the ground field
1 IL(O, ... ,k-l)(X) xo-XI has characteristic zero, then the equalities
Xk-XO L (l, ... ,k) (x) xk-x,k=1,2, ... ,
dimAlb(X) = dimk HO(X, Q.\-) = dimk H1(X, @ x)

where L(i, ... ,m)(x) is the interpolation polynomial with are valid. The number dimAlb(X) is called the irregu-
interpolation nodes Xi, .. , X m , in particular, larity of the variety X. If the field has finite characteris-
L(i)(x) = f(Xi) (see Interpolation fonnula). The process tic, the inequalities
of computations by means of (*) may finish if the irr(X) ,;;;;; dimHo(X, Q.\-) and irr(X)';;;;; dimH1(X, @ x)

67
ALBANESE VARIETY

hold. If the ground field has positive characteristics it addition are equal to the respective limit of the homol-
can happen that ogy (cohomology) groups of the compacta X).. The
dimHl(X, tJ x) *- dimHo(X, n.\} Aleksandrov-Cech theory is the only theory satisfying
the Steenrod - Eilenberg axioms (with the exception
The Albanese variety is dual to the Picard variety. indicated above) and this condition of continuity. On
References the category of paracompact spaces, the usual charac-
[1) BALDASSARRI, M.: Algebraic varieties, Springer, 1956. terization by mappings into Eilenberg - MacLane
[2) LANG, S.: Abelian varieties, Springer, 1983.
spaces is valid for cohomology; while the cohomology
A.N. Parshin
itself is equivalent to the cohomology defined in sheaf
AMS 1980 Subject Classification: 14K30, 14KXX theory. Cohomology may also be defined as cohomol-
ogy of some cochain complex, which makes it possible
ALBEDO METHOD (in transport theory) - A to operate with sheaves of cochains. Similar ideas,
method of solving boundary value problems for a tran- applied to homology, are contained in the homology
sport equation. The Albedo method is a variant of the theory originating from N. Steenrod, A. Borel and oth-
matrix factorization method, the role of factorization ers, which satisfies all axioms including the exactness
coefficients being played by the matrices of reflection axiom (but ty.e property of continuity is lost).
and transmission of a sequence of layers of increasing Aleksandrov-Cech homology and cohomology, includ-
thickness. Practically speaking, the Albedo method is ing the above modification, are employed in homologi-
used only in problems in one-dimensional spaces, both cal problems in the theory of continuous mappings, in
to find the reflection and transmission coefficients and the theory of transformation groups (a connection with
to find the solution of the transport equation in a quotient spaces), in the theory of generalized manifolds
medium. (in particular, in various duality relations), in the
References theory of analytic spaces (e.g. in defining the funda-
[1) BEREZIN, I.S. and ZHIDKOY, N.P.: Computing methods, Per- mental classes of homology), in homological dimension
gamon Press, 1973 (translated from the Russian). theory, etc.
[2) GERGOMENOYA, T.A., ET AL.: Al'bedo neutrons, Moscow, 1973
(in Russian). References
TA. Germogenova [1) ALEKSANDROFF, P.S. [P.S. Ai.EKSANDROY): 'Untersuchungen
AMS 1980 Subject Classification: 35-XX, 82A70 tiber Gestalt und Lage abgeschlossener Mengen beliebiger
Dimension', Ann. of Math. (2) 30 (1929), 101-187.
[2) CECH, E.: 'Theorie genet-ale de l'homologie dans un espace
ALEKSANDROV-CECH HOMOLOGY AND que1conque', Fund Math. 19 (1932), 149-183.
COHOMOLOGY, spectral homology and cohomology - [3) STEENROD, N.E. and E1LENBERG, S.: Foundations of algebraiC
Homology and cohomology theories which satisfy all topology, Princeton Univ. Press, 1966.
[4) SKLYARENKO, E.G.: 'Homology theory and the exactness
Steenrod - Eilenberg axioms (except, possibly, the axiom', Russian Math. Surveys 24, no. 5 (1969), 91-142.
exactness axiom) and a certain continuity condition. (Uspekhi Mat. Nauk 24, no. 5 (1969), 87-140)
The Aleksandrov-Cech homology groups (modules) E. G. Sklyarenko
Hn(X, A ; G) [1], [2] are defined as the inverse limit
Editorial comments. Often one ~peaks also of Cech
lim..... Hn(a, a'; G) over all open coverings a of the space cohomology instead of Aleksandrov-Cech cohomology.
X; here a signifies not only the covering, but also its
nerve, and a' is the subcomplex in a that is the nerve AMS 1980 Subject Classification: 55N05
of the restriction of a on the closed set A (cf. Nerve of
a family of sets). The possibility of passing to the limit ALEKSANDROV COMPACflFICATION, Aleksan-
is ensured by the existence of simplicial projections drov compact extension - The unique Hausdorff compac-
(/3, /3')~(a, a') defined, up to an homotopy, by the tifieation aX of a locally compact, non-compact, Haus-
inclusion of {1 in a. The Aleksandrov - tech cohomology dorff space X, obtained by adding a single point 00 to
groups Hn(x, A ; G) are defined as the direct limit X. An arbitrary neighbourhood of the point 00 must
linL. Hn(a, a'; G). The homology groups satisfy all the then have the form {oo} U (X \ F), where F is some
Steenrod - Eilenberg axioms except for the exactness compactum in X. The Aleksandrov compactification aX
axiom. All axioms are valid for cohomology, and, is the smallest element in the set B (X) of all compactif-
partly for this reason, cohomology is often more useful. ications of X. A smallest element in the set B (X) exists
The exactness axiom is also valid for homology on the only for a locally compact space X and must coincide
category of compacta if G is a compact group or field. with ax'
In addition, Aleksandrov-Cech homology and coho- The Aleksandrov compactification was defined by
mology groups have the property of continuity: For P.S. Aleksandrov (1) and plays an important role in
X=lim.- X h the homology (cohomology) groups in topology. Thus, the Aleksandrov compactification aRn

68
ALEXANDER DUALITY

of the n-dimensional Euclidean space is identical with a is a limit ordinal, and if /3<cf(a), then
the n-dimensional sphere; the Aleksandrov compactifi- K!(p
a
= ~
~
K!(p
<.
cation aN of the set of natural numbers is a
homeomorphic to the space of a convergent sequence Here cf(a) denotes the confinal character of the ordinal
together with the limit point; the Aleksandrov compac- number a. As in the case of cardinal numbers, one dis-
tification of the 'open' Mobius strip coincides with the tinguishes between singular alephs, regular alephs, limit
real projective plane RP2. There are pathological situa- alephs, weakly inaccessible alephs, strongly inaccessible
tions connected with the Aleksandrov compactification, alephs, etc. For example, ~a is singular if a is a limit
e.g. there exists a perfectly-normal, locally compact and ordinal and if cf(a)<a. There is no largest aleph
countably-compact space X whose Aleksandrov com- among all alephs. It was shown by Cantor that the set
pactification has the dimensions dim aX < dim X and of all alephs is meaningless, i.e. that there is no such
Ind aX < Ind X. set. See also Totally weD-ordered set; Continuum
References hypothesis; Set theory; Ordinal number; Cardinal
[I] ALEKSANDROFF, P.S. [P.S. ALEKSANDROV]: 'Ueber die Metrisa- number.
tion der im Kleinen kompakten topologischen Raumen', Math.
Ann. 92 (1924), 294-301.
References
v. V. Fedorchuk [I] ALEKSANDROV, P.S.: EinfUhrung in die Mengenlehre und die
Theorie der reelen Funktionen, Deutsch. Verlag Wissenschaft.,
Editorial comments. The Aleksandrov compactification 1956 (translated from the Russian).
is also called the one-point compactification. [2] HAUSDORFF, F.: Set theory, Chelsea, reprint, 1978.
[3] COHEN, P.J.: Set theory and the continuum hypothesis, Benja-
References min, 1966.
[A1] DUGUNDJI, J.: Topology, Allyn and Bacon, 1966. Theorem [4] KURATOWSKI, K. and MOSTOWSKI, A.: Set theory, North-
8.4. Holland, 1968.
B.A. Efimov
AMS 1980 Subject Classification: 54035
Editorial comments. A more recent theorem on the
exponentiation of alephs was proved by J. Silver in 1974, cf.
ALEPH, ~ - The first letter of the Hebrew alphabet.
[A2]. A particular case says that if
As symbols alephs were introduced by G. Cantor to
denote the cardinal numbers (the cardinality cf. Cardi- 2!( = K~+1 for all ~<W1
nal number) of infinite well-ordered sets. Each cardinal then
number is some aleph (a consequence of the axiom of
choice). However, many theorems about alephs are A reasonable up-tO-date additional reference for this topic is
[A 1].
demonstrated without recourse to the axiom of choice.
For each ordinal number a by ~a=w(wa) one denotes References
[A1] LEVY, A.: Basic set theory, Springer, 1979.
the cardinality of the set of all ordinal numbers smaller [A2] SILVER, J.: 'On the singular cardinals problem', in Proc. Inter-
than Wa' In particular, ~o is the cardinality of the set of nat. Congress Mathematicians Vancouver, 1974, Vol. 1,
all natural numbers, ~I is the cardinality of the set of 1975, pp. 265-268.
all countable ordinal numbers, etc. If a</3, then AMS 1980 Subject Classification: 03E55, 04A 10,
~a<~fl' The cardinal number ~a+1 is the smallest 04A30
cardinal number which follows ~a' The generalized con-
tinuum hypothesis states that 21'1 = ~a + I for any ordinal ALEPH-ZERO - A cardinal number which is the car-
number a. If a = 0, the equation assumes the form dinality of all countably-infinite sets. Denoted by ~o.
21'1 =~], and forms the content of the continuum
0 P.s. Aleksandrov
hypothesis. The set of all alephs smaller than ~a is AMS 1980 Subject Classification: 03E10, 04A 10
totally ordered according to magnitude, and its order
type is a. The definitions of the sum, the product and a ALEXANDER DUALITY - The connection between
power of alephs are obvious. One has the homological properties of complementary subsets of
Ka + Kil = Ka 'KIl = Kmax(a. Ill' a topological space, owing to which the homological
properties of a set can be defined by certain properties
The following formulas are most-frequently encoun-
of its complement. The first theorems of this kind were
tered. The recursive Hausdorff formula:
formulated in terms of set-theoretic rather than alge-
braic topology. It was shown by C. Jordan in 1892 that
a particular case of which, for a=O, is the Bernshtefn a simple closed curve divides the plane into two
formula: domains and forms the common boundary of the two
(the Jordan theorem). This theorem was extended in
The recursive formula of Tarski: If an ordinal number 1911 by H. Lebesgue and L.E.J. Brouwer (indepen-

69
ALEXANDER DUALITY

dently of each other) to the case of an n-dimensional References


manifold in an (n + I)-dimensional spherical (or [1] ALExANDER, II, J.W.: 'A proof of the invariance of certain
constants of analysis situs', Trans. Amer. Math. Soc. 16 (1915),
Euclidean) space; a connection was also established
148-154.
between this fact and the property of an r-dimensional [2] ALEKsANDROV, P.S.: Combinatorial topology, Graylock, Roches-
manifold (in an n-dimensional space) being linked with ter, 1956 (translated from the Russian).
an (n-r-l)-dimensional manifold (Lebesgue). It was [3] PONTRYAGIN, L.S.: 'The general topological theorem of duality
for closed sets', Ann. of Math. 35, no. 4 (1934), 904-914.
shown by Brouwer in 1913 that the number of domains [4] LEFSCHETZ, S.: Algebraic topology, Arner. Math. Soc., 1955.
into which a plane is subdivided by a closed subset [5] ALEKSANDROV, P.S.: 'Fundamental duality theorems for non-
depends only on the topological properties of this set. closed sets of an n -dimensional space', Mat. Sb. 21 (63), no.
2 (1947), 161-232 (in Russian).
This type of duality was first expressed by J.W. Alex- [6A] ALEKSANDROY, P.S.: 'Topological duality theorems I. Closed
ander in 1922 [1] in purely homological terms. sets', Trudy Mat. Inst. Steklov. 48 (1955) (in Russian).
Alexander's theorem [2], [3], [4] states that the r- [6B] ALEKSANDROY, P.S.: 'Topological duality theorems II. Non-
closed sets', Trudy Mat. Inst. Steklov. 54 (1959) (in Russian).
dimensional Betti number mod 2 of a (finite)
[7] CHOGOSHYILI, G.S.: 'Homology theory of compact spaces', in
polyhedron <P in an n-dimensional spherical space is Proc. 4-th. All-Union Math. Congress, Vol. 2, Moscow, 1964,
equal to the (n -r-l)-dimensional Betti number mod pp. 57-62 (in Russian).
2 of its complement. P.S. Aleksandrov in 1927 extended [8] KAPLAN, S.: 'Homology properties of arbitrary subsets of
Euclidean spaces', Trans. Amer. Math. Soc. 62 (1947), 248-271.
this theorem to the case of any closed set <P. The dual- [9] BOURGIN, D.G.: Modern algebraic topology, Macmillan, 1963.
ity expressed by this theorem is known as Alexander [10] CHoGOSHYILI, G.S.: 'On homology theory of non-closed sets',
duality. in General topology and its relations to modern analysis and
algebra. Proc. Symp. Prague, 1961, pp. 123-132.
The next important step in the development of dual-
ity of this kind was the theorem of Pontryagin [2], [3], G.S. Chogoshvili
[4] (1934), which states that the r-dimensional homology AMS 1980 Subject Classification: 55NXX, 54FXX
group H,(A, X) of a closed set A in an n-dimensional
spherical manifold M n with a compact group X of coef- ALEXANDER INVARIANTS - Invariants connected
ficients, and the (n - r - 1)-dimensional homology with the module structure of the one-dimensional
group Hn-,-J(B, Y) of the complement B=Mn\A homology of a manifold M, freely acted upon by a free
with the (discrete) group Y of coefficients which is dual Abelian group Ja of rank a with a fixed system of gen-
to X in the sense of the theory of characters, are duals, erators t J, ,ta _
and their scalar product defines the linking coefficient The projection of the manifold M onto the space M
of arbitrary cycles from the homology classes scalarly of orbits (cf. Orbit) is a covering which corresponds to
multiplied. This theorem is known as the the kernel Ka of the homomorphism y: G-,,;Ja of the
Alexander- Pontryagin theorem; the duality formulated fundamental group 'TTJ(M)=G of the manifold M. Since
in it is known as Alexander- Pontryagin duality or as Ka='TTJCM), the group Ba=Ka/ Ka, where Ka is the
Pontryagin duality. A series of subsequent publications commutator subgroup of the kernel K a , is iSOlporphic to
led to Aleksandrov's theorem [5], [7], the formulation of the one-dimensional homology group H J (M, Z). The
which differs from that of Pontryagin's theorem in that extension l-,,;Ka-,,;G-,,;Ja-,,;I generates the extension
A may be an arbitrary subset of M n , the group X may (*): I-,,;Ba-,,;G / Ka-,,;Ja-,,;I, which determines on Ba
be both compact or discrete, and H,(A, X) and the structure of a module over the integer group ring
Hn -, -J (B, Y) are understood to mean the Z(JG) of the group Ja (cf. Group algebra). The same
Aleksandrov-Cech homology groups (cf. structure is induced on Ba by the given action of Ja on
Aleksandrov-Cech homology and cohomology), one of M. Fixation of the generators ti in Ja identifies Z(JG)
which has compact support, while the other is of with the ring
spectral type. The forms of Alexander duality for arbi- La=La(tJ, ... ,ta)=Z[tJ,t- J, ... ,ta,t;J] of Laurent
trary sets are obtained by replacing the latter type of polynomials in the variables ti . Purely algebraically the
groups by their dual cohomology groups of the same extension (*) defines and is defined by the extension of
dimension over the dual group of coefficients. modules (**): O-";Ba--,,Aa-,,;fa-";O [5]. Here fa is the ker-
In subsequent generalizations, spherical manifolds nel of the homomorphism f.: La-";Z (f.tj = I). The
were replaced by more general manifolds (homology mod~le Aa is called the Alexander module of the cover-
manifolds which are acyclic in certain dimensions, cf. ing M --"M. In the case first studied by J.W. Alexander
Homology manifold), the Aleksandrov-Cech groups [1] when M=M(k) is the complementary space of
were replaced by Sitnikov - Steenrod groups (groups of some link k of multiplicity JL in the three-dimensional
projective type) and other groups; the groups of coeffi- sphere S3, while the covering corresponds to the com-
cients were replaced by modules, sheaves, etc. mutation homomorphism Y/l: G(k)--"P of the link

70
ALEXANDER INVARIANTS

group, A I' is the Alexander module of the link k. The which Ll(O) is invertible [7], in particular over the field
principal properties of G which are relevant to what of rational numbers, and, if fl(O) = + 1, then also over
follows are: G / G' is a free Abelian group, the defect Z. In such a case fl(t) is the cparacteristic_polynomial
of the group G is 1, G has the presentation of the transformation t:HI(M;R)~HI(M;R). The
{XI, ... ,Xm+l;rJ, ... ,rm } for which Yp.(XJ=ti' degree of fll (t) is equal to the rank of HJ (M; R); in
1~i ~JL; Y.(Xi) = 1, i>JL (cf. Knot and link diagrams). In particular, fll(t)=l if and only if HI(M;Z)=O. If
the case of links the generators ti EP correspond to the n = 3, the link ideals have the following symmetry pro-
meridians of the components k i C k and are fixed by perty: Ei =Ei' where the bar denotes that the image is
the orientations of these components and of the sphere. taken under the automorphism generated by replacing
As a rule, M is the complementary space M(k) of k, all ti by t i- I. It follows that
consisting of JL (n - 2)-dimensional spheres k i in S n. In A ( -I
I.li t I , ... , tI'-I) -- tN,
I ... tI'N I.li
A (
t J, . . ,t I' ) f or certaIn
.
addition to the homomorphism Ym' one also considers integers N i This symmetry is the result of the
the homomorphism yo: G(k)~J, where y(x) is equal to Fox - Trotter duality for knot and link groups. It may
the sum of the link coefficients of the loop representing also b~ deduced from the Poincare duality for the mani-
x with all ki . fold M, taking into account the free action of Ja [3]. If
The matrix [)(a of the module relations of a module fll (t I, ... ,t1')*0, then the chain complex C.(M) over
Aa is called the Alexander covering matrix and, in the the field of fractions PI' of the ring L I' is acyclic
case of links, the Alexander link matrix. It may be (n = 3), and the Reidemeister torsion TEP I' / II
obtained as the matrix corresponding to the imbedding Lp. cP 1" where II is
the group of units of Lp., is defined accordingly. If
[ ~]Y"<P,
aXj JL=2, then T=fl l ; if JL= 1, then T=fll / t -1 (up to
units of Lp.). The symmetry of fll for n = 3 is a conse-
where {Xi; rd is a presentation of the group G. If JL= 1,
quence of the symmetry of T. If {t= 1, it follows from
the matrix ina of module relations for Ba is obtained
the symmetry of fli (t) and from the property fli ( 1) = + 1
from [)(a by discarding the zero column. The matrices
that the degree of fli(t) is even. The degree of V(t) is
[)(a and ina are defined by the modules Aa and Ba up
also even [4]. The following properties of the knot poly-
to transformations corresponding to transitions to other
nomials fli(t) are characteristic: Lli(l) = +1;
presentations of the module. However, they can be used
fli(t) = t 2k fli(t -I); fli+ I divides fl i ; and fl; = 1 for all i
to calculate a number of module invariants. Alexander
greater than a certain value N, i.e. for each selection
ideals are ideals of the module A a, i.e. series of ideals
fli(t) with these properties there exists a knot k for
Ei(Aa) of the ring
which they serve as the Alexander polynomials. The
La: (O)C,E o C, ... c,Ei - 1 C,Eic' ... C,(l), where Ei is
Hosokawa polynomials [4] are characterized by the
generated by the minors of [)(a of order
property V(t)=t 2k V(t-I) for any {t~2; the polynomi-
(m-i)X(m-i) and Ei=La for m-i<1. The opposite
als Lli of two-dimensional knots by the property
numbering sequence may also be employed. Since La is
fl l (1)=1.
both a Gaussian ring and a Noetherian ring, each ideal
Alexander invariants, and in the first place the poly-
Ei lies in a minimal principal ideal (Ll i); its generator fli
nomials, are powerful tools for distinguishirJg knots and
is defined up to unit divisors t~. The Laurent polyno-
links. Thus, Lli fails to distinguish between only three
mial Lli(t J, ... ,t1') is simply call~ the Alexander poly-
pairs out of the knots in a table containing fewer than
nomial of k (or of the covering M~M). If fli7'~::O, it is
9 double points (cf. Knot table). See also Knot theory;
multiplied by tt, , ... ,t~ so that fli(O, ... ,0)*0 and
*00. To the homomorphism YrJ there correspond a
Alternating knots and links.

module A, ideals Ei and polynomials ai, designated, References


respectively, as Alexander's reduced module, Alexander's [1) ALEXANDER, J.W.: 'Topological invariants of knots and links',
reduced ideals and Alex..ander's reduced polynomials of.! Trans. A mer. Math. Soc. 30 (1928), 275-306.
[2) REIDEMEISTER, K.: Knotentheorie, Chelsea, reprint, 1948.
(or of the covering M rJ~M). If JL= 1, then A =A. [3) BLANCHFIELD, R.C.: 'Intersection theory of manifolds with
[)(A) is obtained from [)( by replacing all ti by t. If operators with applications to knot theory', Ann. of Math. (2)
JL~2, a l is divisible by (t -l)r2. The polynomial 65, no. 2 (1957), 340-356.
[4) HOSOKAWA, F.: 'On 'V -polynomials of links', Osaka J. Math.
V(t)=a l (t) / (t -1)1'-2 is known as the Hosokawa 10 (1958),273-282.
polynomial. The module properties of A (k) have been [5) CROWELL, R.H.: 'Corresponding groups and module
studied [4], [8], [10]. The case of links has not yet _been sequences', Nagoya Math. J. 19 (1961),27-40.
[6) CROWELL, R.H. and Fox, R.H.: Introduction to knot theory,
thoroughly investigated. For JL=l, the group HI(M;R) Ginn, 1963.
is finitely generated over any ring R containing Z in [7) NEUWIRTH, L.: Knot groups, Princeton Univ. Press, 1965.

71
ALEXANDER INVARIANTS

[8) CROWELL, R.H.: 'Torsion in link modules', J. Math. Mech. 14, tion of a procedure's formal parameters, the description
no. 2 (1965), 289-298. of identifiers (except for labels) prior to their use, and a
[9) LEVINE, J.: 'A method for generating link polynomials', Amer.
J. Math. 89 (1967),69-84.
simplified structure of the naming expressions. These
[10) MILNOR, J.W.: 'Multidimensional knots', in Conference on the restrictions are related to those imposed on Algol-60 in
topology of manifolds, Vol. 13, Boston, 1968, pp. 115-133. the unified language known as the Algol-60 subset. At
A. V. Chernavskif the same time new features were introduced into
AMS 1980 Subject Classification: 55NXX, 57TXX, Algams, including both external and part identifiers,
57MXX which do not exist in Algol-60. External identifiers are
names of arrays recommended for inclusion in the
AL'FA - An algorithmic language developed in the external memory of the machine. Reading and writing
USSR to solve scientific and technical problems with of the external arrays is achieved by a standard
the aid of computers. Al'fa is to some extent an exten- exchange procedure. Part identifiers, preceding the
sion of the reproducing properties of Algol. To the arrays, isolate parts of the program which may be
quantities appearing in Algol, complex numbers and stored in the external memory, and are recalled to the
component quantities with an internal dimension (vec- working memory upon entry into the respective array.
tor, matrices, etc.) have been added. Other additions In this way the effectiveness of the language is
include descriptions which serve to introduce designa- enhanced, while the capacity of the working memory of
tions for components of complex and multi- the machine remains relatively small. Moreover,
dimensional quantities. A special class of function rou- Algams contains detailed descriptions of input and out-
tines, the body of which is defined by an expression, put procedures, as well as a more precise definition of
has been introduced. It is possible to display the count- procedure bodies in the case of a description of them in
ing of natural numbers by dots, to use summation and other languages.
product signs and chains of inequalities. There is a sup- References
plementary kind of loops in which the desired number (II 'Description of the language ALGAMS', in Algorithms and
of repetitions can be performed without introducing an algorithmic languages, Moscow, 1968, pp. 3-56 (in Russian).
[21 LYUBIMsKIi, E.Z. and MARTYNYUK, V.V.: 'Refinement of the
enumeration parameter. The principal programming definition of the Algams language', Programming and Computer
systems include Al'fa (for M-20 type machines), Algibr Software 2 (1976), 73-74. (Programmirovanie 1 (1976), 87-88)
(for the complex M-220/BESM-6) and Al'fa-6 (for V. V. Lutsikovich
BESM-6 machines).
AMS 1980 Subject Classification: 68BXX, 6SF05
References
[1) ERSHOV, A.P., KOZHUKHIN, G.L and VOLOSHIN, YU.M.:
Operating languages for systems of automatic programming, ALGEBRA - 1) A branch of mathematics (see Alge-
Novosibirsk, 1964 (in Russian). bra). The term is used in combinations such as homo-
[2) Al'fa - a system for automatization ofprogramming, Novosi-
birsk, 1967 (in Russian).
logical algebra, commutative algebra, linear algebra,
(3) ERSHOV, A.P., KOZHUKHIN, G.L and POTIOSIN, LV.: Hand- multilinear algebra, and topological algebra.
book for the use of the system Al'fa, Novosibirsk, 1968 (in Rus- 2) A special case of an operator ring: An algebra
sian).
(4) Manual for the use of systems of automatization of programming,
(occasionally, a linear or vector algebra) over a field,
Novosibirsk, 1975 (in Russian). over a skew-field or over a commutative ring. Associa-
A.P. Ershov
tive algebras (formerly described as 'hypercomplex sys-
AMS 1980 Subject Classification: 68B05 tems') and non-associative algebras are algebras in this
sense.
ALGAMS - An algorithmic language developed for 3) A synonym of universal algebra. This includes
use mainly on medium-power electronic computers. It algebras such as Boolean algebras, unary algebras, etc.
was developed in 1963 - 1966 by the Group for (d. Boolean algebra; Unary algebra).
Automatic Programming for Medium-type Computers AMS 1980 Subject Classification: OS-XX, 12-XX, 13-
(GAMS), appointed by the commission for comprehen- XX, 15-XX, 16-XX, 17 -XX, 18-XX, 20-XX
sive collaboration of the acadeInies of sciences of
socialistic countries. It is supposed to be a standard ALGEBRA - The branch of mathematics in which
language for the exchange of algorithms between social- one studies algebraic operations (cf. Algebraic opera-
istic countries. It is based on the language Algol-60 (d. tion).
Algol), on which certain restrictions were imposed in Historical survey. The simplest algebraic operations -
order to facilitate the translation process. The most arithmetic operations on positive integers and positive
important restrictions include the prohibition of a rational numbers - can be encountered in the oldest
recursive use of procedures, the demand of a specific a- mathematical texts, which indicates that the principal

72
ALGEBRA

properties of these operations were known even in early fourth-degree equations (cf. Ferrari method). During
antiquity. In particular, the Arithmetic of Diophantus the following three .::enturies fruitless efforts were made
(third century A.D.) had a major influence on the to find similar formulas for solving equations of higher
development of algebraic ideas and symbols. The term degrees; in this connection, the problem of finding at
'Algebra' originates from the work AI jabr al-muqabala least a 'formula-free' proof of the existence of a com-
by Mohammed al-Khwarizmi (nineth century A.D.), plex root of an arbitrary algebraic equation with com-
which describes general methods for solving problems plex coefficients became of major interest. This theorem
which can be reduced to algebraic equations of the first was first stated in the seventeenth century by A.
and second degree. Towards the end of the fifteenth Girard, but was rigorously proved by C.F. Gauss only
century the cumbersome verbal descriptions of towards the end of the eighteenth century (cf. Algebra,
mathematical operations which had previously pre- fundamental theorem of). Finally, it was established by
vailed began to be replaced by the contemporary sym- N.H. Abel in 1824 that equations of degree higher than
bols '+' and '-', and subsequently symbols for four cannot, in general, be solved by radicals, and E.
powers, roots, and parentheses appeared. F. Viete, at Galois in 1830 stated a general criterion for the solva-
the end of the sixteenth century, was the first to use the bility of algebraic equations by radicals (cf. Galois
letters of the alphabet to denote the constants and the theory). Other problems were neglected at that time,
variables in a problem. Most of the present-day sym- and algebra was understood to mean the 'analysis of
bols of algebra were known as early as the mid- equations', as noted by J. Serret in his course of higher
seventeenth century, which marks the end of the algebra (1849).
'prehistory' of algebra. The development of algebra Studies on algebraic equations in one unknown were
proper took place during the next three centuries, dur- accompanied by studies on algebraic equations in
ing which views as to the proper subject matter of this several unknowns, in particular of systems of linear
discipline kept radically changing. equations. The study of linear equations resulted in the
In the seventeenth and eighteenth centuries 'Algebra' introduction of the concepts of a matrix and a deter-
was understood to mean the science of computations minant. Matrices subsequently became the subject of an
carried out on algebraic symbols - 'identity' transfor- independent theory, the algebra of matrices, and their
mations of formulas consisting of letters, solving alge- scope of application was extended beyond the solution
braic equations (cf. Algebraic equation), etc. - as dis- of systems of linear equations.
tinct from arithmetic, which dealt with calculations per- From the mid-nineteenth century onwards studies in
formed on explicit numbers. It was assumed, however, algebra gradually moved away from the theory of equa-
that the symbols stood for actual numbers: integers or tions towards the study of arbitrary algebraic opera-
fractions. A brief table of the contents of one of the tions. The first attempts at an axiomatic study of alge-
best textbooks of that time, L. Euler's Introduction to braic operations dates back to the 'theory of relations'
algebra includes integers, ordinary and decimal frac- of Euclid, but no progress was made in this direction,
tions, roots, logarithms, algebraic equations of degrees since a geometrical interpretation of even the simplest
one to four, progressions, additions, Newton's binomial arithmetic operations - ratios of lengths or of areas -
and Diophantine equations. Thus, by the mid- is impossible. Further progress only became possible
eighteenth century, algebra corresponded, more or less, as a result of gradual generalization and intensive study
to the 'elementary' algebra of our own days. of the concept of a number, and of the appearance of
The principal subject dealt with by the algebra of the arithmetic operations performed on objects entirely
eighteenth and nineteenth centuries were polynomials. unlike any number. The first such examples were
Historically, the first problem was the solution of alge- Gauss' 'composition of binary quadratic forms', and P.
braic equations in one unknown, i.e. equations of the Ruffini's and A.L. Cauchy's multiplication of permuta-
type: tions. The abstract concept of an algebraic operation
appeared in the mid-nineteenth century in the context
The purpose was to derive formulas expressing the of studies on complex numbers (cf. Complex number).
roots of the equation in terms of its coefficients, by There appeared G. Boole's algebra of logic, H.
means of addition, multiplication, subtraction, division Grassmann's exterior algebra, W. Hamilton's quatern-
and extraction of roots ('solution by radicals'). ions (cf. Quaternion) and A. Cayley's matrix calculus,
Mathematicians were able to solve first- and second- while C. Jordan published a major treatise on pennuta-
degree equations even in the earliest times. Substantial tion groups.
advances were made in the sixteenth century by Italian These studies prepared the way for the transition of
mathematicians: a formula was found for solving algebra at the turn of the nineteenth century into its
third-degree equations (cf. Cardano fonnula) and modern stage of development, which is characterized by

73
ALGEBRA

the combination of previously separate algebraic ideas fields are the main objects studied in commutative alge-
on a common axiomatic basis and by a considerable bra and in the closely related field algebraic geometry.
extension of the scope of its applications. The modern Another important type of algebra with two binary
view of algebra, and of the general theory of algebraic operations is a lattice. Typical examples of lattices
operations, crystallized at the beginning of the twen- include: the system of subsets of a given set with the
tieth century under the influence of D. Hilbert, E. operations of set-theoretic union and intersection, and
Steinitz, E. Artin and E. Noether, and was fully esta- the set of positive integers with the operations of taking
blished by 1930 with the appearance of B.L. van der the least common multiple and the greatest common
Waerden's Modern algebra. divisor.
The subject matter of algebra, its principal branches and Linear (or vector) spaces over a field may be treated
its connection with other branches of mathematics. The as universal algebras with one binary operation (addi-
subject matter of modern algebra are sets and algebraic tion) and with a unary operation (multiplication by
operations on these sets (i.e. algebras or universal alge- scalars of the ground field). Linear spaces over skew-
bras, cf. the terminology in Algebra; Universal algebra), fields have also been studied. If a ring is considered
considered up to an isomorphism. This means that, instead of a set of scalars, the more general concept of
from the point of view of algebra, the sets themselves a module is obtained. An important part of algebra,
and the sets as carriers of algebraic operations are linear algebra, studies linear spaces, modules and their
indistinguishable, and in this sense the proper subject linear transformations, as well as problems related to
of study are the algebraic operations themselves. them. A part of it, the theory of linear equations and
For a long time the studies that were actually carried the theory of matrices, was formulated as early as the
out concerned only a few basic types of universal alge- nineteenth century. A closely related subject is that of
bras which naturally appeared in the development of multilinear algebra.
mathematics and its applications. Initial studies on the general theory of arbitrary
One of the most important and most thoroughly stu- universal algebras (this theory is sometimes called
died type of algebras is a group, i.e. an algebra with 'universal algebra') date back to the nineteen-thirties
one associative binary operation, containing a unit ele- and were carried out by G. Birkhoff. At the same time
ment and, for each element, an inverse element. The A.I. Mal'tsev and A. Tarski laid the foundations for the
concept of a group was, historically, the first example theory of models (cf. Model), i.e. sets with marked rela-
of a universal algebra and in fact served, in many tions on them. Subsequently, the theory of universal
respects, as a model for the construction of algebra and algebras and the theory of models became so closely
of mathematics in general at the turn of the nineteenth linked that they gave rise to a new discipline, inter-
century. Independent studies on generalizations of mediate between algebra and mathematical logic, called
groups such as semi-groups, quasi-groups and loops the theory of algebraic systems (cf. Algebraic system),
began only much later (d. Loop; Quasi-group; Semi- the subject of which are sets with algebraic operations
group). and relations defined on them.
Rings and fields are very important types of algebras A number of disciplines intermediate between alge-
with two binary operations. The operations in rings and bra and other fields of mathematics have been created
fields are usually called addition and multiplication. A by the introduction into universal algebras of comple-
ring is defined by Abelian group axioms for the addi- mentary structures compatible with the algebraic opera-
tion, and by distributive laws for multiplication with tions. These include topological algebra (including the
respect to addition (d. Rings and algebras). Originally, theory of topological groups and Lie groups, cf. Lie
only rings with associativity multiplication were stu- group; Topological group), the theory of normed rings
died, and this requirement of associativeness occasion- (cf. Normed ring), differential algebra, and theories of
ally forms part of the definition of a ring (d. Associa- various ordered algebraic formations. Homological alge-
tive rings and algebras). The study of non-associative bra, which originates both from algebra and from
rings (cf. Non-associative rings and algebras) is today a topology, arose in the nineteen-fifties as a discipline in
fully recognized independent discipline. A skew-field is its own right.
an associative ring in which the set of all non-zero ele- The role of algebra in modern mathematics is
ments is a multiplicative group. A field is a skew-field extremely important, and there is a trend towards
in which multiplication is commutative. Number fields, further 'algebraization' of mathematics. A typical way
i.e. sets of numbers closed under addition, multiplica- of studying many mathematical objects that are some-
tion, subtraction and division by non-zero numbers, times far removed from algebra is to construct alge-
were implicitly included in the very first studies of alge- braic systems which adequately represent the behaviour
braic equations. Associative-commutative rings and of these objects. Thus, the study of Lie groups can be

74
ALGEBRA OF FUNCTIONS

largely reduced to the study of their algebraic counter- Editorial comments.


parts: Lie algebras (cf. Lie algebra). A similar method References
is used in topology: to each topological space is [A1] LIDL, R. and PILZ, G.: Applied abstract algebra, Springer,
assigned, in some standard manner, an infinite series of 1984.
[A2] JACOBSON, N.: Lectures in abstract algebra, 1-3, v. Nostrand,
homology groups (cf. Homology group), and these series 1951-1964.
of algebraic 'reflections of them' make it possible to
AMS 1980 Subject Classification: 01-XX, 10-XX, 12-
evaluate, very accurately, the properties of the spaces
XX, 13-XX, 15-XX, 20-XX
themselves. The recent major discoveries in topology
were made using algebra as a tool (cf. Algebraic topol-
ogy). ALGEBRA, C * - - See C -algebra.
It would appear at first sight that the translation of AMS 1980 Subject Classification: 22D25, 46L05
problems into the language of algebra, solving them in
this language and translating them back is merely a ALGEBRA, FUNDAMENTAL THEOREM OF - The
superfluous complication. In fact, such a method turns theorem that states that any polynomial with complex
out to be highly convenient, and occasionally the only coefficients has a root in the field of complex numbers.
possible one. This is because by algebraization one The theorem was first stated by A. Girard in 1629 and
solves problems not only by purely verbal by R. Descartes in 1637 in a formulation different from
considerations, but also by using the powerful the one employed today. C. MacLaurin and L. Euler
apparatus of formal algebraic calculations, so that one made the formulation more precise and gave it a form
may occasionally overcome highly involved complica- which is equivalent to the one in use today: Any poly-
tions. This role of algebra in mathematics may be com- nomial with real coefficients can be decomposed into a
pared with the role of modern computers in the solu- product of linear and quadratic factors with real coeffi-
tion of practical problems. cients. A proof of the fundamental theorem of algebra
Algebraic concepts and methods are widely employed was first given by J. d' Alembert in 1746. There fol-
in number theory (cf. Algebraic number theory), in lowed the proofs of Euler, P. Laplace, lL. Lagrange
functional analysis, in the theory of differential equa- and others in the second half of the eighteenth century.
tions, in geometry (cf. Invariants, theory of; Projective All these proofs are based on the assumption that some
geometry; Tensor algebra), and in other mathematical 'ideal' roots of the polynomial in fact exist, after which
disciplines. it is demonstrated that at least one of them is a com-
Besides its fundamental role in mathematics, algebra plex number. C.F. Gauss was the first to prove the fun-
is very important from the point of view of its applica- damental theorem of algebra without basing himself on
tions; examples are applications within physics (the the assumption that the roots do in fact exist. His proof
representation theory of finite groups in quantum essentially consists of constructing the splitting field of
mechanics; discrete groups in crystallography), cyber- a polynomial. All proofs of the theorem involve some
netics (cf. Automata, theory of) and mathematical form of topological properties of real and complex
economics (linear inequalities, d. Linear inequality). numbers. The role of topology has ultimately been
reduced to the single assumption that a polynomial of
References
[I] The history of mathematics from the earliest times to the begin- odd degree with real coefficients has a real root.
ning of the 19-th century, Vol. 1-3, Moscow, 1970-1972 (in Rus- References
sian). [I] KUROSH, A.G.: Higher algebra, Mir, 1972 (translated from the
[2] MAL'TSEV, A.I.: 'On the history of algebra in the USSR during Russian).
her first twenty-five years', Algebra and LogiC 10, no. I (1971), [2] LANG, S.: Algebra, Addison-Wesley, 1974.
68-75. (Algebra i Logika 10, no. I (1971), 103-118) [3] BASHMAKOVA, I.G.: 'On a proof of the fundamental theorem of
[3] Mathematics, its content, methods and meaning, Vol. 1-3, Mos- algebra', Istor. Mat. Issled., no. 10 (1957), 257-304 (in Rus-
cow, 1956 (in Russian). Collection of articles.
[4] KUROSH, A.G.: Higher algebra, Mir, 1972 (in Russian). sian). VN
. . RemesIenm'k ov
[5] BOURBAKI, N.: Elements of mathematics. Algebra: AlgebraiC
structures. Linear algebra, I, Addison-Wesley, 1974, Editorial comments. For a proof based on the Brouwer
Chapt. I; 2 (translated from the French). fixed-point theorem cf. [A 1].
[6] WAERDEN, B.L. VAN DER: Algebra, 1-2, Springer, 1967-1971 References
(translated from the German). [A 1] ARNOLD, B.H.: 'A topological proof of the fundamental
[7] LANG, S.: Algebra, Addison-Wesley, 1974. theorem of algebra', Amer. Math. Monthly 56 (1949), 465-
[8] MAL'TSEv, A.I.: Algebraic systems, Springer, 1973 (translated 466.
from the Russian).
For references, see also the articles on individual AMS 1980 Subject Classification: 26CXX, 12005
algebraic disciplines.
Yu.I. Merzlyakov ALGEBRA OF FUNCflONS, function algebra - A
A.I. Shirshov semi-simple commutative Banach algebra A, realized as

75
ALGEBRA OF FUNCTIONS

an algebra of continuous functions on the space of all x=#=xo. In the present case any point in the space of
maximal ideals 9n. If a EA and if / is some function maximal ideals has a representing measure concen-
defined on the spectrum of the element a (i.e. on the set trated on fo.
of values of the function a=a), then/(a) is some func- A function algebra is said to be analytic if all func-
tion on 9n. Clearly, it is not necessarily true that tions of this algebra that vanish on a non-empty open
/(a)EA. If, however, / is an entire function, then subset of the space of maximal ideals vanish identically.
/(a)EA for any aEA. The use of the Cauchy integral Algebras that are analytic with respect to the boundary
formula permits a considerable strengthening of this are defined in a similar manner. Any analytic algebra is
result: If the function / is analytic in some neighbour- analytic with respect to the Shilov boundary; the con-
hood of the spectrum of the element a, then /(a)EA verse is usually not true.
and the mapping / ~/ (a) is a homomorphism of the A function algebra A is said to be regular if, for any
algebra of functions which are analytic in some neigh- closed set F in the space X of maximal ideals of A and
bourhood of the spectrum of a EA into A. This proposi- for any point Xo not contained in F, it is possible to
tion is valid for non-semi-simple commutative Banach find a function / EA such that / (x) = 1 for all x EF and
algebras as well. Moreover, this class of functions that /(xo)=O. All regular algebras are normal, i.e. for any
are analytic in a neighbourhood of the spectrum of a pair of non-intersecting closed sets F, F 0 EX there
given element cannot be enlarged, in general. For exists an element/EA such that/(x)=1 for all XEF
example, if A =L1(Z) and /(a)EA for all aEA with and I (x) = 0 for all x EF o. In a regular algebra, for
spectrum in the interval [0, 1], then / is analytic in any finite open covering {U;}, 1:S;;;i~m, of the space X
some neighbourhood of this interval. there exists a partition of unity belonging to A, i.e. a
In a few cases lea) can also be defined for multi- system of functions /J, ... ,f" EA for which
valued analytic functions f, but such a definition has fl(x)+ ... +f,,(x) 1
inherent difficulties. Thus, let A be the algebra of con-
tinuous functions in the disc I Z I ~ 1 that are analytic and
Ji(x) =0 if x f1- Ui.
in the disc I z I < 1 and that satisfy the condition A function g is said to belong locally to the function
/ (0) = O. The unit disc is naturally identified with the algebra A if for any point Xo EX there exists a neigh-
space of maximal ideals of A. The function /I(Z)=Z, bourhood in which this function coincides with some
which is continuous on the space of maximal ideals, function of the algebra. Any function which locally
does not belong to A, but is a solution of the quadratic belongs to a regular algebra is itself an element of this
equation algebra.
fl- Z 2 = 0,
An element / of a function algebra is called real if
where z2EA. /(x) is real for all x EX. If A is an algebra with real
If A is a semi-simple algebra with space of maximal
generators /a and if
ideals X, if /EC(X) and if
d
f In II exp(itfa) II -t-2
TOO

p(j) - j"+alj"-l + ... +an = 0, aiEA,


-00 1 +t
< 00

with p' (f) Ef.(A ), the group of units of A (a simple for all la, then A is regular.
root), then /EA. Similarly, if /EC(X) and if An ideal in a Banach algebra is said to be primary if
exp(f)EA, then/EA. it is contained in only one maximal ideal. If A is a reg-
A function algebra is said to be a uniformly- ular function algebra, then each maximal ideal Xo con-
convergent algebra (or uniform algebra) if the norm in tains a smallest closed primary ideal J(xo) which is
this algebra defines a notion of convergence equivalent contained in any closed primary ideal contained in Xo.
to the uniform convergence of the functions on the a The ideal J(xo) is the closure of the ideal formed by
space of maximal ideals. If II a 2 II = II a 112 for all a EA, the functions lEA that vanish in some neighbourhood
then A is a uniform algebra. The general example of a (depending on f) of x 0 EX.
uniform algebra is a closed subalgebra of the algebra of In the algebra of absolutely convergent Fourier series
bounded continuous functions on some topological with an adjoined identity any maximal ideal coincides
space, provided with the natural sup-norm. with the corresponding primary ideal.
If A is a uniform algebra and if its space of maximal Let A be a closed subalgebra of the algebra C (X),
ideals is metrizable, then among the boundaries (not where X is a compactum (which does not necessarily
only the closed ones) there is a minimal boundary f 0, coincide with the space of maximal ideals of A). Let A
the closure of which is the Shilov boundary. The set fo separate the points of X, i.e. for any two different
consists of 'peak points': Xo is a peak point if there points Xl, X 2 E X there exists a function I in A for
exists a function /EA such that Ilex) I < I I(xo) I for which I(x d=#=I(x 2) The algebra A is called symmetric

76
ALGEBRA OF LOGIC

if both a function I (x) and the function I(x) belong to belong to the same Gleason part can be described in
it. According to the Stone- Weierstrass theorem, if A is terms of the representing measures on the Shilov boun-
symmetric, then A =C(X). The algebra A called anti- dary: Two such points have two mutually absolutely
symmetric if it follows from the conditions f,lEA that I continuous representing measures with bounded deriva-
is a constant function. In particular, algebras of ana- tives. An algebra for which Re(A Ir) is dense in C(f)
lytic functions are anti-symmetric. A subset S eX is is called a Dirichlet algebra; if P is a Gleason part in
called a set 01 anti-symmetry (with respect to the alge- the space of maximal ideals of a Dirichlet algebra
bra A) if any function lEA that is real on S is constant which contains more than one point, then there exists a
on this set. It follows from this definition that the alge- continuous one-to-one mapping 1/1 of the disc I z I < I
bra A is anti-symmetric if the whole set X is a set of into P such that for any function lEA the function
anti-symmetry. In the general case the space X can be I(I/I(z is analytic in I z I <l. Thus, P has a structure
represented as the union of non-intersecting, closed, with respect to which the functions lEA are analytic;
maximal sets of anti-symmetry. Each maximal set of the mapping 1/1 is not a homeomorphism in general if P
anti-symmetry is an intersection of peak sets (a set P is is endowed with the ordinary topology of the space of
called a peak set if there exists a function lEA such maximal ideals, but 1/1 is a homeomorphism if P is
that II p = 1 and II (x) I < 1 if x tl.P). It follows that endowed with the metric PA.
the restriction A I y of the algebra A to a maximal set For references see Banach algebra.
E.A. Gorin
of anti-symmetry is a closed (anti-symmetric) subalge-
Editorial comments.
bra of the algebra C (Y). If X is the space of maximal
ideals of the algebra A, the maximal sets of anti- References
[A1] GAMELIN, T.W.: Uniform algebras, Prentice-Hall, 1969.
symmetry are connected. If a continuous function is [A2] STOUT, E.L.: The theory of uniform algebras, Bogden and
such that on each maximal set of anti-symmetry it coin- Quigley, 1971.
cides with some function in the algebra A, then the
AMS 1980 Subject Classification: 46J 10
function itself belongs to A. This generalization of the
Stone- Weierstrass theorem makes it possible, in prin- ALGEBRA OF LOGIC - The branch of mathematical
ciple, to reduce the study of arbitrary uniform algebras logic that deals with propositions from the aspect of
to the study of anti-symmetric algebras A. However, their logical meanings (true or false) and with logical
the study of arbitrary algebras A cannot be reduced to operations on them.
the study of analytic algebras: There exists an example The algebra of logic originated in the middle of the
of an algebra of type R (X) (a closed subalgebra of the nineteenth century with the studies of G. Boole [1], [2],
algebra C(X which does not coincide with C(X), and and was subsequently developed by C.S. Peirce, P.S.
is anti-symmetric and regular. Poretskii, B. Russell, D. Hilbert, and others. The
Let Re(A) be the real space of functions of the form development of the algebra of logic was an attempt to
Re(f), where lEA; if Re(A) is an algebra or if Re(A) is solve traditional logical problems by algebraic methods.
closed in C(X), then A =C(X). The space X can be With the birth of the theory of sets (in the eighteen-
regarded as a part of the space of maximal ideals of the seventies), propositions and logical operations on them
algebra A; accordingly, not only the ordinary topology became the principal subject of algebra of logic. Propo-
of the space of maximal ideals, but also the metric sitions are understood to mean statements about which
induced by the imbedding of X into the dual space to it is meaningful to ask whether they are true or false.
A can be considered on X. The distance in the sense of For instance, the proposition 'a whale is an animal' is
this metric will be denoted by PA. For any points true, while the statement 'all angles are right angles' is
x I> X2 EX the inequality PA (x I> x2):S;;;2 is valid; the false. The connectives 'and', 'or', 'if ... then', 'is
relation PA(XI> x2)<2 is an equivalence relation, and equivalent to', the particle 'not', etc., which are com-
the equivalence classes are known as Gleason parts. If X monly used in the language of logic, make it possible to
is the disc I z I :s;;; 1 and A is the closed sub algebra in construct new, more 'complicated', propositions from
C (X) consisting of the functions analytic in I z I < 1, those already established. Thus, given that 'x> 2' and
then the metric PA is non-Euclidean, and the one-point 'x:S;;;3', it is possible to obtain, by using the connective
sets on the circle and in the interior of the disc serve as 'and', the proposition 'x>2 and x:S;;;3'; by using the
the Gleason parts. Gleason parts do not necessarily connective 'or' it is possible to obtain the proposition
have an analytic structure: Any a-compact completely- 'x>2 or x:S;;;3', etc.
regular space is homeomorphic to the Gleason part of The truth or the falsehood of the propositions
the space of maximal ideals of some algebra, such that obtained in this way depend on the truth or falsehood
the restriction of the algebra to this part contains all of the initial propositions and on a corresponding treat-
bounded continuous functions. The fact that two points ment of the connectives as operations on propositions.

77
ALGEBRA OF LOGIC

Often truth is symbolized by the digit '1' and falsehood below:


by the digit '0'. The connectives 'and', 'or', 'if '" x y x x&y xVy x~y x-y
then', and 'is equivalent to' are denoted, respectively,
0 0 1 0 0 1 1
by the symbols & (conjunction), V (disjunction), ~
0 1 1 0 I 1 0
(implication), and (equivalence); negation is
I 0 0 0 I 0 0
represented by a bar - above the symbol. In addition to
1 1 0 1 1 1 1
individual propositions, examples of which are given
above, one also uses variable propositions, i.e. variables Tables for arbitrary functions of the algebra of logic
whose values may be any individual statements given in are constructed in a similar manner. This is the so-
advance. As the next step, the concept of a formula called tabular way of specifying functions of the algebra
was inductively introduced. It formalizes the concept of of logic. The tables themselves are sometimes called
a compound proposition. Let A, B, C, ... , denote truth tables. The following equalities play an important
individual propositions, and let x, y, z, ... , denote part in the transformations of formulas into equivalent
variable propositions. Each one of the above letters is formulas:
known as a formula. Let the symbol * denote anyone x&y = y&x, xVy = yVx (1)
of the connectives listed above, and let ~ and ?E denote (the law of commutativity);
formulas; then (~*?E) and i will be formulas (e.g. (x&y)&z = x&(y&z), (xVy)Vz = xV(yVz) (2)
x&y)~Z). The connectives and the particle 'not' are
(the law of associativity);
looked upon as operations on quantities which assume
the values 0 and 1, the results of these operations are x&(xVy) = x, xV(x&y) = x (3)
also the digits 0 and 1. The conjunction x&y is equal to (the law of absorption);
1 if and only if both x and y equal 1; the disjunction x&(yVz) = (x&y)V(x&z), (4)
xVy equals 0 if and only if x and y are both equal to
xV(y&Z) = (xVy)&(xVz)
0; the implication x~y is equal to 0 if and only if x = I
and y = 0; the equivalence x -y equals 1 if and only if (the laws of distributivity);
the values of x and y are identical; and the negation X- x&x = 0 (5)
is equal to 1 if and only if x = O. If the values of the
(the law of contradiction);
propositions in each formula are given, the value 0 or 1
can be assigned to the formula. This means that any xVx = 1 (6)
formula can be considered as a way of stating or as a (the law of the excluded middle);
way of realizing a function of the algebra of logic, i.e. a
x~y = xVy, (7)
function which is defined on samples of zeros and ones,
and with as values also 0 or 1. Two formulas ~ and ?E x~y = (x&y) V (x&y).

are said to be equal (~=?E) if they realize equal func- If these equalities are used, it is possible to obtain new
tions. The subject matter of the algebra of logic is the equalities, even without the use of tables. These new
treatment of functions of the algebra of logic and the equalities are ottained by the so-called identity transfor-
operations on these functions. In what follows below, mations which, generally speaking, alter the expression,
the class of functions of the algebra of logic will be but not the function realized by the expression. For
extended to the class of functions whose arguments, as instance, the absorption laws yield the law of idempo-
well as the functions themselves, assume the values of tency x V x =x. The above equalities frequently sim-
elements of a finite fixed set E; the number of opera- plify the notation of formulas by eliminating some of
tions on the functions will also be extended. The alge- the parentheses. Thus, the relations (1) and (2) make it
bra of logic is sometimes understood to mean this latter possible to replace the formulas
concept. However, from the aspect of practical applica- ... (~l &~2)& ... )&~s) and
tions, the most important case is that of sets E of cardi- ... (?E 1 V?E 2 )V ... )V?Es) by the more compact
nality two, and this case will therefore be discussed in notation ~1&~2&'" &~s and ?El V?E2 V ... V?Es'
greatest detail. The results presented here are also The former relation is known as the conjunction of the
closely connected with a second approach to the study factors ~t. ... ,~s, while the latter is called the disjunc-
of propositions - the so-called propositional calculus. tion of the terms ?E 1 , ,?Es' It is also seen from the
In specifying the functions of the algebra of logic one equalities (5), (6) and (7) that, if the constants 0 and I,
often uses tables which contain all the combinations of the implication and the equivalence, are regarded as
the values of the variables and the values of the func- functions, they may be expressed by conjunctions, dis-
tions on these combinations. Thus, the synoptic table of junctions and negations. Thus, any function of the alge-
the functions X-, x&y, xVy, x~y, and x-y is given bra of logic may be realized by a formula written using

78
ALGEBRA OF LOGIC

only the symbols &, V and-. form. Thus, the contracted disjunctive normal form for
The set of all formulas involving variable proposi- the formula x"'(y~z~(x&z is x&yVzVx&y. The
tions, some of the symbols &, V, ~, "', -, and the formulas u \ and U2 are equal if and only if their con-
constants 0 and 1 is called a language over these symbols tracted disjunctive normal forms are identical. In addi-
and constants. Equations (1) - (7) show that, for any tion to the disjunctive normal form, use is also made of
formula in the language over &, V, ~, "', -, 0, and 1, the conjunctive normal form, i.e. of expressions which
it is possible to find an equivalent formula in the are obtained from a disjunctive normal form by substi-
language over &, V, -, 0, and 1; for example tuting the symbol V for &, & for V, and 0 for 1. For
instance, the disjunctive normal form x&y VX&Z yields
(x~y)~z = x Vy)&Z)Vx Vy)&z). the conjunctive normal form (x Vy)&(xV z). An opera-
In the latter language a special role is played by a class tion (or function) f is said to be- the dual of an opera-
of formulas which may be written in the form tion I/; if the table defining f is obtained from the table
2r\ V ... V2r s, 0 or 1, where s;;;;.l and where each 2rj which defines I/; by substituting 1 for 0 and 0 for 1
is either a variable proposition, its negation or a con- throughout (including interchange in the values of the
junction of them, so that none of the 2rj contains equal functions). Thus, the conjunction and the disjunction
x
factors or factors of the form x and together, and all are mutually dual, the negation is dual to itself, the
2rj are pairwise unequal. Here the parentheses are omit- constants 0 and 1 are mutually dual, etc. A transforma-
ted, since it is assumed that the conjunction operation tion of a formula which consists in replacing the sym-
is 'stronger' than disjunction, i.e. when calculating with bols of all operations and expressions by the symbols
respect to the given values of the variables, the values of their dual operations, and in exchanging 0 for 1 and
of 2r \, ... , 2rs are calculated first. These expressions 1 for 0, is called a duality transformation. If the equality
are called disjunctive normal forms. Each formula u in a 2r = ~ is true, and if 2r. is the dual of 2r while ~. is the
language over &, V, ~, "', -, 0, 1 which realizes a dual of ~, then it is also true that 2r. =~., and this
non-zero function of the algebra of logic, may be con- equality is called the dual of the above equality; this is
verted with the aid of the equalities (1) - (7) into an the so-called duality prinCiple. The pairs of the laws (1),
equivalent disjunctive normal form, containirJg all vari- (2) and (3) are examples of dual equalities; equality (5)
able formulas of u and any number of other variables, is dual to equality (6), and each conjunctive normal
and each 2rj in this disjunctive normal form containirJg form is the dual of some disjunctive normal form. A
the same variables. Such a disjunctive normal form is perfect conjunctive normal form and a contracted con-
said to be a perfect disjunctive normal form of the for- junctive normal form are defined as conjunctive normal
mula u; the perfect disjunctive normal form of 0 is the forms such that their dual expressions are a perfect dis-
formula 0 itself. The possibility of reduction to a per- junctive normal form and a contracted disjunctive nor-
fect disjunctive normal form forms the basis of an algo- mal form, respectively. Perfect and contracted disjunc-
rithm determining the equality or inequality of two tive and conjunctive normal forms are used for survey-
given formulas. The procedure of this algorithm is as ing all hypotheses and all consequences of a given for-
follows: The given formulas u \ and U2 are reduced to mula. A hypothesis of a formula 2r is understood to be a
perfect disjunctive normal forms which contain all the formula ~ such that (~~2r) = I; a consequence of a for-
variables comprised in both u \ and U2 and the two mula 2r is a formula ~ such that (2r~~)= 1. A
expressions are compared; if they are identical, u\ =U2; hypothesis of a formula 2r is said to be simple if it is a
if not, u \ =;FU 2. An important role in the algebra of conjunction of variable formulas or their negations and
logic and its applications is played by contracted dis- if, after discarding anyone of its factors, it is no longer
junctive normal forms, i.e. disjunctive normal forms a hypothesis of the formula 2r. Similarly, a consequence
which satisfy the following conditions: 1) it does not of 2r is called simple if it is a disjunction of variable
contain pairs of elements 2rj and 2rj such that any fac- formulas or their negations and if, after one of its ele-
tor of 2rj is contained in 2rj ; 2) for any two elements 2rj ments is discarded, it is no longer a consequence of 2r.
and 2rj one of which contains some variable as factor, The survey of hypotheses and consequences is based on
while the other contains the negation of this variable the indication of an algorithm which transforms a given
(under the condition that the pair of elements does not formula into all of its simple hypotheses and
contain a second variable for which the same is true), consequences and, with the aid of the laws (2) - (7),
there exists (in the same disjunctive normal form) an into all remainirJg hypotheses and consequences. The
element 2rk which is equal to the conjunction of the algorithm is based on the following facts. If 2r =~, then
other factors of these two elements. Any disjunctive 2r and ~ have identical hypotheses and identical conse-
normal form may be reduced with the aid of the equali- quences. An element of a disjunctive normal form is a
ties (1) - (7) to an equal contracted disjunctive normal hypothesis of that disjunctive normal form, while a fac-

79
ALGEBRA OF LOGIC

tor of a conjunctive normal form is a consequence of Here it is considered, as before, that conjunction is a
that conjunctive normal form. If m: is a hypothesis of a stronger connective than the sign +. These equalities
proposition 58, then m:&~ is also a hypothesis for 58; if are sufficient to deduce any valid equality in the
m: is a consequence of 58, then m:V~ is also a conse- language over &, +, 1, with the aid of identity
quence of 58. If m: and ~ are hypotheses of a proposi- transformations, just as in the case of the language over
tion 58, then m:V ~ is also a hypothesis for 58; if m: and &, V, ~, "', -, +, 1. A proposition in this language is
~ are consequences of 58, then m:& ~ is also a conse- called a reduced polynomial if it is of the form
quence of 58. A perfect disjunctive normal form has no m: 1 + ... +m:s , when m: j is equal to 1 or is a variable or
hypotheses (not containing letters which are not con- a conjunction of various variables without negations,
tained in that disjunctive normal form) other than the m:j=#=m:j if i=/=j, s;;:.l, or else is equal to 1+1. Thus, the
disjunctions of some of its elements or of disjunctive expression x&y&z +x&y + 1 is a reduced polynomial.
normal forms equal to them. A perfect conjunctive nor- Any formula of the algebra of logic may be converted
mal form has no consequences other than the conjunc- to a reduced polynomial by identity transformations.
tions of some of its factors or of propositions equal to The equality m: = 58 is valid if and only if the reduced
them. A contracted disjunctive normal form is the dis- polynomial for m: is identical with the reduced polyno-
junction of all its simple hypotheses; a contracted con- mial for 58. In addition to the languages mentioned
junctive normal form is the conjunction of all its simple above there are also other languages equivalent to
consequences. A contracted disjunctive normal form them. Two languages are called equivalent if each for-
has important applications. The first problem which mula in one language can be converted, by means of
should be mentioned is the minimization of functions certain conversion rules, into another equivalent for-
of the algebra of logic, which is a part of the problem mula in the second language, and vice versa. It is suffi-
of synthesis of control (switching) systems. The minimi- cient to base such a language on an arbitrary system of
zation offunctions of the algebra of logic consists in con- operations (and constants) such that the operations
structing a disjunctive normal form for the given func- (and constants) of the system may be used to represent
tion of the algebra of logic which realizes this function any function of the algebra of logic. Such systems are
and has the smallest sum of the number of factors in its said to be functionally complete. Examples of complete
elements, i.e. has minimal complexity. Such disjunctive systems are {xVy}, {xVy,x}, {x+y, l,x&y}, etc.
normal forms are called minimal. Each minimal dis- There exists an algorithm which can be used to estab-
junctive normal form for a given function of the alge- lish the completeness or incompleteness of an arbitrary
bra of logic which is not a constant is obtained from finite system of functions of the algebra of logic. It is
the contracted disjunctive normal form of this function based on the following fact. A system of functions of
by discarding a number of elements. For certain func- the algebra of logic is complete if and only if it con-
tions the contracted disjunctive normal form may coin- tains functions fI (x, y, ... , v) and 12 (x, y, ... , v) such
cide with the minimal disjunctive normal form. Such a that fl (0,0, ... ,0)= 1 and 12(1, 1, ... ,1)=0, as well
case occurs, for example, with monotone functions, i.e. as functions h, f4 and f5, where h =#=h' f4 is not
functions realizable by formulas over &, V, 0, 1. monotone, and the reduced polynomial of f5 contains
In the language over &, V, ~, ~, 0, 1, +, when the an element m: with more than one factor. There are
sign + is interpreted as addition modulo 2, the follow- also other languages based on systems of operations
ing relations are valid: which are not functionally complete, and the number of
xVy = x&y)+x)+y, (8) such languages is infinite. They contain infinitely many
pairwise incomparable languages (in the sense that
x~y = i&y, x~y = (x+y)+I, (9)
there is no way of translating from one language to
x+y = (x&y) V (i&y), I = xVi. (10) another by means of identity transformations). How-
These formulas make it possible to translate formulas ever, for any language based on some operations of the
from the language over &, V, ~, -, 0, and 1 into algebra of logic there exists a finite system of equations
equivalent formulas in the language over &, +, 1, and in this language such that any equality can be deduced
vice versa. The identity transformations in the latter with the aid of identity transformations of the equa-
language are realized with the aid of equalities esta- tions of this system. Such a system called a deductively
blished for the conjunction and the following additional complete system of equalities in this language. Thus, the
equalities: equalities (1) - (6) constitute a complete system of
x+y = y+x, (II) equalities of the language over &, V, -, 0, 1.
(x+y)+z = x+(y+z), (12)
In considering one of the languages discussed above
in conjunction with some complete system of equalities
x&(y+z) = x&y+x&z, (13) in this language, a tabular statement of the basic opera-
x&y = x, x+(y+y) = x, x&1 = x. (14) tions in the language and the requirement that proposi-

80
ALGEBRA OF MEASURES

tions be the values of its variables are sometimes dis- M (G) has a unit which is the 8-measure located at the
carded. Instead, various interpretations of the language zero of the group. The set of discrete measures con-
are permitted. These interpretations involve some set of tained in M (G) forms a closed subalgebra.
objects (which are used as values of the variables) and To each function f which belongs to the group alge-
some system of operations over the objects of this set, bra L 1(G) may be assigned a corresponding measure
which satisfy the equalities from a complete system of J.LfEM(G) in accordance with the rule
equalities of the language. Thus, such a procedure
results in the transformation of the language over &, p./E) = ! dx
V, -, 0, I into the language of a Boolean algebra; the
(integral with respect to the Haar measure). The result
language over &, +, I is transformed into the language
is an isometric isomorphic imbedding Ll(G)~M(G).
of a Boolean ring (with a unit element); the language
Under this imbedding the image is a closed ideal in
over &, V, - is transformed into the language of a dis-
M(G).
tributive lattice, etc.
The Fourier- Stieltjes transform of a measure
Historically, the development of the algebra of logic
J.LEM(G) is the function ~ on the dual group Gdefined
has been stimulated mainly by the problems to which it
by the formula
has been applied. An important application is the
theory of electric circuits. In certain cases circuits can-
not be described in terms of the ordinary two-valued
algebra of logic, and a many-valued logic must be con- Then ~=~.~ and II J.L II =0 if ~ O. In particular,
sidered (cf. Many-valued logic). M (G) is an algebra without a radical.
If the group G is not discrete, then the structure of
References
[I) BOOLE, G.: The mathematical analysis of logic: being an esstry
the measure algebra M(G) is extremely complicated: It
towards a calculus of deductive reasoning, Macmillan, 1847. is not symmetric and its space of maximal ideals has a
[2) BOOLE, G.: An investigation of the laws of thought, on which are number of pathological properties. For instance, this
founded the mathematical theories of logic and probabilities,
Dover, reprint, 1951.
space contains infinite-dimens~ona1 analytic sets, and
[3) PORETSKII, P.S.: 'On methods of solving logical equalities and the naturally imbedded group G in it is not dense even
on an inverse method of mathematical logic', in Collection of in the Shilov boundary. Nevertheless, the idempotent
protocols of sessions of the FizMat. section of the Society of
Natural Scientists at Kazan. Univ., Vol. 2, Kazan', 1884, pp.
measures, i.e. measures for which J.L*J.L=J.L, are known.
161-330 (in Russian). Each idempotent measure is a finite integer combina-
[4) HILBERT, D. and ACKERMAN, W.: Grundzuge der theoretischen tion n 1J.Ll + ... + nkJ.Lb where J.Li = Xi Pi, and where Pi is
Logik, Dover, reprint, 1946. the Haar measure of a compact subgroup, and Xi is a
[5) NOVIKOV, P.S.: Elements of mathematical logic, Edinburgh,
1964 (translated from the Russian). character. In the case G = Z this means that a sequence
[6) YABLONSKIi, S.V., GAVRILOV, G.P. and KUDRYAVTSEV, V.B.: (cm ) of zeros and ones is the Fourier- Stieltjes
Functions of the algebra of logic and Post classes, Moscow, 1964 transform of some measure on the circle if and only if
(in Russian).
V.B. Kudryavtsev (cm ) differs from a periodic sequence by not more than
Editorial comments. In the theory of Boolean functions a finite number of terms.
contracted disjunctive normal forms are also called abridged In the general case the theorem on idempotent meas-
disjunctive normal forms, cf. Boolean functions, normal ures can be naturally interpreted in terms of the coho-
forms of. mology spaces of dimension zero of the space of maxi-
AMS 1980 Subject Classification: 03GXX mal ideals. A satisfactory description is also known for
other cohomology groups of the space of maximal
ALGEBRA OF MEASURES, measure algebra - The ideals of the measure algebra. This makes it possible to
algebra M (G) of complex-valued regular Borel meas- tell, in particular, if a logarithm of an invertible meas-
ures on a locally compact Abelian group G that have ure from M (G) can be taken (one-dimensional integral
bounded variation, with the ordinary linear operations cohomology).
and with convolution as multiplication (cf. Harmonic References
analysis, abstract). The convolution A*J.L of the meas- [1) RUDIN, W.: Fourier analysis on groups, Interscience, 1962.
ures A, J.LEM(G) is completely defined by the condition [2) TAYLOR, J.L.: 'The cohomology of the spectrum of a measure
that, for any continuous function f on G with compact algebra', Acta Math. 126 (1971), 195-225.
E.A. Gorin
support,
Editorial comments.
jfd(h*P.) = j jf(x+y)dh(X)dp.(y). References
G GG
[A 1] GRAHAM, c.c. and MCGEHEE, O.c.: Essays in commutative
If the total variation of a measure is taken as norm, harmonic analysis, Springer, 1979,
M (G) becomes a commutative Banach algebra over the [A2] TAYLOR, J.L.: Measure algebras, Amer. Math. Soc., 1972.

field of complex numbers. The algebra of measures AMS 1980 Subject Classification: 28A60, 43A20

81
ALGEBRA OF SETS

ALGEBRA OF SETS - A non-empty collection of generates an associative sub algebra. The set of all alge-
subsets of some set n that is closed under the set- bras with associative powers over a given field F forms
theoretic operations (of union, intersection, taking com- a variety of algebras which, if the characteristic of the
plements), carried out a finite number of times. In field F is zero, is defined by the system of identities
order for a class of subsets of a set n to be an algebra
(x, x, x) = (x 2 , x, x) = 0,
of sets, it is necessary (and sufficient) for it to be closed
under finite unions and taking the complement. An where (x,y, z)=(xy)z-x(yz). If Fis an infinite field of
algebra of sets that is closed under countable unions is prime characteristic p, then the variety of algebras with
known as a a-algebra of sets. Any a-algebra of sets is associative powers cannot be defined by any finite sys-
closed under the set-theoretic operations carried out a tem of identities, but an independent, infinite system of
countable number of times. identities which defines it is known [3]. If a commuta-
Examples. 1) The collection of finite subsets of an tive algebra A with associative powers of characteristic
arbitrary set n and their complements is an algebra of other than 2 has an idempotent e=;FO, then A can be
sets; the collection consisting of the at most countable decomposed according to Peirce into a direct sum of vec-
subsets of n and their complements is a a-algebra of tor subspaces:
sets. A = Ao(e)EflA 1/2(e)ElM I(e), (*)
2) The collection of finite unions of intervals of the where A >..(e) = {a EA: ea =;\a}, ;\=0, 1/2, 1. Here
type A o(e) and A I (e) are sub algebras, A o(e)A I (e)=O,
A 1/2(e)A 1/2(e)CAO(e)+A I(e),
forms an algebra of sets.
A>..(e)AI/2(e)CA I / 2(e)+A j ->..(e) for ;\=0,1. The
3) let n be a topological space; the a-algebra B of
decomposition (*) plays a fundamental role in the
sets generated by the open subsets of n (in other words,
structure theory of algebras with associative powers.
the smallest a-algebra of sets containing all open sub-
sets of n) is known as the Borel a-algebra of subsets of References
n, while the sets belonging to B are known as Borel [I] ALBERT,A.A.: 'Power-associative rings', Trans. A mer. Math.
Soc. 64 (1948), 552-593.
sets. [2] GAiNOV, A.T.: 'Identity relations for binary Lie rings', Uspekhi
4) let n=RT , where T is an arbitrary set (i.e. n is the Mat. Nauk 12, no. 3 (1957), 141-146 (in Russian).
set of all real functions on 1); the class A of sets of the [3] GAiNOV, A.T.: 'Power-associative algebras over a finite-
characteristic field', Algebra and Logic 9, no. 1 (1970),5-19.
type (Algebra i Logika 9, no. 1 (1970), 9-33)
A.T. Gainov
where E is a Borel subset of Rk, is an algebra of sets;
Editorial comments. An algebra with associative powers
in the theory of random processes a probability measure
is also called a power-associative algebra. The fact that the
is often originally defined only on an algebra of this set of algebras with associative powers over a field of non-
type, and is their subsequently extended to a wider zero characteristic forms a variety defined by
class of sets (to the a-algebra generated by A). (x, x, x)=(x 2 , x, x)=O was proved in [A1].
5) The collection of Lebesgue-measurable subsets of
Rk is a a-algebra of sets. References
[A 1] ALBERT, A.A.: 'On the power associativity of rings', Summa
Algebras (respectively, a-algebras) are the natural Brasiliensis Math. 2 (1948), 21-33.
domain of definition of finitely-additive (respectively,
AMS 1980 Subject Classification: 17A 10, 17A05, 16-
a-additive) measures. According to the theorem of
XX
extension of measures, any a-finite, a-additive measure,
defined on an algebra A, can be uniquely extended to a
ALGEBRAIC ALGEBRA - An algebra A with associ-
a-additive measure defined on the a-algebra generated
ative powers (in particular, an associative algebra) over
by A.
a field F all elements of which are algebraic (an ele-
References ment a EA is called algebraic if the subalgebra F[a]
[I] DUNFORD, N. and SCHWARTZ, J.T.: Linear operators. General generated by it is finite-dimensional or, equivalently, if
theory, I, Interscience, 1958.
[2] HALMOS, P.R.: Measure theory, v. Nostrand, 1950.
the element a has an annihilating polynomial with coef-
[3] NEVEU, J.: Bases mathematiques du calcul des probabilites, Mas- ficients from the ground field F). An algebra A is called
son, 1970. an algebraic algebra of bounded degree if it is algebraic
v. V. Sazonov
and if the set of degrees of the minimal annihilating
AMS 1980 Subject Classification: 28A05
polynomials of its elements is bounded. Subalgebras
and homomorphic images of an algebraic algebra (of
ALGEBRA WITH ASSOCIATIVE POWERS - A bounded degree) are algebraic algebras (of bounded
linear algebra A over a field F each element of which degree).

82
ALGEBRAIC CURVE

Examples: locally finite algebras (in particular, neighbourhood of a by a generalized Laurent series
finite-dimensional ones), nil-algebras and associative (Puiseux series):
skew-fields with a countable set of generators over an 00

uncountable field. j(z) = ~ cn(z-at/ k , m~O.


n=-m
The algebras considered below are associative. The
Jacobson radical of an algebraic algebra is a nil-ideal. A The point at infinity, a = 00, is called an algebraic
primitive algebraic algebra A is isomorphic to a dense branch point for a function 1 (z) if the point b 0 is an =
algebra of linear transformations of a vector space over algebraic branch point of the function g(w)= l(l / w).
a skew-field; if, in addition, A is of bounded degree, There may exist several (and even an infinite number
then A is isomorphic to a ring of matrices over a skew- of) different algebraic branch points and regular points
field. An algebraic algebra without non-zero nilpotent of a complete analytic function with a given affix a.
elements (in particular, a skew-field) over a finite field References
is commutative. It follows that finite skew-fields are [1] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
commutative. An algebraic algebra of bounded degree 3, Chelsea, 1977, Chapt.8 (translated from the Russian).
satisfies a polynomial identity (cf. PI-algebra). An alge- [2] HURWITZ, A. and COURANT, R.: Vorlesungen iiber allgemeine
Funktionentheorie und elliptische Funktionen, 4, Springer, 1968.
braic PI-algebra is locally finite. If the ground field is
uncountable, then the algebras obtained from an alge- E.D. Solomentsev
braic algebra by extension of the ground field, and the AMS 1980 Subject Classification: 30AXX, 32830
tensor product of algebraic algebras, are algebraic alge-
bras. ALGEBRAIC CLOSURE 01 a field k - An algebraic
extension (cf. Extension of a field) of k that is an alge-
References
[I] JACOBSON, N.: Structure of rings, Arner. Math. Soc., 1956. braically closed field. Such an extension exists for each
[2] HERSTEIN, I.: Noncommutative rings, Math. Assoc. Amer., 1968. field and is unique up to isomorphism. The algebraic
VN. Latyshev closure of the field of real numbers is the field of com-
AMS 1980 Subject Classification: 16A16, 12F05
plex numbers (cf. Algebra, fundamental theorem of).
VN. Remeslennikov
ALGEBRAIC BRANCH POINT, algebraic singular AMS 1980 Subject Classification: 12F05
point - An isolated branch point a of finite order of an
analytic function 1 (z), having the property that the ALGEBRAIC CURVE - An algebraic variety of
limit limZ-+ 1 (z) exists for any regular element of con-
Q dimension one. An algebraic curve is the most fre-
tinuation of 1 in a domain for which a is a boundary quently studied object in algebraic geometry. In the
point. More exactly, a singular point a in the complex sequel, an algebraic curve means an irreducible alge-
z-plane for the complete analytic function 1 (z), under braic curve over an algebraically closed field.
continuation of some regular element eo of this func- The simplest and clearest concept is that of a plane
tion with centre z 0 along paths passing through a, is affine algebraic curve. This is a set of points in an
called an algebraic branch point if it fulfills the follow- affine plane A~ satisfying the equation l(x,y)=O,
ing conditions: I) There exists a positive number p such where I(x, y) is a polynomial with coefficients from an
that the element eo may be extended along an arbitrary algebraically closed field k. The field of rational func-
continuous curve lying in the annulus D {z: = tions of an irreducible algebraic curve over k is the
0< I z-a I <p}; 2) there exists a positive integer k>1 field of algebraic functions in one variable of the form
such that if z 1 is an arbitrary point of D, the analytic k(x, y), where x and y are connected by the equation
continuation of the element eo in D yields exactly k l(x,y)=O, where I(x,y) is a polynomial over k. This
different elements of the function 1 (z) with centre z 1 ; means that every algebraic curve is birationally iso-
if e 1 is an arbitrary element with centre z 1, all the morphic to a plane affine curve.
remaining k - I elements with centre z 1 can be It has been known for a long time that even when
obtained by analytic continuation along closed paths studying affine curves fundamental relationships can
around the point a; and 3) the values at the points z of only be revealed by considering points at an infinite
D of all elements which are obtainable from eo by con- distance, and by a detailed study of the singular points.
tinuation in D tend to a definite, finite or infinite, limit In order to study all the points of an affine curve, the
as z tends to a while remaining in D. curve is imbedded into a projective space pn, with sub-
The number k - I is said to be the order of the alge- seqlloent closure in the Zariski topology. In this way a
braic branch point. All branches of the function 1 (z) projective curve X is obtained, and the initial affine
obtainable by analytic continuation of the element eo curve Y may be obtained from X by deleting a finite
in the annulus D may be represented in a deleted number of points. If Y is irreducible, then X and Y are

83
ALGEBRAIC CURVE

birationally isomorphic. Every complete algebraic curve where d is a non-negative integer which is a measure of
is projective. If X is a smooth projective curve, then all the smoothness on X. If X has only ordinary double
valuation rings of the field k(x) are given by the local points, d is simply the number of singular points.
rings (!) X> X EX (cf. Local ring). If two smooth projec- In particular, the genus of a plane smooth projective
tive curves are birationally equivalent, then they are curve is given by
isomorphic. A normal algebraic curve is smooth. In _ (m -l)(m -2)
g - 2 '
particular, any irreducible algebraic curve is biratio-
nally equivalent to a smooth projective curve. The pro- which means that not every smooth projective curve is
jective model of an algebraic curve, obtained by the plane. For a curve X in space the following estimate
process of normalization, lies in some space pn. Any holds:
smooth projective curve is isomorphic to a curve (n - 2)2
for even n,
situated in p3. Any plane algebraic curve can be con- 4
(n -l)(n -3) (I)
verted by a Cremona transformation into a curve with for odd n,
ordinary singular points. 4
Divisors on a smooth algebraic curve are represented
by linear combinations of points with integer coeffi- where n is the degree of X. Curves of degree n of maxi-
cients mal genus exist for each value of n and lie on a quadric
(G. Halphen, 1870).
where nx=O for almost all x (cf. Divisor). If all nx;;'O, The degree of the canonical class Kx of a smooth
the divisor D is called positive, or effective, which is projective curve X is connected with the genus of the
written as D ;;.0. The degree of the divisor D is the curve by means of the formula degKx =2g-2. If a
number smooth projective curve X lies on a smooth algebraic
degD = ~nx' surface F, the adjunction formula Kx=X(X+KF ) holds.
The principal divisors form a subgroup P(X) of the In particular, degKx=(X)2 + (XKF ). For an arbitrary
group Div X of all divisors on X. The quotient group divisor D on X, one can consider the subset of the field
Div(X) / P(X) is called the group of'divisor classes and k(X) consisting of zero and of the functions f for
is denoted by CI(X). The group CI(X) is isomorphic to which (f) + D ;;.0. This is a linear space over k of finite
the group Pic(X) of classes of one-dimensional vector dimension I(D). The dimension of the complete linear
bundles on X (cf. Vector bundle, algebraic). The degree system defined by the divisor D is I(D)-I and the cal-
of the principal divisors on a smooth projective curve is culation of I(D) is an important task of the theory of
zero, and thus all divisors in one class are of the same algebraic curves. The strongest relevant result is the
degree. In particular, one can speak of the degree of a Riemann - Roch theorem. For smooth projective curves
divisor class, and of the subgroup Clo(X) of divisor this theorem is the equality
classes of degree zero. The following equality is valid: /(D)-/(K-D) = deg(D)-g+l,
Cl(X) / Clo(X) = Z. where g is the genus of the curve X. If I(K - DO (or
For the line pI, CI(PI)=Z, ClO(pl)=O, i.e. any divisor if I(K-D)=O), one says that D is special (or, respec-
of degree zero is a principal divisor. This property is tively, non-special). For non-special divisors the
characteristic for rational smooth projective curves. Riemann - Roch theorem yields I(D)=deg(D)- g+ l.
For any complete algebraic curve X, the number Each divisor of degree higher than 2g - 2 is non-special.
'7T=dimHI(X, (!) x) is known as the arithmetic genus of The class of divisors which are linearly equivalent to
the algebraic curve X. If X is smooth, '7T is identical with a divisor D on a smooth projective curve X defines a
the dimension of the space HO(X, Q~) of all regular dif- point on the Jacobi variety J (X) of X. This variety is
ferential forms on X; this dimension is known as the identical with the Albanese variety and with the Picard
genus of X. By definition, the genus of an algebraic variety of X. Points which correspond to classes of spe-
curve is equal to the genus of its non-singular model. cial divisors are singular points of the Poincare divisor
For any non-negative integer g there exists an algebraic on J (X). If G~ denotes the subset of points of J (X)
curve of genus g. Rational curves are distinguished by which correspond to the classes of divisors D with
the equality g = O. If X is a projective plane curve of degD =n and I(D)=r, then G~ forms a subscheme in
order m, then J(X) and
17= (m-l)(m-2) dimG~ ;;:. r(n-r+l)-(r-l)g
2 ' (the Riemann - Brill- N oether theorem). This theorem
and its genus is given by the formula: has numerous applications, one of which will now be
g = (m-l)(m-2)_d described. Any divisor D for which I (D)?d defines a
2 ' rational mapping CPD of the curve X into the projective

84
ALGEBRAIC CURVE

space pl(D)-I. The mapping <PD depends on the class of ponent of the unit of which coincides with the group of
D. If deg(D)~2g+ 1, then <PD defines an isomorphic points of X(k). 3) If X is a curve of genus g>l, then
imbedding of X into pm, while <PD(X) is not contained Aut(X) is always a finite group. Its order is bounded
in any proper subspace of the space pm (m I(D)-I). by the number 84(g - 1) [6]. Weierstrass points (d.
Mappings <P which correspond to a multiple nK of the Weierstrass point) on X play an important role in the
canonical class of X are the most interesting from the study of the group Aut(X) in the latter case.
point of view of the birational classification of curves. Another way of studying Aut(X) is based on the fact
If g> 1, the class 3K defines an isomorphic imbedding that all smooth projective curves are finite (ramified)
of the smooth projective curve into pSg -6. Two curves coverings of the projective line.
X and Yare birationally equivalent if and only if their Let X be a smooth projective curve defined over the
images <P3K(X) and <P3K(Y) are obtained from each field C. The set of points of the curve X(C) has the
other by a projective transformation of pSg -6. The natural structure of a one-dimensional compact analytic
study of the mapping <PK yielded a more precise charac- manifold, which is also known as a compact Riemann
teristic of curves of genus g> 1. For these curves surface. The converse is also true, i.e. any compact
<PK: X~pg-I is an isomorphic imbedding if and only if Riemann surface is obtained from some smooth projec-
X is not a hyper-elliptic curve. If <PK is an isomorphism, tive curve. Usually one uses the same symbol X for the
the curve <PK(X) is called canonical; it is defined smooth projective curve and its corresponding one-
uniquely up to projective transformations in pg -I. A dimensional complex manifold. Any connected co~plex
very important task of the theory of algebraic curves is manifol~ X can be represented as a quotient X / G,
their classification up to a birational isomorphism. A where X is a simply-connected complex manifold, an?
number of important results have been obtained in this G is a group of automorphisms of X which acts on X
field, but an adequate solution of the problem is not discretely and freely. It is noteworthy that there are
available now (1977). only three one-dimensional simply-connected connected
Smooth projective curves are subdivided into four analytic manifolds, up to an isomorphism. These are
classes: the projective line CP I (the Riemann sphere), the affine
1) curves of genus 0, birationally equivalent to PI; straight line C (the finite plane) and the interior of the
2) curves of genus I (elliptic curves), birationally unit disc D = {z: I z I < I} (the Lobachevskii plane).
equivalent to a smooth cubic curve in p2; All smooth projective curves are subdivided into three
3) hyper-elliptic curves; classes, depending on which one of the three types their
4) non-hyper-elliptic curves of 8enus g> I, biratio- universal covering belongs to.
nally equivalent to a canonical curve in pg -I (algebraic The problem of classifying smooth projective curves
curves of basic type). of a given type can be reduced to the study of discrete
The genus of a curve does not fully characterize the groups of transformations of the universal coverings
birational class of an algebraic curve. The only excep- which act freely with a relatively compact fundamental
tion are curves of genus zero. If k is the field of com- domain. In the case of the projective line, G is the iden-
plex numbers C, the set of classes of mutually- tity group; in the case of the affine straight line, G is
isomorphic elliptic curves is described by points in the isomorphic to a subgroup Q of the additive group C
quotient space H / G, where H is the upper half-plane which is a two-dimensional lattice in C; in the case of
and G is the modular group consisting of rational-linear the interior of the unit disc, G is a subgroup of motions
transformations with integral coefficients and with in the Lobachevskii plane which can be defined by
determinant equal to + 1. The space H / G has the some non-Euclidean bounded polygon. Thus, the first
structure of an analytic manifold isomorphic to C (d. class above contains a unique curve pI, the second
Elliptic curve). Classes of birationally equivalent curves class consists of complex tori C / Q that all have the
of genus g> 1 are described by points belonging to structure of a one-dimensional Abelian variety (elliptic
some algebraic variety vi( g of dimension 3g - 3, which curve), the addition of points on the torus defining the
is called the moduli variety of curves of genus g. This group structure on the respective curve. All smooth
variety is irreducible. According to one conjecture vi( g elliptic curves are obtained in this way. The field of
is unirational; however, this has been proved (by F. rational functions C(X) on an elliptic curve X~C/Q
Severi) for g< 11 only. is isomorphic to the field of meromorphic doubly-
The following results are valid for the group Aut(X) periodic (elliptic) functions with period group Q. If
of automorphisms of a smooth projective curve X. I) If f(x,y)=O is the equation of the affine model of the
X is pL then Aut(X) is the group of rational-linear curve X, then there exists a parametrization x = <j>(z),
transformations PGL(I, k). 2) If X is an elliptic curve, y = I/;{z) by elliptic functions of it (a uniforrnization of
then Aut(X) is an algebraic group, the connected com- X). The third class consists of all smooth projective

85
ALGEBRAIC CURVE

curves X of genus g> 1. In this case the field C(X) is [6] CHEBOTAREV, N.G.: The theory of algebraiC functions,
isomorphic to the field of meromorphic functions on D Moscow-Leningrad, 1948 (in Russian).
[7] SPRINGER, G.: Introduction to Riemann surfaces, Addison-
that are invariant with respect to the group G. Such
Wesley, 1957.
functions are known as automorphic. Every algebraic [8] DoLGACHEV, I.V. and ISKovsKIKH, V.A.: 'Geometry of alge-
curve of genus g> I is uniformized by automorphic braic varieties', J. Soviet Math. 5, no. 6 (1976), 803-864. (Itogi
Nauk. i Tekhn. Algebra Topol. Geom. 12 (1974),77-170)
functions (cf. Uniformization). The problem of the
classification of elliptic curves also led to a study of the VE. Voskresenskil
quotient D / G, but this situation is substantially dif- Editorial comments. Estimate (1) above is due to G.
ferent from the one just discussed. First, the group G Castelnuovo [A1]. A proof can also be found in [A2], [A3],
has fixed points in D; secondly, the manifold D / G is [A4]. Reference [A2] also contains new results on the
non-compact, though containing a finite LobachevskiI Riemann-Noether-Brill theorem, e.g. it has been proven
plane. The study of the general case of such groups, that equality holds in the theorem for a generiC curve in the
sense of moduli; this reference also gives a survey of recent
and of the respective quotients, plays an important role
developments in the theory of algebraic curves.
in modem arithmetical research.
The moduli space of curves .A 9 is of general type, thus
If an algebraic curve X is defined over a non-closed not unirational, if 9 is odd and g~25 (J.E. Harris and D.
field k, one of the most important problems is that of Mumford) or if 9 is even and g~40 (Harris), [A5].
the existence and location of the set of rational points The Mordell conjecture, i.e. every curve of genus at least
X(k) of X. In the case of a smooth projective curve two over a number field has only a finite number of rational
over a finite field k a proof has been given of the ine- points, has been proved by G. Faltings [A6].
quality I N - q -1 I :S;;;2g vq,
where N is the number of References
points on X that are rational over a finite extension L [A 1) CASTELNUOVO, G.: 'Studies on the geometry of algebraic
of k, q is the number of elements of Land g is the curves', Atti R. Acad. Sci. Tarin 24 (1889), 196-223 (in
Italian).
genus of X. This inequality is equivalent to the
[A2) ARilELLO, E., CORNALBA, M., GRIFFITHS, P.A. and HARRIS,
Riemann hypothesis concerning the zeros of the t- J.E.: Geometry of algebraic curves, 1, Springer, 1985.
function of X, viz. that all zeros of the t-function lie on [A3) GRIFFITHS, P.A. and HARRIs, J.E.: Principles of algebraic
the vertical line (J = 1 / 2 (cf. Zeta-function in algebraic geometry, Wiley, 1978.
[A4) HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
geometry). [AS] MUMFORD, D. and HARRIS, J.: 'On the Kodaira dimension of
N ow let X be an algebraic curve defined over the the moduli space of curves', Invent. Math. 67 (1982), 23-88.
field of rational numbers Q. For curves of genus zero [A6) FALTINGS, G.: 'Endlichkeitssatze fUr abelsche Varietaten Liber
Zahlk6rpern', Invent. Math. 73 (1983), 349-366.
the points of X(Q) are relatively easily found, for ellip-
tic curves the rational points constitute a finitely- AMS 1980 Subject Classification: 14HXX
generated group (if X(Q) is non-empty), while for
curves of genus g;;. 2 there is the Mordell conjecture to ALGEBRAIC CYCLE on an algebraic variety - An ele-
the effect that X(Q) is finite. ment of the free Abelian group the set of free genera-
If the ground field k is the field of rational functions tors of which is constituted by all closed irreducible
ko(B) of a smooth projective curve B, each smooth pro- subvarieties of the given algebraic variety. The sub-
jective curve X over k is isomorphic to the general fibre group of the group C(X) of algebraic cycles on a
X1J of the morphism f: V ~B of a smooth projective variety X generated by a subvariety of codimension p is
algebraic surface V over ko. This morphism is uniquely denoted by cP(X). The group C(X) can be represented
defined if it is assumed that its fibres do not contain as the direct sum
exclusively curves of genus 1. The set of rational points ceX) == EBcP(X).
X(k) is in one-to-one correspondence with the set of p

sections V(B) of f, and X(k) is finite for curves of The subgroup C l (X) is identical with the group of Weil
genus g > 2. Curves of genus 0 and lover the field divisors (d. Divisor) on X.
ko(B) are studied in the theory of algebraic surfaces In what follows X will denote a non-singular projec-
(d. Elliptic surface; Ruled surface). tive algebraic variety of dimension n over an algebrai-
cally closed field k. If k is the field of complex numbers
References
[I] SHAFAREVICH, I.R.: Basic algebraiC geometry, Springer, 1977 C, then each algebraic cycle Z E cP (X) defines a
(translated from the Russian). (2n - 2p )-dimensional homology class
[2] WALKER, RJ.: Algebraic curves, Springer, 1978. [ZlEH 2n -2p(X, Z) and, in accordance with Poincare
[3] MUMFORD, D.: Lectures on curves on an algebraic surface,
Princeton Univ. Press, 1966.
duality, a cohomology class y(Z)EH 2p(X, Z). The
[4] CHEVALLEY, c.: Introduction to the theory of algebraiC functions homology (or, respectively, cohomology) classes of type
of one variable, Amer. Math. Soc., 1951. [Zl (or y(Z) are called algebraic homology (respectively,
[5] SERRE, J.-P.: Groupes algebrique et corps des classes, Hermann,
1959.
cohomology) classes. Each analytic cycle is homologous

86
ALGEBRAIC CYCLE

with an algebraic cycle. It is believed (Hodge's conjec- of the variety X (cf. Intersection theory).
ture) that an integral (2n - 2p )-dimensional cycle r on For any Wei! cohomology theory H(X) there exists
X is homologous with an algebraic cycle if and only if a uniquely defined homomorphism of groups
the integrals of all closed differential forms of type y: cP(X) ~ H"'(X).
(2p -q, q), q=/=p, over r are equal to zero. This conjec-
ture has only been proved for p = 1 (for n =2 [6], and Two algebraic cycles Z and Z' are called homologically
for all n [7]), for p =n -1, and for isolated classes of equivalent (which is denoted by Z "-'homZ,) if
varieties [4]. y(Z)=y(Z'). The subgroup of algebraic cycles that are
If W = ~ni W; is an algebraic cycle on the product of homologically equivalent with zero is denoted by
two varieties X X T, then the set of cycles on X of the Chom(X). The imbedding Calg(X)CChom(X) is valid.
form The quotient group C(X) / Chom(X) is finitely gen-
erated, and is a subring in the ring H (X), which is
is known as a family of algebraic cycles on X denoted by A (X) and is known as the ring of algebraic
parametrized by the base T. The usual requirement in Wei! cohomology classes. It is not known (1986) whether
this connection is that the projection of each subvariety or not A (X) depends on the Weil cohomology theory
W; on T be a flat morphism. If W = Wi is defined by that has been chosen.
an irreducible subvariety, the corresponding family of Two algebraic cycles Z and Z' are called T-equivalent
algebraic cycles on X is called a family of algebraic sub- (which is denoted by Z "-'TZ') if there exists an m ~ I
varieties. In particular, for any flat morphism f: X ~ Y such that mZ "-'algmZ'. The subgroup of algebraic
of algebraic varieties its fibres Xy form a family of alge- cycles that are T-equivalent to zero, is denoted by
braic subvarieties of X parametrized by the base Y. A CT(X). Two algebraic cycles Z and Z' from cP(X) are
second particular case of this concept is that of a linear called numerically equivalent (which is denoted by
system. All members of a family of algebraic sub- Z"-'numZ') if the equality WZ= WZ' is valid for any
varieties (or, respectively, algebraic cycles) of a projec- WE C n -P(X), provided both sides of the equality are
tive variety X, parametrized by a connected base, have defined. The subgroup of algebraic cycles numerically
the same Hilbert polynomial (respectively, virtual arith- equivalent with zero is denoted by C num(X). The
metic genus). imbeddings
Two algebraic cycles Z and Z' on a variety X are
CT(X) C Chom(X) C Cnum(X)
algebraically equivalent (which is denoted by Z "-'algZ')
if they belong to the same family, parametrized by a are valid. For divisors the groups CT(X) C1(X), n
connected base. Intuitively, equivalence of algebraic Chom(X) n C1(X) and Cnum(X) n
C1(X) are identical
cycles means that Z may be algebraically deformed into [6]. However, in accordance with the counterexample in
Z'. If this definition includes the condition that the [5] for the case k =c
base T is a rational variety, the algebraic cycles Z and C(X) =1= Chom(X),
Z' are called rationally equivalent (which is denoted by
Z"-'ratZ'). If Z, z' EC1(X), the concept of rational where Chom(X) is considered with respect to the ordi-
equivalence reduces to the concept of linear equivalence nary cohomology theory with rational coefficients. A
of divisors. The subgroup of algebraic cycles rationally similar counterexample was established for a field k of
(or, respectively, algebraically) equivalent to zero, is arbitrary characteristic and for the l-adic theory of Wei!
denoted by C rat (X) (respectively, C alg (X. Each of cohomology. The question as to the equality of the
these groups is a direct sum of its components groups Chom(X) and Cnum(X) has been solved [9].
Let X be imbedded in a projective space and let Lx
qat(X) = n CP(X),
Crat(X)
be the cohomology class of a hyperplane section. An
CPrug(X) = Calg(X) n CP(X). algebraic cohomology class
The quotient group Cl(X)/C~g(X) is finitely gen- x E AP(X) = A' (X) n H"'(X)
erated and is called as the N eron - Severi group of the
is called primitive if xL'X -P =0. In such a case, if k is
variety X. The problem of the quotient group
the field of complex numbers C, the bilinear form
cP(X) / C~g (X) being finitely generated for p> 1
remains open at the time of writing (1977). The quo- (a, b) ~ (-1)" Lx-"'ab
tient group C~g (X) / C:at (X) has the structure of an is positive definite on the subspace of primitive classes
Abelian variety (cf. Picard scheme). The operation of in AP(X). A similar proposition for arbitrary k, which
intersection of cycles makes it possible to define a mul- is closely connected with the Wei! conjectures on the
tiplication in the quotient group C(X) / Crat(X), con- zeta-function of an algebraic variety, has been proved
verting it into a commutative ring, called the Chow ring for n.;;;;2 only.

87
ALGEBRAIC CYCLE

If a variety X is defined over a field ~ that is not ALGEBRAIC EQUATION - An equation of the type
algebraically closed, the Galois group G(k / k) of the f" =0, where f" is a polynomial of degree n in one or
separable algebraic closure of the field k acts on the more variables (n ~o). An algebraic equation in one
Weil cohomology H*(X), where X=XQ!)"kk. Each ele- variable is an equation of the form
ment of A' (X) is invariant with respect to some sub- (1)
group of finite index of the group G(f / k). It is
Here n is a non-negative integer, ao, ... ,an are given,
believed (Tate's conjecture on algebraic cycles) that the
the so-called coefficients of the equation, while x is an
converse proposition is also true if k is finitely gen-
unknown which has to be found. It is assumed that the
erated over its prime subfield. Many conjectures on the
coefficients of the algebraic equation (1) are not all
zeta-function of algebraic varieties are based on this
equal to zero. If ao*O, then n is called the degree of
assumption [2]. the equation.
References The values of the unknown x which satisfy equation
[1] BALDASSARRl, M.: Algebraic varieties, Springer, 1956. (1), i.e. the values which, if substituted for x, will con-
[2] TATE, J.: 'Algebraic cohomology classes', in Summer school of vert this equation into an identity, are known as the
algebraic geometry Woods Hole, 1964.
[3] DoLGACHEV, LV. and ISKovsKlKH, V.A.: 'Geometry of alge-
roots of the equation (1), or as the roots of the polynomial
braic varieties', J. Soviet Math. 5, no. 6 (1976), 803-864. (ftogi j,,(x) = aOxn+alx n- 1+ ... +an- (2)
Nauk. i Tekhn. Algebra Topol. Geom. 12 (1974), 77-170)
[4] KLEIMAN, S.L.: 'Algebraic cycles and the Wei! conjecture', in The roots of a polynomial are related to its coefficients
Dix exposes sur la cohomologie des schemas, North-Holland, by Viete's formulas (cf. Viete theorem). To solve an
1968, pp. 359-386.
equation means to find all its roots contained in the
[5] GRIFFITHS, P.A.: 'On the periods of certain rational integrals
II', Ann. of Math. (2) 90, no. 3 (1969),496-541. range of values of the unknown(s) under consideration.
[6] LEFSCHETZ, S.: L'anaiysis situs et la geometrie algebrique, Paris, As far as applications are concerned, the most
1924. important case is that of coefficients and roots of an
[7] HODGE, W.V.D.: The theory and applications of harmonic
integrals, Cambridge Univ. Press, 1952. equation that are numbers of a certain kind (e.g.
[8] 'Groupes de monodromie en geometrie,algebrique', in M. Ray- rational, real or complex). The case of the coefficients
naud, D.S. Rim and A. Grothendieck (eds.): Sem. Geom. Alg., and roots being elements of an arbitrary field may also
Vol. 7, Springer, 1972-1973.
[9] DELIGNE, P.: 'La conjecture de Wei! 1', Pub!. Math. fHES 43
be considered.
(1974), 273-308. If a given number (or element of a field) c is a root
/. V. Dolgachev of the polynomial f,,(x) then, in accordance with the
Editorial comments. In 1983 H. Clemens proved that Bezout theorem, f,,(x) is divisible by x -c without
CP(X) / cglg (X) is not finitely generated [A 1]. He also remainder. The division may be performed according to
proved that Chom(X) / C,(X) is not finitely generated, even the Horner scheme.
after tensoring with the field of rational numbers [A1]. A number (or element of a field) c is called a root of
A state-of-the-art survey concerning the Hodge conjec- multiplicity k of a polynomial f (x) (where k is a non-
ture is in [A2]. See also [A3]. i,
negative integer) if f (x) is divisible by (x - c but is
not divisible by (x - ci + I. Roots of multiplicity one
Much of the recent progress of the theory of algebraic
cycles is related to algebraic K-theory, see [A4].
are called simple roots of the polynomial, other roots are
References called multiple roots.
[A1] CLEMENS, H.: 'Homological equivalence modulo algebraic
equivalence is not finitely generated', Publ. Math. IHES 58 Each polynomial f (x) of degree n >0 with coeffi-
(1983), 19-38. cients in a field P has at most n roots in this field P,
[A2] SHIADO, T.: What is known about the Hodge conjecture, each root being counted the number of times equal to
North-Holland - Kinokuniya, 1983.
[A3] ATIYAH, M.F. and HIRZEBRUCH, F.: 'Analytic cycles on com-
its multiplicity (consequently, there are not more than n
plex manifolds', Topology 1 (1961),25-45. different roots).
[A4] BLOCH, S.: Lectures on algebraic cycles, IV, Duke Univ., In an algebraically closed field any polynomial of
1980.
degree n has exactly n roots (counted according to their
AMS 1980 Subject Classification: 14C25 multiplicity). In particular, this statement also applies
to the field of complex numbers.
ALGEBRAIC DIMENSION of a topological space - Equation (1) of degree n with coefficients from a
One of the different numerical invariants of the ring of field P is called irreducible over P if the polynomial (2)
continuous functions on the given space, related to the is irreducible over this field, i.e. cannot be represented
concept of dimension. as the product of other polynomials of degrees lower
P.s. Aleksandrov
than n over P. Otherwise, both the polynomial and the
AMS 1980 Subject Classification: 54F45, 55M10 corresponding equation are called reducible. Polynomi-

88
ALGEBRAIC EQUATION

als of degree zero and zero itself are not considered to equation of degree n >4 with coefficients a 0, . . . ,an;
be reducible or irreducible. Whether a given polynomial let K be an arbitrary field and let P be the field of
is reducible or irreducible over a field P depends on the rational functions in ao, ... ,an with coefficients from
field in question. Thus, the polynomial x 2 - 2 is irredu- K; then the roots of equation (1) (lying in some exten-
cible over the field of rational numbers, since it has no sion of P) cannot be expressed in terms of the coeffi-
rational roots, but is reducible over the field of real cients of this equation by a finite number of operations
numbers: x 2 -2=(x + V2)(x - V2). Similarly, the of addition, subtraction, multiplication and division
polynomial x 2 + I is irreducible over the field of real (operations which are meaningful in P) and root extrac-
numbers, but is reducible over the field of complex tion (which is meaningful in the extension of P). In
numbers. Only polynomials of the first degree are other words, the general equation of degree n >4 is
irreducible over the field of complex numbers, and any unsolvable by radicals ([3]).
polynomial can be decomposed into linear factors. However, Abel's theorem does not contradict the fact
Only polynomials of the first degree and polynomials that some algebraic equations with numerical coeffi-
of the second degree without real roots are irreducible cients (or with coefficients from a given field) are solv-
over the field of real numbers (and all polynomials can able by radicals. Some special equations of degree n
be decomposed into products of linear and irreducible are solvable by radicals (e.g. a two-term equation). A
quadratic polynomials). Irreducible polynomials of all complete solution of the problem when an algebraic
degrees exist over the field of rational numbers; exam- equation is solvable by radicals was given by E. Galois
ples are the polynomials of the form xn +2. The irredu- about the year 1830.
cibility of a polynomial over the field of rational The fundamental theorem on the solvability of alge-
numbers can often be established by Eisenstein's cri- braic equations by radicals in Galois theory can be
terion: If, for a polynomial (2) of degree n >0 with stated as follows. Let f (x) be a polynomial with coeffi-
integral coefficients, there exists a prime number p such cients from a field K that is irreducible over K. Then,
that the leading coefficient ao is not divisible by p, all 1) if at least one root of the equation f (x) = 0 can be
the remaining coefficients are divisible by p, and the expressed by radicals in terms of the coefficients of this
constant term an is not divisible by p2, then this poly- equation and if the exponents of the radicals are not
nomial is irreducible over the field of rational numbers. divisible by the characteristic of K, then the Galois
Let P be an arbitrary field. For each polynomial group of this equation is solvable over K; 2) conversely,
f (x) of degree n> 1 that is irreducible over P, there if the Galois group of the equation f (x) = 0 is solvable
exists an extension of P containing at least one root of over the field and the characteristic of K is zero or is
f(x) (cf. Extension of a field); moreover, there exists a higher than all orders of the constituent factors of this
splitting field of f (x), i.e. a minimal extension of P in group, then all the roots of the equation can be
which this polynomial can be decomposed into linear represented by radicals in terms of its coefficients, all
factors. Every field has an algebraically closed exten- exponents of the roots a I / n involved can be taken to
SIOn. be prime numbers, and the binomial equations
Solvability by radicals. Any algebraic equation of xn -a =0 which correspond to these roots can be taken
degree not exceeding four is solvable by radicals. Solu- to be irreducible over the fields to which these radicals
tions of problems involving special forms of equations are adjoined.
of the second and third degrees were already known to This theorem was proved by Galois for the case in
ancient Babylonians (2000 B.C.) (cf. Quadratic equa- which K is the field of rational numbers; in this case all
tion; Cubic equation). The theory of the solution of conditions involving the characteristic of the field K in
quadratic equations was first expounded in the book the above formulation become superfluous.
Arithmetica by Diophantus (third century A.c.). In the Abel's theorem is a consequence of Galois' theorem,
sixteenth century, Italian mathematicians obtained the since the Galois group of equations of order n with
solution by radicals of equations of the third and coefficients (that are letters) over the field P of rational
fourth degrees with coefficients that are letters (cf. Car- functions in the coefficients of the equation with coeffi-
dano formula; Ferrari method). During the 300 years cients from an arbitrary field K, is the symmetric group
which followed, fruitless efforts were made to find solu- Sn, which is unsolvable for n >4. For any n >4 there
tions by radicals of equations of the fifth and higher exist equations of degree n with rational (and integral)
degrees with coefficients that are letters. It was finally coefficients that are unsolvable by radicals. An example
demonstrated by N.H. Abel in 1826 that such solutions of such an equation for n = 5 is the equation
do not exist. x 5- p2 X - P = 0, where p is a prime number. In Galois
Abel's theorem in modern formulation: Let (1) be an theory an algebraic equation is solved by reducing it to

89
ALGEBRAIC EQUATION

a chain of simpler equations, which are called resol- I t is sometimes desirable to find roots of a special
vents (cf. Resolvent) of the original equation. type. For instance, Hurwitz' criterion is a necessary
The solvability of equations by radicals is closely and sufficient condition for all the roots of an equation
connected with problems involving geometrical con- (with complex coefficients) to have negative real parts
structions with ruler and compasses; in particular, with (cf. Routh- Hurwitz criterion).
the problem of the division of the circle into n equal There exists a method for calculating all rational
parfs (cf. Cyclotomic polynomial; Primitive root). roots of a polynomial with rational coefficients. A poly-
Algebraic equations in one unknown with numerical nomial f(x) with rational coefficients has the same
coefficients. Methods of approximate calculations (e.g. roots as the polynomial g(x) with integral coefficients
the parabola method) are generally used to find the obtained from f (x) by multiplication by a common
roots of algebraic equations of degree higher than two multiple of all denominators of the coefficients of f (x).
with coefficients from the field of real or complex The only rational roots of a polynomial
numbers. It is convenient to begin by getting rid of the g(x)=boxn +b)x n -) + ... +bm bn=FO, with integral
multiple roots. A number c is a root of multiplicity k coefficients are found among the irreducible fractions
of a polynomial f (x) if and only if the polynomial and p / q in which P is a divisor of the number bn and q is
its derivatives up to the order k -1, inclusive, vanish if a divisor of the number b o (and only those fractions
x =c, and if j<k)(c)=FO. If f(x) is divided by the among them such that, for any integer m, the number
greatest common divisor d(x) of this polynOInial and gem) is divisible by p-mq). If b o=l, then all rational
its derivative, then the resulting polynomial has the roots of g(x) (if any) are integers, divisors of the con-
same roots as f (x), but only of multiplicity one. It is stant term, and can be found by trial and error.
further possible to construct the polynomials whose Systems of algebraic equations. Concerning systems of
simple roots are all the roots of the polynomial f (x) of algebraic equations of the first degree, see Linear equa-
equal (given) multiplicity. A polynomial has multiple tion.
roots if and only if its discriminant vanishes. A system of any two algebraic equations of any
The determination of the number of roots and degree in two unknowns x and y may be written as:
bounds on their size are frequently occurring problems. f(x,y) = ao(x)yn +al(X)yn-1 + ... +an(x) = a,}
The number g(x,y) = bo(x)ys+bl(x)y,-l + ... +bs(x) = 0, (3)
max I a; I
1+ ;>0
where ai(x), b/x) are polynomials in one unknown x.
I ao I
If a certain numerical value is assigned to x, a system
can be taken as an upper bound for the modulus of of two equations in one unknown y with constant coef-
each root (both real and complex) of the algebraic ficients ai, bj is obtained. The resultant of this system is
equation (1) with arbitrary complex coefficients. The the following determinant:
Newton method usually yields a more exact bound if ao al an 0 0 0
the coefficients are real. The determination of a lower 0 ao an
bound for the positive roots, and of upper and lower 0 0
bounds for the negative roots, are reduced to the deter- 0 0 0 ao an
mination of an upper bound for the positive roots. R(f, g) = bo bl bs 0 0 0
The simplest method to determine the number of real
0 bo bs
roots is to use the Descartes theorem. If it is known
that all the roots of a given polynomial are real (for 0 0
0 0 0 bo bs
example, for the characteristic polynomial of a real
symmetric matrix), then Descartes' theorem yields the (There are s rows of a's and n rows of b's in this deter-
exact number of roots. By considering the polynomial minant.) The following statement is true: A number Xo
f( - x), the number of negative roots of f (x) can be is a root of the resultant R(f, g) if and only if the poly-
found using the same theorem. The exact number of nomials f(xo,Y) and g(xo,y) have a common root yo
real roots located in a given interval (in particular, the or if the two leading coefficients ao(xo) and bo(xo) are
number of all real roots) of a polynomial with real equal to zero.
coefficients without multiple roots can be found by the Thus, in order to solve the system (3) one must find
Sturm theorem. Descartes' theorem is a special case of all roots of the resultant R(f, g), substitute each one of
the Budan - Fourier theorem, which yields an estimate these roots into the system (3) and find the common
from above for the number of real roots of a polyno- roots of these two equations in one unknown y. One
mial with real coefficients lying in a certain fixed inter- must also find the common roots of the two polynomi-
val. als ao(x) and bo(x), substitute them into (3), and verify

90
ALGEBRAIC FUNCTION

if the resulting equations in one unknown y have com- For k = 2, 3, 4, an algebraic function can
Xl, . . . ,Xn .
mon roots. In other words, the solution of a system of be expressed as square and cube roots of rational func-
two algebraic equations in two unknowns is reduced to tions in the variables x j, . . . , Xn; if k >4, this is
the solution of one equation in one unknown and the impossible in general.
calculation of the common roots of two equations in The theory of algebraic functions was studied in the
one unknown (the common roots of two or more poly- past from three different points of view: the function-
nomials in one unknown are the roots of their largest theoretical point of view taken, in particular, by N.H.
common divisor). Abel, K. Weierstrass and B. Riemann; the arithmetic-
Systems of any number of algebraic equations with algebraic point of view taken by R. Dedekind, H.
any number of unknowns are solved in a similar Weber and K. Hensel; and the algebraic-geometrical
manner. This task involves complicated calculations. It point of view, which originated with the studies of A.
is connected with the so-called elimination theory. Clebsch, M. Noether and others (cf. Algebraic
References geometry). The first direction of the theory of algebraic
[I] KUROSH, A.G.: Higher algebra, Mir, 1972 (in Russian). functions of a single variable is connected with the
[2] SUSHKEVICH, AX.: Foundations of higher algebra, Moscow- study of algebraic functions over the field of complex
Leningrad, 1941 (in Russian).
[3] WAERDEN, B.L. VAN DER: Algebra, 1-2, Springer, 1967-1971
numbers, in which they are regarded as meromorphic
(translated from the German). functions on Riemann surfaces and complex manifolds;
[4] MANIN, YU.I.: Encyclopaedia of elementary mathematics, Vol. 4, the most important methods applied are the geometri-
Moscow, 1963, pp. 205-227 (in Russian).
cal and topological methods of the theory of analytic
[5] DoMORYAD, A.P.: Encyclopaedia of elementary mathematics,
Vol. 2, Moscow, 1951, pp. 313-511 (in Russian). functions. The arithmetic-algebraic approach involves
/. V Proskuryakov the study of algebraic functions over arbitrary fields.
The methods employed are purely algebraic. The theory
Editorial comments. Abel's theorem mentioned above
(the general equation of degree n >4 is unsolvable by radi- of valuations and extensions of fields are especially
cals) is also called the Abel- Ruffini theorem. important. In the algebraic-geometrical approach alge-
Good up-to- date textbooks are [A 1] and [A2]. braic functions are considered to be rational functions
References on an algebraic variety, and are studied by methods of
[A 1] N.: Lectures in abstract algebra. Field theory and
JACOBSON, algebraic geometry (cf. Rational function). These three
Galois theory, 3, Freeman, 1975. points of view originally differed not only in their
[A2] JACOBSON, N.: Basic algebra, 1-2, Freeman, 1974-1980.
methods and their ways of reasoning, but also in their
AMS 1980 Subject Classification: 26CXX, 12AXX terminology. This differentiation has by now become
largely arbitrary, since function-theoretical studies
ALGEBRAIC FUNCflON A function involve the extensive use of algebraic methods, while
y =f(x l, . . . , xn) of the variables x j, Xn that satis-
... , many results obtained at first using function-theoretical
fies an equation and topological methods can be successfully applied to
(1) more general fields using algebraic analogues of these
methods.
where F is an irreducible polynomial in y, Xl, . . . , Xn
with coefficients in some field K, known as the field of
constants. The algebraic function is said to be defined Algebraic functions of one variable. Over the field C of
over this field, and is called an algebraic function over complex numbers, an algebraic function of one variable
the field K. The polynomial F(y, Xl, . . . ,xn ) is often y = f(x) (or y(x) for short) is a k-valued analytic func-
written in powers of the variable y, so that equation (1) tion. If D (x) is the discriminant of the polynomial
assumes the form F(x,y) = Pk(X)yk+ ... +PI(x)Y+Po(x), (2)
Pk(XIo'" ,Xn)yk+Pk_I(XIo'" ,Xn)yk-I+ ... + Pk(x) '" 0,
+PO(XI, ... ,xn) = 0, (i.e. of the polynomial for which F(x, f(x = 0), which
is obtained by eliminating y from the equations

where Pk(Xl, ... ,xn), ... ,PO(Xl, ... ,xn) are polyno-
mials in Xl, . . . ,Xn, and with Pk(Xl,'" ,xn);;eO. The
number k is the degree of F with respect to y, and is

F( x,y ) = , aF(x, y) =
ay
called the degree of the algebraic function. If k = 1, an to yield the equation
algebraic function may be represented as a quotient Pk(x)D(x) = 0,
PO(XI, ... ,xn) then the roots Xl, . . . ,Xm of this last equation are
y=-
PI(XI, ... ,xn) known as the critical values of y = f (x). The comple-
of polynomials, and is called a rational function of mentary set G=C\ {Xl, . . . ,xm } is known as the

91
ALGEBRAIC FUNCTION

non-critical set. For any point Xo EG equation (2) has k 00 00 .


/I (x) = ~a}"; = ~a}(x-xo)'la" (4)
different roots yA, ... ,y~ and the condition ;=0 ;=0

of(xo,Y~) .
oy =1= 0, J=l, ... ,k,
00 00

is satisfied. According to the implicit function theorem,


Ia,(x) = ~af'''; = ~af'(x_xo)'la,;
;=0 ;=0
in a neighbourhood of the point xo there exist k and similar expansions also take place for the elements
single-valued analytic functions f6 (x), ... ,f~ (x) which of other cycles. Such expansions of elements by frac-
satisfy the conditions tional degrees of the difference x - xo, where Xo is a
fd(xo) = y~, F(x,fd(x =0 critical point, are known as Puiseux series. The
and which can be decomposed into a convergent series transformation T~T" r = e 2'ITj / a, which corresponds to
one tum around x 0, converts the Puiseux series of ele-
fd(x) = y~ +a1(x - xO)+a1(x - xo)2 + . . .. (3)
ments in one cycle into each other in cyclic order, i.e.
Thus, for each point Xo EG one can construct k ele- there is a cyclic permutation of the series and of the
ments of an analytic function, known as the function corresponding elements. To turns around the critical
elements with centre at the point Xo. For any two points point correspond permutations of the elements with
X"X2EG, any elements fl (x) andf1(x) with centres at
x, and X2, respectively, are derived from each other by
centre at this point; these permutations consist of
cycles of the orders a" ... ,as, ~aj = k. The permuta-
analytic continuation along some curve in G; in partic- tions defined in this way constitute the monodromy
ular, any two elements with the same centres are also group of the algebraic function. If at least one element
connected in this way. If Xo is a critical point of an aj is greater than 1, the critical point Xo is called an
algebraic function, then two cases are possible: 1) Xo is algebraic branch point of the algebraic function; the
a root of the discriminant, i.e. D(xo)=O, but numbers aj (sometimes aj -I) are called the branch
Pk(xo)=F0; or 2) Pk(xo)=O. indices (or branch orders) of the algebraic function.
Case 1. Let Ko be a small disc with centre at Xo Case 2. If Pk(x)y is substituted for y, one returns to
which does not contain other critical points, and let
case 1; one obtains expansions similar to (4), which
/J. (x), ... ,hex) be a system of regular elements with may contain a finite number of terms with negative
centre at x' EK o, x' =FXO. These functions remain
indices:
bounded as x ~xo. Furthermore, let D be the circle 00 00
with centre Xo passing through x'; it is completely con- f(x) = ~ air ~ a;(x - xoY la. (5)
;=-P I=-P
tained inside Ko. The analytic continuation of some
given element, e.g. /J. (x), along D (in, say, the clock- If P >0, the point Xo is a pole of order p of the alge-
wise direction), yields an element / (x) which also braic function. An algebraic function is usually con-
belongs to the system of elements with centre x'. This sidered on the Riemann sphere S, i.e. on the complex
system consists of k elements, and a minimum required plane completed by the point at infinity x = 00. The
a,
finite number ~k of such turns yields the initial ele- introduction of the variable T= I / x reduces this case
ment /J. (x). One obtains a subsystem /J. (x), ... ,la, (x) to the previous case; in a neighbourhood of the point
of elements with centre x'; each one of these elements T= (x = 00) one has the expansion
may be obtained by analytic continuation of the other
y(x) = ~ ajTj la = ~ ajX- Jla . (6)
by a number of turns around the point Xo; such a j=-r j=-r

subsystem is known as a cycle. Any system If r>O, then the point x = 00 is called a pole of order r.
flex), ... ,jk(x) can be decomposed into a number of The parameter of the expansion in the series (3), (4),
non-intersecting cycles (5), (6) is called the local uniJormizing parameter for the
{/I (x), ... ,la, (x)}, algebraic function. If Xo is a non-critical point of the
{fa,+I(X), ... ,Ia,Ta,(X)}, ... ,
algebraic function, then T = x - x 0 can be taken as
parameter; if, on the other hand, Xo is a critical point,
{fa,t ... +a,,+I(X), ... ,Ia,+ ... +a,(X)}, the root (x - XO)I / a (where a is a positive integer) can
a, + ... +as=k. If a,>I, then the element /J.(x) is be taken as such a parameter. The population of all
not a single-valued function of x in the disc Ko, but is elements of an algebraic function described above
a single-valued analytic function of the parameter forms the complete algebraic function in the sense of
T=(X-XO)'/a, in a neighbourhood of T=O. In a cer- Weierstrass. Algebraic functions have no singularities
tain neighbourhood of this point the elements other than algebraic branch points and poles. The con-
/J. (x), ... ,la, (x) of the first cycle can be represented verse proposition is also true: A function y = f (x)
as convergent series which is analytic, is not more than s-valued at all

92
ALGEBRAIC FUNCTION

points of the Riemann sphere except for a finite of all rational functions R(y, x], ... , x n ) forms a field
number of points x I, . . . , Xm and x = 00, and has at Kf , coinciding with the field of rational functions on
such points only poles or algebraic branch points, is an the algebraic hypersurface in (n + I)-dimensional space
algebraic function of degree k~s. defined by the equation F(y, x], ... , x n ) = O. If the
The Riemann surface of a complete algebraic func- field of constants k is the field of complex numbers C
tion is compact and is a k-sheeted covering of the and if n = 1, then Kf is identical with the field of mero-
Riemann sphere, branch points can be the critical morphic functions on the Riemann surface of the alge-
points and the point x = 00. Algebraic functions are the braic function. The field Kf is an extension of finite
only class of functions with a compact Riemann sur- type of the field of constants k of transcendence degree
face. The genus of the Riemann surface of an algebraic n (cf. Extension of a field). In par~icular, any n + 1 ele-
function is important; it is called the genus of the alge- ments of this field are connected by an algebraic equa-
braic function. It can be calculated by the tion, so that each of them defines an algebraic function
Riemann - Hurwitz fonnula. The genus of a rational of the remaining elements. Any extension K of finite
function is zero, and its Riemann surface is the type of a field k of transcendence degree n is known as
Riemann sphere. The Riemann surface of an elliptic an algebraic function field in n variables (or, sometimes,
function that satisfies a third- or fourth-degree equation as a function field). Each such field contains a purely
is a torus; the genus of such a function is one. transcendence extension k (x I, . . . , x n ) of the field k
The universal covering Riemann surface of an alge- (called the field of rational functions in n variables). Any
braic function is a simply-connected two-dimensional element y EK satisfies some algebraic equation
manifold, i.e. it has a trivial fundamental group and is I>(y, x I, . . . ,xn ) = 0, and can be considered as an alge-
conform ally equivalent either to the Riemann sphere, braic function in the variables x I, . . . , x n . Each field K
the complex plane or the interior of the unit disc. In of algebraic functions in n variables is isomorphic to
the first case the algebraic function is a rational, in the the field of rational functions on some algebraic variety
second case it is an elliptic, while in the third case it is of dimension n, which is called a model of K. If the
a general function. field of constants k is algebraically closed and of
The unifonnization problem of algebraic functions is characteristic zero, then each algebraic function field
closely connected with the theory of Riemann surfaces has a non-singular projective model (cf. Resolution of
of algebraic functions. The function y = f (x) can be singularities). Let S be the set of all non-trivial valua-
uniformized if y and x are representable as single- tions (cf. Valuation) of an algebraic function field K
valued analytic functions which are non-negative on the field of constants. If
y = y(t), x = x(t) provided with the natural topology, it is known as the
abstract Riemann surface of the field K [1]. In the case
of a parameter t, which identically satisfy equation (2). of algebraic functions in one variable, the Riemann sur-
The uniformization problem is locally solved by a local face coincides with the set of non-singular projective
uniformizing parameter; however, it is the solution 'in models, which in this case is uniquely defined up to an
the large' that is of interest. If k = 1, i.e. if y (x) is a isomorphism. Many concepts and results in algebraic
rational function of x, this parameter may be the vari- geometry on the model of a field K can be restated in
able x - x 0; if k = 2, then uniformization is attained the language of the theory of valuations of fields [1],
with the aid of a rational or a trigonometric function.
[6]. A particularly close analogy holds for algebraic
For instance, if y (x) satisfies the equation
functions in one variable, the theory of which is practi-
y2_x 2 = 1, cally identical with the theory of algebraic curves.
one can take Each algebraic function field in one variable is the
field of fractions of a Dedekind ring, so that many
t2 + 1
Y = -2--1'
t -
x - results and concepts of the theory of divisibility in alge-
braic number fields can be applied to function fields
or [12]. Many problems and constructions in algebraic
y = sect, x = tgt.
number theory motivate similar problems and construc-
If k = 3, 4 in the case of an algebraic function of genus tions in fields of algebraic functions and vice versa. For
one, uniformization is achieved using elliptic functions. instance, the application of a Puiseux expansion to the
Finally, if k>4 and the genus of the algebraic function theory of algebraic numbers led to the genesis of the p-
is higher than one, uniformization is realized using adic method in number theory, due to Hensel. Class
automorphic functions (cf. Automorphic function). field theory, which had originally belonged to the
Algebraic functions of several variables. If f is an alge- domain of algebraic numbers, was subsequently applied
braic function in the variables x I, . . . , x n , then t} ~ set to functions [2]. An especially close analogy exists

93
ALGEBRAIC FUNCTION

between algebraic number fields and algebraic function ously a complex-analytic, differentiable and topological
fields over a finite field of constants. For instance, the space in the ordinary Hausdorff topology. This fact
concept of a zeta-function is defined for the latter and makes it possible to introduce a large number of classi-
the analogue of the Riemann hypothesis has been cal structures which induce such invariants of algebraic
demonstrated for algebraic function fields (cf. Zeta- varieties that can be obtained by purely algebraic
fun~on in algebraic geometry). means only with great difficulty or perhaps not all. The
References concepts and the results of algebraic geometry are
[I) ZARISKl, O. and SAMUEL, P.: Commutative algebra, I, Springer, extensively used in number theory (Diophantine equa-
1975. tions and the evaluation of trigonometric sums), in dif-
[2) SERRE, J.-P.: Groupes algehrique et corps des classes, Hermann,
1959.
ferential topology (both with respect to singularities
[3) SToiLOV, S.: The theory offunctions of a complex variable, 1-2, and differentiable structures), in group theory (alge-
Moscow, 1962 (in Russian and Rumanian). braic groups and simple finite groups connected with
[4) CHEBOTAREV, N.G.: The theory of algebraic functions,
Lie groups), in the theory of differential equations (K-
Moscow-Leningrad, 1948 (in Russian).
[5) SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977 theory and the index of elliptic operators), in the theory
(translated from the Russian). of complex spaces, in the theory of categories (topoi,
[6) CHEVALLEY, c.: Introduction to the theory of algebraic functions Abelian categories), and in functional analysis
of one variable, Amer. Math. Soc., 1951.
[7) ApPELL, P. and GOURSAT, E.: Theorie des fonctions algebriques, (representation theory). Conversely, the ideas and
1-2, Chelsea, reprint, 1976-1978. methods of these disciplines are utilized in algebraic
[8) DEDEKlND, R. and WEBER, H.F.: 'Theorie der algebraischen geometry.
Funktionen einer Veranderlichen', J. Reine Angew. Math. 92
(1882), 181-290.
The genesis of algebraic geometry dates back to the
[9) HENSEL, K. and LANDSBERG, G.: Theorie der algebraischen seventeenth century, with the introduction of the con-
Funktionen einer Variabeln und ihre Anwendung auf algebraische cept of coordinates into geometry. Nevertheless, its
Kurven und Abelsche Integrale, Teubner, 1902.
crystallization into an independent branch of science
[\0) PiCARD, E. and SIMART, G.: Theorie des fonctions algebriques
de deux variables independantes, 1-2, Chelsea, reprint, 1971. only began in the mid-nineteenth century. The utiliza-
[II) JUNG, H.W.E.: Einfohrung in die Theorir! der algebraischen tion of coordinates in projective geometry created a
Funktionen zweier Veriinderlicher, Akademie-Verlag, 1951. situation in which algebraic methods could compete
[12) HASSE, H.: Number theory, Springer, 1980 (translated from the
German).
with synthetic methods. However, the achievements of
[13) LANG, S.: A Igebraic functions, New York, 1965. this branch of algebraic geometry still are in the
A.B. Zhizhchenko domain proper of projective geometry, but as a result it
left to algebraic geometry the traditional study of pro-
AMS 1980 Subject Classification: 26A09, 30AXX
jective algebraic varieties. Modern algebraic geometry
ALGEBRAIC GEOMETRY The branch of arose as the theory of algebraic curves (cf. Algebraic
mathematics dealing with geometric objects connected curve). Historically, the first stage of development of
with commutative rings: algebraic varieties (cf. Alge- the theory of algebraic curves consisted in the clarifica-
braic variety) and their various generalizations tion of the fundamental concepts and ideas of this
(schemes, algebraic spaces, etc., cf. Scheme; Algebraic theory, using elliptic curves as examples. The system of
space). concepts and results which one now calls the theory of
Algebraic geometry may be 'naively' defined as the elliptic curves arose as a part of analysis (rather than
study of solutions of algebraic equations. The geometri- geometry) - the theory of integrals of rational func-
cal intuition appears when every 'set of solutions' is tions on an elliptic curve. At first, these were the
identified with a 'set of points in a coordinate space'. integrals that received the name 'elliptic', and it was
If the coordinates are real numbers, and if the space is only later that the meaning was extended to include
two- or three-dimensional, the situation may be clearly functions and curves (cf. Elliptic integral).
visualized; however, the language of geometry is also At the end of the seventeenth century, Jakob and
used in more general situations. This language implies Johann Bernoulli noted a new interesting property of
problems, constructions and considerations which do elliptic integrals. They studied integrals expressing the
not follow from the point of view of pure algebra. In arc lengths of certain curves. They found methods for
turn, algebra provides a flexible and powerful transforming one curve into another of the same arc
apparatus which is just as suitable for the conversion of length, even though the respective arcs could not be
a tentative reasoning into a proof as for the formula- brought into correspondence. From the analytic point
tion of such proofs in their most obvious and most gen- of view, this was tantamount to the conversion of an
eral form. integral into another integral, and in certain cases the
In the case of algebraic geometry over the field of result was a transformation of an integral into itself.
complex numbers, every algebraic variety is simultane- e.G. Fagnano, in the first half of the eighteenth cen-

94
ALGEBRAIC GEOMETRY

tury, gave numerous examples of such transformations. periods, and obtained several results in the problem of
L. Euler studied an arbitrary polynomial f (x) of the transformations. By transforming Abel's expressions of
fourth degree and posed the problem of the relations elliptic functions in series into the form of products,
between x and y that satisfy the equation Jacobi arrived at the concept of 8-functions (cf. Theta-
dx _ ~ function) and discovered their numerous applications
(I) not only in the theory of elliptic functions, but also in
Vj(x) - Vj(y)'
number theory and in mechanics.
He considered (1) as a differential equation, connecting
While studying the transformations of elliptic func-
x and y. The desired relation is the ordinary integral of
tions, Abel was the first to investigate the group of
this equation. The reason for the existence of the
homomorphisms of a one-dimensional Abelian variety.
integral of (1) and all its special cases, discovered by
Finally, as a result of the publication of the inheri-
Fagnano and the Bernoulli's, is the presence of a group
tance of e.F. Gauss and, in particular, of his diary, it
law on the elliptic curve s2 = f (t), and the invariance of
became clear that to some extent he possessed some of
the everywhere-regular differential form s -1 dt with
these ideas long before the work of Abel and Jacobi.
respect to shifts by elements of the group. Euler's rela-
The transition to the study of arbitrary algebraic curves
tions, which interconnect x with y by (1), may be writ-
still took place within an analytic framework: Abel had
ten as
shown the way in which the basic properties of elliptic
(x, Vj(xEB(e, Vj(e = (y, Vj(y, integrals can be generalized to include integrals of arbi-
where EB denotes the addition of points on the elliptic trary algebraic functions. Such integrals were subse-
curve. quently named Abelian integrals (cf. Abelian integral).
Thus it is seen that these results contain both the The work published by Abel in 1826 became the
group law on an elliptic curve and the existence of an starting point of the general theory of algebraic curves.
invariant differential form on this curve (cf. Elliptic It contains the concepts of the genus of an algebraic
curve). curve and of the equivalence of divisors, and presents
After the work of Euler, the theory of elliptic equivalence criteria in terms of integrals. It leads to the
integrals was mainly developed by A. Legendre. His theory of the Jacobi variety of an algebraic curve.
considerations, beginning in 1786, are collected in a In his dissertation, published in 1851, B. Riemann
three-volume treatise Traite des fonctions elliptiques et adopted a novel principle of studying functions of a
integrales Euleriennes ('A treatise on elliptic functions complex variable. He assumed that such a function is
and Euler integrals'). given not on the plane of the complex variable, but on
The studies carried out by N.H. Abel on the theory a 'many-sheeted' surface above this plane.
of elliptic functions appeared in 1827 - 1829. His start- The surfaces introduced by Riemann (cf. Riemann
ing point was the elliptic integral surface) are close to the modern concept of a one-
dimensional analytic manifold: A manifold on which
f
I-
0= dx analytic functions are defined. Riemann posed and
o V(l-e 2x 2)(I-e 2x 2)'
solved the question of the connection between his con-
where c and e are complex numbers. Abel considered cept and the concept of an algebraic curve; the
this integral as a function of the upper limit and intro- corresponding result is nowadays called Riemann's
duced the inverse function A(8) and the function existence theorem. After having studied the possible
!:l(O) = V(l-e 2A2)(I-e 2A2). position of branching points of the surface he proved
that the set of classes depends on 3p - 3 independent
Both these functions have two periods, 2w and 2w, in
parameters if p > 1, which he called the moduli (cf.
the complex domain:
Moduli problem).
I/e
w=2f dx The work of Riemann was the starting point for stu-
o V(l-c 2x2)(I-e 2x2) ' dies on the topology of algebraic curves; this study
I/e explains the topological meaning of the dimension p of
w=2f dx
the space QT[X] as one-half of the dimension of the
o V(l-c 2x2)(l-e 2x2) '
one-dimensional homology group of the space X (C).
so that the mapping x =A(8), y =t:.(8) defines a uru- Analytic methods led to the inequality
formization of the elliptic curve I(Dy;;:'deg(D)-p + 1. The Riemann - Roch equality
y2 =(1-c 2 x 2 )(l-e 2 x 2 ) by elliptic functions. was proved by E. Roch, a student of Riemann (cf.
Somewhat later than Abel, but independently, e.G.J. Riemann - Roch theorem). Finally, this study consti-
Jacobi studied the function inverse to the elliptic tutes the first appearance of the field k (X) as the pri-
integral; he showed that it has two independent mary object connected with a curve X, and of the con-

95
ALGEBRAIC GEOMETRY

cept of a birational isomorphism. The problem of as an independent branch. Thus, J. Plucker in 1834
inversion of integrals of arbitrary algebraic functions arrived at formulas interconnecting the class of the
had been posed much earlier by Abel. Another part of curve, its order and the number of its double points.
Riemann's study on Abelian functions deals with the He also showed that a plane curve of order three has
relations between O-functions and the inversion prob- nine points of inflection. However, such studies were
lem in general, in particular with the series (in p vari- only of secondary importance at that time.
ables) It was only after the studies of Riemann that the
O(V) = ~eF(m)+2(m,v), (2)
m
geometry of algebraic curves became an important field
in mathematics, together with the theory of Abelian
where m=(m), ... ,mp) runs through all integral p-
integrals and Abelian functions. This change was
dimensional vectors,
mainly due to the work of A. Clebsch. While Riemann
v = (VI> ' .. ,vp ), (rn, v) = ~rnjvj,
based his work on functions, Clebsch based his work
F(rn) = ~CXjkrnjrnb CXjk = CXkj' on algebraic curves. Clebsch and P. Gordan [10]
This series converges for all values of v if the real part deduced a formula for the number p of linearly
of the quadratic form F is negative definite. The princi- independent integrals of the first kind (i.e. of the same
pal properties of the function f) are the relations kind as the genus of the curve X), which expresses this
number in terms of the order of the curve and the
O(v+7Tir) = OCr), O(v+CXj) = eL/VlO(v), (3) number of singular points. They also showed that if
where r is an integer vector, aj is a column of the p = 0, the curve has a rational parametrization and, if
matrix (ajd and L/ v) is a linear function. p = 1, it becomes a plane curve of order three.
It was shown by Riemann that it is possible to select An error made by Riemann proved useful in the
cuts a I, . . . ,ap ' b), ... ,bp which convert the surface development of the algebraic-geometric aspect of the
he has introduced into a simply-connected surface, and theory of algebraic curves. In proving his existence
to select U I, . . . ,up of the everywhere-finite integrals theorems, Riemann considered it self-evident that a


on this surface such that the integrals Uj with respect to variational problem - the 'Dirichlet principle' - is
ak are if J=I=k, and are wi if J =k, while the integrals solvable. It was soon shown by Weierstrass that this is
Uj with respect to bk form a symmetric matrix (ajd not true in all cases. Accordingly, Riemann's results
which satisfies conditions that ensure the convergence remained unfounded for a certain period of time. One
of the series (2), He considered the function f) way in which this difficulty could be overcome was to
corresponding to these coefficients ajk. The periods of prove these theorems by an algebraic method; their for-
an arbitrary 2n-periodic function of n variables satisfy mulation was essentially algebraic. These studies, which
relations analogous to those necessary for the conver- were undertaken by Clebsch, greatly facilitated the
gence of the series which determine the f)-functions. understanding of the algebraic-geometric nature of the
These relations between the periods were explicitly results of Abel and Riemann, which had hitherto been
written out by G. Frobenius, who showed that they are masked by the analytic approach.
necessary and sufficient for the existence of non-trivial The studies initiated by Clebsch were continued and
functions which satisfy the functional equation (3) (cf. expanded by a student of his school - M. Noether.
Theta-function; Abelian function). These relations are Noether's ideas were presented very clearly in a study
necessary and sufficient for the existence of a mero- published by himself and A, Brill. They posed the
morphic function with given periods which cannot be problem of the development of geometry on an alge-
reduced to a function in a smaller number of variables braic curve lying in a projective plane as the set of
by linear transformations. This theorem was formulated results which are invariant with respect to one-to-one
by K. Weierstrass and proved by H. Poincare. It was (i,e. birational) transformations (cf. Birational
shown in 1921 by S. Lefschetz that, if the Frobenius geometry).
relations are satisfied, the f)-function defines an imbed- By the eighteen-fifties many special properties of
ding of the variety cn / ~ into a projective space (~ is algebraic varieties of dimension higher than one
the lattice corresponding to the given matrix of periods; (mostly surfaces) had been discovered. For instance, a
cf. Complex torus). detailed study was made of surfaces of the third degree;
The concepts and the results which now form the in particular, it was shown by G. Salmon and A. Cay-
base of the theory of algebraic curves were created ley in 1849 that any cubic surface without singular
under the influence and within the framework of the points contains 27 different straight lines. However,
theory of algebraic functions and their Integrals. The these results were not connected with any general prin-
purely geometrical theory of algebraic curves developed ciples for a long time, and remained unrelated to the

96
ALGEBRAIC GEOMETRY

fundamental ideas in the theory of algebraic curves transformations z~(az+b)/(ez+d), where a, b, e and
which were developed at the same time. d are integers, ad - be = 1, and
The development of algebraic geometry was strongly
affected by the Italian school, in particular by L. [:~) = [~ ~) (mod 7).
Cremona, C. Segre and E. Bertini. The principal
representatives of this school were G. Castelnuovo, F. He showed that these functions uniformize the curve
Enriques and F. Severi. One of the principal achieve- x6x 1+ XTx2 + x~xo =0 of genus three. It is possible in
ments of the Italian school was the classification of this manner to distort the fundamental polygon of this
algebraic surfaces (cf. Algebraic surface). The first result group and to obtain new groups which uniformize
in this direction was reported by Bertini in 1877; he curves of genus three. A similar reasoning underlies the
gave the classification of involutive plane transforma- studies of Klein and Poincare, the latter constructing
tions or, in modern terms, a classification up to conju- automorphic functions with the aid of series which are
gation in the group of birational automorphisms of the now named after him. They both correctly conjectured
plane of all elements of order two in this group. This that any algebraic curve can be uniformized by a
classification is very simple and, in particular, it can be corresponding group, and made considerable advances
readily deduced that the quotient of the plane by a in their attempts to prove this result. The complete
group of order two is a rational surface. In other proof was only obtained in 1907 by Poincare and,
words, if a surface X is unirational and a morphism independently, by P. Koebe. A major contribution to
1: p2~X is of degree two, then X is rational. Castel- this proof was made by the studies of Poincare on the
nuovo (1893) solved (positively) the general case of the concepts of the fundamental group and a universal cov-
Liiroth problem for algebraic surfaces. He also posed ering.
and solved the problem of the characterization of The topology of algebraic curves is very simple, and
rational surfaces by numerical invariants. A classifica- was exhaustively investigated by Riemann. E. Picard
tion of surfaces was achieved by Enriques in a series of studied the topology of algebraic surfaces by a method
studies, which ended only in the first decade of the based on the study of the fibres of a morphism
twentieth century. 1: X ~p I. He studied the variation of the topology of
The principal tool of the Italian school was the study the fibre 1- I (a) with the variation of the point a Epl
of families on a surface; the families were taken linear and, in particular, conditions under which this fibre
or algebraic (the latter also being known as 'continu- contains a singular point. For instance, it was proved in
ous'). This led to the concept of linear and algebraic this way that smooth surfaces in p3 [11] are simply
equivalence. The connection between these concepts connected. Poincare also made an important contribu-
was first studied by Castelnuovo. An important contri- tion to the topology of algebraic surfaces.
bution to the subsequent development of the problem In 1921 S. Lefschetz started to use a new science -
was made by Severi. A large portion of the concepts topology - in the study of algebraic varieties over the
was defined in an analytic form, and the algebraic sig- field of complex numbers. His first aim was to obtain a
nificance was clarified in the course of time. Neverthe- simplification of the results of Poincare and Picard and
less, there are still many concepts and results which are to obtain generalizations of these results to higher-
essentially analytic, at least from the modern point of dimensional cases. However, he did considerably more
VIew. and his works opened a new domain of research in
The studies of F. Klein and H. Poincare on the prob- algebraic geometry. A further application of Lefschetz
lem of unifonnization of algebraic curves by auto- to algebraic geometry is connected with the theory of
morphic functions appeared in the early eighteen- algebraic cycles on algebraic varieties. He proved that a
eighties. Their objective was the uniformization of all two-dimensional cycle on an algebraic variety is homo-
curves by functions now known as automorphic, analo- logous to a cycle representable by an algebraic curve if
gous to the uniformization of curves of order one by and only if the regular double integral
elliptic functions. Klein's starting point was the theory jjR(x,y, z)dxdy has a zero period over this cycle.
of modular functions. The field of modular functions is Lefschetz' studies laid the foundations of the modern
isomorphic to the field of rational functions, but it is theory of complex manifolds. Such manifolds were sub-
possible to consider functions that are invariant with sequently studied by powerful tools, including the
respect to various subgroups of the modular group and theory of harmonic integrals r.yv. Hodge, G. de Rham)
to obtain more complicated fields in this manner. In and the theory of sheaves (H. Cartan, J. Leray). It
particular, Klein considered functions that are auto- could be shown by using these techniques that non-
morphic with respect to the group consisting of all singular algebraic varieties form part of an important

97
ALGEBRAIC GEOMETRY

class of complex manifolds - Kahler manifolds (cf. ley and J.-P. Serre introduced powerful methods of
Kibler manifold). commutative and, in particular, local algebra into alge-
The theory of sheaves, and the related theory of vec- braic geometry in the early nineteen-fifties.
tor bundles on complex manifolds, supplied a new Serre gave a definition of varieties based on the con-
interpretation and considerable generalization of many cept of a sheaf. He also established the theory of
classical invariants of algebraic surfaces (of both the coherent algebraic sheaves, modelled on the theory of
arithmetic and geometric genus, and of the canonical coherent analytic sheaves, which had been introduced
system). One of the most important achievements of only a short time before (cf. Coherent algebraic sheaf;
this theory was the creation of the theory of Chern Coherent analytic sheaf).
classes (cf. Chern class) and a considerable In the late nineteen-fifties algebraic geometry
generalization of the classical Riemann - Roch theorem underwent a further radical transformation, due to
by F. Hirzebruch [7]. A. Grothendieck's work on the foundation of the con-
In the middle of the nineteen-twenties much work cept of a scheme. By using the language of category
was done on broadening the scope of algebraic theory, Grothendieck succeeded in generalizing and
geometry from the aspect of set theory and from the clarifying many important classical constructions in
axiomatic aspect. The scope of application of algebraic algebraic geometry, the abstract definitions of which
geometry was greatly extended to include complex had been until now only little geometric. He also
manifolds and algebraic varieties over arbitrary fields. founded many new branches of algebraic geometry (cf.
Interest in algebraic geometry over 'non-classical' fields Abstract algebraic geometry). The language of the
originated in the theory of congruences, interpreted as theory of schemes has now become a part of the rou-
equations over a finite field. It was claimed by Poincare tine of modern algebraic geometry, has united it natur-
in his address at the International Congress of ally with commutative algebra, and has made a signifi-
Mathematicians in 1908 that methods of the theory of cant contribution to the advances made in the study of
algebraic curves may be used for studying equations arithmetical problems of algebraic varieties (cf. Alge-
with two unknowns. The ground for a systematic con- braic varieties, arithmetic of). The theory of schemes
struction of algebraic geometry had been prepared in directly affected one of the most important recent
the first decade of the twentieth century by the general achievements of algebraic geometry - the solution (by
development of the theory of fields and the theory of H. Hironaka) of the problem on the existence of a
rings. non-singular birational model of an algebraic variety
In the nineteen-thirties H. Hasse and his school defined over a field of characteristic zero (cf. Resolution
attempted to prove the Riemann hypothesis (cf. of singularities).
Riemann hypotheses), which may be formulated for any References
algebraic curve over a finite field; this involved the [I] SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
development of a theory of algebraic curves over an (translated from the Russian).
[2] HODGE, W.V.D. and !'EDOE, D.: Methods of algebraic geometry,
arbitrary field. The hypothesis itself was proved by
1-3, Cambridge Univ. Press, 1947-1954.
Hasse for elliptic curves. Advances in the construction [3] BALDASSARRI, M.: Algebraic varieties, Springer, 1956.
of algebraic geometry over arbitary fields are also due [4] DIEUDONNE, J.: 'The historical development of algebraic
to the studies of B.L. van der Waerden in the period geometry', Amer. Math. Monthly 79 (1972),827-866.
[5] WElL, A.: Foundations of algebraic geometry, Amer. Math. Soc.,
between 1931 and 1939. In particular, he developed 1946.
the theory of intersections in a smooth projective [6] GROTHENDIECK, A. and DIEUDONNE, J.: 'Elements de
variety. geometrie algebrique', Pub!. Math. IHES 4;8;11;17;20;24;28;32.
[7] HIRZEBRUCH, F.: Topological methods in algebraic geometry,
A. Weil, in 1940, succeeded in formulating a proof of Springer, 1978 (translated from the German).
the Riemann hypothesis for an arbitrary algebraic [8] ENRIQUEs, F.: Le superficie algebraiche, Bologna, 1949.
curve over a finite field. He found two ways of proving [9] ZARISKI, 0.: AlgebraiC surfaces, Springer, 1971.
[10] CLEBSCH, A. and GORDAN, P.: Theorie der Abelschen Funk-
the hypothesis: one based on the theory of correspon- tionen, Teubner, 1866.
dences of the curve X (i.e. of divisors on the surface [II] PICARD, E. and SIMART, G.: Theorie des fonctions algebriques
X X X), while the other was based on the study of its de deux variables independantes, 1-2, Chelsea, reprint, 1971.
Jacobi variety. Thus, higher-dimensional varieties are Based on material from the 'Historical sketch' of [I]
used in both cases. Accordingly, Weil's book [5] con- Editorial comments. For another good reference on
tains the construction of algebraic geometry over an Hirzebruch's work concerning a generalization of the classi-
arbitrary field: the theory of divisors, cycles and inter- cal Riemann - Roch theorem, see [A 1]. A recent historical
sections. For the first time 'abstract' (not necessarily account of algebraic geometry is [A2].
quasi-projective) varieties were defined by pasting References
together affine pieces. O. Zariski, P. Samuel, C. Cheval- [A 1] FULTON, W.: Intersection theory, Springer, 1984.

98
ALGEBRAIC GROUP OF TRANSFORMATIONS

[A2] DIEUDONNE, I.: History of algebraic geometry, Wadsworth, linear algebraic subgroup H such that the quotient
Monterey, 1985. group G / H is an Abelian variety [AI]. Numerous
[A3] HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
[A4] GRIFFITHS, PH. and HARRIs, I.: Principles of algebraic
examples of algebraic groups which are neither linear
geometry, Wiley, 1978. algebraic groups nor Abelian varieties are given by the
theory of generalized Jacobi varieties for algebraic
AMS 1980 Subject Classification: 14-XX
curves with singular points [3] (cf. Jacobi variety). A
natural extension of the class of algebraic groups leads
ALGEBRAIC GROUP - A group G provided with the to the concept of a group scheme.
structure of an algebraic variety in which the multipli- References
cation p.: G X G~G and the inversion mapping p: G~G [I] BOREL, A.: Linear algebraic groups, Benjamin, 1969.
are regular mappings (morphisms) of algebraic varieties. [2] MUMFORD, D.: Abelian varieties, Oxford Uniy. Press, 1974.
[3] SERRE, I.-P.: Groupes algebrique et corps des classes, Hermann,
An algebraic group is said to be defined over a field k if
1959.
its underlying algebraic variety and the morphisms p. [4] CHEYALLEY, c.: 'La theorie des groupes algebriques', in Proc.
and p are defined over k. In such a case the set of k- Internat. Congress Mathematicians Edinburgh, 1958, Cambridge
Uniy. Press, 1960, pp. 53-68.
rational points of the variety G is an (abstract) group
[5] DEMAZURE, M. and GROTHENDIECK, A.: 'Schemas en groupes',
which is denoted by G(k). An algebraic group is called in Sem. Geom Alg. 1963-1964, Leet. Notes in Math., Vol. 151-
connected if its algebraic variety is connected. The 153, Springer, 1970. B B TJ" k
. . "en ov
dimension of an algebraic group is the dimension of its
v.P. Platonov
algebraic variety. In what follows, only connected alge- Editorial comments.
braic groups will be considered. A subgroup H of an
References
algebraic group G is called algebraic if it is a closed [A1] CHEVALLEY, c.: 'Une demonstration d'un theoreme sur les
subvariety of the algebraic variety G. For such sub- groupes algebriques', J. Math. Pures Appl. 39 (1960), 307-
groups the space of co sets (left or right) can be natur- 317.
ally provided with the structure of an algebraic variety, AMS 1980 Subject Classification: 14KXX, 20GXX
defined by a universal property (d. Quotient space of
an algebraic group). If the subgroup H is also normal, ALGEBRAIC GROUP OF TRANSFORMATIONS -
then the quotient group G / H is an algebraic group An algebraic group G acting regularly on an algebraic
with respect to this structure, and is calle~ an algebraic variety V. More precisely, it is a triplet (G, V, T) where
quotient group. A homomorphism <1>: G~G of algebraic T: G X V ~ V (r(g, x) = gx) is a morphism of algebraic
groups is called algebraic if <I> is a morphism of their varieties satisfying the conditions: ex =x, g(hx)=(gh)x
algebraic varieties; if <I> is defined over k, it is called a for all x E V and g, hE G (where e is the unit of G). If
k-homomorphism. A k-isomorphism of an algebraic G, V and T are defined over a field k, then (G, V, T) is
group is defined in a similar manner. called an algebraic group of k-transformations. For
Examples of algebraic groups: The general linear instance, (G, G, T), where T is the adjoint action or an
group GL(n, k) (the group of all invertible matrices of action by shifts, is an algebraic group of transforma-
order n, with coefficients in a fixed algebraically closed tions. If G is an algebraic subgroup in GL(n) and T is
field k); the group of triangular matrices; an eUiptic its natural action on the affine space V = k", then
curve. (G, V, T) is an algebraic group of transformations. For
There are two main types of algebraic groups, with each point x E V one denotes by G (x) = {gx: g E G} the
altogether different properties: Abelian varieties and orbit of x, and by Gx = {gEG: gx =x} the stabilizer of
linear algebraic groups (d. Abelian variety; Linear alge- x. The orbit G (x) need not necessarily be closed in V,
braic group). The type of a particular group is deter- but closed orbits exist always, e.g. orbits of minimal
mined exclusively by properties of its variety. An alge- dimension are closed. An algebraic group of transfor-
braic group is called an Abelian variety if its algebraic mations is sometimes understood to mean a group G
variety is complete. An algebraic group is called linear which is acting rationally (but not necessarily regularly)
if it is isomorphic to an algebraic subgroup of a general on an algebraic variety V (this means that T: G X V ~ V
linear group. An algebraic group is linear if and only if is a rational mapping, and the above properties of T are
its algebraic variety is affine. These two classes of alge- valid for ordinary points). It was shown by A. Well [3]
braic groups have a trivial intersection: If an algebraic that there always exists a variety VI, birationally iso-
group is both an Abelian variety and a linear group, morphic to V, and such that the action of G on VI
then it is the identity group. The study of arbitrary induced by the rational action of G on V is regular.
algebraic groups reduces to a great extent to the study The problem of describing the orbits, stabilizers, fields
of Abelian varieties and linear groups. In particular, an of invariant rational functions (cf. Invariants, theory of),
arbitrary algebraic group contains a unique normal and of constructing quotient varieties are fundamental

99
ALGEBRAIC GROUP OF TRANSFORMATIONS

in the theory of algebraic groups of transformations which satisfy linear differential equations with coeffi-
and have numerous applications. cients from the field of rational functions [2], [3]. Alge-
References braic independence has been proved for numbers aP
[1) BoREL, A.: Linear algebraic groups, Benjamin, 1969. and at, where a*O, I is an algebraic number and f3 is
[2A) DIEUDONNE, J.A. and CARREll, J.B.: Invariant theory, old and a cubic irrationality. A number of theorems have also
new, Acad. Press, 1971.
been proved which deal with the algebraic non-
[2B) MUMFORD, D.: Geometric invariant theory, Springer, 1965.
[2C) MUNFORD, D.: 'Projective invariants of projective structures expressibility of numbers; a concept close to that of
and applications', in Proc. Internat. Congress Mathematicians algebraic independence.
Stockholm, 1962, Inst. Mittag-Leffler, 1%3, pp. 526-530. It is possible to impart a quantitative facet to the
[2D) SESHADRI, C.S.: 'Quotient spaces modulo reductive groupes
and applications to moduli of vector bundles on algebraic
qualitative concept of algebraic independence if the
curves', in Actes du Congres Internat. Mathematiciens Nice, measure of algebraic independence (cf. Algebraic
197JJ, Yol. 1, Gauthier-Yillars, 1971, pp. 479-482. independence, measure of) of these numbers is con-
[3) WElL, A.: 'On algebraic groups and homogeneous spaces',
sidered. The analytic methods referred to above yield
Amer. J. Math. 77, no. 2 (1955), 355-391.
v.P. Platonov estimates from below for the measure of algebraic
Editorial comments. The notion in question is also independence of certain classes of numbers. General
called an algebraic transformation space. theorems treating the estimation of the measure of
AMS 1980 Subject Classification: 20GXX, 20HXX algebraic independence of the values of E-functions
have been established [3].
ALGEBRAIC INDEPENDENCE - A concept in the References
theory of field extensions (cf. Extension of a field). Let [I) LANG, S.: Algebra, Addison-Wesley, 1974.
K be some extension of a field k. The elements [2) FEL'DMAN, N.!. and SHIDLOvsKII, A.B.: 'The development and
present state of the theory of transcendental numbers', Russian
b 1> ,bn EK are called algebraically independent over Math. Surveys 22, no. 3 (1967), 1-79. (Uspekhi Mat. Nauk 22,
k if for each polynomial f(x) , ... ,xn ) with coefficients no. 3 (1967), 3-81)
from k which is not identically equal to zero, [3) SHIDLOVSKIi, A.B.: 'On arithmetic properties of values of ana-
lytic functions', Trudy Mat. Inst. Steklov. 132 (1973), 169-202
feb), ... ,bn)*O. Otherwise, the elements b), ... ,bn
(in Russian).
are called algebraically dependent. An infinite set of ele- A.B. ShidlovskiF
ments is called algebraically independent if each one of Editorial comments. The above-mentioned theorem on
its finite subsets is algebraically independent; otherwise the algebraic independence of the values of e Z is known as
it is called algebraically dependent. The definition of the Lindemann- Weierstrass theorem. Schanuel's
algebraic independence may be extended to the case conjecture, 'transcendence degree
where K is a ring, and k a sub ring [I]. (x 1, . . . ,Xn , eX' , ... ,eX,),;:, n for any Q-linear independent
X1, .. ,XnEC', is still (1986) unproved. An analogue of the
Algebraic independence of numbers. Complex numbers
Lindemann - Weierstrass theorem for values of the Weier-
a), ... ,an are called algebraically independent if they
strass Ie -function has been proved recently [A1], [A2] by
are algebraically independent over the field of algebraic
using Gel'fond - Schneider-type arguments and zero esti-
numbers, i.e. if for any polynomial P(x), ... ,xn ) with
mates for polynomials on algebraic group varieties. Another
algebraic coefficients, not all of which are zero, the
method in algebraic independence is Mahler's method for
relationship P(a), ... , an)*O is valid. Otherwise, values of functions satisfying functional equations [A3].
a), ... ,an are called algebraically dependent. The con-
References
cept of algebraic independence of numbers is a general- [A 1] WDSTHOLZ, G.: 'Ueber das abelsche Analogon des Lin-
ization of the concept of transcendency of a number demannsche Satzes I " Inv. Math. 72 (1983), 363-388.
(the case n = I). If several numbers are algebraically [A2] PHILIPPON, P.: 'Varietes abeliennes et independence
algebrique II', Inv. Math. 72 (1983),389-405.
independent, each one of them is transcendental. It is [A3] LOXTON, J. and POORTEN, A.1. VAN DER: 'Transcendence and
usually very difficult to prove that given numbers are algebraic independence by a method of Mahler', in A Baker
algebraically independent. The eXlstmg analytic and D. Masser (eds.): Transcendence theory, Acad. Press,
1977, Chap!. 15.
methods of the theory of transcendental numbers give a
solution of this problem for the values of certain classes AMS 1980 Subject Classification: 10F37
of analytic functions. For instance, it was found that
the values of the exponential function e Z , the argu- ALGEBRAIC INDEPENDENCE, MEASURE OF -
ments of which are algebraic and linearly independent The measure of algebraic independence of the numbers
over the field of rational numbers, are algebraically aI, ... ,am is the function
independent. A similar result was obtain~d for Bessel cIJ(al,'" ,am;n,H) = min 1 P(al,. ,am) I,
functions (cf. Siegel method). Several general theorems
were also established concerning algebraic indepen- where the minimum is taken over all polynomials of
dence of the values in algebraic points of E-functions degree at most n, with rational integer coefficients not

100
ALGEBRAIC K-THEORY

all of which are zero, and of height at most H. For same type (for example, an obstruction to the deforma-
more details see Transcendency, measure of. tion of a pseudo-isotopy of a closed manifold into an
A.B. Shidlovskit isotopy lies in some quotient group of the group
AMS 1980 Subject Classification: 12AXX K2(ZW. Algebraic studies of the Whitehead group
began in the nineteen-fourties. A related field is the
ALGEBRAIC IRRATIONALITY - An irrational alge- study of the structure of linear groups over arbitrary
braic number. An algebraic irrationality is the root of rings, in particular, the theory of determinants over a
an irreducible polynomial of a degree at least two, with skew-field [10].
rational coefficients. The second origin of algebraic K-theory was an alge-
A.I. Galochkin braic proof of the Riemann - Roch theorem [7] and its
AMS 1980 Subject Classification: 12AXX generalizations by A. Grothendieck in 1957. These con-
siderations involved the introduction of the K-functor
ALGEBRAIC K-THEORY - A branch of algebra, K(X) as the group of values of a universal additive
dealing mainly with the study of the so-called K- functor on coherent sheaves on a smooth algebraic
functors (Ko. K], etc., cf. K-functor); it is a part of variety. Moreover, the previously familiar representa-
general linear algebra. It deals with the structure theory tion rings, Witt rings (cf. Witt ring) of classes of qua-
of projective modules and their automorphism groups. dratic forms, etc., turned out to be related construc-
To put it more simply, it is a generalization of results tions. The K-functor was then transferred to topology,
obtained on the existence and uniqueness (up to an in which it found numerous applications, and with its
automorphism) of a basis of a vector space and other help several previously unsolved problems could be
group-theoretical facts concerning linear groups over dealt with.
fields. On passing from a field to an arbitrary ring R It became clear, moreover, that this construction
these theorems usually become invalid, and the reveals new perspectives in the understanding of old
Grothendieck group Ko(R) and the Whitehead group analytical problems (the index problem of elliptic
K] (R) are, in a certain sense, a measure of their devia- operators), topological problems (extraordinary homol-
tion from being true. Similar generalizations of the ogy theories), and the theory of group representations.
structure theorems of linear algebra appear in topology. However, the development of algebraic K-theory for
A vector space can be regarded as a special case of a rings (beginning with the establishment of the
vector bundle. These objects may be studied with the correspondence (analogy) between projective, finitely-
aid of the homotopy theory of vector bundles and of generated modules and vector bundles) was hindered
topological K-tbeory. It is important to note in this by the fact that an adequate concept, analogous to that
connection that a projective module can be regarded as of suspension in topology, was lacking in algebra.
the module of sections of a vector bundle. This explains The nineteen-fifties and nineteen-sixties saw the
the choice of the class of projective modules as the beginning of the systematic study of projective modules
object of the theory. Algebraic K-theory makes exten- over finite groups, and the development of one of the
sive use of the theory of rings, homological algebra, most important ideas on which algebraic K-theory is
category theory and the theory of linear groups. based - the idea of 'stabilization', the essence of
Algebraic K-theory has two different historical ori- which, roughly speaking, is that general relationships
gins, both in the field of geometry. The first is related are more clearly manifested on passing to the limit of
to certain topological obstructions. The starting point the dimension of the objects studied (e.g. linear groups
was the introduction of the concept of Whitehead tor- or projective modules). Connections were noted
sion, which is connected with the homotopy between algebraic K-theory and the reciprocity laws of
equivalence of finite complexes and is an element in the the theory of algebraic numbers and algebraic func-
Whitehead group, the latter being some quotient group tions; studies were made of problems connected with
of the group K] (R), where R = ZII is the integral group congruence subgroups (cf. Congruence subgroup) and
ring of the fundamental group II. The next step con- an algebraic analogue of the Bott periodicity theorem
cerned topological spaces X that are dominated by a - the theory of polynomial extensions - was
finite complex, and their generalized Euler characteris- obtained.
tic X(A), which is an element of group Ko(Zw). The For a ring R with a unit element, the Grothendieck
computation of the Whitehead group and L-groups group Ko(R) is defined as the Abelian group generated
(which is, strictly speaking, an algebraic problem by the isomorphism classes of finitely-generated projec-
concerning group rings), was in fact one of the first tive R-modules, with the defining relation:
objectives of algebraic K-theory. Both K2 and other
higher functors have topological applications of the

101
ALGEBRAIC K-THEORY

where [P] is the class of modules isomorphic to the If fz is also an epimorphism, then the sequence is sup-
module P. Let GL(n, R) be the general linear group plemented by the terms
over R, let A ~ II~ ?II K 2 (R) ~ K 2 (R 1)ffiK2 (R 2 ) ~ K 2 (R') ~ K1(R) ~ .. '.

be the imbedding of GL(n, R) in GL(n + 1, R), let If I is a two-sided ideal of R, then the Mayer- Vietoris
GL(R) be the direct limit of the groups GL(n, R), and sequence makes it possible [8] to define the relative
let E(R) be the subgroup in GL(R) generated by the functors K;(R, I), which yield an exact sequence
elementary matrices et, i.e. by the matrices that have K 2(R,J) ~ KiR) ~ K 2 (R/ J) ~ K1(R,J) ~ K1(R) ~
an element hER at the (i, j)-th place and agree with
~ K1(R / J) ~ Ko(R, J) ~ Ko(R) ~ Ko(R / J).
the unit matrix in all other places. E(R) then coincides
with the commutator of GL(R). The quotient group A fairly complete study has been made of the
GL(R)/ E(R) is denoted by KI(R), and is known as behaviour of K-functors on passing from a ring R to its
the Whitehead group. Finally, the Steinberg group localization with respect to a central, multiplicatively-
St(n, R) for n;;;.3 is defined by the generators XA.'J' hER , closed system. In particular, if certain conditions on R
are satisfied, then the following exact sequence has
l~i,j~n, i=/=j, and the relations
been found for the functor Go(R):
xAx"" = x A+I'.
'J'
lJ I)
U Go(R / (s ~ Go(R) ~ GO(S-I R) ~ O.
[xt, xjd = x>;t if i=/=I; seS

[xt,x~tl =1 if }=/=k, i=/=I. If R is commutative, Ko(R) becomes a ring with a unit


Passing to the direct limit, one obtains the group St(R) element by introducing the multiplication induced by
and a natural homomorphism the te~sor product of modules. There exists a split epi-
morphism of Ko(R) onto the ring H (R) of continuous
</>: St(R) ~ E(R),
integer-valued functions (the ring Z is given the discrete
with topology) on the spectrum of R (cf. Spectrum of a ring).
<f>(xt) = et
The kernel of this_homomorphism is denoted by Ko(R).
The kernel kercf> is denoted by K 2(R) (the Milnor
It is known that Ko(R) is the nil radical of Ko(R) and,
group). It coincides with the centre of St(R). Thus,
if R is Noetherian and if the dimension of its maximal
~o, KI and K2 are functors from the category of rings
spectrum is a, thel! Ko(Rt+ 1 =0. If this dimension is
mto the category of Abelian groups. Each of the func-
at most 1, then Ko(R) is isomorphic to the Picard
tors K 0 and K I can be characterized as a functor from
group Pic(K).
finitely-generated projective modules to Abelian groups
For arithmetical rings there are finiteness theorems
that satisfies certain properties and is universal with
for the functors K;(R) and G;(R). In fact, if A is the
respect to these properties. Such a 'universal' character-
ring of integers or the ring of polynomials over a finite
ization makes it possible to define analogues of the
field, and R is an R -order and at the same time an R-
functors Ko and KI on 'sufficiently good' categories. In
lattice in a semi-simple finite-dimensional algebra over
particular, for the category of Noetherian R-modules
the field of fractions of a ring A, then the groups K;(R)
functors G;(R) quite close to K;(R) can be defined.
and G;(R) are finitely generated (i =0, 1).
Examples oj the groups K;(R). Let R be a skew-field
The development of algebraic K-theory was stimu-
and let R* be its multiplicative group. Ko(R)=Z is
lated by studies carried out on the problem of
then the group of integers, KI(R)=R* /[R*,R*]; and
congruence subgroups: Do all subgroups of finite index
K 2 (Z) is the cyclic group of order two. If R is a finite
in an arithmetical group contain some congruence sub-
field, then K2(R)=0.
group? This question is closely connected with the
An important result in algebraic K-theory is the
problem of computing the group K I (R, J) for ideals J
exact Mayer- Vietoris sequence for a Cartesian square.
in R.
The diagram below represents a Cartesian square of
. ~f the results concerning the stable structure of pro-
ring homomorphisms in which JI is an epimorphism;
JectIve modules one can mention the following
h,
R ~ R2
theorem: If R is a commutative Noetherian ring whose
maximal spectrum has dimension d, and A is a
hd th,
R ~R'
module-finite R-algebra, then any finitely-generated
I /,
projective A-module P such that
then there is an exact sequence Pm ~ A~1+1 ffiQ
K1(R) ~ K 1(R 1)ffiK 1(R 2 ) ~ K1(R') ~
for all maximal ideals m of R is isomorphic to A EEl N
~ Ko(R) ~ K o(R 1)ffiK o(R 2 ) ~ Ko(R'). (here M m is the localization of the module M at m).

102
ALGEBRAIC LOGARITHMIC SINGULAR POINT

Another important theorem on the structure of projec- geometry', Russian Math. Surveys 24, no. 5 (1969), 1-89.
tive modules is the cancellation theorem: Let R, A and (Uspekhi Mat. Nauk 24, no. 5 (1969), 3-86)
[8] MiLNOR, J.: Introduction to algebraic K-theory, Princeton Univ.
the module P be as above. Let Q be a finitely- Press, 1971.
generated projective A-module, and let M and N be [9] BASS, H. (ED.): Algebraic K-theory (Battelle Inst. Conf), 1-3,
arbitrary A-modules. Then it follows from Springer, 1973.
QtBPfJJM-QfJJN that [10] ARTIN, E.: Geometric algebra, Interscience, 1957.
A. V. Mikhalev
PfBM ~ N.
A.I. Nemytov
The stable rank of a ring R is closely connected with Editorial comments. Algebraic K-theoretic ideas and
problems of the stable structure of projective modules. results have become most important in certain parts of func-
Thus, if R is a commutative ring of stable rank smaller tional analysis centering around C'-algebras (cf. C'-
than d, then algebra). Especially in the form of KK-theory (or Kasparov
K1(R)::: GL(d, R)/ E(d, R). K-theory). Cf. e.g. [A2].
In algebraic geometry there are important connections
In connection with the theory of induced representa-
with the Chow groups (cf. Chow ring).
tions of groups, the functors K; for group rings have
been studied. One of the results of these studies is that References
[A1] MAGURN, B. (ED.): Reviews in K-theory 1940-1984, Amer.
if G is a finite group of order n and C is the set of Math. Soc., 1985.
cyclic subgroups of G, then the index of the subgroup [A2] CuRTZ, J.: 'K-theory and C'-algebras', in A. 8ak (ed.): Alge-

:s Im(K;(Rc )--,>K;(RG
CEC
braic K-theory, number theory and analysis, Springer, 1984,
pp.55-79.

in K;(RG) is divisible by n if i =0, 1,2. AMS 1980 Subject Classification: 18F25


Regarding polynomial ring extensions it IS known
ALGEBRAIC LATIICE, compactly-generated lattice -
that if R is a regular ring, then
A lattice each element of which is the union (i.e. the
Ko(R[t)) ~ Ko(R[t, t -1)) ::: Ko(R), least upper bound) of some set of compact elements (d.
K 1(R[t)) ~ K1(R). Compact lattice element). A lattice is isomorphic to the
Moreover, the sequence lattice of all sub algebras of some universal algebra if
and only if it is both complete and algebraic. These
0----> K 1(R) ----> K 1(R[t))fBK 1(R[t- I ))---->
conditions are also necessary and sufficient for the lat-
----> KI(R[t, t-I)) --'> Ko(R) ----> 0 tice to be isomorphic to the congruence lattice of some
is exact for any ring R. universal algebra (the Gratzer-Schmidt theorem). In
One result in the computation of the functor K 2 (R) both cases it is assumed that the arity of the operations
is the theorem of Matsumoto: If R is a field, then K 2 (R) of the universal algebra is finite.
is given by the generators lea) (which are in one-to-one References
correspondence with all the non-zero elements a of R) [1] BIRKHOFF, G.: Lattice theory, Amer. Math. Soc., 1967.
[2] GRATZER, G.: Lattice theory, Birkhauser, 1978.
and the relations l(a)l(l-a)= 1 for a:;i=l.
In the nineteen-seventies there appeared numerous T.S. Fofanova
versions of the definitions of the functors K; for i ;;;;.2. Editorial comments. Algebraic lattices are an important
It has been shown [9] that these theories coincide and special case of continuous lattices, for which see [A1]. The
yield the classical functors Kn if n ~2. In several cases original reference to the Gratzer-Schmidt theorem is [A2].
effective methods of computation for higher K-groups References
[A1] GIERZ, G., HOFMANN, K.H., KEIMEL, K., LAWSON, J.D.,
were found. The development of unitary K-theory ([9],
MISLOVE, M.V. and SCOrf, D.S.: A compendium ot continu-
Vol. 3), which studies analogous problems for modules ous lattices, Springer, 1981.
on which quadratic and bilinear forms are defined, also [A2] GRATZER, G. and ScHMIDT, E.T.: 'Characterizations of
began in that decade. congruence latlices of abstract algebras', Acta. Sci. Math.
(Szeged) 24 (1963), 34-59.
References
[I] ATIYAH, M.F.: K-theory: lectures, Benjamin, 1967. AMS 1980 Subject Classification: 06A23
[2] BASS, H.: Lectures on topics in algebraic K-theory, Tata Inst.
Fundam. Res., 1966. ALGEBRAIC LOGARITHMIC SINGULAR POINT -
[3] BASS, H.: Algebraic K-theory, Benjamin, 1968.
An isolated singular point z 0 of an analytic function
[4] SWAN, R.G.: Algebraic K-theory, Springer, 1968.
[5] SWAN, R.G. and EVANS, E.G.: K-theory offinite groups and f (z) such that in a neighbourhood of it the function
orders, Springer, 1970. f (z) may be represented as the sum of a finite number
[6] THOMAS, c.B. and Moss, R.M.F. (EDS.): Algebraic K-theory of terms of the form
and its geometric applications, Springer, 1969.
[7] MANIN, Yu.!.: 'Lectures on the K -functor in algebraic (z - zo)-S[ln(z - zo)t g(z),

103
ALGEBRAIC LOGARITHMIC SINGULAR POINT

where s is a complex number, k is a non-negative the algebraic integers form a ring; on the other hand,
integer, and g(z) is a regular analytic function at the the real algebraic integers form an everywhere-dense set
point z 0 with g(z oh6 0. in R, while the rational integers form a discrete set. It
References was shown in 1872 by G. Cantor that the set of all
[ll BIEBERBACH, L.: Analytische Fortsetzung, Springer, 1955, algebraic numbers is denumerable, and this implied the
Sect. 3. existence of transcendental numbers (cf. Transcendental
E.D. Solomentsev
number).
AMS 1980 Subject Classification: 30B40, 30BXX
A root of any (not necessarily irreducible) polyno-
mial with rational integer coefficients and leading coef-
ALGEBRAIC NUMBER - A complex (sometimes,
ficient one is an algebraic integer. Moreover, a root of a
real) number that is a root of a polynomial
polynomial with algebraic integer coefficients and with
f(x) = a.x' + ... +alx +ao (1) leading coefficient one is an algebraic integer. In partic-
with rational coefficients, not all of which are zero. If a ular the k-th degree root of an algebraic integer is an
is an algebraic number, then, among all polynomials algebraic integer. For any algebraic number a there
with rational coefficients and a as a root, there exists a exists a positive integer r such that ra is an algebraic
unique polynomial <p(x) of lowest degree' with leading integer (in analogy with rational numbers). The smal-
coefficient equal to one, which is therefore irreducible lest possible number r is the modulus of the leading
(d. Irreducible polynomial). It is called the irreducible, coefficient of the irreducible primitive polynomial with
or minimal, polynomial of the algebraic number a. The rational integer coefficients that has a as a root. All
degree n of the minimal polynomial <p(x) is also called conjugates of an algebraic integer are also algebraic
the degree of the algebraic number a. The existence of integers.
irreducible polynomials of any degree n implies the One says that an algebraic integer f3 is divisible by
existence of algebraic numbers of degree n. All rational an algebraic integer a, a*O, if there exists an algebraic
numbers, and only such numbers, are algebraic integer y such that f3 = ya. Many divisibility characteris-
numbers of the first degree. The number i is an alge- tics of rational integers also hold for algebraic integers.
braic number of the second degree, since it is a root of An algebraic integer t: is called an algebraic unit (or
the polynomial x 2 + 1, while 21/ n, where n is any posi- unit for short) if it is a divisor of 1, that is, if 1 / t: is an
tive integer, is an algebraic number of degree n, being a algebraic integer. A unit is a divisor of any algebraic
root of the irreducible polynomial xn - 2. integer. The inverse of a unit is a unit; the numbers
The roots aI, ... ,an of the irreducible polynomial conjugate to a unit are units; all divisors of a unit are
are called the conjugates of a, and are also algebraic units; a product of a finite number of units is a unit.
numbers of degree n. All numbers conjugate with a are An algebraic integer is a unit if and only if the product
distinct. Apart from its degree, another important of all its conjugates is -+-1. The k-th roots of unity are
characteristic of an algebraic number is its height, units, each one having modulus 1. There exists an
which is the analogue of the denominator of a rational infinite set of other units which do not have modulus 1.
fraction. The height of an algebraic number a is the For instance, the numbers 2 - v'3 and 2 + v'3 are
greatest absolute value of the coefficients of the irredu- units, being the roots of the polynomial x 2 -4x + 1.
cible and primitive polynomial with integral rational Moreover, their powers constitute units which may be
coefficients that has a as a root (d. Primitive polyno- arbitrary large or small. The field of rational numbers
mial). The sum, the difference, the product and the contains only two units - + 1 and - 1.
quotient of two algebraic numbers (except for division Two algebraic integers are called associated if they
by zero) are algebraic numbers; this means that the set differ by a factor which is a unit. There is another
of all algebraic numbers is a field. A root of a polyno- important difference between the ring of algebraic
mial with algebraic coefficients is an algebraic number. integers and the ring of rational integers. The concept
An algebraic number is called an algebraic integer if of an irreducible integer (in analogy to a prime
all the coefficients of its minimal polynomial are number) cannot be introduced into the former. This
rational integers. For instance, i and 1+ V2 are alge- may be seen from the fact that a root of an algebraic
braic integers, being roots of the polynomials x 2 + 1 integer is an algebraic integer. The concept of an
and x 2 -2x -1 . irreducible number (apart from the class of associated
The concept of an algebraic integer is a generaliza- numbers) can be introduced in certain subfields of the
tion of the concept of a rational integer (a rational field of all algebraic numbers, the so-called algebraic
integer m is an algebraic integer, being the root of the number fields. It turns out, however, that the decompo-
polynomial x - m). Many properties of rational sition of an algebraic integer into irreducible factors is
integers are also displayed by algebraic integers. Thus, not always unique.

104
ALGEBRAIC NUMBER THEORY

Algebraic numbers cannot be very closely approxi- Sokhotskii (theory of ideals) and others - also made
mated by rational and algebraic numbers (Liouville's significant contributions.
theorem). It is this fact which led in 1844 to a proof of The concept of an algebraic number and the related
the existence of transcendental numbers. The problem concept of an algebraic number field are very impor-
of approximation of algebraic numbers by rational tant ideas in number theory and algebra. Algebraic
numbers is one of the more difficult problems in numbers, which are a generalization of rational
number theory; attempts to solve it yielded very impor- numbers, form subfields of algebraic numbers in the
tant results, including the Thue, Thue - Siegel and fields of real and complex numbers with special alge-
Thue - Siegel- Roth theorems, but its ultimate solution braic properties. The development of the theory of
is still nowhere in sight. Another very difficult problem algebraic numbers greatly influenced the creation and
is the expansion of algebraic numbers into continued development of the general theory of rings and fields.
fractions. Real algebraic numbers of the second degree Algebraic numbers have found numerous applica-
(quadratic irrationalities) can be represented as infinite tions in various branches of number theory, algebra
periodic continued fractions. Nothing is known so far and other branches of mathematics: the theory of
(the nineteen-seventies) about the expansion of real forms, Diophantine equations, Diophantine approxima-
algebraic numbers of degree at least three into ordinary tions, transcendental numbers, geometry of numbers,
continued fractions. algebraic geometry, Galois theory, etc.
A complex number is called an algebraic number over References
a field P C C if it is a root of a polynomial (1) with [1] TSCHEBOTAROW, N.G. [N.G. CHEBOTAREV): Grundzuge der
coefficients from P. The minimal polynomial, the Galois'schen theorie, Noordhoff, 1950 (translated from the Rus-
sian).
degree over P and the conjugate numbers over P for [2) HECKE, E.: Lectures on the theory of algebraic numbers,
algebraic numbers over P are defined in a similar Springer, 1981 (translated from the German).
manner. A root of a polynomial with coefficients that [3) LANDAU, E.: Einflihrung in die elementare und analytische
Theorie der algebraischen Zahlen und der Ideale, Chelsea,
are algebraic numbers over P is an algebraic number
reprint, 1949.
over P. [4) LANDAU, E.: Vorlesungen iiber Zahlentheorie, 3, Hirzel, 1927.
An algebraic number of arbitrary degree n does not [5) LANG, S.: AlgebraiC number theory, Addison-Wesley, 1970.
necessarily exist over any field P. For instance, only A.B. Shidlovski{
algebraic numbers of the first degree exist over the field Editorial comments.
of complex numbers: the numbers of the field them-
References
selves. A given algebraic number can be of different [A1] WEISS, E.: Algebraic number theory, McGraw-Hili, 1963.
degrees with respect to different fields; thus, the [A2] WElL, A.: Basic number theory, Springer, 1967.
number i is an algebraic number of the second degree, AMS 1980 Subject Classification: 12AXX
but it is of the first degree with respixt to the field of
complex numbers. The set of all algebraic numbers ALGEBRAIC NUMBER TIlEORY - The branch of
over a field P forms a number field. number theory with the basic aim of studying proper-
Algebraic numbers, and algebraic number fields, ties of algebraic integers in algebraic number fields K
were first systematically studied by e.F. Gauss (Gauss- of finite degree over the field Q of rational numbers
ian numbers of the form a + bi, where a and bare (cf. Algebraic number).
rational numbers). Gauss developed the arithmetic of The set of algebraic integers OK of a field K / Q -
Gaussian integers as a base for the theory of biqua- an extension K of Q of degree n (cf. Extension of a
dratic residues. In their study of the theory of cubic field) - can be obtained from an integral basis
residues, e.GJ. Jacobi and F.G. Eisenstein created the (WI, ... ,wn ); this means that each algebraic integer
arithmetic of numbers of the form a +bp, where (i.e. element of OK) can be written in the form
p=( -1 + v=3) / 2 is a cubic root of unity, while a XIWI + ... +xnwn where all the Xi run through the
and b are rational numbers. His attempts to produce a rational integers (i.e. Z). Moreover, such a representa-
proof of the Fennat theorem led E. Kummer to con- tion is unique for each algebraic integer in K.
duct a deep study of cyclotomic fields (cf. Cyclotomic However, properties o{rational integers often do not
field), to introduce the concept of an ideal, and to have obvious analogues for algebraic integers. The first
create the elements of the theory of algebraic numbers. such property is related to units, the invertible elements
The theory of algebraic numbers was further developed of OK (cf. Unit). The field of rational numbers has only
by P. Dirichlet, L. Kronecker, D. Hilbert and others. + 1 and - 1 as units, but a general algebraic number
Russian mathematicians - E.!. Zolotarev (theory of field may contain an infinite number of units. E.g. con-
ideals), G.F. Vorono! (cubic irrationalities, units of sider the real quadratic field Q( Vii), where D > 1 is a
cubic fields), A.A. Markov (cubic fields), Yu.V. rational integer not equal to a square. If, moreover,

105
ALGEBRAIC NUMBER THEORY

D;;E 1 (mod 4), then (1, YD) is an integral basis for it. fact that the number of subfields of an extension K / Q
=
The PeU equation x 2 - Dy2 I has an infinite number of degree n over Q is finite follows from the existence
of solutions2,Y)EZXZ. Any of them gives rise to a of a one-to-one correspondence (the fundamental Galois
unit x +Y YD of Q( YD). In fact, correspondence) between all subfields of the normal clo-
(x+yVDXx-yVD) = I, sure of K (cf. Nonnal extension) and the subgroups of
its Galois group, a finite group of finite order (number
and
IVD = x-yVD of elements) at most n!.
x+y D The structure of the group of units of a field was elu-
is also an algebraic integer in Q(YD). The units of cidated by P. Dirichlet. His basic idea can be given
Q(YD) form an infinite multiplicative group (the group by taking the group of Pell units (cf. above) as exam-
of Pell units). The question arises: What is the structure ple. Any power of such a unit (both positive or nega-
of this group? tive) is a unit. There is a fundamental unit
A second property of rational numbers without an 11=Xo +yo YD, and all other units are plus or minus
obvious analogue for algebraic numbers is related to integral powers of it; hence, the Pell units form the
the theorem on unique factorization of rational integers product of < - 1> with an infinite cyclic group. This
n in prime factors: fact is a special case of the general Dirichlet unit
theorem for algebraic number fields: If the degree of a
n = p~1 ... p%'.
field K / Q is n =r1 +2r2, where r1 is the number of
For algebraic numbers such a unique factorization need real imbeddings K~R and r2 the number of complex-
not hold. E.g. consider the field Q(v=5). In it, the conjugate pairs of complex irnbeddings K ~C, then the
number 6 has two essenti, different factorizations: group of units U of K is the product of a finite cyclic
6=23, 6=(l+v=5)(1- -5). For extensions of group C and r=r1 +r2 -I infinite cyclic groups:
higher degree the situation becomes still more compli-
U = ex <'II!> X ... X <'IIr>.
cated. The question arises: What becomes of the
unique factorization theorem, does it have a meaning at Here 111, ... ,11, are independent fundamental units
all in algebraic number fields? and C is the group of roots of unity contained in K.
A third property without an obvious analogue is The norm of any unit in a field, i.e. the product of the
related to prime numbers. An ordinary prime number unit by its conjugates, is equal to + I.
need not remain a prime number in an algebraic The problem of the non-unique factorization of alge-
number field. E.g. in the field Q(-v=--l) of Gaussian braic integers in algebraic number fields was solved by
numbers the prime number 5 has a factorization: E. Kummer, who started from a special case; he tried
5=(2+ -v=--l)(2- -v=--l). However, the prime to solve Fermat's last theorem on the impossibility of
number 7 remains prime in this field. The question solving the equation x P+yP =zP in non-zero integers
arises: Do there exist general laws governing the for any prime number p > 2. Kummer expanded the
behaviour of prime numbers in algebraic number fields left-hand side using p-th roots of unity, and hence the
of higher degree? In other words, is it possible to find problem was reduced to one about the algebraic
rules that would give a definite answer to the following integers of Q(t), f a primitive p-th root of unity. If
question: Does a given prime number remain prime in there would be unique factorization into prime factors
a field K / Q or does it split in it, and if it splits, in in Q(f). then it would have been sufficient to prove
how many factors? that not all prime factors at the left-hand side occur
Finally, a last (fourth) problem concerns the general with an exponent that is a multiple of p. This was
structure of algebraic number fields. The field Q is the Kummer's first point of view, but Dirichlet pointed out
minimal field of characteristic zero without proper sub- to him that unique factorization need not hold. In
fields. Any other algebraic number field does have sub- order to overcome this difficulty Kummer introduced
fields. E.g., Q is a subfield of any algebraic number ideal numbers, thus altering the entire structure of alge-
field. The question arises: How many subfields are con- braic number theory for the future. The concept of an
tained in a given extension K / Q, finitely or infinitely ideal number arises from the fact that if a field k does
many, and what is their structure? not contain prime numbers into which any algebraic
These are four main problems in algebraic number integer in k can be split, then there is a field K / k of
theory, and answering them constitutes the content of finite degree over k in which there does exist a collec-
algebraic number theory. It is quite natural to start tion of numbers that play the role of primes for k.
with the fourth problem, since its answer will shed light These numbers were called ideal by Kummer (since
on the other three. The problem was solved by E. they do not lie in the original field k). By introducing
Galois in the eighteen-twenties (d. Galois theory). The ideal numbers the theorem on unique factorization in k

106
ALGEBRAIC NUMBER THEORY

holds. Two numbers in a field that differ by a unit I(x) =Jil (x) .. .t:.~(x) (modp).
(so-called associated numbers) have one and the same The corresponding factorization is called Kummer's for-
ideal factors. (Note that ideal numbers are defined rela- mula (or Kummer's decomposition)
tive to the original field k; for another field k' one
(1)
must construct an extension K' / k' (of possibly dif-
ferent degree over k') in which all ideal numbers of k' where ~j are prime ideals in K=k(O).
are contained.) In principle, this equation solves the third problem in
Kummer also introduced the concept of the ideal algebraic number theory, but it is a local equation in
class number of a field k. Two ideal numbers are said to the sense that it is necessary to check it for each prime
belong to the same class if their quotient belongs to k. ideal separately. The problem of partitioning the set of
The number of classes thus obtained is called the ideal all prime ideals in classes such that the factorization
class number of k. He obtained the following important law is the same in any given class and such that, more-
result: The class number h of k is finite, and the classes over, it is possible to give a simple description of these
form an Abelian group under multiplication. Thus, any classes, is solved by class field theory for extensions
ideal number can be regarded as the h-th root of some K / k with Abelian Galois group Gal(K / k).
element of the original field k. The class number can be Equation (1) yields a preliminary concept of a class.
explicitly described in terms of other field constants Let n be the degree of the extension K / k and let f; be
(the regulator, the discriminant and the degree of the the relative degree of the ideal ~j. Computing the rela-
field). tive norms N K / k of both parts of (1) leads to

Subsequently, the concept of an ideal number was (2)


replaced by that of an ideal, equivalent to it, which can
in which aj and f; are natural numbers. For n fixed
be described in terms of the field k itself. Already in equation (2) has a finite number of solutions, so that
the nineteen-fifties the concept of an ideal was general-
the set of all prime ideals of k can be partitioned into a
ized to the more comprehensive concept of a divisor. finite number of classes and one can collect the prime
Therefore, the modern theory of Kummer may be ideals whose Kummer decomposition corresponds to
stated in terms of divisors. For algebraic number one solution (aj,f;) of (2) into one class. The number of
fields, however, the concepts of a divisor and an ideal prime ideals with the property that some aj > 2, is fin-
coincide, and such fields only will be considered in the ite, and they all are divisors of the discriminant ~ of
sequel. The concept of an ideal is related to that of K / k. Only infinite classes are of interest, so that
non-associated numbers, which helps in understanding classes for which aj;;;;.2 can be ignored.
the deep connections between Kummer's theory and In order to simplify the exposition, the field K / k is
Dirichlet's theory of units. Although Kummer did not considered to be normal from now on. In such fields
succeed in solving Fermat's problem, his ideas extended the condition
far beyond this problem and the concept of an ideal 11= ... = 1m
has now become fundamental in many branches of holds. Therefore, the set of all tJ not dividing ~ is parti-
mathematics. tioned into d(n) classes, where d(n) denotes the number
Since prime ideal numbers are defined relative to a of divisors. The class with m = n is of special interest, in
field or, in modern terminology, since they are prime it
ideals in a field, the third problem on the factorization 11==lm=l.
of ordinary prime numbers in an algebraic number A prime ideal tJ in it has maximal number of prime
field can be stated in the following general form. Sup- divisors in K / k:
pose one is given a field k and a prime ideal tJ of its P = PI ... Pn (3)
ring of algebraic integers. The question is: Does tJ
remain prime in an extension K=k(O) or does it split Such tJ are said to split completely (or to be totally split),
in a product of prime ideals of the ring of algebraic and their class is called the principal class of k relative
integers of K'? In the latter case: By which law does it to K / k. It is a principal object of study in class field
split? This question led to class field theory, a central theory. In order to be able to define the principal class
part of modern algebraic number theory. The first solu- via (3) it is necessary to give a proof of the facts that
tion of this problem was given by Kummer, who prime ideals satisfying (1) do exist in k and that there
proved that if 0 is a root of an irreducible polynomial are infinitely many such ideals. Therefore a basic prob-
f(x) and 1,0, ... ,fI' -) is an integral basis for OK over lem in class field theory is to define the principal class
Ob then tJ splits in k(O) 'by the same law as f(x), does in terms of the field k itself in such a way that its infin-
in the residue field (modtJ). In other words, the factori- ite nature would follow. This problem has been com-
zation of tJ in k(O) is determined by the congruence pletely solved for Abelian extensions K / k.

107
ALGEBRAIC NUMBER THEORY

In order to be able to understand the ideas of class morphic to Gal(K / k), only the prime ideals of Hf
field theory better, it is necessary to introduce the gen- split completely in K, and such that the conductor f
eral concept of an ideal class group. Kummer's defini- has the same prime divisors as the discriminant D of
tion given above then corresponds to the modem con- K / k. The 'duality' was stated by Hilbert in 1900 as a
cept of an absolute ideal class group. The general hypothesis (he proved it only in special cases). In its
concept of an ideal class group which is used nowadays general form it was proved by Takagi.
is due to H. Weber and T. Takagi, [5]. The next important stage of development of class
Weber introduced the concept of the conductor of a group theory is related to the name of E. Artin, who
class group. Let m be an integral ideal of k, let Lm be revealed the special role of the canonical isomorphism
the subgroup of principal ideals (a) of k given by a=l between the Galois group and the ideal class group. He
(mod m) and let A m be the subgroup of all ideals of k proved that this isomorphism is given by the Frobenius
that are relatively prime with m. A subgroup H m satis- automorphism op of an Abelian extension K / k,
fying Lm ~Hm ~Am can be regarded as a group of defined as
principal ideals, and a (generalized) class group
Am / H m is constructed in this way. For m = 1 and Here N p is the absolute norm of the ideal p, a runs
HI = L lone obtains Kummer's definition. In the gen- through all numbers of K and p is a prime ideal of k.
eral case H m consists of progressions (mod m) whose The automorphism op (nowadays denoted by

[-T]]
residue classes form a subgroup of the whole multipli-
cative group (mod m). The order h m of this generalized
class group satisfies the estimates 1:S;;;;h m :S;;;;h'cp(m),
where h is the order of the absolute class group and <I> depends (in an appropriate group) only on the ideal
is Euler's phi-function. For different conductors ml and class to which p belongs. It is multiplicative:
m2 the class groups may be equivalent if H m and H m,
l

consists in fact of the same progressions (mod/), where [ \: k] [\: k] = [~l ~: ],


f=l.c.m.(ml, m2)' If one agrees not to distinguish where the symbol at the right-hand side is understood
between equivalent class groups, then one obtains the as an automorphism of the class to which lJIlJ2 belongs.
concept of a class group according to Weber with a con- With this in mind, Artin introduced the symbol
ductor f that is the greatest common divisor of all
equivalent conductors. A class field according to Weber
is a field K / k in which only the prime ideals in its
[~]
principal class Hf , and only these, split completely. on the entire group Af of ideals 0 of k that are rela-
Dirichlet's theorem on prime ideals in progressions, tively prime with the conductor f It is called the Artin
which is valid in every field k, implies that there are symbol and realizes an isomorphism between the class
infinitely many prime ideals. Weber has shown that in group Af / Hf and the Galois group Gal(K / k), whose
certain particular cases the Galois group Gal(K / k) of explicit from is expressed in Artin's reciprocity law:
a class field and the class group Af / Hf of k are iso-
morphic. [~] = 1
A new point of view on class field theory, which is
still valid, originated with D. Hilbert. He understood if and only if oEHf (reciprocity as a correspondence
that between all relatively Abelian extensions of a field between the groups Gal(K / k) and Af / Hf ). From
k and all class fields for this field there must exist a this one can obtain the classical form of the reciprocity
one-to-one correspondence. This correspondence can be law in the language of Kummer's power reciprocity
stated as follows. If for some conductor f one con- symbol (one has to consider the field K=k(a 1 / n ), d.
structs the Weber class group, then there is only one Reciprocity laws). This form, in tum, implies the
normal extension K / k in which the prime ideals of reciprocity law for Hilbert's symbol. In all three forms
Weber's principal class, and only these, will split com- the reciprocity law is regarded as an isomorphism of
pletely. Moreover, the Galois group of K / k is iso- groups, and the symbols of Artin, Kummer and Hilbert
morphic to the Weber class group and the discriminant are regarded as group elements realizing this isomor-
D of K / k consists of the same prime ideals as the con- phism. However, each of them also has a numerical
ductor f The converse is also true: If an Abelian exten- value, which is equal to some n-th root of unity. There-
sion with Galois group Gal(K / k) is give.n, then there fore one can formulate the reciprocity law as follows.
exists a method (subsequently explicity formulated by Given the value of (a / {3)n, it is required to find the
Takagi) by which one can uniquely construct a princi- value of the symbol (a / f3)n reciprocal to it, i.e. to exhi-
pal class Hf such that the class group Af / Hf is iso- bit the explicit form of the function of (a, {3) defined by

108
ALGEBRAIC NUMBER THEORY

[5] CASSELS, J.W.S. and FROHLICH, A. (EDs.): Algebraic number


theory, Acad. Press, 1967.
[6] SHAFAREVICH, I.R: 'General reciprocity laws', Mat. Sb. 26
In this form the law first appeared in the work of CF. (68), no. 1(1950),113-146 (in Russian).
Gauss (cf. Gauss reciprocity law) for quadratic fields [7] 'Hilbert's problems', Bull. Amer. Math. Soc. 8 (1902), 437-479
(translated from the German).
and in the work of Kummer for cyclotomic fields of [8] ARTIN, E. and TATE, J.: Class field theory, Benjamin, 1967.
prime degree. The study of the reciprocity law in this [9] WElL, A.: Basic number theory, Springer, 1974.
form was subsequently continued by CG.J. Jacobi, [10] LANG, S.: AlgebraiC numbers, Addison-Wesley, 1964.
[II] BoURBAKI, N.: Commutative algebra, Addison- Wesley, 1966
F.G.M. Eisenstein, Hilbert, H. Hasse, and others, but (translated from the French). A.!, Vinogradov
they only obtained partial results. I.R. Shafarevich [6]
solved this problem in its general form in 1948, basing Editorial comments. An algebraic number field of
himself on the idea of establishing a connection degree n is a degree n extension of the field a of rational
numbers. An element aEK is an algebraiC integer if it is
between the arithmetic definition of the symbol
integral over Z, i.e. if it is a root of a monic polynomial in

[] Z[XJ . These form a free Abelian subgroup of rank n, called


the ring of integers OK of K and also the principal order of
and the analytic definition of an Abelian differential K; a basis of OK is an integral basis, also called a minimal
(ld/1 on a Riemann surface. He gave a construction of basis.

[]
For Kummer's approach to the Fermat problem cf. [A3].
The link (equivalence) between Kummer's ideal numbers
and ideals comes from the fact that for the prime ideals in
which precisely corresponds to the determination of the OK there is a field extension L / K in which they become
residue of (ld/1 in the .):J-adic field. For this purpose he principal ideals.
introduced E-functions, also called Shafarevich func- The relative degree 1; of an ideal \l3; is the degree of the
tions, in terms of which he also obtained an explicit residue field extension OL / \l3; over OK / p where
form of the reciprocity law in the general case. n
P=\l3; OK'
In the late nineteen-twenties Hasse introduced the For more details, definitions, newer results, and more pre-
idea of doing class field theory for one prime ideal of k cise statements of much of the above cf. various more
at a time, and reformulated and proved many theorems specified articles in this Encyclopaedia. In particular the arti-
cles Algebraic number; Unit; Extension of a field; Regula-
for an Abelian extension Kp of a local field kp- At first
tor of an algebraic number field; Discriminant; Frobenius
the idea was of secondary importance only (the results
automorphism (also for Artin symbo~, Kummer theorem
were obtained as a constX!uence of the local theory), on decomposition of ideals, and especially class field
but at the end of the nineteen-thirties C Chevalley theory.
proved the extraordinary importance of the local point The ideal class groups Am / Hm are also often known as
of view in the creation of class field theory. He intro- ray class groups mod m, and the corresponding fields as ray
duced the concept of an idele group and formulated class fields. The class field corresponding to the ray class
general class field theory from the local point of view. group A1 / L1 is the Hilbert class field. The definitions
Following this the local-global principle became esta- above ignore some problems with the Archimedean primes;
blished in class field theory. Subsequently it was in particular the prinCipal ideals (a) making up Lm must
extended and refined (cf. [5]), and as a result Abelian have a totally positive, i.e. such that all real conjugates of a
class field theory took a structured and completed are >0.
The discussion of class field theory above (up to the men-
form. The problem of creating non-Abelian class field
tion of ideles) is in terms of the older ideal-theoretic formula-
theory for normal extensions with non-Abelian Galois
tion as in e.g. Hasse's Zahlbericht [A1], which in fact can
group remains. be advantageously consulted for more details and precise
The exposition above relates mainly to the qualitative statements. For a discussion of the relations between the
aspects of algebraic number theory. In questions of ideal-theoretic formulations and the modern idele based
quantitative estimation and methods algebraic number class field theory [A2] is recommended.
theory is intimately connected with analytic number Meferences
theory. It is also based, to a large extent, on properties [A 1] HASSE, H.: Bericht Dber neuere Untersuchungen und Prob-
of r-functions and L-functions of algebraic fields. leme aus der Theorie der algebrai'sche Zahlkorper, 1.11, Phy-
sika Verlag, 1965.
References [A2] NEUKIRCH, J.: Class field theory, Springer, 1986, Chapt. 4,
[I] BOREVICH, Z.I. and SHAFAREVICH, I.R.: Number theory, Acad. Sect. 8.
Press, 1966 (translated from the Russian). [A3] POLLARD, H.: The theory of algebraic numbers, Math. Assoc.
[2] POSTNIKOV, M.M.: Galois theory, Moscow, 1962 (in Russian). Amer., 1950.
[3] WEYL, H.: AlgebraiC theory of numbers, Princeton Univ. Press, [A4) WEISS, E.: Algebraic number theory, McGraw-Hili, 1963.
1959.
[4] LANG, S.: Algebraic number theory, Addison-Wesley, 1970. AMS 1980 Subject Classificatiol"l: 12-XX

109
ALGEBRAIC OPERATION

ALGEBRAIC OPERATION, n-ary operation, on a set References


A - A mapping [1] CHEBYSHEV, P.L.: Complete collected works, Vol. 2, Moscow,
1947, pp. 478; 152-236 (in Russian).
of the n-th Cartesian power of the set A into the set A [2] BOREL, E.: Lefons SW' les fonctions de variables reelles et les
developpements en series de polynomes, Gauthier-Villars, 1905.
itself. The number n is known as the arity of the alge- [3] JACKSON, D.: 'A general class of problems in approximation',
braic operation. Historically, the concepts of binary Amer. J. Math. 46 (1924),215-234.
(n =2) and unary (n = 1) operations were the first to be [4] GARKAVI, A.L.: 'The theory of approximation in norrned linear
spaces', ftogi Nauk. Mat. Anal. 1967 (1969),75-132 (in Rus-
considered. Nullary (n =0) operations are fixed ele-
ments of the set A; they are also known as distinguished sian). Yu.N. Subbotin
elements or constants. In the twentieth century the con- Editorial comments. Instead of the long phrase 'alge-
cept of an infinitary operation appeared, i.e. a mapping braic polynomial of best approximation' one also uses the
w: A a ~A, where a is an arbitrary cardinal number. A shorter phrase 'best algebraic approximation', which is not
set with a system of algebraic operations defined on it to be confused with the phrase 'best approximation' for the
is called a universal algebra. least error En{ f)p.
T.M. Baranovich
References
Editorial comments. The study of infinitary operations [A 1] ACHIEZER, N.I. [N.I. AKmEZER]: Theory of approximation,
actually started in the late nineteen-fifties [A 1]. A nullary Ungar, 1956 (translated from the Russian).
operation is also called a noughtary operation [A2]. [A2] CHENEY, E.W.: Introduction to approximation theory, Chel-
sea, reprint, 1982.
References [A3] MEINARDus, G.: Approximation von Funktionen und ihre
[A1] STOMINSKl, J.: 'The theory of abstract algebras with infinitary numerische Behandlung, Springer, 1964.
operations', Rozprawy Mat. 18 (1959).
[A2] COHN, P.M.: Universal algebra, Reidel, 1981, 13-14. AMS 1980 Subject Classification: 41 A 10, 41 A50
AMS 1980 Subject Classification: 08A40
ALGEBRAIC SPACE - A generalization of the con-
ALGEBRAIC POLYNOMIAL OF BEST APPROXI- cepts of a scheme and an algebraic variety. This gen-
MATION - A polynomial deviating least from a given eralization is the result of certain constructions in alge-
function. More precisely, let a measurable function braic geometry: Hilbert schemes, Picard schemes,
f(x) be in lp[a, b] (p;;;:'l) and let Hn be the set of alge- moduli varieties, contractions, which are often not real-
braic polynomials of degree not exceeding n. The quan- izable in the category of schemes and require its exten-
tity sions. However, the category of algebraic spaces is
(*) closed under these constructions, which renders the
is called the best approximation, while a polynomial for algebraic space a natural object of algebr~c geometry.
which the infimum is attained is known as an algebraic Any scheme S defines some sheaf S in the etale
polynomial of best approximation in lp[a, b]. Polynomi- topology of the category of schemes, which in tum
als which deviate least from a given continuous func- uniquely defines the scheme S. An algebraic space is a
tion in the uniform metric (p = 00) were first encoun- sheaf of sets F in the etale topology of schemes satisfy-
tered in the studies of P.L. Chebyshev (1852), who con- ing the condition of local representability (in the etale
tinued to study them in 1856 [1]. The existence of alge- topology): There exists a scheme U and a sheaf mor-
braic polynomials of best approximation was esta- e.hism U~F such that for a~y sc~eme V and morphism
blished by E. Borel [2]. Chebyshev proved that P~(x) is V~F the fibred product uXFV is represented by a
an algebraic polynomial of best approximation in the scheme Z, and the induced morphism of schemes Z ~ V
uniform metric if and only if Chebyshev alternation is a surjective etale morphism. The scheme U is known
occurs in the difference f (x) - P~(x); in this case such as the etale covering of the sheaf F, which is a quotient
a polynomial is unique. If p> 1, the algebraic polyno- sheaf of t~e sh~af U according to the etale equivalence
mial of best approximation is unique due to the strict relation UXFU. This last proposition reveals the
convexity of the space Lp- If P = 1, it is not unique, but geometric sense of an algebraic space as a quotient
it has been shown by D. Jackson [3] to be unique for scheme according to the etale equivalence relation.
continuous functions. The rate of convergence of Morphisms of an algebraic space are defined as mor-
En{j)p to zero is given by Jackson's theorems (cf. phisms of sheaves; the category of schemes becomes
Jackson theorem). identical with a complete subcategory of the category
In a manner similar to (*) an algebraic polynomial of of algebraic spaces.
best approximation is defined for functions in a large Many concepts in the theory of schemes are applica-
number of unknowns, say m. If the number of variables ble to algebraic spaces: point, local ring, etale topology,
m;;;:' 2, an algebraic polynomial of best approximation Zariski topology, field of functions, structure sheaf and
in the uniform metric is, in general, not unique. coherent sheaves. Many results in the theory of schemes

110
ALGEBRAIC SURFACE

such as Serre's affinity criterion (cf. Affme scheme), and the theory of algebraic surfaces were laid by A. Clebsch
the theorem of finiteness and existence of a proper and M. Noether (ca. 1870), who defined the first
morphism, can be applied to algebraic spaces. important invariants of algebraic surfaces: the
Finer results include the representability of Picard geometric genus and the canonical class. The subse-
and Hilbert functors in the category of algebraic quent development of the theory of algebraic surfaces is
spaces. If a flat equivalence relation is given on an alge- due to the Italian school of algebraic geometers: C.
braic space, then the factorization by this relation Segre, L. Cremona, E. Bertini, G. Castelnuovo, F.
yields an algebraic space (such a situation occurs, for Enriques, F. Severi, and others. Topological and tran-
example, when there is a free action of a finite group scendental methods were introduced into the theory of
on the space). Finally, an algebraic space permits a algebraic surfaces by E. Picard, H. Poincare and S.
contraction of a subspace with an ample conormal Lefschetz. Many general facts arrived at in these studies
sheaf. proved to be valid for algebraic varieties and schemes
All algebraic spaces contain a subspace that is open of arbitrary dimension as well, while other results up
and dense in the Zariski topology and that is a scheme. till now have not been generalized to higher-
One-dimensional and non-singular two-dimensional dimensional cases. The latter include the classification
algebraic spaces are schemes, but this is not true of of algebraic surfaces by numerical invariants, criteria of
three-dimensional or singular two-dimensional algebraic rationality and ruledness of algebraic surfaces, and the
spaces; a group in the category of algebraic spaces over theory of minimal models. Most of these results were
a field is a scheme. Complete algebraic spaces of critically reviewed and demonstrated once again in the
dimension n over the field of complex numbers have nineteen-fifties and nineteen-sixties using modem coho-
the natural structure of a compact analytic space with n mological methods (1], [5], [8].
algebraically-independent meromorphic functions. General features of algebraic surfaces. Unless otherwise
References stated, an algebraic surface will be understood to mean
[1] ARTIN, M.: Algebraic spaces, Yale Univ., 1969. a projective algebraic surface over an algebraically
[2] KNurSON, D.: AlgebraiC spaces, Springer, 1971.
closed field k.
VI. Danilov a) Resolution of singularities. The first rigorous
Editorial comments. The notion of an algebraic space proofs of the existence of a non-singular model were
was introduced by M. Artin and, in a slightly different but given in the nineteen-thirties by R. Walker and O. Zar-
equivalent form, it was studied by B. Moishezon [A 1]. The iski. The previous geometrical proofs given by the
theorem cited at the end of the article (on subspaces) is Italian workers had contained lacunae [9]. The proof of
due to him. the existence of a non-singular model of an algebraic
References surface for the case of a ground field with positive
[A 1] MOISHEZON, B.: 'AlgebraiC varieties and compact complex
characteristic was given in 1956 (cf. Resolution of singu-
spaces', in Proc. internat. congress mathematicians Nice,
1970, Gauthiers-Villars, 1971, pp. 643-648. larities).
Any non-singular algebraic surface can be
AMS 1980 Subject Classification: 14JXX, 14A20 birationally projected in CP3 on a surface with ordi-
nary singularities - ordinary double curves with a fin-
ALGEBRAIC SURFACE - A two-dimensional alge- ite number of ordinary cusp points and triple points,
braic variety. Together with the class of algebraic curves which are triplanar singular points of the surface. Such
(cf. Algebraic curve), the class of algebraic surfaces is surfaces represented the principal object of studies dur-
the class of algebraic varieties which has been most ing the classical stage of development of the theory of
thoroughly studied. The richness of problems and the algebraic surfaces. Owing to the theorem on the resolu-
variety of ideas used to solve them makes the theory of tion of singularities of algebraic surfaces, global
algebraic surfaces one of the most interesting fields of methods of the theory of schemes may be applied to
algebraic geometry. the local study of singular points of algebraic surfaces.
Unlike algebraic curves, two birationally isomorphic The fundamental work on this subject was done by D.
algebraic surfaces need not be biregularly isomorphic. Mumford (1961), who deduced important invariants of
During the classical period of the development of alge- two-dimensional singularities and proved that the tOp<r
braic geometry (1868 - 1920) algebraic surfaces were logical space of a normal algebraic surface is a topolog-
understood to mean a class of birationally equivalent ical manifold if and only if the surface is non-singular.
surfaces, and its representations were considered to be b) Numerical invariants of algebraic surfaces. The
imbedded in complex three-dimensional projective concept of the geometric genus Pg(V) of an algebraic
space Cp3, in which they were defined by a single alge- surface is a generalization of the concept of the genus
braic equation f(xo, Xl> X2, X3)=0. The foundations of of an algebraic curve in the theory of algebraic sur-

111
ALGEBRAIC SURFACE

faces. Here, Pg( V) is the maximum number of linearly deg(c1 )+deg(c2)


independent regular two-dimensional differentials on V I +Pa(V) = 12 '
(cf. Geometric genus). The virtual arithmetic genus of
called Noether's formula. The number I=deg(c2)-4 is
the divisors in the canonical system I K y I of an alge-
known as the Zeuten - Segre invariant.
braic surface V is known as the linear genus and is
c) The Riemann - Roch theorem for algebraic surfaces.
denoted by p(l). For a non-singular algebraic surface
The generalization of the Riemann - Roch theorem for
the following equation is valid:
algebraic curves to algebraic surfaces is due to Castel-
p(1) = (~)+ I = deg(cD+ I, nuovo (1897). For a non-singular algebraic surface V
where CI = - Ky is the first Chern class of the tangent and a complete linear system I D I of divisors of vir-
bundle to the surface V. The multiple canonical sys- tual degree n =(D2) and virtual genus
tems I Ki I = I iKy I are called pluri-canonical systems '1T={(D 2 )+(DKy )} /2+1, the Riemann-Roch
of the algebraic surface V. The number dim I Ki I + 1 is theorem asserts the inequality
referred to as the i-th (pluri-) genus and is denoted by dim I D I ;;;;. n-?T+Pa(V)+I-i.
p.. it is identical with the dimension of the space of
" The number i = dim IKy - D I is known as the special-
everywhere-regular i-tuple two-dimensional differential
ity index of the divisor D. The cohomological proof of
forms on V.
this theorem (cf. Riemann- Roch theorem) permits the
According to the Cayley- Noether postulation for-
stipulation of a necessary and sufficient condition such
mula, the number N m of linearly independent forms of
that this inequality becomes an equality. This condition
a sufficiently high degree m passing through a space
is the vanishing of the cohomology group
curve of degree d, genus p and with t triple points and
HI(V, (!) y(D. It has been proved that, for any divisor
'T double points, is given by the formula
D and all sufficiently large numbers n, one has
N m = [mj3] -dm+2t+'T+p-1. Hi(V, (!) y(D +nH=O, where i>O and where H is a
hyperplane section of the surface V (J.P. Serre, 1955).
The concept of the arithmetic genus Pa(V), defined If the characteristic of the ground field is zero, then the
by the formula inequality becomes an equality for any ample divisor
Pa(V) = [nil] -d(n-I)+2t+'T+p-1 D. This result is a generalization of the classical
Picard - Severi theorem concerning the regularity of the
(where d is the degree, 'T is the number of double adjoint system [9].
points, p is the genus and t is the number of triple d) Systems of curves on an algebraic surface. The
points of the double curve V; for non-singular surfaces principal method used by Italian geometers was their
t =r =d=O and p = 1), was introduced in 1875 for alge- theory of linear and algebraic systems of curves on an
braic surfaces with ordinary singularities. Other algebraic surface. The concepts of algebraic and linear
equivalent definitions of pi V) were given subsequently equivalence of divisors are closely connected with this
(cf. Arithmetic genus). theory. These concepts only coincide for regular alge-
braic surfaces (Castelnuovo, 1896). Any sufficiently
The difference Pg(V)-Pa(V) is always non-negative; general curve on an algebraic surface V with irregular-
it is called the irregularity of the algebraic surface V and ity q is contained in a maximal algebraic family which
is denoted by q(V). An algebraic surface is called regu- fibrates into linear systems of equal dimension, the
lar if q(V)=O and irregular if q(VO. The first exam- base of the fibration being a variety of dimension q
ples of irregular surfaces date back to the end of the (Enriques, 1904). This variety is an Abelian variety, and
nineteenth century (A. Cayley, Castelnuovo). Irregular- is known as the Picard variety of the algebraic surface
ity can be described as the maximal insufficiency of the V (cf. Picard scheme). The proof of Enriques' result
linear system which is cut out by the system I C + K v I (known as the 'fundamental theorem in the theory of
on the curve C (Enriques, 1896, see [9]). A irregular surfaces', or the 'theorem of completeness of
cohomological definition of pa and q for non-singular characteristic series'), as well as subsequent prcofs by
algebraic surfaces is given by the formulas: Severi, contain lacunae. The first rigorous transcenden-
Pa(V) = -dimk H1(V, ~ v)+di..m;.; H2(V, ~ v) = tal proof of this theorem was given by Poincare in
= xCV, ~ v)-l, 1910, and the algebraic proof came 50 years later (A.
Grothendieck, see [5]). If the characteristic of the
q(V) = dimkHI(V, ~v).
ground field is positive, this theorem is not valid in the
The arithmetic genus Pa(V) of a non-singular alge- general case (J. Igusa, 1955). It has been shown that, in
braic surface V is expressed in terms of the Chern the general case, q is not less than the dimension of the
classes of the tangent bundle to V by the formula: Picard variety.

112
ALGEBRAIC SURF ACE

The non-identity of algebraic and linear equivalence quality b 2 (V);;;'p+2pg(V). It was proved, moreover,
on an algebraic surface implies that everywhere-regular that the number of positive squares in the quadratic
one-dimensional differentials exist on it. The dimension form which is defined on H 2(V, Z) by the intersection
of the space of such differentials in the case where index equals 2pg(V) + 1. The analogues of these results
k=C coincides with the irregularity (Castelnuovo, for fields of finite characteristic are not true. The
Severi, 1905). This fact is a special case of the equality development of the transcendental theory of algebraic
dimHP(V, gq)=dimHq(V, (l'), which is valid for an surfaces is due to the work of P.A. Griffiths.
arbitrary compact Kibler manifold V. If char k >0, this f) Projective immersions of an algebraiC surface. In
result is invalid. the context of the abstract concept of an algebraic
The group of divisor classes with respect to algebraic variety there arose the problem of the existence of
equivalence is finitely generated (see Neron-Severi (abstract) algebraic surfaces that are not immersible in
group). Its rank is denoted by p and is called the Picard a projective space. It was proved that a non-singular
number of the algebraic surface V. The order of the complete algebraic surface is always projective [8].
torsion subgroup of this group is denoted by (1 and is There exist complete singular non-projective algebraic
known as the Severi divisor of V [9]. surfaces. There are various numerical criteria for the
Severi also laid the foundations of the theory of projectivity of an algebraic surface (cf. Ample sheaf).
zero-dimensional cycles on an algebraic surface; this Any non-singular two-dimensional algebraic space is an
theory contains many interesting problems that have algebraic surface.
not yet been solved. Minimal models of algebraic surfaces. A non-singular
e) Topological properties of an algebraic surface. A
algebraic surface V is called a minimal model if each
non-singular projective algebraic surface over the field birational morphism V ~ V' onto a non-singular alge-
of complex numbers is a compact four-dimensional braic surface V' is an isomorphism. A minimal model
oriented real manifold; in particular, the integral exists in each class of birational equivalence. Except
homology groups Hi(V, Z) and the integral cohomology for classes of ruled surfaces, this minimal model is
groups Hi(V, Z), satisfying Poincare duality, have been unique up to an isomorphism [1], [2], [8]. For an alge-
defined for such an algebraic surface. Each divisor on braic surface to constitute a minimal model it is neces-
an algebraic surface V specifies a two-dimensional sary and sufficient for it not to contain exceptional
cycle, the divisors which are algebraically equivalent to curves of the first kind, i.e. irreducible curves which do
zero correspond to cycles homologous to zero. The contract to a non-singular point for some birational
intersection index of divisors is identical with the topo- morphism. Such curves were first studied by M.
logical intersection index of cycles. The following Noether in 1895. They are characterized by the follow-
equalities are valid: (1 is equal to the order of the tor- ing requirements: C must be a rational non-singular
sion group of the group H 2(V, Z) and b 1(V)=2q(V), curve and (C 2 )= -1 ([1], [2]). Minimal models of ruled
where b 1(V) is the first Betti number of the surface V surfaces have been completely classified.
[4], [9].
A method, based on the study of a linear pencil of Classification of algebraic surfaces. According to the
curves on a surface and on the study of the topology of classification of Enriques [2], each non-singular alge-
the curves which vary in this pencil, was developed for braic surface over a field of characteristic zero belongs
the investigation of the topology of algebraic surfaces. to one of the following types, up to a birational
It was proved in this way that non-singular hyperplanes equivalence: a) ruled surfaces (cf. Ruled surface); b)
in CP3 are simply connected [6]. This method was sub- two-dimensional Abelian varieties (cf.Abelian variety);
sequently applied to studies on the topology of mani- c) K3-surfaces (cf. K3-surface); d) elliptic surfaces (cf.
folds of higher dimension. Elliptic surface); or e) general-type algebraic surfaces
The study of etale cohomology of schemes makes it (cf. General-type algebraic surface). This classification
possible to formulate and prove algebraic analogues of is highly analogous to the classification of algebraic
many classical statements in the topology of algebraic curves. Moreover, rational curves correspond to ruled
surfaces for surfaces defined over a field of arbitrary surface, surfaces of the types b) - d) correspond to
characteristic. In particular, the l-adic cohomology elliptic curves, and surfaces of general type correspond
groups Hj( V), the fundamental group and the homotopy to curves of genus g> 1. The type of the minimal
type of an algebraic surface have all been defined [9]. model of an algebraic surface is determined by the
E. Picard [6] initiated the theory of integrals on an values of its numerical invariants. Ruled surfaces are
algebraic surface. His results were subsequently general- characterized by the condition p 12 = 0; type-b) algebraic
ized to manifolds of higher dimension (W.V.D. Hodge, surfaces are characterized by the conditions P12 =1,
ca. 1940). These results yielded, in particular, the ine- Pg = 1 and pa = - 1; type-c). algebraic surfaces are

113
ALGEBRAIC SURFACE

characterized by the conditions Kv=O and q=O; type- Also, the inequality
d) algebraic surfaces are characterized by the condi- ~H2(V,Ea);;;' 2pg-Pa-1
tions (Kf.,)=O andpl2>1 orp12=1 andPg=O. Finally,
is valid [9]. The local moduli scheme S a can be singu-
algebraic surfaces of general type are characterized by
lar, even if char(k)=O. This shows that the 'actual'
the conditions (Kf.,0 and p 12 > 1.
number of moduli, i.e. M' = dim Sa, can be less than
The class of ruled surfaces comprises rational sur-
M. The difference w = M - M' is called the number of
faces (cf. Rational surface), which are characterized by
obstructions to the deformation; the estimate
the conditions P2 =Pa =0 (this makes it possible to give
wE;;;dimH 2(V, Ea) is known. The existence of a global
a definite answer to the Liiroth problem in the two-
moduli variety of an algebraic surface has been proved
dimensional case).
only for certain cases. The moduli variety for general-
Many different results in the theory of the classifica-
type surfaces and K 3-surfaces exists as an analytic
tion of algebraic surfaces concern the problem of the
space or as an algebraic space.
construction of an algebraic surface with given numeri-
cal invariants. Algebraic surfaces are most frequently Automorphisms of algebraic surfaces. The automor-
constructed by representing them as a double covering phism group Aut(V) of a complete algebraic surface V
of the projective plane with a specially selected branch is the group of k-points of a certain group scheme, the
curve (cf. Double plane). The values that can be connected component Auto(V) of which is an algebraic
assumed by the invariants p(l) and Pa of a general-type group. If V is not a ruled surface and if
surface are not known (1986). Some of the results dim Auto(VO, then Pa = -1. If Pg=l=I or if Pg= I
obtained in connection with the classification have and P2 > I, then V is an elliptic surface and Auto(V) is
recently been extended to include fields of arbitrary a one-dimensional Abelian variety. In the other cases V
characteristic. K. Kodaira generalized the classification and Auto(V) are Abelian surfaces [2]. For general-type
results obtained by Enriques to include complex- surfaces Aut(V) is a finite subgroup of a projective
analytic surfaces (cf. Analytic surface (in algebraic group. The group Aut(V) for K3-surfaces over the field
geometry. of complex numbers and for ruled surfaces has been
thoroughly studied. If the surface V is not ruled, then
The moduli problem for algebraic surfaces. This is the
Aut(V) coincides with the group of birational
problem of the classification of algebraic surfaces up to
transformations of V. This group has no algebraic
an isomorphism. The first formula for the number of
structure for ruled surfaces and has not been
parameters (moduli) which determine an algebraic sur-
thoroughly studied (cf. Cremona group). The automor-
face with given invariants Pa' Pg and p(l) for one class
phism group of affine algebraic surfaces (cf. Algebraic
of algebraic surfaces was given by Noether in 1888.
variety, automorphism of an) is now under intensive
Non-ruled surfaces with given invariants Pa' Pg and p(l)
depend on M moduli, where study.

M = lOpa -Pg-2p(l) + 12+8, (*)


Algebraic surfaces over algebraically non-closed fields.
Number-theoretic problems in the theory of algebraic
where (J is a birational invariant of the algebraic surface surfaces involve Diophantine problems (cf. Diophantine
under consideration [2], [9]. geometry). The classification of rational surfaces over
The modem theory of deformations gives the follow- non-closed fields, begun by Enriques, Comessatti and
ing interpretation of this classical result. The number M Segre, has now been completed. The group of birational
of moduli is identical with the dimension of the Zariski automorphisms has been studied for a certain class of
space tangent to the local (or, in the algebraic case, the such stafaces.
formal) moduli scheme S a for the polarized surface The results of the theory of algebraic surfaces are
(V, a) of this class of algebraic surfaces. Moreover, if used in the study of algebraic curves over function
fields (d. MordeU conjecture). The generalization of
certain results of the theory of algebraic surfaces
is the canonical extension of the tangent sheaf Tv to (minimal models, intersection theory) to the wider class
the surface V by the structure sheaf (!) v, defined by the of regular two-dimensional schemes [7] permits the use
fundamental class of the polarization a, then of the geometrical language in the study of algebraic
M = dim Im(Hl(V, Ea)~Hl(V, Tv. curves over number fields.
Formula (*) is a consequence of this exact sequence References
and the Riemann - Roch theorem. The number (J turns [I] 'Algebraic surfaces', Trudy Mat. Inst. Steklov. 75 (1965) (in
out to be the sum Russian).
[2] ENRIQUES, F.: 1 superjicie algebraiche, Bologna, 1949.
dimk H2(V, Ea)+dimk H2(V, Tv). [3] lUNG, H.W.E.: Aigebraische Fiiichen, Hannover, 1925.

114
ALGEBRAIC SYSTEM

[4] LEFSCHETZ, S.: L'anolysis situs et la geometrie a1gebrique, Paris, The pair of families <{nj: iE/}; {m/ jEJ}> is
1924. called the type of the algebraiC system A. Two algebraic
[5] MUMFORD, D.: Lectures on curves on an algebraic surface,
Princeton Univ. Press, 1966.
systems A, A' have the same type if 1=]', J =f and
[6) PICARD, E. and SIMART, G.: Theorie desfonctions algebriques nj =n; , mj =mj for all i EI, j EJ. Basic operations OJ,
de deux variables indepentiantes, 1-2, Chelsea, reprint, 1971. 0; and basic relations rj' rj of two algebraic systems A,
[7] SHAFAREVICH, I.R.: Lectures on minimal models and birational
A' of the same type, with identical indices in I, J
transformtJtions of two-dimensional schemes, Tata Inst. Fundam.
Res., 1966. respectively, are called similar.
[8) ZAIusIa, 0.: Introduction to the problem of minimal models in An algebraic system A is called finite if the set A is
the theory of algebraic surfaces, Math. Soc. Japan, 1958. finite and it is called of finite type if the set I U J is
[9) ZAIuSKI, 0.: Algebraic surfaces, Springer, 1971.
[10] ZAIuSKI, 0.: An introduction to the theory of algebraiC surfaces, finite. An algebraic system A of finite type is written as
Springer, 1969. A = <A; OJ, ,os, rJ, ,rt >.
1. V. Dolgachev
An algebraic system A= <A, 0, R> is called a
Editorial comments. One of the more important achieve- universal algebra or an algebra if the set R of its basic
ments is the proof of the Miyaoka- Yao- Bogomolov ine- relations is empty, and it is called a model or a rela-
quality ~ o;;;;3C2, where Cj denotes the i-th Chern class of an
tional system if the set 0 of basic operations is empty.
algebraic surface [A1]. A recent reference is [A2].
Classical algebraic systems are groups, rings, linear
References spaces, linear algebras, totally ordered sets, totally
[A1] MIYAOKA, Y.: 'On the Chern numbers of surfaces of general
type', Invent. Math. 42 (1977), 225-237. ordered groups, lattices, etc.
[A2] BAllTH, W., PETERs, C. and YEN, A. VAN DE: Compact com- A non-empty subset B of the underlying set A of an
plex surfaces, Springer, 1984. algebraic system A = <A, 0, R> is called closed if for
[A3] BBAUVll.LB, A.: 'Surfaces algebrique complexes', Asterique
54 (1958). Translated into English as a London Math. Soc.
any elements b J, . ,bn, of B the value oj(b J, . ,bn,}
Lecture Notes. of each basic operation OJ EO also belongs to B. By
[A4] GllIFFITIIS, P.A. and HAIuus, J.E.: Principles of algebraic considering the operations of 0 and the relations of R
geometry, 1-2, Wiley, 1978.
on a closed subset B, one obtains an algebraic system
AMS 1980 Subject Classification: 14JXX B= <B, 0', R' > which is of the same type as the
given algebraic system and is called a subsystem of the
ALGEBRAIC SYSTEM - A set with operations and algebraic system A. Subsystems of algebras are called
relations defined on it. An algebraic system is one of subalgebras, while subsystems of models are called sub-
the basic mathematical concepts and its general theory models. The concept of a subalgebra strongly depends
has been developed in depth. This was done in the on the set of basic operations of the algebra under con-
nineteen-fifties, and the work took place on the inter- sideration. Thus, a groupoid G is an algebra of type
face between algebra and mathematical logic. <2>, i.e. an algebra with one basic operation
Fundamental concepts. An algebraic system is an object G X G..",G. A groupoid H with a distinguished unit e is
A= <A, 0, R> consisting of a non-empty set A, a an algebra of type <2,0>, the distinguished element
family 0 of algebraic operations (cf. Algebraic opera- of which has the property o(e, h)=o(h, e)=h for all
tion) oj:An'..",A (iEI) and a family R of relations (cf. h EH with respect to the basic operation 0: H X H..",H.
Relation) rj ~A (j EJ) defined on A. The indices
mj
For this reason any subgroupoid of a groupoid H with
nj, mj of the Cartesian powers of A under consideration a distinguished unit e contains e, while a subgroupoid
are assumed to be non-negative integers and are said to of a groupoid <H, 0> does not necessarily contain e.
be the arities of the respective operations and relations. Unlike algebras, any non-empty subset of a model may
The set A is called the carrier or the underlying set of be considered as a submodel.
the algebraic system A, while its elements are called the An algebraic system A is isomorphic to an algebraic
elements of this system. The cardinality I A I of A is system A' of the same type if there exists a one-ta-one
said to be the cardinality or order of the algebraic sys- mapping cp of the set A into the set A' such that
tem A. The image oj(al,'" ,an) of the element
q,(o;(at. ... ,an,) = O;(q,(al),' .. ,q,(an,), (1)
(aJ, ... ,an,)EA n, under the mapping oj:An'..",A is
riat. ... ,am) <=> rj(q,(ad, . .. ,q,(am)), (2)
called the value of the operation OJ at the point
(aJ, ... ,an,). Similarly, if (aI, ... ,am)Erj' then one holds for all a I, a2, ... , of A and for all i EI, j EJ. A
says that the elements a I, . . . ,a of A are in relation
mj mapping cp possessing these properties is called an iso-
rj' and one writes ria J, ,am). The operations OJ morphism.
(i EI) and the relations rj (j EJ), unlike other opera- In the following, a class of algebraic systems will be
tions and relations that may be defined on A, are called understood to mean only an abstract class, i.e. a class
basic or primitive. of algebraic systems of the same type which, whenever

115
ALGEBRAIC SYSTEM

it contains a system A, also contains all systems iso- 8(a], bd, . ; . ,8(an" bn) ~
morphic to A. In considering a class Sf of algebraic sys- ~ 8(Fj(a), ... ,an,),Fj(b), ... ,bn,))
tems, all systems in this class are usually written in a
definite signature, as follows. Let the type of the class for all a J, b], ... ,all" bll, of A and all F j EQj . For
Sf be <{nj: iEI}; {m/ JEJ}>. To each iEI is each homomorphism 4> of an algebraic system A the
assigned some symbol Fj, called a functional, while to binary relation fJ(a, b) which holds if and only if
each j EJ is assigned a symbol Pj' called a predicate. If 4>(a)=4>(b) is a congruence in A, called its kernel
an algebraic system A belongs to the class st and if congruence. For any congruence 0 of an Q-system A
OJ: A II, ~A is a basic operation in it, then the element and for each element a EA the set
oj(a], ... ,all) of A is written as F;(a], ... ,all). Simi- a / fJ = {x EA: fJ(x, a)} is called a coset of the algebraic
larly, if rj CA mj is a basic relation in A and the element system A by the congruence (). On the assumption that,
(a], ... ,am,)Erj' then one writes P/a], ... ,am)=T for each F; EQj' Pj EQp'
(true) or simply P/a], ... ,am). If, on the other hand, Fj(aj /fJ, ... ,an, /fJ) = Fj(a], ... ,an)/fJ
(a], ... ,am)flrj' one writes P/a], ... ,am)=F(false) and
or -,P/a] , ... ,am). Let Qj = {Fj: i EI}, P/b j /fJ, ... ,bmj /fJ) = T
Qp = {Pj : j EJ} and let v be the mapping of the union if and only if there exist elements c], ... ,cmj in A such
Qj U Qp into the set of natural numbers {a, 1,2, ... }, that fJ(b] , cd, ... ,fJ(bm , cm ) and Pj'(CI, ... ,cm ), one
j j ,

defined by the formulas v(Fi)=ni (iEI), v(Pj)=mj obtains the algebraic system A / fJ, which is of the same
(j EJ). The object Q= <Qj , Qp' v> is the signature of type as the given system; it is called the quotient system
the class st. A finite signature is written as a string of the algebraic system A by the congruence fJ. For
< 1'1
---'II,) ---'II,). p(m,)
, ... ,1'~ , ] , ... , p(m,)
lo > r , more bne . fly,
each congruence () of an algebraic system A the canoni-
<F], ... ,Fs; PI, ... ,PI>. An algebraic system A, cal mapping 4>( a) = a / () (a EA) is a homomorphism of
written in the signature Q, is known as an Q-system and A onto the quotient system A / fJ for which the given
is denoted by A= <A, Q>. congruence () is the kernel congruence. If 4> is a
The conditions (I) and (2) for an isomorphism of the homomorphism of an algebraic system A into an alge-
systems A and A' become simplified if the systems are braic system A' and () is the kernel congruence for 4>,
considered in one signature Q. Thus, if Fi (i EJ) and Pj then the mapping 1/;{a / fJ)=4>(a) is a homomorphism of
(j EJ) are the signature symbols, equations (1) and (2) the quotient system A / fJ into the algebraic system A'.
assume the form If the homomorphism 4> is strong, 1/; is an isomorphism.
cp(Fj(a j, ... ,an)) = Fj(cp(a j), ... ,CP(an)), (3) The Cartesian product of Q-systems Aa = <Aa, Q>,
aEA=I= 0, is the Q-system D= <D, Q> in which D is
Pia], ... ,am) ~ Picp(aj), ... ,CP(am)). (4)
the Cartesian product of the underlying sets A a (a E A)
A homomorphism of an Q-system A into an Q-system and the basic operations and the basic relations on D
A' is any mapping 4>: A ~A' which fulfils both condi- are given by the conditions: F;(dJ, ... ,dn)
tion (3) and the condition (d l , . . . ,dn, ED, FiEAf) is the element dED with coor-
Pia], ... ,am) ~ Picp(aj), ... ,cp(amj )) (5) dinates d(a)=Fi(dl(a), ... ,dn,(a)) (aEA),
for all Fi EQj, Pj EQp and for all a I, a2, ... , of A. A P/dJ, ... ,dm)=T (PjEQp) if and only if
homomorphism 4>: A~A' is called strong if, for any ele- P/dl(a), ... , dm,(a)) = Tfor all aEA.
ments b I, . . . ,bm, of A' and for any predicate symbol A language of the first order. The basic formal
Pj of Qp, the relation P/ b J, . . . ,bm) = T implies the language of the theory of algebraic systems is the first-
existence in A of pre-images a], ... , am, of the ele- order language L which is constructed as follows. The
ments b l , . . . ,bm, such that Pj(a], . .. ,am)=T.
, The alphabet of the language L in the given signature
concepts of a homomorphism and of a strong Q=<g,j,Qp'v>, Qj={Fi: iE/}, g,p={Pj:jEJ}, con-
homomorphism of algebras are identical. Models have sists of the object variables x I, X2, . . . , the function
homomorphisms that are not strong, and have one-to- symbols Fi (i EI), the predicate symbols Pj (j EJ), the
one homomorphisms that are not isomorphisms. Under logical connectives
a homomorphism 4>: A~A', images of subsystems of A
are subsystems in A'; and non-empty complete pre- the quantifiers:
images of subsystems of A' are subsystems in A. \;JXk - 'for each element Xk';
An equivalence relation () CA X A is called a 3Xk - 'there exists an element Xk';
congruence of the Q-system A if and the auxiliary symbols: brackets and commas.

116
ALGEBRAIC SYSTEM

The (first-order) properties of n-systems are expressed is an isomorphic mapping of an n-system A into an
<I>
by finite sequences of alphabet symbols, or words, con- n-system A', then
structed in accordance with specific rules and known as ffia J, ... ,ak) = fficp(a 1), ... ,CP(ak)),
terms and formulas. It is inductively assumed that each
word of the form Xk or F; is a term if p(Fj)=O; if for all a J, ,ak from A.
fl> ... ,In are terms and if n =p(Fj), then A formula ~ is called closed if it contains no free
Fj(fl> ... ,In) is also a term. object variables. For any closed formula ~ of the signa-
If A is an n-system and if f(x I, . . . ,xn ) is a term of ture n and any n-system A one may speak of ~ being
the signature n containing object variables x I, . . . ,Xk, true or false in A. A set S of closed formulas of a
then if one replaces x I> ,Xk by some elements given signature n is said to be realizable or consistent if
a I, . . . ,ak in A, and executes the operations in A on there exists an n-system in which all formulas from S
the latter elements corresponding to the symbols of f n are true.
forming part of this term, then one obtains an element The compactness theorem or the Godel- Mal'tsev
f(al> ... ,ak) from A, which is called the value of the theorem states: If each finite subset of an infinite set S
term f(xJ, . .. ,Xk) for XI =aJ, ... ,xk=ak. If <I> is a of closed formulas of a given signature 0 is realizable,
homomorphism of an n-system A into an n-system A', then the entire set S is realizable as well.
then Axiomatizable classes. Let S be some set of closed for-
qij(aJ, ... ,ad) = j(cp(ad, ... ,cp(ad) mulas of a signature O. The class of all O-systems in
The concept of a formula of the signature n, and of which all formulas of S are true will be denoted by KS.
free and bound object variables in it, is also defined The set Th ~ of all closed formulas of a signature 0
inductively: which are true in all O-systems of the given class ~, is
I) If P is any predicate symbol for n or the equality called the elementary theory of ~. In particular, if (A)
sign =, m = p(P) or 2 respectively, and if fl' ... ,fm is the class of n-systems isomorphic to the given 0-
are arbitrary terms of the signature n, then the word system A, then Th(A) is said to be the elementary
P(fJ, ... ,fm) is a formula in which all object variables theory of the n-system A and is simply denoted by
are free. Th A. A class ~ of O-systems is said to be axiomatizable
2) If ~ is a formula, so is ...,~. Free (bound) object if ~ = K Th~. A class ~ of O-systems is axiomatizable
variables in the formula ...,~ are those and only those if and only if there exists a set S of closed formulas of
variables which are free (bound) in ~. the signature 0 such that ~=KS.
3) If ~I' fu are formulas and if the object variables Along with the general concept ofaxiomatizability,
which form part of both formulas are free in both for- axiomatizability using first-order formulas of a special
mulas, then the words kind may also be considered. The special formulas of a
(6) given signature which are the most important in alge-
bra are:
are also formulas. Identities - the formulas
Object variables which are free (bound) in at least
one of the formulas ~I' ~2 are called free (bound) in (Vxd .. (Vxs)P(jJ, ... ,fm),
the formulas (6) as well. where P is some predicate symbol from 0 or the equal-
4) If an object variable Xk occurs free in a formula ~, ity sign =, and fl' ... ,fm are terms of the signature 0
then the words 0txkm, (3Xkm are again formulas in in XI, . . . ,XS.
which the variable Xk is bound, while all the other Quasi-identities - the formulas
object variables which occur free or bound in the for-
(VXl) (Vxs)[P(jJ, ... ,jk)& ... &Q(gJ, ... ,gm)~
mula ~ remain free or bound in the formulas 0txkm,
m
(3Xk as well. ~R(hJ, ... ,hn )],
Let an n-system A and a formula ~ of the signature where P, ... , Q, R are certain predicate symbols from
n be given. If one now assigns to all free object vari- Op or equality signs, while
ables x I, . . . ,Xk from ~ some values a I, . . . ,ak from fI. ,/k,gl> ... ,gm, hl> ... ,hn are terms of the sig-
A and if one interprets the function and the predicate nature 0 in x I> ,XS
symbols forming part of ~ as the respective basic Universal formulas - the formulas 0tx I) ... 0txsm,
operations and basic relations in A, then one obtains a where ~ is a formula of the signature 0 not containing
definite statement which may be true or false. One quantifiers.
accordingly assigns to the formula ~ the value T or F, If a set S of identities (quasi-identities or universal
for XI =aJ, ... ,xk=ak denoted by ffial, . . . ,ak). If formulas) of a signature n is given, then the class KS is

117
ALGEBRAIC SYSTEM

known as a variety (quasi-variety or universal class) of of the n-systems Aa (aEA) and if ffixJ, ... ,xd is an
Q-systems. arbitrary formula of the signature n in which
Birkhoffs theorem: A non-empty class ~ of Q- x I, . . . ,Xk are free object variables, then for any ele-
systems is a variety if and only if it is closed with ments d J, . . . ,dk ED:
respect to subsystems, Cartesian products and 'is{d j let>, ... ,dk let = T ~
homomorphic images.
~ {a: 'is{dj(a), ... ,dk(a=T} E et>.
If A= <A, Q> is some n-system, then, by exchang-
ing each function symbol F; from nf for a predicate In particular, a closed formula iY of the signature n
symbol F; of arity n; + I (higher by one) and putting, is true in the ultraproduct D / cI> of the n-systems Au
for elements a J, . . ,ani' a from A (aEA) if and only if the set of numbers of factors in
which the formula iY is true belongs to the ultrafilter cI>.
F;(a], ... ,ani' a) ~ Fi(a\> ... ,an) = a, Therefore, any axiomatizable class of n-systems is
one obtains a model A = <A, n > for which closed with respect to ultraproducts.
n;=np U {F;: F;Enf } Submodels of the model A A class L of n-systems is universally axiomatizable if
are called submodels of the n-system A. For any non- and only if it is closed with respect to subsystems and
empty finite subsets Aa CA, n,8 en, the model ultraproducts.
Aap = <A a , n,8 > is called a finite depletion of the finite An n-system E= < {e}, Q> is said to be a unit if its
submodel Aa = <A a, n > of the n-system A. An n- underlying set consists of a single element, e.g. an ele-
system A is said to be locally imbeddable in a class ~ of ment e, and if Pie, ... ,e)=T for all PjEnp-
n-systems if for each finite depletion Aap of any finite Mal'tsev's theorem: A class ~ of n-systems is a
submodel Aa of the n-system A there exists in the class quasi-variety if and only if it contains a unit n-system
~ an n-system B (depending on the chosen finite deple- and is closed with respect to subsystems and filtered
tion Aap) such that the model Aap = <A a, n,8 > is iso- products (through an arbitrary filter).
morphic to the model BaP=<Ba,n,8> for a suitable Completeness and categoricity. A non-empty class ~ of
subset Ba CB. n-systems is called categorical if all n-systems from ~
A subclass 53 of a class ~ of n-systems is called are mutually isomorphic. Any categorical axiomatizable
universal (or universally axiomatizable) in ~ if there class of Q-systems consists of one (up to an isomor-
exists a set S of universal formulas of the signature n phism) finite n-system.
such that 53 = KS n
Sl'. A class ~ of n-systems is called categorical in cardi-
The Tarski - Los theorem: A subclass 53 of a class ~ nality m if it contains an n-system of cardinality m and
of n-systems is universal in ~ if and only if 53 contains if all Q-systems from ~ of cardinality m are mutually
all systems from ~ which are locally imbeddable in 53. isomorphic. For example, the class of algebraically
Filtered products. Let D = ITAa be the Cartesian pro- closed fields of fixed characteristic is categorical in any
duct of the n-systems Aa (a E A, A=r'= 0) and let cI> be uncountable infinite cardinality. A non-empty class ~
some filter over A. The relation of n-systems is called complete if for any n-systems
A, B from ~ the equality ThA=ThB is valid.
d=lph ~ {a:EA: d(a:) =h(a:)} Eet> (d, h ED) Vaught's theorem. If an axiomatizable class Sl' of n-
is an equivalence relation on the underlying set D of systems is categorical in some cardinality
the n-system D. For each element d ED, let d / cI> be m;;;;' I Qf U np I and if all n-systems in Sl' are infinite,
the coset by this equivalence and let then ~ is a complete class.
D / cI> = {d / cI>: d ED}. Putting In particular, the class of all algebraically closed
F,(d j Iet>, .. ,dn, let = det> ~ fields of fixed characteristic is complete.
See also Algebraic system, automorphism of an; Alge-
{a: F;(dj(a), . .. ,dn,(a=d(a)} E et> (FiEfJj ),
$=>
braic systems, quasi-variety of; Algebraic systems, class
P/d j Iet>, ,dm} let ~ of; Algebraic systems, variety of.
~ {a: P/dj(a), ... ,dm/a} E et> (P;EfJp ), References
[I] MAL'TSEV, A.I.: Algebraic systems, Springer, 1973 (translated
one obtains the n-system D/cI>=<D /cI>,n>, which from the Russian).
is called the product of the n-systems Au (aEA)filtered [2] COHN, P.M.: Universal algebra, Reidel, 1981.
[3] GRATZER, G.: Universal algebra, Springer, 1979.
through the filter cI>. The n-systems Au (aEA) are said
[4] BELL, J.L. and SLOMSON, A.B.: Models and ultraproducts: an
to be the factors of this product. If cI> is. an ultrafilter introduction, N orth- Holland, 1971.
over A, the filtered product D / cI> is known as the [5] CHANG, c.c. and KEISLER, H.J.: Model theory, North-Holland,
ultraproduct of the Q-systems Au (aEA). 1973.

The ultraproduct theorem: If D / cI> is the ultraproduct D.M. Smirnov

118
ALGEBRAIC SYSTEM, AUTOMORPHISM OF AN

Editorial comments. The terminology in use for this sub- group Aut(A) / IC(A) is isomorphic to an automor-
ject in English-speaking countries differs in a number of phism group of the lattice of all congruences of the sys-
respects from the Russian terminology. What is here called tem A [1]. In particular, any inner automorphism
an algebraic system is usually called a (first-order) structure x ~a -I xa of a group defined by a fixed element a of
(a structure of type 0, if a given signature or similarity type this group is an IC-automorphism. However, the exam-
o has been specified); the term algebra is often used ple of a cyclic group of prime order shows that not all
instead of 'structure' in the case when 0 has no primitive
IC-automorphisms of a group are inner.
relations. The term model is reserved for a structure which
Let ~ be a non-trivial" variety of O-systems or any
satisfies a given set of first-order sentences (closed formu-
las); it is not normally used in the sense defined above. The other class of O-systems comprising free systems of any
underlying set of a structure is sometimes called its (non-zero) rank. An automorphism cp of a system A of
universe. the class ~ is called an I -automorphism if there exists a
Although the text is correct in saying that the subject was term f.p(xlo'" ,xn ) of the signature 0, in the
largely developed in the nineteen-fifties (principally through unknowns XI, . ,xn , for which: 1) in the system A
work of LA Henkin [L.A Khenkin], AI. Mal'tsev, A Robin- there exist elements a 2, . . . ,an such that for each ele-
son and A. Tarski), two important precursors dating from ment x EA the equality
1935 should be mentioned. They are Birkhoff's theorem
characterizing varieties of algebras [A1] and Tarski's defini- q,(x) = !.p(x,a2,'" ,an)
tion of validity of formulas in a first-order structure [A2]. is valid; and 2) for any system B of the class ~ the
References mapping
[A 1] BIRKHOFF, G.: 'On the structure of abstract algebras', Proc.
Cambridge Phi/os. Soc. 31 (1935), 433-454.
is an automorphism of this system for any arbitrary
[A2] TARSKI, A.: 'Der Wahrheitsbegriff in den formalisierten Spra-
chen', Studia Phi/os. 1 (1935),261-405. selection of elements X2, . . . ,Xn in the system B. The
set I(A) of all I-automorphisms for each system A of
AMS 1980 Subject Classification: 08AXX, 03CXX
the class ~ is a normal subgroup of the group Aut(A).
ALGEBIlAIC SYSTEM, AUTOMORPIDSM OF AN - In the class ~ of all groups the concept of an 1-
An isomorphic mapping of an algebraic system onto automorphism coincides with the concept of an inner
itself. An automorphism of an O-system A= <A, 0> is automorphism of the group [2]. For the more general
a one-to-one mapping cp of the set A onto itself having concept of a formula automorphism of O-systems, see [3].
the following properties: Let A be an algebraic system. By replacing each
basic operation F in A by the predicate
CP(F(XIo . .. ,xn = F(CP(XI), ... ,cp(xn , (1)
R(XIo .. . ,xn,y) ~ F(XIo ... ,xn)=y
P(XIo . .. ,xm ) ~ P(CP(Xl), ... ,q,(xm )), (2)
(X., ... ,xn,YEA),
for all x], x2, ... , from A and for all F, P from O. In
other words, an automorphism of an O-system A is an one obtains the so-called model A* which represents the
isomorphic mapping of the system A onto itself. Let G system A. The equality Aut(A*)=Aut(A) is valid. If
be the set of all automorphisms of the system A. If the systems A= <A, 0> and A' = <A, 0' > have a
cpEG, the inverse mapping cp-I also has the properties common carrier A, and if 0 CO', then Aut(A) d Aut(A').
(1) and (2), and for this reason cp-I EG. The product If the O-system A with a finite number of generators is
a=# of two automorphisms cp, t/I of the system A, finitely approximable, the group Aut(A) is also finitely
defined by the formula a(x)=t/I(cp(x, xEA, is again an approximable (cf. [1 D. Let ~ be a class of O-systems
automorphism of the system A. Since multiplication of and let Aut(~) be the class of all isomorphic copies of
mappings is associative, < G, " -I > is a group, known the groups Aut(A), AE~, and let SAut(~) be the class
as the group of all automorphisms of the system A; it is of subgroups of groups from the class Aut(~). The class
denoted by Aut(A). The subgroups of the group Aut(A) SAut(~) consists of groups which are isomorphically
are simply called automorphism groups of the system A. imbeddable into the groups Aut(A), AE~.
Let cp be an automorphism of the system A and let 8 The following two problems arose in the study of
be a congruence of this system. Putting automorphism groups of algebraic systems.
1) Given a class ~ of O-systems, what can one say
(x,Y)E8.p ~ (cp-l(X), cp-I(y))E8, x,YEA, about the classes Aut(~) and SAut(~)?
one again obtains a congruence 8.p of the system A. The 2) Let an (abstract) class K of groups be given. Does
automorphism cp is known as an Ie-automorphism if there exist a class ~ of O-systems with a given signature
8.p = () for any congruence () of the system A. The set o such that K=Aut(~) or even K=SAut(~)? It has
IC(A) of all IC-automorphisms of the system A is a been proved that for any axiomatizable class ~ of
normal subgroup of the group Aut(A), and the quotient models the class of groups SAut(~) is universally

119
ALGEBRAIC SYSTEM, AUTOMORPHISM OF AN

axiomatizable [1]. It has also been proved [1], [4] that if U a Aa=A) and which belong to~. A class ~ is called
~ is an axiomatizable class of models comprising infin- local if each ~-system A with a local set of ~
ite models, if <B, OS;;; > is a totally ordered set and if G subsystems belongs to the class ~. Theorems which
is an automorphism group of the model <B, OS;;; >, establish the local nature of given abstract classes are
then there exists a model AE~ such that A ~B, and for called local (cf. MaI'tsev local theorems).
each element g E G there exists an automorphism cp of An ~-system A is called ~-approximable (or ~
the system A such that g(x)=cp(x) for all XEB. The residual) if, for any predicate P E {~p, =} (i.e. for any
group G is called 1) universal if GESAut(~) for any basic predicate as well as for the predicate coinciding
axiomatizable class ~ of models comprising infinite with the equality relation in A) and for any elements
models; and 2) a group of ordered automorphisms of an aJ, ... ,an in A for which P(aJ, ... ,an)=F, there
ordered group H (cf. Totally ordered group) if G is iso- exists a homomorphism cp: A~B of the system A into
morphic to some automorphism group of the group H some system B of the class ~ for which, again,
which preserves the given total order OS;;; of this group P(cp(a)), ... ,CP(an=F. Any subsystem of a ~
(i.e. a os;;;b=>cp(a)os;;;CP(b) for all a, bEH, CPEG). approximable system is itself ~-approximable. If ~ is
Let I be the class of totally ordered sets <M, OS;;; >, the class of all finite ~-systems, a ~-approximable sys-
let U be the class of universal groups, let RO be the tem 2{ is called finitely apprOXimable (or residually fin-
class of right-ordered groups and let OA be the class of ite). If an abstract class ~ has a unit system
ordered automorphism groups of free Abelian groups. E = < {e }, ~ >, an ~-system A is ~-approximable if
Then [4], [51 [6]: and only if it is isomorphically imbeddable in a Carte-
SAut(l) = U = RO = ~A. sian product of systems from the class ~ [3]. A class ~
Each group is isomorphic to the group of all automor- is called residual if all ~-approximable systems belong
phisms of some ~-algebra. If ~ is the class of all rings, to the class ~. A class ~ is homomorphically closed if it
Aut(~) is the class of all groups [1]. However, if ~ is
contains, for each of its ~-systems A, also all ~-systems
the class of all groups, Aut(~)=F~; for example, the that are homomorphic images of A. All residual
cyclic groups C3 , C5 , C7 of the respective orders 3, 5 homomorphically-closed classes are local [5].
and 7 do not belong to the class Aut(~). There is also A class ~ of ~-systems is called (finitely) axiomatiz-
no topological group whose group of all topological able if there exists a (finite) set S of first-order closed
automorphisms is isomorphic to Cs [7]. formulas of the signature 12 such that ~ consists of
exactly those ~-systems in which all formulas of S are
References
[1] PLOTKIN, B.I.: Groups of automorphisms of algebraic systems,
true. Finitely-axiomatizable classes are also referred to
Wolters-Noordhoff, 1972. as elementary classes. It has been shown [5] with the aid
[2] CsAKANY, B.: 'Inner automorphisms of universal algebras', of the generalized continuum hypothesis that: 1) a class
Pub!. Math. Debrecen 12 (1965),331-333.
~ of algebraic systems is axiomatizable if and only if it
[3] GRANT, J.: 'Automorphisms definable by formulas', Pacific J.
Math. 44 (1973), 107-115. is closed with respect to ultra-products, and its comple-
[4] RABIN, M.O.: 'Universal groups of automorphisms of models', ment (in the class of all ~-systems) is closed with
in Theory of models, North-Holland, 1965, pp. 274-284. respect to ultra-powers; 2) a class ~ of algebraic sys-
[5] COHN, P.M.: 'Groups of order automorphisms of ordered sets',
Mathematika 4 (1957), 41-50. tems is elementary if and only if both it and its com-
[6] SMlRNOV, D.M.: 'Right-ordered groups', Algebra i Logika 5, plement are closed with respect to ultra-products. The
no. 6 (1966), 41-59 (in Russian). theory ofaxiomatizable classes of algebraic systems
[7] WILLE, R.J.: 'The existence of a topological group with auto-
morphism group C 7 ', Quart. J. Math. Oxford (2) 18 (1967),
deals with the connection between the structural pro-
53-57. D. MS' perties of these classes and the syntactic features of the
. mlrnov
formal language in which these classes may be
AMS 1980 Subject Classification: 08A35 specified. Axiomatizable classes which play an espe-
cially important role in algebra include varieties (cf.
ALGEBRAIC SYSTEMS, CLASS OF - A class of Algebraic systems, variety of) and quasi-varieties (cf.
algebraic systems of the same type. All systems of a Algebraic systems, quasi-variety of), which are local and
given type are assumed to be written in a given signa- residual.
ture ~ and are called ~-systems. A class ~ of 12-systems In addition to axiomatizability by first-order closed
is called abstract if, whenever it contains a system A, it formulas, axiomatizability by special second-order
also contains all 12-systems isomorphic to A. closed formulas is also considered. To the function and
Let ~ be an abstract class of ~-systems. One says predicate signature symbols F; (i E/), Pj (j EI) of a
that an 12-system A has a local set of ~-subsystems if fixed signature 12 are added predicate variables
there exists an inclusion-directed set {A,,: ilEA} of R 1 , R 2, . . . . Let 0: be a quantifier-free formula of the
subsystems A" of the system A which cover A (i.e. first order, consisting of function and predicate signa-

120
ALGEBRAIC SYSTEMS, QUASI-VARIETY OF

ture symbols, predicate variables R I, . . . , Rs and terms of the signature Q in the object variables
object variables x I, . . . , x,. A second-order formula x I, . . . , Xs' By virtue of Mal'tsev's theorem [1] a
Q 0:, where Q is some sequence of quantifiers of the quasi-variety Sl' of algebraic systems of signature n can
type ('VR i ), (3Ri) or ('Vxd is known as crypto-universal. also be defined as an abstract class of Q-systems con-
The second-order formulas formed from the crypto- taining the unit Q-system E, and which is closed with
universal formulas without free object variables, with respect to subsystems and filtered products [1], [2]. An
the aid of the logical connectives &, V,~, -, and with axiomatizable class of n-systems is a quasi-variety if
subsequent quantification by V on all free predicate and only if it contains the unit Q-system E and is
variables encountered in the representations of the closed with respect to subsystems and Cartesian pro-
crypto-universal formulas, are called Boolean-universal ducts. If Sl' is a quasi-variety of signature n, the sub-
formulas of the signature n. A class Sl' of n-systems is class Sl'l of systems of Sl' that are isomorphically
called quasi-universal if there exists a set S of Boolean- imbeddable into suitable systems of some quasi-variety
universal formulas of the signature n such that Sl' con- Sl" with signature Q':2 Q, is itself a quasi-variety. Thus,
sists of the n-systems in which all formulas of S are the class of semi-groups imbeddable into groups is a
true, and only of such systems. A quasi-universal class quasi-variety; the class of associative rings without zero
of Q-systems is local (Mal'tsev's theorem). A.I. Mal'tsev divisors imbeddable into associative skew-fields is also
[4] gave a more detailed definition of a quasi-universal a quasi-variety.
class. A quasi-variety Sl' of signature n is called finitely
References definable (or, is said to have a finite basis of quasi-
[I] MA!.'rsEv, A.I.: 'A general method for obtaining local identities) if there exists a finite set S of quasi-identities
theorems in group theory', Uchen. Zap. Ivanovsk. Gas. Ped. of Q such that Sl' consists of only those Q-systems in
Inst. 1, no. 1 (1941), 3-9 (in Russian).
[2] MAL'rsEv, A.I.: 'Model correspondences', Izv. Akad. Nauk which all the formulas from the set S are true. For
SSSR Ser. Mat. 23 (1959),313-336 (in Russian). instance, the quasi-variety of all semi-groups ,vith can-
[3] MA!.'rsEv, A.I.: Algebraic systems, Springer, 1973 (translated cellation is defined by the two quasi-identities
from the Russian).
[4] MAL'rsEv, A.I.: 'Some problems in the theory of classes of
zx=zy ~ x=y, xz=yz ~ x=y,
models', in Proc. 4-th AI/-Union Math. Congress 1961, Len-
ingrad, 1963 (in Russian). Trans!. in: Amer. Math. Soc. and is therefore finitely definable. On the other hand,
Trans!. (2) 83 (1969), 1-48.
[5] COHN, P.M.: Universal algebra, Reidel, 1981. the quasi-variety of semi-groups imbeddable into
[6] CLEAVE, J.P.: 'Local properties of systems', J. London Math. groups has no finite basis of quasi-identities [l], [2].
Soc. 44 (1969), 121-130. Let Sl' be an arbitrary (not necessarily abstract) class
D.M. Smirnov
of Q-systems; the smallest quasi-variety containing Sl' is
Editorial comments. The characterization (under the said to be the implicative closure of the class Sl'. It con-
generalized continuum hypothesis) ofaxiomatizable and ele- sists of subsystems of isomorphic copies of filtered pro-
mentary classes is due to H.J. Keisler [A1].
ducts of Q-systems of the class Sl' U {E}, where E is
Translations of the articles [1], [2] and [4] may (also) be
the unit Q-system. If Sl' is the implicative closure of a
found in [A2], Chapts. 2, 11 and 26, respectively.
The term inductive class is sometimes used instead of class ill: of n-systems, ill: is called a generating class of
'local class'. the quasi-variety Sl'. A quasi-variety Sl' is generated by
one system if and only if for any two systems A, B of Sl'
References
[A1] KEISLER, H.J.: 'Ultraproducts and elementary classes', Indag. there exists in the class Sl' a system C containing sub-
Math. 23 (1961), 477-495. systems isomorphic to A and B [l]. Any quasi-variety Sl'
[A2] MAL'CEV, A.1. [A.1. MAL'rsEv]: The metamathematics of alge- containing systems other than one-element systems has
braic systems. Collected papers: 1936 - 1967, North-Holland,
1971.
free systems of any rank, which are at the same time
free systems in the equational closure of the class Sl'.
AMS 1980 Subject Classification: 08C10, 03C52
The quasi-varieties of n-systems contained in some
given quasi-variety Sl' of signature n constitute a com-
ALGEBRAIC SYSTEMS, QUASI-VARIETY OF - A
plete lattice with respect to set-theoretic inclusion. The
class of algebraic systems (Q-systems) axiomatized by atoms of the lattice of all quasi-varieties of signature Q
special formulas of a first-order logical language, called are called minimal quasi-varieties of n. A minimal
quasi-identities or conditional identities, of the form quasi-variety we is generated by anyone of its non-unit
(\iXl)' .. (VxS> systems. Every quasi-variety with a non-unit system
[P1((fl), ... ,f,,!!)&'" & Pk((fk) , ... ,f'!:') ~ contains at least one minimal quasi-variety. If Sl' is a
quasi-variety of Q-systems of finite signature n, all its
~ Po((f) , ... ,f"?~)],
sub-quasi-varieties constitute a groupoid with respect to
where Po, ... ,PkEQp U{=}, and frO), ... ,f'!:') are the Mal'tsev Sl'-multiplication [3].

121
ALGEBRAIC SYSTEMS, QUASI-VARIETY OF

References has free systems FmOIR) of any rank m and


[1] MAL'TSEV,A.I.: Algebraic systems, Springer, 1973 (translated [n = var F Ko (IDe) [1], [2].
from the Russian).
[2] COHN, P.M.: Universal algebra, Reidel, 1981. Let S be a set of identities of the signature n and let
[3] MAL'TSEV, AI.: 'Multiplication of classes of algebraic systems', KS be the class of all n-systems in which all the identi-
Siberian Math. J. 8, no. 2 (1967), 254-267. (Sibirsk Mat. Zh. 8, ties of S are true. If the equality IDe = KS is satisfied for
no. 2 (1967), 346-365)
D.M. Smirnov a variety IDe of signature n, S is known as a basis for
[n. A variety IDe is known as finitely baseable if it has a
Editorial comments. In the Western literature, quasi-
identities are commonly called Horn sentences (cf. [A1]). finite basis S. For any system A, a basis of the variety
For a categorical treatment of quasi-varieties, see [A3]; for var A is also known as a basis of identities of the system
their finitary analogue, see [A2]. Mal'tsev's article [A3] may A. If [n is a finitely-baseable variety of algebras of a
also be found in [A4] as Chapt. 32. finite signature and if all algebras of IDe have distribu-
References tive congruence lattices, then each finite algebra A of
[A1] HORN, A: 'On sentences which are true of direct unions of [n has a finite basis of identities [10]. In particular, any
algebras', J. Symbolic Logic 16 (1951), 14-21. finite lattice <A, V, 1\ > has a finite basis of identi-
[A2] ISBELL, l.R.: 'General functional semantics, I', Amer. J. Math.
94 (1972),535-596.
ties. Any finite group has a finite basis of identities [3].
[A3] KEANE, 0.: 'Abstract Horn theories', in F.w. Lawvere, C. On the other hand, there exists a six-element semi-
Maurer and C. Wraith (eds.): Model theory and topoi, Lec!. group [5] and a three-element groupoid [6] without a
Notes in Math., Vol. 445, Springer, 1975, pp. 15-50.
finite basis of identities.
[A4] MAL'CEV, AI. [AI. MAL'TSEV): The metamathematics of alge-
braic systems. Collected papers: 1936 - 1967, North-Holland, The varieties of n-systems contained in some fixed
1971. variety IDe of signature n constitute under inclusion a
complete lattice L(IDe) with a zero and a unit, known as
AMS 1980 Subject Classification: 08C15
the lattice of subvarieties of the variety [)C. The zero of
ALGEBRAIC SYSTEMS, VARIEI'Y OF - A class of this lattice is the variety with the basis x =y,
algebraic systems (cf. Algebraic systems, class of) of a P(x\, ... ,xn ) (PEn), while its unit is the variety IDe. If
fixed signature n, axiomatizable by identities, i.e. by the variety IDe is non-trivial, the lattice L(IDe) is anti-
formulas of the type isomorphic to the lattice of all fully-characteristic
congruences (cf. Fully-characteristic congruence) of the
(Vx I) ... (VXs)P(jh ... ,f,,),
system F ~o (IDe) of countable rank which is free in IDe
where P is some predicate symbol from n or the equal- [1]. The lattice Lo of all varieties of signature n is
ity sign, while f\, ... ,f,. are terms of the signature n infinite, except for the case when the set n is finite and
in the object variables x \, ... ,Xs A variety of alge- consists of predicate symbols only. The exact value of
braic systems is also known as an equational class, or a the cardinality of the infinite lattice Lo is known [1].
primitive class. A variety of signature n can also be The lattice of all lattice varieties is distributive and has
defined (Birkhoffs theorem) as a non-empty class of n- the cardinality of the continuum [7], [8]. The lattice of
systems closed with respect to subsystems, all group varieties is modular, but it is not distributive
homomorphic images and Cartesian products. [3], [4]. The lattice of varieties of commutative semi-
The intersection of all varieties of signature n which groups is not modular [9].
contain a given (not necessarily abstract) class ~ of n- Atoms of the lattice Lo of all varieties of signature n
systems is called the equational closure of the class ~ are known as minimal varieties of signature n. Every
(or the variety generated by the class ~), and is variety with a non-unit system contains at least one
denoted by var~. In particular, if the class ~ consists minimal variety. If the n-system A is finite and of finite
of a single n-system A, its equational closure is denoted type, then the variety var A contains only a finite
by var A. If the system A is finite, all finitely-generated number of minimal subvarieties [1].
systems in the variety var A are also finite [1], [2]. Let 21:, IS be subvarieties of a fixed variety we of n-
Let !l' be a class of n-systems, let S!l' be the class systems. The Mal'tsev product 2!9J(01S denotes the class
of subsystems of systems of !l', let H!l' be the class of of those systems A of we with a congruence () such that
homomorphic images of the systems from !l', and let (A/()EIS, and all cosets a/() (aEA), which are sys-
II!l' be the class of isomorphic copies of Cartesian pro- tems in we, belong to 2!. If we is the variety of all
ducts of the systems of !l'. The following relation [1], groups and if 21: and IS are subvarieties of it, then the
[2] is valid for an arbitrary non-empty class ~ of n- product 21: IJJ/oSB is identical with the Neumann product
systems: [3]. The product of varieties of semi-groups need not be
var st = HS IIR
a variety. A variety we of n-systems is called polarized
A variety is said to be trivial if the identity x = y is true if there exists a term e(x) of the signature n such that
in each one of its systems. Any non-trivial variety we in each system from we the identities e (x) =e (y),

122
ALGEBRAIC TOPOLOGY

F(e(x), ... ,e(x=e(x) (FED) are true. If ID1 is a. dIes (fibrations, cf. Fibration) and their sections. The
polarized variety of algebras and the congruences in all principal types of mappings considered in algebraic
algebras in !In are commutable, then the Mal'tsev pro- topology are arbitrary continuous, piecewise-linear and
duct 2(ID/0!8 of any subvarieties 2( and !8 in !In is a smooth mappings, and their most important subclasses
variety. One may speak, in particular, of the groupoid are: homeomorphisms, in particular, continuous,
G[(!In) of subvarieties of an arbitrary variety !In of piecewise-linear or smooth (diffeomorphisms); imbed-
groups, dngs, etc. If ID1 is the variety of all groups or dings of one object into another and also immersions
all Lie algebras over a fixed field P of characteristic (local imbeddings). (cf. Homeomorphism; Diffeomor-
zero, then G[(!In) is a free semi-group [I]. phism.)
References A very important notion in algebraic topology is that
[I] MAL'TSEV, A.I.: Algebraic systems, Springer, 1973 (translated of a deformation. The principal types of deformations
from the Russian). include: a homotopy, i.e. an arbitrary continuous
[2] COHN, P.M.: Universal algebra, Reidel, 1981.
[3] NEUMANN, H.: Varieties of groups, Springer, 1967. (smooth, piecewise-linear) deformation of continuous
[4] BlRKHOFF, G.: Lattice theory, Colloq. Pub!., 25, Amer. Math. mappings; an isotopy (continuous, smooth, piecewise-
Soc., 1973. linear), i.e. a deformation of homeomorphisms, imbed-
[5] PERKINS, P.: 'Bases of equational theories of semigroups', J. of
Algebra 11, no. 2 (1968), 298-314.
dings or immersions where in the process of deforma-
[6] MURSKIi, V.L.: 'The existence in three-valued logic of a closed tion the mapping is a homomorphism, an imbedding or
class with finite basis, not having a finite system of identities', an immersion at any moment of time.
Soviet Math. Dokl. 6, no. 4 (1965), 1020-1024. (Dokl. Akad.
The principal internal problems of algebraic topology
Nauk SSSR 163, no. 4 (1965), 815-818)
[7] J6NSSON, B.: 'Algebras whose congruence lattices are distribLl- include the problem of the classification of manifolds
tive', Math. Scand 21 (1967), 110-121. by homeomorphisms (continuous, smooth, piecewise-
[8] BAKER, K.A.: 'Equational classes of modular lattices', Pacific linear), the classification of imbeddings (or immersions)
J. Math. 28 (1969),9-15.
[9] ScHWABAUER, R: 'A note on commutative semi-groups', Proc. with respect to isotopies (regular homotopies), and the
Amer. Math. Soc. 20 (1969), 503-504. classification of general continuous mappings up to
[10] BAKER, K.A.: 'Primitive satisfaction and equational problems homotopy. An important intermediate role in the solu-
for lattices and other algebras', Trans. Amer. Math. Soc. 190
(1974), 125-150.
tion of these problems is played by the problem of the
D.M. Smirnov classification of complexes or manifolds by the so-
Editorial comments. A categorical characterization of called homotopy equivalence or homotopy type.
varieties of algebraic systems was introduced by F.w. The following, somewhat special, problems played an
Lawvere [A1]; for a detailed account of this approach see important role in the development of algebraic topol-
[A2]. ogy.
References I) The problem of imbedding is usually considered
[A1] LAWVERE, F.W.: 'Functional semantics of algebraic theories', not in its general form, but rather as imbedding into a
Proc. Nat. Acad. Sci. USA 50 (1963), 869-873.
Euclidean space. A very important special case is knot
[A2] MANES, E.G.: Algebraic theories, Springer, 1976.
theory (and the theory of links) in three-dimensional
AMS 1980 Subject Classification: 08BXX space, which was one of the origins of algebraic
topology; braid theory is a related case.
ALGEBRAIC TOPOLOGY The branch of 2) An important role in the history of algebraic
mathematics in which one studies such properties of topology was played by the theory of homology invari-
geometrical figures (in a wider sense, of all objects for ants of the position of various sets in a Euclidean space
which one can speak of continuity), and their mappings and the duality laws (cf. Duality in algebraic topology),
into each other, which remain unchanged under con- connecting the homology of a set and its complement.
tinuous deformations (homotopies). In principle, the 3) A number of fundamental results concerned the
objective of algebraic topology is a complete listing of computation of the algebraic number of fixed points of
such properties. The very name of algebraic topology a mapping of a manifold into itself. Many fundamental
originates from the decisive role of algebraic notions facts were discovered for the fixed points of compact
and algebraic methods in solving problems in this field. smooth groups of transformations, including cyclic
The most important classes of objects whose properties groups of finite order.
are studied in algebraic topology include complexes 4) Of major technical importance in the development
(polyhedra, cf. Complex), which may be simplicial, cel- of algebraic topology were methods developed to solve
lular, etc.; manifolds (cf. Manifold), which may be the problems on so-called cobordism: Does a manifold
open, closed or with boundary, and may in tum be with boundary (cobordism) with a given closed mani-
divided into smooth (differentiable), analytic, complex- fold as boundary exist? Problems of this kind originally
analytic, piecewise-linear or topological; and fibre bun- arose in the context of the corp.putation of the homo-

123
ALGEBRAIC TOPOLOGY

topy groups of spheres (cf. Homotopy group). Impor- able in actual examples. and which assume some
tant cases of the cobordism problem are solved in the discrete set of values; the value of the invariant may
language of characteristic classes (cf. Characteristic not vary during deformations in the corresponding
class). class of mappings for the study of which the invariant
5) Many facts are now available concerning the has been constructed. A large number of essential
homology invariants of the singularities of vector fields, invariants, the abundance of algebraic links between
frame fields and tensor fields, as well as of the singular- them and the difficulty in their calculation have deter-
ities of smooth mappings of manifolds, in particular mined the features of modem algebraic topology.
into Euclidean space. The solution of this problem The calculation of algebraic-topological invariants of
leads, in particular, to characteristic classes. An espe- the simplest and most frequently encountered mani-
cially important case is that of stationary points of folds is not always a simple matter. Thus, the calcula-
smooth functions on manifolds or of various function- tion of homology invariants of Lie groups and of many
als on path spaces (extremals); their connection with homogeneous spaces required much effort and involved
homology theory is important in the clarification of the the use of complicated methods. The calculation of
geometrical structure of manifolds and in obtaining homotypy groups is even more difficult. A number of
lower bounds for the number of extremals. the most important homotypy groups for Lie groups
6) The study of the algebraic-topological properties was calculated, against expectation, using the varia-
of an important kind of special spaces - Lie groups - tional theory of geodesics; the knowledge of the table
is closely connected with their algebraic structure, their of these homotypy groups for Lie groups made the
representations and the calculus of variations on Lie classification of certain vector bundles possible.
groups (cf' Lie group). The results obtained for the Most algebraic-topological invariants are a so-called
topological structure of Lie groups form the base of functor on the category of topological spaces of the
numerous methods and facts of algebraic topology type studied. This means, roughly speaking, that the
which apply to arbitrary manifolds. Algebraic topology values of the invariant undergo natural transformations
of homogeneous manifolds is closely related to methods when the spaces are mapped into each other. For
of Lie groups. example, the fundamental groups of arbitrary spaces or
7) A major role in algebraic topology is played by their homology (cohomology) groups (rings) are con-
special invariants connected with various algebraic nected by homomorphisms induced by continuous
structures over the fundamental group. The simplest mappings; the characteristic classes (distinguished ele-
invariants of this type appeared in knot theory and in ments in (co)homology groups) are mapped into each
the theory of three-dimensional manifolds; the develop- other by the homomorphism corresponding to a mor-
ment of their algebraic theory subsequently made sub- phism of manifolds; the result of cohomology opera-
stantial progress, and eventually formed a separate dis- tions performed on an element of the homology (coho-
cipline - stable algebra or algebraic K-theory. mology) is converted, after a continuous mapping of
8) The analysis of the geometrical structure of a very the space, into the result of this operation performed
large number of examples of the simplest and most fre- on the image of this element, etc.
quently encountered manifolds (e.g. Lie groups, homo- One of the principal properties, on which the study
geneous spaces, manifolds of line elements and mani- and the application of almost-all algebraic-topological
folds with discrete groups of motions), in conjunction invariants are based, is the fact that their effective con-
with the fundamental principles of Riemannian struction is usually connected with an essential comple-
geometry, led to the notion of a fibre bundle, which mentary geometrical structure. For instance, the con-
consists of a space (the total or fibre space), a base struction of all the principal invariants up to
space, a projection of the total space onto the base homeomorphism for complexes involves a subdivision
space, a fibre (homeomorphic to a 'fibre' of the projec- into simplices or cells, whereas the result of such a con-
tion), and the structure group of fibre transformations. struction has to be invariant with respect to all continu-
In this context, one of the central problems in algebraic ous homeomorphisms, and even with respect to homo-
topology is the problem of the classification with topy equivalences; examples are the fundamental
respect to homotopy of fibre bundles and their sections. group, the Euler characteristic, the homology group
Principal fibre bundles and vector bundles are espe- (Betti group), the cohomology ring and the cohomology
cially important (cf' Principal fibre bundle; Vector bun- operations (cf. Cohomology operation). In a similar
dle). manner, the structure of all the basic homeomorphism
A universal method for the solution of all principal invariants for smooth manifolds involves their prelim-
problems in algebraic topology involves the construc- inary triangulation, i.e. reduction to complexes, or else
tion of algebraic invariants which are effectively calcul- to a significant extent the use of analytical tools, such

124
ALGEBRAIC TOPOLOGY

as the construction of a homology ring by way of dif- spaces, and products of mappings correspond to the
ferential forms (skew-symmetric tensors) and differen- products of these homomorphisms (functoriality).
tial operations on them, or the construction of charac- 3) A certain form of interconnection is postulated
teristic classes using the singularities of vector fields, between the cohomology of a complex, a subcomplex
frame fields or tensor fields. Moreover, it may also be and a quotient complex (exactness axiom).
necessary to use tools of Riemannian geometry in cer- 4) The so-called excision axiom.
tain cases, e.g. the definitions of the characteristic 5) Normalization: Only the homology groups for a
classes of a manifold or of a fibre bundle in terms of point need be known, and they must be known to van-
the Riemannian curvature play an important role, even ish in non-zero dimensions.
though the result is invariant wih respect to all continu- It was subsequently noted that objects of an alto-
ous homeomorphisms. In this context, the appearance gether different geometrical nature may display all of
of basic, effectively calculable invariants in the history the above properties, except for normalization. Such
of topology involved the difficult problem of proving objects were called generalized or extra-ordinary
the invariance of these quantities. This is another type homology theories. They were used to improve the cal-
of problem dealt with by algebraic topology. Thus, culation methods of algebraic topology. The most
rational characteristic classes (or integrals of classes important example is K-theory, which is based on vec-
over cycles) proved to be topologically invariant and tor bundles over the space studied instead of their
homotopically non-invariant. The totality of charac- cycles or differential forms which served for the con-
teristic classes over the integers proved to be non- struction of the ordinary homology groups. Another
invariant with respect to continuous or even piecewise- important example is cobordism (bordism) theory, in
linear homeomorphisms. which only the mappings of closed manifolds into the
On the contrary, if an invariant is so constructed that space under study are taken instead of arbitrary cycles,
it is invariant (by definition) with respect to continuous and only mappings of manifolds with boundary of
homeomorphisms (or some wider class of transforma- some classes are taken instead of arbitrary boundaries.
tions - homotopy equivalences), then the calculation The combination of the methods of algebraic topol-
of such an invariant is usually difficult. The most ogy with a small number of pure geometric facts
important invariants of this type are the homotopy applied to the structure of manifolds and their
groups of classes of homotopic mappings of a sphere homeomorphism groups eventually resulted in the solu-
into the space under study. The calculation of even the tion (which, from the modem point of view, mayor
simplest examples of homotopy groups involves major may not be considered complete) of classification prob-
difficulties. Thus, the homotopy groups of the spheres lems of manifolds with respect to all kinds of
themselves are only incompletely known, despite the homeomorphisms (smooth, piecewise-linear, continu-
large number of methods which were proposed for such ous), and in the solution of fundamental problems in
calculations in the course of development of algebraic the classification of imbeddings and immersions.
topology (cf. Homotopy group). Strangely enough, problems connected with three-
The method of classification of classes of homotopic dimensional and four-dimensional manifolds, where the
mappings is based on homology theory in conjunction fundamental problems are still (1977) unsolved,
with the theory of fibre bundles with its apparatus of represent an exception.
spectral sequences, as well as cohomology operations Since most of the fundamental internal problems
and their generalizations. The algebraic discipline which have now been solved, modem efforts in algebraic
arose on the basis of the complicated computational topology tend to concentrate on the application of
tools of algebraic topology is known as homological these ideas in other fields.
algebra.
All the basic primary constructions of homology References
theory for complexes and smooth manifolds by way of [1] ATIYAH, M.F.: K-theory: lectures, Benjamin, 1967.
[2] SEIFERT, H. and TmlELFALL, W.: Lehrbuch der Topologie, Chel-
triangulation or differential forms are effectively com- sea, reprint, 1980.
binatorial - algebraic or analytic. However, it was [3] MILNOR, J.: Morse theory, Princeton Univ. Press, 1963.
noted in the course of development of algebraic topol- [4] MILNOR, J.: Lectures on the h-cobordism theorem, Princeton
Univ. Press, 1965.
ogy that homology theory can be completely defined by [5] MILNOR, J.: Singular points of complex hypersurfaces, Princeton
a small number of formal properties (axioms) on which Univ. Press, 1968.
its computational tools are based: [6] MILNOR, J. and STASHEFF, J.: 'Characteristic classes', in Ann.
of Math. Studies, Princeton Univ. Press, 1974.
1) Homotopic invariance of homology groups. [7] SwITzER, R.M.: Algebraic topology - homotopy and homology,
2) The homology groups are homomorphically Springer, 1975.
mapped into each other under continuous mappings of [8] Hu, S.-T.: Homotopy theory, Academic Press, 1959.

125
ALGEBRAIC TOPOLOGY

[9] STEENROD, N.E.: The topology offibre bundles, Princeton Univ. between the category of algebraic tori over k and the
Press, 1951. category of Z-free G-modules of finite rank. An alge-
[10] NOMIZU, K.: Lie groups and differential geometry, Math. Soc.
Japan, 1956.
braic torus over k that is isomorphic to a product of
[11] LERAY, J.: 'Le calcule differentiel et integrale sur une variete groups Gm over its ground field k is called split over k;
analytique complexe. (Probleme de Cauchy III)" Bull. Soc. any algebraic torus over k splits over a finite separable
Math. France 87 (1959),81-180. extension of k. The role played by algebraic tori in the
[12] CHERN, S.S.: Complex manifolds without potential theory,
Springer, 1979. theory of algebraic groups greatly resembles the role
[13] SPRINGER, G.: Introduction to Riemann surfaces, Addison- played by tori in the theory of Lie groups. The study of
Wesley, 1957. algebraic tori defined over algebraic number fields and
[14] ALEKsANDROV, P.S.: Combinatorial topology, Graylock, Roches-
ter, 1956 (translated from the Russian). other fields, such as finite fields, occupies an important
[15] PONTRYAGIN, L.S.: Grundzuge der kombinatorischen Topologie, place in problems of arithmetic and in the classification
Deutsch. Verlag Wissenschaft., 1956 (translated from the Rus- of algebraic groups. Cf. Linear algebraic group;
sian).
[16] PONTRYAGIN, L.S.: Smooth manifolds and their applications in
Tamagawa number.
homology theory, Moscow, 1976 (in Russian). References
[17] SPANIER, E.H.: Algebraic topology, McGraw-Hill, 1966. [1] BOREL, A.: Linear algebraic groups, Benjamin, 1969.
[18] NOVIKOV, S.P.: 'Topology', in R.V. Gamkrelidze (ed.): Modem [2] ONO, T.: 'Arithmetic of algebraic tori', Ann. of Math. (2) 74,
mathematics,fundamental directions, Vol. I, To appear no. 1 (1961), 101-139.
(translated from the Russian). (Original: Moscow, 1986). [3] ONO, T.: 'On the Tamagawa number of algebraic tori', Ann. of
[19] NOVIKOV, S.P.: 'The Cartan-Serre theorem and intrinsic Math. (2) 78, no. 1 (1963),47-73. VE T7 k k'-
homology', Russian Math. Surveys 21, no. 5 (1966), 209-224. y os resens II
(Uspekhi Mat. Nauk 21, no. 5 (1966
AMS 1980 Subject Classification: 20G15
[20] HlRzEBRUCH, F.: TopolOgical methods in algebraic geometry,
Springer, 1978 (translated from the German).
[21] DUBROVIN, B.A., FOMENKO, A.T. and NOVIKOV, S.P.: ALGEBRAIC VARIETIES, ARITHMETIC OF, arith-
'Methods and applications', in Modem geometry, Vol. 1-2,
Springer, 1985 (translated from the Russian).
metical algebraic geometry - The branch of algebraic
[22] DUBROVIN, B.A., FOMENKO, A.T. and NOVIKOV, S.P.: geometry in which one studies properties of algebraic
'Methods of homology theory', in Modem geometry, Vol. 3, varieties defined over fields of so-called arithmetic type,
Springer, To appear (translated from the Russian). i.e. finite, local and global fields of algebraic numbers
S.P. Novikov or algebraic functions. In the case of a finite field, its
Editorial comments. The axioms 1) - 5) for a main subject is the study of the number of rational
(co)homology theory are often known as the points of the algebraic variety in this field and in finite
Eilenberg - Steenrod axioms (cf. Steenrod - Eilenberg extensions of it. The zeta-function of the variety which
axioms), [A3]. Some additional good references are [A1J, is used in these studies strongly influenced the develop-
[A2] , [A4] - [A6].
ment of the methods of algebraic geometry. Estimates
References from below of the number of (rational) points [1], [4]
[A1] ADAMS, J.F.: Stable homotopy and generalized homology,
are also important.
Univ. of Chicago Press, 1974.
[A2] BOIT, R. and Tu, L.W.: Differential forms in algebraic topol- If X is an algebraic variety (or scheme) over a local
ogy, Springer, 1982. field K with field of residues k, then the study of the
[A3] ElLENBERG, S. and STEENROD, N.: Foundations of algebraic set X (K) of rational points with values in K combines
topology, Princeton Univ. Press, 1952.
[A4] KAROUBI, M.: K-theory. An introduction, Springer, 1978. two different problems: To find solutions of
[AS] VAISMAN, J.: Cohomology and differential forms, M. Dekker, congruences (or points of the variety over a finite field)
1973. and to find integral or rational solutions of Diophan-
[A6] WHITEHEAD, G.W.: Elements of homotopy theory, Springer,
1978. tine equations (cf. Hasse principle). If the variety X is
defined by a set of equations with coefficients from the
AMS 1980 Subject Classification: 55-XX
ring A of integral elements of the field K, then it is pos-
sible to define the reduction of this variety by the same
ALGEBRAIC TORUS - An algebraic group that is set of equations, but with coefficients taken modulo the
isomorphic over some extension of the ground field to maximal ideal of A. A 'variety' X 0 over the field of
the direct product of a finite number of multiplicative residues k and a canonical mapping, or reduction,
groups Gm The group T of all algebraic homomor-
phisms of an algebraic torus T in Gm is known as the Red: X(K) __ Xo(k)
character group of T; it is a free Abelian group of a are obtained. This description of the reduction is diffi-
rank equal to the dimension of T. If the algebraic torus cult to explain in terms of classical algebraic geometry.
T is defined over a field k, then T has a G-module This was one of the reasons for the introduction of the
structure, where G is the Galois gr~oup of the separable concept of a scheme - a language suitable for a
closure of k. The functor T ~ T defines a duality rigorous description of the process. The main problem

126
ALGEBRAIC VARIETY

is to determine the iniage of the mapping Red, i.e. to algebraic variety was subjected to significant algebrai-
find the points x EXo(k) that come from the rational zation, which made it possible to consider algebraic
K-points of the variety. The Hensel lemma states that x varieties over arbitrary fields. A. Wei! [6] applied the
is such a point if x is a non-singular point. For more idea of the construction of differentiable manifolds by
general results on this subject see [4]. glueing to algebraic varieties. An abstract algebraic
Another type of problems concerned with the local variety is obtained in this way and is defined as a sys-
arithmetic of algebraic varieties is the study of forms tem (Va) of affine algebraic sets over a field k, in each
over such fields. Let F be a form in n variables of one of which open subsets WaP C Va, corresponding to
degree d over a local field; Artin's conjecture is that if the isomorphic open subsets W,Ba C V,B, are chosen. All
n>d2 , then the equation F=O has a non-trivial solu- basic concepts of classical algebraic geometry could be
tion. It is known that this statement is true in the case transferred to such varieties. Examples of abstract alge-
of function fields. It was proved for p-adic fields that braic varieties, non-isomorphic to algebraic subsets of a
for each d there exists a finite number A (d) of primes projective space, were subsequently constructed by M.
such that Artin's conjecture holds for forms of degree Nagata and H. Hironaka [2], [3]. They used complete
d, if P gA(d). It was shown in 1966 that the set A (4) is algebraic varieties (d. Complete algebraic variety) as the
non-empty, which proved Artin's conjecture to be false analogues of projective algebraic sets.
[4]. It is not yet (1977) known whether or not it is valid J.-P. Serre [5] has noted that the unified definition of
for forms of odd degree. differentiable manifolds and analytic spaces as ringed
The arithmetic of algebraic varieties over global topological spaces has its analogue in algebraic
fields is the largest and most diverse part of algebraic geometry as well. Accordingly, algebraic varieties were
geometry. It includes Diophantine geometry, class field defined as ringed spaces (cf. Ringed space), locally iso-
theory, the theory of zeta-functions of varieties, and morphic to an affine algebraic set over a field k with
complex multiplication of Abelian functions (or the Zariski topology and with a sheaf of germs of regu-
varieties). All these theories were developed in parallel lar functions on it. The supplementary structure of a
for number and function fields. Such a possibility was ringed space on an algebraic variety makes it possible
first demonstrated by the development of class field to simplify various constructions with abstract algebraic
theory in the nineteen-thirties; it is based on the rough varieties, and study them using methods of homological
analogy between these fields which is most clearly man- algebra which involve sheaf theory.
ifested in the construction of the theory of schemes. At the International Mathematical Congress in Edin-
References burgh in 1958, A. Grothendieck outlined the possibili-
[1) BOREVICH, Z.I. and SHAFAREVICH, I.R.: Number theory, Acad. ties of a further generalization of the concept of an
Press, 1966 (translated from the Russian).
algebraic variety by relating it to the theory of schemes.
[2) WElL, A.: 'Number theory and algebraic geometry', in Proc.
Internat. Math. Congress Cambridge, 1950, Vol. 2, 1950, pp. After the foundations of this theory had been esta-
90-100. blished [4] a new meaning was imparted to algebraic
[3) GROTHENDIECK, A. and DIEUDONNE, 1.: 'Elements de varieties - viz. that of reduced schemes of finite type
geometrie algebrique', Publ. Math. IHES 4;8;11;17;20;24;28;32.
[4) PARSHIN, A.N.: 'Arithmetic on algebraic varieties', J. Soviet over a field k, such affine (or projective) schemes
Math. 1, no. 5 (1973), 594-620. (Itogi Nauk. Algebra Topol. became known as affine (or projective) varieties (cf.
Geom 19701 (1970), 111-152) Scheme; Reduced scheme). The inclusion of algebraic
[5) SWINNERTON-DYER, H.P.F.: 'Applications of algebraic
geometry to number theory', in Proc. 1969 summer inst. number
varieties in the broader framework of schemes also
theory, Proc. of Symp. Pure Math., Vol. 20, Amer. Math. Soc., proved useful in a number of problems in algebraic
1971.
A.N. Parshm
. geometry (resolution of singularities; the moduli prob-
lem, etc.).
AMS 1980 Subject Classification: 14J20, 14K15, 14GXX Another generalization of the concept of an algebraic
variety is related to the concept of an algebraic space.
ALGEBRAIC VARIETY - One of the principal Any algebraic variety over the field of complex
objects of study in algebraic geometry. The modern numbers has the structure of a complex analytic space,
definition of an algebraic variety as a reduced scheme which makes it possible to use topological and tran-
of finite type over a field k is the result of a long evolu- scendental methods in its study (d. Kahler manifold).
tion. The classical definition of an algebraic variety was Many problems in number theory (the theory of
limited to affine and projective algebraic sets over the congruences, Diophantine equations, modular forms,
fields of real or complex numbers (d. Affine algebraic etc.) involve the study of algebraic varieties over finite
set; Projective algebraic set). As a result of the studies fields and over algebraic number fields (d. Algebraic
initiated in the late nineteen-twenties by B.L. van der varieties, arithmetic of; Diophantine geometry; Zeta-
Waerden, E. Noether and others, the concept of an function in algebraic geometry).

127
ALGEBRAIC VARIETY

References number of connected components; this is also true in


[1) BALDASSARRI, M.: Algebraic varieties, Springer, 1956. the general case [2].
(2) SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
(translated from the Russian).
Families of automorphisms are considered in the
(3) DOLGACHEV, I.V.: 'Abstract algebraic geometry', J. Soviet modern approach to automorphism groups of algebraic
Math. 2, no. 3 (1974), 264-303. (Itogi Nauk. i Tekhn. Algebra varieties. A family of automorphisms of a variety X
Topol. Geom 10 (1972),47-112)
with parameter scheme T is a set of automorphisms of
(4) GROTIJENDIECK, A. and DIEUDONNE, J.: 'Elements de
geometrie algebrique', Publ. Math. IHES 4 (1960). the product XX T that commute with projection onto
(5) SERRE, J.-P.: 'Faiseaux algebriques coherentes', Ann. of Math. the second factor; the set of families of automorphisms
(2) 61, no. 2 (1955), 197-278. with parameter scheme T is denoted by AutT(XX T).
(6) WElL, A.: Foundations of algebraic geometry, Amer. Math. Soc.,
1946. One thus obtains a contravariant functor
I. V. Doigachev TI-+AutT(XX T). If the variety X is complete, then this
AMS 1980 Subject Classification: 14A10 functor is locally representable (cf. Representable
functor) by an algebraic group scheme with at most a
ALGEBRAIC VARIETY, AUTOMORPIDSM OF AN countable number of connected components [3]. A.
- An invertible morphism of an algebraic variety (or Grothendieck gave a proof of this fact for projective
scheme) into itself. The group of all automorphisms of varieties, and this theorem has been extended to the
an algebraic variety X, which is usually denoted by case of proper flat schemes of morphisms. The scheme
Aut X, is an important invariant of X. Studies of the representing this functor is not necessarily reduced even
action of the automorphism group of an algebraic if X is a smooth projective surface; however, if the
variety on objects functorially connected with X such as characteristic of the ground field is zero, or if X is a
the Picard group; the Chow ring; the K-functor, and the smooth curve or a smooth hypersurface, then the con-
cohomology group are a tool used in the study of the nected component of the unit of this scheme is a
varieties themselves. The automorphism group of an variety.
algebraic variety is important for the concept of forms For incomplete varieties the automorphism functor is
(cf. Fonn) of an algebraic variety. For complete alge- not always representable in the category of schemes.
braic varieties over the field of complex numbers, the For an affine variety, the automorphism functor is
automorphism group is identical with the group of representable in the category of inductive limits of
biholomorphic automorphisms. schemes.
The structure of the group Aut X is known for a Apart from the simple case of the affine straight line,
number of simple algebraic varieties. For instance, if X for the affine spaces only the automorphism group of
is projective n-dimensional space pn over a field k, the affine plane is known. It is a free product of two of
then anyone of its automorphisms is a linear projective its subgroups with as amalgamated subgroup their
transformation and Aut pn becomes identical with the intersection, viz. the subgroup of linear affine transfor-
projective linear group PLG(n + 1, k). The automor- mations and the subgroup of triangular automorphisms,
phism group of an elliptic curve and, in general, of any i.e. transformations of the form
Abelian variety A, is an extension of the group G of
x' = ax+b,
automorphisms which preserve the structure of the
Abelian variety, by the group A (k) of translations in y' = cy +f(x},
the points of A, i.e. the sequence of groups where a, b, cEk, a*O, c*O, while f(x) is an arbitrary
1 ~ A (k) ~ AutA ~ G ~ 1 polynomial in x [4], [5]. For the treatment of affine
algebraic surfaces transitively acted upon by the auto-
is exact. If X is a smooth, complete algebraic curve of morphism group, see [6].
genus g> 1, then the group Aut X is finite; an estimate
of its order as a function of g is known (cf. Algebraic References
curve). For automorphisms of surfaces, see Algebraic [II MATSUMURA, H. and MONSKY, P.: 'On the automorphisms of
hypersurfaces', J. Math. Kyoto Univ. 3 (1964),347-361.
surface. [2) MATSUSAKA, T.: 'Polarized varieties, fields of moduli and gen-
In the case of algebraic varieties with an ample eralized Kummer varieties of polarized Abelian varieties',
canonical or anti-canonical invertible sheaf the auto- Amer. J. Math. 80 (1958), 45-82.
[31 MATSUMURA, H. and OORT, F.: 'Representability of group
morphism group is an algebraic subgroup of the group
functors and automorphisms of algebraic schemes', Invent.
PLG(N, k) for some N. The automorphism group of a Math. 4 (1967), 1-25.
smooth hypersurface of dimension n;;;' 2: and degree [4] ENGEL, W.: 'Ganze Cremona-Transformationen von Prim-
d~3 is finite [1]. zahlgrad in der Ebene', Math. Ann. 136 (1958),319-325.
[5] SHAFAREVICH, I.R.: 'On some infinite-dimensional groups',
In the above examples, Aut X has the natural struc- Rend. di Mat. e Appl. 25 (1966),208-212.
ture of an algebraic group, perhaps with an infinite [6] GIZATULLlN, M.KH.: 'Quasi homogeneous affine surfaces',

128
ALGOL-68

Math. USSR-Izv. 5, no. 5 (1971), 1057-1082. (lzv. Akad. Nauk symbols of arithmetical and logical operations, other
SSSR Ser. Mat. 35 (1971), 1047-1071) special symbols and certain English words (e.g., 'begin',
[7] ROTH, L.: Algebraic threefolds, Springer, 1955.
'end', 'real', 'integer', 'array'). From the basic symbols
VI. Danilov
of the language further entities are formed according to
VA. Iskhovskikh
certain rules, such as numbers, identifiers (names), sim-
AMS 1980 Subject Classification: 14JXX, 14A 10 ple variables, subscripted variables, function designa-
tors, expressions, statements, and comments. There are
ALGEBRAICALLY CLOSED FIELD - A field k in several basic types of statements: an assignment state-
which any polynomial of non-zero degree over k has at ment, a goto statement, a conditional statement which,
least one root. In fact, it follows that for an algebrai- depending on the outcome of the logical expression
cally closed field k each polynomial of degree n over k contained in it, actuates one of its own inner state-
has exactly n roots in k, i.e. each irreducible polynomial ments, and a for statement (a loop). A group of state-
from the ring of polynomials k[x] is of degree one. A ments may be combined into a compound statement or
field k is algebraically closed if and only if it has no a block (which contains declarations). The Algol nota-
proper algebraic extension (d. Extension of a field). For tion of an algorithm may include definitiOns of pro-
any field k, there exists a unique (up to isomorphism) cedures. The definition of a procedure includes a head-
algebraic extension of k that is algebraically closed; it ing and a body. The body of a procedure may be a
is called th~ algebraic closure of k and is usually statement (most often a block) described in the usual
denoted by k. Any algebraically clo~ed field containing Algol notation. In order to render the Algol language
k contains a subfield isomorphic to k. more flexible, procedures expressed in some other
The field of complex numbers is the algebraic closure language (e.g. machine language) may be employed. A
of the field of real numbers. This is the fundamental procedure may be called with the aid of a procedure
theorem of algebra (d. Algebra, fundamental theorem statement consisting of the procedure identifier and a
of). list of actual parameters corresponding to the formal
References parameters provided in the heading of the procedure
[I] O. and SAMUEL, P.: Commutative algebra, I, Springer,
ZARISKI, with the given identifier. An expression may contain
1975. function designators, which call the procedure for the
[2] LANG, S.: Algebra, Addison-Wesley, 1974.
computation of a value. The procedure call may be
o.A. Ivanova recursive, i.e. a call which provides for the re-call of the
AMS 1980 Subject Classification: 12F05 same procedure while executing it. The language poten-
tialities of standard Algol are often restricted in its con-
ALGOL - A general name for a number of algo- crete implementations. The proposed successor to
rithmic languages (d. Algorithmic language) used for Algol-60 is the language A1gol-68, which has a signifi-
automatic programming and for the publication of algo- cantly different structure, contains many new concepts
rithms (it is an abbreviation of 'ALGOrithmic and possibilities, and is designed for more powerful
Language'). machines.
The first variant of Algol was developed in 1958 by References
an international scientific team. The 1960 International [lA] NAUR, P., ET AL.: 'Revised report on the algorithmic language
Conference in Paris adopted the language 'Algol-60', Algol 60', Num. Math. 4 (1963), 420-453.
[IB] NAUR, P., ET AL.: 'Revised report on the algorithmic language
which combined several useful features of the program-
Algol 60', Comm ACM 6 (1963), 1-17.
ming languages then known. Algol-60, and its descen- [lC] NAUR, P., ET AL.: 'Revised report on the algorithmic language
dent like Pascal, gained widespread use and the name Algol 60', The Computer Journal 5 (1963), 349-367.
Algol now usually refers to 'Algol-60'. It is especially [2] LAVROV, S.S.: A universal programming language (Algol-60),
Moscow, 1967 (in Russian).
suited for the description of algorithms of numerical [3A] WUNGAARDEN, A. VAN, ET AL.: 'Report on the algorithmic
analysis. The language is machine-independent and language Algol 68', Num. Math. 14 (1969), 79-218.
does not provide standard ways of performing input or [3B] WUNGAARDEN, A. VAN, ET AL.: 'Revised report on the algo-
rithmic language Algol 68', Acta Inform. 5 (1975), 1-236.
output operations. Special concrete versions of standard
V V Martynyuk
Algol may be developed for individual computers, and
each one is a language accepted by the compiler of the AMS 1980 Subject Classification: 68BXX
individual machine. As a rule, the transition from the
standard language to its concrete version is natural and ALGOL-68 - A universal algorithmic language. It
does not involve much labour. The basic symbols in was developed during 1964 - 1968 by a team of scien-
Algol are decimal numbers, the upper-case and lower- tists from 12 countries, in the framework of the work-
case letters of the English alphabet, punctuation marks, ing group on Algol of the International Federation for

129
ALGOL-68

Information Processing, for the exchange of algorithms, sible to give a description of the execution of a pro-
their effective realization by various computers, and as gram using of a small number of independent funda-
a means of their study. While similar in style to Algol- mental concepts. The objects may be external (with
60, Algol-68 differs substantially from it in having respect to the structure of the program) and internal
many more constructions, which are also of a more (data, including procedures and names). The relations
general nature. The basic data types, in addition to the between external (E) and internal (I) objects are intro-
Algol-60 'real', 'integer' and 'Boolean', are 'character' duced as axioms, e.g. 'E) contains ~', 'E) identifies
(for alpha-numeric information), 'format' (to describe ~', 'E has 1', 'I) names 12 ', ' I) is a component of 12 ',
the format of the external data), names and routines etc. The execution of a program is described in terms
(procedures). Thus, names and procedures can be 'com- of the relations thus introduced as a function of the
puted' when a program is executed in Algol-68, even analysis of the program.
though such a computation is limited by the necessity A typical syntactic feature of Algol-68 is that it is
of selecting the value of, for example, a procedure out given in the form of a two-level grammar, in which the
of a given finite population of 'routine texts'. The basic Algol-68 production rules are themselves well-formed
types may be used to construct new, composite types texts in some meta-language, defined by its own gen-
by induction; these types may be homogeneous index- erating grammar. For example, grammatical rules of
able sequences of data of the same type (multiple Algol-68 may have the following form:
values), or else ordered lists of data of arbitrary type chain of NOTIONs separated by SEPARATORs:
(structured values). NOTION;
In addition to the conventional approach for defining NOTION, SEPARATOR,
procedures, Algol-68 contains means for defining so- chain of NOTIONs separated by SEPARATORs.
called infix operators of the type x +y. The presence of reference to MODE assignation:
a priority definition makes it possible to establish prior- reference to MODE destination,
ity relations between the infix operators introduced. becomes symbol, MODE source.
The identity definition, which is typical of Algol-68, is a The capitalized words, called 'metanotions', are the
general construction for defining variables, for assign- grammatical units of the meta-language, whose produc-
ing initial values, for the transmission of actual parame- tion rules may have forms such as:
ters in procedures and for creating synonyms. NOTION: identifier; operator.
An expression in Algol-68 may contain an assign- SEPARATOR: comma; semi-colon.
ment statement, or any sequence of statements that MODE: integral; Boolean; reference to MODE.
produces a value. Together with the possibility of com- Some of the concepts of the meta-language, such as
puting names and procedures, and also by the intro- MODE, can have an infinite number of productions.
duction of parenthesis pairs for conditional expressions, The proper production rules of Algol-68 are obtained
this permits Algol-68 constructions such as illustrated by systematically replacing the metanotions of the
by the following example: meta-language in the syntax rules by some terminal
1) Algol-68: production of these metanotions. The resulting rules, in
if x >0 then u else z Ii the meta-linguistic notation of Algol-60, may have, for
: = a +(m<n Isin Icos)(t : = xj2), example, the following form:
2) Algol-60: < chain of identifiers separated by commas > :: =
t := xj2; < identifier > I < identifier >, < chain of identifiers
r : = a + if m <n then sin(t) else cos(t); separated by commas >
if x>O then u: =r else z: =r.
< reference to integer assignation> :: = < refer-
ence to integer destination > : = < integer source >.
An Algol-68 program consists of closed, serial, condi- The use of a two-level grammar makes it possible,
tional and collateral clauses. The first three types of first, to reduce the number of production rules of the
clauses generalize such Algol-60 concepts as block, same type and, secondly, to give a syntactic expression
compound statement and conditional expression and to the attributive information of the notions and to
statement. Collateral clauses signify non-ordered sets of ensure contextual relations which must otherwise be
component phrases and are used, in particular, to indi- formulated as context conditions (static semantics).
cate parallel branches in the course of execution of the
program. References
The description of the semantics of Algol-68 is [lA) WUNGAARDEN, A. VAN, ET AL.: 'Report on the algorithmic
language Algol 68', Num. Math. 14 (1969), 79-218.
characterized by a profound analysis of the principal [I B) WUNGAARDEN, A. VAN, ET AL.: 'Revised report on the algo-
concepts of algorithmic languages, which makes it pos- rithmic language Algol 68', Acta Inform. 5 (1975), 1-236.

130
ALGORITHM

[2) LINDSEY, C.H. and MEULEN, S.G. VAN DER: Informal introduc- (d. Constructive object). Thus, the concept of an algo-
tion to Algol 68, North-Holland, 1977. rithm is very general. One may speak of an algorithm
A.P. Ershov
for the translation from one language into another, of
AMS 1980 Subject Classification: 68BXX
an algorithm of an air traffic controller (processing
ALGORITHM - Detailed instructions defining a information on the motions of aircraft according to
computational process (which is then said to be algo- fixed rules), and other examples of algorithmically
rithmic), which begins with an arbitrary input (out of a described control processes. Consequently, the concept
certain number of inputs which are possible for the of an algorithm is one of the central ideas in cybernet-
given algorithm), and with instructions aimed at ics and computer science.
obtaining a result (or output) which is fully determined
Example of an algorithm. Let the possible inputs and
by the input. For instance, the rules taught in elemen-
the possible results be all possible words over the alpha-
tary schools for column-wise addition, subtraction, mul-
bet {a, b }. The transition from a word X to a word Y
tiplication and division are algorithms; in these algo-
is 'permissible' in the following two cases (where P
rithms the possible results are non-negative integers
denotes an arbitrary word): 1) X is of the form aP, and
written in the decimal system, while the possible inputs
Y is of the form Pb; or 2) X is of the form baP, while
are ordered pairs of such numbers. It is, in general, not
Y is of the form Paba. The instructions are formulated
assumed that the result is necessarily obtained: the pro-
as follows: 'designating some word as the input, exe-
cess of applying an algorithm to some possible input
cute the permitted transitions until a word of the form
(i.e. an algorithmic process which proceeds starting
aaP is obtained; then halt and let word P be the
from this input) may also terminate without a result
result'. These instructions constitute an algorithm,
being obtained (in such a case one has a no-result stop)
which will be denoted by m. Take the word babaa as
or may not terminate at all. If the process terminates
the input. After one transition one obtains baaaba;
(does not terminate), then one says that the input is
after the second, one obtains aabaaba. Here the algo-
suitable (is not suitable) for the algorithm in question.
rithm halts with baaba as result. Consider another
An important result in this area is the undecidability
input baaba. One obtains, in succession,
of the so-called halting problem. It is possible to con-
abaaba, baabab, abababa, bababab,babababa,.... It
struct an algorithm a such that there is no algorithm to
can be shown that this process will never end (i.e. none
decide whether or not some arbitrary input which is
of the generated words will ever begin with aa). Now
possible for a is also suitable for a. In particular, one
take abaab as input. One obtains
can find such an algorithm a for which the set of its
baabb, abbaba, bbabab; no further permissible transi-
possible inputs is the set of natural numbers.
tions are possible, and, at the same time, the termina-
The notion of an algorithm is one of the central con-
tion condition is not satisfied. One has a no-result stop.
cepts in modem mathematics, especially in the area of
Thus, input babaa is suitable for m, and inputs baaba
computations. An example is given below. Finding
and abaab are not.
the numerical solution of equations of a given type is
tantamount to constructing an algorithm which con- The significance of algorithms. In science, algorithms
verts an arbitrary equation of this type and an arbitrary are encountered everywhere: a 'general' solution to a
positive rational number t: into a number (or a set of problem requires an algorithm. The ability of man to
numbers) which differs from the root(s) of this equation add numbers is an example of this. Not the fact that he
by less than t:. However, the term 'computational pro- can find, sooner or later, the sum of any two numbers,
cess', as used in the context of an algorithm, must not but rather in the sense that he possesses a method for
be understood to mean merely numerical calculations: the unique addition of any two numbers given in writ-
even in elementary algebra computations with letters ten form, i.e. an addition algorithm such as the well-
are used. Standard arithmetic calculations involve some known column-wise addition method. 'Generally' solv-
symbols which are not numbers (parentheses, equality ing a problem is formalized as solving an algorithmic
signs, symbols for arithmetic operators). It is accord- problem. An algorithmic problem consists of separate
ingly expedient to consider algorithms involving arbi- problem instances, and a single algorithm needs to be
trary symbols and objects composed from symbols. The found for the solution of all of them. If there is no
simplest example of such an object is a linear sequence such algorithm, one says that the algorithmic problem
of symbols forming a word. It is also possible to con- is unsolvable. Thus, the problem of numerically solving
sider 'non-linear' objects - algebraic matrices, deriva- equations of a given type and the problem of automatic
tion trees of some formal language and general graphs. translation are algorithmic problems. Their problem
Intuitively, the least requirement on inputs and results instances are, respectively, to find the numerical solu-
of algorithms is that they must be constructive objects tion of individual equations of the given type, and the

131
ALGORITHM

translation of individual phrases. Other examples are 307 307 307 307
the verification of algebraic equalities in algebra; the - 49 - 49 - 49 - 49
algorithmic problem of recognition of the deducibility - - 8 ----s8 258'
of propositions from given axioms in mathematical It will be noted that, in a series of successive con-
logic. (In mathematical logic the concept of an algo- structive objects, each successive constructive object is
rithm is also important since it serves as the basis of fully determined (in the framework of the given algo-
the key concept of a calculus, which is a generalization rithm) by the constructive object immediately preceding
and a more precise form of the intuitive concepts of it. In the more rigorous approach, it is also assumed
'proof and 'demonstration'.) The establishment of the that the transition from any constructive object to the
unsolvability of a given algorithmic problem (e.g. the one immediately following it is sufficiently 'elementary'
problem of establishing the truth or demonstrability of - in the sense that the one-step transformation of the
sentences in some logical-mathematical language) is preceding into the immediately succeeding object is
important, because it shows that, in principle, specific local (it is not the entire constructive object which is
problem instances of that problem can only be solved transformed, but only its algorithm-limited part; more-
by methods which are specific to each individual prob- over, this transformation itself is not determined by the
lem instance. complete preceding object, but only by a limited part
The scientific importance of the concept of an algo- of it).
rithm has been recognized for a long time. Since the Accordingly, one has not only a set of possible
earliest times, people have looked for constructive inputs and a set of possible results, but also a set of
methods to solve mathematical problems. Such efforts possible intermediate results, representing the working
usually benefitted from the new availability of suitable medium in which the algorithmic process is evolving.
symbolic notations. This process has made a signifi- Regarding m, all these three sets coincide, but this is
cant contribution to scientific knowledge, especially so not true of the algorithm of column-wise subtraction -
after it had become clear that some problems are the possible inputs are pairs of numbers, the possible
inherently unsolvable (squaring the circle, etc.). After it results are numbers (all in the decimal system), while
had been recognized that certain problems cannot be the possible intermediate results are 'three-storey'
solved by direct calculations, the theoretical concept of records of the type
p
a set was born in the nineteenth century. It is only after
-q
a period of rapid development of this concept (which
r
did not involve problems of constructive methods in
their modem sense), that it was possible in the mid- where q is the written record of the number in the
twentieth century to return to constructive problems, decimal system, r is a similar record or the empty word,
but this was done on a different level, enriched by the while p is the written record of a number in the decimal
concept of an algorithm which had crystallized in the system, with dots over some of the digits. As a rule,
meantime. This concept forms the base of the con- several (not independent) parameters which are charac-
structive trend in mathematics. teristic of an algorithm may be distinguished: 1) the set
The word 'algorithm' comes from the word 'algor- of possible inputs; 2) the set of possible results; 3) the
itmi', the latter being the Latin transliteration of the set of possible intermediate results; 4) the starting rule;
name of the nineth century Arab mathematician AI- 5) the direct transition rule; 6) the termination rule;
Khwarizrni. An 'algorithm' in the Europe of the Middle and 7) the rule for the retrieval of the result.
Ages was 'the decimal-positional system and the art of Formalizing the concept of an algorithm. The concept
calculating with it', since it is through the Latin transla- of an algorithm in its general form is a fundamental
tion (twelfth century) of the treatise of AI-Khwarizmi mathematical concept which cannot be defined in terms
that the positional system was introduced in Europe. of simpler concepts. Strictly speaking, formalizations of
The structure of an algorithmic process. An algorithmic the concept of an algorithm actually restrict it some-
process is a process of consecutive conversion of con- what. In each such formalization an exact definition is
structive objects by discrete 'steps', each step consisting given of some class within which each one of the
of the replacement of one constructive object by parameters may vary. The various formalizations differ
another one. Thus, when applying the algorithm 2{ to from each other by the selection of such classes. Since
the word baaba, there follow, in succession, the words the parameters unambiguously define some algorithm. a
baaba, abaaba, baabab, etc. If one applies. say. the algo- choice of parameter domains determines some class of
rithm of column-wise subtraction to the pair of algorithms. However, such a choice may claim to be a
numbers <307, 49>, then one obtains, successively. formalization of the intuitive concept of algorithm. only
the following constructive objects: if one can be sure that for any 'intuitive' algorithm

132
ALGORITHM, COMPLEXITY OF DESCRIPTION OF AN

there exists an equivalent algorithm in the class of algo- of the form qJ~, where p EC, ~EB U D, is replaced by
rithms defined by the choice under consideration. This a word Q in accordance with the rule: a pair with the
requirement is formulated for each formalization as a first term p~ is sought in the program; let the second
fundamental hypothesis, which, in the present state of term of this pair be T/Tq ; if Tis -, Q =qfT/; if T=O,
knowledge, cannot be mathematically demonstrated. Q =qT/; if T is +, Q=fT/q. The word produced by this
The first formalizations of this type are due to E.L. exchange is A'. For references, see [3], [4], [5], [10],
Post [5] and to AM. Turing [3], [4], whose construc- [11], [12], [13] of Algorithms, theory of.
tions anticipated in many respects the ideas on which V.A. Uspenskii
modern computers are based. There are also versions Editorial comments. The fundamental hypothesis men-
formulated by AA Markov [10], [11] (cf. Nonnal algo- tioned above is usually called the Church- Turing thesis or
rithm) and by AN. Kolmogorov [12], [13], who based Turing's hypothesis. This thesis states that what can be
the approach on constructive topological complexes of effectively computed according to one's intuition about what
a certain type, which makes it possible to express more constitutes an effective process, can also be computed by a
precisely the property of 'being local' of a transforma- Turing machine, i.e. a formally defined notion of algorithm.
tion. In each of these formalizations the fundamental By its very nature of identifying an intuitive informal notion
hypothesis is in satisfactory agreement with practice. with a formal precise definition, the Church - Turing thesis
cannot be formally proved.
Another argument in favour of this hypothesis is the
Many formalizations have turned out to be equivalent, and
fact that, as can be demonstrated, all the formal ver-
no formalization has turned out to be strictly more powerful.
sions (of effective computability) which have been pro-
Apart from the formalizations mentioned above, the follow-
posed are equivalent. ing references are the most standard ones in the West:
An an example, consider the formalization proposed Church's formalization of numbers and computable func-
by Turing. In its original form, this was a description tions by lambda terms, Kleene's general recursive schemes,
of some theoretical computer consisting of 1) an infin- and the register machine model proposed by J.C.
ite tape, divided into consecutive cells, each cell capable Shepherdson and H.E. Sturgis, which was extended into a
of containing some symbol out of the 'tape alphabet' of more realistic computer model by SA Cook and RA Reck-
the machine; and 2) a 'finite control,' which at each how. Markov's and Kolmogorov's versions are less well
moment of time is in some 'state' (out of a given finite known. For completeness the original references to Turing
list of states). The finite control can move along the and Post are also given.
tape, one cell at a time, and change the contents of the References
[A1A] TUluNG, A.M.: 'On computable numbers, with an application
cells it passes. A program which monitors the motion
to the Entscheidungsproblem', Proc. London Math. Soc. Ser.
of the finite control constitutes the algorithm for the 242 (1936), 230-265.
computations conducted on such a machine (a 'Turing [A1B] TuIuNG, A.M.: 'Corrections to 'On computable numbers,
algorithm'). For a more exact and more detailed with an application to the Entscheidungsproblem", Proc.
London Math. Soc. Ser. 243 (1937), 544-546.
description see Turing machine. Here a modernized [A2] CHuRCH, A.: 'An unsolvable problem of elementary number
exposition of Turing's construction is given. To define theory', Amer. J. Math. 58 (1936), 345-363.
a Turing algorithm, one specifies a) pairwise disjoint [A3] KLEENE, S.c.: 'General recursive functions of natural
numbers', Math. Ann. 112 (1936),727-742.
alphabets B, D, C, with a distinguished letter A in D
[A4] POST, E.L.: 'Formal reductions of the general combinatorial
and distinguished letters IX and '" in C; and b) a set of decision problem', Amer. J. Math. 65 (1943), 197-268.
pairs of the form <p~, T/Tq> where p, qEC, [A5] SHEPHERDSON, J.C. and STURGIS, H.E.: 'Computability of
~,T/EB U D, while T is one of the three symbols recursive functions', J. Assoc. Compo Mach. 10 (1963),
217-255.
- , 0, +; in this set (which is called a program) there [A6] COOK, SA and RECKHOW, R.A.: 'Time-bounded random
are no two pairs with the same first coordinate. The access machines', J. Computer and System Sciences 7
possible inputs and the possible outputs are words over (1973),354-375.
B, while the possible intermediate results are words AMS 1980 Subject Classification: 03D03, 03D10,
over BUD U C which contain not more than one 68CXX, 68C05
letter of C. The starting rule is that the initial word P is
converted to the word AaPA. The termination rule is ALGORITHM, COMPLEXITY OF DESCRIPTION
that an intermediate result which contains '" is final. OF AN, program-size compleXity - A measure which
The rule for the retrieval of the output is that the out- characterizes the length of description of an algorithm.
put is the sequence of letters in the final intermediate The complexity of description of an algorithm is
result which follows '" and precedes the first letter not defined differently, depending on the specific defini-
belonging to B. The transformation rule which converts tion. At present (1987) there is as yet no generally
A into A' is as follows. The letter A is written to the left valid definition of the concept, and the most frequently
and to the right of A; in the word thus formed, the part occurring cases are reviewed below.

133
ALGORITHM, COMPLEXITY OF DESCRIPTION OF AN

The complexity of description of a nonnal algorithm considered the following problem: To construct, for
is usually understood to mean the length of its encod- any function of the algebra of logic in N variables, an
ing, i.e. the length of all its substitution formulas when encoding of a normal algorithm in the alphabet
written in one line (a special separating character is cP = {O, 1, a, b, c} which realizes the function and which
interposed between the individual formulas). The com- has least complexity among all such algorithms. It has
plexity of description of a Turing machine usually been shown [1], [4] that the complexity of a normal
means the number of its internal states and external algorithm which solves this problem is of the order 2N.
symbols. A Turing machine may also be characterized Another problem which has been studied is the com-
by the number of commands of the machine in ques- plexity of algorithms which solve, for the first n natural
tion. In the case of recursive functions (cf. Recursive numbers, the problem of belonging to a recursively
function) given by recursion schemes, the number of enumerable set (the complexity of n-segments of recur-
letters in these schemes usually serves as the measure of Sively enumerable sets). It was shown that in the case of
their complexity. normal algorithms, the order of such a complexity is
An axiomatic definition of the complexity of descrip- not more than 10& n, and that this estimate cannot be
tion of an algorithm has been proposed [2]. Below the reduced in the general case. There also exist sets
application of this definition to Turing machines will defined by simple diagonalization of which the n-
be discussed. Let (Mj ), i =0, 1, ... , be the natural segments have complexity order n. It was also shown
numbering of Turing machines characterized by the that if the time of operation of algorithms is general-
fact that the machine itself (i.e. its program) may be recursively bounded, then the complexity of n-segments
effectively reconstituted from its number, and the of recursively enumerable sets may increase exponen-
number of the machine may be effectively reconstituted tially, and the exponent may attain value n.
from the machine (i.e. from its program). A general Complexity of description of an algorithm is used in
recursive function s is a measure of the complexity of the formalizations of the problem of the minimum com-
machines (s(i) being the complexity of the machine M j ) plexity of an algorithm which constructs some finite
if and only if: a) for any y there exists not more than a object. This minimum complexity is often named sim-
finite number of machines with complexity y; and b) ply the complexity of the finite object (for this particular
there exists an effective procedure which makes it pos- specification of the complexity of description of an
sible to determine, for any y, all the machines with algorithm).
complexity y. A definition of the complexity of finite objects was
Let s be an arbitrary measure of the complexity of a first proposed by AN. Kolmogorov (cf. Algorithmic
Turing machine. If U is an arbitrary, effectively (i.e. information theory). It was found that the following
algorithmically) enumerable infinite subclass of Turing asymptotically exact relations exist between the com-
machines, then there exists a machine T belonging to plexity K(x) according to Kolmogorov, the complexity
V, and there exists a machine T' (which mayor may Mm(x) of the same objects when expressed through the
not belong to V) such that T' and T compute the same length of the encoding of a normal algorithm into an
function, and the complexity of T' is much lower than m-letter alphabet, and the complexity Tm(x) expressed
that of T. Hence it follows, in particular, that there by the number of internal states of a Turing machine
exist primitive recursive functions whose lowest com- with an external m-letter alphabet:
plexity in the primitive recursive form (i.e. through M () ...KJ& T ( ) K(x)
schemes of primitive recursion) is much higher than m x ~ lO&1 m ' m x ~ (m-l)log2 K (x)'

their lowest complexity in the general recursive form References


(i.e. through schemes of general recursion). Let the [I] MARKov, A.A.: 'Nonnal algorithms connected with the com-
complexity of normal algorithms and of Turing putation of Boolean functions', Izv. Akad. Nauk SSSR Ser.
Mat. 31 (1967), 161-208 (in Russian).
machines mean, respectively, the length of the encoding [2] BLUM, M.: 'A machine independent theory of the complexity
and the number of internal states. Then it will be pos- of recursive functions'. J. ACM 14 (1967),322-336.
sible to realize any function of the algebra of logic in N [3] KUZ'MIN, V.A.: 'Realization of functions of the algehra of
logic by automata, nonnal algorithms and Turing machines',
variables (cf. Boolean function) [3]: a) by a normal
Problemy Kibernet. 13 (1965). 75-96 (in Russian).
algorithm in an m-Ietter alphabet with complexity [4] PETRI, N.V.: 'Algorithms connected with predicates and
~2N /10& m; and b) by a Turing machine with an m- Boolean functions', Soviet Math. Dokl. 10, no. 2 (1969), 294-
letter external alphabet with complexity 297. (Dokl. Akad. Nauk SSSR 185. no. I (1969). 37-39)
[5] ZVONKIN, A.K. and LEVIN, L.A.: 'The complexity of finite
~2N / N(m -1). The study of the complexity of algo- objects and the developments of the concepts of information
rithms solving the finite restrictions of algorithmically and randomness hy means of the theory of algorithms'. Rus-
unsolvable problems (the so-called bounded algorithmic lian Math. Surveys 25, no. 6 (1970), 82-124. (Ulpekhi Mat.
Nauk 25. no. 6 (1970), 85-127)
problems) began in the nineteen-sixties. AA Markov Ya.M. Barzdin'

134
ALGORITHM, COMPUTATIONAL COMPLEXITY OF AN

Editorial comments. exemplified by the speed-up theorem (see below), such


References a formulation is not always well-posed. The real prob-
[A1] liARTMANIS. J. and HOPCROYf, J.E.: 'An overview of the lem may be the description of the rate of growth of the
theory of computational complexity', J. ACM 18 (1971), cost function R a , where a computes f For example,
444-475.
find upper and lower bounds for the computational
AMS 1980 Subject Classification: 68C25, 68C05 complexity of f, i.e. two functions G(x) and g(x) such
that there exists a computation a for the function f for
ALGORITHM, COMPUTATIONAL COMPLEXITY which Ra(x)~G(x), and such that for any algorithm /3
OF AN - A function giving a numerical estimate of the which computes f the function R /3 in a certain sense
difficulty (amount of time/storage involved) of the pro- majorizes g.
cess of application of an algorithm to the inputs. A Another important branch of the theory of computa-
more precise definition of the computational complex- tional complexity is the study of complexity classes -
ity of an algorithm is the concept of a cost function sets of functions for which there exist computations
(step-counting function) - defined as a decidable rela- with a complexity not exceeding some bound out of the
tion between the objects to which the algorithm is set of bounds which defines the class. This branch also
applicable and the natural numbers, and which has a includes problems of comparative computational com-
range of definition coinciding with the range of appli- plexity for various types of algorithms (automata): the
cability of the algorithm. transition to another algorithmic system and another
Usually, the time and space characteristics of algo- measure of complexity is usually equivalent to consider-
rithmic processes are considered. Thus, for a Turing ing a new measure for the first algorithmic system.
machine M the time cost function (duration of work) Below a few fundamental results are given which are
TM(P) is the number t of cycles of work of M required independent of the choice of measure of complexity
for the conversion of the initial configuration of Pinto (and of the formalization), as long as there exists, for
the final configuration; the memory cost function (or example, an effective mutual simulation of the algo-
space function) SM(P) is defined as the number of cells rithms of the type under consideration and ordinary
on the tape scanned by the head of the machine. Time Turing machines. (For the sake of simplicity, it is
and storage costs for normal algorithms (d. Nonnal assumed that the subject is the computation time of
algorithm), iterative arrays, multi-head and multi-tape Turing machines computing functions from non-
Turing machines, etc., are defined in a similar manner. negative integers to non-negative integers.) Let T and h
The common feature of such cost functions is the be computable integer-valued functions (d. Computable
fact that there exists an effective procedure establishing function) on the domain of the algorithms, and let f be
for any algorithm a (i.e. in particular for a Turing a function defined on this domain which can assume
machine or, rather, for its program), any input x and two values only - say, 0 and l.
any non-negative integer t, whether or not the process First, there exist functions whose computation may
of applying a to x will terminate with complexity t. be as complex as one pleases. More exactly, for any
This observation leads to the abstract theory of compu- function T there exists a computable function f such
tational complexity [1]. An effective procedure r is that for any algorithm a which computes f, the inequal-
called a computational measure if: 1) it always ends in ity
the result 0 or 1 when applied to triplets of the form Ra(x) > T(x)
<algorithm, input, natural number>; and 2) it has the is false only in a finite number of cases.
property that for any algorithm a and any input x the Secondly, there exist functions, any computation of
equality rea, x, t)= 1 is true for not more than one which may in principle be improved as much as one
natural number t, this t existing if and only if the pro- pleases. More precisely, the speed-up theorem is valid:
cess of application of a to x eventually terminates. The For any function h (e.g. h(t)=2/) it is possible to find a
cost function Ra with respect to the measure r for a is computable function f such that for any algorithm a
introduced if and only if rea, x, t)= 1, and then which computes f it will always be possible to find (no
Ra(x)=t. effective procedure is assumed) an algorithm /3 which
This last equality is equivalent to the statement 'the also computes f and such that for all x (except for a
computational complexity a on x is t (in the measure finite set)
r)'. h(Rf3(x)) < Ra(x).
Given some computational measure, one can con-
In case h (t) = 21 this yields
sider the complexity of computation of a given function
f - for example, that of finding an algorithm a which Rf3(x) < 10& Ra(x)
computes f 'better than other algorithms'. However, as for almost all x.

135
ALGORITHM, COMPUTATIONAL COMPLEXITY OF AN

For the computational measures of time and space deterministic automata and probabilistic automata (cf.
(for Turing machines), the result of the speed-up Automaton, probabilistic). In the non-deterministic case
theorem is valid for most computations in the following the computational device can choose both ways when
form: If f is computable with complexity R (n), it is presented with a choice of how to continue the compu-
also computable with complexity R(n ) / 2 (one says tation, by 'splitting' into two independent copies, each
that f is computable with complexity R (n) if, for some of which has the same capabilities as its parent. Its
a which computes it, Ra(P)~R(n) for all words P of computational complexity is understood to mean the
length n in the alphabet under consideration). For this complexity of the 'simplest' successful computation
reason time and space computational complexities of trace. In the probabilistic case, the actual course of the
concrete functions are often order of magnitude esti- computation is determined not only by the program
mates. Some specific results are quoted below. and the argument, but also by a random sequence of
The time to decide the equality of words (of length numbers, and the result should coincide, to a great
n) is of order n 2 for Turing machines and normal algo- degree of probability, with the value of the function
rithms; any property of words which may be decided being computed. In both cases the computational com-
on Turing machines in a time asymptotically less than plexity can sometimes be proved to be less than that of
n lo~ n can also be decided in time n; computations by deterministic computations for the same problem.
a Turing machine of time T(n) (with the order of The quality of algorithms is evaluated not only by
T(n)~~2) can be simulated by a Turing machine with their computational complexity, but also by the com-
space T(n). Furthermore, the multiplication of two plexity of description an algorithm (cf. Algorithm, com-
n-digit numbers on multi-tape Turing machines can be plexity of description of an). In some publications,
performed in time n I +(, but not in time order n, but both these approaches are combined.
iterative arrays can achieve this in real time, i.e. the i-th References
digit of the product is produced before the (i + l)-th [1] HARTMANIS, J. and HOPCROFT, J.E.: 'An overview of the
digits of the factors are inputted. When algorithmic theory of computational complexity', J. ACM 18 (1971), 444-
processes of one type are modeled by other processes, 475. N . V.. P etn.
the computational complexity usually changes. Thus, a Editorial comments. The area has considerably
reduction in the dimension of the iterative arrays expanded from the state described in the article above. Two
results in longer computation time. Multi-head Turing key references [A1], [A5] are listed below.
machines and multi-tape Turing machines can simulate The results on Turing machines referred to, up to the
each other in real time. Many types of algorithms can result on multiplication, hold for Turing machines in the ori-
be simulated on Turing machines, such that the compu- ginal version only (single input/work tape, no extra work
tation time increases polynomially (usually quadrati- tapes). In the West, the standard Turing machine is nowa-
cally) with an insignificant increase in space complexity days the (off-line) model with multiple work tapes. Turing
machines are useful, because of their simplicity, to obtain
(usually, an additional constant factor).
lower bounds on the computational complexity of problems.
The complexity approach is useful in studying cer-
Upper bounds, by exhibiting actual algorithms, are usually
tain hierarchies of classes of general recursive functions based on the RAM (random access machine), which more
(cf. General recursive function), in connection with their realistically models the 'von Neumann' computer. The two
logical and algebraic features. For instance, primitive alternatives can simulate each other with a constant multipli-
recursive functions coincide with the functions comput- cative factor in space and a polynomial time overhead. The
able by Turing machines in space bounded by a primi- latter property characterizes all 'reasonable' models (First
tive recursive function. The existence of arbitrarily Machine Class) for sequential computations (invariance
complex functions is established by diagonalization Thesis). The determination of such overheads is a subject of
methods. Natural analogues for such lower bounds on the theory of machine models. For instance, multi-tape Tur-
computational complexity were found for the validity ing machines with one-way input (no backup on the input
problem in certain decidable theories and in areas tape) can be simulated in square time by a Turing machine
with one-way input and a single work tape in an obvious
related to mathematical linguistics. However, for many
way, and it has been shown that less than square time is
problems of practical importance the problem of
insufficient [A2], [A3]. One can simulate k work tapes by 2
obtaining non-trivial lower bounds presents consider- work tapes in time order n log n; it is not known whether this
able difficulties. This is the case, in particular, for cer- is optimal, but the lower bound is non-linear. In contrast,
tain combinatorial problems which are formalized (in multicounter machines can be simulated in real time (step-
one variant) as problems of finding, out of all words of for-step) by a Turing machine with a single work tape [A4].
a given word length, those which satisfy a predicate (A counter is a storage device which can hold any integer
computable in polynomial time. One can also mention and supports unit increments, unit decrements, and check
the complexity of the computations performed by non- for zero.) The Invariance Thesis Yields a justification for the

136
ALGORITHM, LOCAL

identification of the class of problems which can be solved alphabet A - 'An exact, generally understandable set of
deterministically in polynomial time (denoted by P) as the instructions describing a potentially realizable process
class of 'tractable' problems. Adding the feature of non- of successive transformation of abstract words over the
determinism gives the class NP (non-deterministic polyno- alphabet A, in which any word over A may serve as the
mial time), which is known to contain a large variety of initial word' [1].
problems which one likes to solve in daily life. The best
An algorithm over an alphabet is a special case of
known deterministic algorithms for solving NP-problems
the general concept of an algorithm. The input and the
require exponential time. On the other hand, it is unknown
whether P=NP. I.e., it is unknown whether these exponen- possible outputs of an algorithm over an alphabet are
tial time algorithms can be replaced by deterministic polyno- constructive objects (cf. Constructive object) of suffi-
mial time ones. This question is generally considered the ciently general type - words (cf. Word). Various for-
most important open problem in the theory of computation. malizations of an algorithm over an alphabet were
It is known that the complexity of NP is exemplified in developed in the nineteen-seventies; the best known
specific NP problems. These NP-complete problems have ones are Markov's normal algorithms (d. Normal algo-
the property that P=NP if and only if such a problem can rithm) and algorithms defined using the Turing
be solved in deterministic polynomial time (is in P). Similar machine. When constructing specific algorithms over
questions arise with respect to the relations between dif- alphabets, it is often useful to extend the starting
ferent space complexities, and the relations between time alphabets by introducing additional letters and consid-
and space. The fundamental complexity classes related to
ering the algorithms over the alphabets thus extended.
space are PSPACE (NPSPACE), which consist of those
In the case of two algorithms over the same alphabet,
problems which can be solved in polynomial space by
deterministic (non-deterministic) machines from the First
the concepts of equivalence and complete equivalence
Machine Class. It is known (Savitch's theorem) that of these algorithms with respect to it can be naturally
PSPACE=NPSPACE. Clearly, PSPACE contains NP but it is introduced.
unknown whether the inclusion is strict. References
There exists a large variety of models for parallel compu- [1] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!.,
tation. Many of those models (Second Machine Class) have Jerusalem, 1961 (translated from the Russian). Also: Trudy
the property that time on the parallel model is equivalent to Mat. Inst. Steklov. 42 (1954).
N.M. Nagornyf
space on the sequential model, up to a polynomial overhead
Editorial comments. The connection between algo-
(Parallel Computation Thesis). Each problem in NP can be
rithmic descriptions of languages (on the basis of algorithms
solved in deterministic polynomial time on a machine of the
over an alphabet) and generative descriptions using formal
Second Machine Class. Other models seem to be even
grammars in the style of N. Chomsky, constitutes the
more powerfull, albeit totally unrealistic. The power of the
essence of the theory of abstract automata and formal
parallel models is rooted in the possible activation of at least
languages. The classic text is J.E. Hopcroft and J.D.
an exponential amount of hardware in a polynomial number
Ullman's book [A1].
of computation cycles (called 'time'). As soon as one
enforces realistic bounds (e.g., because of the bounded References
[A1] HOPCROFT, J.E. and ULLMAN, J.D.: Formal languages and
speed of light) on the amount of hardware which can be their relation to automata, Addison-Wesley, 1969. Revised
activated by a computation in a given timespan, the Parallel version: 'Introduction to automata, languages and computa-
Computation Thesis becomes invalid. For many specific tion' Addison-Wesley, 1979.
problems it has been shown that they can be solved on a
AMS 1980 Subject Classification: 68C05, 03003,
parallel device with a polynomial amount of hardware in
03010
time order log n k (NC complexity class).
References ALGORITHM, LOCAL - An algorithm which speci-
[A 1] GARY, M.R. and JOHNSON, D.S.: Computers and intractabil-
fies the attributes of elements of a structured set and
ity; A guide to the theory of NP-completeness, Freeman,
1978. uses, at each step, only information about neighbour-
[A2] MAASS, W.: 'Combinatorial lower bound arguments for deter- hoods of the element. Problems of the existence or
ministic and nondeterministic Turing machines', Trans. Amer. non-existence of effective algorithms for various
Math. Soc. 292 (1985), 675-693.
[A3] VITANYl, P.M.B. and LI, M.: 'Tape versus stack and queue:
discrete optimization problems are naturally formulated
the lower bounds', Information and Computation (To in terms of local algorithms.
appear). Let there be given a family {WC: WCEM} of sets, and
[A4] VITANYl, P.M.B.: 'An optimal simulation of counter
let there be assigned to each pair (~, WC), where ~ E WC,
machines', SIAM J. on Computing 14 (1985),1-33.
[A5] WAGNER, K. and WECHSUNG, G.: Complexity theory, Reidel, a set S(~, WC) such that: 1) ~ES(~, WC); 2)
1986. S(~, WC)~WC; and 3) if ~EWCl' ~EWC2 and
AMS 1980 Subject Classification: 68C05, 68C25 S(~, WCl)~WC2 ~WCl' then S(~, WCl)=S(~, WC 2). The
set S(~, WC) will then be denoted as a neighbourhood of
ALGORITHM IN AN ALPHABET, algorithm over an ~ in WC. Examples of neighbourhoods are conjunction

137
ALGORITHM, LOCAL

neighbourhoods in a contracted normal form (cf. J(9R) and, secondly, if (al> ... ,al)~(fJ}, ... ,PI) fol-
Boolean functions, minimization of). lows from ~, '" a, E9Ri and ~P, ... P, E9Ri; 5) on the
Let f be a finite undirected graph, f =M 1 U M 2, set of neighbourhoods of the form S(~' ... a" 9R*) one
where M 1 = {a I> ,aq } is the set of vertices of f, takes SI ~S2' where SI =S(~, .. 'a" 9Ri) and
and let M 2 ={(aj" aj), ... ,(a", a,,)} be the set of S2 =S(~p, ... P" 9Ri) if and only if
edges of f. A neighbourhood SI(R, f) of an edge S(~, 9R])=S(~, 9R2) and if it follows from
R =(aj, aj) of the graph f consists of all vertices of the ?BY' ... y, ES 1 and ?B6, ... 6, ES 2 that
edges incident with the edge R, and also of all edges (y}, ... 'Y/)~(8}, ... ,81),
whose vertices are included in the neighbourhood Let A and B be elements of one of the sets 1) - 5). If
S 1(R, f). Let Sn -1 (R, f) be a defined neighbourhood; A ~B and A ~B, then the elements A ap.d B are called
the neighbourhood Sn(R, f) then consists of all vertices equi-informative, which is denoted by A B. A function A

of the edges incident with the vertices of the neighbour- cp(~, aI, ... ,ai, S, 9R*) is called monotone if it follows
hood Sn -1 (R, f) and all edges, the vertices of which from SI ~S2 that, for i= 1, ... ,I,
belong to the neighbourhood Sn(R, f). The neighbour- cM~, IXl. .. ,IXl. S l. 9nj) .;;;; cMI}!, f3l. . ,f3l. S 2, '1n 2).
hoods SI(aj, f), ... ,Sn(aj' f) of an arbitrary vertex aj
The determination of a local algorithm also necessi-
of the graph f are introduced in a similar manner.
tates the introduction of an ordering algorithm A '/I' Let
Let a system of two-place predicates
M be an arbitrary set of elements with informatio!l vec-
P 1 (~, 9R), ... ,PI(~' 9R), which is subdivided into two
tors, and let N ={I, ... ,I}. Consider the set M X N of
non-intersecting subsets <P" ... ,P, > and
all pairs (~I'" 0/, ,j) such that ~" .. 0/, E, j EN,
<Pr+1> ... ,PI>, be defined on pairs (~, 9R), ~E9R.
aj=6.. The algorithm A'/I orders the set MXN. The
The elements of the first subset are called the main
local algortihm A is completely defined by the system
predicates, while those of the second are called the auxi-
of predicates PI>' .. ,PI, by the subdivision of this sys-
liary predicates. A vector a=(ab ... ,al) is called an
tem into main predicates P" ... , Pr and auxiliary
information vector if aj E {O, 1, 6. }, i = 1, ... ,I. A vector
predicates Pr +" ... ,PI, by the system of monotone
a is called permissible for ~ in 9R if for all aj=l=A the
equality aj = Pj(~, 9R) is satisfied. The set J (~, 9R) of functions cP" ... ,CP/, cPj =CPj(\}J, a}, ... ,ai> S, 9R*), and
all information vectors permissible for ~ in 9R is called by the algorithm A '/I'
the information set of ~ in 9R. Let [)C* = U~]~' ... ai" 9R* EJ(9R). The first step
Let 9R= {~b ... ,~q} and (ail, ... ,ai/)EJ (~j, 9R), of the local algorithm may be described ..as follows.
.I -- 1, , q. The set on* - {ora" ... all
;U~ - '?t}
oraq' ... a" }
, ,u q Algorithm A'/I is applied to the set M X N, where
is called permissible for 9R. The class J(9R) of all sets M = 9R*; for the first pair (~' ... a, ,j) of the ordered
9R* permissible for 9R is called the information class of sequence one computes
the set 9R over the system of predicates P" ... ,PI' cpi~, a], ... ,ai, S, 9R*)=(fJI' ... ,PI) and the element
Clearly, a neighbourhood S(~, 9R) defines a neighbour- ~, 0/, is then replaced by \}JP' ... .8, ; if
hood S(~a, ... a" Wt). (ab ... ,al)=(fJb ... ,PI)' then one takes the second
Introduce a system of functions cp], ... ,CPI such that pair, etc.; if for all elements (?BY' ... Y', i) the equality
cp(?B, yJ, ... ,YI, S, [)C*)=(y], ... ,YI) is true, then the
cM~a, ... a" S(~a, ... a" [If) = (f3l. ... ,f3I)' local algorithm A is terminated after reviewing all the
The functions cPj will be defined over all pairs pairs from M X N. Otherwise, after replacement of the
(~a, ... a" S(~a, ... a" 9R* such that ~I' a, E9R*,
vector (ab'" ,al) by the new vector (/3], ... ,/31)'
9R* EJ(9R), and satisfies the fo~~wing conditions: 1) algorithm A is terminated if there are no more elements
aj =/3j if j=l=i; and 2) the set 9R , which is obtained with at least one symbol A in the first r places of the
from 9R* by replacing the e~etpent ~a, ... a, by ~.81 .. '.8', information vectors; if there are such elements, the first
is permissible for 9R, i.e. 9R EJ(9R). For the sake of step of the local algorithm terminates. Let the number
brevity, the pairs (~I' .. a" S(~a, ... a" 9R*) are of steps of the local algorithm A which have been per-
denoted by(~,a], ... ,ai'S, 9R*). formed be n. The (n + 1)-st step is carried out exactly as
A partial order is introduced on some of the above the first step, except that instead of the set [)C* one now
sets: 1) on the set M] ={O, 1, A} one takes A<O, considers the set [)C~ into which the set [)C* has been
A< 1; 2) on the set M 2 of information vectors of length converted after the first n steps of the local algorithm
lone takes (a], ... ,aa~(/3], ... ,/31) if a.i~/3i' A. Owing to the monotone nature of the functions CPi,
i = 1, ... ,I; 3) on the set of elements wi~h marks one i = 1, ... ,I, the local algorithm A will be terminated
takes ~a ... a'~~.8 ... .8'1'f (a], ... , al )~(/3
I I """ ], ... , /3I;
) after a finite number of steps.
4) on the set M* = U J(9R) one takes 9R~ ~9R; if, The starting theorems for local algorithms are the
* * IJJIcM
first, [)C] and [)C2 belong to the same information class uniqueness theorem and the theorem on the existence of a

138
ALGORITHM, REPRESENTATION OF AN

best algorithm. The first theorem states that the result such that in A(9JC') the predicate PI will not be com-
of the calculation of the main predicates is independent putable for all elements.
of the algorithm A" (the order in which the elements of Let f(x I, . . . ,xn ) be the set of all Boolean functions
the set 9JC' are taken). The second theorem stipulates (cf. Boolean function) in the variables x I, . . . ,Xn and
the existence, under very general conditions, of a best let P I (~,f (x I, . . . ,xn be the predicate ('the conjunc-
local algorithm, i.e. of a local algorithm which uses a tion ~ is included in at least one minimal disjunctive
given system of neighbourhoods to compute given main normal form'). With natural limitations imposed on the
predicates with fixed auxiliary predicates, whenever this class of predicates (JJ it was found that the predicate
is done by any other local algorithm with an identical P I (~,f (x I, . . . ,xn is not (k, I)-computable if
system of neighbourhoods, and main and auxiliary kl<const2n It is interesting to note that the predicate
predicates. P(m:,f(XI, ... ,xn ('the conjunction ~ is included in
The problem of finding a best local algorithm in at least one dead-end disjunctive normal form of 1')
explicit form has been solved for algorithms of syn- (cf. Boolean functions, nonnal fonns of) is (2, 1)-
thesis of minimal coverings. Let there be given a tuple computable, but is not (1, I)-computable. Similar
<~I"" ,~q,/1.I"" ,}1.q,M>, where m: i is a set, Jl.i is results were obtained for predicates describing the entry
the complexity of m:i, }1.i>O, i=l, ... ,q, of an edge into a shortest path between given vertices
M c:; U r= I m:i . The complexity of the tuple of a graph.
(m:il , . . . , ~i) is ~~ = I Jl.i The problem is how to con-
j References
struct a covering of M by sets from a number of [IA] ZHURAVLEV, YU.I.: 'Local algorithms for the calculation of
infonnation Part 1', Cybernetics 1, no. I (1965), 10-16. (Kiber.
m: 1, ,~q with minimum complexity. A system of netika (Kiev) 1, no. I (1965), 12-19)
neighbourhoods SI, ... ,Sn, ... , for each ~i is intro- [lB] ZHURAVLEV, Yu.I.: 'Local algorithms for the calculation of
duced in a manner similar to the introduction of a sys- infonnation Part II', Cybernetics 2, no. 2 (1966), 1-8. (Kiber
netika (Kiev) 2, no. 2 (1966), I-II)
tem of neighbourhoods for conjunctions. A best local [2] KHUTORYANSKAYA, LV.: 'Aspects of the theory of local algo-
algorithm is constructed for a system of neighbour- rithms on graphs', Cybernetics 7, no. I (1971),37-44. (Kiber
hoods Sk at fixed predicates: the set of auxiliary predi- netika (Kiev), no. I (1971), 29-33)
cates is empty; the main predicates are considered to [3] ZHVRAVLEV, Yu.I.: 'Set-theoretical methods in the algebra of
logic', Problemy Kibernet. 8 (1962), 5-44 (in Russian).
be the predicates PI (m:, M) (m: is not included in any
See also the references to Boolean functions, minimiza-
minimal covering of 9JC) and P 2 (m:, M) (m: is included
tion of.
in all the minimal coverings of 9JC). Best local algo- Yu.I. Zhuravlev
rithms have also been constructed to calculate simple Editorial comments. As a specific area (or in this form),
properties of edges of a graph. Various local algorithms the subject seems not to have been studied in the Western
have been proposed to solve problems of minimization literature. It may be related to the study of Systolic Array
of Boolean functions, of discrete linear programming, algorithms [A1], Chapt. 8.
etc. References
Computability of best local algorithms in a class of [A 1] MEAD, C. and CONWAY, L.: Introduction to VLSI systems,
local algorithms is an important problem in the theory Addison-Wesley, 1980, pp. 263-330.
of local algorithms. If a system of imbedded neighbour- AMS 1980 Subject Classification: 68C05
hoods S I (m:, 9JC) c:; ... c:; Sn(m:, 9JC) c:; . .. is given on
the pairs (~, 9JC), ~E9JC, 9JCEM, and if the local algo- ALGORITHM, REPRESENTATION OF AN, coding
rithm operates with respect to the system S(~, 9JC) of of an algorithm - A constructive object of a certain kind
neighbourhoods where (as a rule, a natural number or a word) containing
Sk-I(m:, 9JC)C:;S(m:, 9JC)C:;Sk(m:, 9JC), then one says that complete information about the algorithm, coded in
the local algorithm has index k. It is natural to consider accordance with the rules laid down for this type of
the number of predicates PI, ... ,PI in the definition algorithm. A coding of an algorithm is usually so
of a local algorithm as the quantity of memory of the defined that the procedures for obtaining the coding of
local algorithm. Let there be given a set (JJ of predi- the original algorithm and obtaining the original algo-
cates P(m:, 9JC). It is considered that all the predicates rithm from its coding are as simple as possible.
participating in the definition of the local algorithm The coding ~I of a nonnal algorithm ~ over an
belong to (JJ. Let A(9JC') be the result of applying the alphabet A (not containing the letters a, {3 and y) is
local algorithm A to 9JC', 9JC' EJ(9JC), 9JCEM. One says defined as a word over the alphabet A a{3y which is
that the predicate P I (~, 9JC) is not (k, I)-computable if obtained as follows [1]. In each substitution formula of
for any local algorithm of index k with memory quan- the scheme of ~ the arrow is replaced by the letter a,
tity I for the main predicate P I and for the auxiliary the point (if any) by the letter {3, and the letter y is
predicates P 2 , . ,PI from (JJ there exists a set 9JC' written at the end of the word thus formed. The result-

139
ALGORITHM, REPRESENT AnON OF AN

ing words are then written out, one after the other, in recourse to probability theory, so that the concepts of
the same sequence as that of their respective substitu- entropy and quantity of information might be applica-
tion formulas in the scheme of 2{. In actual fact, 2{I is ble to individual objects. It also gives rise to its own
merely a scheme of the normal algorithm 2{, written in problems, which are related to the study of the entropy
a somewhat different form - viz. as a word. For rea- of specific individual objects.
sons connected with the technical details of the defini- The key concept of algorithmic information theory is
tion of a normal algorithm, another type of coding of a that of the entropy of an individual object, also called the
normal algorithm is somewhat more convenient - the (Kolmogorov) complexity of the object. The intuitive
so-called record of a normal algorithm [1], which is the meaning of this concept is the minimum amount of
result of translating the words of 2{I into some two- information needed to reconstruct a given object. An
letter alphabet (usually {O, I}). The coding of the pro- exact definition of the concept of complexity of an
gram of a Turing machine can be carried out in a simi- individual object, and (on the basis of this concept) of
lar manner. The role of the coding of a recursive func- the concept of the quantity of information in an indivi-
tion is played by the GOdel number of the system of dual object y relative to an individual object x, was
equations which define this function [2]. given by A.N. Kolmogorov in 1962 - 1965, after which
In any formalization of the general concept of an the development of algorithmic information theory
algorithm, its coding is so defined that it (in fact, the began.
algorithm coded by it) might be included in the inputs Without loss of generality, only binary words (linear
for the algorithms of this type. This makes it possible strings of O's and l's) are considered. The complexity of
to demonstrate the so-called 'theorems on universal a word x is understood to mean the length I(p) of the
algorithms', dealing with the realizability of algorithms shortest binary word p which contains all the informa-
whose codings model the operation of any algorithm of tion required to reconstruct x with the aid of some
this type. definite decoding procedure. A precise definition of this
If a coding of an algorithm is a permissible input for concept follows.
the algorithm, this algorithm is called self-applicable if it Let F(s) be an arbitrary universal partial recursive
is applicable to its own coding, and not self-applicable function. Then the complexity of the word x with
otherwise. It is possible to show, with the aid of a vari- respect to F will be
ant of the Russell paradox (cf. Antinomy), which is min/(p) if F(P)=x,
applicable to this situation, that the algorithmic prob- KFeX) = { 00 if there is no p such that F(P)=x.
lem which then arises - to wit, how to recognize self-
applicable algorithms among other algorithms of this The word p such that F(P)=x is known as the code,
type - is not solvable by an algorithm of this type. or the program, by which F reconstructs the word x.
This result forms the base of numerous theorems on the The following theorem is valid: There exists a partial
un solvability of algorithmic problems. recursive function F 0 (an optimal function), such that
The length of the code of an (encoding of an) algo- for any partial recursive function F the following ine-
rithm is a natural measure of the complexity of an quality holds:
algorithm (the amount of memory which is required to
remember the program of the algorithm) (cf. Algorithm, where CF is a constant which is independent of x. This
computational complexity of an; Algorithm, complexity theorem yields an invariant (up to an additive constant)
of description of an). definition of complexity: The complexity K(x) of the
References word x is the complexity KFo (x) with respect to some
[1] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!., partial recursive function F 0, which has been fixed
1961 (translated from the Russian). Also: Trudy Mat. Inst. once and for all. Clearly, K(x)~/(x)+C, where C is a
Steklov. 42 (1954).
[2] KLEENE, S.c.: Introduction to metamathematics, North-Holland
constant independent of x.
& Noordhoff, 1951. Using K(x) one may also determine the complexity
N.M. Nagornyr K (x, y) of pairs of words (x, y); to do this, the pairs
AMS 1980 Subject Classification: 03D05, 68C05 (x, y) are effectively coded as one word c(x, y), and the
complexity K(x,y) is understood to mean K(c(x,y.
ALGORITHMIC INFORMATION THEORY - The P. Martin-LOf studied the behaviour of the complex-
branch of mathematical logic which gives a more exact ity of the initial segments of infinite binary sequences.
formulation of the basic concepts of the theory of Let Wn denote the initial segment of a sequence w, con-
information, based on the concepts of an algorithm and sisting of the n initial characters. Then almost all
a computable function. Algorithmic information theory (according to the Lebesgue measure) infinite binary
attempts to give a base to these concepts without sequences W have the following properties: a) for infin-

140
ALGORITHMIC INFORMATION THEORY

itely many natural numbers n the inequality holds. At the same time there exists a recursively enu-
K(wn)';;;:'n -c, where c is some constant, is valid; and b) merable set M such that if the time t of computation of
for all natural numbers n larger than some value n., an arbitrary general recursive function is bounded, the
K(wn);;;:'n -(1 +)lo~ n, where is an arbitrary positive estimate Kt(M, n);;;:.n / c" where Ct does not depend on
number (the Martin-LOf theorem). Conversely, for any n, is valid.
sequence W there exist infinitely many n such that Sets which are universal with respect to a certain
K(wn)~n -lo~ n. type of reducibility may also be defined in these terms
The concept of complexity is also employed in the (cf. Universal set): A set M is weakly tabularly univer-
study of algorithmic problems. Here, a more natural sal if and only if there exists an unbounded general
concept is the so-called complexity of solution of the recursive function f (n) such that the inequality
word x. The intuitive meaning of this concept is the K(M, n);;;:. f (n) is valid for all values of n.
minimum quantity of information required to find, for In studying the complexity of bounded algorithmic
each number i~l(x), the i-th character of word x (the problems other measures of complexity may also be
length of the word x need not necessarily be known). employed, e.g. the minimum of the length of the coding
More exactly, the complexity of solution of the word of the nonnal algorithm which solves the bounded prob-
x =x} ... Xl(x) by a partial recursive function G(s, t) lem in question. It was found, however, that asymptoti-
means that cally exact relations (d. Algorithm, complexity of
min{I(P): Vi""-/(x), G(p, i)=x;}, description of an) exist between the complexities intro-
KRG(x) = {
ereIS no sueh p.
00 if t h duced above and the analogues of these complexities
expressed as the minimum length of the coding of an
The following theorem is valid: There exists a partial algorithm (or as the minimum number of internal states
recursive function Go (an optimal function) such that of a Turing machine).
any other partial recursive function G satisfies the ine- Another concept which is introduced in constructing
quality KRG.(x )~KRG(x) + CG, where CG is a constant algorithmic information theory is that of conditional
independent of x. The complexity of solution, KR(x), compleXity of a word x, for a known y, using a partial
of the word x is the complexity KR G, (x) of some recursive function G(s, t):
optimal partial recursive function Go which has been min {l(P): G(p,y)=x},
fixed once and for all. Clearly, KR(x)~K(x)+C and KG(x Iy) = { 00 if there is no p such that G(p,y)=x.
K(x)~KR(x)+21(x)+C. It is possible, by using
KR(x), to determine, for any set M of natural numbers, This concept is related to the theorem on the existence
the complexity K(M, n) for the n-segment of the set M: of an optimal function Go, and the conditional com-
K(M, n)=KR(wn), where w=x} Xi . .. is the plexity K(x Iy) of a word x, if y is known, is defined as
characteristic sequence of the set M (x; = 1 if i EM, and the complexity KG, (x Iy) using an optimal function Go
Xi =0 if HiM). fixed once and for all. The intuitive meaning of
Algorithmic problems are usually represented as K(x Iy) is the minimum quantity of information which
problems of belonging to some recursively enumerable must be added to the information contained in the
set M. If some value n is given, and the problem of word y for the reconstitution of the word x to be possi-
belonging to M is restricted to the first n natural ble. Clearly, K(x ly)~K(x)+C.
numbers only, then one obtains a bounded algorithmic The next central concept of algorithmic information
problem. The magnitude K(M, n) may be intuitively theory is the concept of the amount of information in an
understood as expressing the quantity of information individual object y relative to an individual object x:
required to solve this bounded problem. In particular, I(y:x) = K(x)-K(x Iy).
the set M is recursive if and only if K (M, n) is
The quantity /(y :x) is called the algorithmic amount of
bounded from above by some constant.
information in y about x. The quantities K(x) and
It follows from the Martin-Lof theorem that there
K(x Iy) are sometimes called the algorithmic entropy of
exist sets for which K(M, n)-n. It was also found that
x and the algorithmic conditional entropy of x for a
sets with maximum degree of complexity exist even
given y. The formulas for the decomposition of the
among sets defined by arithmetic predicates with two
entropy H(X, y)=H(X)+H(YIX) and of the commu-
quantifiers. However, the following theorem applies to
tation of information /(X:Y)=/(Y:X) are valid in the
recursively enumerable sets: 1) for any recursively enu-
algorithmic concept up to terms of order
merable set M and any value n, the inequality
O(1ogH(X, Y:
K(M, n )~lo~ n + C, where C does not depend on n, is
valid; and 2) there exists a recursively enumerable set I K(x,y)-[K(x)+ K(y Ix)ll ""- o (log K(x, y),
M such that for any n the inequality K(M,nlo~ n I I(x:y)-I(y:x) I ""- Q(logK(x,y)).

141
ALGORITHMIC INFORMATION THEORY

There exists a certain relation (due to Kolmogorov) posium on Foundations of Computer SCience, 1983, pp.
439-445.
between the algorithmic and the classical definitions of [A2] SoLOMONOFF, R.J.: 'A formal theory of inductive inference,
the amount of information (or, more exactly, between Part I and Part II', Information and Control 7 (1964), 1-22.
the complexity of a word according to Kolmogorov and [A3] MARTIN-LOF, P.: 'The definition of random sequences', Infor-
mation and Control 9 (1966), 602-619.
the entropy of the Shannon distribution of frequen- [A4] CIiArrIN, G.J.: 'Randomness and mathematical proof', Scien-
cies): Let a number r be given, let the word x of length tific American May (1975),47-52.
i'r consist of i words of length r, and let the k-th word [A5] CHAiTIN, G.J.: 'Algorithmic information theory', IBM J. Res.
of length r appear in x with a frequency qk Oev. 21 (1977), 350-359.
(k = 1, ... ,2'); one then has AMS 1980 Subject Classification: 68C25, 94A 17
K(.x) ..;; H +a:(i),
I ALGORITHMIC LANGUAGE, formal programming
where language - A fonnal language intended for describing
t
H = - L qk 10& qb computational processes or, equivalently, for writing
k=l down algorithms to be executed by computers. One dis-
and a(i)~O as i~oo. In the general case no closer con- tinguishes between problem-oriented algorithmic
nection can be established between entropy and com- languages (high-level languages), which are not related
plexity. This is because entropy is intended to study to any specific machine, and machine-oriented algo-
sequences subject to frequency laws only. Conse- rithmic languages (low-level languages), which take the
quently, one may establish the following relation for specific features of a given machine into account
random sequences w in measure between the magni- (instruction set, addressing modes, etc.). The term
tudes under study, which corresponds to independent 'algorithmic language' usually refers to a problem-
trials: oriented language, as opposed to machine code, which is
lim K(~ir) = H. a notation that is directly interpreted by a machine.
1---"00 I
For the well-formed texts of an algorithmic language
Similar considerations apply to an arbitrary ergodic
(programs, cf. Program) a general algorithm defines
stationary stochastic process.
their execution in a unique way, which is the distinc-
References tion between algorithmic languages and non-algorithmic
[1] KOLMOGOROV, A.N.: 'Three approaches to the quantitative programming languages, for which the execution process
definition of information', Problems Inform. Transmis-
sion 1 (1965), 1-7. (Problemy Peredachi Informatsii 1, no. I for a text is fully undetermined or the text merely
(1965),3-11) serves as material for the synthesis of an algorithm.
[2] KOLMOGOROV, A.N.: 'On the logical foundations of informa- As in natural languages, an algorithmic language is
tion theory and probability theory', Problems Inform.
Transmission 5, no. 3 (1969), 1-4. (Problemy Peredachi Infor-
constructed over an alphabet of basic symbols (in
matsij 5, no. 3 (1969), 3-7) which the program is written down) in the form of a
[3] ZVONKIN, A.K. and LEVIN, L.A.: 'The complexity of finite hierarchical system of grammatical elements, between
objects and the developments of the concepts of information
which relations are given (similarly to the words,
and randomness by means of the theory of algorithms', Rus-
sian Math. Surveys 25, no. 6 (1970), 82-124. (Uspekhi Mat. phrases and sentences in a natural language, whose
Nauk 25, no. 6 (1970),85-127) connections are given by syntactic rules). The lowest
[4] BARDZIN', YA.M.: 'Complexity of programs to determine level elements, formed by chains of basic symbols, are
whether natural numbers not greater than n belong to a recur-
sively enumerable set', Soviet Math. Dokl. 9, no. 5 (1968), called lexemes, or lexical units. For the lexemes occur-
1251-1254. (Dokl. Akad. Nauk SSSR 182, no. 6 (1968),1249- ring in a program the class to which they belong is
1252) defined, and for certain classes of lexemes (e.g. identif-
[5] KANOVICH, M.l.: 'Complexity of resolution of a recursively
enumerable set as a criterion of its universality', Soviet Math.
iers) also their scope - some uniquely identifiable part
Dokl. 11, no. 5 (1970), 1224-1228. (Dokl. Akad. Nauk SSSR of the program to which all occurrences of a given lex-
194, no. 3 (1970), 500-503) v M B d" erne belong (a block). Exactly one occurrence of such a
1a. . arz In
lexerne is said to be defining; the other occurrences of
Editorial comments. The theory of Kolmogorov com-
the lexeme in its scope are called applied.
plexity, or 'algorithmic information theory', originated in the
The further levels of elements of an algorithmic
independent work of R.J. Solomonoff, Kolmogorov, L. Levin,
Martin-Lof and G.J. Chaitin. Recently, J. Hartmanis has language are formed by notions, or non-terminals. The
refined the theory by also considering the amount of work relation that may hold between the notions of an algo-
(time complexity) involved in reconstructing the original data rithmic language is that of being a (direct) constituent
from its description. of (i.e. an immediate constituing part), while the indivi-
dual constituents of a given notion are related to each
References
[A 1] HARTMANIS, J.: 'Generalized Kolmogorov complexity and the other by concatenation (textual sequence). The transitive
structure of feasible computations', in Proc. 24th IEEE Sym- closure of the constituent relation uniquely assigns to

142
ALGORITHMIC LANGUAGE

each notion some subword of the text of the program, and symbols for certain operations. The principal
which is said to be the (terminal) production of this classes of lexemes are numerals to represent numbers,
notion. There is one initial notion, the production of literals to represent texts, and identifiers to denote the
which is the entire program text. A tree whose root is various objects of the program that are defined in the
the initial notion, whose terminal vertices (leafs) are program itself. The principal such objects are variables,
lexemes and basic symbols, whose internal vertices are labels (which name different parts of the program) and
concepts and whose branches are constituent relations, procedures (which denote functions). The meanings and
is called a production or syntax tree of a program. The the names of certain identifiers are defined by the
construction of such a tree is known as the syntactic description of an algorithmic language (reserved
analysis or parsing of a program. words).
Notions and lexemes have attributes, i.e. certain sets Among the concepts of an algorithmic language are
fixed by the description of the algorithmic language. basic structures - definitions, expressions and opera-
Determining the attributes of the elements occurring in tors. Descriptions are sources of attribute information
a program is called its semantic analysis. Finding the which is assigned to the defining occurrence of the lex-
attributes of lexemes begins with the analysis of their erne. Essentially, attributes characterize the type of
defining occurrences, which usually contain explicit values calculated by executing the program, their
information about the attributes. The attribute informa- representation and the way in which they are stored in
tion is transferred to all the applied occurrences of the the memory of the computer. For composite values like
lexeme within the corresponding scope (identification). arrays (vectors, matrices) and structured values
The attributes of a certain notion are found, by induc- (records) the mode of access to their elementary com-
tion over the production tree, as a function of the attri- ponents is also specified. Expressions are the source of
butes of its constituents. A substantial part of the values; operators are units of finite operations in the
semantic analysis, which is very valuable for checking program; basic operators are the operators of assignment
the correctness of a program, is a check on the compa- of the value of an expression to a variable, the operator
tibility of the attributes. of control transfer (unconditional or conditional jump,
The rules of syntactic analysis are laid down by the or goto), the operator of procedure call, and the opera-
generative grammar of the algorithmic language (d. tor of a loop.
Grammar, generative) or by an analyzing automaton Loops and procedures are the most characteristic
(more precisely, by its different generalizations) for tools of an algorithmic language, providing as they do a
converting the program text into a production tree. The concise notation for very long computations. The loop
rules of semantic analysis are usually described infor- operator prescribes a repetitive execution of some part
mally, but attempts have been made to formalize the of the program (the body of the loop), the number of
definition of attribute information and to take the con- repetitions being determined by some given condition.
text into consideration with the aid of the mechanism The definition of a procedure introduces an abbrevi-
of two-level grammars (cf. AJgol-68). ated, parametrized notation for the function of some
The execution algorithm of the programs of an algo- part of the program (the procedure body). The execu-
rithmic language is usually given in the form of a tion of the procedure body may be subsequently
correspondence between the vertices of the production treated as an elementary operation triggered by a call
tree and various execution procedures (also called of the procedure, at which time the actual values are
transducers or semantic procedures). Each procedure assigned to the parameters included in the procedure.
performs certain operations as a function of the attri- An important property of an algorithmic language is
butes and of some context of the given notion (or lex- its capacity to structure a program text, which makes it
erne), and then determines the next procedure. The easier to write, to learn and to check. The principal
operations of the procedure may specify the computa- means of structuring are limitations on the structure of
tion process directly in terms of some abstract machine scopes (block structure), on procedure and loop, bodies
(semantics of the interpretation type), or may transform and rules imposed on the use of goto operators, which
the production of a given notion into a fragment of the limits the number of branches on the execution paths.
program in some high-level language (translational Programming a computer in an algorithmic language
type). necessitates the availability of special software in the
In specific algorithmic languages, the alphabet of form of programming processors (compilers, interpreters)
basic symbols usually consists of the letters of the - which are intermediaries between the program in the
Roman alphabet (which may be replaced or augmented algorithmic language and the machine. A complete pro-
by characters from a national set), digits, pairs of de- cessor executes the following functions: input of the
limiters (parentheses), separators (punctuation marks) program, lexical analysis (isolation and classification of

143
ALGORITHMIC LANGUAGE

lexemes), syntactic and semantic analysis, with indica- nology and concepts of. the theory of programming
tions of formal errors, synthesis of an intermediate languages are not standardized; the article above leans
form (representation of the program in the internal heavily on the somewhat idiomatic approach employed in
language of some abstract computer, which is con- the definition of Algol-68.
venient for the subsequent processing or execution of A language that has gained widespread use since the
the program), optimization (systematic transformations nineteen-seventies is Pascal, which is a close relative of
Algol-60 offering some additional facilities. Next to the class
of the intermediate forms which improves certain
of procedural (or operational) languages, which basically
characteristics of the program such as its size, speed
prescribe the sequence of operations to be performed, there
and demands on memory), code generation (construc- are functional (or applicative) languages, in which a pro-
tion of the machine program) and execution of the pro- gram is basically a collection of recursive function defini-
gram. In processors of the translational type (transla- tions, and deductive (or logic-programming) languages, for
tors, compilers, program generators) the execution of the which a program consists of a collection of recursive defini-
program takes place after the completion of the con- tions of predicates. Most existing algorithmic langages are
struction of the machine program. In processors of the procedural. An example of a functional language is Lisp,
interpretative type (incremental translators, interactive whereas Prolog is a deductive language.
translators) the program is executed with the aid of References
some mechanism of interpretation of either its inter- [A1] PRArr, T.W.: Programming languages: design and implemen-
mediate form, or of the production tree or even of the tation, Prentice Hall, 1975.

initial text. AMS 1980 Subject Classification: 68BXX


One distinguishes between one-phase, two-phase and
multi-phase translation schemes of programs. In the ALGORITIlMIC PROBLEM - The problem of find-
one-phase scheme all functions of the translator are ing a (unique) method (an algorithm) to solve an infin-
executed during a single pass of the text of the pro- ite series of individual problems of the same type.
gram. There is no optimization and no intermediate Algorithmic problems arose and were solved in various
form, while the isolated vertices of the production tree branches of mathematics throughout its history; how-
immediately generate machine commands of the pro- ever, some of them could not be solved for a long time.
gram. In two-phase translators, the production tree is The reason for this only became apparent in the
usually built during the first pass of the program, while nineteen-thirties, when an exact definition of an algo-
the semantic analysis, combined with the generation, is rithm was given in mathematical logic. It was found
effected during the second pass. Multi-phase schemes, that algorithmic problems can be unsolvable, i.e. that
involving the utilization of an intermediate form, are the algorithm sought need not exist at all. As a result,
employed in optimizing translators or in systems with the approach to algorithms on the part of mathemati-
mixed generation for various types of machines; in cer- cians underwent a radical change, and new algorithmic
tain very complex algorithmic languages syntactic and problems began to be formulated as problems of
semantic analysis is also required. existence of an algorithm for the solution of a given
The number of algorithmic languages which may be infinite series of problems of a given type, and of find-
employed with computers is very large (more than one ing such an algorithm if it exists.
thousand), but only a few of them are extensively used. Several more exact definitions of algorithm appeared
These include Algol; Algol-68; Cobol; Lisp; PL/I; in the theory of algorithms (cf. Algorithms, theory of)
Simula; Fortran; and in the USSR also Algams; Al'fa practically all at the same time, and they all proved to
and Refal. be essentially equivalent. In each such definition a suf-
References ficiently broad class of specific algorithms, closed with
[1] INGERMAN,P.: A syntax-oriented translator, Academic Press, respect to the natural operation of combination of algo-
1966. rithms, is singled out. Each statement to the effect that
[2] Programming languages, Moscow, 1972 (in Russian; translated
from the English).
some algorithmic problem is unsolvable is a precise and
[3] HOPGOOD, F.R.A.: Compilation techniques, American Elsevier, proved mathematical theorem on the unsolvability of
New York, 1969. the algorithmic problem under consideration by an
[4] HiGMAN, B.: A comparitive study of programming languages,
American Elsevier, New York, 1968.
algorithm of the given class. In this form these
A.P. Ershov theorems may be regarded as specific, i.e. related to a
Editorial comments. In English-speaking computer sci- given class of algorithms. However, all results of this
ence community, usually no specific distinction is made kind may be expressed in terms of the intuitive idea of
between the concepts of 'progamming language' and an algorithm as understood by all mathematicians. This
'algorithmic language', although the latter term historically 'translation' is based on the so-called Church thesis
refers mainly to languages of the Algol family. The termi- (depending on the exact specification of the given algo-

144
ALGORITHMIC PROBLEM

rithm, it may also be called Turing's thesis or the nor- long time: Are unsolvable algorithmic problems specific
malization principle), which says that the class of algo- to the theory of algorithms and mathematical logic? In
rithms under consideration is universal, i.e. that the other words, do there exist unsolvable algorithmic
essential function of any intuitive algorithm may be problems in traditional branches of mathematics far
performed by algorithms of this class. This thesis is a removed from mathematical logic? The first result of
scientific fact confirmed by the history of mathematics: this kind was obtained in 1949 and 1947 by A.A. Mar-
all algorithms known in mathematics satisfy it and all kov and E.L. Post, independently of one another. They
attempts to construct algorithms not satisfying it have proved that the problem of equality (identity) of words
failed. Finally, the thesis is also confirmed by the for semi-groups - posed by A. Thue as early as 1914
equivalence of the various proposed definitions of algo- - was unsolvable. In this problem one considers semi-
rithm. Church' thesis cannot be proved, since it deals groups defined by a finite number of generators (an
with the mathematically vague concept of an intuitive alphabet)
algorithm, but it is highly important nevertheless, since (1)
it makes it possible to speak of unsolvability of given and defining relations
algorithms in a wide sense, which is generally under-
A, = B;, i=l, ... ,k. (2)
stood by mathematicians.
The first examples of unsolvable algorithmic prob- An elementary transformation of the semi-group II here
lems were noted in the theory of algorithms itself. considered is a transition from the word PAiQ to the
These include the problem of recognition of belonging word PBiQ or vice versa, where P and Q are arbitrary
to a given non-recursive set of natural numbers, the words in the alphabet (1). Two words X and Y in the
halting problem of a universal Turing machine, the alphabet (1) are said to be equivalent in II if they coin-
problem of the recognition of self-applicability of a cide, or if one can be obtained from the other through
normal algorithm, etc. a finite sequence of elementary transformations. The
As concerns algorithmic problems outside the theory set of all equivalence classes with a multiplication
of algorithms itself, one must mention, first of all, the operation which is naturally defined by the operation
problem of recognition of validity of formulas of first- of concatenation of words is just the semi-group II
order predicate calculus, the unsolvability of which was defined by the generators (1) and relations (2). The
first proved by A. Church in 1936. Numerous studies of problem of word equality (identity) (the word problem) in
algorithmic problems in model theory are related to the semi-group II consists in finding an algorithm for
these results. Model theory, which deals with non- recognizing if any pair of words X and Y of II are
empty sets with relations defined on them using predi- equivalent in II or not, i.e. if the elements of II defined
cate calculus, was created in the nineteen-thirties by by them are identical or not. Markov and Post con-
A.I. Mal'tsev and A. Tarski. They must also be credited structed specific semi-groups for which there is no algo-
with exact formulations of many algorithmic problems rithm for solving the equality problem [1].
in model theory and with several fundamental results in The most noteworthy result in this context was
this field. The elementary theory of a given class K of obtained by P.S. Novikov [4], [5]. He showed that the
models is the set of all closed formulas of first-order classical word problem in group theory (the equality or
predicate calculus that are true for all models of K. The identity of words problem) posed by M. Dehn in 1912,
class K may consist of one model. An elementary which was studied by many experts in algebra
theory T is solvable or unsolvable, depending on whether throughout the world, was unsolvable. It may be for-
or not an algorithm for the recognition of whether or mulated in the same way as the word problem for
not any given formula belongs to the theory T exists. semi-groups, except that only such systems of relations
For a partial review of the methods of studying algo- (2) are considered for which inverse elements for all
rithmic problems in model theory and the results generators (1) exist in the given semi-group. Novikov
obtained, see [3]. The problem of the solvability of the also showed that the isomorphism problem, which is very
elementary theory of free groups is the most interesting important in group theory - to wit, how to find an
one mentioned in [3], and one which has not yet (1977) algorithm to tell whether or not any two groups are iso-
been solved. morphic - is unsolvable. It was subsequently shown
Numerous and varied algorithmic problems also arise that for any given group G it is impossible to find an
during the constructive interpretation of various algorithm that would tell whether or not an arbitrary
branches of mathematics. The principal results con- group is isomorphic to G [6].
cerning algorithmic problems in traditional branches of Markov studied the problem of recognition of invariant
mathematics are given below. properties of semi-groups, i.e. of properties that are
A naturally arising question remained open for a preserved on passing to isomorphic semi-groups [2]. He

145
ALGORITHMIC PROBLEM

showed that if for an invariant semi-group property a (1977) is the proof of the solvability of the word prob-
there exists a semi-group with property a and another lem in the class of groups defined by systems of defin-
semi-group which is not imbeddable in any semi-group ing relations in which the defining words may overlap
with this property, then there is no algorithm by which by less than 115 of their length [14]. If the overlap
it is possible to tell whether a given semi-group has the between the words may be as large as 113 of their
property a or not. This actually means that almost-all length, an example of a group with unsolvable word
invariant semi-group properties are algorithmically problem becomes quite simple. In the range between
unrecognizable in the class of semi-groups. An analo- 115 and 1/3 the problem remains open. Similar studies
gue of this fundamental theorem has also been were also conducted for semi-groups. It was found that
obtained in group theory [7], [8], [9]. In particular, the for semi-groups with maximum measure of overlap of
conclusion in this case is as follows. Let Ka be the class the defining words less than 1/2 the word problem is
of all groups with an invariant property a. Two algo- solvable, but examples are known of semi-groups with
rithmic problems are related to each such class: the an unsolvable word problem in which the maximum
word problem for groups from Ka, and the problem of overlap of the defining words is 1/2.
recognizing whether or not an arbitrary group belongs The most prominent topological algorithmic problem
to Ka. It was found that for any non-empty class Ka at is the classical decision problem for homeomorphism of
least one of these algorithmic problems is unsolvable. n-dimensional manifolds. Markov constructed an exam-
The same applies to semi-groups as well. ple of a four-dimensional manifold we for which it is
Another problem studied was the specification of the impossible to construct an algorithm to tell whether or
simplest groups and semi-groups in which the word not an arbitrary four-dimensional manifold is
problem is unsolvable. Of the numerous studies on this homeomorphic to we [17]. It is known that for n = 2 the
subject one may mention the proof of the unsolvability solution of this problem is positive; for n = 3 the prob-
of the word problem for the semi-group specified by lem remains open; for four-dimensional and higher-
the following seven simple defining relations [10]: dimensional manifolds the problems of recognition of
ac = ca, ad = da, be = cb, bd = db, combinatorial and homotopy equivalence of manifolds
are also unsolvable. One may also mention that if P is
eca = ce, edb = de, cca = ccae.
a polyhedron for which the word identity problem of
At a later date [11] an example was given of a semi- its fundamental group is unsolvable, no algorithm can
group with three defining relations and an unsolvable be constructed for P that recognizes the bounded
equality problem. No algorithm for solving the word homotopy (or the free homotopy) of two arbitrary
problem in the general case has yet (1977) been found, paths on P passing through a common point. A posi-
not even for semi-groups with a single defining rela- tive solution was obtained for the problem of conjuga-
tion; it was only constructed for irreducible defining tion in braid groups; this problem is equivalent to the
relations [12]. The algorithm solving the word problem topological problem of recognizing the equivalence of
for groups with one defining relation dates a long time braids [16].
back [13], but even for two such relations the question One of the most famous algorithmic problems in
still remains open. The minimum number of defining mathematics is Hilbert's lO-th problem: To find an algo-
relations for which examples of groups with unsolvable rithm by which to tell whether or not a system of
word problem are known is 12. The problem of word Diophantine equations with integer coefficients has a
equality for commutative semi-groups has been shown solution in integers. After many examples of unsolv-
to be solvable. For commutative groups the isomor- able algorithmic problems had become known, it was
phism problem has been solved as well. The problem suggested that this problem was unsolvable as well.
of word equality is solvable for finite groups, for Moreover, a team of American mathematicians suc-
finitely-defined simple groups, for nilpotent groups and ceeded in proving that the un solvability of Hilbert's
for solvable groups of the second order. However, even tenth problem follows from the existence of a two-place
in the class of solvable groups of the fifth order one Diophantine predicate of exponential growth [18], [19].
can indicate a group given by a corresponding identity However, they were unsuccessful in constructing such a
of the fifth order and a finite number of defining rela- predicate; they merely demonstrated that the recogni-
tions with unsolvable word problem [15]. tion of the existence of integer solutions of exponential
The study of the word problem for groups with a equations was an unsolvable problem. The desired
limited measure of overlap of the defining words, i.e. Diophantine predicate of exponential growth was first
with overlap of the left-hand sides of the defining rela- constructed in 1970 [20], [21]; this was the first proof
tions, had begun before the unsolvability of the word ever given of the unsolvability of Hilbert's tenth prob-
problem in its general form was demonstrated. The lem.
strongest result on this subject at the time of writing In the theory of algorithms the existence of unsolv-

146
ALGORITHMIC REDUCIBILITY

able algorithmic problems of various degrees of unsolva- concept of effective computability. Church' thesis is now
bility was proved. Many studies of algorithmic prob- almost universally accepted.
lems arising in mathematics showed that each such The halting problem asks whether there exists an effective
procedure to decide, given a Turing machine M and input x,
problem had, in the general case, a maximum degree of
whether the calculation ever terminates. A subset of Nk is
unsolvability, while their special instances (e.g. the
recursive if its characteristic function is recursive. Let all
word identity problem in specific semi-groups or Turing machines be enumerated and let Mx be the machine
groups) may have any pre-given degree of un solvability. with index x. Then the theorem that {(x, y): Mx with input y
References halts} is not a recursive set is a precise statement of the
[I] MAL'TSEV, A.I.: AlgOrithms and recursive functions, Wolters- unsolvability of the halting problem.
Noordhoff, 1970 (translated from the Russian). The theorem to the effect that there exists a group with a
[2] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!., presentation (i.e. given by generators and relations) for
1961 (translated from the Russian). Also: Trudy Mat. Inst.
which the word problem is unsolvable is often known as the
Steklov. 42 (1954).
[3] ERsHOV, Yu.L., Ef AL.: 'Elementary theories', Russian Math. Boone- Novikov theorem.
Surveys 20, no. 4 (1965), 35-105. (Uspekhi Mat. Nauk 20, no. 4 A class of groups @ is invariant if GE@ and H isomorphic
(1965),37-108) to G imply HE@. It is non-trivial if there exist finitely-
[4] NOVIIWV, P.S.: Dokl. Akad. Nauk SSSR 85, no. 4 (1952), presented groups both inside and outside @ and it is
709-712.
hereditary if a subgroup of an element of @ is an element of
[5] NOVIKOV, P.S.: 'On algorithmical unsolvability of the word
problem', Trudy Mat. Inst. Akad. Nauk SSSR 44 (1955) (in @. The Adyan- Rabin theorem now says that if @ is an
Russian). invariant, non-trivial and hereditary class, then there is no
[6] ADYAN, S.I.: 'Unsolvability of some algorithmic problems in algorithm that determines whether the group given by a fin-
group theory', Trudy Moskov. Mat. Obshch. 6 (1957),231-298 ite problem belongs to that class or not. Properties to which
(in Russian).
the Adyan - Rabin theorem applies include being cyclic,
[7] ADYAN, S.I.: 'Finitely defined groups and automorphisms',
Dokl. Akad. Nauk SSSR 117, no. I (1957),9-12 (in Russian). commutative, free, solvable, finite and equal to the identity
[8] ADYAN, S.l.: 'On algorithmic problems in effectively complete group.
classes of groups', Dokl. Akad. Nauk SSSR 123, no. I (1958), The original papers in which Church' thesis was
13-16 (in Russian). advanced and defended are [A2] , [A3]. Together [A4] and
[9] RABIN, M.O.: 'Recursive unsolvability of group theoretic prob-
[A5] give an excellent introduction to algorithmic unsolvabil-
lems', Ann. of Math. (2) 67, no. 1(1958),172-194.
[IO] TSEiTIN, G.S.: 'An associative calculus with unsolvable ity. Reference [A 1] contains a wealth of interesting material
equivalence problem', Trudy Mat. Inst. Steklov. 52 (1958), 172- and references on the word problem in groups.
189 (in Russian).
[II] MATIYASEVICH, YU.V.: 'Simple examples of undecidable asso-
References
[A1] BOONE, W.W., CANNONITO, F.B. and LYNDON, R.C.: Word
ciative calculi', Soviet Math. Dokl. 8, no. 2 (1967), 555-557.
problems, North-Holland, 1973.
(Dokl. Akad. Nauk SSSR 173, no. 6 (1967), 1264-1266)
[A2] CHURCH, A.: 'An unsolvable problem of elementary number
[I2] ADJAN, S.I. [S.I. ADYAN]: 'Defining relations and algorithmic
theory (abstract)', Bull. Amer. Math. Soc. 41 (1935),332-
problems for groups and semigroups', Proc. Steklov Inst. Math.
333.
85 (1967). (Trudy Mat. Inst. Steklov. 85 (1966
[A3] CHURCH, A.: 'An unsolvable problem of elementary number
[I3] MAGNUS, W.: Uspekhi Mat. Nauk 8 (1941), 365-376.
theory', Amer. J. Math. 58 (1936), 345-363.
[14] LYNDON, R.C.: 'On Dehn's algorithm', Math. Ann. 166 (1966),
[A4] DAVIS, M.: 'Unsolvable problems', in J. Barwise (ed.): Hand-
208-228.
book of mathematical logic, North-Holland, 1977, pp. 567-
[15] REMESLENNIKOV, V.N.: 'Example of a finitely presented group
594.
in the variety j5 with the unsolvable word problem', Alge-
[A5] ENDERTON, H.B.: 'Elements of recursion theory', in J. Bar-
bra and Logic 12, no. 5 (1973), 327-345. (Algebra i Logika 12,
wise (ed.): Handbook of mathematical logic, North-Holland,
no. 5 (1973), 577-602)
1977, pp. 527-566.
[16] GARSAID, F.A.: 'The braid group and other groups', Quart. J.
Math. Oxford Ser. (2) 20 (1969), 235-254. AMS 1980 Subject Classification: 68020, 68025,
[17] MARKov, A.A.: 'On the unsolvability of certain problems in
03DXX
topology', Dokl. Akad. Nauk SSSR 123, no. 6 (1958), 978-980
(in Russian).
[18] DAVIS, M. and MASCHLER, M.: 'Reductions of Hilbert's tenth ALGORITHMIC REDUCIBILITY - One of the basic
problem', J. Symbolic Logic 23 (1958), 183-187. concepts in the theory of algorithms (cf. Algorithms,
[19] ROBINSON, 1.: 'Existential definability in arithmetic', Trans.
theory of) and its applications. Its background was the
Amer. Math. Soc. 72 (1952), 437-449.
[20] MATIYASEVICH, YU.V.: 'Enumerable sets are Diophantine', fact that solvability (and un solvability) of many algo-
Soviet Math. Dokl. 11, no. 2 (1970), 354-357. (Dokl. Akad rithmic problems (d. Algorithmic problem) is not esta-
Nauk SSSR 191, no. 2 (1970), 279-282) blished directly, but rather by reducing some algo-
[21] MATIYASEVICH, Yu.V.: 'Diophantine representation of enumer-
able predicates', Math. USSR-Izv. 5, no. I (1971), 1-28. (Izv. rithmic problem, already demonstrated to be unsolv-
Akad. Nauk SSSR Ser. Mat. 35 (1971),3-30) able, to the problem in question, or by reducing the
S1. Adyan latter problem to some other one which has already
Editorial comments. Church' thesis is often formulated been solved. Thus, the problem of homotopy of paths
as stating that the idea of a recursive function (as defined in a polyhedron is shown to be unsolvable by reducing
in various equivalent ways, e.g. via Turing machines, cf. the problem of equality of words in the corresponding
Turing machine), is the correct formalization of our intuitive fundamental group to this problem.

147
ALGORITHMIC REDUCIBILITY

In what follows, the simplest examples of algorit1ullic ferent kinds of algorithmic reducibility) of enumerable
reducibility of number-theoretical predicates and func- sets and the rate of growth of the complexity of initial
tions (i.e. those defined on natural numbers) will be fragments of enumerable sets. A special kind of reduci-
presented: bility - polynomial reducibility or p-reducibility (reduci-
P(x) = Q(2x + 1), bility with some limitation imposed on time) - is
where P and Q are predicates. The problem of solving employed in proving the universal character of com-
the predicate P, that is, of establishing the truth or binatorial optimization problems from various branches
falsehood of P for different values of x, is reduced to of mathematics [5]; for more details on this subject see
the problem of solving Q - or, 'P is reduced to Q', for [1]. The methods of measure theory [1], and the forcing
short. One may also say that the set of truths of P, i.e. method [4] have in turn been used in the study of
{x: P(x)}=Mp , is reduced to MQ={x: Q(x)}. degrees.
Let xl u References
f(x) = ~ IIg(u,y), [I) ROGERS, JR., H.: Theory of recursive functions and effective
u=ly=1 computability, McGraw-Hill, 1967.
[2) POST, E.L.: 'Recursively enumerable sets of positive integers
where f(x) and g(u,y) are number-theoretic functions.
and their decision problems', Bull. Amer. Math. Soc. 50 (1944),
The problem of computation of the function f is reduced 284-316.
to the computation of g or, for short, f is reduced to g. [3) KLEENE, S.c. and POST, E.L.: 'The upper semi-lattice of
The concept of algorithmic reducibility was given a degrees of recursive unsolvability', Ann. of Math. (2) 59, no. 3
(1954), 379-407.
more exact form by A.M. Turing: If, roughly speaking, [4) SELMAN, A.L.: 'Applications of forcing to the degree-theory of
some Turing machine transforms a sequence of encoded the arithmetical hierarchy', Proc. London Math. Soc. (3) 2S
values of the function g into a sequence of encoded (1972), 586-602.
[5) KARP, R.M.: 'Reducibility among combinatorial problems', in
values of the function f, then f is reduced to g. S.c. R.E. Miller and l.W. Thatcher (eds.): Complexity of computer
Kleene [1] formulated the related concept of relative computations, Plenum Press, 1972, pp. 85-103. A A M h 'k
.. uc m
computability with the aid of recursive systems of equa-
tions. Mter arithmetization, each algorithmic problem Editorial comments.
is reduced to the problem of computing some number- References
theoretic function f If f is reduced to g according to [A 1] R.I.: Recursively enumerable sets and degrees,
SOARE,
Springer, 1986.
Turing, f";;;;' Tg, and g is reduced to f, g,,;;, Tf, then one [A2] SIMPSON, S.G.: 'Degrees of unsolvability: a survey of results',
says that f and g have the same degree of unsolvability in J. Barwise (ed.): Handbook of mathematical logic, North-
or that f Tg The relation ,,;;, T is both reflexive and Holland, 1977, pp. 631-652.
transitive. Thus, all functions (and sets of natural AMS 1980 Subject Classification: 03DXX, 03FXX,
numbers or their characteristic predicates) are subdi- 68CXX
vided into equivalence classes, known as Turing degrees
or T-degrees [3]. Most algorithmic problems considered ALGORITHMIC THEORY OF SETS, algorithmic set
in logic and in mathematics are expressible as problems theory - See Recursive set theory.
of membership in enumerable sets of constructive AMS 1980 Subject Classification: 03E45
objects. In this context the study of enumerable sets
was initiated in the nineteen-fourties by E.L. Post [2]; ALGORITHMS, COMBINATIONS OF - The name of
he introduced certain special kinds of algorithmic redu- several methods for constructing new algorithms (cf.
cibility, in addition to Turing's kind, and formulated Algorithm) from given algorithms.
the problem of reducibility as follows: Can various enu- As applied to normal algorithms (cf. Normal algo-
merable unsolvable sets be reduced (according to Tur- rithm), the following combinations (compositions) are
ing) to each other? It was subsequently established that best known: normal composition (IB o 2r) of two normal
the enumerable sets form an infinite, very rich system algorithms 2r and IB, normal union (2r" IB) of two nor-
of T-degrees. This was supplemented by discovering the mal algorithms 2r and IB, normal branching (2rYIBI G) of
so-called priority method, which is extensively employed two normal algorithms 2r and IB controlled by a normal
in the theory of algorithms. algorithm G, and normal repetition 9! OG of a normal
Subsequently, degrees of unsolvability found applica- algorithm 9! controlled by a normal algorithm G. If
tions in other branches of mathematics as well. For 9!, IB and G are normal algorithms in some alphabet A,
instance, for all T-degrees a and b of enumerable sets, the above combinations are normal algorithms in a cer-
if a";;' Tb, then there exists a finitely-determined group tain given extension of A and satisfy the following con-
in which the problem of equality of words has degree a, ditions: a) for any word P in A, IBo9!)lP J~I.B l2r lP JJ
while the conjugation problem has degree b. There is a is true (the composition theorem); b) for any word P in
close connection between degrees (concerning the dif- A, ( 2r "IS lP J~2r lP JIBlP J is true (the union theorem);

148
ALGORITHMS. EQUIVALENCE OF

c) for any word P in A tion, if-then-else and while, from simple assignment state-
ments [A1].
(ID'IS IC) lP J ~ { ill:lPJ
IS lP J
ifClPJoF,
if C lP J$:F, References
[A1] BOHM, C. and JACOPINI, G.: 'Flow diagrams, Turing machines
moreover, if (~?B I~ lP J has been defined, then ~ lP J and languages with only two formation rules', Comm. ACM 9
is defined as well (the branching theorem); d) for any (1966), 366-371.

words P and Q in A, the graphic equality AMS 1980 Subject Classification: 68C05
(~O~HP J Q is true if and only if it is possible to
indicate a sequence of words Po, ... ,Pk (k ~ I) over ALGORITHMS, EQUIVALENCE OF - A binary rela-
the alphabet A such that tion connecting algorithms of a fixed type and express-
Po P, Pk ::g:: Q, ing the fact that any two algorithms thus connected
yield the same results if fed with the same given type of
Pi ~ ill:lPi-d (i=l, ... ,k),
inputs (and may also yield some additional information
ClPkJ $: F (i=l, ... ,k-l), as regards the computations thus performed - the so-
ClPkJ ::g:: F called history of the computation). A few typical exam-
(the repetition theorem). Similar theorems may also be ples of this connection are given below.
obtained for Turing machines (cf. Turing machine). In a) Consider all possible recursive schemes - i.e. sys-
the theory of recursive functions (cf. Recursive func- tems of equations which define n-place partial recursive
functions. Two schemes which define functions <I> and
tion), the combinations of these functions which are
most frequently employed are those supplied by the
1/; are called functionally eqUivalent if for any natural
numbers Xl, . . . ,Xn the conditional equality
substitution operator, the primitive recursion operator
and the /L-operator. <P(X1,'" ,xn ) ~ 1/-{X 1, . . ,xn )
Theorems on combinations of algorithms reveal an is true (it is considered to be true if both its sides are
important feature of the existing formalizations of the meaningful at the same time and, if meaningful, assume
general concept of an algorithm - to wit, their 'clo- the same value). It was shown by S.c. Kleene (see, for
sure' under the natural ways of combination of algo- example, [ID that for any everywhere-defined comput-
rithms. This fact is one of the principal arguments in able operator lR over recursive schemes it is possible to
favour of the basic assumption about algorithms (the find a scheme S such that S and lR(S) are functionally
Church thesis). Theorems on compositions of algo- equivalent (the so-called fixed-point theorem). The
rithms are an important part of the general theory of consequence of this is, in particular, the
algorithms. Having been demonstrated once, they make Uspenskii - Rice theorem on the un solvability of any
it possible to determine the realizability of complicated property of recursive schemes which is invariant with
and cumbersome algorithms, without actually writing respect to functional equivalence if there exist schemes
down their schemes. S 1 and S 2 such that S 1 has this property, while
Of major interest in the general theory of algorithms scheme S 2 does not have it. The un solvability of the
is the problem of synthesizing an arbitrary algorithm relation of functional equivalence itself follows from
within some class of interest, using a pre-determined set this theorem.
of composition operators. b) Consider normal algorithms (cf. Normal algorithm)
References over a fixed alphabet A. Two such algorithms ~ and ?B
[1] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!., are called equivalent with respect to A [2] if for any
Jerusalem, 1961 (translated from the Russian). Also: Trudy
Mat. Inst. Steklov. 42 (1954).
word P in A the following condition is satisfied: If one
[2] KLEENE, S.c.: Introduction to metamathematics, North-Holland, of these algorithms converts P to some word Q which is
1951. also contained in the alphabet A, then the second algo-
[3] OUSPENSKl, V.A. [VA USPENSKlIl: Le~ons sur les fonctions cal-
rithm also converts P to Q. These algorithms are called
culable, Hermann, 1966 (translated from the Russian).
[4] MAL'TSEV, A.I.: Algorithms and recursive functions, Wolters- completely equivalent with respect to A if for any P in A
Noordhoff, 1970 (translated from the Russian). the conditional equality
N.M. Nagornyr ill:(P) ~ IS(P)
Editorial comments. The standard terminology for the
above composition operators is in the West: 'sequential
is true. Both these relations are unsolvable.
composition', 'non-deterministic choice', 'if-then-else' and c) Consider logical schemes of algorithnls (Yanov
the 'while loop', in order of appearance. A representative schemes, [3D. The realization of such a scheme is a
result of the type indicated is the characterization of the sequence of operators involved in its execution, from
power of structured programming: Every flowchart can be beginning to end. Two schemes are called equivalent if
expressed as a flowchart obtained by sequential composi- the sets of their realizations coincide. The equivalence

149
ALGORITHMS, EQUIVALENCE OF

relation of Yanov schemes proved to be solvable, and a [10], [11], and others. In particular, Markov rendered
complete system of equivalent transformations has been the concept of an algorithm more precise by the intro-
constructed for it [3], [4]. duction of the concept of a normal algorithm [10]. A.N.
A detailed study of the relations of algorithmic Kolmogorov [12], [13] suggested the most general
equivalence is of major importance in the context of a approach to the concept.
number of tasks of current interest (especially in the Since algorithms exclusively deal with the so-called
theory of programming schemes), which can only be constructive objects (cf. Constructive object), the ideas
solved by advanced techniques of equivalent transfor- and methods of the theory of algorithms can be applied
mations of algorithms. Such problems include the to non-constructive objects only if these are encoded,
translation of algorithms from one language to another or re-named, as constructive objects. The study of the
and their optimization (transformations aimed at general properties of such encodings (mainly of situa-
improving their 'working characteristics'). Special atten- tions in which such encodings, or names, are natural
tion is given in this connection [5] to finding solvable numbers) is the subject of the theory of enumeration
relations of algorithmic equivalence which would per- [14], which is an important part of the theory of algo-
mit complete systems of equivalent transformations rithms.
that are convienient with regard to their applications. Fundamental concepts in the theory of algorithms. The
The concept outlined in [3] to a large extent determined domain of applicability of an algorithm is the set of the
the general approach to the study of algorithmic objects to which it is applicable. It is said that an algo-
equivalence. rithm ~ 1) 'computes a function l' if its domain coin-
References cides with the range of definition of f, and if ~ con-
[I] KLEENE, S.c.: Introduction to metamathematics, North-Holland, verts any x from its domain into f (x); 2) 'solves a set
I95l. A with respect to a set X' if it is applicable to any x
[2] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!.,
1961 (translated from the Russian). Also: Trudy Mat. Inst.
from X and converts all x from X n A to the word
Steklov. 42 (1954). 'yes', and all x from X\A to the word 'no'; and 3)
[3] YANOV, Yu.!.: 'On logical schemes of algorithms', Problemy 'counts (enumerates) a set B' if its domain is the
Kibernet. 1 (1958), 75-127 (in Russian).
[4] ERSHOV, A.P.: 'Operator algorithms III. (On Yanov operator
sequence of natural numbers, and the set of results is
schemes)" Problemy Kibernet. 20 (1968), 181-200 (in Russian). B. A function is called computable if an algorithm
[5] ERSHOV, A.P.: 'The contemporary state of the theory of pro- which evaluates the function exists. A set is said to be
gramming schemes', Problemy Kibernet. 27 (1973),87-110 (in
solvable with respect to X if there exists an algorithm
Russian). NM N -
. . agornYI which solves it with respect to X. A set is said to be
AMS 1980 Subject Classification: 68C05 enumerable if it is empty or if there exists an algorithm
enumerating it.
ALGORITHMS, THEORY OF - The branch of A detailed analysis of the concept of an algorithm
mathematics dealing with the general properties of reveals that: 1) the range of possible initial data and
algorithms (cf. Algorithm). Preliminary notions about the range of applicability of an algorithm are enumer-
algorithms have played a role in mathematics able sets. In tum, 2) for any pair of enumerable sets,
throughout history. However, the concept of an algo- one of which is included in the other, it is possible to
rithm itself was only formulated in the twentieth cen- find an algorithm in which the larger set is the range of
tury and became the object of independent study (at possible inputs, while the smaller set is the range of
first only in its rather vaguely defined form) in the outputs. The following basic theorems hold: 3) a func-
nineteen-twenties by the intuitionistic school of L.E.J. tion f is computable if and only if its graph, i.e. the set
Brouwer and H. Weyl [1] (cf. Intuitionism). Systematic of all pairs of the form <x, f(x, is enumerable; 4) a
studies began in 1936, with the publication by A. subset A of an enumerable set X is solvable with
Church [2] of the first formalization of the concept of a respect to X if and only if A and X \ A are enumer-
computable function (he suggested that the concept of able; 5) if A and B are enumerable, A U B and A n B
an everywhere-defined computable function with are also enumerable; 6) in each infinite enumerable set
natural arguments and natural values be identified with X there exists an enumerable subset with a non-
the concept of a general recursive function) and of the enumerable complement (in view of 4) this enumerable
first example of a non-computable function. A.M. Tur- subset will not be solvable with respect to X); and 7)
ing [3], [4] and E.L. Post [5] gave the first formaliza- for each infinite enumerable set X there exists a com-
tions of the concept of an algorithm (in terms of ideal- putable function, defined on a subset of this set, which
ized computers, cf. Turing machine). Subsequent cannot be extended to a computable function defined
development of the theory of algorithms is due to the on all of X. Statements 6) and 2) in combination yield
studies of Kleene, Post [6], [7], [8], A.A. Markov [9], an example (d. Algorithm) of an algorithm with an

150
ALGORITHMS, THEORY OF

unsolvable domain of applicability. Solvable and enu- of description may be defined in various ways, and it
merable sets represent the simplest (and the most may well be that A will tum out to be more complex
important) examples of sets, the structure of which is than B when one definition is adopted, while B will
defined by certain algorithmic procedures. A systematic tum out to be more complex than A according to
study of sets of constructive objects from the point of another definition. In order to be able to speak about
view of the properties of these sets as related to the the complexity of algorithms, one must first specify
availability of some of algorithms, constitutes the so- some exact language in which the algorithms are to be
called algorithmic theory of sets; certain concepts, written, and define the complexity of an algorithm as
methods and results of this theory find analogues in the complexity of its notation; the complexity of nota-
descriptive set theory. tion can in tum be determined in various ways (e.g. as
Algorithmic problems. The problem of constructing an the number of symbols of a given type which are
algorithm with certain given properties is known as the included in the notation, or as a selection of such
algorithmic problem. As a rule, the property of the numbers computed for the various types of symbols).
algorithm which is sought is formulated in terms of the In order to speak about computational complexity, one
correspondence which should hold between the inputs must specify the exact form of the computation as a
and the results of the algorithm. Some important chain of constructive objects successively replacing each
examples of algorithmic problems are: the problem of other, as well as some criterion of the complexity of
computing a given function (i.e. finding an algorithm such a chain - the number of 'links' (or steps) in the
which computes the function); the problem of solving a chain alone or in combination with the 'dimension' of
given set (i.e. finding an algorithm which solves this the links, etc. In any event, the computational complex-
set with respect to some other set); and the problem of ity will depend on the input with which the computa-
enumeration of a given set (i.e. to find an algorithm tion is begun; for this reason, the computational com-
enumerating the given set). All the examples of algo- plexity is a function which assigns to each object within
rithmic problems in various branches of mathematics, the domain of the algorithm the complexity of the
given in the section 'Applications' below, are problems corresponding chain. The development of the methods
of solving. That an algorithmic problem is unsolvable of evaluation of the complexity of algorithms and of
means that there is no corresponding algorithm; computations has a high theoretical and practical
theorems which establish the un solvability of such importance; however, as distinct from the descriptive
problems belong to the most important theorems in the theory of algorithms, which has now crystallized into
theory of algorithms. For instance, if an algorithm has an integral mathematical discipline [11], [15], [16], the
an unsolvable domain, the algorithmic problem of solv- metric theory of algorithms is only in the process of
ing this domain with respect to the set of all possible being created [17], [18], [19], [20].
inputs is unsolvable. By reducing to this problem unsol- Applications of the theory of algorithms. These are
vability, most other solvability problems were found to encountered in all branches of mathematics in which
be unsolvable; this applies, in particular, to all the algorithmic problems are encountered. Such problems
problems listed in the section 'Applications' below. The may arise in mathematical logic and in model theory;
question as to whether or not the unsolvability of any for each theory, the problem which arises is to solve the
unsolvable problem can be established in this way, set of all true or demonstrable statements of this theory
forms the so-called algorithmic reducibility problem. with respect to the set of all its statements (the theories
The metric theory of algorithms. The theory of algo- divide into solvable or unsolvable theories, depending
rithms comprises the descriptive (qualitative) and the on the solvability or unsolvability of the problem in
metric (quantitative) theory. The former deals with question). Church [21], [22] proved in 1936 that the
algorithms from the point of view of the correspon- problem of solving the set of all true assumptions of
dence they establish between the initial data and the predicate logic was unsolvable; other important results
results; in particular, all algorithmic problems discussed on the subject are due to A. Tarski, A.I. Mal'tsev and
in the preceding section belong to the qualitative to others [23]. Unsolvable algorithmic problems are
theory. The latter studies algorithms from the point of encountered in algebra (the word (identity) problem for
view of the complexity both of the algorithms them- semi-groups and, in particular, for groups). The first
selves (cf. Algorithm, complexity of description of an) examples of semi-groups with unsolvable word problem
and of the 'computations' defined by the algorithms, were found, independently of each other, by Markov [9]
i.e. the processes of successive transformations of con- and by Post [8] in 1947, while an example of a group
structive objects (cf. Algorithm, computational complex- with unsolvable word problem was found by P.S. Novi-
ity of an). It is important to note that both the compu- kov [24], [25] in 1952. Markov [26] in 1958 proved that
tational complexity of the algorithm and the complexity the problem of homeomorphy in topology was unsolv-

151
ALGORITHMS, THEORY OF

able for an important class of cases. In number theory, [17] MARKov, A.A.: 'Normal algorithms connected with the com-
putation of Boolean functions', Izv. Akad. Nauk SSSR Ser.
Yu.V. Matiyasevich proved in 1970 that the problem of Mat. 31 (1967), 161-208 (in Russian).
solvability of Diophantine equations was unsolvable [18] 'fRAKHTENBROT, B.A.: Complexity of algorithms and calcula-
[27], [28]. Similar examples can also be quoted from tions, Novosibirssk, 1967 (in Russian).
many other branches of mathematics. [19] Problems of mathematical lOgic. Complexity of algorithms and of
classes of computable functions, Moscow, 1970 (in Russian;
The theory of algorithms is closely connected with translated from the English).
mathematical logic, since the concept of an algorithm [20] Complexity of computation of algorithms, Moscow, 1974 (in
forms the base of one of the central concepts of Russian; translated from the English).
[21] CHURCH, A.: 'A note on the Entscheidungsproblem', J. Sym-
mathematical logic - the concept of a calculus, as a bolic Logie 1, no. I (1936), 40-41.
result of which the GOdel incompleteness theorem of [22] CHURCH, A.: 'Correction to 'a note on the Entscheidungsprob-
formal systems may be obtained from theorems of the lem", J. Symbolic Logic 1, no. 3 (1936), 101-102.
[23] ERsHOV, Yu.L., ET AL.: 'Elementary theories', Russian Math.
theory of algorithms [29]. Finally, the theory of algo- Surveys 20, no. 4 (1965),35-105. (Uspekhi Mat. Nauk 20, no. 4
rithms is closely connected with the foundations of (1965),37-108)
mathematics, where one of the key problems is the rela- [24] NOVIKOV, P.S.: 'On algorithmic unsolvability of the word
tion between the constructive and the non-constructive. problem', Dokl. Akad. Nauk SSSR 85, no. 4 (1952),709-712
(in Russian).
In particular, the theory of algorithms provides the [25] NOVIKOV, P.S.: On algorithmically unsolvable problems of word
apparatus for the development of the constructive identity in groups, Moscow, 1955 (in Russian).
direction in mathematics. It was suggested by KoImo- [26] MARKov, A.A.: 'Unsolvability of the homeomorphy problem',
Dokl. Akad. Nauk SSSR 121, no. 2 (1958), 218-220 (in Rus-
gorov [30] in 1965 that the theory of algorithms be used sian).
as the foundation of information theory (cf. Algo- [27] MATIYASEVICH, Yu.V.: 'The Diophantiness of enumerable
rithmic information theory). The theory of algorithms is sets', Dokl. Akad. Nauk SSSR 191, no. 2 (1970), 279-282 (in
Russian).
the theoretical foundation for a number of problems in
[28] MATIYASEVICH, YU.V.: 'Diophantine sets', Russian Math. Sur-
computational mathematics, and is closely related to veys 27, no. 5 (1972), 124-164. (Uspekhi Mat. Nauk 27, no. 5
cybernetics, in which an important subject is the study (1972), 185-222)
of control algorithms. [29] USPENSKIi, V.A.: 'An elementary exposition of GOdel's incom-
pleteness theorem', Russian Math. Surveys 29, no. I (1974),63-
References 106. (Uspekhi Mat. Nauk 29, no. I (1974), 3-47)
[I] WEYL, H.: Philosophie der Mathematik und NaturwissenschaJ- [30] KOLMOGOROV, A.N.: 'Three approaches to the definition of
ten, Leibniz Verlag, Miinchen, 1950. the concept of 'quantity of information", Problemy Peredachi
[2] CHURCH, A.: 'An unsolvable problem of elementary number Informatsii 1, no. 1 (1965), 3-11 (in Russian). VA U. k
.. spens II
theory', Amer. J. Math. 58 (1936),345-363.
[3] TuRING, A.M.: 'On computable numbers, with an application Editorial comments. Since the above text was prepared
to the Entscheidungsproblem', Proc. London Math. Soc. (2) 42 also the theory of complexity of algorithms has matured into
(1937), 230-265. a well-developed branch of mathematics.
[4] TuRING, A.M.: 'On computable numbers with an application
to the Entscheidungsproblem, a correction', Proc. London See also the comments to Algorithm, computational
Math. Soc. (2) 43 (1937),544-546. complexity of an.
[5] POST, E.L.: 'Finite combinatory processes - formulation I', J. In addition to [27], [28] one may consult [A1J, [A2]. A
Symbolic Logic 1, no. 3 (1936), 103-105. thorough treatise on computational complexity is [A3].
[6] POST, E.L.: 'Formal reductions of the general combinatorial
decision problem', Amer. J. Math. 65 (1943), 197-215. References
[7] POST, E.L.: 'Recursively enumerable sets of positive integers [A1] DAVIS, M.: 'Hilbert's tenth problem is unsolvable', Amer.
and their decision problems', Bull. A mer. Math. Soc. 50 (1944), Math. Monthly 80 (1973), 233-269.
284-316. [A2] MAmAsEvIc, J.V.: 'A new proof of the theorem of exponen-
[8] POST, E.L.: 'Recursive unsolvability of a problem of Thue', J. tial Diophantine representation of enumerable sets', J. Soviet
Symbolic Logic 12, no. I (1947), I-II. Math. 14 (1980),1475-1486.
[9] MARKov, A.A.: Dokl. Akad. Nauk SSSR 55, no. 7 (1947), [A3] WAGNER, K. and WECHSUNG, G.: Computational complexity,
587-590. Reidel, 1986.
[10] MARKov, A.A.: 'Theory of algorithms', Trudy Mat. Inst. Stek-
AMS 1980 Subject Classification: 68CXX, 03DXX
lov. 38 (1951), 176-189 (in Russian).
[11] MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Trans!., ALIQUOT RATIO, aliquot fraction - A fraction of
1961 (translated from the Russian). Also: Trudy Mat. Inst.
Steklov. 42 (1954). the type I / n, where n is a natural number. It is essen-
[12] KOLMOGOROV, A.N.: 'On the concept of an algorithm', tial, in solving several physical and mathematical prob-
Uspekhi Mat. Nauk 8, no. 4 (1953), 175-176 (in Russian). lems, that each positive rational number is represent-
[13] KOLMOGOROV, A.N. and USPENSKIi, V.A.: 'On the definition
of an algorithm', Uspekhi Mat. Nauk 13, no. 4 (1958),3-28 (in able as a sum of a finite number of aliquot ratios, with
Russian). different denominators. Thus. 3/ II = 1/6 + 1/11 +
[14] ERSHOV, YU.L.: The theory of enumeration. 1-2 .. Novosibirssk, 1/66. Aliquot ratios were extensively employed III
1969-1973 (in Russian).
[15] MAL'TSEV, A.I.: Algorithms and recursive functions, Wolters-
Ancient Egypt and, as a result, received the name of
Noordhoff, 1970 (translated from the Russian). Egyptian fractions.
B. M. Bredikhin
[16] ROGERS, JR., H.: Theory of recursive functions and effective
computability, McGraw-Hili. 1967. AMS 1980 Subject Classification: 10A99

152
ALMOST-PERIOD

ALMOST-COMPLEX STRUCTURE - A tensor field uniquely defined by its value 1p at a fixed point p EM.
I of linear transformations of the tangent spaces on a This represents an invariant complex structure in the
manifold M satisfying the condition tangent space TpM relative to the isotropic representa-
12 = -id, tion (see Invariant object on a homogeneous space).
Methods of the theory of Lie groups allow one to con-
i.e. a field of complex structures in the tangent spaces
struct a wide class of homogeneous spaces having an
TpM, p EM. An almost-complex structure I dete~es
invariant almost-complex structure (both integrable and
a decomposition T C M = V + + V_of the complexiflca-
tion rc M of the tangent bundle in a direct sum of two
non-integrable) and to classify invariant almost-
complex structures under different assumptions (see
complex mutually-conjugate subbundles V + and V_
[2]). For instance, any quotient space G / H of a Lie
consisting of eigen vectors of the affinor I (extended by
linearity on rc M) with eigen values i and - i, respec-
group G by the subgroup H consisting of fixed points
of an automorphism of even order of G has an invari-
tively. Conversely, a decomposition of TC M in a direc.,!
ant almost-complex structure. An example is the 6-
sum of mutually-conjugate vector subbundles S, S dimensional sphere S6, considered as the homogeneous
defines an almost-complex structure on M for which space G2 / SU(3); none of the invariant almost-
V+=S. complex structures on S6 is integrable.
An almost-complex structure I is called integrable if The existence of an almost-complex structure on a
it is induced by a complex structure on M, i.e. if there manifold imposes certain restrictions on its topology -
exists an atlas of admissible charts of the manifold M it must be of even dimension, oriented, and in the com-
in which the field I has constant coordinates Ii. A pact case all its odd-dimensional Stiefel- Whitney
necessary and sufficient condition for the integrability
classes must vanish. Among the spheres only the
of an almost-complex structure is that the subbundle
spheres of dimensions 2 and 6 admit an almost-
V + is involutive, i.e. that the space of its sections is
complex structure.
closed with respect to commutation of (complex) vector
fields. The condition for the subbundle V + to be invo- References
[1] KOBAYASHI, S.: TransfoT7lUJtion groups in differential geometry,
lutive is equivalent to the vanishing of the vector- Springer, 1972.
valued 2-form N(I, I) associated with I and given by [2] KOMRAKOY, B.P.: Structure on manifolds and homogeneous

the formula spaces, Minsk, 1978 (in Russian).


[3] LICHNEROWICZ, A.: Global theory of connections and holonomy
N(l, lXX, Y) = [IX, Iy]-I[X, IY]-I[l, XYj-[X, y], groups, Noordhoff, 1976 (translated from the French).
[4] WELLS, JR., R.O.: Differential analysis on complex manifolds,
where X and Y vector fields. This form is called the tor- Springer, 1980.
sion tensor, or the Nijenhuis tensor, of the almost- [5] HORMANDER, L.: An introduction to complex analysis in several

complex structure. The torsion tensor N(I, I) can be variables, North-Holland, 1973. D. V. Alekseevskii
considered as first-order differentiation on the algebra
Editorial comments. The theorem that an almost-
of differential forms on M of the form
complex structure is integrable, i.e. comes from a complex
N(I, I) = [1,[1, dll+d, structure, if and only if its Nijenhuis tensor vanishes, is due
where d is the exterior differential and I is considered to A. Newlander and L. Nirenberg [A1].
as a differentiation of order zero. References
From the point of view of the theory of G-structures [A1] NEWLANDER, A. and NIRENBERG, L.: 'Complex analytic coor-
dinates in almost complex manifolds', Ann. of Math. 65
an almost-complex structure is a GL(m, C)-structure, (1957), 391-404.
where m = (1 / 2)dim M, and the torsion tensor N (I, I)
AMS 1980 Subject Classification: 53C15, 58812
is the tensor defined by the first structure function of
this structure. A GL(m, C)-structure is a structure of
ALMOST-EVERYWHERE, for almost all x (with
elliptic type, therefore the Lie algebra of infinitesimal respect to a measure JL) - An expression indicating that
automorphisms of an almost-complex structure satisfies one speaks about all x of a measure space X with the
a second-order system of elliptic differential equations possible exception of some set A ~X of measure zero:
[1]. In particular, the Lie algebra of infinitesimal auto-
morphisms of an almost-complex structure on a com- JL{A)=O. V.I. Bityutskov
pact manifold is finite-dimensional, and the group G of AMS 1980 Subject Classification: 28-XX
all automorphisms of a compact manifold with an
almost-complex structure is a Lie group. For non- ALMOST-PERIOD - A concept from the theory of
compact manifolds this statement is, in general, not almost-periodic functions (cf. Almost-periodic func-
true. tion); a generalization of the notion of a period. For a
If the automorphism group G acts transitively on the uniformly almost-periodic function f(x), - oo<x<oo,
manifold M, then the almost-complex structure I is a number T=Tff.) is called an f.-almost-period of f(x) if

153
ALMOST-PERIOD

for all x, almost-periodic analytic functions turns out to be a


If(X +r)-f(x) I < (. theory analogous to that of almost-periodic functions of
For generalized almost-periodic functions the concept a real variable (cf. Almost-periodic function). Therefore,
of an almost-period is more complicated. For example, many important results of the latter theory can be
in the space SIf an (-almost-period T is defined by the easily carried over to almost-periodic analytic func-
inequality D.1,[f(x+'T),f(x)) < (, tions: the uniqueness theorem, Parseval's equality, rules
of operation with Dirichlet series, the approximation
where DSj [f, cf>] is the distance between f(x) and cf>(x) in theorem, and several other theorems.
the metric of SIf. References
A set of almost-periods of a function f(x) is said to [1) BOHR, H.: Almost-periodicfunctions, Chelsea, reprint, 1947
be relatively dense if there is a number L=L(,jO (translated from the German).
[2) LEVITAN, B.M.: Almost-periodic functions, Moscow, 1953,
such that every interval (a, a+ L) of the real line con-
Chapt. 7 (in Russian).
tains at least one number from this set. The concepts of E.A. Bredikhina
uniformly almost-periodic functions and that of Editorial comments. The hyphen between almost and
Stepanov almost-periodic functions may be defined by periodic is sometimes dropped.
requiring the existence of relatively-dense sets of (- References
almost-periods for these functions. [A1] CORDUNEANU, c.: Almost periodic functions, Interscience,
1961, Chap!. 3.
References
[I) LEVITAN, B.M.: Almost-periodic functions, Moscow, 1953 (in AMS 1980 Subject Classification: 42A75, 43A60,
Russian).
E.A. Bredikhina 30H05, 30EXX
Editorial comments. For the definition of Sf and its
ALMOST-PERIODIC FUNCTION - A function
metric Os: see Almost-periodic function. The Weyl,
representable as a generalized Fourier series. There are
Besicovitch and Levitan almost-periodic functions can also
be characterized in terms of Sf (-periods. These characteri-
several ways of defining classes of almost-periodic func-
zations are more complicated. A good additional reference tions, based respectively on notions of closure, of an
is [A1), especially Chapt. II. almost-period and of translation. Each of these classes
can be obtained as a closure, with respect to some
References
[A1) BESICOVITCH, A.S.: Almost periodic functions, Cambridge metric, of the set of all finite trigonometric sums.
Univ. Press, 1932. Let DG[f(x), cf>(x)] be the distance of two functions
AMS 1980 Subject Classification: 42A75, 43A60 f(x) and cf>(x) in a metric space G of real- or complex-
valued functions on R. In the following, G will be one
ALMOST-PERIODIC ANALYTIC FUNCTION - An of the spaces U, S, WP, or BP. Here U is the set of
analytic function f(s), S=(J+iT, regular in a strip continuous bounded functions on the real line with the
- 00 ';;;;'a<(J<f3.;;;;, + 00, and expandable into a series metric
Du[f(x), <j>(x) = sup
-oo<x<oo
If(x)-<j>(x) I;
and S, WP and BP for p;;:.1 are the sets of functions
where the an are complex and the An are real numbers.
A real number T is called an (-almost-period of f(s) if that are measurable and whose p-th powers are integr-
able on every finite interval of the real line, the metrics
for all points of the strip (a,f3) the inequality
being
If(s+i'T)-f(s) I < Ds~[f(x), <j>(x) =
holds. An almost-periodic analytic function is an ana- IX+/ lilP
_oos~p<oo [ I [ I f(x)-<j>(x) IP dx ,
lytic function that is regular in a strip (a,f3) and
possesses a relatively-dense set of (-almost-periods for Dwp[f(x, <j>(x)] = lim D\~[f(x). <j>(x)],
every (>0. An almost-periodic analytic function on a !----x

closed strip a';;;;'(J.;;;;,f3 is defined similarly. An almost-


periodic analytic function on a strip [lX, 13] is a uni-
Dsp[f(x), <t>(x)] = [

T_X
1
lim -2
7T
f I j(x)-<j>(x) IP dxlilP.
T

--7

formly almost-periodic function of the real variable 'T


Let T be the set of trigonometric polynomials
on every straight line in the strip and it is bounded in
v
[lX, 13], i.e. on any interior strip. If a function f(s), regu- ~
L." ak e
IA~ ,-
.
lar in a strip (a, 13), is a uniformly almost-periodic func- k - I

tion on at least one line 0 = 00 in the strip, then bound- where the Ak are arbitrary real numbers and the Uk are
edness of J(s) in [a,f3] implies its almost-periodicity on complex coefficients, and let the symbol Hc(T) denote
the entire strip [a, /3]. Consequently, the theory of the closure of T in G. The classes H veT) = U-a.p.,

154
ALMOST-PERIODIC FUNCTION

HSj(T)=SI/-a.p., HW"(T) = wP-a.p. and HRP = BP-a.p. Stepanov almost-periodic functions are, respectively,
denote, respectively, the classes of uniformly almost- non-trivial extensions of the class of continuous 2'17-
periodic functions, or Bohr almost-periodic functions, of periodic functions on R and the class of 2'17-periodic
Stepanov almost-periodic functions, of Weyl a1most- integrable functions on the interval [0, 2'17]. For these
periodic functions and of Besicovitch almost-periodic classes of almost-periodic functions the uniqueness
functions. These classes of almost-periodic functions are theorem remains valid.
invariant under addition. Together with j(x), each class A consequence of the definition of the classes of
also contained the functions f(x), I j(x) I and j(x )e iAx , almost-periodic functions through the concept of clo-
where A is a real number. The metrics DSj[f(x), cp(x)] sure is the approximation theorem: For every almost-
are topologically equivalent for all values of I; therefore periodic function j(x) from U (or SP or WJ') and every
it may be assumed that 1= 1. Let SI( -a.p. = SP -a.p., S 1_ >0 there is a finite trigonometric polynomial P(x) in
T, satisfying the inequality
a.p.=S-a.p., and BI-a.p.=B-a.p .. Then
U-a.p. C SP-a.p.cWp-a.p. C BP-a.p., p~l.
Dulf(x),p(x)] <

(Dsplf(x), P(x)] < , DW"lf(x), P(x)] < f).


If PI <P2 and p 1 ;;;'1, then
Sp, -a.p. C SPI -a.p., Wi" -a.p. C WPI -a.p., The approximation theorem may serve as a starting
BP' - a.p. C BPI - a.p .. point of the definition of various classes of almost-
periodic functions. The approximating polynomials
For every j(x)EB-a.p., the mean value P(x) may contain 'extraneous' exponents, i.e.
1 T exponents different from the Fourier exponents of j(x).
M{f(x)} = lim -T jf(x)dx
0
T-HXl However, important for some applications of the
approximation theorem is the fact that the exponents
exists. The function a(A,j)=M{f(x)e- iAx }, where A is different from the Fourier exponents of j(x) can be
a real number, differs from zero only on a countable avoided in P(x).
set of values of A; any enumeration of this set is called In connection with the representability of almost-
the sequence {Ad, k = 1, 2, ... , of Fourier exponents periodic functions by generalized Fourier series, the
of j(x). problem of convergence criteria for these series arises
The numbers AA. =a(Abj) are called the Fourier and various summation methods for generalized
coefficients of j(x). With a function j(x) in any of the Fourier series (the Bochner- Fejer method, etc.)
classes defined above one can associate its Fourier become meaningful. Thus, the following criteria have
series
f(x) '" b4;\,/A..x. been obtained: absolute convergence of a generalized
k Fourier series if the Fourier exponents are linearly
For j(x)EB 2 -a.p. one has the Parseval equality independent; uniform convergence of a Fourier series
M{lf(x) 12} = ~IAA.. 12.
when I Ak I~oo as k~oo or when Ak~O as k~oo.
k The importance of criteria for uniform convergence
The Riesz - Fischer theorem can be generalized to the in the theory of almost-periodic functions is
class BP-a.p.: Let {Ad, k = 1,2, ... , be arbitrary real emphasized by the following theorem: If a tri-
numbers, and let {Ad, k = 1, 2, ... , be complex gonometric series ~kakeiAkX is uniformly convergent on
numbers for which ~:=I I Ak I <00. Then there is an the entire real line, then it is the Fourier series of its
j(x)EB 2 -a.p. which has the trigonometric series sum S(X)E U-a.p .. Corollary: There exists uniformly
~kAkeiA.X as its Fourier series. almost-periodic functions with an arbitrary countable
There is also a uniqueness theorem: If two functions set of Fourier exponents. If particular, the Fourier
j(x)EHG(T) and cp(x)EHG(T) have the same Fourier exponents of a uniformly almost-periodic function may
series, then have finite limit points or may even be everywhere
DGlf(x), cjl(x)] = o. dense.
In particular, for uniformly almost-periodic functions Other definitions of almost-periodic functions of the
the uniqueness theorem states that j(x)=cp(x) (for above classes rely on the concept of an almost-period
Stepanov almost-periodic periodic functions: almost- and generalizations thereof.
everywhere). A uniqueness theorem in the same sense Besides the concept of closure or that of an almost-
as for Fourier- Lebesgue series of 2'17-periodic func- period, the concept of a translation can also be used for
tions does not hold for Weyl or Besicovitch almost- the definition of almost-periodic functions. Thus, a
periodic functions. function j(x) is uniformly almost-periodic if and only if
The classes of uniformly almost-periodic and of every infinite sequence of functions

155
ALMOST-PERIODIC FUNCTION

f(x + h \ ),f(x + h 2 ), .. , where the translation References


numbers h \, h2' ... , are arbitrary real numbers, con- [A1] BOCHNER, S.: 'A new approach to almost periodicity', Proc.
Nat. Acad. Sci. USA 48 (1962), 2039-2043.
tains a uniformly convergent subsequence. This defini-
[A2] BRONSTETN, I.U.: Extensions of minimal transformation
tion serves as a starting point in considering almost- groups, Sijthoff & Noordhoff, 1979 (translated from the Rus-
periodic functions on groups. sian).
The main results in the theory of almost-periodic [A3] GoTTSCHALK, W.H. and HEDLUND, G.A.: Topological dynam-
ics, Amer. Math. Soc., 1955.
functions remain valid if one considers the concept of a [A4] KATZNELSON, Y.: An introduction to harmonic analysis,
generalized translation. Other generalizations are possi- Dover, reprint, 1968.
ble and useful: almost-periodic functions with values in [A5] LEVITAN, B.M. and ZHIKOV, V.V.: Almost periodiC functions
and differential equations, Cambridge Univ. Press, 1968
an n-dimensional space or in a Banach or metric space, (translated from the Russian).
and analytic or harmonic almost-periodic functions. [A6] NEMYTSKJT, V.V. and STEPANOV, V.V.: Quantitative theory of
differential equations, Princeton Univ. Press, 1960 (translated
References
from the Russian).
[I] BOHR, H.: Almost-periodicfunctions, Chelsea, reprint, 1947
[A7] REICH, A.: 'Prakompakte Gruppen und Fastperiodicitat',
(translated from the German).
Math. Z. 116 (1970), 216-234.
[2] BESICOVITCH, A.S.: Almost periodic functions, Cambridge Univ.
[A8] SELL, G.R.: Topological dynamics and ordinary differential
Press, 1932.
equations, v. Nostrand Reinhold, 1971.
[3] LEVITAN, B.M.: Almost-periodic functions, Moscow, 1953 (in
[A9] VEECH, W.A.: 'Almost automorphic functions on groups',
Russian).
Amer. J. Math. 87 (1965),719-751.
[4] KUPTSOV, N.P.: 'Direct and converse theorems of approxima-
[A 10] CORDUNEANU, c.: Almost periodic functions, Interscience,
tion theory and semigroups of operators', Russian Math. Sur-
veys 32, no. 4 (1968), liS-In (Uspekhi Mat. Nauk 23, no. 4 1961.
[A 11] LEVITAN, B.M.: 'The application of generalized displacement
(1968), 117-178)
[5] RUDIN, W.: Fourier analySiS on groups, Benjamin, 1962. operators to linear differential equations of the second order',
Amer. Math. Soc. Transl. Series 1 (1950),408-541.
[6] LEVITAN, B.M.: Generalized shift operators and some of their
(Uspekhi Mat. Nauk 4, no. 1 (29) (1949), 3-112)
applications, Moscow, 1962 (in Russian).
[7] KRASNOSEL'SKJi, A.M., BURD, V.SR. and KOLESOV, Yu.S.: AMS 1980 Subject Classification: 42A 75, 43A60
Non-linear almost-periodic oscillations, New York, 1973
(translated from the Russian).
ALMOST-PERIODIC FUNCTION ON A GROUP - A
E.A. Bredikhina generalization of almost-periodic functions defined on
Editorial comments. The theory of almost-periodic func- R. Let G be an (abstract) group. A bounded complex-
tions was initiated by H. Bohr, who developed the notion of valued function f (x), X E G, is called a right almost-
a uniformly almost-periodic function in his study of Dirichlet periodic function if the family f(xa), where a runs
series. This original definition was the one using (-almost- through the entire group G, is (relatively) compact in
periods and the approximation theorem was one of the main the topology of uniform convergence on G, i.e. if every
achievements of the theory. For a modern approach see sequence of functions f(ax\),f(xa2), ... , contains a
[A4], Sect. 5 and [A10], Chapt. 1,6. As to the equivalence
subsequence which is uniformly convergent on G. A left
of the approach starting from a certain structural property
almost-periodic function on G is defined similarly. It
that is a generalization of pure periodicity, and on the other
turns out that every right (left) almost-periodic function
hand, the approach starting from approximation by tri-
gonometric polynomials (also in the definition of the various
f is also left (right) almost-periodic, and the family
classes of generalized almost-periodic functions), see [2],
f(axb), where a and b independently run through G, is
Chapt. 2. An interesting generalization not mentioned above (relatively) compact. The latter property is often taken
are the Levitan almost-periodic functions (see [A5], where as a definition of almost-periodic functions on G. The
they are called N-almost periodiC functions). A new set of all almost-periodic functions on G is a Banach
approach to the theory of uniformly almost-periodic func- space with a norm II f II = SUPXEG I f(x) I
tions is given in [A 1); this has lead to the study of so-called The theory of almost-periodic functions on a group
almost-automorphic functions [A9), a class of functions depends essentially on the mean-value theorem (cf. [5],
closely related to the Levitan almost-periodic functions men- [8]). A linear functional Mx {f(x)} defined on the space
tioned above [A7).
of almost-periodic functions is called a mean value if
Outside the realm of harmonic analysis an important
I) Mx {1}=I, Mx{f(x)}~O for f(x)~O and
application of (uniformly) almost-periodic functions lies In
the theory of differential equations; see e.g. [7], [A8], [AS],
Mx{f(x)} >0 for j(x)~O,f=O;
[A2], Chapt. 4, and [A10], Chapt. 4, S. In this context the
2) Mx {f(xa)} = Mx {f(ax)} =Mx{f(x -I)} =Mx {f(x)} ,
theory of dynamical systems (cf. Dynamical system) IS for all a EG.
relevant and, in particular, the study of various types of A unitary matrix function g(x)={glj(X)}i'J=I'
almost-penodic and (or) recurrent motions, see [A2), [A3) defined on G, is called a unitary representation of G if
and [A6) (beware of conflicting terminology In the literature). g(e)=Ir (e is the identity element of G and Ir is the
As a replacement for [6) , [A 11) may be used, which IS of identity matrix of order r) and if for all x,y E G,
the same flavour as [6]. g(xy) = g(x )g(y). The number r is called the dimension

156
ALMOST-PERIODIC FUNCTION ON A GROUP

of the representation g. The matrix entries gij(x) are functions x(x) = exp(iA'x), where AERn,
almost-periodic functions on G. In the theory of A'X =A1Xl + ... +AnXn. Theorems I and 2 imply the
almost-periodic functions on a group they play the main results in the theory of almost-periodic functions
same role as the functions exp(iA(x)) in the theory of of a single or of several variables.
almost-periodic functions on R. The proof of the main statements in the theory of
Two representations g(x) and g' (x) are said to be almost-periodic functions is based on the consideration
equivalent if a constant matrix A exists such that of integral equations on a group (cf. [2]). The existence
g'(x)=A -lg(x)A. A representation g is said to be of sufficiently many linear representations of compact
irreducible if the family of the matrices g(x), x E G, does Lie groups has been proved [3]. In this case, invariant
not admit a common non-trivial subspace in R'. The integration (and consequently, the mean) can be esta-
set of all irreducible unitary representations is parti- blished directly. Invariant integration on an abstract
tioned into classes of mutually-equivalent representa- compact group has been constructed [4] depending on
tions. Let one representation be chosen from each an extension of the Peter - Weyl theory to this case.
equivalence class and let the set thus obtained be The theory of almost-periodic functions on a group
denoted by S. Then the set of almost-periodic functions can be deduced (d. [3]) from the Peter- Weyl theory
in the following way. Let f be an almost-periodic func-
H = {</>x(x)} = {</>x: </>x =gt, gES} tion on a group G and let
on G turns out to be an orthogonal (though, in general,
p{x,y) = a,sup I f(axb)- f(ayb) I.
uncountable) system with respect to the mean value. bEG

Theorem 1 (the Parseval equality). For an almost- Then the set E={tEG: p(t,e)=O} is a normal sub-
periodic function f(x) the following equality holds: group of G, p is an invariant metric on the quotient
group G / E and f is uniformly continuous on G / E.
M (If(x) 12} = ~ I Mx{f(x~(x)} 12 The almost-periodicity of f implies that the comple-
x Mx{ I </>x(x) 12}
tion of G / E in the metric p is a compact group and
Thus, for only countably many A, Mx{f(x)<I>;\(x)} Theorems I and 2 follow from the Peter- Weyl theory.
differs from zero; the series References
[I] LEVITAN, B.M.: Almost-periodicfunctions, Moscow, 1953,
Chapt. 6 (in Russian).
[2] WEYL, H.: 'Integralgleichungen und fastperiodische Funk-
tionen', Math. Ann. en (1927),338-356.
is called the Fourier series of f [3] PETER, F. and WEYL, H.: 'Die Vollstiindigkeit der primitiven
A representation gES is said to occur in the Fourier Darstellungen einer geschIossener kontinuierlichen Gruppe',
series of an almost-periodic function f if Math. Ann. en (1927),737-755.
[4] NEUMANN, 1. VON: 'Zurn Haarschen Mass in topologischen
Mx{f(x)gij(x)}=t=O for some i,j, l";;;;i,j";;;;r. Gruppen', Compositio Math. 1 (1934), 106-114.
Theorem 2 (the approximation theorem). The set H is [5] NEUMANN, 1. VON: 'Almost periodic functions in a group 1',
Trans. Amer. Math. Soc. 36 (1934),445-492.
dense in the space of almost-periodic functions
[6] WElL, A.: c.R. A cad. Sci. Paris Ser. I Math. 200 (1935), 38-40.
equipped with the norm [7] YOSIDA, K.: Functional analysis, Springer, 1980, Chapt. 8,
Sect. 4; 5.
II !II = ~~g I f(x) I, [8] MAAK, W.: Fastperiodische Funktionen, Berlin, 1950.

and every almost-periodic function can be arbitrarily V. V. Zhikov


well approximated by a finite linear combination of B.M. Levitan
matrix entries of representations occurring in its Editorial comments. Instead of the term 'mean value'
Fourier series. one often uses the term 'invariant mean' (cf. [Al], Sect. 18).
If G is a topological group, then to the definition of For an Abelian group G the uniformly almost-periodic
an almost-periodic function should be added the functions are precisely those that can be continuously
requirement of its continuity. In this case, the represen- extended to the Bohr compactification of G.
tations occurring in its Fourier series are also continu- A unified account of the theory of almost-periodic func-
tions on groups can also be found in [A2] and [A3] , Sect.
ous.
41. The basic observation is that the Banach algebra of
If G is an Abelian group, then the continuous unitary
(continuous) almost-periodic functions on a (topological)
representations are one-dimensional. They are called
group G is isomorphic to the Banach algebra of all continu-
the characters of G. The characters of G are denoted by ous functions on the so-called Bohr compactification Gc of
X and Parseval's equality reads as follows: G. In this way the theory is reduced to the theory of con-
Mx{ I f(x) 12} = ~ I an 12, an = Mx{f(x)x,,(x)}. tinuous functions on a compact group (e.g., the mean-value
theorem corresponds to the normalized Haar measure on
In the case G = Rn the continuous characters are the Gc the approximation theorem is nothing else than the

157
ALMOST-PERIODIC FUNCTION ON A GROUP

well-known Peter-Weyl theorem for compact groups, etc.). A(): R -+ Hom(Rn, Rn),
The Bohr compactification of G can be characterized as the
having the following property: There exist a system
reflection of G in the subcategory of all compact groups. By
considering reflections in other subcategories of the
j=By, YERn , with constant coefficients and, for every
category of all topological groups (or even of all semi- >0, a Lyapunov transformation L((t) such that by the
topological semi-groups) one can define other classes of change of variables x =L((t)y, the system (*) is
almost-periodic functions on groups (or semi-groups), see transformed into the system
[A4]. Weakly almost-periodic functions are of particular j = (B+C.(ty,
interest in functional-analytic applications (semi-groups of
operators). See also [7] and [AS]. where
~~f II C.(t) " < (.
References
[A1] HEWITT, E. and Ross, K.A.: Abstract harmonic analysis, 1, Every reducible linear system is almost reducible.
Springer, 1979.
References
[A2] WElL, A.: I'lntegration dans les groupes topologiques et ses
[1) lzoBov, N.A.: 'Linear systems of ordinary differential equa-
applications, Hermann, 1940.
tions', J. Soviet Math. S, no. 1 (1976),46-96. (Itogi Nauk. i
[A3] LooMIs, L.H.: An introduction to abstract harmonic analysis,
Tekhn. Mat. Anal. 12 (1974),71-146) VM M'll' h h'k
v. Nostrand, 1953. llOnSClOV
[A4] BERGLUND, J.F., JUNGREN, H.D. and MILNES, P.: Compact
right topological semigroups and generalizations of almost AMS 1980 Subject Classification: 34A34 , 34A30
periodicity, Lect. Notes in Math., Vol. 663, Springer, 1978.
[A5] BURCKEL, R.B.: Weakly almost periodic functions on semi- ALMOST-SYMPLECTIC STRUCfURE - A non-
groups, Gordon and Breach, 1970. degenerate differential 2-form on a manifold. An
[A6] CORDUNEANU, c.: Almost periodic functions, Interscience,
almost-symplectic structure 1'2 can exist only on an
1961, Chapt. 7.
[A7] GUCKSBERG, I. and LEEUW, K. DE: 'Almost periodic functions even-dimensional manifold M (dimM=2m) and
on semigroups', Acta Math. 105 (1961),99-140. defines an Sp(m, R)-structure BSp(m,R), namely the prin-
[A8] AMERIo, L. and PROUSE, G.: Almost-periodic functions and
functional equations, v. Nostrand, 1971.
cipal fibre bundle of frames on M with structure group
[A9] DIXMIER, J.: C' algebras, North-Holland, 1977 (translated Sp(m, R), consisting of all frames r = {ei' Ji:
from the French). i = 1, ... ,m} for which
AMS 1980 Subject Classification: 43A60, 42A75 Q(e;, ej) = Q(j;,jj) = 0, Q(e;,jj) = 81)'
ALMOST-PRIME NUMBER - A natural number n A necessary and sufficient condition for the existence
of the form of an almost-symplectic structure (or of an almost-
n=PI"'Pk,
complex structure, as well) on a manifold M is the pos-
where the Pi are prime numbers and k ~ I is a constant. sibility of reducing the structure group of the tangent
Prime numbers are the special case of almost-prime bundle to the unitary group U(m). For this, in particu-
numbers for k = 1. There are theorems for almost-prime lar, it is necessary that all odd-dimensional
numbers that generalize theorems on the distribution of Stiefel- Whitney classes of M vanish (cf. [I D.
prime numbers in the set of natural numbers. Several An almost-complex structure J and a Riemannian
additive problems that have not yet been solved for metric g on a manifold M define an almost-symplectic
prime numbers have been solved for almost-prime structure Q by the formula
numbers.
B.M. Bredikhin Q(X, Y) = g(JX, Y)-g(X,JY),
Editorial comments. See also Sieve method. where X and Yare vectors. Any almost-symplectic
An example of a result on the distribution of almost-prime structure can be obtained in this manner. An almost-
numbers generalizing the corresponding one on prime symplectic structure is said to be integrable or, in other
numbers is the following. Let 'TI'k(X) be the number of words, a symplectic structure, if it can be brought to the
square-free almost-prime numbers .;;;x. Then (cf. [A 1], Sect.
form Q = ~dxi I\dyi in some local coordinates Xi ,yi,
22.18) x(log log xt- 1 i = 1, ... , m, in a neighbourhood of any point. Accord-
'TI'k(X) - (k-2)! logx
ing to Darboux's theorem, for this it is necessary and
References sufficient that Q be closed. An example of an integrable
[A1] HARDY, G.H. and WRIGHT, E.M.: An introduction to the almost-symplectic structure is the canonical symplectic
theory of numbers, Clarendon Press, 1965.
structure Q = ~dpi I\dqi on the cotangent bundle T* M
AMS 1980 Subject Classification: 10A25 of an arbitrary manifold M (here the ql are local coor-
dinates on M and the pi are the associated coordinates
ALMOST-REDUOBLE LINEAR SYSTEM of ordi- in the fibres). An example of a non-integrable almost-
nary differential equations - A system symplectic structure is a left-invariant 2-form on a
x = A(t)x, xERn, (*) semi-simple Lie group G, obtained by extending an

158
ALTERNATING KNOTS AND LINKS

arbitrary non-degenerate exterior 2-form on the Lie structure is always infinite-dimensional and is a k-
algebra TeG of G by left translation to G. As a transitive group for any k >0.
Riemannian metric, an almost-symplectic structure also References
defines an isomorphism of the tangent and cotangent [I) LIBERMAN, P.: 'Sur les structures presque complexe et autres
spaces (and by the same method, of the spaces of con- structures infinitesimales regu!ieres', Bull. Soc. Math. France 83
(1955), 195-224.
travariant and covariant tensors); it further defines a (2) Itogi Nauk i Tekhn. Algebra Topol. Geom. 11 (1974), 153-207.
canonical 2m-form 1/ = om / m!, called its volume form, (3) LYCHAGIN, V.V.: 'Contact geometry and second-order non-
and several operators in the space /\(M) of differential linear differential equations', Russian Math. Surveys 34, no. I
(1979), 149-180. (Uspekhi Mat. Nauk 34, no. I (1979), 137-
forms: the operator n of exterior multiplication by 0;
165)
the operator in of interior multiplication by 0; the (4) KOBAYASHI, S.: Transformation groups in differential geometry,
Hodge star operator *: /\p(M)-,>/\2m -P(M), w-'>iw1/, Springer, 1972.
where the operator i w of interior multiplication is (5) HURT, N.E.: Geometric quantization in action, Reidel, 1983.
(6) ARNOL'D, V.1. and GIVENTAL, A.B.: Itogi Nauk. i Tekhn.
defined as the contraction of the given form with the Sovrem. Probl. Mat. 4, 5-139.
p-vector corresponding to the p-form w; the operator of D. V. Alekseevskir
codifferentiation 0 = *d*. In contrast with the Riemann- Editorial comments.
ian case, the operator !:l=do+od turns out to be skew- References
symmetric with respect to the global scalar product [A 1] LIBERMANN, P. and MARLE, C.-M.: Symplectic geometry and
analytical mechaniCS, Reidel, 1987.
<a, [3> = fMa/\ *[3 in the space of p-forms on a com-
AMS 1980 Subject Classification: 53C15
pact manifold M. For an arbitrary p-form one has the
Hodge - Lepage decomposition ALPHABET - A list (finite set) of letters (cf. Letter).
W=Wo +nWl +bW2 + .. " where the Wi EN -2i(M) An alphabet usually contains the letters which are
are uniquely determined effective forms (i.e. they are required to develop a certain system of symbols or, as
annihilated by in) [3]. is sometimes said, a language. For two alphabets A and
An almost-symplectic structure is said to be confor- B, their union A U B, their intersection A n
B, their
mally flat if there is a function A>O such that difference A \ B, as well as the inclusion relation A CB
d(AO) = O. This is equivalent to the representability of 0 are defined in the obvious manner. For convenience,
in the form
Q = v 1 ~ dx' /\dyl the concept of an empty alphabet, i.e. an alphabet
, - 1 without any letters, may also be considered.
For m =2, a necessary and sufficient condition in order . NM. Nagornyr
that the almost-symplectic structure 0 be conformally AMS 1980 Subject Classification: 03C07
flat is the closedness of the I-form 00 = indO, and for ~
m>2 the equality dO=(l/m-l)oO/\O should hold ALTERNATING GROUP of degree n - A The sub-
(cf. [1]). group An of the symmetric group Sn consisting of all
The tensor T of type (1,2) corresponding to the 3- even permutations. An is a normal subgroup in Sn of
form dO and defined by the equality index 2 and order n! / 2. The permutations of An, con-
O(TxY, Z)=dO(X, Y, Z), where X, Yand Z are vectors, sidered as permutations of the indices of variables
is called the torsion tensor of the almost-symplectic struc- x I, . . . ,Xn , leave the alternating polynomial
ture O. The (degenerate) metric g(X, Y)=trTxT y can II(xi-Xj) invariant, hence the term 'alternating
be associated with it. An almost-symplectic structure group'. The group Am may also be defined for infinite
determines the class of linear connections V for which cardinal numbers m, as the subgroup of Sn consisting
o is parallel and which have T as their torsion tensor. of all even permutations. If n > 3, the group An is
Two such connections differ by a tensor field of the (n - 2)-fold transitive. For any n, finite or infinite,
form Oij Sjkl, where Sjkl is an arbitrary symmetric tensor except n =4, this group is simple; this fact plays an
field. The connections under consideration correspond important role in the theory of solvability of algebraic
in a one-to-one manner to the sections of the first equations by radicals.
extension B l-'>B for the Sp(m, R)-structure References
B =B Sp(m. R), which is the principal bundle of frames on [I) HALL, M.: The theory of groups, Macmillan, 1959.
B with structure group s3(R2m) (the vector group of NN Vi/'yams
homogeneous polynomials in 2m variables of degree 3). Editorial comments. Note that A5 is the non-Abelian
The Sp(m, R)-structure is a G-structure of infinite type. simple group of smallest possible order.
Therefore, the group of automorphisms of an almost- AMS 1980 Subject Classification: 20835
symplectic structure can be infinite-dimensional. In
particular, the group of automorphisms of a symplectic ALTERNATING KNOTS AND LINKS - Knots and

159
ALTERNATING KNOTS AND LINKS

links with an alternating diagram (cf. Knot and link Editorial comments. A Seifert surface for a knot or link
diagrams), i.e. a projection in general position onto a K n c sn+2 is a connected, bicoliared compact manifold
plane such that, when successively traversing all com- M n + 1 CS n + 2 such that aM n + 1 =Kn. A subset Xc Yis bicof-
ponents, the overpasses and underpasses succeed each fared (in Y) if there exists an imbedding b: XX [ -1, 1]~ Y
other in alternation. Any diagram can be converted to such that b(x, O)=x for ali XEX. The mapping b or its image
is the bicoliar itself.
an alternating diagram by changing the upper and
lower branches through its double points. References
[A 1] ROLFSEN, D.: Knots and links, Publish or Perish, 1976.
Let F be a Seifert surface. As distinct from the ordi-
nary case, the inequality d.,:;;;2h+p.-l, where d is the AMS 1980 Subject Classification: 57M25, 57Q45
degree of the Alexander polynomial (d. Alexander
invariants), h is the genus of the Seifert surface and p. is ALTERNATING SERIES - An infinite series whose
the number of components of the link k, becomes an terms are alternately positive and negative:
equality in the case of alternating knots and links. U]-U2+ ... +(-lr-]un + " ' , Uk> O.
Accordingly, the genus of an alternating link can be
If the terms of an alternating series are monotone
computed from anyone of its alternating diagrams, the
decreasing (un + 1 <Un) and tend to zero (limn->oo Un =0),
Seifert surface being the surface of smallest genus. This
then the series is convergent (Leibnitz' theorem). The
also shows that if the diagram is normalized, i.e. if the
remainder term of an alternating series,
projection plane does not contain a simple closed con-
tour that intersects the diagram at one double point, 'n = (-ltun +]+"',
the link is trivial (cf. Knot theory) if and only if the has the same sign as its first term and is less then the
diagram contains no double points. If such a contour latter in absolute value. The simplest examples of alter-
exists, it is possible, by rotating an internal part of the nating series are
diagram through 180 degrees, to reduce the number of
1-1-+1--1-+ ... +(_l)n-]1-+ ...
double points while preserving the alternating nature of 2 3 4 n'
the diagram. This yields an algorithm for solving the 1-1-+1--1-+ ... +(-lr-]_I_+ ....
problem of triviality of alternating knots and links. 2 5 7 2n-l
Moreover, if the diagram is connected, the links do not The sum of the first of these series is log 2; that of the
become separated, since d;;;;.l, and the reduced Alex- second is 'IT / 4.
ander polynomial of seperated links is zero. The Alex- v,l. Bityutskov
ander matrix is computed as the incidence matrix of AMS 1980 Subject Classification: 40A05
some graph, which implies [1], [2] that Ll(t) is an alter-
nating polynomial, i.e. its coefficients are non-zero and ALTERNATION, skew symmetry, anti-symmetry,
their signs alternate. If Ll(O) = I, alternating knots and alternance - One of the operations of tensor algebra,
links are known as Neuwirth knots and links (d. yielding a tensor that is skew-symmetric (over a group
Neuwirth knot). The number of double points of a nor- of indices) from a given tensor. Alternation is always
malized diagram of an alternating knot or link is not effected over a few superscripts or over a few sub-
larger than its determinant. The groups of alternating scripts. A tensor A with components {aj',... J" I ~i p,
knots and links (d. Knot and link groups) are j p..,:;;;n} is the result of alternation of a tensor T with
represented as free products with amalgamation of two components {tj', ...... J,' I":;;;i p , jp..,:;;;n}, for example, over
free groups of rank q by subgroups of rank 2q - 1. This superscripts, over a group of indices 1= (i 1, . . . ,im ) if
representation is obtained with the aid of the van Kam-
pen theorem, if the space of the link k is subdivided by (*)
the boundaries of regular neighbourhoods of the Seifert
surface with respect to k, constructed over the alternat- The summation is conducted over all m! rearrange-
ing diagram. All knots of the standard table (cf. Knot ments (permutations) a=(a], ... ,am) of I, the number
table) with non-alternating diagrams are non- a(I, a) being + 1 or -1, depending on whether the
alternating knots. Most parallel knots, windings, etc., respective rearrangement is even or odd. Alternation
do not alternate. over a group of subscripts is defined in a similar
manner.
References Alternation over a group of indices is denoted by
[I] MURASUGI, K.: 'On the Alexander polynomial of the alternat-
ing knot', Osaka J. Math. 10 (1958), 181-189. enclosing the indices between square brackets. Secon-
[2] CROWELL, R.H.: 'Genus of alternating link types', Ann. of dary indices inside the square brackets are separated by
Math. (2) 69 (1959), 258-275. vertical strokes. For instance:
[3] MURASUGI, K.: 'On alternating knots', Osaka 1. Math. 12
(1960),277-303.
A. V Chernavskir

160
ALTERNATIVE RINGS AND ALGEBRAS

Successive alternation over groups of indices I I and h, max If(x)-Pn(x) I,


XEQ
I I Ch, coincides with alternation over the group of
indices 1 2 : then the points {xd8 + I are called Chebyshev points of
alternation. Points of alternation play an important role
If n is the dimension of the vector space on which in the theory of approximation of functions. E.g., the
the tensor is defined, alternation by a group of indices de la Vallee-Poussin theorem (the alternation theorem)
the number of which is larger than n will always pro- and the Chebyshev criterion (d. Chebyshev alternation)
duce the zero tensor. Alternation over a given group of are formulated in terms of points of alternation. Points
indices of a tensor which is symmetric with respect to of alternation are also employed in constructing poly-
this group (d. Symmetrization (of tensors also yields nomials of best approximation.
Yu.N. Subbotin
the zero tensor. A tensor that remains unchanged under
Editorial comments. A sequence of Chebyshev points of
alternation over a given group of indices I is called
alternation is also called an alternating set [A 1], Chapt. 1.
skew-symmetric or alternating over I. Interchanging any
pair of such indices changes the sign of the component References
[A 1] T.1.: An introduction to the approximation of func-
RlvLIN,
of the tensor. tions, Dover, reprint, 1981.
The operation of tensor alternation, together with the [A2] MOLLER, M.W.: Approximationstheorie, Akad. Ver-
lagsgesellschaft, Wiesbaden, 1978.
operation of symmetrization, is employed to decompose
[A3] MEiNARDus, G.W.: Approximation von Funktionen und ihre
a tensor into simpler tensors. numerische Behandlung, Springer, 1964.
The product of two tensors with subsequent alterna- [A4] CHENEY, E.W.: Introduction to approximation theory, Chel-
tion over all indices is called an alternated product sea, reprint, 1982.

(exterior product). AMS 1980 Subject Classification: 41 A 10


Alternation is also employed to produce sign-
alternating sums of the form (*) with multi-indexed ALTERNATIVE (in game theory) - Each of the posi-
terms. For instance, a determinant with elements which tions to which, according to the rules of the game, it is
commute under multiplication can be computed by the possible to pass from a given position in one move. The
formulas move itself, and the index of the move, are also known
al as alternatives.
I.N. Vrublevskaya
AMS 1980 Subject Classification: 90DXX

a7 ALTERNATIVE RINGS AND ALGEBRAS - An alter-


native ring is a ring in which every two elements gen-
erate an associative subring; an alternative algebra is a
References
[1] SHIROKOY, P.A.: Tensor calculus. Tensor algebra, Kazan', 1961
(linear) algebra that is an alternative ring. By a theorem
(in Russian). of E. Artin the class of all alternative rings is defined
[2] BEKLEMISHEY, D.V.: A course of analytical geometry and linear by the system of identities:
algebra, Moscow, 1971 (in Russian).
[3] SCHOlITEN, 1.A.: Tensor analysis for physicists, Cambridge (xy)y = x(yy) (right altemativeness);
Uniy. Press, 1951.
(xx)y = x(xy) (left altemativeness).
[4] EFIMOY, N.V. and ROZENDORN, E.R.: Linear algebra and
multi-dimensional geometry, Moscow, 1970 (in Russian). Thus, the alternative rings form a variety. The termi-
L.P. Kuptsov nology 'alternative ring' is justified by the fact that in
AMS 1980 Subject Classification: 15A69, 15A72, any such ring the associator (the defect of associativity)
53A45 (x,y,z) = (xy)z-x(yz)

ALTERNATION, POINTS OF - A sequence of points


IS a skew-symmetric (alternative) function of its argu-
ments.
{x 1 }3+ 1 C Q, a";;xO<xl<'" <xn +l.,;;b, The first example of alternative rings were the Cayley
at which the difference (Xi = f(x;)- Pn(x;) assumes non- numbers, which form an alternative skew-field, i.e. an
zero values of alternating signs. Here f (x) is a con- alternative ring with a unit element, in which the equa-
tinuous function on the closed set Q qa, bland Pn(x) tions ax = band ya = b are uniquely solvable for all b
is an algebraic polynomial of degree not exceeding n. and all a*O. Alternative skew-fields playa substantial
In a similar manner the concept of points of alternation part in the theory of projective planes, since a projec-
is introduced for polynomials in a Chebyshev system of tive plane is a Moufang plane (i.e. a translation plane
functions {sk(x)}8 (which satisfy the Haar condition). with respect to some straight line) if and only if any
If, in this situation, all absolute values of (Xi are equal coordinatization of its ternary ring is an alternative
to skew-field. If in a ring R with a unit element each

161
ALTERNATIVE RINGS AND ALGEBRAS

non-zero element is invertible, and if the identity distinct from local, nilpotency the situation in alterna-
a-l(ab) =b tive rings differs from that in associative rings. Thus, an
alternative ring with the identity x 3 =0 is not neces-
(or the identity (ba)a -) =b) is valid for any a, b ER, sarily nilpotent (even if its additive group is torsion-
then R is an alternative skew-field. Any alternative free). However, an alternative ring with an identity
skew-field is either associative or has the structure of a xn =0 and without elements of order k, O<k~n, in the
Cayley - Dickson algebra over its centre. additive group is solvable of index n (n + I) / 2.
Each simple alternative ring is also either associative If R is an algebraic alternative algebra with an iden-
or a Cayley- Dickson algebra over its centre (in this tity relation that is not a consequence of the associa-
case the algebra need not necessarily be a skew-field). tivity (or if the degrees of algebraicity of the elements
Both the associative and the primitive alternative rings of R are uniformly bounded), then R is locally finite-
are exhausted by the Cayley - Dickson algebras. All dimensional.
primary alternative rings R (if 3R:~0) are either associ- In a alternative ring there is an analogue of the
ative or are Cayley- Dickson rings. Jacobson radical: In each alternative ring there exists a
Many properties of alternative rings differ substan- maximal quasi-regular ideal J (R), which is equal to the
tially from those of an associative ring in a similar intersection of all modular maximal right ideals. The
situation. Thus, if R is an alternative ring and A and B quotient ring R / J (R) is J -semi-simple, l.e.
are right ideals in it, their product AB need not neces- J(R / J(R=O; if I is an ideal of R, then
sarily be a right ideal, even if A is a two-sided ideal in J(I)=J(R)nI, and any J-semi-simple ring can be
R; however, the product of two two-sided ideals of an approximated by primitive alternative rings (i.e. by
alternative ring is a two-sided ideal in it. The differ- primitive associative rings and by Cayley - Dickson
ence between associative rings and alternative rings is algebras). There also exist analogues of all other associ-
also strongly manifested by the fact that alternative ative radicals (lower nil-radical, locally nilpotent radi-
rings contain various kinds of nilpotency, since the pro- cal, etc.), which display the same basic properties as in
duct of elements may be zero or non-zero, depending associative rings.
on the placement of the parentheses. It is customary to In an Artinian alternative ring R (i.e. an alternative
use the following kind of nilpotency in an alternative ring that satisfies the descending chain condition
ring: solvability (a ring R is called a solvable ring of (minimum condition) for right ideals) the radical R (J)
index m if there exists a number m such that Rm = 0, is nilpotent, and so is any nil-sub groupoid of the multi-
where R; + I =R; R;, R) = R), right nilpotency (there plicative groupoid of R. A ring R is an Artinian alter-
exists a number n such that R(n) =0, where native ring without nilpotent ideals if and only if R
R(i+l)=R(i)R(l), R(I)=R), and nilpotency (there exists splits into the direct sum of finitely many complete
a number k such that Rk =0, i.e. the product of any k matrix algebras (over certain associative skew-fields)
elements of R is zero, no matter how the parentheses and Cayley- Dickson algebras; this decomposition is
are placed). There are solvable alternative rings of unique for each R, apart from a rearrangement of the
index 3, but they are not nilpotent. Right nilpotency in terms. If R is an alternative ring, lone of its ideals,
an alternative ring is equivalent to nilpotency (alterna- and R satisfies the minimum condition for the two-
tive rings that are right nilpotent of index n, are also sided ideals contained in I, then I is nilpotent if and
nilpotent of index ~(n + 1)2). Locally, i.e. in finitely only if there are no simple ideals in the ring Ref> for any
generated rings, all kind of nil potency are equivalent. arbitrary homomorphism ef> from R into Ief>.
The theory that establishes sufficient criteria of local In an alternative ring R one distinguishes between
nilpotency of an alternative ring is completely parallel the associative centre
to the corresponding theory for associative rings. This
N(R) = {nER: (n,a,b)=Oforalla,bER},
is a consequence of the following fact: Let R be an
alternative ring in which one can to select a system of the commutative centre
generators S such that any two elements of S generate C(R) = {cER: [c,a]=ca-ac=OforallaER},
a nil ring, suppose also that all assocIative and the centre
homomorphic images of R are locally nilpotent. then R
is locally nilpotent. Therefore. if R is an alternative Z(R) = N(R) n C(R).
ring with an identity xn =0. then R is locally nilpotent; If in the additive group of R there are no elements of
if R is an alternative algebra with an identity relation order three. then C(R)CN(R). However. commutative
that is not a consequence of the associativity. and if non-associative alternative algebras exist over a field of
each element is the sum of a finite numher of nil ele- characteristic three. In a primary alternative ring R one
ments. then R is locally nilpotent. As regards global. as always has C (R) ~ N (R). In any alternative ring R one

162
AL TERNA TIVE RINGS AND ALGEBRAS

always has [N (R), R] C;N (R). Suppose that in an alter- [7) SKORNYAKOY, L.A.: 'Alternative rings', Rend di Mat. e Appl.
native ring R there are no non-trivial ideals. Then 1) 24 (1965), 360-372.
[8) SLATER, M.: 'Ideals in semiprime alternative rings', J. of Alge-
either 3R =0 or N(R)=!=O; 2) either 3R C;N(R) or bra 8 (1968), 60-76.
Z (R)=!=O; 3) if A is a right ideal in R, then [9) DoROFEEV, G.V.: 'Alternative rings with three generators',
N(A)=A nN(R) and Z(A)=A nZ(R). However, Sibirsk. Mat. Zh. 4, no. 5 (1963), 1029-1048 (in Russian).

over any field F it is possible to construct an alterna- KA. Zhevlakov


tive ring K with no non-trivial ideals such that Editorial comments. An element z in a ring R is right
N(K)=C(K)=O, but K2, which is an ideal of K, is an quasi-regular if {a - za: a E R} = R. An element is quasi-
associative-commutative ring, i.e. regular if it is both left and right quasi-regular. An ideal in R

N(K2) = C(K2) = K2 * O.
is quasi-regular if all its elements are quasi-regular [A2). The
phrase 'approximated by ... ' means 'representable as a
Of the identities that are valid in an associative ring, subdirect sum of ... '. For example 'a J-semi-simple alter-
the following are the best known: native algebra R can be approximated by primitive alterna-
tive rings' means that R is isomorphic to a subdirect sum of
[(xy)z]y = x[(yz)y], primitive alternative rings.
[(xy)x]z = x [y(xz)], References
(xy)(zx) = [x(yz)]x [A 1] JACOBSON, N.: Structure and representation of Jordan alge-
bras, Amer. Math. Soc., 1968.
(the Moufang identities); [A2] JACOBSON, N.: Structure of rings, Amer. Math. Soc., 1964.

(xy, z, I)-y(x, z, I)-(y, z, I)x = [A3] SCHAFER, R.D.: An introduction to non-associative algebras,
Acad. Press, 1966.
= ([x,y), z, 1)+(X,y, [z, Ij),
AMS 1980 Subject Classification: 16-XX, 17D05
([x,y]", z, I) = [x,y]([x,yf, z, I) =

= ([x,yf, z, 1)[X,y] = o. ALTERNION - A hypercomplex number. Alternions


An alternative ring with three generators also satisfies may be considered as a generalization of the complex
the identity numbers, double numbers (cf. Double and dual
([x,y][z, I]+[z, I][X,y], u, v) = o. (*) numbers) and quaternions. The algebra IAn of altern ions
of order n and of index I is an algebra of dimension
In an alternative ring with more than three generators 2n -lover the field of real numbers, with unit element
the identity (*) is usually not satisfied; moreover, in
1 and a system of generators II, ... ,In -I, in which the
these alternative rings, as a rule, ([x, y f, z, t)=!=O. In multiplication satisfies the formula
any alternative ring without locally nilpotent ideals, the
identity (*) is satisfied, so that such an alternative ring li~ = -~/i' If = -(i,

can be approximated by primary associative rings and where fj = + 1, the value - 1 occurs I times and + 1
by Cayley - Dickson rings. occurs n -1-1 times, respectively. A base of the alge-
Any free alternative ring R has a non-zero ideal bra is formed by the unit element and by elements of
U(R) contained in the associative centre N(R). A free the form
alternative ring with three or more generators not only
contains divisors of zero, but is not primary. Free alter- where h < ... <A. In this base any alternion a can
native rings with four or more generators even contain be written as
trivial ideals and for this reason cannot be approxi- a = a+ ~ai/i+ ~~aiJlij+ ... +
mated by primary rings. i ; j

References +a I '(n-I)l I, ... n-I>


[1) DoROFEEY, G.V.: 'An instance of a solvable, though non-
nilpotent alternative ring', Russian Math. Surveys 15, no. 3
where a, ai, ... ,a I ... (n -I) are real numbers. The
(1960), 147-150. (Uspekhi Mat. Nauk 15, no. 3 (1960), 147- alternion a conjugate to the alternion a is defined by
150) the formula
[2) ZHEVLAKOY, K.A.: 'Alternative Artinian rings', Algebra i
Logika 5, no. 3 (1966), 11-36 (in Russian). a = ~(_1)k(k+I)/2ail "'i'/il ... Ii,.
[3) ZHEVLAKOY, K.A.: 'On radicals and von Neumann ideals', k
Algebra and LogiC 8, no. 4 (1969), 244-252. (Algebra i Logika The following equalities hold
8, no. 4 (1969), 425-439)
[4) SHIRSHOY, A.I.: 'On certain non-associative nil-rings and alge- a+f3 = a+P, ~ = a, af3 = pa.
braic algebras', Mat. Sb. 41 (83), no. 3 (1957), 381-394 (in Rus-
sian). The product aa is always a positive real number; the
[5) KLEINFELD, E.: 'Simple alternative rings', Ann. of Math. (2) 58,
no. 3 (1953), 544-547.
quantity I a I = y:;;;;
is called the modulus of the
[6) KLEINFELD, E.: 'Alternative nil rings', Ann. of Math. (2) 66, no. altern ion a. If the number I P- a I is taken as the dis-
3 (1957), 395-399. tance between two alternions a and p, then the alge-

163
ALTERNATIVE RINGS AND ALGEBRAS

bras 0 An and 'An, 1>0, are isometric to the Euclidean [4] COHN, P.M.: 'The embedding of firs in skew fields', Proc. Lon-
space R 2"-' and the pseudo-Euclidean spaces 'R2"-', don Math. Soc. (3) 23 (1971), 193-213.
L.A. Bokut'
respectively. The algebra 0A 1 is isomorphic to the field D.M. Smirnov
of real numbers; 0 A 2 is isomorphic to the field of com-
Editorial comments. The reference to [1] in the article
plex numbers; 1A 2 is isomorphic to the algebra of dou-
above should have been to H. Neumann's original papers
ble numbers; 0 A 3 is isomorphic to the skew-field of [A 1], [A2].
quaternions; and 1A 3 and 2A 3 are isomorphic to the The amalgamation problem for a class of algebraic sys-
so-called algebras of anti-quaternions. The elements of tems is generally understood to mean the problem of imbed-
oAn are the so-called Clifford numbers. The algebra ding an amalgam of two systems, as defined above, in a
4A 5 was studied by P. Dirac in the context of the spin system of the class. As mentioned above, the amalgamation
of an electron. problem for groups is solvable, but for other classes it can
The algebras of alternions are special cases of Clif- be an interesting and important question. E.g., it has been
ford algebras (d. Gifford algebra). shown [A3] , [A4] that the amalgamation problem for various
classes of lattices is closely related to the interpolation prob-
References lem in logic.
[1) ROZENFEL'D, B.A.: Non-Euclidean geometry, Moscow, 1955 (in
Russian). References
N. N. Vi/'yams [A1] NEUMANN, H.: 'Generalized free products with amalgamated
subgroups', Amer. J. Math. 70 (1948), 590-625.
AMS 1980 Subject Classification: 16AXX [A2] NEUMANN, H.: 'Generalized free products with amalgamated
subgroups', Amer. J. Math. 71 (1949), 491-540.
AMALGAM - A set M represented as the set- [A3] MAKSIMOYA, L.L.: 'Craig's interpolation problem and amal-
gamable varieties', Soviet Math. Oak/. 18 (1977), 1550-1553.
theoretic union of a family {Ai: i EI} of algebraic sys- [A4] PrITs, A.M.: 'Amalgamation and interpolation in the category
tems (cf. Algebraic system) Ai of a given class ~ with of Heyting algebras', J. Pure App/. A/g. 29 (1983), 155-165.
intersections Vi}' where for all i, j the intersection AMS 1980 Subject Classification: 08825
Ai nAj = Ui] = Uji
is non-empty and is a subsystem of each of the systems AMALGAM OF GROUPS - A family of groups
G a, a EI, that satisfies the condition that the intersec-
Ai, A j . If there exists a system B in the class Sf contain-
ing all Ai (i EI) as subsystems, then says one that the tion Ga n G p is a subgroup in G a and G p for any a, f3
amalgam is imbeddable in the system B. from I. An example of an amalgam of groups is an
An amalgam of two groups and, in general, any arbitrary family of subgroups of a given group. An
amalgam of groups {Ai: i EI}, in which all intersec- imbedding of an amalgam of groups A = {G a: a EI} into
tions Vi} (i=l=j) coincide and are equal to V, is always a group G is a one-to-one mapping of the union
imbeddable in a group, e.g. in the free product of the U aE I Ga into G whose restriction to each Ga is an iso-
groups Ai (i EI) with the common subgroup U. There morphism. An amalgam of groups in which all intersec-
are, however, amalgams of groups which are not tions Ga n Gp are identical (and equal to, say, a sub-
imbeddable in a group. (For conditions for imbeddabil- group H) is imbeddable in the group that is the free
ity of amalgams of groups in a group see [l]; for product of the groups Ga with the amalgamated sub-
imbeddability of amalgams of semi-groups in a semi- group H. On the other hand, there exists an amalgam
group see [2]). See also Amalgam of groups. of four Abelian groups that is not imbeddable in a
Let ~ be the class of all algebras over a given field F group. The principal problem concerning amalgams of
or the class of commutative, anti-commutative or Lie groups is, generally speaking, as follows. Let a, T be
algebras over a field F. An amalgam {A;: i EI} of Sl- possible properties of groups. The question to be
algebras Ai with identical intersections Vi} = V (for all answered is the nature of the conditions under which
i =1=j) is imbeddable in the S'f-free product of these alge- an amalgam of groups with the property a is imbedd-
bras with the common subalgebra V [3]. It has been able in a group with the property T. It was found that
shown [4] that an amalgam {Ti: i EI} of associative all amalgams of two finite groups are imbeddable in a
skew-fields Ti with identical intersections Tij = T (i=l=j) finite group. An amalgam of three Abelian groups is
is imbeddable in an associative skew-field. imbeddable in an Abelian group. An amalgam of four
References Abelian groups imbedded in a group is contained in an
[1) KUROSH, A.G.: The theory of groups. 2. Chelsea. reprint. 1960. Abelian group. There exists an amalgam of five Abelian
33 (translated from the Russian). groups which is imbeddable in a group, but not in an
[2) CLIFFORD. A.H. and PRESTON. G.B.: The algefTraic theory of
Abelian group. Another problem that has been studied
semigroups. 2. Amer. Math. Soc., 1967.
[3) SHIRSHOY, A.I.: 'On a hypothesis in the theory of Lie alge- is the imbeddability of an amalgam of groups if a, T
bras'. Sibirsk. Mat. Zh. 3, no. 2 (1962). 297-30 I (in Russian). denote solvability, nilpotency, periodicity, local finite-

164
AMPLE SHEAF

ness, etc. (in different combinations). respectively, to


Yu.I. Merzlyakov 1+2+4+5+10+11+20+22+44+55+110 = 284,
N.S. Romanovskii
1+2+4+71 + 142 = 220.
Editorial comments. In the definition of amalgam above,
think of the Ga as all being subsets of some large set X. L. Euler discovered about 60 pairs of amicable
The amalgamated product of groups G; 'over a common numbers, while the use of electronic computers yielded
subgroup U' is constructed as follows. Let G; be a set of a few hundreds of such numbers. It is not known, how-
groups indexed by the set I. For each i let U; be a sub- ever, if there exists a pair of amicable numbers one of
group of G; and for each i let there be an isomorphism which is even while the other is odd.
A.!, Galochkin
q,;: U;~U identifying U; and U. Consider the set G of all
words Editorial comments. For some very large pairs see [A1].
a, ... at
References
with each a; from some Gj , and consider the following ele- [A1] RmLE, H.J.J. TE: 'New very large amicable pairs', in Proc.
mentary equivalences Number Theory Noordwijkerhout, 1983, Lect. Notes in Math.,
1) if a;=1 then a, ... a;_,a;a;+, ... at is equivalent to Vol. 1068, Springer, 1983, pp. 210-215.
a, ... a;_, a;+, ... at; AMS 1980 Subject Classification: 10A40
2) if a; and a; +' belong to the same group Gj and
aja;+, =a; in Gj then a, ... aj_, a;aj+, ... at is equivalent
to a, ... aj_, a;aj+2 ... at; AMPLE SHEAF - A generalization of the concept of
3) if aj=UjEUjCGj and bj=UkEUkCGk and an ample invertible sheaf. Let X be a Noetherian
q,k(Uk)=U=q,j(U;)EU, then a, aj_,aja;+, ... at is scheme over a field k, and let C be a locally free sheaf
equivalent to a, ... aj_, bja;+, ... at. on X (that is, the sheaf of sections of some algebraic
Let - be the equivalence ~elation generated by these ele- vector bundle E~X). The sheaf C is called ample if for
mentary equivalences, then G / - is the amalgamated pro- each coherent sheaf on X there exists an integer no,
duct of the G;, more precisely of the (Gj, Uj) (Le. the free depending on , such that the sheaf S n C for
product of the G; with amalgamated subgroup ll); the group n ~no is generated by its global sections (here sn C
law is induced by concatenation. denotes the n-th symmetric power of C).
Amalgamated products are non-trivial. This follows from A locally free sheaf C on X is ample if and only if
the following canonical form theorem. For each i select a the invertible tautological sheaf !l'( C) on the projectiv-
set R; of left coset representatives of U in G;. Then each
ization P(E) of the bundle E is ample. Another cri-
word is equivalent to precisely one of the form UZ, ... Zt
terion of ampleness is that for each coherent sheaf
with each Z; in some Rj , UEU and z, and Zj+, belonging to
different G/s for i=1, ... ,t-1. If U={e} one obtains of
on X there must exist an integer no, depending on ,
course the free product of the Gi . A subgroup of a free pro- such that the cohomology group Hi(X, sn C) is
duct is itself a free product (Kurosh' theorem). Subgroups of zero for n~no and i>O. If the sheaves C and are
a product with an amalgamated subgroup need not be ample then C is an ample sheaf [1]. If X is a non-
themselves of this type. The reason is that if U is the amal- singular projective curve, then a sheaf C on X is ample
gamated subgroup, then one can take subgroups Hj of the if and only if C and all its quotient sheaves have posi-
Gj with different intersections with U so that the Hj will amal- tive degree [2]. The tangent sheaf on pN is ample for
gamate in various different ways. This leads to generalized any N (see [1]). The converse also holds: Any non-
amalgamated products and the notion of amalgam as singular N-dimensional algebraic variety with an ample
defined above. The theory of these is still incomplete. tangent sheaf is isomorphic to pN (see [1], [3]).
References
[A1] HALL, M, JR.: The theory of groups, Macmillan, 1959. References
[A2] NEUMANN, H.: 'Generalized free products with amalgamated [1] HARTSHORNE, R.: 'Ample vector bundles', Publ. Math. IRES

subgroups I', Amer. J. Math. 70 (1948),590-625. 29 (1966),319-350.


[A3] NEUMANN, H.: 'Generalized free products with amalgamated [2] HARTSHORNE, R.: 'Ample vector bundles on curves', Nagoya
subgroups II', Amer. J. Math. 71 (1949), 491-540. Math. J. 43 (1971), 73-89.
[3] DEMAZURE, M.: 'Characterisations de l'espace projectif (conjec-
AMS 1980 Subject Classification: 08B25, 20AXX tures de Hartshorne et de Frankel)', in Sem. Bourbaki 1979/80,
Lect. Notes in Mathematics, Vol. 842, Springer, 1981, pp. 11-

AMICABLE NUMBERS - A pair of natural numbers


1~ V.A. Iskovskikh
each one of which is equal to the sum of the proper Editorial comments. The theorem stated in the last line
divisors of the other, i.e. of the divisors other than the of the text is due to S. Mori [A 1].
number itself. This definition is found already in References
Euclid's Elements and in the works of Plato. One pair [A1] MORl, S.: 'Positive manifolds with ample tangent bundles',
of such numbers only - 220 and 284 - was known to Ann. of Math. 110 (1979), 593-606.
the ancient Greeks; the sums of their divisors are equal, AMS 1980 Subject Classification: 14F05

165
AMPLE VECTOR BUNDLE

AMPLE VECI'OR BUNDLE - An algebraic or ana- capacity which are suited. to the metrics of various
lytic vector bundle with an ample sheaf of regular other spaces of analytic functions (see, for example, [2]
(resp. analytic) sections (see Ample sheaf; Positive vec- and [3]).
tor bundle). The concept of analytic capacity proved to be well
o.A. Ivanova
suited to certain problems in approximation theory,
AMS 1980 Subject Classification: 14F05 where the solution of a number of fundamental prob-
lems is formulated in terms of analytic capacity. Thus,
AMPLITUDE OF AN ELLIPTIC INTEGRAL - The
[4], any continuous function on a bounded closed set E
variable 4>, considered as a function of z, in an elliptic in the plane can be uniformly approximated by rational
integral of the first kind functions to any desired degree of accuracy, if and only
z = F(.p,k) = j dt
if the equality
o VI-k 2 sin2 t y(J~ \ E) = y(J~) = 8
ill the normal Legendre form. The concept of the holds for any disc 118 of radius 8.
amplitude of an elliptic integral and the notation
References
4>=amz were introduced by e.G.J. Jacobi in 1829. The [l] AHLFORS,L.V.: 'Bounded analytic functions', Duke Math. J.
amplitude of an elliptic integral is an infinite-valued 14(1947), I-II.
periodic function of z. The basic elliptic Jacobi func- [2] GARABEDIAN, P.R.: 'The classes 4 and conformal mapping',
Trans. Amer. Math. Soc. 69 (1950), 392-415.
tions sinamz=snz, cosarnz=cnz, 6.amz=dnz are all [3] SINANYAN, S.O.: 'Approximation by analytic functions and
single-valued. It is convenient, however (e.g. for pur- polynomials in a real mean', Dokl. Akad. Nauk ArmSSR 35,
poses of tabulation), to consider an elliptic integral as a no. 3 (1962), 107-112 (in Russian).
[4] VlTUSHKIN, A.G.: 'Analytic capacity of sets in problems of
function F(4), k) of the amplitude 4> and the modulus k.
approximation theory', Russian Math. Surveys 22, no. 6 (1967),
See also Jacobi elliptic functions. 139-200. (Uspekhi Mat. Nauk 22, no. 6 (1967), 141-199)
E.D. Solomentsev
A. G. Vitushkin
AMS 1980 Subject Classification: 33A25
Editorial comments. A good reference to the general
area is [A1].
ANALLAGMATIC GEOMETRY - A geometry that
studies the properties of figures which are invariant References
[A 1] GAMELIN, T.W.: Uniform algebras, Prentice-Hall, 1969.
with respect to circle transformations (d. Circle
transfonnation) of the extended plane, i.e. of the AMS 1980 Subject Classification: 31 A 15, 31 C15
Euclidean plane completed by a point at infinite dis-
tance. ANALYTIC COMPLEMENT - An obsolete term
A.B. Ivanov denoting a set which is complementary to an s1' -set (on
AMS 1980 Subject Classification: 51 M05 the number axis).
P.S. Aleksandrov
ANALYTIC CAPACITY, analytic measure, Ahlfors AMS 1980 Subject Classification: 04A 15
analytic measure - A set function in the plane, intro-
duced by L. Ahlfors [1], which is an analogue of the ANALYTIC CONTINUATION of a function - An
logarithmic capacity, and which is suited for the charac- extension of a function fa, already defined on a certain
terization of sets of removable singularities of bounded subset E of a complex manifold M, to a function f
analytic functions. Let E be a bounded closed set in the which is holomorphic on a certain domain D eM con-
plane, let A (E) be the set of functions which are ana- taining E such that the restriction fl E =fo of f to E
lytic outside E, vanish at infinity and are bounded coincides with fa. The starting point in the theory of
everywhere outside E by the constant 1. Let analytic continuation is the concept of an (analytic) ele-
y(E ;f)=limzj(z) as z....,.oo. The number ment, i.e. a pair (D,f), where D is a domain in M and
y(E)=sup{ I y(E;f) I: fEA(E)}, is called the analytic f is a holomorphic function on D. One says that the
capacity of the set E. The analytic capacity of an arbi- elements (Do,fo) and (D l,fd are direct analytic con-
trary set is usually defined as the supermum of the ana- tinuations of each other through a connected com-
lytic capacities of its bounded closed subsets. ponent 6. of the set Do n
D 1 if fa It. =fl It.. By defini-
If E is a bounded closed set, then a necessary and tion, an element (Do,fo) continues analytically to a
sufficient condition in order that each function which is boundary point ~EaD eM if there exists a direct ana-
analytic and bounded outside E be extendable to E, is lytic continuation (D 1 ,ij) of the element (Do,fo)
that the analytic capacity of E be zero (the Ahlfors through 6. such that ~ E 6. n
D 1. A maximal analytic
theorem). continuation of CDo,fo) (in /'vf) is an element CD, f)
There also exist several concepts related to analytic which continues h analytically to the domain D ~ Do,

166
ANALYTIC CONTINUATION

but which cannot analytically be continued to any parameter-dependent integrals such as Cauchy-type
boundary point of D. The maximal analytic continua- integrals (cf. Cauchy integral); the Laplace integral; the
tion of (Do,fo) in M is unique, but does not always Borel transform; change of variable in a power series,
exist. In order to overcome this drawback one intro- special methods of summation of power series (the
duces the concept of a covering domain over M (a Borel expansion into a polynomial series converging in
Riemann surface in the case M = C), which is con- a maximal polygon (cf. Borel summation method), the
structed from the elements that are analytic continua- Mittag-Leffler series converging in a maximal star (cf.
tions of (Do,fo). An element (D, f) is called an ana- Star of a function element; Mittag-Leffler summation
lytic continuation of an element (Do,fo) if there exists a method), etc., the Riemann - Schwarz reflection method
finite chain of elements (Di , J;), i =0, ... , n, and (cf. Riemann - Schwarz principle), functional and dif-
corresponding connected components !1i in Di n Di + I ferential equations satisfied by a function (e.g. the
such that (Dn,fn)=(D,j) and (Di,J;), (Di+I,J;+I) are equation f(z + 1) =z f(z) for the gamma-function, con-
direct analytic continuations of each other through !1i. ditions of periodicity, evenness, symmetry, etc.), and
One says that the holomorphic function fa, initially analytic expressions in terms of known functions.
defined in the domain Do, continues analytically to a The subject of analytic continuation also comprises
point z EM if there exists an analytic continuation studies on the relation between the initial element of an
(D, f) of (Do,fo) such that z ED. One introduces an analytic function (a Taylor series) and the properties of
equivalence relation among the elements which are con- the complete analytic function generated by this element
tinuations of fo to the point z: (D',f}-(D",f") if [1]. Results have been obtained on singular points (cri-
zED' n D" and / =/' in a neighbourhood of z. On teria for singular points, the Hadamard theorem on pro-
the set of equivalence classes Df (for all possible z) ducts, the Fabry theorem on quotients) and on singular
there is a natural way of introducing the topology and curves (theorems on gaps and non-extend ability beyond
complex structure of a covering domain over M. The the boundary of the disc of convergence, e.g. the
function fo is lifted to Df in a natural manner (its value Hadamard theorem on gaps, the Fabry theorem on
on the equivalence class at z containing (Do,fo) is set gaps, etc.), theorems on over-convergence and on rela-
equal to fo(z ; it continues analytically to all of Df tions between analytic continuation of a power series
and, in the sense defined above, it does not continue to and properties of an entire function which defines its
any boundary point of Df over M. coefficients, problems of meromorphic continuation,
If M is the complex plane C or, more generally, the meromorphic continuation by Pade approximation, etc.
complex space en, n ~ 1, this process of analytic con- The field of analytic continuation also includes
tinuation can be described more simply. A canonical theorems on removable singularities (removability of an
element is a pair (Da,fa) where a Ecn and fa is a power . isolated singularity of a bounded holomorphic function,
series centred at the point a with a non-empty domain removability of a rectifiable singular curve under condi-
of convergence Da. A canonical element (Db,fb) is an tions of continuity, etc.), as well as a large class of
analytic continuation of (Da,fa) along a path theorems on the simultaneous continuation of holo-
y: [0, l]~en if there exists a family of canonical ele- morphic functions of several complex variables. The
ments (D,,!t), O~t~l, with centres a,=z(t), such that space en, n> 1, comprises domains for which every
(Do,fo)=(Da,fa), (D1,f1)=(Db,fb), and for any toEY holomorphic function can be extended to a larger
the elements (D,,!t) are direct analytic continuations of domain (this phenomenon is absent in the one-
(D'o,!to) for all t sufficiently near to to. The family dimensional case). It is therefore an important problem
(D,,!t) is in fact uniquely determined. If y" O~'T~ 1, is in the theory of analytic continuation of functions of
a continuous family of paths in Cn with common end several complex variables to describe these larger
points a and b, and if (Da,fa) continues analytically domains - the so-called envelopes of holomorphy. Thus,
along each y" then the result (Db,fb) does not depend there are descriptions of the hulls (envelopes) of holo-
on 'T (the monodromy theorem). In the case of en the morphy for Hartogs domains, n-circular and tube
canonical elements (Da,fa) obtained by analytic con- domains, theorems on the removability of compact
tinuation along all possible paths in en become the singularity sets and of singularity sets of codimension
points of Df ; fa is lifted to Df and continues analyti- ~2, the Bogolyubov theorem on the 'edge-of-the-wedge'
cally throughout Df to a holomorphic function f, while and Vladirnirov's theorem on C-convex hulls [3] (cf.
Df is the domain of holomorphy of f Holomorphic envelope; Hartogs domain; Tube domain).
This general process of analytic continuation is not A number of effective methods for the construction of
very effective in practice, and for this reason many spe- envelopes of holomorphy are available [3].
cial methods of analytic continuation are employed. The problem of analytic continuation of functions of
These include various analytic representations: a real variable can be reduced to that of holomorphic

167
ANALYTIC CONTINUATION

functions, since for any domain G eRn and any func- [2) SHABAT, B.V.: Introduction to complex analySiS, Moscow, 1969,

tion f an~lytic in G there are a domain D c en and a Chapt. 3 (in Russian).

~unction f holomorphic in D such that D Rn = G andn Editorial comments.


E.D. Solomentsev

flc=j
References
References [A1] GUNNING, R.C. and ROSSI, H.: Analytic functions of several
[I) BIEBERBACH, L.: Analytische Fortsetzung, Springer, 1955. complex variables, Prentice-Hall, 1965.
[2] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
1-3, Chelsea, 1977 (translated from the Russian). AMS 1980 Subject Classification: 53A04, 32C25
[3] VLADIMIROV, V.S.: Methods of the theory offunctions of several
complex variables, M.LT., 1966 (translated from the Russian). ANALYTIC DIFFERENTIAL - See Differential on a
E.M. Chirka Riemann surface.
Editorial comments.
AMS 1980 Subject Classification: 14H05, 30F30
References
[A 1] HORMANDER, L.: Introduction to complex analysis in several
variables, v. Nostrand, 1966. ANALYTIC EXPRESSION (formula) - The totality of
[A2) GRAUERT, H. and FRlTZSCHE, K.: Several complex variables, operations to be performed in a certain sequence on the
Springer, 1976 (translated from the German). value of an argument and on the constants in order to
AMS 1980 Subject Classification: 30840, 32A30 obtain the value of the function. Every function in one
unknown x with not more than a countable number of
ANALYTIC CURVE, analytic arc - A curve K in an discontinuities has an analytic expression A (x) involv-
n-dimensional Euclidean space Rn, n;;:;' 2, which has an ing only three operations (addition, multiplication,
analytic parametrization. This means that the coordi- passing to the limit by rational numbers), performed
nates of its points x = (x I, . . . , x n ) can be expressed as not more than a countable number of times, starting
analytic functions of a real parameter x, =x,(t), from an argument x and from the constants, e.g.
i = I, ... , n, a ~ t ~ /3, i.e. in a certain neighbourhood
_ 00 n x2n+1
of each point to, a~to~f3, the functions x,(t) can be SIllX - n~o( -I) (2n + I)!'
represented as convergent power series in t-to, and
the derivatives x;(t 0), i = I, ... , n, do not simultane- If there is at least one analytic expression describing a
ously vanish at any point of the segment [a,f3l. This given function, there are infinitely many such expres-
last condition is sometimes treated separately, and an sions. Thus, the function which is identically equal to
analytic curve which satisfies it is called a regular ana- zero is expressed by the series
lytic curve. An analytic curve is called closed if 0=
00
~xx-n+1
n I( )
x;(a)=x;(f3), i=l, ... ,n. n=1 n!
An analytic curve in the plane C=C 1 of the complex and from any analytic expression A (x) it is always pos-
variable z = x I + ix 2 can be represented as a complex- sible to obtain another one which is identically equal to
analytic function of a real parameter z = j(t), a~t~/3, the first:
j'(t)=F0 on [a, 13], If the analytic curve is located in a
domain DeC, then a conformal mapping of D into A(x)+B(x) [n~1 xn-I~~-n) +11,
any domain will also yield an analytic curve. If the set
of intersection points of two analytic curves is infinite, where B (x) is an arbitrary analytic expression.
these analytic curves coincide. References
In general, in a complex space en, n;;:;' I, the complex [I] LUZIN, N.N.: Theory of functions 0/ a real variahle, Moscow,
coordinates Z; of the points of an analytic curve can be 1948 (in Russian).
B. V Kutuzov
represented as analytic functions of a real parameter
AMS 1980 Subject Classification: 26A09
z,=z,(t), a~t~/3, i=l, ... ,n. It should be noted,
however, that if n> 1, the term 'analytic curve' may
sometimes denote an analytic surface of complex ANALYTIC FUNCfION - A function that can be
dimension one. locally represented by power series. The exceptional
On a Riemann surface S an analytic curve K can be importance of the class of analytic functions is due to
represented as j(t)=If;(<P(t)), where Z=If;(P) is a local the following reasons. First, the class is sufficiently
uniformizing parameter of the points P on Sand j (t) large; it includes the majority of functions which are
is an analytic function of a real parameter in a neigh- encountered in the principal problems of mathematics
bourhood of any point to E[a, /3]. and its applications to science and technology.
Secondly, the class of analytic functions is closed with
References
[1] MARKUSHEVICH, A. I.: Theory of/unctions of a complex variah/e.
respect to the fundamental operations of arithmetic,
1. Chelsea, 1977 (translated from the Russian). algebra and analysis. Finally, an important property of

168
ANAL YTIC FUNCTION

an analytic function is its uniqueness: Each analytic sense of complex analysis or in the sense of C) at z;
function is an 'organically connected whole', which A =/ (z) is the derivative of f at z, and
represents a 'unique' function throughout its natural Afj,z = j'(z)dz = df(z)
domain of existence. This property, which in the 18-th
century was considered as inseparable from the very is its differential at that point. A function f which is
notion of a function, became of fundamental signifi- differentiable at every point of D is called differentiable
cance after a function had come to be regarded, in the in the domain D.
first half of the 19-th century, as an arbitrary One may compare the concepts of differentiability of
correspondence. The theory of analytic functions ori- f considered as a function of two variables (in the sense
ginated in the 19-th century, mainly due to the work of of R2) and in the sense of C. In the former case the dif-
A.L. Cauchy, B. Riemann and K. Weierstrass. The ferential dfhas the form
'transition to the complex domain' had a decisive effect 11
ax dx+ 11
ay dy,
on this theory. The theory of analytic functions was
constructed as the theory of functions of a complex where
variable; at present (the 1970's) the theory of analytic J!...
ax = ~+i~ lL
ax ax 'ay = ~+i~
ay ay ,
functions forms the main subject of the general theory are the partial derivatives of f Passing from the
of functions of a complex variable. independent variables x, y to the variables z, Z, which
There are different approaches to the concept of may formally be considered as new independent vari-
analyticity. One definition, which was originally pro- ables, related to the old ones by the equations
posed by Cauchy, and was considerably advanced by z=x+ry, z=x-ry (from this point of view, the func-
Riemann, is based on a structural property of the func- tion f may also be written as f (z, Z)) and expressing dx
tion - the existence of a derivative with respect to the
and dy in terms of dz and di according to the usual
complex variable, i.e. its complex differentiability. This
rules of differential calculus, one can write df in its
approach is closely connected with geometric ideas.
complex form:
Another approach, which was systematically developed
by Weierstrass, is based on the possibility of represent- dif = Jf.. dz + lL di
ing functions by power series; it is thus connected with az az'
the analytic apparatus by means of which a function where
can be expressed. A basic fact of the theory of analytic
functions is the identity of the corresponding classes of Jf.. = 1. [J!... -
az 2 ax ilL] lLaz = 1. [J!...ax + ilL]
ay' 2 ay'
functions in an arbitrary domain of the complex plane.
Exact definitions are given below. Let D be a domain are the (formal) derivatives of f with respect to Z and Z,
in the complex plane C. If to each point Z ED there has respectively. It is seen, accordingly, that f is differenti-
able in the sense of C if and only if it is differentiable in
been assigned some complex number w, then one says
the sense of R2 and if the equation af / az=o is satis-
that on D a (single-valued) function f of the complex
variable Z has been defined and one writes: w = f (z), fied, which in expanded form may be written as
ZED (or f:D~C). The function w=f(z)=f(x+iy) ~-~ ~--~
may be regarded as a complex function of two real ax - ay' ay - ax .
variables x and y, defined in the domain D CR2 (where If f is differentiable in the sense of C in D, the latter
R2 is the Euclidean plane). To define such a function relations are satisfied at all point of the domain; they
is tantamount to defining two real functions are called the Cauchy- Riemann equations. These equa-
tions occurred already in the 18-th century in J.L.
U = 4>(x,y), v = l/J(x,y), {x,y)ED (w = u+iv).
d'Alembert's and L. Euler's studies on functions of a
Having fixed a point Z ED, one gives Z the increment complex variable. The initial definition may now be
Az =.ix + i Ay (such that Z + Az E D) and considers the rendered more precise as follows. A function f, defined
corresponding increment of the function f in a domain D, is said to be holomorphic (analytic) at a
!!.f{z) = fez + fj,z) - f(z). point Zo ED if there exists a neighbourhood of this
point in which f may be represented by a power series:
If
!!.f{z) = A fj,z + o(fj,z)
fez) = ao +a1 (z - Zo)+ ... +an{z - ZoY + ....
as Az ~o, or in other words, if If this requirement is satisfied at every point Z 0 of D,
lim MEl =A the function f is said to be holomorphic (analytic) in the
&~o fj,z domain D.
exists, the function f is said to be differentiable (in the A function f which is holomorphic at a point Zo ED

169
ANALYTIC FUNCTION

is differentiable at that point. In addition, the sum of a formula, which expresses the values of an analytic func-
convergent power series has derivatives of all orders (is tion inside a domain in terms of its values on the boun-
infinitely differentiable) with respect to the complex
variable z; the coefficients of the series may be
dary:
1
fez) - -2' J- 1niill!R
,zED.
7TI D ~-z
expressed in terms of the derivatives of f at Zo by the
formulas an = j<n)(zo) / n!. The power series, written in Here, D is a domain whose boundary aD consists of a
the form finite number of non-intersecting rectifiable curves (the
l(zo) j<n)(zo);, orientation of aD is assumed to be positive with respect
f(zo)+-l-! -(z-zo)+ ... + n! (z-zo)+"', to D), and L. is a function which is analytic in some
domain G-::JD=D UaD. This formula makes it possi-
is known as the Taylor series of f at zoo Thus, holomor-
ble, in particular, to reduce the study of many prob-
phy of a function f in a domain D means that it is
lems connected with analytic functions to the
infinitely differentiable at any point in D and that its
corresponding problems for a very simple function -
Taylor series converges to it in some neighbourhood of
the Cauchy kernell/(t-z), tEaD, ZED. For more
this point.
details see Integral representation of an analytic func-
On the other hand, the following noteworthy fact is
tion.
established in the theory of analytic functions: If a
A very important property of analytic functions is
function f is differentiable in a domain D, it is holo-
expressed by the following uniqueness theorem: Two
morphic in this domain (for a single point this state-
functions which are analytic in a domain D and which
ment is not true: f (z) = 1 Z 12 = zz is differentiable at
coincide on some set with a limit point in D, coincide
Zo =0, but is nowhere holomorphic). Thus, the concepts
throu.ghout D (are identical). In particular, an analytic
of complex differentiability and holomorphy of a func-
functIOn f (z), z ED, which is not identically zero can
tion in a domain are identical; each one of the follow-
only have isolated zeros in D. If, in addition, z 0 is a
ing properties of a function f in a domain D - dif-
zero of f, then one has, in some neighbourhood U(zo)
ferentiability in the sense of C, differentiability in the
of zo, f(z)=(z-zo)"g(z), where p~l is a natural
sense of R2 together with satisfaction of the
number (called the multiplicity of the zero of f at zo),
Cauchy - Riemann equations, holomorphy - may
while g(z) is a analytic function in U(zo).
serve as definition of analyticity of f in this domain.
An important role in the theory of analytic functions
One other characteristic of an analytic function is
is played by the points at which the function is not
connected with the notion of an integral. The integral
analytic - the so-called singular points of the analytic
of a function f=cf>+il[; along an (oriented rectifiable)
function. Here, only isolated singular points of (single-
curve f: z=z(t), tE[a,p), may be defined by the for-
valued) analytic functions are considered; for more
mula:
f3 details cf. Singular point. If f is an analytic function in
[fez )dz = [fIz(t)]z' (t) dt an annulus of the form 0< 1 z -Zo 1 <p, it may be
expanded there in a Laurent series
or by means of a curvilinear integral: +oc

[f(z)dz = [[cf>dx -""dy]+i [[""dx +cf>dy].


fez) = ~ an{z - Zo)",
n= -00

which contains, as a rule, not only positive but also


A key result in the theory of analytic functions is
negative powers of z - zoo If there are no terms with
Cauchy's integral theorem: If f is an analytic function in
negative powers in the series (an =0 for
a domain D, then Jrf (z) dz = 0 for any closed curve
n = -1, -2, ... ), Zo is called a regular point of f (a
reD bounding a domain inside D. The converse
removable singular point). At a regular point there also
result (Morera's theorem) is also true: If f is continuous
exists a finite limit
in a domain D and if frf (z) dz = 0 for any such curve
r, then f is an analytic function in D. In particular, in limf(z)
.?~:o
= ao;
a simply-connected domain, those and only those con-
tinuous functions f are analytic, whose integral along Putting f(zo)=aQ, one obtains an analytic function in
any closed curve reD is zero (or. which is the same the whole disc z - Zo <po If the Laurent series of the
1 1

function contains only a finite number of terms with


thing, the integral along any curve r connecting two
negative powers of z - z 0:
arbitrary points z 1,22 ED does depend only on the
points z I and Z2 themselves and not on the shape of
00

fez) = ~an(z -zo)". /.I. < 0, a~ =1= 0.


the curve). This characterization of analy'tic functions
forms the basis of many of their applications. the point Zo is called a pole of f (of multiplicity fL); a
Cauchy's integral theorem yields Cauchy's integral pole z 0 is characterized by

170
ANAL YTIC FUNCTION

limf(z) = 00. The simplest class of analytic functions consists of


Z---+-Zo
the functions which are holomorphic in the whole
The function f has a pole of multiplicity j.t at the point
plane; such functions are called entire functions. Entire
Z 0 if and only if the function 1 / f has a zero of multi-
functions are represented by series
plicity j.t at this point. If the Laurent series contains an ao +alz + ... +anz n + ... which are convergent in
infinite number of negative powers of Z - Zo (an:;;t=O for
the whole plane. This class includes the polynomials in
an infinite set of negative indices n), then Zo is called
z, the functions
an essential singular point; at such points there is no
finite and no infinite limit for f The coefficient a ~ I in eZ = 1+-+-+
Z
...
Z2 zn
+ - + ...
the Laurent series for f with centre at the isolated I! 2! n!
singular point Z 0 is called the residue of f at z 0 : . Z3 z2n+1
SlllZ = z--+ ... +(-1)" + ...
3! (2n+l)!
a_I = res[f(z);z=zoJ. z2n
It can be defined by the formula
cosz = Z2
1 - - + ... +(-1)"--+ ...
2! (2n)!'

res[f(z);z=zol = ~ jf(z)dz, etc.


7Tl r
Weierstrass' theorem states that, for any sequence of
where y={z: I Z-Zo I =p} and p>o is sufficiently complex numbers an, n = 1,2, ... , without limit points
small (so that the disc I Z-Zo I,,;;;p does not contain in C, there exists an entire function F that vanishes at
singular points of f other than zo). The important role the points an and only at these points (among the an
of residues is made clear by the following theorem: If f there may be coincident points, to which correspond
is an analytic function in a domain G, except for some zeros of F of corresponding multiplicity). Here, the
set of isolated singular points, if reG is a contour function F may be represented as a (generally infinite)
bounding a domain D C G and not passing through any product of entire functions each one of which has only
singular points of J, and if z I, . . . ,Zn are all the singu- one zero.
lar points of f inside D, then Functions that are meromorphic in the plane (i.e.
that may be represented as quotients of entire func-
/f(z )dz = 2'7Tik~/es[f(z); z =zd tions) are called meromorphic functions. These include
rational functions, tgz=sinz /cosz,
This theorem is an effective tool in calculating
ctgz =cosz / sinz, elliptic functions, etc.
integrals. See also Residue of an analytic function.
According to Mittag-Lejjler's theorem, for any
The sum of the terms of the Laurent series for f at
sequence fin EC, n = 1,2, ... , without limit points in
z 0 corresponding to the negative indices n,
C, there exists a meromorphic function G with poles at
-I
~ an(z -zo)", the points fin and only at those points, such that its
n= -00
principal parts at the points fin coincide with pre-
is known as the principal part of the Laurent series (or assigned polynomials in 1/ (z - fin). The function G
of the function 1> at the point zo0 This principal part may be represented as a (usually infinite) sum of mero-
determines the nature of the singularity of f at z o. morphic functions, each one with a pole at a single
Functions which are representable as a quotient of point only.
two functions that are holomorphic in a domain Dare Theorems on the existence of a holomorphic function
called meromorphic in the domain D. A function which with pre-assigned zeros and of meromorphic functions
is meromorphic in a domain is holomorphic in that with pre-assigned poles and principal parts are also
domain, except possibly at a finite or countable set of valid for an arbitrary domain DeC.
poles; at the poles the values of a merom orphic func- In the study of analytic functions the related
tion are considered to be infinite. If such values are geometric notions are also of importance. If f: D~C is
allowed, then meromorphic functions in a domain D an analytic function, the image feD) of the domain D
may be defined as functions that in a neighbourhood of is also a domain (principle of preservation of domains).
each point zoED can be represented by a Laurent If / (z 0 ):;;t=0, the mapping f preserves angles at z 0 both
series in z - z 0 with a finite number of terms involving in value and in sign, i.e. it is conformal. Thus, there
negative powers of Z - z 0 (depending on z 0) in a neigh- exists a close connection between analyticity and the
bourhood of each point Zo ED. important geometric notion of conformal mapping. If f
Both holomorphic and merom orphic functions in a is an analytic function in D and f(z'):;;t=f(z") for
domain D are often designated as analytic in the domain z' :;;t=z" (such functions are called univalent), then
D. In this a case holomorphic functions are also said to / (z):;;t=O in D and f defines a one-to-one and conformal
be regular analytic or simply regular functions. mapping of the domain D onto the domain feD).

171
ANALYTIC FUNCTION

Riemann's theorem, which is the fundamental theorem primitive and the construction of an analytic function
in the theory of conformal mappings, says that on any with a given real part in multiply-connected domains
simply-connected domain whose boundary contains (cf. Multiply-connected domain), the solution of alge-
more than one point there exist univalent analytic func- braic equations with analytic coefficients, etc.) require
tions which conformally map this domain onto a disc the study of multi-valued functions; such functions
or a half-plane (cf. Conformal mapping; Univalent func- include Z I / n, In z, arctg z, arcsin z, algebraic functions,
tion). etc. (cf. Algebraic function).
The real and imaginary parts of a function f = </> + i1P A regular process which yields the complete analytic
which is holomorphic in a domain D satisfy the function, considered throughout its natural domain of
Laplace equation in that domain: existence, was proposed by Weierstrass; it is known as
Weierstrass' method of analytic continuation.
ft
2+
f t -- 0, f i+f i - 0, The initial concept is that of an element of an ana-
ax 2
2 -
ax ay2 ay
lytic function, viz. a power series with a non-zero
i.e. they are harmonic functions (cf. Harmonic func- radius of convergence. Such an element Wo:
tion). Two harmonic functions which are connected by
aO+al(z-zO)+ ... +an(z-zo)"+ ... ,
the Cauchy - Riemann equations are called conjugate.
In a simply-connected domain any harmonic function </> defines a certain analytic function f on its disc of con-
has a conjugate function 0/ and is thus the real part of vergence Ko. Let z\ be a point of Ko different from zo0
some holomorphic function finD. Expanding f in a series with centre at z I, one obtains a
The connections with conformal mappings and har- new element WI:
monic functions form the basis of many applications of bo+bl(z-Zl)+ ... +bn(Z-ZI)n+ ... ,
the theory of analytic functions. whose disc of convergence will be denoted by K I. In
A functionf(z), ZEE (ECC being an arbitrary set), the intersection of Ko and KI the series WI converges
is called analytic at a point ZOE E if there exists a neigh- to the same function as the series W o. If K I extends
bourhood of this point such that f may be represented beyond the boundary of Ko, the series WI defines the
by a convergent power series on the intersection of this function determined by Wo on some set outside Ko
neighbourhood with E. The function f is called analytic (where the series Wo is divergent). In such a case the
on the set E if it is analytic on some open set which element W \ is called a direct analytic continuation of
contains E (or, more exactly, if there exist both an the element W 0 0 Let W o, . .. , WN be a chain of ele-
open set containing E and an analytic function F on ments in which Wn + I is a direct analytic continuation
this set which coincides with f on E). For open sets the of Wn (n =0, ... , N -1); the element W N is then said
notion to analyticity coincides with the notion of dif- to be an analytic continuation of the element Wo (by
ferentiability with respect to the set. However, this is means of the given chain of elements). When the centre
not the case in general; in particular, on the real line of the disc KN belongs to Ko it may happen that the
there exist functions which not only have a derivative, element WN is not a direct analytic continuation of the
but which are infinitely differentiable at every point element W 0 0 In such a case the sums of the series Wo
and are not analytic even at a single point of this line. and WN will have different values in the intersection of
The property of connectedness of the set E is necessary Ko and K N; thus analytic continuation may lead to
in order that the uniqueness theorem for analytic func- new values of the function inside K o.
tions holds. This is why analytic functions are usually The totality of all elements which may be obtained
considered in domains, i.e. on connected open sets. by analytic continuation of an element Wo forms the
All the preceding refers to Single-valued analytic func- complete analytic function (in the sense of Weierstrass)
tions f, considered in a given domain D (or on a given generated by Wo; the union of their discs of
set E) of the complex plane. In considering the possible convergence represents the (Weierstrass) domain of
extension of a function f, as an analytic function, to a existence of this function. It follows from the unique-
larger domain, one arrives at the concept of the ana- ness theorem for analytic functions that an analytic
lytic function considered as a whole, i.e. throughout its function in the sense of Weierstrass is completely deter-
whole natural domain of existence. If the function is mined by the given element W 0 The initial element
0

thus extended, its domain of analyticity becomes larger, may be any other element belonging to this function;
and may overlap itself, supplying new values of the the complete analytic function will not be affected.
function at points in the plane where it already was A complete analytic function f, considered as a func-
defined. Accordingly, an analytic function considered tion of the points in the plane belonging to its domain
as a whole is generally multi-valued. Many problems in of existence D, is generally multi-valued. In order to
analysis (inversion of a function, the determination of a eliminate this feature, f is considered not as a function

172
ANALYTIC FUNCTION

of the points in the plane domain D, but rather as a bois af / aZk and af / aZk are defined as in the case of
function of the points on some multi-sheeted surface R the plane. If df is of the form
(lying above D) such that to each point of D there
correspond as many points of the surface R (projecting df = ~
nxdZ b
k=1 aZk
onto the given point of D) as there are different ele-
i.e. is a complex linear function in dz \, ... , dz n , the
ments with centre at this point in the complete analytic
function f is said to differentiable in the sense of en or
function f; on the surface R the function f becomes
holomorphic or analytic in the domain D.
single-valued. The idea of passing to such surfaces is
Thus, the condition of holomorphy of f in a domain
due to Riemann, and the surfaces themselves are
known as Riemann surfaces. The abstract definition of
D c en consists of the condition that it be differentiable
in the sense of R2n and that it satisfy the system of n
the notion of a Riemann surface has made it possible
to replace the theory of multi-valued analytic functions complex equations af /azk=o (k=l, ... ,n), which is
by the theory of single-valued analytic functions on equivalent to the system of 2n first-order partial dif-
Riemann surfaces (cf. Riemann surface). ferential equations
Now fix a domain d belonging to the domain of
existence D of the complete analytic function f, and fix
aXk --a'
~ Jf.. ~
a --
!Yk!Yk
_~ - ... ,n)
a (k-l,
Xk

some element W of f with centre at a point in d. The (Cauchy- Riemann system).


totality of all elements which may be obtained by ana- In the case of space (n > 1) as distinct from that of
lytic continuation of W by means of chains with cen- the plane (n = 1) this system is overdetermined, since
tres belonging to d is called a branch of the analytic the number of equations is larger than that of the
function f A branch of a multi-valued analytic function unknown functions. It remains overdetermined on pass-
may tum out to be a single-valued analytic function in ing to the (geometrically more natural) spatial analogue
the domain d. Thus, arbitrary branches of the functions of a holomorphic function of one complex variable - a
Z I / n and In Z which correspond to an arbitrary simply- holomorphic mapping f, which is realized by a system
connected domain not containing the point 0, are f =ifl, . . . ,In) of n functions fk which are holo-
single-valued functions. The function Z I / n has exactly morphic in a domain D C en . The mapping
n different branches in such a domain, while Inz has an f: D~G =f (D) is called biholomorphic if it is one-to-
infinite set of such branches. The selection of single- one and if it is holomorphic together with its inverse
valued branches (using some cuts in the domain of f- I : G~D. The conditions for holomorphy of a map-
existence) and their study by the theory of single- ping f: D~en are expressed by a system of 2n 2 real
valued analytic functions constitute one of the princi- equations involving 2n real functions. The overdeter-
pal methods of studying specific multi-valued analytic minacy of the conditions of holomorphy for n > 1 is the
functions. cause of a number of effects typical of the spatial case
A.A. Gonchar
- such as the absence of a spatial analogue of the
Analytic functions of several complex variables. The Riemann theorem on the existence of conformal map-
complex space en, conslstmg of the points pings. According to Riemann's theorem, if n = 1, any
Z=(Zb ... ,zn), Zk=Xk+ryb is a vector space over the two simply-connected domains whose boundaries do
field of complex numbers with the Euclidean metric not reduce to a single point are isomorphic. However, if
n > 1, even such simple simply-connected domains as
Iz =
1 [k~IIZk 12)1/2, C = C. the ball {I Z 1< I} and the product of discs (polydisc)
It differs from the 2n-dimensional Euclidean space R2n
{ I z. I < 1: p= 1, ... , n} are non-isomorphic. The
non-isomorphism is brought to light on comparing the
by a certain asymmetry: on passing from R2n to en (i.e.
groups of automorphisms of these domains (i.e. their
on introducing a complex structure in R2n), the coordi-
biholomorphic mappings onto themselves, cf. Biholo-
nates are subdivided into pairs which appear in the
morphic mapping) - the groups prove to be algebrai-
complex combinations Zk =Xk + iYk.
cally non-isomorphic, whereas a biho10morphic map-
If a complex function f=cp+j,f is defined in a
ping of one domain onto another, if it existed, would
domain D ce n and is differentiable at all points ZED
establish an isomorphism of these groups. Owing to
in the sense of R2n (i.e. as a function of the 2n real
this, the theory of biholomorphic mappings of domains
variables Xk and Yk), its differential may be represented
in complex space is substantially different from the
in the form
theory of conformal mappings in the plane.
A function lis called holomorphic at a point a E en if
it is holomorphic in some neighbourhood of this point.
According to the Cauchy - Riemann criterion, a func-

173
ANALYTIC FUNCTION

tion of several variables which is holomorphic at a may be locally described as sets of common zeros of
point a is holomorphic with respect to each variable (if polynomials in the variable Zn, with coefficients from
the values of the other variables are fixed). The con- the ring of holomorphic functions in the other variables
verse proposition is also true: If, in a neighbourhood of 'z. This circumstance permits extensive use of algebraic
some point, a function f is holomorphic with respect to methods in the local study of analytic sets.
each variable separately, then it is holomorphic at this Cauchy's integral theorem must also be slightly
point (Hartogs' fundamental theorem). modified in the spatial case, and is then known as the
In analogy with the case of the plane the holomor- Cauchy-Poincare theorem: Let a function f be holo-
phy of a function f at a point a = (a I, . . . ,an) is morphic in a domain D cen; then, for any (n + 1)-
equivalent to its expandability in a multiple power series dimensional surface G compactly imbedded in D, with
in a neighbourhood of this point piecewise-smooth boundary aG,
I(z) = ~Ckldz,-a,/l "'(zn-an/", /!dz = o.
k,;;'O

or, in abbreviated notation, As in the planar case, this integral is defined by a


parametric representation of the given set: If aG has
I(z) = ~ ck(z-ai, the equation Z = Z (t), where the parameter
jk j;;.o
t=(tl, ... ,tn } varies over an n-dimensional cell
where k=(k l , ,kn } is a multi-index of integers
r={ak"$:;;t~f3k: k=I, ... ,n}, then, by definition,
k.-;;:.O, I k I =kl + ... +kn and
(z-ai = (z,-a,)k l (zn-an)k". lI
G
dz = f.
"
a(z" ... ,zn)
JIz(t)] a(
t" ... ,tn
) dt, ... dtn

A holomorphic function is infinitely differentiable, and The difference between the spatial and the planar cases
the above series is its Taylor series, i.e. consists in the fact that in the former case the dimen-
I akl + ... H" [ sion of the surface G is less than that of the domain D,
=
az7' ... aZ~"'
Ck
k,!'" kn ! while in the planar case the dimensions are the same
the derivatives being taken at the point a.
(n + I =2n).
The spatial analogue of the Cauchy integral formula
The fundamental facts of the theory of holomorphic
can be written in a particularly simple form for poly-
functions of one variable extend to holomorphic func-
cylinder domains, i.e. for products of plane domains.
tions of several variables, sometimes in an altered form.
Let D=DI X ... XDn be a domain in which Dk is a
An instance of this is the Weierstrass preparation
domain in the complex plane with a piecewise-smooth
theorem (d. Weierstrass theorem), which extends the
boundary aDk (k = I, ... ,n), while the function f is
property of holomorphic functions of one variable to
holomorphic in a domain which compactly contains D.
become zero as an integral power of z -a to the spatial
Successive application of Cauchy'S formula for one
case. The theorem is formulated as follows: If a func-
variable then yields, for any point zED,
tion /=1=0 which is holomorphic at a point a Een is
equal to zero at that point, then it may be represented,
in a certain neighbourhood U (a) (possibly after a non-
I(z) f
= (2~i)" [f~2:~,
degenerate linear transformation of the independent where r = aD I X ... X aDn is an n-dimensional surface
variables) in the form in the boundary aD, f=(fl, ... ,fn), and
d~ _ d~, ... d~n
I(z) = {(zn -ant +c,('z)(zn -an)k-' + ... +Ck('Z)}CP(Z),
~-z - (~-z,) (~n-zn)'
where k;;.l is an integer, c v (1' = I, ... ,k) are func-
However, polycylinder domains are only a very special
tions of 'z = (z j, . . . ,Zn -I) which are holomorphic in a
class, and in general domains such a separation of vari-
neighbourhood of the point
ables is not possible. The role of Cauchy's integral for
'a = (al,'" ,an-dEen-1 (a 'prime' preceding a
arbitrary domains D C en with a piecewise-smooth
letter denotes the projection on the space of the first
boundary is played by the Martinelli - Bochner integral
n -1 coordinates) and which are equal to zero at a,
formula: For any f~ction f which is holomorphic in a
while <p is holomorphic and zero-free in U(a).
domain containing D, and for any point zED,
This theorem is of fundamental importance in the
study of analytic sets (cf. Analytic set), which are fez)
.
= (n -I)!
(27Ti)n
Jim 8(t-z)~~.
I ~-z I ~n
described locally, in a neighbourhood of each one of elf)

their points, as sets of common zeros of a certain where df=d~1 ... dL, and
number of functions which are holomorphic at this n

point. By Weierstrass' preparation theorem such sets 8(w) = ~(- I)' t 'w, dWI . dw, 1 dW"1 ... dw".
, 1

174
ANAL YTIC FUNCTION

This is Green's formula for a pair of functions, one of dimensional residues. This theory deals with the prob-
which is holomorphic in D, while the other is a funda- lem of computing the integral of a function f, which is
mental solution of the Laplace equation in the space holomorphic everywhere in a domain D C en except for
R2n with singular point r=z. If n = 1, this is the ordi- an analytic set M, over a closed n-dimensional surface
nary Cauchy integral. If n> 1, the formula differs from (J not intersecting M. Since the dimension of the singu-

Cauchy's multiple integral for a product of plane lar set M is lower than the dimension of D by at least
domains in that, first, the integration is not over an n- two, M does not subdivide D. If the surface (J is not
dimensional part of the boundary, but over the whole linked with M, i.e. bounds an (n + I)-dimensional sur-
(2n - 1)-dimensional boundary of the domain, and, face G compactly belonging to D \ M, then by the
secondly, its kernel (the factor multiplying f under the Cauchy - Poincare theorem f"f dz = O. In order to cal-
integral sign) does not depend analytically on the culate this integral in the general case, it is necessary to
parameter z. An analytic kernel, however, is essential in clarify how (J is linked with the singular set M, and to
a number of problems, and it is therefore desirable to calculate the integrals over special n-dimensional sur-
construct an integral formula with such a kernel for as faces associated with separate portions of the set M
large a class of domains as possible. An ample supply (residues).
of integral formulas, including formulas with an ana- The solution of this problem involves considerable
lytic kernel for many domains, is contained in the gen- topological and analytic difficulties. These may often be
eral Leray formula. This formula is overcome by the methods proposed by E. Martinelli
and J. Leray. The Martinelli method is based on the
fez) = (n -l)! (fm 8[wmld~ use of the topological Aleksander- Pontryagin duality
(27Ti)" tD <~-z, wm>n '
principle, and reduces the study of the n-dimensional
where W=(W1' . . . ,Wn ) is a smooth vector function homologies of the set D \ M to the study of the
depending also on z, dr
and 8 are defined above, and (n -I)-dimensional homologies of the singular set M.
<r-z,w>=:LZ=l(rk-zdwk; it is assumed that The Leray method is more general: it is based on the
<r-z,wm>=I=O for any fixed ZED and r running examination of special homology classes and on the
over aD. The value of the integral in this formula does calculation of certain differential forms (residue forms).
not depend on the choice of the vector function wen The multi-dimensional theory of residues has also
(provided that_<r-z,wm>=I=O for all ZED, rEaD), found applications in theoretical physics (cf. Feynman
and if wm = r- z,
this integral coincides with the integral).
Martinelli - Bochner integral. By varying the choice of The Osgood - Brown theorem reveals an important
w for different classes of domains, the Leray formula fundamental difference between the spatial and the
will yield various integral formulas. In the theory of planar theories. In the plane one can, for any domain
analytic functions of several variables other integral D, construct a function f which is holomorphic in D
representations, which are valid only for certain classes but which cannot be extended analytically beyond its
of domains, are also considered. An important class of boundary, i.e. D is the natural domain of existence. In
this kind consists of the so-called Wei! domains, which space the situation is different: thus, the spherical shell
are a generalization of the product of plane domains. { 1 / 2 < I z I < I} cannot be the domain of existence of
For such domains one has the Bergman - Weil any holomorphic function, since by the
representation with a kernel which also depends analyt- Osgood - Brown theorem, any function which is holo-
ically on the parameter. morphic in it will certainly extend analytically to the
As in the planar case, the study of the singularities of entire ball { I z I < I}.
analytic functions is of fundamental interest; the main Thus arises the problem of the characterization of the
difference between the planar and the spatial cases is natural domains of existence of holomorphic functions
expressed by the Osgood - Brown theorem on the - the so-called domains of holomorphy. A simple suffi-
removability of compact singularities, according to cient condition may be formulated with the aid of the
which any function f which is holomorphic in D \ K, concept of a barrier at a boundary point of the domain,
where D is a domain in cn (n > 1) and K is compact i.e. a function fl(z) which is holomorphic in this
subset of D which does not subdivide D, extends holo- domain and which increases without limit as z tends to
morphically to the whole domain D. By this theorem, r D C en will be a domain of holomorphy if it is possi-
holomorphic functions of several variables cannot have ble to construct a barrier for an everywhere-dense set
isolated singular points. These are replaced in en of points in its boundary. This condition is satisfied, in
(n > I) by singular sets which are analytic if their particular, by any convex domain; for any point rEaD
dimension is lower than 2n - 1. it is sufficient to select, in the (2n - 1)- dimensional
This fact is essential in the theory of multi- supporting plane to D at the point r, a (2n - 2)-

175
ANAL YTIC FUNCTION

dimensional plane of the form pseudo-convexity of a domain. .


n As regards domains D which are not dom~s. of
L\(z) = ~ak(Zk -Kk) = 0; holomorphy, there arises the problem of desc~bmg
k=l
their envelope of holomorphy, i.e. the smallest domam of
the function h = I / L r will then be a barrier. Conse- holomorphy to which any function holomorphic in D
quently, every convex domain in cn is a domain of
extends analytically. For domains of the simplest types
holomorphy. However, convexity is not a necessary
envelopes of holomorphy can be effectively constru~te?,
condition for holomorphy: A product of plane domains
but in the general case the problem is unsolvable W1t~
is always a domain of holomorphy, and such a product
the class of single-sheeted domains. Under analytIc
need not be convex. Nevertheless, if the notion of con-
continuation of functions beyond the boundary of a
vexity is suitably generalized, it is possible to arrive ~t
given domain D, multi-valuedness may result, which
a necessary and sufficient condition. One such generali-
can be avoided by introducing multi-sheeted covering
zation is based on the observation that the convex hull
domains over D, analogous to Riemann surfaces (cf.
of a set K eRn may be described as the set of points at
Riemann surface). In the class of covering domains, the
which the value of any linear function does not exceed
problem of constructing envelopes of holo~or:rhy .is
the supremum of the values of this function on K. By
always solvable. This problem also has applicatIOns m
analogy, the holomorphically convex hull of a set
theoretical physics, to wit in quantum field theory.
KeD e en is defined by
The transition from the plane to a complex space

Kc: = {ZED: If(z) I ~~~F Ifm 1,fE<'?(D+ substantially increases the variety of geometrical prob-
lems related to holomorphic functions. In particular,
such functions are naturally considered not only in
where @ (D) denotes the set of all functions which are
domains, but also on complex manifolds - smooth
holomorphic in D. A domain DeC" is called holo-
manifolds of even real dimension, the neighbourhood
morphicallJ: convex if, for every compact subset K of D,
relations of which are biholomorphic. Among these,
the hull K ('J also is a compact subset of D. Holo-
Stein manifolds (cf. Stein manifold) - natural generali-
morphic convexity is a necessary and sufficient ~ond~
tion for a domain of holomorphy. However, this cn- zations of domains of holomorphy - play a special
terion is not very effective, since holomorphic convexity role
Several problems in analysis may be reduced to the
is difficult to verify.
problem of constructing, in a given domain, a hol~
Another generalization is connected with the notion
morphic function with given zeros or a meromorphic
of a plurisubhannonic function, which is the complex
function with given poles and principal parts of the
analogue of a convex function. A convex function in a
Laurent series. In the plane case, these problems have
domain D of Rn may be defined as a function for
been solved for arbitrary domains by the theorems of
which the restrictions to the segments in D of the
Weierstrass and Mittag-Leffler and their generaliza-
straight lines x=xo+wt (where xO,wERn and t is a
tions. The spatial case is different - the solvability of
real parameter) are convex functions of t. ~ real fun~
the corresponding problems, the so-called Cousin prob-
tion cf>, defined and upper semi-continuous m a domam
lems, depends on certain topological and analytic pro-
D e en, is said to be plurisubharmonic in D if for each
perties of the complex manifolds considered:
complex line Z = zO + wt (zo, wEC", t is .a .co~plex
The key step in the solution of the Cousm problems
parameter), its restriction to the parts of th!s line I~ D
is a sub harmonic function of r
If cf> is tWIce contmu-
is to construct - starting from locally-defined func-
tions with given properties - a global function,
ously differentiable, then the condition of 'pluris~b~ar
defined on the whole manifold under consideration and
monicity, in accordance with the rules of dIfferentIatIOn
having the same local properties. Such kinds of con-
of composite functions, is that the Hermitian form
structions are very conveniently effected using the
n ~_
H(w, w) = ~ _ WjWk theory of sheaves, which arose from the algebrai~
j,k = 1 azJaZ k topological treatment of the concept of an analytIc
- the so-called Levi form - be non-negative. function, and which has found important applications
A domain D e en is called pseudo-convex if the func- in various branches of mathematics. The solution of the
tion In d(z, aD), where d(z, aD) denotes the Euclidean Cousin problems by methods of the theory of sheaves
distance from the point z ED to the boundary aD, is was realized by H. Cartan and J.-P. Serre.
plurisubharmonic in this domain. Pseudo-convexity .is B. V Shabat
also a necessary and sufficient condition for a domam The contemporary theory of analytic functions and their
to be a domain of holomorphy. generalizations. This is one of the most important
In some cases it is possible to verify effectively the branches of analysis. it is closely connected with quite

176
ANALYTIC FUNCTION

diverse branches of mathematics and it has numerous [8] FUKS, B.A.: Theory of anarytic functions of several complex vari-
applications in theoretical physics, mechanics and tech- ables, 1-2, Arner. Math. Soc., 1963-1965 (translated from the
Russian).
nology. [9] VLADIMIROV, V.S.: Methods of the theory offunctions of several
Fundamental investigations on the theory of analytic complex variables, M.I.T, 1966 (translated from the Russian).
functions have been carried out by Soviet mathemati- [lO] MARKUSHEVICH, A.I.: Notes on the history of the theory of ana-
rytic functiOns, Moscow-Leningrad, 1951 (in Russian).
cians. Extensive interest in the theory of functions of a [II] Mathematics in the USSR during 30 years, 1917-1947,
complex variable emerged in the Soviet Union at the Moscow-Leningrad, 1948, pp. 319-414 (in Russian).
begining of the 20-th century. This was in connection [12] Mathematics in the USSR during 40 years, 1917-1957, Vol. I,
Moscow, 1959, pp. 381-510 (in Russian).
with noteworthy investigations by Soviet scientists on
[l3] SHABAT, B.V.: Introduction to complex anarysis, 1-2, Moscow,
applications of the theory of analytic functions to vari- 1976 (in Russian).
ous problems in the mechanics of continuous media. [l4] BITSADZE, A.V.: Foundations of the theory of anarytic functions
N.E. ZhukovskiI and S.A. Chaplygin solved very of a complex variable, Moscow, 1972 (in Russian).

important problems in hydrodynamics and aerodynam- A.A. Gonchar


ics by using methods of the theory of analytic func- B. V Shabat
Editorial comments. Complex analysis, the theory of
tions. In the works of G.V. Kolosov and N.l. Muskhel-
analytic functions, has been an active field of research up to
ishvili these methods were applied to fundamental
the present time also in the West. Some of the history may
problems in the theory of elasticity. In subsequent
be found in the extensive article in the Encyclopaedia Bri-
years the theory of functions of a complex variable tannica entitled Analysis, complex.
underwent extensive development. The development of There is a large number of textbooks on complex analysis
various aspects of the theory of analytic functions was besides those listed above. Some good current books are
determined by the fundamental research of, among oth- listed below: [A1J - [A10J for the case of one variable, [A11J
ers, V.V. Golubev, N.N. Luzin, 1.1. Privalov and V.l. - [A 18J for the case of several variables.
Smirnov (boundary properties), M.A. Lavrent'ev Some final comments on complex analysis of several vari-
(geometric theory, quasi-conformal mappings and their ables are in order. The 'Osgood - Brown theorem on the
applications to gas dynamics), M.V. Keldysh, M.A. removability of compact singularities' is usually called the
Lavrent'ev and L.L Sedov (applications to problems in Hartogs extension theorem in the West. The modern proof,
due to L. Ehrenpreis, makes use of the inhomogeneous
the mechanics of continuous media), D.E. Men'shov
Cauchy - Riemann or a-equations. The so-called a-method,
(theory of monogeneity), M.V. Keldysh, M.A.
extensively developed by J.J. Kohn and L. Hbrmander, has
Lavrent'ev and S.N. Mergelyan (approximation theory),
become one of the three most powerful techniques available
LN. Vekua (theory of generalized analytic functions in complex analysis today. Among other things, it can be
and their applications), A.O. Gel'fond (theory of inter- used to obtain solutions of the Levi problem (are domains of
polation), N.N. Bogolyubov and V.S. Vladimirov holomorphy the same as pseudo-convex domains?) and the
(theory of analytic functions of several variables and its Cousin problems, cf. [A 15]. These fundamental problems
application to quantum field theory). The development had been settled previously by methods now belonging to
of the theory of analytic functions of one and several sheaf theory, and which go back to K. Oka (1936 - 1954),
complex variables and their generalizations is continu- and have culminated in the sheaf-cohomology theory of H.
ing. See Boundary properties of analytic functions; Cartan, J.-P. Serre, H. Grauert and others, cf. the elegant
Quasi-conformal mapping; Boundary value problems of treatment in [A12J. The third and most recent technique
analytic function theory; Approximation of functions of consists in the use of suitable integral representations as
developed by G.M. Henkin [G.M. KhenkinJ and E. Ramirez
a complex variable.
de Arellano, cf. [A14].
References References
[1] PRIWALOW. 1.1. [1.1. PRIVALOV]: Einfuhrung in die Funk- [A1] AHLFORS, L.V.: Complex analysis, McGraw-Hill, 1966.
tionentheorie, 1-3, Teubner, 1958-1959 (translated from the [A2] CONWAY, J.B.: Functions of one complex variable, Springer,
Russian). 1973.
[2] SMIRNOV, V.l.: A course of higher mathematics, 111-2, Addison- [A3] DINGHAS, A.: Vorlesungen uber Funktionentheorie, Springer,
Wesley, 1964 (translated from the Russian). 1961.
[3] MARKuSHEVICH, A. I. : Theory offunctions of a complex variable, [A4] HENRICI, P.: Applied and computational complex analysiS, 1-
1-3, Chelsea, 1977 (translated from the Russian). 3, Wiley, 1974-1986.
[4] LAVRENT'EV, M.A. and SHABAT, B.V.: Methoden der komplexen [AS] HILLE, E.: Analytic function theory, 1-2, Chelsea, reprint,
Funktionentheorie, Deutsch. Verlag Wissenschaft., 1967 1974.
(translated from the Russian). [A6] NEHARI, Z.: Conformal mapping, Dover, reprint, 1975.
[5] GoLUZIN, G.M.: Geometric theory offunctions of a complex [A7] NEVANILINNA, R.: Analytic functions, Springer, 1970
variable, Arner. Math. Soc., 1969 (translated from the Russian). (translated from the German).
[6] EVGRAFov, M.A.: A nary tic functions, Saunders, Philadelphia, [A8] NEvANLINNA, R. and PAATERO, V.: Introduction to complex
1966 (translated from the Russian). analysis, Addison-Wesley, 1969 (translated from the Ger-
[7] SVESHNIKOV, A.G. and TnrnONOV, A.N.: The theory offunc- man).
tions of a complex variables, Moscow, 1967 (in Russian). [A9] RUDIN, W.: Real and complex analysis, McGraw-Hili, 1974.

177
ANAL YTIC FUNCTION

!
00
[A 10] lITCHMARSH, E.C.: The theory offunctions, Oxford Univ. fez) = CkZ- k /",
Press, 1939. k=m
[A11] GRAUERT, H. and FRITZSCHE, K.: Several complex variables,
Springer, 1976 (translated from the German). where v> I; the number v-I is called the branching
[A 12] GRAUERT, H. and REMMERT, R.: Theory of Stein spaces, order. The branched elements generalize the concept of
Springer, 1979 (translated from the German). an element of an analytic function, which in this con-
[A13] GUNNING, R. and ROSSI, H.: Analytic functions of several
complex variables, Prentice-Hall, 1965.
nection is also called an unramified (for v= 1) regular
[A14] HENKIN, G.M. [G.M. KHENKIN) and LEITERER, J.: Theory of (for m ~O) element.
functions on complex manifolds, Birkhauser, 1984. As the simplest element (D, /) of an analytic function
[A 15] HORMANDER, L.: An introduction to complex analysis in
fez) of several complex variables Z=(Zb ... ,Zn), n>l,
several variables, North-Holland, 1973.
[A 16) KRANTZ, S.G.: Function theory of several complex variables, one can take a multiple power series
Wiley (Interscience), 1982. OC!

[A17] NARASIMHAN, R.: Several complex variables, Univ. of Chi- fez) = ~ ck(z-a)k = (2)
cago Press, 1971. Ik 1=0
[A18] RUDIN, W.: Function theory in the unit ball ofC n , Springer,
1980.
[A19] RANGE, R.M.: Holomorphic functions and integral represen-
tations in several complex variables, Springer, 1986. where a=(aJ, . . , ,an) is the centre,
AMS 1980 Subject Classification: 30-XX, 32-XX I k I = k I + ... + k n , Ck =Ck, . . . Ck.,

(z -al =(z -ad' ... (z -anl', and D IS some


ANALYfIC FUNCfION, ELEMENT OF AN - The
polydisc
collection (D, /) of domains D in the plane C of a com-
plex variable z and analytic functions f(z) given on D D = {ZEC": IZj-ajl<Rl'j=I, ... ,n}
by a certain analytic apparatus that allows one to effec- in which the series (2) converges absolutely. However,
tively realize the analytic continuation of f(z) to its for n > lone has to bear in mind that a polydisc is not
whole domain of existence as a complete analytic func- the exact domain of absolute convergence of a power
tion. The simplest and most frequently used form of an series.
element of an analytic function is the circular element The concept of an element of an analytic function is
in the form of a power series close to that of the germ of an analytic function.

fez) = ~ck(z-a)k (I) References


k =0
[I] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
1-2, Chelsea, 1977 (translated from the Russian).
and its disc of convergence D={zEC: I z-a I <R} [2] VLADIMIROV, V.S.: Methods of the theory offunctions of several
with centre at a (the centre of the element) and radius of complex variables, M.I.T, 1966 (translated from the Russian).
convergence R >0. The analytic continuation here is E.D. Solomentsev
achieved by a (possibly repeated) re-expansion of the Editorial comments. For n> 1 the domain of absolute
series (1) for various centres b, I b-a I ~R, by formu- convergence of a power series is a so-called Reinhardt
las like OC! domain, cf. [A 1].
fez) = ~dn(z-b)" = Co+
n =0 References
[AI] HORMANDER, L.: An introduction to complex analysis in several
+[Cj(b -a)+Cj (z -b)]+ variables, North-Holland, 1973, Chapt. 2.4.
+[c2(b -a)2 +2C2(b -a)(z -b)+C2(Z -b)2]+ .... AMS 1980 Subject Classification: 30A05, 30610,
Anyone of the elements (D,/) of a complete analytic 30640, 32A30
function determines it uniquely and can be represented
by means of circular elements with centres a ED. In the ANALYTIC FUNCflONAL - An element f of the
case of the centre at infinity, a = 00, the circular ele- space H' (Q), the dual of the space H(Q) of analytic
ment takes the form functions defined on an open subset Q of en, i.e. a
functional on H(Q). Thus, a distribution with compact
f(Z) = ~CkZ-k support is an analytic functional. There exists a com-
k =0
pact set K C Q, said to be the support of the analytic
with domain of convergence D = {z EC: I z I > R}.
functional f, on which f is concentrated: For any open
In the process of the analytic continuation, f(z) may
set w ~ K the functional f can be extended to H (w) so
turn out to be multiple-valued and there may appear
that for all u EH (Q) the following inequality is valid:
corresponding algebraic branch points (Gf. Algebraic
branch point), that is, branched elements of the form If(u) I ~ Cwsup I u I,
-Yo where C w is a constant depending on w. There exists a
f(Z) = ~ck(z-a>""
measure p, with support in K such that
k =m

178
ANALYTIC GEOMETRY

f(u)=judp.. be the Cartesian orthogonal coordinates of the point M


OJ in the system Oxy. A point M with abscissa x and ordi-
An analytic functional is defined in a similar manner nate y is denoted by the symbol M(x,y). The coordi-
on a space of real-valued functions.
. -
nates of a Doint M clearlY determine its location with
M.l. Vottsekhovskit respect to the system Oxy.
Editorial comments. For applications to partial differen- Let a line L be given on a plane 'fT with a given
tial equations, see [A 1]. Cartesian orthogonal coordinate system Oxy. Using the
concept of coordinates of a point it is possible to intro-
References
[A 1] EHRENPREIS, L.: Fourier analysis in several complex vari- duce the concept of the equation of the line L with
ables, Wiley (Interscience), 1970. respect to system Oxy as an equation of the type
[A2] HORMANDER, L.: An introduction to complex analysis in F(x, y) = 0, which will be satisfied by the coordinates x
several variables, North-Holland, 1973, Sect. 4.5.
and y of any point M on L, but not by those of any
AMS 1980 Subject Classification: 46F15 point which does not lie on L.
The basic idea of the method of coordinates on a
ANALYTIC GEOMETRY - A branch of geometry. plane is that the geometric properties of the line L can
The fundamental concepts of analytic geometry are the be clarified by studying its equation F (x, y) = 0 by ana-
simplest geometric elements (points, straight lines, lytic and algebraic tools. Thus, for instance, the prob-
planes, second-order curves and surfaces). The principal lem of the number of intersection points between a
means of study in analytic geometry are the method of straight line and a circle is reduced to the analytic
coordinates and the methods of elementary algebra. problem of the number of solutions of the set of equa-
The genesis of the method of coordinates is closely tions of the line and of the circle.
linked with the intense development of astronomy, This was in fact the method of study of the proper-
mechanics and technology in the seventeenth century. ties of an ellipse, a hyperbola and a parabola, which are
In his Geometrie, R. Descartes (1637) gave a clear and the lines of intersection between a circular cone and
exhaustive account of this method and of the founda- planes which do not pass through its vertex (cf. Conic
tions of analytic geometry. P. Fermat, a contemporary sections).
of Descartes, was also familiar wih the principles of this The so-called algebraic curves of the first and second
method. Subsequent development of analytic geometry orders are a subject of systematic study in plane ana-
is due to the studies of G. Leibniz, I. Newton and, par- lytic geometry; in Cartesian orthogonal coordinates
ticularly, L. Euler. The tools of analytic geometry were these are described by algebraic equations of the first
utilized by J.L. Lagrange in his construction of analytic and second degree, respectively. The curves of the first
mechanics and by G. Monge in differential geometry. order are rectilinear and, conversely, any straight line is
In our own days, analytic geometry has no significance described by an algebraic equation Ax + By + C = 0 of
as an independent branch of science, but its methods the first degree. The curves of the second order are
are extensively employed in various fields of mathemat- described by equations of the form
ics, mechanics, physics and other sciences. AX2 + Bxy + Cy2 + Dx + Ey + F=O. The principal tech-
The principle of the method of coordinates is as fol- nique in the study and classification of such curves is
lows. Consider, for example, two mutually- to select a Cartesian orthogonal coordinate system in
perpendicular straight lines Ox and 0' in a plane 'fT. which the equation assumes its simplest form, after
These lines, including their direction, the coordinate which the simple equation is studied. Cf. Second-order
origin 0 and the selected scale unit e, form a so-called curve.
Cartesian orthogonal system Oxy of coordinates on the In the analytic geometry of space, the Cartesian
plane. The straight lines Ox and 0' are called the orthogonal coordinates are x, y and z (the abscissa, the
abscissa (axis) and the ordinate (axis) respectively. The ordinate and the applicate), and points M are described
location of any point M in the plane relative to this exactly as in plane analytic geometry. Any given sur-
system Oxy can be determined as follows. Let Mx and face S in space has its own equation F(x,y, z)=O in
My be the projections of M onto Ox and Oy, and let the coordinate system Oxyz, and its geometric proper-
the numbers x and y be the magnitudes of the segments ties are studied by studying its equation by analytic
OMx and OMy (the magnitude x of the segment OMx , and algebraic means. A curve L in space is given as the
for example, is equal to the length of this segment, intersection of two surfaces S \ and S 2. If the respective
taken with the plus sign if the direction from 0 to Mx equations of S \ and S 2 are F\ (x, y, z)=O and
is the same as the direction of the straight line Ox, and F 2 (x, y, z) = 0, these equations, considered together, are
taken with the minus sign in the opposite case). The the equation of the curve L. Thus, a straight line L in
numbers x (the abscissa) and y (the ordinate) are said to space may be considered as the intersection of two

179
ANALYTIC GEOMETRY

planes. In analytic geometry of space a systematic complex or p-adic analytic group, respectively. An
study is made of the so-called algebraic surfaces of the example of an analytic group is the general linear
first and second orders. It was fQund that only planes group GL(n, k) of the vector space k n over k (cf.
are algebraic surfaces of the first order. Surfaces of the Linear classical group) or, more generally, the group of
second order have equations of the type invertible elements of an arbitrary finite-dimensional
Ax2 +By2 +Cz 2+ Dxy + Eyz +Fxz + associative algebra with a unit over k. In general, the
group of k-rational points of an algebraic group,
+Gx+Hy+Mz+N = O. defined over k, is an analytic group. A subgroup of an
The principal method of study and classification of analytic group G which is a submanifold in G is called
such surfaces is to select a Cartesian coordinate system an analytic subgroup; such a subgroup must be closed
in which the equation of the surface assumes its sim- in G. For example, the orthogonal group
plest form, the latter being then studied. Cf. Surface of O(n, k)={gEGL(n, k): tgg=I} is an analytic sub-
the second order. group in GL(n, k). All closed subgroups of a real or p-
adic analytic group are analytic, and each continuous
References
[1] DESCARTES, R.: La geometrie, Leiden, 1637. homomorphism of such groups is analytic (Carlan's
[2] WIELEITNER, H.: Die Geschichte der Mathematik von Descartes theorems, [1]).
bis zum Hiilfte des 19. Jahrhunderts, de Gruyter, 1923. An analytic group is sometimes referred to as a Lie
[3] EFIMOV, N.V.: A short course of analytic geometry, Moscow,
1967 (in Russian).
group [1], but a Lie group is usually understood in the
[4] IL'IN, V.A. and POZNYAK, E.G.: Analytical geometry, Mir, 1984 narrower sense of a real analytic group [2], [3] (cf. Lie
(translated from the Russian). group). Complex and p-adic analytic groups are called,
[5] ALEKSANDROV, P.S.: Lectures on analytical geometry, Moscow,
respectively, complex and p-adic Lie groups.
1968 (in Russian).
[6] POSTNIKOV, M.M.: AnalytiC geometry, Moscow, 1973 (in Rus- The Cartan theorems formulated above signify that
sian). the category of real or p-adic analytic groups is a com-
[7] BAKHVALOV, S.V., MODENOV, P.S. and PARKHOMENKO, A.S.: plete subcategory in the category of locally compact
Collection of problems on analytiC geometry, Moscow, 1964 (in
Russian).
topological groups. The question of the extent to which
E. G. Poznyak these categories differ, i.e. as to when a locally compact
Editorial comments. group G is a real analytic or a p-adic analytic group,
References
can be exhaustively answered: If G is real analytic, it
[A 1] BOL, G.: Elemente der analytischen Geometrie, Vandenhoeck must contain a neighbourhood of the unit without
& Ruprecht, 1948. non-trivial subgroups [5] - [9]; if it is p-adic, it must
[A2] BORSUK, K.: Analytic geometry, PWN, 1969.
[A3] STRUIK, D.J.: Lectures on analytic and projective geometry,
contain a finitely generated open subgroup U which is
Addison-Wesley, 1953. a pro-p-group and whose commutator subgroup is con-
[A4] TODD, I.A.: Projective and analytical geometry, Pitman and tained in the set uP' of p2-th powers of elements in U
Sons, 1947. [10]. In particular, any topological group with a neigh-
AMS 1980 Subject Classification: 51 NXX bourhood of the unit that is homeomorphic to a
Euclidean space (a so-called locally Euclidean topological
ANALYTle GEOMETRY - The theory of analytic group, [4]) is a real analytic group. In other words, if
spaces (cf. Analytic space). The term was introduced continuous local coordinates exist in a topological
by J.P. Serre [I] in analogy with algebraic geometry. group, it follows that analytic local coordinates exist;
References this result is the positive solution of Hilbert's fifih prob-
[I] SERRE, I.-P.: 'Geometrie algebrique et geometrie analytique', lem [5], [11].
Ann. Inst. Fourier (Grenoble) 6 (1955-1956), 1-42. If the characteristic of the field k is zero, the most
A.L. Onishchik important method in the study of analytic groups is the
AMS 1980 Subject Classification: 32BXX, 32CXX study of their Lie algebras (cf. Lie algebra of an
analytic group).
ANALYTIC GROUP - A set G which possesses at the
For infinite-dimensional analytic groups cf. Lie
same time the structure of a topological group and that group, Banach.
of a finite-dimensional analytic manifold (over a field k References
that is complete in some non-trivial norm, cf. Nonn on [I] SERRE, J.-P.: Lie algebras and Lie groups, Benjamin, 1965
(translated from the French).
a field) so that the mapping G X G~G defined by the [2] PONTRYAGIN, L.S.: Topological groups, Princeton Univ. Press,
rule (x, y )~xy -I is analytic. An analytic group is 1958 (translated from the Russian).
always Hausdorff; if k is locally compact, then G is [3] CHEVALLEY, c.: Theory of Lie groups, I, Princeton Univ. Press,
1946 (translated from the French).
locally compact. If k is, respectively, the field of real, [4] HELGASON, S.: Differential geometry and symmetriC spaces,
complex or p-adic numbers, then G is called a real, Academic Press, 1962.

180
ANALYTIC MANIFOLD

[5) Hilbert's problems, Moscow, 1969, pp. 101-115 (in Russian). principal functions see [2].
[6) GLEASON, A.M.: 'Groups without small subgroups', Ann. of
Math. (2) 56, no. 2 (1952), 193-212. References
[7) MONTGOMERY, D. and ZIPPIN, L.: 'Small subgroups for finite [I) MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
dimensional groups', Ann. of Math. (2) 56, no. 2 (1952), 213- 1, Chelsea, 1977, Chapt. 2 (translated from the Russian).
241. [2) JAHNKE, E. and EMoE, F.: Tables offunctions with formulae and
[8) Y AMABE, H.: 'On the conjecture of Iwasawa and Gleason', curves, Dover, reprint, 1945 (translated from the German).
Ann. of Math. (2) 58, no. 1 (1953),48-54. ED. Solomentsev
[9) YAMABE, H.: 'A generalization of a theorem of Gleason', Ann.
of Math. (2) 58, no. 2 (1953),351-365.
Editorial comments. Instead of the German word
[10) LAZARD, M.: 'Groupes analytiques p-adiques', Publ. Math. 'Landschaft', the English 'landscape' is also used.
IHES 26 (1965). AMS 1980 Subject Classification: 30AXX
[II) KAPLANSKY, I.: Lie algebras and locally compact groups, Chi-
cago Univ. Press, 1971.
VL. Popov ANALYTIC MANIFOLD - A manifold with an ana-
Editorial comments. In Western literature a connected lytic atlas, The structure of an n-dimensional manifold
Lie group is often called an analytic group. M over a complete non-discretely normed field k on a
Cartan's theorems usually go back to J. von Neumann topological space is defined by specifying an analytic
(cf. [A1], [A2]). atlas over k on M, i.e, a collection of charts (cf. Chart)
References with values in k n covering M, any two charts of which
[A1) NEUMANN, J. VON: Collected works, Vol. 1, Pergamon Press, are analytically related. Two atlases are said to define
1961, pp. 134-148. the same structure if their union is an analytic atlas.
[A2) NEUMANN, J. VON: Collected works, Vol. 1, Pergamon Press,
1961, pp. 509-548.
The sheaf ~ of germs of k-valued analytic functions is
defined on an analytic manifold. The class of ringed
AMS 1980 Subject Classification: 10K40, 22A05, spaces (M, ~) which results in this way is identical
22E05, 57SXX, 58A05 with the class of smooth analytic spaces over k,
If k is the field of real numbers R one speaks of
ANALYTIC IMAGE - An extension of the concept of
real-analytic manifolds; if k is the field of complex
a complete analytic function, obtained on considering
numbers C, of complex-analytic or simply complex man-
all possible elements of an analytic function in the form
ifolds; if k is the field of p-adic numbers Qp' of p-adic
of generalized power series (Puiseux series)
analytiC manifolds. Examples of analytic manifolds
00 00
~a
~P
(z-z 0 ),/n '~P
~a z-,/n

(*) include the n-dimensional Euclidean spaces k n , the n-
J)=m
dimensional projective spaces over k, the affine and
Here z is a complex variable, m is an integer and n is a projective algebraic varieties without singular points
natural number. The series converge in the domains over k, and Lie groups and their homogeneous spaces,
1 z -Zo 1 <r and 1 z 1 >r>O, respectively. An analytic The concept of an analytic manifold goes back to B.
image can be identified with the class of all elements of Riemann and F. Klein, but was precisely formulated
the form (*) which are obtained from each other by for the first time by H. Weyl [4] for the case of
analytic continuation. The analytic image differs from Riemann surfaces, i.e. one-dimensional complex mani-
the complete analytic function by the addition of all folds. At present (the 1970's) it is natural to regard
ramified elements of the form (*) with n > 1, which are analytic manifolds as a special case of analytic spaces
obtained by analytic continuation of its regular ele- (cf. Analytic space), which may be roughly described as
ments with n = 1. After the introduction of a suitable 'varieties with singular points', The concept of an ana-
topology, the analytic image is converted to the lytic space was introduced in the 1950's and became
Riemann surface of the given function. the principal subject of the theory of analytic functions;
References many fundamental results obtained for analytic mani-
[I) MARKUSHEVICH, A. I. : Theory of functions of a complex variable, folds could be successfully applied to the non-smooth
3, Chelsea, 1977, Chapt. 8 (translated from the Russian). case. For an account of the general properties of ana-
E.D. Solomentsev lytic manifolds over an arbitrary field see [3].
AMS 1980 Subject Classification: 30840 There is a close relationship between the theories of
real-analytic and differentiable manifolds (cf, Differen-
ANALYTIC 'LANDSCHAFr', analytic relief, modulus tiable manifold), and also between the theories of real-
surface - The geometric image of the modulus f (z) 1 1 analytic and complex-analytic manifolds, Clearly, the
of an analytic function f(z), z=x+ry. The analytic natural structure of a manifold of class Coo is defined
'landschaft' of a function f (z) is the surface over the on each real-analytic manifold. It was shown by H.
(x, y)-plane with z-coordinate 1f (z) I. An analytic Whitney in 1936 that the converse proposition is also
'landschaft' sometimes provides a good illustration of true: It is possible to define on any paracompact mani-
the behaviour of specific functions. For the relief of the fold of class Coo an analytic structure over R which

181
ANAL YTIC MANIFOLD

induces the initial smooth structure. It follows from [A3] WELLS, R.O.: Differential analysis on complex manifolds,
Grauert's theorem on the imbeddability of a paracom- Springer, 1980.
[A4] lIIRzEBRUCH, F. and KODAIRA, K.: 'On the complex projec-
pact analytic manifold over R in. a Euclidean space that tive spaces', J. Math. Pures Appl. 36 (1957), 201-216.
this analytic structure is unambiguously defined up to [A5] YAU, S.T.: 'Calabi's conjecture and some new results in
an isomorphism (not necessarily the identity) [2]. algebraic geometry', Proc. Nat. Acad. Sci. USA 74 (1977),
1789-1799.
A natural structure of a real-analytic manifold (of
[A61 I'ETERNELL, T.: 'A rigidity theorem for P3 (C)', Manuscripta
double dimension) is defined on all complex manifolds Math. 50 (1985),397-428.
M. The answer to the converse problem - viz. whether
AMS 1980 Subject Classification: 32C05, 58A07
a complex structure on a given real-analytic manifold
exists and whether it is unique - has been given in the
ANALYTIC MAPPING, analytic morphism - A mor-
simplest cases only. Thus, if M is a connected two-
phism of analytic spaces considered as ringed spaces
dimensional real-analytic manifold, then a necessary
(cf. Analytic space; Ringed space). An analytic map-
and sufficient condition for the existence of a complex
ping of a space (X, (!) x) into a space (X, (!) y) is a pair
structure on M is paracompactness and orientability,
(jO,fl), where
while the problem of classification of these structures is Io:X ~ Y
identical with the classical moduli problem for is a continuous mapping, while
Riemann surfaces (cf. Moduli of a Riemann surface).
II : 10 I ( (!) y) ~ (!) x
There is a classification of compact analytic surfaces
(i.e. of two-dimensional complex manifolds, cf. Analytic is a homomorphism of sheaves of rings on X. If the
surface), which gives a partial answer to the above spaces are complex, an analytic mapping is also called
problem for four-dimensional real-analytic manifolds. a holomorphic mapping.
On the other hand it is possible, using topological If (X, (!) x) and (Y, (!) y) are reduced analytic spaces,
methods, to identify classes of real manifolds that do the homomorphism f1 is completely determined by the
not permit almost-complex or, a fortiori, complex struc- mapping fo and is the inverse mapping of the germs of
tures. Such manifolds include the spheres S2k for functions corresponding to fo. Thus, in this case an
k=f:l, 3. A description of complex structures which are analytic mapping is a mapping f: X ~ Y such that for
sufficiently near to a given complex structure is given any x EX and for any CPE (!) f(x) one has cpof E (!) x.
by the theory of deformations of analytic structures (cf. A fibre of an analytic mapping
Defonnation), in which an important role is played by 1= VoJd: (X, (!) x) ~ (Y, (!) y)
Banach analytic manifolds - infinite-dimensional
at a point y E Y is the analytic subspace
analogues of analytic manifolds.
References
F'(y) = Vol (y), (!) x / II(my )(!) x 1!~I(y)
[11 BOURBAKI, N.: Elements of mathematics. Differentiable and of the space (X, (!) x), where my E (!) y is the sheaf of
analytic manifolds, Addison-Wesley, 1966 (translated from the germs of functions that vanish at the point y. Putting
French).
[21 NARASIMHAN, R.: Analysis on real and complex manifolds, d(x) = dimxF'Vo(x)), XEX,
Springer, 1971.
[31 SERRE, J.-P.: Lie algebras and Lie groups, Benjamin, 1965 one obtains the inequality
(translated from the French).
dimx X ~ dimfo(x) Y +d(x). (*)
[41 WEYL, H.: Die Idee der Riemannschen Fiache, Teubner, 1955.
A.L. Onishchik If X and Yare reduced complex spaces, then the set
Editorial comments. A much related basic problem in Xl = {XEX: d(x)~l}
complex analysis is the question whether there are any
complex structures on projective space besides the usual is analytic in X for any r~o.
one (and inducing the same topology). For n = 1 this is An analytic mapping f=(jo,fd is called flat at a
very classical (all Riemann surfaces of genus zero are iso- point x EX if (!} X.x is a flat module over the ring
morphic to Pb). For n =2, uniqueness of the complex (!} Y,fo(x)' In such a case (*) becomes an equality. An
structure follows from combined work of F. Hirzebruch, K. analytic mapping is called flat if it is flat at all points
Kodaira [A4], and S.T. Yau [AS]. For n =3 one has that a x EX. A flat analytic mapping of complex spaces is
compact manifold that is bimeromorphically equivalent to a open. Conversely, if fo is open, Y is smooth and all
Kahler manifold and that is also topologically q is analyti- fibres are reduced, then f is a flat analytic mapping.
cally isomorphic to q [A6].
The set of points of a complex or a rigid analytic space
References X at which an analytic mapping f is not flat is analytic
[A 11 WHITNEY, H.: Complex analytic varieties, Addison-Wesley,
1972.
in X. If X and Yare reduced complex spaces, while X
[A2] BARTH, W., PETERS, C. and YEN. A. VAN DER: Compact com- has a countable base, then Y contains a dense
plex surfaces, Springer, 1984. everywhere-open set over which f is a flat analytic map-

182
ANALYTIC MODEL OF A LANGUAGE

ping. If an analytic mapping In the most fully developed analytic models of


j:(X, ll1x)~(Y, ll1 y) languages, the set of initial data is usually an object
which models a set of grammatical sentences of a
of complex spaces is flat, then the set of y E Y at which
the fibre r
1 (y) is not reduced or normal is analytic in
natural language, viz. some fonnal language in a given
alphabet (vocabulary) V. If L is a language in the voca-
(X, ~ x). bulary V and 'f/u, v E V [uxv EL=*uyv EL], then one
Let f: X ~ Y be an analytic mapping of reduced says that the string x can be substituted by the string y
complex spaces. If dim X < 00, then there exists a with respect to V and L; if each one the strings x and y
stratification can be substituted by the other with respect to V and
o = X( -1) C; X(O) C; ... C; X(r;) C; ... , L, then one says that x and y can be mutually substi-
where X(r) are analytic sets and X (r) = X for large r, tuted with respect to V and L. The concept of mutual
with the following property: Any point substitutability has a simple linguistic meaning: If L is
xEX(r)\X(r-l) has a neighbourhood U in X such understood to be the set of grammatical sentences of
that f(U nX(r is a local analytic set in Y, all irredu- some natural language, the mutually substitutable sets
are 'syntactically equivalent' (i.e. fulfilling the same
cible components of germs of which have dimension r
at f(x). If fis proper, then f(X) is an analytic set in syntactic functions) word combinations. In particular,
X. This is a particular case of the finiteness theorem for if a single-symbol string a (in linguistic interpretation:
analytic mappings. a word) is mutually substitutable with a string x of
Let X, Y be complex spaces and let X be compact. length> 1, x is a 'potential constituent', i.e. may be a
Then it is possible to endow the set Mor (X, Y) of all constituent of the linguistically natural constituent
analytic mappings f: X ~ Y with the structure of a structures of grammatical sentences of this language (cf.
complex space such that the mapping Syntactic structure); in such a case the string x is called
a first rank configuration of the language L with resultant
Mor(X, Y)XX ~ Y,
a. Thus, in the English language, the string 'uniformly
which maps the pair (j, x) into f (x), is analytic. In continuous' can be regarded as a first rank configura-
particular, the group of automorphisms of a compact tion with resultant 'continuous'. However, first rank
complex space X is a complex Lie group, acting analyt- configurations do not exhaust all 'potential consti-
ically on X. tuents'; for example, the word combination 'continuous
References function' is not a first rank configuration, since only
[I] REMMERT, R.: 'Projektionen analytischer Mengen', Math. Ann. words such as 'function', 'derivative', etc., can be sub-
130 (1956), 410-441. stituted by it, but it cannot itself be substituted' by any
[2] REMMERT, R.: 'Holomorphe und meromorphe Abbildungen
of these words ('f (x) is a uniformly continuous func-
komplexer Rliume', Math. Ann. 133 (1957), 328-370.
[3] STEIN, K.: Colloquium for topology, Strasbourg, 1954. tion' is a correct statement, but 'f (x) is a uniform
[4] FRISCH, J.: 'Points de plattitude d'une morphisme d'espaces function' is not). Therefore one introduces the following
analytiques complexes', Invent. Math. 4 (1967), 118-138. definition: If r> 1 is a natural number and if the con-
[5] FISHER, G.: Complex analytic geometry, Springer, 1976.
cept of a rank i configuration of the language L has
D.A. Ponomarev
been defined for each i = 1, ... , r -1, then a string x
AMS 1980 Subject Classification: 58A07, 32HXX of length > 1 is called a rank r configuration of the
language L with resultant a, where a E V, if the following
ANALYTIC MODEL OF A LANGUAGE, analyzing condition holds: If a can be substituted by x with
model of a language - A type of mathematical construc- respect to V and L, if z 1XZ 2 ELand if z 1XZ 2 does not
tion used in mathematical linguistics to describe the contain occurances of configurations of ranks lower
structure of natural languages. These constructions are than r overlapping the considered x but not entirely
used in the formal modelling of basic linguistic contained in it, then z I az 2 EL. In English, the word
categories and of the process of linguistic research combination 'continuous function' can be regarded as a
itself; in other words, certain sets of 'disordered' data second rank configuration, the word 'function' being
about the language (or, more exactly, about speech) are the resultant (the word 'derivative', for example, is also
used to deduce certain information about the structural a resultant). It may be shown that, in a certain sense, a
mechanism of the language, i.e. about its grammar in language is fully determined by the set of its configura-
the wider sense of this word. The 'functioning' of such tions.
a model does not always have the character of an effec- The configurational model belongs to the so-called
tive construction, since the set of initial data need not syntagmatic analytic models of a language, which are
be a constructive object; in principle, this does not intended to describe the relations between the elements
render such models inferior. of segments of speech (in linguistics such relations are

183
ANALYTIC MODEL OF A LANGUAGE

called syntagmatic). Another class of analytic models struction of analytic models of a language is usually
of a language are paradigmatic models, intended to fairly simple. In some models it merely involves the
describe paradigmatic relations, i.e. relations between simplest concepts of set theory (such models are fre-
the elements of a language in its system. In such quently called set-theoretic, and this term is sometimes
models the usual procedure is to construct certain rela- extended to cover all analytic models of a language). In
tions on the vocabulary, which are often (but not other cases algebraic concepts are employed, in particu-
always) equivalence relations. In particular, paradig- lar those of the theory of semi-groups and of the alge-
matic analytic models of a language serve to construct bra of binary relations; this is why the theory of ana-
formal analogues of traditional linguistic categories lytic models of a language is sometimes referred to as
such as parts of speech, case, gender, phoneme, etc. A algebraiC linguistics.
set of grammatical sentences also serves as the 'starting
References
material' in many paradigmatic analytic models of a [1] KULAGINA, O.S.: 'On a means of defining grammatical con-
language; the concept of substitutability is employed in cepts on the base of set theory', Problemy Kibemet. 1 (1958),
several models of this type. The simplest equivalence 203-214 (in Russian).
[2] USPENSKIi, V.A.: 'On the Kolmogorov definition of case',
relation obtained in this way is the mutual substituta- Byull. Obshch. Problemy Mashinnogo Perevoria, no. 5 (1957),
bility relation on the vocabulary (i.e. on the set of all 11-18 (in Russian).
one-element strings); the equivalence classes induced by [3] USPENSKIi, V.A.: 'A model for the concept of phoneme',
Voprosy Yazykoznaniya, no. 6 (1964), 39-53 (in Russian).
this relation are called families. If one introduces an [4] MARKus, S.: Algebraic linguistics: analytical models, Acad.
additional relation on the vocabulary, consisting of 'to Press, 1967.
be different forms of the same word' (more precisely, of [5] REvzIN, I.I.: Modelling and typologizing Slavic languages, Mos-
cow, 1967 (in Russian).
one lexeme; this is, e.g., the relation between 'to limit'
[6] GLADICU, A.V. [A.V. GLADIaij and MEL'i':UK, LA. [IA
and 'limiting' or between 'number' and 'numbers'; it MEL'CHUK]: Elements of mathematical linguistics, Mouton
involves a certain idealization in that it is assumed that Pub!., 1983.
a word may be the form of one lexeme only; the [7] GLADKIi, A.V.: Formal grammars and languages, Moscow, 1973
(in Russian).
respective equivalence classes are called neighbour- A. V. Gladkit
hoods), then these two relations may be used to intro- AMS 1980 Subject Classification: 68FXX
duce certain other classifications, which may be
regarded as approximations to the formal analogues of ANALYTIC NUMBER THEORY - A branch of
parts of speech and other traditional grammatical con- number theory. Analytic number theory deals with the
cepts, in particular the concepts of case and gender of problems of distribution of primes, studies the
nouns. These last two concepts are particularly inten- behaviour of number-theoretic functions, and the
sively studied; a number of models based on the con- theory of algebraic and transcendental numbers.
cept of grammaticality as well as other types of models
have been suggested for their formalization. In one Distribution of prime numbers. a) The problem of the
model, in particular, each case of nouns is treated as a distribution of prime numbers (primes) is one of the
set of 'uniformly directed' forms of nouns, while each most interesting and most difficult problems in analytic
gender is treated as a set of 'uniformly directing' nouns. number theory. The first result on the problem of the
The initial data in this model, which serve to determine distribution of primes was obtained by Euclid: There
the gender, are the vocabulary, the set of neighbour- are infinitely many primes. Let ?T(x) be the number of
hoods, the set of nouns and the binary relation of primes not exceeding x. Euclid's theorem can then be
'potential subordination' in the vocabulary: a poten- formulated as follows: ?T(x)~+ 00 as x~oo. The next
tially subordinates b if, in some 'linguistically natural' step was due to P.L. Chebyshev (1850), who proved
dependency structure (cf. Syntactic structure) of a that:
grammatical sentence in this language, some occurrence 1) The quantity ?T(x) satisfies the inequalities
of b (immediately) depends on some occurrence of a; ax bx
-I-
nx
< 7T(X) < -Inx
-,
additional initial data of the same nature are added to
determine the gender. Models based on 'potential where a~(ln2)j2; b~2In2.
subordination' or other concepts related to dependency 2) If there exists a limit of ?T(x)ln(x)j x as x~+oo,
structures apparently permit a better formalization of this limit is one.
traditional grammatical categories than do models The problem of the existence of this limit was solved
involving strings only, since in the former .models syn- in 1896 by 1. Hadamard and by Ch.1. de la Vallee-
tactic and linear relations are separated (cf. Grammar, Poussin, who established thereby that
transformational). X
7T(X) -(I + 0(1)).
= -Inx
The mathematical apparatus employed in the con-

184
ANALYTIC NUMBER THEORY

De la Vallee-Poussin demonstrated a much more gen- with the function N(a, T), which is the number of zeros
eral assertion, viz. if of t(s) in the rectangle 1/2";;a..;;0'..;;1, It I..;;T. The
x d function N(a, T) is in turn closely connected with the
7T(X) = [ Inll + R(x), function r(1 /2 + it). The existing hypotheses are:
N(a, T)=0(T 2(1-a)+<) (the density hypothesis) and
then
R(x) = O(xe-aVm-;), r(1 /2 + it)= O(t<) (the Lindelof hypothesis), where >0
may be as small as one pleases. Lindelofs hypothesis
where a>O is an absolute constant (cf. de la Vallee-
follows from the Riemann hypothesis on t(s); the den-
Poussin theorem). This problem was solved by methods
sity hypothesis follows from Lindelofs hypothesis, and
of the theory of functions of a complex variable. The
it follows from the density hypothesis that
problem of estimating R (x) is closely connected with
I:1n = O(n 1/2+<). It was proved that I:1n = O(nY), where
the problem on the behaviour of a certain function of a
y<1.
complex variable, which was first (1859) studied by B.
c) The problem of the distribution of primes in arith-
Riemann and which is now known as Riemann's zeta-
metical progressions nk + I, (k, 1)= 1, n =0, 1, ... ,
function. This function is defined by
leads to the problem on the zeros of special zeta-
res) = 1- 5+2- S + .,. +n- S + "', s=o+it, 0>1. functions - the so-called Dirichlet L-series, of the
Even earlier (1737, 1749) the zeta-function with real s- form
all-s+a22-5+ ... +ann- s + ...
values was studied by L. Euler, who demonstrated the
identity specifying the connection between r(s) and for 0'> 1, where an depends on n and on the difference
prime numbers: k of the series (Dirichlet characters modulo k).
res) = (1-2- )-I(1-3- )-1
S S '" (l_p-s)-I " ' , 0>1, The problem on the distribution of zeros of Dirichlet
L-series and the distribution of primes in arithmetical
where the product is taken over all prime numbers. The progressions have their own specific features. One of
function t(s), which is given in the form of a series for the principal achievements in this field, due to c.L.
0'> 1, can be analytically extended to the entire plane Siegel (1935), is as follows: Let 7T(X, k, I) be the number
of the complex variable; the resulting function is of primes not exceeding x in the series nk + I,
analytic in the entire complex plane except at the point n =0, 1, .... Then
s= 1, where it has a simple pole with residue one. The
problem of estimating the remainder R (x) in the 7T(X, k, /) -- I (dt
CP(kH TrV + O(x In -A x,)
asymptotic formula for the distribution of primes is
closely connected with the problem of the distribution where <P(k) is the Euler function and l..;;k..;;In B x,
of the zeros of t(s) in the 'critical' strip 0..;;0''';; 1. where A and B are arbitrary fixed numbers. Informa-
According to the Riemann hypothesis, all the zeros of tion about the distribution of prime numbers in arith-
t(s) in the critical strip lie on the straight line 0' = 1 / 2. metical progressions is extensively used in solving addi-
It follows from this hypothesis that R (x) = O( Vx In x). tive problems involving prime numbers. See also
Conversely, the Riemann hypothesis on the zeros of Distribution of prime numbers.
t(s) follows from the relation R (x) = O(x 1/2+<), where
Additive problems. Additive problems in analytic
>0 may be as small as one pleases (cf. Riemann
number theory include problems involving a special
hypotheses). Hadamard and de la Vallee-Poussin
type of integer equations. The main problems of this
obtained the asymptotic law for the distribution of
kind are: To prove the solvability of a given equation
primes by proving that t(s) cannot have any zeros if
and to find an asymptotic formula for the number of
O'~ 1. So-called ~-theorems were proved for R (x):
solutions of a given equation. The latter problem is
There exist two sequences X ~ + 00, y ~ + 00 such that
much more difficult and, in a certain sense, its positive
R(X) < _XI/2 InlnlnX, solution constitutes an answer to the first problem.
hllnX
Classical examples of additive problems are the Waring
R(Y) > yl/2 I~ln~n/. problem, the Goldbach problem and the
Hardy - Littlewood problem.
b) A second problem on the distribution of primes is Waring's problem (1770) can be formulated as fol-
to estimate the difference between two successive lows: Let J k n(N) be the number of solutions in positive
primes, i.e. the number I:1n = Pn + 1 - Pn, where pn is the integers XI, . , Xn of the equation
n-th prime. Here too, the first general result is due to
Chebyshev, who showed that at least one prime number x7 + ... +xZ = N, (1)
is contained between Nand 2N, where N~ 1 (the Ber- where N ~ 1 is an integer. It is required to prove that
trand postulate). The estimate of I:1n is closely connected there exists a number ko =ko(n), where ko depends

185
ANAL YTIC NUMBER THEORY

only on n, such that Jk,n(N)~l if k~ko. In other Behaviour of number-theor~tic functions. The theory of
words, it is required to prove that any number N ~ 1 numbers comprises several classical functions: .p(n) -
can be represented by a sum of n-th powers of positive the number of numbers not exceeding n and mutually
integers, the number of terms in this representation prime with n (the Euler function); T(n) - the number
depending only on n. J.L. Lagrange (1770) solved this of divisors of n; /LCn) - the Mobius function; A(n) -
problem for n = 2 by showing that any positive integer the Mangoldt function, etc. Even though all these func-
is the sum of four squares of integers. D. Hilbert (1909) tions behave fairly 'irregularly', their average values can
was the first to give a general solution to Waring's properly be studied. The average value of a function
problem. G.H. Hardy and J.E. Littlewood (1924) used f (n) is understood to mean the value of L.n,,;;,J
(n) / x.
their circle method to prove that for hn(N), if The problem of estimating the average value of J.L(n) is
k~n2n + 1, an asymptotic formula of the form
equivalent to the problem of bounds on the zeros of the
Jk,n(N) = ANk/n-l +O(Nk/n-1-y) (2) Riemann zeta-function. The problem of the asymptotic
behaviour of the average value of A(n) is equivalent to
is true, where y>O and A =A (N)~c>O; c is an abso-
the problem of the asymptotic formula for '/T(x), i.e.
lute constant. And since there is an infinitely large
again to the problem of bounds on the zeros of the
number of numbers N which, for k = n, are not the sum
Riemann zeta-function. The results obtained on all
of n-th powers, i.e. h,n(N)=O, there arose the problem
these problems are the same as those obtained in the
of establishing the true order of k, as a function of n,
problem of the distribution of prime numbers. This
for which equation (1) is solvable and formula (2) is
does not apply to the problem of the asymptotic
valid. The strongest results on this problem were
behaviour of the average value of 'T(n) or, to put it in a
obtained by I.M. Vinogradov (1934) who showed that:
somewhat different manner, the problem of the asymp-
a) h,n(N)~1 for N~No(n) if k~3n Inn + lln;
totic formula for the sum of the values of 'T(n). Let
b) formula (2) is true for k~4n2lnn.
A second classical additive problem - the <II(N) = 7'(1)+ .. , +7'{N),
Goldbach - Euler problem (1742) - is as follows: Let
P(N) is then the number of integer points below the
J(N) be the number of solutions in primesPI,P2,P3 of
hyperbola y = N / x. Thus, to find the asymptotic
the equation P 1 +P2 +P3 =N; prove that for odd N~7,
behaviour of P(N) amounts to finding the asymptotic
J (N)~ 1. It was shown in 1937 by Vinogradov that (an
behaviour of the number of integer points in expanding
asymptotic formula for J(N:
domains. This class of problems also includes the prob-
J(N) = BN 2 1n- 3 N +O(N 2 In- 35h N), lem of the integer points in a disc, i.e. the problem of
where B = B(N)~0.6. It follows, in particular, that determining the number
J (N) ~ 1 for N ~ No, i.e. a solution of the G(N) = 2 ~ 1,
Goldbach - Euler problem for sufficiently large N. x +y .;;;N

Additive problems also include the where x and yare integers, and of extending these
Hardy- Littlewood problem (1923): Each N~2 can be problems to arbitrary domains, both in a plane and in
represented in the form N=p+x 2 +y2, where p is a space. It was shown by P. Dirichlet in 1849 that
prime number and x and yare positive integers. Yu.V. <II(N) = N In N +(2y+ I)N + R 1(N),
Linnik proved in 1958 that if WeN) is the number of
where R 1(N) = OeVN); it was also shown by C.F.
solutions of this equation, the asymptotic formula

I:: +
Gauss in 1863 that
WeN) = O(N In -1028 N), G(N) = 'TTN + RiN),
where a=a(N)~cI >0 and CI is an absolute constant, where R 2 (N) = ocvii). The problems of finding the
is true, It follows that W(N)~ 1 if N~N 0, which is the best possible estimates for R I (N) and R2(N) are
solution of the Hardy- Littlewood problem for suffi- known as divisor problems and the circle problem.
ciently large N. There are many additive problems respectively, G.F. VoronoI (\ 903) obtained the formula
which have remained unsolved for hundreds or even Rl(N) = O(NI3InN),
thousands of years, These include, for example, the
while the formula found by W. Sierpiilski (1906) was
problem on the infinite number of twin primes, i,e.
pairs of primes P and q such that I p - q I = 2, the R 2 (N) = O(N1 1).

binary problem of Goldbach - Euler, viz. that any even In addition, the Q-theorems:
number ~4 is a sum of two primes; and "the problem
on the existence of an infinite number of primes in the RI(N) = [2(NI/4) and R 2 (N) = [2(NI/4)
sequence of the form n 2 + 1. See also Additive problems, have also been demonstrated. The estimates of R I (N)

186
ANAL YTiC NUMBER THEORY

and R2(N) obtained in our own days (1976) are some- where ,,=n, c =c(o:O is a constant which depends
what better than those of Vorono! and Sierpiilski. only on a, while p and q are arbitrary integers. The
A problem related to those just discussed is the next decisive step in this problem was made by A.
asymptotic behaviour of the sum of the fractional parts Thue (1909) whose ideas exerted a major influence on
of various types of functions or an equivalent problem, the theory of transcendental numbers. He showed that
viz. the problem of the distribution of the fractional ,,=n /2+, >0. Subsequently, the value of " was
parts of various types of functions. Let {a} be the frac- reduced by several workers, and it was shown by K.F.
tional part of the number a. Then, if F(x) is a real Roth in 1955 that ,,=2+ (it is known that ,,;;;;'2). A
function, the problem concerns the asymptotic drawback of all these theorems (except for Liouville's
behaviour of the two functions theorem) consists in the fact that they are ineffective,
T](N) = ~ {F(n)}, i.e. c .;annot be computed from known 0: and .
n<O;,N Approximation problems are connected with a cer-
~ I. tain class of problems in the theory of indefinite equa-
tions. E.g., Thue employed his approximation theorem
If for any O<y';;;; I to prove that the number of integral solutions of the
equation Fn(x,y)=a, where Fn(x,y) is a form with
T 2 (y; N) = "IN +o(N), integral coefficients of degree n;;;;.3 and a is a non-zero
one says that the fractional parts of F(x) are uniformly integer, is finite (this theorem is also ineffective, i.e. no
distributed. The uniform distribution of the fractional bound on the number of solutions of the equation can
parts of F(n) can also be expressed in terms of the be given).
asymptotic behaviour of T I (N). The first results on the Another approach employed in this theory are tran-
uniform distribution of the fractional parts of polyno- scendence proofs of numbers. The first results on this
mials, and criteria for uniform distribution, were found subject were obtained towards the end of the
by H. Weyl in 1916. The most accurate results in this nineteenth century by Ch. Hermite (1873), who proved
field were obtained by Vinogradov, who also found that the number e is transcendental, and by F. Lin-
asymptotic formulas for Tl (N) and T2(N) in cases in demann (1882), who proved that the number 7T is tran-
which n runs through a part of the set of integers not scendental and thus gave a negative solution to the
larger than N, in particular through the set of prime problem of squaring the circle. According to the
numbers. Not much is known about the distribution of theorem of Gel'fond - Schneider (1934), a P is a tran-
the fractional parts of functions which grow faster than scendental number if 0: is an algebraic number, 0:=f=0, 1
polynomials. For instance, nothing is known about the and f3 is an algebraic number of degree ;;;;.2 (Hilbert's
distribution of the fractional parts of the function seventh problem). Beginning in 1967, A. Baker
(3/2Y obtained a number of effective theorems on the estima-
Algebraic and transcendental numbers. The theory of tion of linear forms in logarithms of algebraic numbers.
algebraic and transcendental numbers includes prob- As a result, Thue's theorem on the number of represen-
lems connected with the arithmetical nature of given tations of an integer by a form was effectively demon-
numbers or classes of numbers. Consider polynomials strated. There are still many unsolved problems in the
with integer coefficients and with leading coefficient theory of transcendental numbers; these include the
one; if a is a root of such a polynomial of degree nand transcendental nature of Euler's constant
is not a root of a polynomial of a lower degree, it is
said to be an algebraic number of degree n; if n = 1, the "I = lim
n--+oo
[Inn -1 _1--
2
... _1-],
n
number a is said to be rational. If, on the other hand, 0:
is not an algebraic number, it is said to be transcenden- the problem of algebraic independence of the numbers
tal. There are 'much fewer' algebraic than transcenden- e and 7T, etc.
tal numbers; 'almost any' number is transcendental, Some methods used in analytic number theory. a) The
but determining whether a specific number is transcen- method of complex integration. This method grew out of
dental or algebraic is very difficult. The principal Euler's method of generating functions which is often
'characteristic' of algebraic numbers is the fact that used for the solution of problems in elementary
they are very 'poorly' approximated to by rational mathematics. The method is based on the following for-
numbers. This assertion (Liouville's theorem, cf. mula (a discontinuous multiplier):
Liouville theorems, 1844) is formulated as follows: If 0:
if x>l,
is an algebraic number of degree n, then
_1 f E.ds = if X = 1,
Ia-~ I >
27Ti Res =u>O S 2
cq-., o ifO<x<l,

187
ANAL YTIC NUMBER THEORY

where the integral is taken over the straight line S = ~ e 27r;F(x" ... x,), (3)
Res =a>O, s =a+it. E.g., for Res> 1, XIo'" ,x,.

f(s) = -tf};) = n~IA(n)n-s, where


x], . .
is a real function, while
F(x], ... , x n )
run through a set of P integers. Thus, the
,Xn
essence of many problems is the study of such sums, in
and for x =N + 1 /2 one obtains particular the problem of finding the most acurate pos-
~ A(n) = _1_. f tiSil ds. sible estimate of the modulus of such a sum. A trivial
n';;;x 2Wl Res=2 S
estimate of the sum (3) is P. The problem is to obtain
The left-hand side of this equation is the Chebyshev an estimate of the form
function, finding the asymptotic behaviour of which is I S I ".; ; tu>,
equivalent to the problem of prime numbers. The where O",;;;;~",;;;; 1 is called the reducing factor. The first
right-hand side, after the separation of the main factor, non-trivial estimates of trigonometric sums for the case
will be the smaller, the farther to the left the integra- when F=F(x) is a polynomial, while x=l, ... ,P,
tion contour can be shifted. (Cf. Complex integration, were obtained in 1919 by Weyl, who at the same time
method of.) deduced a criterion for the uniform distribution of the
b) The circle method (of fractional parts of functions in terms of trigonometric
Hardy - Litlewood - Ramanujan). This method is used sums. The method of trigonometric sums was created
mostly in additive problems. Below the application and by Vinogradov who, by utilizing intrinsic arithmetical
the nature of the circle method is considered in the properties of such sums, obtained very strong estimates
example of applying Vinogradov's trigonometric sums of the modulus for a broad class of these sums. He was
to the ternary problem of Goldbach - Euler. Let m be able to obtain in this way several fundamental results,
an integer. One then has (a discontinuous multiplier):
I
f e27r;am da = {I if m=O,
if m=f=O.
close to the limit of the possible, in several problems of
number theory (the Waring
Hilbert- Kamke problem, and Weyl sums, cf. Weyl
problem, the
o
Therefore sum). Another consequence of Vinogradov's method
I (1937) was the solution of several additive problems on
leN) = fS3(a)e2'1TtaN da, Sea) = ~ e'1T;XP, prime numbers, in particular the Euler - Goldbach
o p,;;;N problem. The fundamental idea in Vinogradov's
and I (N) is the number of solutions in primes of the method was 'smoothing' (exponentiation of the tri-
equation PI +P2 +P 3 = N. The integration interval is gonometric sum and reduction of the estimate to the
then subdivided into two parts - the principal and the theorem of averages for the estimation of Weyl sums,
complementary interval; the principal interval contains the introduction of double trigonometric sums for the
all the intervals of the form estimation of sums of primes, etc.). (See also Tri-
gonometric sums, method of.)
!a-E-!
q
< _I,
q' d) The dispersion method and the method oj the large
sieve. Linnik in 1958 - 1960 proposed a dispersion
where (a, q)= 1, O",;;;;a",;;;;q, q",;;;;ln lO N, T=N In- 2o N; all method for the solution of several additive problems in
other intervals are within the complementary interval. number theory. He solved the Hardy- Littlewood
The principal intervals do not intersect. Moreover, for problem, the Titchmarsh problem on divisors and the
a in the principal interval, the sum Sea) is 'close' to the
additive divisor problem. The basic concept in the
rational sum S(a / q). However, for 'small' q, q",;;;;lnN, method is the dispersion of the number of solutions of
the distribution law of the prime numbers in a series a given auxiliary equation related to the principal equa-
with difference q is known (e.g. Siegel's theorem), i.e. tion (see also Dispersion method). Fundamental results
the asymptotic behaviour of the sums S(a / q) is were recently obtained by the method of the large sieve
known. In this way the principal term of the problem is proposed by Linnik in 1940 for the solution of the
separated, which is the fundamental idea of the circle problem of the smallest quadratic residue.
method. If one now makes a non-trivial estimate of e) Methods in algebraic and transcendental number
I S(a) I on the complementary intervals (d. Vinogradov theory. In order to prove his theorem on the approxi-
method), the asymptotic formula in the mation of an algebraic number by a rational fraction,
Goldbach - Euler problem is obtained. (See also Circle Thue (cf. Thue method) constructed the polynomial
method.)
c) The method oj trigonometric sums. Most problems /(x.y) = (y -a)/l (x.y)+(x -a)"h(x, a)
in analytic number theory can be expressed by tri- with integer coefficients, where JI and 12 are also poly-
gonometric sums, i.e. finite sums of the form nomials. On the assumption that a is 'well' approxi-

188
ANALYTIC POLYHEDRON

mated by fractions PI / ql and P2 / q2 with sufficiently References


large ql and q2, putting m~lnql /lnq2, and proving [A1] VAUGHAN, R.c.: The Hardy-Littlewood method, Cambridge
Univ. Press, 1981.
that f(x, y) does not vanish if x =p I / ql, Y =P2 / q2, a [A2] HALBERSTAMM, H. and RICHERT, H.E.: Sieve methods, Acad.
contradiction is obtained. Press, 1974.
A.O. Gel'fond's proof of the transcendental nature of [A3] IVIt, A.: The Riemann zeta-function, Wiley, 1985.
numbers involves the function AMS 1980 Subject Classification: 10-XX
Ina
f(z) = qo ~
~ 2,. ck./e(>.k+I)z, A= --. ANALYTIC OPERATOR at a point Xo - An operator
k=OI=O Inf3
A, acting from one Banach space into another, that
On the assumption that a is an algebraic number and admits a representation of the form
using Dirichlet's box principle, non-zero integers Ck,l 00

are chosen so that f (z) and 'many' of its derivatives A(xo +h)-Axo = ~ Ckhk,
k=l
have 'many' zeros. A 'large' number of zeros will yield
where Ck is a form of degree k and the series converges
'good' estimates from above for a 'large' number of
uniformly in some ball II h II <r. An operator is called
derivatives and points, whence, using the estimates
analytic in a domain G if it is an analytic operator at all
from below obtained by Liouville's theorem, it follows
points of this domain. An analytic operator is infinitely
that f (z) and 'many' of its derivatives have more zeros
differentiable. In the case of complex spaces, analyti-
than before. A repetition of this process results either
in A being a rational number or in zero values for ck,l, city of an operator in a domain is a consequence of its
which is in contradiction with their choice. See also differentiability (according to Gateaux) at each point of
Algebraic number; Transcendental number. this domain. Examples of analytic operators are
Lyapunov's integro-power series, and the Hammerstein
References and Urysohn operators with smooth kernels on the
[1] VINOGRADOV, I.M.: Selected works, Springer, 1985 (translated
from the Russian). space C of continuous functions.
[2] VINOGRADOV, I.M.: Elements of number theory, Dover, reprint, References
1954 (translated from the Russian). [I] HILLE, E. and PHILLIPS, R.: Functional analysis and semi-
[3] VINOGRADOV, I.M.: The method of trigonometric sums in the groups, Arner. Math. Soc., 1957.
theory of numbers, Interscience, 1954 (translated from the Rus- [2] KRASNOSEL'sKIi, M.A., ET. AL: Approximate solution of operator
sian). equations, Woiters-Noordhoff, 1972 (translated from the Rus-
[4] GEL'FOND, A.O.: Transcendental and algebraic numbers, Dover, sian).
reprint, 1960 (translated from the Russian). P. P. Zabrelko
[5] DELONE, B.N.: The Peterburger school in number theory,
Moscow-Leningrad, 1947 (in Russian). AMS 1980 Subject Classification: 46G20, 58C10,
[6] KARATSUBA, A.A.: Fundamentals of analytic number theory, 58C20, 47HXX
Moscow, 1975 (in Russian).
[7] LINNIK, YU.V.: The dispersion method in binary additive prob- ANALYTIC PLANE, complex-analytic plane - A non-
lems, Amer. Math. Soc., 1963 (translated from the Russian).
[8] CHUDAKOV, N.G.: Introductions to the theory of Dirichlet L- empty set of points z = (z I, . . . , zn) in the complex vec-
functions, Moscow-Leningrad, 1947 (in Russian). tor space en that satisfies a system of equations
[9] PRACHAR, K.: Primzahlverteilung, Springer, 1957. n
[10] DAVENPORT, H.: Multiplicative number theory, Springer, 1980. ~aijzi = bJ , j= I, ... ,k; aij, bjEC;
[II] TnCHMARSH, E.C.: The theory of the Riemann zetajunction, i=1
Clarendon, 1951.
[12] HUA, L.-K.: 'Abschatzungen von Exponentialsummen und ihre rank II aij II = k < n.
Anwendung in der Zahlentheorie', in Enzyklopaedie der The number k is called the complex codimension, while
Mathematischen WissenschaJten mit Einschluss ihrer Anwen-
dungen, Vol. I, 1959. Heft 13, Teil I. n - k is called the complex dimension of the analytic
[13] BAKER, A.: 'Linear forms in the logarithms of algebraic plane. The real dimension of the analytic plane equals
numbers (II)', Mathematika 14 (1967), 102-107. 2(n -k) and is even, but not all even-dimensional real
[14] VINOGRADOV, A.I.: 'The density hypothesis for Dirichlet L-
series', Izv. Akad. Nauk SSSR Ser. Mat. 29, no. 4 (1965),903-
planes in R2n = en are analytic planes. Complex one-
924 (in Russian). dimensional analytic planes are sometimes called com-
[15] BOMBIERI, E.: 'On the large sieve', Mathematika 12 (1965), plex, or analytic, straight lines. See also Analytic surface.
201-225.
A.A. Karatsuba E.D. Solomentsev
E.M. Chirka
Editorial comments. Liouville's theorem is closely
related with the theory of continued fractions (cf. Continued AMS 1980 Subject Classification: 15A06, 51 N1 0,
fraction). 32AXX
For a description of various results obtained on additive
problems and the distribution of prime numbers in arithmetic ANALYflC POLYHEDRON - A domain II of the
progressions see Large sieve; Distribution of prime complex space en, n ~ 1, which can be represented by
numbers; Sieve method. inequalities 1J;(z) 1<1, where the functions J;(z),

189
ANALYTIC POLYHEDRON

i = I, ... ,m, are holomorphic in some domain D c en more accurate definition. Let k be a field with a non-
containing II, i.e. II={zED: 1J;(z) 1<1, trivial norm (cf. Nonn on a field) (which is usually
i = I, ... ,m}. It is also assumed that II is compact in assumed to be complete), and let k { { X I, . . . ,Xn }} be
D. If J;(z) are polynomials, the analytic polyhedron is the k-algebra of power series in XI, ... ,Xn with coef-
said to be a polynomial polyhedron. If m = nand ficients in k which converge on some polycylinder with
J;(z)=a;z;, the analytic polyhedron is called a polydisc. centre (0, ... ,0), each series converging on its own
The sets O';={zED: 1J;(z) I =1; 1Jj(z) I <l,j:t=i} are polycylinder. A quotient ring of the ring
called the faces of the analytic polyhedron. The intersec- k {{X I, . . . ,Xn }} is called an analytic ring over k, or
tion of any k different faces (2",;;;;k";;;'n) is said to be an an analytic k-algebra; usually, k is the field of real
edge of the analytic polyhedron. If m;;:'n and all faces numbers R or the field of complex numbers C. Any
have dimension 2n - I, while no edge has dimension analytic ring is a local, Noetherian, Hensel ring; its
exceeding 2n - k, the analytic polyhedron is a Weil field of residues is isomorphic to k. An analytic ring
domain. The set of n-dimensional edges k {{ X I, . . . ,Xn }} is a regular (and a factorial) ring,
n ... n
0';, "';, =0';, 0';, forms the skeleton of the and its completion in the topology defined by the maxi-
analytic polyhedron. The concept of an analytic mal ideal (X \, ... ,Xn ) coincides with the ring of for-
polyhedron is important in problems of integral mal power series k[[X\, ... ,Xn ]]. The normalization
representations of analytic functions of several vari- lemma is true: An integral analytic ring is a finite
ables. extension of an analytic ring k { { X \, ... ,Xn }}. Alge-
References bras that are finite over k { { X \, ... ,Xn }} are generally
[I] SHABAT, B.V.: Introduction to complex analysis, 2, Moscow, called quasi-analytic k-algebras. If k is a perfect field,
1976 (in Russian). an analytic ring is an excellent ring.
E.D. Solomentsev
References
Editorial comments. The analytic polyhedron II defined [I] DIEUDONNE,1. and GROTHENDIECK, A.: 'Criteres differentiels
above is sometimes said to be an analytic polyhedron of de regularite pour les localises des algebres analytiques', J. of
order m (cf. [A1]). Algebra 5 (1967), 305-324.
[2] MALGRANGE, B.: Ideals of differentiable functions, Tata Inst.
References Fundam. Res., 1966.
[A 1] HORMANDER, L.: An introduction to complex analysis in [3] ABHYANKAR, S.S.: Lacal analytic geometry, Academic Press,
several variables, North-Holland, 1973. 1964.
v.I. Danilov
AMS 1980 Subject Classification: 32A07 Editorial comments.
References
ANALYTIC REPRESENTATION, holomorphic [A 1] GRAUERT, H. and REMMERT, R.: Coherent analytic sheaves,
Springer, 1984 (translated from the German).
representation - A representation of a complex Lie
group G in a topological vector space E in which all AMS 1980 Subject Classification: 32CXX, 32810
matrix elements (cp(g)~, 1/), ~EE, 1/EE', where E' is the
dual topological vector space, are holomorphic on G. A ANALYTIC SET - A subset of a complete separable
representation <p is called an anti-analytic representation metric space that is a continuous image of the space of
if its matrix elements become holomorphic after com- irrational numbers. The concept of an analytic set was
plex conjugation. An analytic (anti-analytic) representa- introduced by N.N. Luzin [1]. His classical definition
tion of a connected Lie group is uniquely determined has been generalized to general metric and topological
by a corresponding Lie algebra representation of this spaces.
group (cf. Representation of a Lie algebra). If G is a 1) An analytic set in an arbitrary topological space X
semi-simple complex Lie group, then all its topologi- is a subset of that space that is the image of a closed
cally irreducible analytic (anti-analytic) representations subset of the space of irrational numbers under an
are finite-dimensional. upper semi-continuous multi-valued mapping with com-
pact images of points and a closed graph [2]. If X is a
References
[I] NAiMARK, M.A.: Theory of group representations, Springer, Hausdorff space, the last-mentioned condition is
1982 (translated from the Russian). automatically satisfied. If X is metrizable, this defini-
[2] ZHELOBENKO, D.P.: Compact Lie groups and their representa- tion is equivalent to the classical one.
tion, Amer. Math. Soc., 1973 (translated from the Russian).
2) In a complete separable metric space the classical
D.P. Zhelobenko
analytic sets are identical with sf-sets (cf. .sd-set).
AMS 1980 Subject Classification: 22E10 This fact forms the base of another definition of ana-
lytic sets (in their capacity as .sd-sets) in general metric
ANALYTIC RING - A ring of germs of analytic func- and topological spaces [3], [4], [5]. In the class of
tions at a point in an analytic space. The following is a completely-regular spaces, analytic sets in the sense of

190
ANAL YTIC SET

l) are absolute analytic sets in the sense of 2). In the By defmition,


class of non-separable metrizable spaces definition 2) is dim" S = linplimz S, a E S;
zeS.
employed, since definition l) yields separable analytic z~a

sets. the dimension of the analytic set S is the number


3) An analytic set in a Hausdorff space [6], [7], is a dim S = sup dim" S.
ae~
continuous image of a subset of a compact space of An analytic set S is called purely k-dimensional if
type F08. <lima S =k for all a ES. For each O~k~dimS, the set
4) An analytic set is a continuous image of a set Sk = {a ES: ditna S =k} is a purely k-dimensional ana-
belonging to the family K08, where K is the family of lytic set in U\ U j>kSj' Thus, any analytic set in U
all closed compact subsets of some topological space
can be represented as a finite union of pure analytic
[8]. Definitions l), 3) and 4) equivalent in the class of
sets, S = U Sk' At the singular points
Hausdorff spaces.
5) For a generalization in another direction see [4]; <lima(S \ S)<dima s, so that the dimension of the
k-analytic sets are obtained from closed sets of a topo- analytic set of singular points of a purely k-dimensional
logical space by generalized d -operations (cf. d - analytic set in U is smaller than k. The connected
operation; the Baire space of countable weight is components of S are complex manifolds. Since this is
replaced by a Baire space of weight k) and are a gen- also true for the analytic set S \ S, one obtains the
eralization of analytic sets in the sense of 2). decomposition:
S = S U(S\S)* U ...
References
[I] LUZIN, N.N.: 'Sur la classification de M. Baire', C.R. Acad. of the analytic set into complex manifolds. The decom-
Sci. Paris Ser. I Math. 164 (1917),91-94. position
[2] FROLiK, Z.: 'Respectability of the graphs of composites', S = s; U(S\S;);-l U ...
Mathematika 16 (1969), 153-157.
[3] SIERPINSKI, W.: General topology, Univ. Toronto Press, 1956 is more convenient (the dimensions of the summands
(translated from the Polish). strictly diminish, d=dimS); it is called the stratification
[4] STONE, A.H.: 'Non-separable Borel sets II', Gen. Topol. and of S; the connected components of the k-th summand
Appl. 2 (1972),249-270.
[5] KURATOWSKI, K. and MOSTOWSKI, A.: Set theory, North- of this sum are called the k-dimensional strata of the
Holland, 1968. analytic set S.
[6] SCHNEIDER, V.E.: 'Descriptive theory of sets in topological An analytic set S is called reducible (in U) if it is the
spaces', Uchen. Zap. Moskov. Gos. Univ. 135 (1948), 37-85.
[7] CROQUET, G.: 'Theory of capacities', Ann. Inst. Fourier (Greno-
union of two analytic sets in U other than itself; other-
ble) 5 (1953-1954), 131-295. wise it is called irreducible (in U). All irreducible ana-
[8] SION, M.: 'On analytic sets in topological spaces', Trans. A mer. lytic sets in U are connected and pure. An analytic set
Math. Soc. 96 (1960), 341-354. S in U is irreducible if and only if the set S of its reg-
[9] JAYNE, J.E.: 'Structure of analytic Hausdorff spaces',
Mathematika 23 (1976), 208-211. ular points is connected. The closure of each connected
[10] ROGERS, c.A., JAYNE, J.E. and DELLACHERIE, c., ET AL. (EDS.): component of S is an irreducible analytic set in U;
Analytic sets, Acad. Press, 1980. k such analytic sets are called the irreducible components
A.G. El' in
of S. All analytic sets in U are locally finite unions of
6) An analytic set in the theory of analytic functions their irreducible components. If two analytic sets have
is a set that can locally be defined as the set of com- no common irreducible components, the dimension of
mon zeros of a finite number of holomorphic functions. their intersection is strictly smaller than the dimension
If S is an analytic set in an open subset U of the com- of each set. If the intersection of two irreducible sets in
plex n-dimensional space C', this means that for each U contains a set that is open in each one, these analytic
point a E U there exist a neighbourhood V C U and a sets are identical (the identity theorem).
finite tuple of functions fl' ... ,fr, holomorphic in V, An analytic set S in U is called irreducible at a point
such that S n V={ZEV: fl(z)= ... =fr(z)=O}. If a ES if there exists a fundamental system of neighbour-
the functions J; can be selected (in some neighbour- hoods Vi of the point a in U such that all analytic sets
hood V) so that the rank of the Jacobi matrix S n Vi in Vi are irreducible; a is then called an irredu-
I 3J; / 3zj I at the point a is r, a is called a regular point cibility point of the analytic set S. In a neighbourhood
of the analytic set S; the number n - r is called the of each irreducibility point the analytic set has the
(complex) dimension of S at a and is denoted by structure of an analytic covering, i.e. for each such point
<lima s. The set S of all regular points of an analytic a ES, dima S =k, there exist a connected neighbour-
set S is an open everywhere-dense subset of S (in the hood V C U, a linear mapping A: C' ~Ck and an ana-
induced topology on S as a subset of U). Its comple- lytic set a CA(V) such that the restriction of A to S Vn
ment S \ S - the set of singular points of S - is an is a proper mapping into A(V), while the restriction of
analytic set in U that is nowhere dense in S. A to (S n V)\A-I(a) is a finite-to-one locally biholo-

191
ANALYTIC SET

morphic covering over A(V) \ (1. For irreducible one- AMS 1980 Subject Classification: 54H05, 32810,
dimensional analytic sets there exists thus (after a suit- 32C25
able linear change in coordinates) a local parametric
representation of the form ANALYTIC SHEAF - A sheaf F on an analytic space
ZI = ~m, Z2 = /z<n, ... , Zn = f,,<n, X such that for any point x EX the set Fx is a module
over the ring (1J x of germs of holomorphic functions at
where tEe, I t I <r, m is a positive integer and the the point x, and such that the mapping if, a)-'?fa,
functions fi are holomorphic in the disc I t I <r. Thus, defined on the set of pairs (j, a) where fE (1J x' aEFx ,
in a neighbourhood of each irreducibility point, a one- is a continuous mapping of (1J XF into F for x EX.
dimensional analytic set is a topological manifold. For M.I. Vnttsekhovskit
an analytic set of higher dimension this is usually not
Editorial comments.
true.
The union of a finite number and the intersection of References
[A1] GRAUERT, H. and REMMERT, R.: Theory of Stein spaces,
any family of analytic sets in V are analytic sets in U. Springer, 1979 (translated from the German).
Any analytic set in V is closed in V. Any compact ana- [A2] GRAUERT, H. and REMMERT, R.: Coherent analytic sheaves,
lytic set in V cC' consists of a finite number of points. Springer, 1984 (translated from the German)
If V is connected and the analytic set S =1= V, then AMS 1980 Subject Classification: 32L 10
V \ S is open, everywhere-dense in V and is also con-
nected. The set of all isolated points of an analytic set ANALYTIC SPACE - A generalization of the concept
in V has no limit points in V. Moreover, all analytic of an analytic manifold. A local model (and, at the
sets are locally connected. A connected analytic set is same time, the most important example) of an analytic
pathwise connected. space over a complete non-discretely normed field k is
Any analytic set in V of dimension d has locally in V an analytic set X in a domain V of the n-dimensional
finite 2d-dimensional Hausdorff measure mes2d. If space kn over k, defined by equations
diII1a S =k, there exist positive constants c and C fl = ... = /p =0, where fi are analytic functions in V,
(which depend on a and S) such that which is provided with the sheaf (1J obtained by res-
cr 2k ,.;;; meSzk(S n {I Z -a I <r}) ,.;;; Cr 2k, tricting the sheaf (!) u / I on X; here (!) u is the sheaf of
for all sufficiently small r<O. germs of analytic functions in V, while I is the subsheaf
The family of analytic sets is invariant under biholo- of ideals generated by fl' ... ,/p. An analytic space
morphic mappings (cf. Biholomorphic mapping). More- over k is a ringed space that is locally isomorphic to a
over, if S is an analytic set in V and if f: V-,?V I is a ringed space (X, (1J) of the above type. If k is the field
proper holomorphic mapping, then f (S) is an analytic of real numbers R, one speaks of real-analytic spaces; if
set in VI' k is the field of complex numbers C, of complex-
The definition of analytic sets on complex manifolds analytiC spaces or simply of complex spaces; if k is a
is similar to the definition for C', and all the properties field of p-adic numbers Qp' of p-adic analytic spaces.
listed above are preserved except for one: in the general An analytic (holomorphic) mapping of one analytic
case there exist compact non-discrete analytic sets. In space (X, (!) x) into another (Y, @ y) is a morphism
special manifolds analytic sets can have certain addi- (X, @ X )-'?( Y, @ y) in the sense of the theory of ringed
tional properties. For example, in the complex n- spaces, i.e. a pair (</>0, <PI), where </>0: X -'? Y is a continu-
dimensional projective space all analytic sets are alge- ous mapping, while <PI: <Po I @ y-'? @ x is a sheaf
braic, i.e. coincide with the set of common zeros of homomorphism. A point x of an analytic space (X, @)
some finite tuple of homogeneous polynomials (Chow's is called simple (or regular, or non-singular) if x has a
theorem). neighbourhood over which (X, @) is isomorphic to a
Real-analytic sets in open subsets of Rn are defined space of the type (V, @ u), where V is a domain in kn.
in the same manner, except that real-analytic functions Otherwise x is known as a Singular point. A space is
must be taken instead of holomorphic functions. Any called smooth if all of its points are simple. A smooth
real-analytic set is the intersection of some analytic set analytic space is identical with an analytic manifold.
(in some open subset of en) with the real subspace The dimension dimx X of an analytic space X at a
Rn cC'. point x EX is defined as the dimension of the
References corresponding analytic set in a local model (cf. Analytic
[1] GUNNING, R.C. and ROSSI, H.: Analyticfunctions of several set). The global dimension is defined by the formula:
complex variables, Prentice-Han, 1965.
[2] HERVE, M.: Several complex variables: local theory, Oxford
dim X = sUP dim, X.
XEX

Univ. Press, 1963.


E.M. Chirka Let mx be the maximal ideal of the local nng @~

192
ANALYTIC SPACE

(x EX). The vector space TxCX)=(mx / m;)* over k is (1, cJ>\): (Y, rrJ y )~(X, rrJ). An example of an analytic
called the tangent space to X at the point x, while subspace of (X, rrJ) is the reduction of this space.
r;(X)=m x / m; is the cotangent space. The number The concept of an analytic space originated as a gen-
emdimx X = dim TAX) eralization of the concept of an analytic manifold. Such
a generalization had been suggested mainly by alge-
is called the tangent dimension or the embedding dimen- braic geometry, in which spaces with singular points
sion at the point x (the last name is connected with the had been under study for a long time. The effect of the
fact that em dim X is the smallest number n such that ideas of algebraic geometry was immediately reflected
(X, rrJ) is, in a neighbourhood of x, isomorphic to a in the ultimate formulation of the concept of an ana-
local model in kn). One has dimxX~emdimxX, and lytic space (for reduced complex spaces see [9]; for the
the two are equal if and only if x is a simple point. One general case, see [6]). In particular, any scheme of finite
also defines the dimension type over a complete normed field k naturally deter-
em dim X = sUP em dimx X mines an analytic space over k. This correspondence
XEX
between schemes and analytic spaces over k for
Each analytic mapping of analytic spaces reduced complex spaces was studied in [9], in which the
cJ>=(<Po, cJ>\): (X, rrJ x)~(Y, rrJ y) defines a linear map- theory of analytic spaces was named 'analytic
pmg dcJ>x: TxCX)~Tcp,,(x)(Y), which IS called its geometry'. Subsequently, the two geometries developed
differential at the point x EX. in parallel, and the exchange of ideas between the two
An analytic space (X, rrJ) is said to be reduced if its made a substantial contribution to the results achieved
local model in a neighbourhood of an arbitrary point in both these fields.
has the following property: I consists of all germs of In the theory of functions of several complex vari-
holomorphic functions that vanish on Xc U. In the ables, spaces with singular points appeared, in the first
case of an algebraically closed field k, this statement is place, as Riemannian domains (cf. Riemannian
equivalent to saying that the fibres rrJ x (x EX) of the domain), which are analogues of Riemann surfaces for
sheaf rrJ contain no nilpotent elements. All smooth functions of one variable. Using these as local models,
spaces are reduced. If (X, rrJ) is reduced, it can be said H. Behnke and K. Stein in 1951 defined a class of
that rrJ consists of the germs of certain continuous ringed spaces which, as was shown in [5], coincides
functions on X. The sections of the sheaf rrJ on the with the class of reduced normal analytic spaces (cf.
reduced space (X, rrJ) are identical with analytic func- Normal analytic space). The local geometry of analytic
tions on X, i.e. with analytic mappings X~k (cf. Ana- sets inen had been studied by W. RUckert as early as
lytic mapping). For any analytic space (X, rrJ) there 1932. Finally, non-smooth analytic spaces are a natural
exists a natural sheaf epimorphism red: rrJ ~ rrJ \ (where product of the theory of automorphic functions as quo-
(X, rrJ \) is a reduced analytic space), which is called the tient spaces of analytic manifolds by properly discrete
reduction. If jEr(X, rrJ) is a section of the sheaf rrJ, groups of automorphisms (cf. Discrete group of
one can speak of the value of j at a point x EX (which transformations). p-adic analytic sets were first intro-
coincides with the value of the analytic function red j at duced in 1935 by I. Skolem in connection with certain
x). For this reason the algebra rex,
rrJ), even in the problems in number theory.
non-reduced case, is often referred to as the algebra of The theory of analytic spaces has two aspects: the
analytic (holomorphic) functions on (X, rrJ). Sheaves of local and the global aspect. Local analytic geometry is
rrJ -modules on an analytic space (X, rrJ) are also called concerned with germs of analytic sets in the space P
orzall'tic shoaves. provided with sheaves of the above type. Principal
If (X, rrJ) is an analytic space, then each open U ex stress is laid on the study of the properties of the alge-
defines an open subspace (U, rrJ Iu). On the other bra of convergent power series in n variables over k
hand, one can introduce the concept of an analytic sub- and its quotient algebras - the so-called analytic alge-
space of (X, rrJ), which is necessarily closed. A set bras, the foundations of which were laid by K. Weier-
Y C X is called analytic if it is defined by a finite strass. The local theory comprises the theory of normal-
number of analytic equations in a neighbourhood of ization, the study of singular points, local properties of
each point x E X. The sheaf of ideals lye rrJ consisting analytic functions and mappings, etc. The most impor-
of the germs of all analytic functions that vanish on Y, tant results obtained in this field refer to the case of
is connected wi~h such a set. Conversely, each analytic algebraically closed fields k [1], [4], [7]. There appears
sheaf of ideals of finite type Ie rrJ defines an analytic the important concept of a coherent analytic sheaf,
set Y C X. If rrJ y = rrJ / I I y, one obtains an analytic which continues to play a leading part in the global
space (Y, rrJ y), which is called an analytiC subspace of theory. In particular, the structure sheaf rrJ of the ana-
(X, rrJ); there exists a natural morphism lytic space (X, rrJ) and the sheaf of ideals ly of any

193
ANALYTIC SPACE

analytic set Y e X are coherent for any algebraically morphic mappings, the first and most important one of
closed k. The case k = R has also been thoroughly stu- which (for proper mappings) was demonstrated by H.
died. Grauert [6A).
Global analytic geometry studies the properties of An important role in the theory of complex spaces is
analytic functions, mappings and other analytic objects, played by holomorphic mappings of a special kind -
defined 'globally' on the entire analytic space, as well the so-called modifications (cf. Modification), i.e.
as the geometrical properties of these spaces. In the mappings f: X ~ Y inducing an isomorphism of open
process of studying complex-analytic spaces natural subspaces X\XI~Y\ YJ, where XI ex, YI eY are
classes of them were isolated. These include, first, the certain analytic sets. One says that Y is obtained from
class of Stein spaces (cf. Stein space), which can be X by 'contracting' the subset XI on YJ, while X is
roughly described as the class of spaces with a suffi- obtained from Y by 'blowing up' the subset YI into
ciently large amount of global holomorphic functions. X I. Of special interest are analytic subsets that can be
Stein spaces are the most natural multi-dimensional contracted into a point (exceptional analytiC sets); these
generalizations of the domains of the complex plane were characterized by H. Grauert [6B). A natural prob-
considered in the classical theory of functions of one lem in analytic geometry is the following problem of
complex variable. This class of spaces in fact coincides resolution of singularities: Is it possible to 'blow up' an
with the class of analytic subspaces of the spaces en. analytic space so that the entire space becomes smooth?
Its algebraic analogue is the class of affine algebraic It should be noted that modifications in algebraic
varieties (cf. Affine variety). geometry were studied as early as the 19-th century,
For a domain D ecn holomorphic completeness is while modifications in analytic geometry were intro-
equivalent with the fact that D is a domain of holomor- duced by Behnke and Stein in 1951 in the context of
phy, i.e. that there exists a holomorphic function in D the concept of a Riemannian domain.
that does not extend into a larger domain. The boun- Another natural object of study, which is also closely
dary of a domain of holomorphy is pseudo-convex, i.e. connected with the ideas of algebraic geometry, are
it behaves with respect to local analytic sub manifolds meromorphic functions on complex spaces and their
as would a convex surface with respect to real linear generalizations - meromorphic mappings (a mapping
submanifolds. The problem of the validity of the con- which yields an operation inverse to a modification
verse theorem (cf. Levi problem) gave rise to a number may serve as an example; cf. Meromorphic function;
of investigations and yielded a new characterization of Meromorphic mapping). Meromorphic functions on a
Stein spaces. reduced compact complex space X form a field of tran-
The class of compact spaces is, in a certain sense, the scendence degree t(X),;;;;dimX (this was first demon-
opposite case. The following generalization of the clas- strated by c.L. Siegel in 1955 for the smooth case).
sical theorem of Liouville is valid: Functions which are Spaces X for which t(X)=dimX (Moishezon spaces)
holomorphic on a reduced compact space are constant form a class which is very close to the class of projec-
on each connected component of this space and tive algebraic varieties; they are characterized by the
therefore form a finite-dimensional vector space. A gen- fact that they are modifications of smooth projective
eralization of this theorem are the finiteness theorems, algebraic varieties. Another class of analytic spaces
which confirm the finite dimensionality of the homol- which is very close to algebraic varieties, are Kahler
ogy groups with values in a coherent analytic sheaf. manifolds (cf. Kiihler manifold). A number of criteria
Holomorphically-convex complex spaces, q-complete, for the projectivity of a compact complex space are
q-pseudo-convex, q-pseudo-concave spaces, which are known [3], [6B], [13]. Studies of automorphic functions
generalizations of Stein spaces, and compact spaces are in several complex variables have made a major contri-
also considered (cf. Holomorphically-convex complex bution to the development of this subject.
space). The theory of deformations of analytic structures (cf.
These classes of complex spaces have their analogues Deformation) is concerned with the problem of classifi-
in the theory of holomorphic mappings. Thus, to com- cation of analytic objects of a given type (e.g. all com-
pact spaces correspond proper holomorphic mappings; plex structures on a given real-analytic variety, all ana-
to holomorphically complete spaces correspond Stein lytic subspaces in a given complex space, etc.), with the
mappings, etc. 'Relative' analogues were found for purpose of introducing the 'natural' structure of a com-
many theorems, and the 'absolute' variant of a theorem plex space on the set of these objects, and in order to
is obtained from its relative variant if the entire space describe all analytic objects 'sufficiently near' to the
is mapped into a point. The corresponding generaliza- given object. In the former case one speaks of the glo-
tion of finiteness theorems are theorems of coherence of bal moduli problem, while in the latter one speaks of
direct images of coherent analytic sheaves under holo- the local moduli problem. An example of the global

194
ANALYTIC SURFACE

moduli problem is the problem of the classification of [12] lIIRzEBRUCH, F.: TopolOgical methods in algebraic geometry,
all complex structures on a compact Riemann surface Springer, 1978 (translated from the German).
[13] CHERN, S.S.: Complex manifolds without potential theory,
(cf. Moduli of a Riemann surface). Springer, 1979.
The principal apparatus of global analytic geometry A.L. Onishchik
is formed by coherent analytic sheaves and their coho- Editorial comments. Instead of [8] one may also consult
mology spaces. The first successful result of the coho- [A2].
mological method was Cartan's solution of the additive The original papers in which what are now called
Cousin problem and of problems of prolongation of a Moishezon spaces were introduced are [A3]. They are a
holomorphic function from a closed Y C X for Stein natural setting to examine the question of projectivity of
manifolds (X, l!) (cf. Cousin problems; [8]); it was compact complex spaces. In particular, part I contains the
found that the solution of these problems is obstructed result that a Moishezon space is algebraiC projective if and
by the cohomology groups HJ(X, l!) and HJ(X, ly), only if it carries a Kahler metric.
respectively. References
Most of the results of the global theory were initially [A 1] GRAUERT, H. and REMMERT, R.: Coherent analytic sheaves.
demonstrated for complex manifolds, after which they Springer. 1984 (translated from the German).
[A2] CARTAN. H.: Sem. ENS. 1951-1952. Ecole Norm. Super ..
were generalized to analytic spaces. The difficulties
[A3] MOISHEZON, B.: 'On n-dimensional compact complex
involved in this procedure often necessitated the varieties with n algebraically independent meromorphic func-
development of completely new methods. Cohomology tions'. Amer. Math. Soc. Translations Ser. 263 (1967). 51-
spaces of a locally free analytic sheaf on a complex 177.
manifold may be expressed in terms of differential AMS 1980 Subject Classification: 32CXX
forms (the Dolbeault - Serre theorem, cf. also Differen-
tial fonn), which makes it possible to study them by ANALYfIC SURFACE in a Euclidean space - An
methods of the theory of elliptic differential equations arbitrary two-dimensional analytic submanifold X in
and by other analytic methods. In the non-smooth case the space Rn , n > 2. However, the term 'analytic surface
this approach involves major difficulties, and it is often in Rn ' is often employed in a wider sense as a manifold
necessary to define cohomology classes in other ways, which is (locally) analytically parametrizable. This
e.g. using Cech cochains in a suitable covering. The means that the coordinates of the points
technique of Banach analytic spaces, applied to moduli X=(Xb'" ,Xn)EX can be represented by analytic
problems, proved useful in this context (d. Banach ana- functions Xj =Xj(t) of a real parameter t =(tj, ... ,tk)
lytic space). which varies in a certain range aCRk , l.;;;;k<n. If the
See also Real-analytic space; Rigid analytic space. rank of the Jacobi matrix II ax / at II, which for an
References analytic manifold is maximal everywhere in a, is equal
[1] ABHYANKAR, S.S.: Local analytic geometry, Academic Press, to k, then the dimension of the analytic surface X is k.
1964.
[2] BANICA, C. and STANASILA, 0.: Algebraic methods in the global
In the complex space en the term 'analytic surface' is
theory of complex spaces, Wiley, 1976 (translated from the also employed to denote a complex-analytic surface X in
Rumanian).
[3] GUNNING, R.C. and ROSSI, H.: Analytic functions of several
en, i.e. a manifold which allows a holomorphic
(complex-analytic) parametrization. This means that
complex variables, Prentice-Hall, 1965.
[4] GRAUERT, H. and REMMERT, R.: Analytische Stellenalgebren, the complex coordinates of points z=(zJ, ... ,Zn)EX
Springer, 1971. can be expressed by holomorphic functions Zj =Zj(T) of
[5] GRAUERT, H. and REMMERT, R.: 'Komplexe Rliume', Math a parameter T = (TJ, . . . ,Tk) which varies within a cer-
Ann. 136 (1958),245-318.
[6A] GRAUERT, H.: 'Ein Theorem der analytischen Garbentheorie tain range acck , l.;;;;k<n (it is also usually assumed
und die Modulrliume komplexer Strukturen', Publ. Math that rank II az / aT II =k). If a =c" and all the func-
IHES 5 (1960), 233-292. tions Zj(T) are linear, one obtains a complex-analytic
[6B] GRAUERT, H.: 'Ueber Modifikationen und exzeptionelle
analytische Mengen', Math Ann. 146, no. 4 (1962), 331-368.
plane (cf. Analytic plane). If k = 1, the term which is
[7] NARASIMHAN, R.: Introduction to the theory of analytic spaces, sometimes employed is holomorphic curve (complex-
Springer, 1966. analytic curve); if all functions Zj(T) are linear, one
[8] CARTAN, H.: 'Varietes analytiques complexes et cohomologie',
in Colloque sur les fonctions de plusieurs variables. Bruxelles, 11
speaks of a complex straight line in the parametric
au 14 mars 1953, G. Thone and Masson, Liege-Paris, 1953. representation:
[9] SERRE, J.-P.: 'Geometrie algebrique et geometrie analytique',
Ann. Inst. Fourier (Grenoble) 6 (1955-1956), 1-42. Zi = aiT+bi; ai,biEC, i=l, ... ,n, (al> ... ,an) =1= O.
[10] FUKS, B.A.: Special chapters in the theory of analytic functions
of several complex variables, Amer. Math. Soc., 1965 (translated References
from the Russian). [I] SHABAT, B.V.: Introduction to complex analySiS, 1-2, Moscow,
[II] HORMANDER, L.: An introduction to complex analysis in several 1976 (in Russian).
variables, North-Holland, 1973. [2] VLADIMIROV, V.S.: Methods of the theory offunctions of several

195
ANALYTIC SURFACE

complex variables, M.I.T, 1966, Chapt. 2 (translated from the The group Z acts discretely and without fixed points on
Russian). Y, while the quotient space X = Y / Z is diffeomorphic
E.D. Solomentsev
E.M. Chirka to SI XS3. The quotient space X has a natural struc-
ture of an analytic surface, and is called a Hopf surface.
AMS 1980 Subject Classification: 32C25, 32B15
Oassification of analytic surfaces. The principal
ANALYTIC SURFACE (IN ALGEBRAIC invariant in the classification of analytic surfaces is the
GEOMETRY) - A two-dimensional (complex) analytic transcendence degree of the field of meromorphic func-
manifold, i.e. a smooth four-dimensional manifold with tions C(X) on the analytic surface X. According to
a complex structure. While the theory of analytic sur- Siegel'S theorem, for any compact connected manifold
faces forms part of the general theory of complex mani- X the field C(X) is finitely generated, and its transcen-
folds, the two-dimensional case is treated separately, dence degree is not larger than the complex dimension
since much more is known about analytic surfaces than of X. Thus, for an analytic surface X, the field C(X)
about n-dimensional manifolds if n ;;;;'3. Moreover, cer- contains two independent meromorphic algebraic func-
tain facts are specific to the two-dimensional case tions, or one such function or constants only. These
alone. These results concern the classification of ana- possibilities lead to the following theorems.
lytic surfaces, which is analogous to that of algebraic For any analytic surface X to be an algebraic surface
surfaces (cf. Algebraic surface) - a fact which largely it is necessary and sufficient that there exist two alge-
reduces the theory of analytic surfaces to that of alge- braically independent meromorphic functions on X.
braic surfaces. The principal results on the classification If an analytic surface X has a field of meromorphic
of analytic surfaces were obtained by K. Kodaira [l], functions of transcendence degree I, then X is an ellip-
[2], [3], but his work is based on the results of the clas- tic surface, i.e. there is a holomorphic mapping onto an
sical Italian school of algebraic geometry on the classif- algebraic curve Y, P : X ~ Y, such that
ication of algebraic surfaces. P'C(Y) = C(X)
All analytic surfaces discussed below are assumed to
and all fibres of P, except for a finite number, are ellip-
be compact and connected.
tic curves (the singular fibres may only have a very spe-
Examples. I) Algebraic surfaces. Let cial form, which has been thoroughly studied).
j;(xo, ... ,XN), i=l, ... ,m, If no meromorphic functions other than constants
exist on an analytic surface X, and no exceptional
be a set of homogeneous polynomials with complex
curves (cf. Exceptional subvariety) exist on X, then the
coefficients. The closed subset of the complex projective
first Betti number b I of X assumes only three values:
space pN (C) specified by the equations ii(x) = 0 is an
4, I or O. If b I = 4, X is a complex torus, and if b I = 0,
analytic surface if it is non-singular, connected and has
X has a trivial canonical fibration. These analytic sur-
complex dimension two. This is the basic example of an
faces are called K3-surfaces. They are all mutually
analytic surface.
homeomorphic. The case b I = 1 has not been studied in
2) Complex tori. Let C2 be the two-dimensional vec-
detail, but certain examples of analytic surfaces with
tor space over the field of complex numbers (as a vec-
b I = I are obtained by a generalization of the construc-
tor space over the field of real numbers it is isomorphic
tion of Hopf surfaces.
to R4) and let r"-'z4 be a lattice in C2. The quotient
Analytic Kahler surfaces are not always algebraic.
space X = C2 / r is an analytic surface. Being a smooth
They are, however, algebraic if the square of their first
manifold, X is diffeomorphic to a four-dimensional
Chern class is positive. All analytic Kahler surfaces
torus, but the complex structure on X depends on the
with b I >0 are deformations of algebraic surfaces.
lattice r. Complex tori X = C2 / r play an important
role in analysis, since meromorphic functions on such References
[I] KODAIRA, K.: Matematika 6, no. 6 (1962), 3-17.
tori are meromorphic functions on C2 and are periodic
[2A] KODAIRA, K.: 'On compact (complex) analytic surfaces, 1',
with period lattice r. Analytic surfaces of the type Ann. of Math. 71, no. 1(1960), 111-152.
C2 / r are not always algebraic. There also exist lattices [2B] KODAIRA, K.: 'On compact (complex) analytic surfaces. II',

r such that there are no meromorphic functions at all [2C]


Ann. of Math. 77, no. 3 (1963), 563-626.
KODAIRA, K.: 'On compact (complex) analytic surfaces, Ill',
(except for constants) on the corresponding torus Ann. of Math. 78, no. 1(1963), 1-40.
C2 / r. For specific examples of such tori see [5]. [3A] KODAIRA, K.: 'On the structure of compact (complex) analytic
3) Hopf surfaces. Let Y=C 2 -{0} and let c be a surfaces 1', Amer. 1. Math. 86 (1964), 751-798.
[3B] KODAIRA, K.: 'On the structure of compact (complex) analytic
positive number. Consider the action of the group Z on surfaces II', A mer. 1. Math. 88 (1966),682-721.
Y given by [3C] KODAIRA, K.: 'On the structure of compact (complex) analytic
surfaces III', A mer. 1. Math. 90 (1968), 55-83.
[3D] KODAIRA, K.: 'On the structure of compact (complex) analytic

196
ANALYTIC THEORY OF DIFFERENTIAL EQUATlO'JS

surfaces IV', Amer. J. Math. 90 (1968), 1048-1066. where A _j are constant matrices, A -P7'~:0, and the
[4) 'Algebraic surfaces', Trudy Mat. Inst. Steklov. 75 (1965) (in matrix B(t) is holomorphic at to. A pole 10*,00 of
Russian).
[5) SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
order p is called a regular Singular point if p = 1 and an
(translated from the Russian). irregular singular poinl if p~2. The case 10 = 00 is
B.B. Venkov reduced to the case 10 = 0 by the change of variables
Editorial comments. The notion defined above is also I~/-I. In what follows, 10*,00.
called a complex-analytic surface, since one considers com- 2) Let 10 be a pole of A (I). Then there exists a fun-
plex structures and the field of complex numbers C. If damental matrix X(/) for the system (2) of the form
instead one considers real structures and the field of real
numbers, one speaks of real-analytic surfaces. However, an X(t) = 1I>(t)(t-tof, (3)
analytic surface is always understood in the sense explained where D is a constant matrix, 4>(/) is holomorphic if
above. I t - 10 I < p where to is a regular singular point, and
References 4>(/) is holomorphic if 0< I t - to I <p where to is an
[A1] BARTH, W., PETERS, C. and VEN, A. VAN DER: Compact com- irregular singular point, for some p>O. (Here,
plex surfaces, Springer, 1984.
(I - (0)D = exp(ln(t - (0)D), by definition.) For a regular
AMS 1980 Subject Classification: 14JXX singular point the matrix D can be expressed in terms
of A (t) in an explicit form [1], [2]; this is not the case
ANALYTIC THEORY OF DIFFERENTIAL EQUA-
for irregular singular points.
TIONS - The branch of the theory of ordinary differen-
A similar classification of singular points is intro-
tial equations in which the solutions are studied from
duced for differential equations of order n with mero-
the point of view of the theory of analytic functions. A
morphic coefficients. Differential equations and dif-
typical formulation of a problem in the analytic theory
ferential systems all singular points of which are regular
of differential equations is this: Given a certain class of
are known as Fuchsian differential equations (systems).
differential equations, the solutions of which are all
The general form of A (t) for such a system is:
analytic functions of one variable, find the specific pro-
k
perties of the analytic functions that are solutions of A(t) = ~(t-tj)-IAj, Aj = const, k<oo.
this class of equations. In this wide sense, the analytic j=l

theory of differential equations includes the theory of An example of a Fuchsian differential equation is the
algebraic functions, the theory of Abelian integrals, the hypergeometric equation.
theory of special functions, etc. Special functions - 3) Let A (t)=tqB(t) where q~O is an integer and let
Bessel functions, Airy function", Legendre functions, B(/) be holomorphic at 1=00 (00 is an irregular singu-
Laguerre functions, Hermite functions (cf. Hermite lar point if B( 00 )*,0). If S is a sufficiently narrow sec-
function), Chebyshev functions (cf. Chebyshev function), tor of the form I t I >R, a<argt<{3, then there exists
Whittaker functions, Weber functions (cf. Weber func- a fundamental matrix of the form
tion), Mathieu functions, hypergeometric functions (cf. X(t) = P(t)tQexp(R(t, (4)
Hypergeometric function), Sonin functions and many
other functions - are solutions of linear differential where Q is a constant matrix, R (t) is a diagonal matrix
equations with analytic coefficients. whose elements are polynomials in t 1/ p, P ~ 1 is an
integer and
Linear theory. Consider a system of n equations in 00
pet) ~ L,Pjt- Jlp
matrix notation: j=O
x= A (t)x +f(t). as I t I ~OO, tES. The plane C(/) is subdivided into a
(I)
1) Let the matrices A (t),f(t) be holomorphic in a finite number of sectors, and in of them there exists a
region G cC(t), where C(t) is the complex t-plane. Any fundamental matrix of the form (4) ([3], [4]; see also [1],
solution of the system (1) will then be analytic in G [2]).
(but will not, in general, be single-valued if G is not 4) As a result of analytic continuation along a closed
simply-connected). It is assumed that A (t) is mero- path y the fundamental matrix X(/) is multiplied by
morphic in G, and one considers the homogeneous sys- By: X(t)~X(t)By, where By is a constant matrix; one
tem obtains the monodromy group of the differential equa-
x = A (t)x. (2)
tion. LA. Lappo-Danilevskii [5] has studied the prob-
(The matrix A (t) is called holomorphic (meromorphic) in lem of Riemann: Let A (t) be a rational function of t
G if all its elements are holomorphic (meromorphic) in and let the singularities of the fundamental matrix X(t)
G.) A point to EG is called a pole of the matrix A (t) of be known, find A (I).
order p~ 1 if, in a given neighbourhood of this point, 5) Let the function z =CP(/) be a conformal mapping
A(t) = A_.(t-to)-+ +A_1(t-to)-I+B(t), of the upper half-plane Imt>O onto the interior of a

197
ANALYTIC THEORY OF DIFFERENTIAL EQUATIONS

polygon, the boundary of which consists of a finite the numbers Al / A2, A2 / Al be either a non-negative
number of segments of straight lines and circular arcs. integer or a real negative number. Then there exists a
The function cf>(t) will then satisfy the Schwarz equa- neighbourhood U of the p.,oint (to, xo), a neighbour-
tion: Z '" 3 ["
z ]2
hood V of the point t =0, =0, and functions x
{z, t} i-"2 i = R(t). (5) (=(t, x) and x=x(t, x) such that the mapping U~V
defined by these functions is biholomorphic, and the
where R (t) is a rational function, and the equation differential equation (8) in the new variables assumes
" 1 the form [6]:
w +"2R(t)w =0 (6) dx AIX
is Fuchsian. Any solution of equation (5) may be dt A2t
represented in the form z = W I / W 2, where W I and W 2 All solutions of equation (8) in the new variables are
are linearly independent solutions of equation (6). Let written in the form x=C?,,/lo., and x=O. Thus, a
G be an infinite discrete group and let cf>(t) be an auto- singular point of the equation is a branching point of
morphic function of G, then cf>(t) can be represented as infinite order for all solutions of equation (8) (except
</>=WI/W2' where WI, W2 are linearly independent for the trivial solutions). The singular points of the
solutions of equation (6) and R (t) is some algebraic solution which coincide with the singular points of the
function. equation are called stationary. As distinct from the
Non-linear theory. 1) Consider the Cauchy problem: linear case, the solution of a non-linear equation may
have singular points not only at the singular points of
x = f(t, x), x(to) = xo, (7)
the equation; such singular points of the solution are
where tEC(t), x=(xJ, ... ,Xn)Ecn(X), called movable. Painleve's theorem is valid: The solu-
j=(jJ, ... ,J,,). tions of the equation
Cauchy's theorem: Let the function j(t, x) be holo-
morphic in (t, x) in a region GCC(t)xcn(x) and let
P(t, x, x) = 0,
the point (to, XO)EG. Then there exists a 8>0 such where P is a polynomial in x and x with holomorphic
that in the domain I t - to I <8 there exists a solution coefficients in t, has no movable transcendental singu-
x(t; to, xo) of the Cauchy problem (7), which is unique lar points [7].
and holomorphic. If, in equation (8), P and Q are polynomials in t, x,
An analytic continuation of the solution x(t;to,xo) then, in view of Painleve's theorem, all movable singu-
will also be a solution of the system (7), but the func- lar points are algebraic. On substituting t = 1 / t',
tion obtained as a result of the continuation may have x =x' / t', equation (8) assumes the form
singularities and, in the general case, is a many-valued dx' PI(t', x')
function of t. The problems which arise are: What dt' = -Q-'-I-'-(t7"",,-x-:'-,),
singularities may this function have and how can one
where P I and QI are polynomials. Let Xj be the roots
construct the general solution? In the linear case these
of the equation PI (0, x') =0. The points (0, xj) are
questions have been conclusively answered. In the
called infinitely-remote Singular points of equation (8);
non-linear case the situation is much more complicated
the structure of the solutions in a neighbourhood of
and has not been fully clarified even when the !jet, x)
these points is described by the theorem quoted above
are rational functions of t, x.
[6].
2) Consider the differential equation:
Let P and Q be polynomials of degree n. Since P, Q
dx_~
(8) are defined by their coefficients and the pair (AP, AQ)
dt - Q(t, x)'
defines the same equation, one obtains a one-to-one
where t E C, X E C, and P and Q are holomorphic func- correspondence between equations (8) and the points of
tions of (t, x) in a certain region G. A point (to, xo) is the complex projective space CpN,
called an (essentially) Singular point of equation (8) if N=(n + l)(n -2)-1. The following theorem is valid: If
P(to, xo)=O, Q(to, xo)=O. Below the structure of the some set of measure zero is removed from CpN, the
solutions in a neighbourhood of a singular point of the remaining equations (8) will have the following pro-
equation is clarified. Develop P and Q into Taylor perty: All solutions x=x(t) are everywhere dense in
senes: C2 =C(t)XC(x) [8].
P(t,x) = all(x-xO)+a12(t-tO)+ ... , 3) Consider the autonomous system
Q(t,x) = a2l(x-xo)+adt-t o)+ : .. , x = f(x), (9)
and let AI, A2 be the eigen values of the matrix I au II tEC(t),xEcn(X),j=(j], ... ,J,,). A point xU will
The following theorem holds. Let Aj=l=O and let none of then be a singular point oj the system (9) if j(xo) = o.

198
A:--:AL YTIC VECTOR

Poincare's theorem is valid: Let xO be a singular point References


of the autonomous system (9). Also let a) the elemen- [A 1] INCE, E.L.: Ordinary differential equations, Dover, reprint,
1956.
tary divisors of the Jacobi matrix / (xo) be prime divi-
[A2] HILLE, E.: Ordinary differential equations in the complex
sors; and b) the eigen values Aj of this matrix lie on domain, Wiley (I nterscience) , 1976.
one side of some straight line in C(A) passing through
the coordinate origin. Then there exists neighbourhoods AMS 1980 Subject Classification: 34A25
U, V of the points x = x ,x
= 0 and a biholomorphic ANALYTIC VECTOR in the space V of a representa-
mapping x =x(x): V ~U such that the system (9)
x
expressed in the variable assumes the form [9]:
tion T of a Lie group G - A vector ~ E V for which the
mapping g~T(g)~ is a real-analytic vector function on
iXJ - G with values in V (cf. Representation theory). If T is a
dt = AjXj, 1~j ~n.
weakly-continuous representation of a Lie group G in a
If only condition a) is satisfied, it is possible, by using Banach space V, then the set VW of analytic vectors is
a transformation x =cp(x), where q,(x) is a formal power dense in V [I], [2], [3]. This theorem has been general-
series, to convert system (9) in a neighbourhood of a ized to a wide class of representations in locally convex
singular point into a system which can be integrated in spaces (5). It has been proved (6) that a representation
quadratures [9], [10]. However, the convergence of these of a connected Lie group G in a Banach space V is
series has been proved on assumptions close to a) and uniquely determined by the corresponding representa-
b). If the function f(x) and the transformation x =q,(x) tion of the Lie algebra of the Lie group G. in the space
are real for real x, x,
a theorem similar to the theorem VW.
of Poincare has been proved [11]. The structure of the An analytic vector for an unbounded operator A on a
solutions of the autonomous system (9) in general, Banach space V, defined on a domain D (A), is defined
where jj(x) are polynomials and n;;;' 3, has not yet as a vector 00
~ E nD(An)
(nineteen-seventies) been studied. n=1

References for which the series


[1) CODDINGTON, E.A. and LEVINSON, N.: Theory of ordinary dif-
ferential equations, McGraw-Hill, 1955.
(2) WAZOV, W.: Asymptotic expansions for ordinary differential
equations, Interscience, 1965. has a positive radius of convergence. This notion, which
[3) BIRKHOFF, G.D.: 'Singular points of ordinary linear differential
equations', Trans. A mer. Math. Soc. 10 (1909),436-470. was introduced in [2], is a special case of the general
(4) 'fIuITZINSKY, W.J.: 'Analytic theory of linear differential equa- concept of an analytic vector; here the set of points on
tions', Acta Math. 62 (1934), 167-226. the real line with the addition operation plays the role
(5) LAPpo-DANILEVSKY, J.A.: Memoire sur la tMorie des systemes
des equations differentielles lineaires, Chelsea, reprint, 1953.
of the Lie group G. It was found useful in the theory of
(6) BIEBERBACH, L.: Theorie der gewohnlichen Differential- operators on Banach spaces and in the theory of elliptic
gleichungen auffunktionentheoretischer Grundlage dargestellt, differential operators.
Springer, 1965.
[7) GOLUBEV, V.V.: Vorlesungen Uber Differentialgleichungen im References
Komplexen, Deutsch. Verlag Wissenschaft., 1958 (translated [I) CARTIER, P. and DIXMIER, J.: 'Vecteurs analytiques dans les
from the Russian). representations de groupes de Lie', A mer. J. Math. 80 (1958),
(8) KHuoAi-VERENOV, M.G.: 'On a property of solutions of a dif- 131-145.
ferential equation', Mat. Sb. 56 (98), no. 3 (1962), 301-308 (in (2) NELSON, E.: 'Analytical vectors', Ann. of Math. 70 (1969),572-
Russian). 615.
(9) NEMYTSKII, V.V. and STEPANOV, V.V.: Quantitative theory of [3) GARoING, L.: 'Vecteurs analytiques dans les representations
differential equations, Princeton Univ. Press, 1960 (translated des groups', Bull. Soc. Math. France 88 (1960),73-93.
from the Russian). [4] CARTIER, P.: 'Vecteurs analytiques', in Sem. Bourbaki
(10) BRYUNO, A.D.: Local methods of non-linear analysis of differen- 1958/1959, Vol. 181, 1959, pp. 12-27.
tial equations, Moscow, 1979 (in Russian). (5) MOORE, R.T.: 'Measurable, continuous and smooth vectors for
[11) SIEGEL, c.L.: 'Ueber die Normalform analytischer Differential- semigroup and group representations', Mem. A mer. Math. Soc.
gleichungen in der Nilhe einer Gleichgewichtslosung', Nachri- 78 (1968).
chten Akad Wissenschaft. Gottingen (1952), 2130. [6) HARISH-CHANDRA: 'Representations of a semisimple Lie group
(12) POINCARE, H.: Oeuvres de Henri Poincare, Vol. 3, Gauthier- on a Banach space 1', Trans. A mer. Math. Soc. 75 (1953), 185-
Villars, 1916-1965. 243.
(13) FORD, L.R.: Automorphicfunctions, Chelsea, reprint, 1951. A.A. Kirillov
M. V. Fedoryuk
Editorial comments.
Editorial comments. The Riemann monodromy problem References
mentioned above is of great importance in the modern [A1] WARNER, G.: HarmOniC analysis on semi-simple Lie groups,
1, Springer, 1972.
theory of completely integrable or soliton equations. Cf.
Soliton. AMS 1980 Subject Classification: 22E45

199
ANDRONOV - WITT THEOREM

ANDRONOV - Wrrr THEOREM - A modification


of Lyapunov's theorem (on the stability of a periodic
J

(z) = sin /!7T
V7T
[1- _Z_2_
22 -,?-
+ --::---:=-z_4-:---::-_
(22 - '?-)(42 -'?-)
solution of a non-autonomous system of differential
equations) for the autonomous system - (22_'?-)(42~'?-)(62_,?-) + ... ] +
dx,
dt = X;(xJ, ... ,xn), i=I, ... ,n. (1) + sin /!7T Z_ _
[ __ Z3 +
7T 12-,?- (12-'?-)(3 2 -,?-)
Let
= eMt) (2)
(l2-'?-)(32~'?-)(52-,?-)
X,

be a periodic solution of the system (1), and let + ... }

. ~aX'(</>I""'</>n) . The asymptotic expansion


~, = ..::.. a. ~)' 1= 1, ... , n, (3)
j= I x)
Jp(z) ~ J.(z)+
be the corresponding system of variational equations
+ sinll7T [1- 12-,?- + (12-'?-)(3 2 -,?-) _ ... ]_
which always has, in the case here considered, one zero 7TZ Z2 z4
characteristic exponent. The Andronov- Witt theorem
is then valid: If n - 1 characteristic exponents of the _ sinV7T [.!:..._ v(2 2 -,?-) + v(22 -,?-)W-'?-) _ ... ]
7TZ Z Z3 Z5
system (3) have negative real parts, a periodic solution
(2) of the system (1) is stable according to Lyapunov is valid for I z I-HXJ and I argz I <7T.
(cf. Lyapunov characteristic exponent; Lyapunov stabil- The functions have been named after CT. Anger [1],
ity). who studied functions of the type (*), but with 27T as
The Andronov- Witt theorem was first formulated the upper limit of the integraL
by A.A. Andronov and A.A. Witt in 1930 and was References
proved by them in 1933 [1]. [1] ANGER, C.T.: Neueste Schr. d. Naturf d. Ges. i. Danzig 5
(1855), 1-29.
References [2] WATSON, G.N.: A treatise on the theory oj Bessel junctions,
[1] ANDRONOV. A.A.: Collected works, Moscow. 1976 (in Russian).
Cambridge Univ. Press, 1952.
[2] PONTRYAGIN, L.S.: Ordinary differential equations, Addison- A.P. Prudnikov
Wesley, 1962, 264 (translated from the Russian).
AMS 1980 Subject Classification: 33A40
E.A. Leontovich-Andronova
Ed ito rial comments. The Andronov - Witt theorem is ANGLE - A geometrical figure consisting of two dis-
usually found in the Western literature under some heading tinct rays issuing from the same point. The rays are
like' hyperbolic periodic attractor'. called the sides of the angle and their common original
Good additional general references are [A 1], [A2], [A3]. is called the vertex of the angle. Let [BA), [BC) be the
In [A2] the theorem under discussion occurs as a statement
sides of an angle, B its vertex and a the plane defined
about periodic attractors, cf. pp. 277-278. The original
by its sides. The figure r=[BA) U [BC) divides the
Andronov-Witt paper is [A4].
plane a into two adjoint regions a;, i = 1, 2:
References
a'l U a~ =a \ r. The region ai =a; U r, i = 1,2, is also
[A1] HAHN, W.: Stability of motion, Springer, 1967, 422.
[A2] HIRSCH, M. and SMALE, S.: Differential equations, dynamiC called an angle or a plane angle, a; is called the interior
systems and linear algebra, Acad. Press, 1974. domain of the plane angle ai' The regions (angles)
[A3] CODDINGTON. E.A. and LEVINSON, M.: Theory of ordinary
aI, a2 are called supplementary.
differential equations, McGraw-Hili, 1955, 323.
[A4] ANDRONOV. A.A. and WITT, A. 'Zur Stabilitat nach Two angles are called equal (or congruent) if they can
Liapounoy', Physikal. Z. Sowjetunion 4 (1933),606-608. be superposed so that corresponding sides and vertices
AMS 1980 Subject Classification: 34020, 34C25 coincides. For every ray in the plane and a given side
of it one can construct a unique angle equal to a given
ANGER FUNCfION - The function one. The comparison of two angles can be realized in

f
two ways. If the angle is considered as a pair of rays
Jp(z) = 1- cos(v</> - z sin </ d</>, (*) with a common origin then to answer the question
7T 0
which of the two angles is the greater it is necessary to
which satisfies the inhomogeneous Bessel equation: superpose in the given plane the vertices and one of
2 I . each pair of sides (see Fig. I). If the second side of an
rv- "+ZV
7" ,
+(z -,r-)"
7
= -(z-v)smV7T.
. ~ 11
angle comes to lie inside the second angle, one says
For integers v=n, Jv(z)=Jn(z) is the Bessel function of that the first angle is smaller than the second.
order n (cf. Bessel functions). For non-integer v the fol- A second method of comparing angles consists in
lowing expansion is valid: assigning to each angle a certain number. Equal angles

200
ANGULAR BOUNDARY VALUE

In practical problems it is convenient to consider


angles as the measure of rotation of a fixed ray round
its origin to a given position. Depending on the direc-
tion of the rotation, the angle may in this case be con-
sidered as positive or negative. Thus, an angle in this
Fig. I.
sense can have as value any real number. The angle as
will have the same number of degrees or radians (see a measure of the rotation of a ray is considered in the
below), greater angles a greater and smaller angles a theory of trigonometric functions: For any value of the
smaller number. argument (an angle) one can define a value for the tri-
Two angles are called supplementary if they have gonometric functions. The concept of an angle in
their vertex and one side is common, while the other geometrical systems founded on the axiomatics of posi-
two sides form a straight line (see Fig. 2). tions of point vectors is radically different from the
definition of angles as figures - in such axiomatics by

L
an angle one understands a well-defined metric quan-
tity connected with two vectors by means of their scalar
product. Thus, each pair of vectors a, b defines a cer-
---
[ 0 8 tain angle cp - the number given by the vector formula
Fig. 2. _ (a, b)
cos</> - I a 1'1 b I '
Generally, angles having a common vertex and one where (a, b) is the scalar product of the vectors.
common side are called a4joining. Two angles are called The concept of an angle as a plane figure and as a
vertical if the sides of one are the prolongations through certain numerical quantity is applied in different
the vertex of the sides of the other. Vertical angles are geometrical problems in which the angle plays a special
equal to each other. Angles the sides of which form a role. Thus, by the angle between two intersecting curves
straight line are called straight. Half a straight angle is having a well-defined tangent in their point of intersec-
called a right angle. A right angle may equivalently be tion one understands the angle formed by these
defined as follows: An angle equal to its supplement is tangents.
called right. The interior domain of a plane angle that By the angle between a straight line and a plane one
is not greater then a straight angle is called a convex understands the angle formed by the straight line and
domain in the plane. its orthogonal projection on the plane; its value lies
As the unit of measurement of an angle one uses the between 0 and 90. By the angle between two crossing
90-th part of a right angle, called a degree. The cyclic lines one understands the angle between the directions
or radian measure of an angle is also used. The numeri- of these lines, i.e. between two lines parallel to the
cal value of the radian measure of an angle is the given crossing lines and passing through a given point.
length of the arc cut out by the sides of the angle on A solid angle is a part of space bounding a certain
the unit circle. One radian is assigned to the angle the conical surface; a particular case of a solid angle is a
arc of which has length equal to the radius. A straight polyhedral angle.
angle is equal to 1T radians. In higher-dimensional geometry one defines the angle
By intersecting two straight lines lying in a plane between multi-dimensional planes, between straight
with a third there are formed angles (see Fig. 3): 1 and lines and planes, etc. Angles between straight lines,
5, 2 and 6, 4 and 8, 3 and 7; these are called between planes and straight lines, and between higher-
corresponding angles; 2 and 5, 3 and 8 are called interior dimensional planes are also defined in non-Euclidean
angles on the same side; 1 and 6, 4 and 7 are called spaces.
exterior angles on the same side; 3 and 5, 2 and 8 are LA. Sidorov
called interior opposite angles; and I and 7, 4 and 6 are Editorial comments.
called exterior opposite angles. References
[A1] GREENBERG, M.J.: Euclidean and non-Euclidean geometries,
W.H. Freeman and Co., 1974.

AMS 1980 Subject Classification: 51 NXX

ANGULAR BOUNDARY VALUE, boundary value


along a non-tangential path - The value associated to a
Fig. 3. complex function !(x) defined in the unit disc

201
ANGULAR BOUNDARY VALUE

D={ZEC: I Z I <1} at a boundary point ~=eill, equal which is the commutator subgroup of the centralizer of
to the limit limf(z) =
zeS
rm a maximal k-split torus SCG; D=[Zc(S),Zc(S)]. The
anisotropic kernel D is a semi-simple anisotropic group
z--+\
defined over k; rank D = rank G - rankk G. The con-
of f(z) on the set of points of the angular domain cept of the anisotropic kernel plays an important role
in the study of the k-structure of G [I]. If D = G, i.e. if
S(r,t:) = {z=re;<PED: I arg(eiO-z) 1<; -t:}
rankk G = 0, then G is anisotropic over k; if D = (e), the
under the condition that this limit exists for all t:, group G is called quasi-split over k.
0<t:<'IT /2, and hence does not depend on t:. The term References
is sometimes applied in a more general sense to func- [I] TITS, J.: 'Classification of algebraic simple groups', Proc.
Symp. Pure Math., Vol. 9, Arner. Math. Soc., 1966, 33-62.
tions f(z) given in an arbitrary (including a higher- [2] BOREL, A. and TITS, J.: 'Groupes roouctifs', Publ. Math. IHES
dimensional) domain D; for S(~, t:) one takes the inter- 27 (1965),55-150.
VP. Platonov
section with D of an angular (or conical) domain with
vertex ~EaD, with axis normal to the boundary at ~ aD AMS 1980 Subject Classification: 20GXX
and with angle 'IT /2-t:, 0<t:<'IT /2.
ANNIHILATION OPERATORS - A family of closed
References
[I] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
linear operators {a(j): fEH}, where H is some Hilbert
1-2, Chelsea, 1977 (translated from the Russian). space, acting on a Fock space constructed from H (i.e.
[2) PIuwALOW, I.I. [1.1. PRIVALov): Randeigenschaften analytischer on the symmetrization P(H) or anti-symmetrization
Funktionen, Deutsch. Verlag Wissenschaft., 1956 (translated
ra(H) of the space of tensors over H) such that on the
from the Russian).
E.D. Solomentsev vector (j1 ... f,,)a Ef"(H), O'.=S, a, consisting of
Editorial comments. An angular boundary value is also the symmetrized (O'.=s) or anti-symmetrized (O'.=a) ten-
called a non-tangential boundary value. Cf. Boundary pro- sor product of a sequence of elements fl' ... ,f" EH,
perties of analytic functions. n = I, 2, ... , in H, they are given by the formulas:
AMS 1980 Subject Classification: 30D40 a(j)(fl 0 ... 0 j,,)s = (1)

ANHARMONIC RATIO - A synonym for double (f,;;)(j10 ... 0;;-1 0 ;;+1 0 ... 0f,,),.
ratio or cross-ratio.
in the symmetric case, and
AMS 1980 Subject Classification: 51 N15
a (j)(jl 0 ... 0 f"L = (2)
ANISOTROPIC GROUP over a field k - A linear
algebraic group G defined over k and of k-rank zero, i=i
i.e. not containig non-trivial k-split tori (cf. Splittable in the anti-symmetric case; the empty vector ~Era(H),
group). Classical examples of anisotropic groups include O'.=s, a (i.e. the unit vector in the subspace of constants
the orthogonal groups of quadratic forms that do not in ra(H is mapped to zero by a(j). In these formulas
vanish over k; and algebraic groups of elements of (".) is the inner product in H. The operators
reduced norm one in division algebras over k. If G is
{a"(j): fEH} dual to the operators a(j) are called
semi-simple, and if the characteristic of k is zero, then
creation operators; their action on the vectors
G is anisotropic over k if and only if Gk contains non-
(jl'" f,,)a, O'.=s,a, n=1,2, ... , is given by the
trivial unipotent elements. (For the field of real
formulas
numbers or the field of p-adic numbers this is
equivalent to saying that Gk is compact.) The classifica- a'(f)(f10'" 0f,,)a = (f0f I 0 0fn)a (3)
tion of arbitrary semi-simple groups over the field k and
a'(j)fJ. =f
reduces essentially to the classification of anisotropic
groups. As a consequence of these definitions, for each n >0
the subspace r~(H);fn, O'.=S, a, the symmetrized or
References
[IJ BOREL. A.: Linear algehraic r,roups, Benjamin. 1969. anti-symmetrized n-th tensor power of H, is mapped by
[2J TITS, 1.: 'Classification of algebraic simple groups'. Pmc. a(j) into r~ _I and by a" (j) into r~ + 1(H).
Symp. Pure Math .. Vol. 9. Amer. Math. Soc .. 1966. 33-62. In quantum physics, the Fock space ra(H), 0'.=.1', a,
V. P. Platonov is interpreted as the state space of a system consisting
AMS 1980 Subject Classification 20GXX . of an arbitrary (finite) number of identical quantum
particles, the space H is the state space of a single par-
ANISOTROPIC KERNEL - The subgroup D of a ticle. the subspace r~(H) corresponds to the states of
semi-simple algebraic group G, defined over a fieJd k, the system with n particles, i.e. states in which there are

202
Al':Tl-COMMCTATIVE ALGEBRA

just n particles. A state with n particles is mapped by where


a(j) to a state with n - 1 particles ('annihilation' of a
particle), and by a * (j) to a state with n + 1 particles References
(,creation' of a particle). [I] BEREZIN, F.A.: The method of second quantization, Moscow,
1965 (in Russian).
The operators a(j) and a * (j) form irreducible fami-
[2] DOBRUSHIN, R.L. and MINLOS, R.A.: Uspekhi Mat. Nauk 32,
lies of operators satisfying the following permutation no. 2 (1977), 67-122.
relations: In the symmetric case (the commutation rela- [3] GARDING, L. and WIGHTMAN, A.: Proc. Nat. A cad. Sci. U.S.A.
tions) 40, no. 7 (1954), 617-626.
R.A. Minlos
a(jl)a(!2)-a(j2)a(jl) = (4) Editorial comments.
= a'(jI)a'(j2)-a'(j2)a'(jI) = 0, References
[A1] GLIMM, J. and JAFFE, A.: Quantum physics, Springer, 1981.
and in the anti-symmetric case (the anti-commutation [A2] BOGOLYUBov, N.N., LOGUNOV, A.A. and TODOROV, I.T.:
relations) Introduction to axiomatic quantum field theory, Benjamin,
1975 (translated from the Russian).
a(jl)a(j2)+a(j2)a(jl) = (5)
[A3] BOER, J. DE: 'Construction operator formalism in many parti-
= a'(jI)a'(j2)+a'(j2)a'(jI) = 0, cle systems', In J. de Boer and G.E. Uhlenbeck (eds.): Stu-
dies in statistical mechanics, North-Holland, 1965.
where E is the identity operator in P(H) or rU(H).
AMS 1980 Subject Classification: 81 XX, 81 D05
Besides the families of operators a(j) and a * (j), f E H,
described here, there exist in the case of an infinite- ANNIHILATOR, left, of a set X in R - The set B,(X)
dimensional space H also other irreducible representa- of all elementsy in R such thatyX=O. Here R is a ring
tions of the commutation and anti-commutation rela- or a semi-group (or, generally, a groupoid) with a zero.
tions (4) and (5), not equivalent to those given above. The right annihilator of a set X in R is defined in a
Sometimes they are also called creation and annihila- similar manner as the set
tion operators. In the case of a finite-dimensional space
3rCx) = {zER: Xz=O}.
H, all irreducible representations of the commutation
The set
or anti-commutation relations are unitarily equivalent.
The operators {alf),a*(j):fEH} are in many con-
3(X) = 3,(X) n ~:lr<X)
nections convenient 'generators' in the set of all linear is the two-sided annihilator of X. In an associative ring
operators acting in the space ra(H), o:=s, a, and the (or semi-group) R the left annihilator of an arbitrary
representation of such operators as the sum of arbitrary set X is a left ideal, and if X is a left ideal of R, then
creation and annihilation operators (the normal form of 8,(X) is a two-sided ideal of R; in the non-associative
an operator) is very useful in applications. The connec- case these statements are usually not true.
tion with this formalism bears the name method of K.A. Zhevlakov
second quantization, cf. [1 J. AMS 1980 Subject Classification: 20MXX, 13-XX, 16-
In the particular, but for applications important, case XX
in which H=L 2(R", d"x), 1'= 1,2, ... (or in a more
general case H=L 2 (M, Q), where (M, Q) is a measure ANNULAR DOMAIN - I) A doubly-connected
space), the family of operators planar domain between two closed Jordan curves
{a(j), a'lf): fEL2(R", d"x)} defines two operator- without common points, one of each encloses the other.
valued generalized functions a(x) and a'(x) such that 2) For an annular domain with respect to a quadratic
differential, see Global structure of trajectories of a qua-
a(j) = /a(x)f(x)d"x, a'(j) = /a'(x)f(x)dPx. dratic differential.
E.D. Solomentsev
The introduction of a(x) and a * (x) turns out to be
AMS 1980 Subject Classification: 54FXX, 53A04, 30-
convenient for the formalism of second quantization
XX
(e.g. it allows one directly to consider operators of the
form ANTI-COMMUTATIVE ALGEBRA - A linear algebra
over a field in which the identity
(*)
Xa(YI) ... a(Yn)d"xl ... d"xn d"yl ... d"Ym, n, m = 1, 2, ... ,
is valid. If the characteristic of the field differs from 2,
where K(xJ, ... ,xn;YJ, ... ,Ym) is a certain the identity (*) is equivalent with the identity
'sufficiently-good' function), without having to recourse xy = - yx. All subalgebras of a free anti-commutative
to their decomposition as a series in the monomials algebra are free. The most important varieties of anti-
a'(jd' .. a'(j,,)a(gl) ... a(gm), commutative algebras are Lie algebras, Mal'tsev alge-

203
ANTI -COMMUTATIVE ALGEBRA

bras and binary Lie algebras (cf. Lie algebra; Binary a ring A into a ring B that is an isomorphism of the
Lie algebra; MaI'tsevalgebra). additive group of A onto the additive group of B and
References for which (ab)<p=b<p'a<p (a, bEA).
O.A. Ivanova
[I] SHIRSHOV, A.I.: 'Subalgebras of free commutative and free
anti-commutative algebras', Mat. Sb. 34 (76), no. 1 (1954), 81- AMS 1980 Subject Classification: 13-01, 16AXX,
88 (in Russian). AT G 17-01
.. amov
W

AMS 1980 Subject Classification: 17-01, 15A69 ANTI-MOTION - A transformation of a pseudo-


Euclidean space which converts points at a real dis-
ANTI-CONFORMAL MAPPING, conformal mapping tance a from each other into points at a purely-
of the second kind - A continuous mapping w =f (z) of imaginary distance a. The motions and anti-motions of
a neighbourhood of a point Zo of the complex z-plane a pseudo-Euclidean space together form a group.
onto a neighbourhood of a point Wo of the complex w- A.B. Ivanov
plane which preserves the angles between the curves
AMS 1980 Subject Classification: 51 N25
passing through z 0 but changes the orientation. A func-
tion f (z) which produces an anti-conformal mapping is
ANTI-PARALLEL STRAIGHT LINES with respect to
an anti-holomorphic function. See also Conformal map-
two given lines m I and m2 - Two straight lines II and
ping.
12 which intersect m I and m 2 so that L I = L 2 (cf.
References Fig.).
[1] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
1, Chelsea, 1977, Chapt. 2 (translated from the Russian).
E.D. Solomentsev
AMS 1980 Subject Classification: 30C35

ANTI-DISCRETE SPACE - A topological space in


which only the empty set and the entire space are open.
If II and 12 are anti-parallel with respect to ml and
A.A. Mal'tsev
m2, then m I and m2 are also anti-parallel with respect
Editorial comments. Other frequently occurring names to II and 12 , In any quadrilateral inscribed in a circle,
for this topological space are indiscrete space and trivial any two opposite sides are anti-parallel with respect to
topological space.
the other two sides. If the lines m I and m 2 intersect at
AMS 1980 Subject Classification: 54AXX a point 0, one also says that II and 12 are anti-parallel
with respect to the angle m I Om 2. If the lines m I and
ANTI-DISCRETE TOPOLOGY on a set - The topol-
m 2 coincide, II and 12 are said to be anti-parallel with
ogy defined by the open base consisting of the empty
set and of the entire space. respect to a straight line. A.B. Ivanov
A.A. Mal'tsev
AMS 1980 Subject Classification: 51 N05
AMS 1980 Subject Classification: 54AXX
ANTI-PARALLELOGRAM - A plane quadrangle in
ANTI-HOLOMORPHIC FUNCTION, anti-analytic
which each pair of opposite sides is equal and anti-
function - A function f (z) = u + iv of one or more com- parallel to the other two sides (cf. Anti-parallel straight
plex variables z = (z 1> ,zn) that is the complex con- lines).
jugate of a holomorphic function f (z) = u - iv (cf. Ana- A.B. Ivanov
lytic function). AMS 1980 Subject Classification: 51 M15, 51 M20
E.D. Solomentsev
AMS 1980 Subject Classification: 30-01 ANTI-PRISM - A semi-regular polyhedron (cf.
Semi-regular polyhedra) in which two parallel faces are
ANTI-ISOMORPHISM OF PARTIALLY ORDERED congruent regular n-gons, while the remaining 2n faces
SETS - A bijective anti tone mapping of a partially are regular triangles (d. Fig.).
ordered set A into a partially ordered set B, i.e. a one-
to-one mapping <p: A --')B such that a <b (a, b EA)
implies a<p>b<p in B.
o.A. Ivanova
AMS 1980 Subject Classification: 06AXX
A.B. Ivanov
ANTI-ISOMORPHISM OF RINGS - A mapping <p of AMS 1980 Subject Classification: 51 M20

204
ANTINOMY

ANTI-SYMMETRIC TENSOR - See Tensor calculus. arrive at the point B and will never overtake the tor-
AMS 1980 Subject Classification: 15A69, 53A45 toise.
Paradoxes of this type are easily solved by methods
ANTILOGARITHM of a number n - The number N, of the modem mathematical theory of motion. A
denoted by ant log" n, the logarithm of which to the detailed analysis shows that such solutions materially
base a is equal to n. Thus, depend on the truth of the so-called Archimedean axiom
in the field of real numbers: For any real numbers
ant log" n = N = an,
a, b >0 there exists a natural number n such that
or an >b. Nevertheless, the problem illustrated by the
log" N = n.
paradox is very real. It is possible to deny the conveni-
Antilogarithms are also called inverse logarithms.
ence or the adequacy of the ordinary mathematical
AMS 1980 Subject Classification: 26A09 model as a description of real motion. In order to
describe the concepts of infinitely small and infinitely
ANTINOMY, paradox - A situation in which two large physical magnitudes, it was repeatedly attempted
mutual contradictory statements are demonstrated, to construct a theory of real numbers in which the
each one having been deduced by means that are con- Archimedean axiom would not be valid. In any case,
vincing from the point of view of the same theory. the theory of non-Archimedean ordered fields forms an
As distinct from a sophism, which is a deliberately essential part of modem algebra. Such non-
presented erroneous conclusion containing a masked Archimedean ordered fields - the non-standard real
error, an antinomy is usually an indication of deeper straight line - play a determining role in non-standard
deficiencies of the theory in question. The discovery of analysis.
an antinomy often leads to a thorough revision of the The 'sandpile paradox' may be stated as follows.
theory as a whole, attracts attention to new effects, and Clearly, one grain of sand does not form a sandpile. If
ultimately stimulates further studies. This feature of n grains of sand still do not form a sandpile, it follows
antinomies attracted the interest of philosophers ever that they will not form a pile after another grain of
since ancient times. One may recall, for example, the sand has been added. Accordingly, no number of grains
important role played by antinomies in the philosophy of sand can form a sandpile.
of E. Kant. A number of antinomies were studied in This paradox can be solved in modern terms by say-
antiquity under the name aporium. Here, two famous ing that the method of complete mathematical induc-
antinomies, named after Zeno of Elea (5-th century tion cannot be applied to undefined volumes such as a
B.C.) will be quoted. 'pile of sand grains'. In modern mathematics (second
'Achilles and the tortoise'. This aporium deals with the half of the 20-th century) indefinite volumes are
contradiction involved in certain attributes of motion, employed in the foundations of mathematics to estab-
and may be stated as follows. A runner (Achilles) is lish the consistency of classical theories, and the pro-
standing at a point A, and a tortoise is standing at a perties of such concepts are studied by exact methods.
point B, 100 meters away from A. At the same moment It is possible to use methods of mathematical logic to
of time Achilles begins to run from A towards B, while realize a situation in which mathematical induction in
the tortoise begins to move away from B in the direc- its general form is not applicable to any natural series,
tion away from A at a speed which is, say, 100 times even though very many ordinary properties of natural
less than that of Achilles. Experience teaches us that in numbers remain valid in the process (the so-called
such a situation Achilles will overtake the tortoise after non-standard model of arithmetic).
a fairly short period of time. On the other hand, it is However, it is the antinomies involving unusual for-
also possible to conclude that Achilles will never over- mulations of concepts which are undoubtedly of
take the tortoise, and will not even reach the point B. greatest interest to mathematics. These are the so-called
In fact, at the moment when Achilles has reached the logical and semantic antinomies (see also Skolem para-
midpoint C I of the distance AB, the tortoise will have dox).
moved away from B, though only by a short distance. Below three examples of logical antinomies are
Next, Achilles will reach the midpoint C 2 of C I B, then presented.
the midpoint C 3 of C 2B, etc. Throughout that time the Russell's paradox (B. Russel, 1902), also indepen-
tortoise keeps moving away from B. In order to reach dently discovered by E. Zermelo. Consider the follow-
B, Achilles must have been present at each point in the ing property D of sets. Assume that a set X has the
infinite sequence of points C I, C 2 , .... However, it is property D if and only if it is not an element of itself.
impossible to be present at an infinite number of points Property D is valid for a very large majority of specific
in a finite period of time; therefore, Achilles will never sets used in mathematical considerations. E.g., neither

205
ANTINOMY

the set of natural numbers nor the set of all real paradox is known as the Gonseth paradox.) It is the
numbers are elements of themselves. Consider the set T case of a village barber who shaves those and only
such that its elements are exactly those sets X that have those inhabitants of his village who do not shave them-
the property D. Now consider whether T E T or T f T is selves. Does the barber shave himself? A reasoning
true. If TET then, by definition, T has the property D, similar to that given in the context of Russell's paradox
i.e. T f T. Thus, one must postulate T f T. But this shows that he both shaves and does not shave himself.
again results in a contradiction, since if T f T, T has This difficulty is easily overcome if one concludes that
the property D, so that TET. there cannot be such a barber. The condition to be
The paradox can be avoided by postulating that all satisfied by the barber is self-contradictory and there-
that has been proved in this way is that there is no fore cannot be satisfied. Admittedly, the problem raises
such set T, i.e. that the property D does not character- the question of criteria of self-consistent properties but,
ize any set at all. However, this cannot be accepted, unlike in Russell's paradox, this is by no means a burn-
since it is natural to suppose, from the point of view of ing problem. It refers to a real-life situation, and most
'naive' set theory, that any exactly-described property B such situations are never exactly formulated or reliably
of objects defines a set C of objects which possess this defined. Moreover, self-consistency is not the only cri-
property. Russell's paradox is a strong blow to this terion (nor probably even the most important one) of
natural concept. One must conclude that certain very acceptability of real-life reasoning. The situation is alto-
simple properties must either be considered as impre- gether different in mathematical reasoning, which is
cisely described, or else that there exist exactly- supposed to be much more precisely defined and more
described properties which do not define a set. This in reliable; self-consistency forms an important part of
tum gives rise to several difficult questions. What are such a reasoning.
'exactly' and what are 'inexactly' described properties? Cantor's paradox (G. Cantor, 1899). Let M be the set
Which properties define a set and which do not? May of all sets and let P (M) be the set of all its subsets. It
not other properties which are extensively used in set is clear from the definition of M that P (M) is included
theory also lead to contradictions and have to be dis- in M. On the other hand, in accordance with a well-
carded? Is it at least possible to reliably specify a known theorem of Cantor, the cardinality of P (M) is
domain which is adequately protected from paradoxes, larger than that of M, and for this reason P(M) cannot
and which is nevertheless sufficiently large to include be a subset of M. The conclusions which follow from
conventional mathematical practice? Cantor's paradox are roughly the same as those follow-
With the construction of exact logico-mathematical ing from Russell's paradox. In particular, it may be
languages, the problem of the exact description of pro- thought that Cantor's paradox proves that the set M of
perties may be considered as being satisfactory solved. all sets does not exist. It is interesting to note in this
However, the problem of description of criteria estab- connection that there exist axiomatic systems of set
lishing the class of properties defining a set is still a theory, such as the well-known 'New Foundations' sys-
question that is far from solved. Even worse, modem tem of W. Quine, which establish the existence of M.
results in axiomatic set theory indicate that there is no Cantor's paradox can be avoided in the 'New Founda-
final solution to this problem. Russell's paradox made tions' owing to the fact that Cantor's cardinality
a great impression on his contemporaries since it con- theorem can be proved in this system only in a specific
cerned the very first stage in the study of set theory. form, which is insufficient to 'prove' the paradox. In
Nevertheless, there exist various ways of avoiding para- general, the concepts of set theory required to construct
doxes which, though hardly ultimate or natural, are Cantor's paradox are much more involved than those
very convenient in practice and throw light both on the needed to construct Russell's paradox (such as the con-
nature of the paradox itself and on logical connections cept of a subset, the idea of a one-to-one correspon-
of other principles of set theory. The axiomatic dence, etc.).
approach to the fundamental principles of set theory A paradox which resembles Cantor's paradox is the
(cf. Axiomatic set theory) proved particularly success- so-called Burali - Forti paradox, which deals with the
ful. Thus, the formal axiomatic system of class of all ordinal numbers. The order type of this
Zermelo - Fraenkel is at the time of writing (1970's) a class must be larger than all the ordinals contained in
most useful axiomatic theory, which gives the most ade- it.
quate description of the 'non-paradoxical' part of the Antinomies of a somewhat different type, the so-
practice of classical set theory. called semantic antinomies, are also of major interest.
The paradox of the 'village barber'. This is a variant As distinct from logical antinomies, semantic
of Russell's paradox, formulated in the guise of an antinomies include terms such as 'truth', 'falsehood',
everyday-life situation. (A slightly-different form of this 'denotes', 'defines' and others. Otherwise, the difference

206
APOLAR ]\;ETS

between the two types of paradoxes, which was pro- For any continuous mapping of the sphere sn into the
posed by F. Ramsey in 1926, is largely arbitrary. Euclidean space En there exist antipodes with a com-
Many semantic antinomies may be stated in a logical mon imag~; 2) Any mapping of the sphere sn into
form and vice versa. It was noted by Ramsey that itself in which the images of antipodes are antipodes is
semantic antinomies cannot be constructed in ordinary an essential mapping.
logico-mathematical theories for the simple reason that References
such theories do not contain the semantic concepts [II BORSUK, K.: 'Drei Siitze tiber die n -dimensionale euklidische
which are required in the formulation of semantic Sphare', Fund. Math. 20 (1933), 177-190. A V Ch k'-
.. ernavs II
antinomies. In this sense, semantic antinomies are
Editorial comments. The first result mentioned above is
'safer' than logical antinomies. It should be noted, how-
known as the Borsuk-Ulam theorem (on antipodes). The
ever, that there are modern theories containing con- following result also goes by the name of Borsuk's antipodal
cepts for the formulation of certain semantic paradoxes, theorem: There is no continuous mapping f of the (n + 1)-
in particular Richard's paradoxes (but such paradoxes ball B n +1 into the n-sphere sn such that f(x) = - f( - x), cf.
can always be avoided by using special devices). Below [A1],p.131.
two well-known semantic paradoxes are stated.
References
Richard's paradox in the formulation of G.D.W. [A 1] ISTRATEscu, V.l.: Fixed point theory, Reidel, 1981.
Berry (1906). Consider the set of natural numbers each
AMS 1980 Subject Classification: 55M20
one of which can be uniquely defined by a meaningful
text containing not more than 1000 syllables. Obvi- ANTITONE MAPPING of partially ordered sets - A
ously, the number of such texts is finite, since the col- mapping cf> of a partially ordered set A into a partially
lection of all texts with 1000 syllables is finite. Consider ordered set B such that if a ~b (a, b EA), then acf>?3bcf>.
the smallest natural number that is not a member of The dual concept to an anti tone mapping is an isotone
the set defined above. mapping.
The above paragraph is a meaningful text, containing o.A. Ivanova
not more than 1000 syllables, which uniquely deter- AMS 1980 Subject Classification: 06A 10
mines some natural number which, by definition, can-
not be characterized by a set of this type. Obviously, APERIODIC AUTOMORPHISM of a measure space -
this paradox can be avoided if this text is declared to An automorphism T of a measure space such that its
be not meaningful (or to be a text which does not periodic points, i.e. the points x for which Tk x =x for
define a natural number), but in such a case, as before, some k >0, form a set of measure zero. The introduc-
there arise difficult problems concerning the criteria for tion of a special name for such transformations is due
a text to be meaningful, etc. to the fact that in certain theorems of ergodic theory
Eubulides' paradox (4-th century B.C.). A person automorphisms with 'too many' periodic points are
says: 'What I am saying now is a lie'. Is this statement considered as trivial exceptions (see [1 D.
true or not? If it is true, its very meaning indicates that References
it is false. If it is false, it immediately follows that it is [1] ROKHLIN, V.A.: 'Selected topics from the metric theory of
true. Many variants of this paradox are known - the dynamical systems', A mer. Math. Soc. Transl. Series 2 49, 171-
240 (Uspekhi Mat. Nauk 4, no.2 (30) (1949), 57-128)
liar's paradox, Epimenides' paradox, etc. The idea of
this paradox forms the base of the proof of the well- D. V Anosov
known COde! incompleteness theorem of formal Editorial comments. The so-called Rokhlin- Halmos
axiomatic theories. lemma for periodic automorphisms is important tor the
In general, analysis of such paradoxes stimulated a approximation of an automorphism of a Lebesgue space by
fundamental review of the views on the foundations of periodic transformations (ct. Approximation by periodiC
mathematics and the development of many modern transformations), see [A 1], p. 75, or [A2], p. 390.
ideas and methods of mathematical logic. References
[A 1] CORNFEL'D, LP., FOMIN, S.V. and SINAl, YA.G.: Ergodic
References theory, Springer, 1982.
[1] FRAENKEL, A.A. and BAR-HILLEL, Y.: Foundations of set
[A2] HALMOS, P.R.: Lectures on ergodic theory, Math. Soc. of
theory, North-Holland, 1958.
Japan, 1956.
[2] KLEENE. S.c.: Introduction to metamathematics, North-Holland,
1951. AMS 1980 Subject Classification: 28-01, 28DXX
A. G. Dragalin
AMS 1980 Subject Classification: OOA25, 03A05 APOLAR NETS - Two nets given in the same
domain G of a two-dimensional manifold, where at
ANTIPODES - Diametrically-opposite points on a each point x E G the tangent directions of one net har-
sphere. Borsuk's antipodal-point theorems apply [1]: 1) monically subdivide the tangent directions of the other.

207
APOLAR NETS

Thus, for instance, the asymptotic net on a surface in a equations, since they were first proposed by l.W. Gibbs
Euclidean space is apolar with respect to the net of cur- [3] for holonomic systems. The Appell equations in
vature lines (cf. Curvature lines, net of). independent Lagrange coordinates qs (s = 1, ... ,n)
VT Bazylev have the form of second-order equations
AMS 1980 Subject Classification: 53A05, 53820 ~ - Q* i= I, ... ,k~n. (1)
3qi - i,

ApOLLONIUS PROBLEM - The problem of con- Here I N


structing, in a given plane, a circle tangent to three S = 2' ~m.w;
given circles. The problem is solved by the method of ,,=)

inversion. The circle representing the solution of this (m. and w. are the masses and the accelerations of the
problem is known as the Apollonius circle. The problem N points of the system) is the energy of acceleration of
was named after Apollonius of Perga (3-rd century the system, which is so expressed that it contains the
B.c.). second derivatives of the coordinates qi' i = 1, ... ,k,
References only, the variations of which are considered as
[I) Encyclopaedia of elementary mathematics, Vol. I, Moscow independent; Q; are the generalized forces correspond-
Leningrad, 19511963 (in Russian).
A.B. Ivanov
ing to the coordinates q;, obtained as coefficients in
front of the independent variations oq; in the expres-
Editorial comments. sion for the work of the given active forces F.
References corresponding to virtual displacements or.:
[A 1) BAKER, H.F.: Principles of geometry, 4, F. Ungar, 1963.
N k
AMS 1980 Subject Classification: 51 M15 ~F.~r. = ~Q; Oqi'
p:::: 1 i=1

APOLLONIUS THEOREM - I) The sum of the In evaluating Sand Q; the dependent variables qj
squares of the lengths of conjugate semi-diameters of (oqj) U= k + 1, ... ,n) are expressed in terms of the
an ellipse is constant and equal to the sum of the independent velocities (variations) by solving the n - k
squares of the lengths of its semi-axes. non-holonomic constraint equations (cf. Non-holonomic
2) The area of a parallelogram circumscribed around systems), expressed in the generalized coordinates qs
an ellipse, with its sides oriented in opposite directions, (and by solving the equations for oqs obtained from
is constant and equal to the product of the lengths of them). Differentiation with respect to the time t of the
its diameters. expressions found for qj yields expressions for qj In
A.B. Ivanov
terms of qi'
Editorial comments. Cf. [A 1] for a higher-dimensional Equations (1), together with the n - k equations of
generalization. the non-integrable constraints, form a system (of order
References n + k) of n differential equations involving the n
[A1) BORSUK, K.: AnalytiC geometry, PWN, 1969.
unknowns qs'
AMS 1980 Subject Classification: 51 N1 0 For a holonomic system k = n, all velocities qs and
variations oqs are independent, Q; = Q;, and equations
ApOTHEM of a regular polygon - The segment (and (1) are a different notation for the Lagrange equations
its length) of a perpendicular dropped from the centre (in mechanics) of the second kind.
of the regular polygon onto any of its sides. The Appell's equations in quasi-coordinates 1'" where
apothem of a regular n-gon is equal to the radius of the
circle inscribed in it and is connected with the side of irr = ~ar,q"
" r=l, ... ,k, (2)
i=1
the polygon, an' and with its surface area Sn by the
relations: have the form

an = 2r" tan!!"', S" = nr;, tan!!.... ~=rr


air ro r = 1, ... , k ~11. (3)
n 11

The apothem of a regular pyramid is the height of its Here S is the energy of acceleration, expressed in
(side) face. terms of the second 'derivatives' 7Tr (with respect to the
AMS 1980 Subject Classification: 51 M15 time) of the quasi-coordinates, and I1r are the
generalized forces corresponding to the quasi-
ApPELL EQUATIONS - Ordinary differentia) equa- coordinates. Equations (3), together with the n - k
tions which describe the motions of both holonomic equations of the non-integrable constraints and the k
and non-holonomic systems, established by P.E. Appell equations (2), form a system of n + k differential equa-
[1]. They are sometimes referred to as Gibbs - Appell tions of the first order with the same number of

20~
APPELL POLYNOMIALS

unknowns q" s=l, ... ,n, and ir" r=l, ... ,k. defined by "
oc A (z)
Appell's equations are the most general equations of A (t)e Zl = ~ _n_,_t n,
n=O n.
motion of mechanical systems.
References An(z) = nAn_\(z), n =],2, ... ,
[I] ApPELL,P.E.: 'Sur une forme generale des equations de la which, apart from normalization, are equivalent to
dynamique', C.R. A cad. Sci. Paris Ser. I Math. 129 (1899).
[2] ApPELL, P.E.: 'Sur une forme generale des equations de la those given above.
dynamique et sur Ie principe de Gauss', J. Reine Angew. Math. Appell polynomials of class A (I) are used to solve
122 (1900), 205-208. equations of the form:
[3] GIBBS, l.W.: 'On the fundamental formula of dynamics', A mer.
A(D)y(z) = f(z). (2)
1. Math. 2 (1879), 49-64.
vV Rumyantsev
The formal equality y(z)=f(z)/ A(D) for
Editorial comments.
OC Ck zk
References fez) = k~Ok!
[A1] WHIITAKER, E.T.: Analytical dynamics, Cambridge Univ.
Press, 1927, 258.
makes it possible to write the solution of (2) III the
AMS 1980 Subject Classification: 70GXX form

APPELL POLYNOMIALS - A class of polynomials


y(z) = ~ckA;(z),
k=O
over the field of complex numbers which contains where {AZ(z)} are Appell polynomials with the gen-
many classical polynomial systems. The Appell polyno- erating function e Zl / A (t). In this connection the
mials were introduced by P.E. Appell [l]. The series of expansion of analytic functions into Appell polynomi-
Appell polynomials {An(z)};'=o is defined by the for- als is of special interest. Appell polynomials also find
mal equality use in various problems connected with functional
equations, including differential equations other than
(I)
n=O (2), in interpolation problems, in approximation theory,
in summation methods, etc. (cf. [1] - [6]). For a more
where A (t)= 2:.'/:=oaktk is a formal power series with
general account of the theory of Appell polynomials of
complex coefficients ak> k =0, 1, ... , and ao==O. The
class A (I), and a number of applications, see [6].
Appell polynomials An(z) have an explicit expression in
The class A (1) con tains, as special cases, a large
terms of the numbers ak as follows:
number of classical sequences of polynomials. Exam-
ples, up to a normalization, are the Bernoulli polynomi-
als
te zt Bn(z)
The condition ao==O is tantamount to saying that the - t- = ~
x;
__ ,_tn;
e -] n =0 n.
degree of the polynomial An(z) is n.
There is another, equivalent, definition of Appell the Hermite polynomials
H (z)
polynomials. Let ezt-t /2
2
= ~
00
_n_ tn .
n ==0 n! '
the Laguerre polynomials
be a differential operator, generally of infinite order,
defined over the algebra P of complex polynomials in n=O
the variable Z = x + ry. Then etc. For many other examples, see [2] and [3], Vol. 3.
There exist various generalizations of Appell polyno-
A ( ) = A(D)zn n =0, I, ... ,
n Z n." mials, which are also known as systems of Appell polyno-
i.e. An(z) is the image of the function zn / n! under the mials. These include the Appell polynomials with gen-
mappingp =A (D)q, p, q EP. erating functions of the form
The class A (1) of Appell polynomials is defined as A(w)ezU(w) =
00

~Pn(z)wn,
the set of all possible systems of polynomials {An(z)} n=O
(3)
with generating functions of the form (1). To say that a
U(w) = ~Ckwk, c\*O,
system {Pn(z)} of polynomials (of degree n) belongs to k=\
the class A (1) amounts to saying that the relationships
and the Appell polynomials with the more general gen-
P~(z)=Pn-\(z), n=],2, ... ,
erating functions:
are valid. OC

The Appell polynomials of class A (I) are sometimes A(w)1/;(zU(w = ~qn(z)wn (4)
n=O

209
APPELL POLYNOMIALS

(see, for example, [2], [3], Vol. 3). If w(U) is the inverse x >0, y>O, x + y < 1; however, they do not form a sys-
function to the function U(w), then the fact that the tem of orthogonal functions with the weight t (x, y) in
system of polynomials {Pn(z)} ~=0 belongs to the class T (see, for example, [3], Vol. 2).
of sequences of Appell polynomials with a generating References
function of type (3) is equivalent to the validity of the [I] ApPELL, P.E.: Ann. Sci. Ecole Norm. Sup. 9 (1880), 119-144.
relations [2] BOAS, R.P. and BUCK, R.C.: Polynomial expansions of analytiC
functions, Springer & Acad. Press (U.S.A. & Canada), 1958.
w(D)Pn(Z)=Pn-I(Z), n=I,2, ... [D=~]. [3] BATEMAN, H. and ERDELYI, A.: Higher transcendental functions,
Vol. 1-3, McGraw-Hill, 1953-1955.
[4] WOOD, B.: 'Generalized Szitsz operators for the approximation
There are only five weighted orthogonal systems of in the complex domain', SIAM J. Appl. Math. 17 (1969), 790-
sequences of Appell polynomials on the real axis with 801.
[5] SZEGO, G.: Orthogonal polynomials, Arner. Math. Soc., 1975.
generating functions of the type (3); these include only [6] BOURBAKI, N.: Elements of mathematics. Functions of a real
one orthogonal system with generating functions of the variable, Addison-Wesley, 1976 (translated from the French).
type (I), which consists of Hermite polynomials with [7] MEIXNER, J.: 'Orthogonale Polynomsysteme mit einer beson-
the weight e - x' /2 on the real axis (cf. [7]). deren Gestalt der erzeugenden Funktion', J. London Math. Soc.
(1) 9 (1934),6-13.
For the expansion in series by Appell polynomials [8] ANDERSON, CH.A.: 'Some properties of Appell-like polynomi-
with generating functions of the types (3) and (4), and als', J. Math. Anal. Appl. 19 (1967),475-491.
interconnections of these polynomials by various func- [9A] KAZ'MIN, YU.A.: 'Expansions in series of Appell polynomials',
Math. Notes 5, no. 5 (1969), 304-311. (Mat. Zametki 5, no. 5
tional equations see [2], [7], [8]. (1969), 509-520)
The class A (P), where p;;,.1 is an integer, of Appell [9B] KAZ'MIN, YU.A.: 'On Appell polynomials', Math. Notes 6, no.
polynomials is defined as follows: It is the set of all 2 (1969), 556-562. (Mat. Zametki 6, no. 2 (1969), 161-172)
[10] ApPELL, P.E.: Arch. Math. Phys. (1) 66 (1881),238-245.
systems of polynomials {An(z)} for each of which the
(formal) representation Yu.A. Kaz'min
Editorial comments. For Appell polynomials in two vari-
ables an explicit biorthogonal system is known. There is
also an explicit system for the weight function (5), consist-
is valid. Here, wp = e 2'1Ti / P, and Ak(t), k = 0, ... ,p - I, ing of products of two Jacobi polynomials and a power. See
are formal power series, the free terms of which are [A1], p. 454, for references.
such that the degree of the polynomial An(z) is n. To References
say that a sequence {Qn(z)} of polynomials of degree n [A 1) KOORNWINDER, T.H.: 'Two-variable analogues of the classi-
belongs to A (P) amounts to saying that the relations cal orthogonal polynomials', in R. Askey (ed.): Theory and
applications of special functions, Acad. Press, 1975, pp.
DPQn(z) = QW)(z) = Qn -p(z), n =P,P + I, ... , 435-495.

are valid. For problems on the expansion of analytic AMS 1980 Subject Classification: 33A70
functions in series by Appell polynomials of class A (P),
see [9]. They are closely connected with the problem of APPELL TRANSFORMATION - A transformation of
finding analytic solutions of functional equations of the the equations of plane motion of a material point under
type p-I the action of a force that varies with the coordinates
~Ak(D)y(zw~) = fez). x, y of the point only. The Appell transformation is a
k =0
homographic transformation of the coordinates x, y into
Appell polynomials in two variables were introduced
the coordinates x I , Y 1 :
by P. Appell [10]. They are defined by the equations:
I-y I-i . ax+by+c a'x+b'y+c'
J m n (lX, Y, Y, , x, Y ) -- x y, (1- X)-
- ,)y+y -a X XI = a"x+b"y+c YI = a"x+b"y+c'"
, (Y)nh)11
which involves the substitution of the time tI for t in
accordance with the formula
dt
m, n =0. I, ... , k dtl = " " "2
(a x+b y+c )
.

III which it IS assumed that (Y)o = I,


As a result of the Appell transformation, the motion of
(Y)n=y(y+I)" '(y+n-l) for n;;,.l; these Appell
a point (x I, Y d, acted upon by a force F I (x 1, Y d that
polynomials are analogues of the Jacobi polynomials.
depends only on the coordinates x l,y I, corresponds to
The Appell polynomials J m.n are orthogonal with the
the motion of the point (x, y) acted upon by the force
weight
(5 ) F(x, y). A transformation of the equations of motion of
to any polynomials in two variables of degree lower a point (x, y) with the aid of formulas of general form
than m + n, over the domain T, where T is the triangle: XI = cp(x.y). YI = l/;(x.y), dt l = ?I.(x.y)dt. (*)

210
APPROXIMATE DERIVATIVE

yields the equations of motion of the point (x 1 ,y 1) In the simplest case, f (x) is a real-valued function of
under the effect of a force that depends only on the the points of an n-dimensional Euclidean space (in gen-
coordinates Xl, Y 1 only if (*) is a homographic eral it is a vector-valued function). The following
transformation. The Appell transformation is used to theorems apply. 1) A real-valued function f (x) is
solve Bertrand's problem on the determination of the Lebesgue-measurable on a set E if and only if it is
forces under which the trajectories of motion of approximately continuous almost-everywhere on E (the
material points become conic sections. The Appell Stepanov- Denjoy theorem). 2) For any bounded
transformation is named after P.E. Appell. Lebesgue-measurable function f (x) one has, at each
References point xo,
[1) ApPELL, P.E.: 'De I'homographie en mecanique', Amer. J. ~~ M(k) [f(x)d JL = f(xo),
Math. 12 (1890), 103-114.
(2) ApPELL, P.E.: 'Sur les lois de forces centrales faisant decrire a where JL is the n-dimensional Lebesgue measure, R is an
leur point d'application une conique queUes que soient les con- n-dimensional non-degenerate segment containing xo,
ditions initiales', Amer. J. Math. 13 (1891), 153-158.
and p is its diameter.
L.N. Sretenskir
References
AMS 1980 Subject Classification: 34AXX [1) SAKS, S.: Theory of the integral, Hafner, 1952 (translated from
the Polish).
G.P. To/stov
APPLICATE, z-AXIS - One of the Cartesian coordi-
Editorial comments. For other references see Approxi-
nates of a point in the three-dimensional space.
mate limit.
AMS 1980 Subject Classification: 51 N20
AMS 1980 Subject Classification: 26A 15

APPROXIMATE COMPACfNESS - A property of a APPROXIMATE DERIVATIVE - A generalization of


set M in a metric space X requiring that for any x EX, the concept of a derivative, where the ordinary limit is
every minimizing sequence Yn EM (i.e. a sequence with replaced by an approximate limit. If, for a function
the property p(x, Yn)~P(x, M)) has a limit point Y EM. f (x) of a real variable x, the limit
Approximate compactness of a given set ensures the . f(x)- f(xo)
existence of an element of best approximation for any lim ap "-'--'---=---
X-Xo x-xu
x EX. The concept of approximate compactness was
exists, then it is called the approximate derivative of the
introduced [I] in connection with the study of Che-
functionf(x) at Xo and is denoted by f:p (xo). In the
byshev sets (d. Chebyshev set) in a Banach space,
simplest case f (x) is a real-valued function; in general
which made it possible to describe convex Chebyshev
it is a vector-valued function. An approximate deriva-
sets in certain spaces. In fact, let X be a uniformly-
tive may be finite or infinite. The classical rules for the
convex smooth Banach space. For a Chebyshev set differentiation of a sum, a difference, a product, and a
M eX to be convex, it is necessary and sufficient that quotient of functions apply to finite approximate
it be approximately compact. It follows, in particular, derivatives; the theorem on the differentiation of a
that the set of rational fractions with fixed degrees of
composite function does not apply in general. The con-
the numerator and the denominator is not a Chebyshev
cept of an approximate derivative was introduced by
set in the space Lp (1 <p < 00) if the degree of the
A.Ya. Khinchin in 1916.
denominator is not smaller than one [I].
Approximate Dini derivatives are defined by analogy
For subsequent studies on this subject see [2].
with ordinary Dini derivatives (d. Dini derivative):
References Ad(XO) - lim sup from the right; Ad(XO) - lim inf
[Il EFIMOV, N.V. and STECHKIN, S.B.: 'Approximative compact-
ness and Cebysev sets', Soviet Math. Dok/. 2, no. 5 (1961),
from the right; Ag(xo) - lim sup from the left; Ag(XO)
522-524. (Dok/. Akad. Nauk SSSR 140, no. 3 (1961), 522-534) - lim inf from the left; for example,
[2) GARKAVI, A.L.: 'The theory of best approximation in normed
A d (Xo ) -- lim f(x)- f(xo)
linear spaces', Itogi Nauk. Mat. Anal. 1967 (1969), 75-132 (in ap .
X-+Xo. X-Xo
Russian). v
.lu. N. SUbbotm
. X>Xo

AMS 1980 Subject Classification: 41 A65, 54D99 The following Denjoy- Khinchin theorems apply. If a
real-valued function f (x) is finite and Lebesgue-
APPROXIMATE CONTINUITY - A generalization of measurable on a set E, then, at almost any point of this
the concept of continuity in which the ordinary limit is set, either f (x) has a finite approximate derivative, or
replaced by an approximate limit. A function f (x) is Ad(X) = Ag(x) = + 00, AAx) = Ag(X) = - 00.
called approximately continuous at a point Xo if
If x
lim apf(x)
X---JoXo
= f(xo}. F(x) = jf(t)dt
a

211
APPROXIMATE DERIVATIVE

is a Denjoy- Khinchin integral, then F~p (x)= f(x) Editorial comments. For other references see Approxi-
almost-everywhere in the interval under consideration mate limit.
(an ordinary derivative need not exist on a set of posi- AMS 1980 Subject Classification: 26A24
tive measure). This theorem explains the role played by
approximate derivatives in the theory of integrals. APPROXIMATE LIMIT - A limit of a function f (x)
There exist continuous functions without an ordinary as X-Ho over a set E for which Xo is a density point.
or an approximate derivative at any point of a given In the simplest case f (x) is a real-valued function of
interval. the points of an n-dimensional Euclidean space; in the
Approximate partial derivatives of functions of more general case it is a vector function. The approxi-
several real variables are considered as well. mate limit is denoted by
References
[II SAKS, S.: Theory of the integral, Hafner, 1952 (translated from
lim apf(x).
X-Xo
the Polish).
G.P. Tolstov In general, the existence of an ordinary limit does not
Editorial comments. For other references see Approxi- follow from the existence of an approximate limit. An
mate limit. approximate limit displays the elementary properties of
AMS 1980 Subject Classification: 26A24 limits - uniqueness, and theorems on the limit of a
sum, difference, product and quotient of two functions.
APPROXIMATE DIFFERENTIABILITY - A general- Let Xo be a density point of the domain of definition
ization of the concept of differentiability obtained by of a real-valued function f (x). If the ordinary limit
replacing the ordinary limit by an approximate limit. A limx - Ho f (x) exists, the approximate limit also exists
real-valued function f (x) of a real variable is called and is equal to it. The approximate upper limit of a
approximately differentiable at a point x 0 if there exists function f(x) at a point Xo is the lower bound of the
a number A such that set of numbers y (including y = + (0) for which x 0 is a
. f(x)-f(xo)-A(x-xo) point of dispersion of the set {x:f(xy}. Similarly,
hm ap = O.
X--Ho X - Xo the approximate lower limit of a function f (x) at a
The magnitude A (x - xo) is called the approximate dif- point Xo is the upper bound of the set of points y
ferential of f(x) at Xo. A function f(x) is approxi- (including y = - (0) for which x 0 is a point of disper-
mately differentiable at a point x 0 if and only if it has sion of the set {x: f (x )<y}. These approximate limits
an approximate derivative lap (x 0) = A at this point. are denoted, respectively, by
Approximate differentiability of real functions of n real lim ap f(x) and lim ap f(x).
variables is defined in a similar manner. For example, X-Xo X-Xo

for n = 2, f (x, y) is called approximately differentiable at An approximate limit exists if and only if the approxi-
a point (xo,Yo) if mate upper and lower limits are equal; their common
f(x,y)-f(xo,yo)-A(x-xo)-B(y-yo) - 0 value is equal to the approximate limit.
lim ap - , If x is real, one-sided (right and left) approximate
x-xu'
y----I'O
P
upper and lower limits are also used (xo must then be,
where A and B are certain given numbers and
p= vex - xO)2 +(y _YO)2 . The expression
respectively, a right-hand or left-hand density point in
the domain of definition of the function). For the
A (x - xo) + B(y -Yo) is called the approximate differen- approximate right upper limit the following notation is
tial off(x,y) at (xo.Yo). used:
Stepanov's theorem: A real-valued measurable func- lim apf(x),
x-xu,
tion f (x, y) on a set E is approximately differentiable X>Xo

almost-everywhere on E if and only if it has finite with corresponding notations for the other cases. If the
approximate partial derivatives with respect to x and to approximate right upper and lower limits coincide, one
y almost-everywhere on E; these partial derivatives obtains the right approximate limit; if the approximate
almost-everywhere on E coincide with the coefficients A left upper and lower limits coincide, one obtains the left
and B, respectively, of the approximate differential. approximate limit.
The concept of approximate differentiability can also Approximate limits were first utilized by A. Denjoy
be extended to vector functions of one or more real and A.Ya. Khinchin in the study of the differential
variables. connections between an indefinite integral (in the sense
References of Lebesgue and in the sense of Denjoy- Khinchin)
[I] SAKS, S.: Theory of the integral, Hafner. 1952 (translated from and the integrand (cf. Approximate continuity; Approxi-
the Polish).
G.P. To/stov mate derivative).

212
APPROXIMA TIo,", BY PERIODIC TRA'-JSFORMATIO'-JS

References died in geometry and in topology. Certain branches of


[I] SAKS, S.: Theory of the integral, Hafner, 1952 (translated from mathematics deal almost exclusively with approxima-
the Polish). G P 7' I
. . 10 stov tions - e.g. the theory of approximation of functions
Editorial comments. A point of dispersion is defined as and the theory of numerical methods in analysis.
follows: Let A be a set in W. Let 0A be the completely- BSE-3
additive set function defined for measurable E by AMS 1980 Subject Classification: OOA25
0A(E) =1-1' (E n A), the outer measure of En A. Let x be
any pOint in Rn. The upper strong derivative and lower APPROXIMATION BY PERIODIC TRANSFORMA-
strong derivative DoACx) and OOA(X) are called, respectively, TIONS - One of the methods of ergodic theory. Any
the upper outer density and lower outer density of A at x. automorphism T of a Lebesgue space X with measure J.l
The point x is a point of density for a set A if the outer den-
can be obtained as the limit of periodic automorphisms
sity of A at x is 1 and it is a point of dispersion if the outer
Tn in the natural weak or uniform topology of the
density of A at x is zero. If A is measurable, almost-all points
space 2! of all automorphisms [1]. To characterize the
of A are pOints of density and almost-all points of its com-
plement are points of dispersion. The latter condition is also rate of approximation quantitatively one considers not
sufficient for A to be measurable. only the automorphisms Tn, but also finite measurable
decompositions of X which are invariant under to Tn, i.e.
References
[A 1] BRUCKNER, A.M.: Differentiation of real functions, Springer, decompositions of X into a finite number of non-
1978. intersecting measurable sets Cn, I, . . . ,Cn,q,' which are
[A2] MUNROE, M.E.: Introduction to measure and integration, mapped into each other by Tn. The number
Addison-Wesley, 1953.
[A3] THOMSON, B.S.: Real functions, Springer, 1985. q,
d(T, Tn; ~n) = ~p.(TCn.i t1 TnC,,)
AMS 1980 Subject Classification: 26A 15 i=l

is an estimate of the proximity of Tn to T with respect to


APPROXIMATELY-COMPACf SET - A set having
~n; here A is the symmetric difference
the property of approximate compactness. A metric pro-
jection on any approximately-compact Chebyshev set is At1B = (A \B) U (B \A).
continuous. Examples of approximately-compact sets If qn is given, it is possible to choose ~n and Tn (with
include boundedly-compact sets and closed convex sets the above properties) such that d(T, Tn; ~n) is arbi-
in the spaces Lp (0 <p < 00), and the set of rational trarily small [1]. The metric invariants of the automor-
fractions in which the degrees of the numerator and phism T become apparent on considering infinite
denominator are constant. In approximation theory and sequences Tn and ~n such that for any measurable set A
in the theory of ill-posed problems frequent use is there exists a sequence of sets An, each being the union
made of spaces in which all closed sets are approxi- of some of the Cn,i, which approximates A in the sense
mately compact. that
lim p.(A t1A n) =0
References n~oo

[I] EFIMOV, N.V. and STECHKIN, S.B.: 'Approximative compact-


('the decompositions ~n converge to a decomposition
ness and Cebysev sets', Soviet Math. Dokl. 2, no. 5 (1961),
1226-1228. (Dokl. Akad. Nauk SSSR 140, no. 3 (1961), 522- into points'). If, in addition, d(T, Tn; ~n)<f(qn)' where
524) f (n) is a given monotone sequence tending to zero,
[2] VLASOV, L.P.: 'Approximative properties of sets in normed then one says that T admits an approximation of the
linear spaces', Russian Math. Surveys 28, no. 6 (1973), 1-66.
(Uspekhi Mat. Nauk 28, no. 6 (1973),3-66) Y N S bb . first type by periodic transformations with rate f (n); if,
u. . u otm in addition, Tn permutes the sets Cn,i cyclically, then
AMS 1980 Subject Classification: 41 A65, 54099 one speaks of cyclic approximation by periodic transfor-
mations. For other variants see [2], [6], [7].
APPROXIMATION - The replacement of mathemati- At a certain rate of approximation certain properties
cal objects by others which resemble them in certain of the periodic automorphisms Tn affect the properties
respects. It is possible to study in this way numerical of the limit automorphism T. Thus, if T has a cyclic
characteristics and qualitative properties of an object approximation by periodic transformations with a rate
by reducing the problem to the study of simpler or c / n, then, if c<4, T will be ergodic; if c<2, Twill
more convenient objects (e.g. of objects with readily not be mixing; and if c < 1, the spectrum of the
calculable characteristics or with properties which are corresponding unitary shift operator is simple. Certain
known). Number theory deals with Diophantine properties of T may be described in terms of the rate of
approximations, in particular with approximations of approximation. Thus, its entropy is equal to the lower
irrational numbers by rational numbers. Approxima- bound of the c's for which T admits an approximation
tions of curves, surfaces, spaces and mappings are stu- by periodic transformations of the first kind with a rate

213
APPROXIMATION BY PERIODIC TRANSFORMATIONS

of 2c /log2 n [2], [7]. Approximations by periodic the following inequality for the sequences {~n}, {Tn}, where
transformations were used in the study of a number of Tn is periodic of order qn,
simple examples [2], including smooth flows on two-
dimensional surfaces [8]. They served in the construc- ;=1
tion of a number of dynamical systems with unexpected and Urn ~Ur in the strong topology for operators on
metric properties [2], [6], [7], or with an unexpected L 2 (X, p.), then one says that T admits an approximation of
combination of metric and differential properties [3], the second type by periodic transformations with speed
[4]. fen). Reference [A1] is a basic and well-known one.
The statement on the density of periodic automor- In this context one also speaks of partitions instead of
phisms in 2.(, provided with the weak topology, may be decompositions.
considerably strengthened: For any monotone sequence References
j (n 0, the automorphisms which allow cyclic approx- [A1) CORNFEL'D, I.P., FOMIN, S.V. and SINAT, YA.G.: Ergodic
imations at the rate j(n) form a set of the second theory, Springer, 1982, Chap!. 15; 16.
[A2) ROKHLIN, V.A.: 'Selected topics from the metric theory of
category in 2.( [2]. Accordingly, approximations by dynamical systems', Amer. Math. Soc. Trans!. Series 249,
periodic transformations yield so-called category 171-240. (Uspekhi Mat. Nauk4, no. 2 (30) (1949),57-128)
theorems, which state that in 2.( (with the weak topol-
AMS 1980 Subject Classification: 28DXX
ogy) the automorphisms with a given property form a
set of the first or second category (e.g. ergodic sets are APPROXIMATION IN THE MEAN - Approximation
of the second category, while mixing sets are of the first of a given function jet), integrable on an interval [a, b],
category [I D. by a function <l>(t), where the quantity
Let X be a topological or smooth manifold, and let b
the measure Jl be compatible with the topology or with pIJ, </ = j I f(t)-cf>(t) I dt
a
the differential structure. In the class of homeomor-
phisms or diffeomorphisms preserving Jl it is not the is taken as the measure of approximation.
weak topology, but other topologies that are natural. The more general case, when
Category theorems analogous to those for 2.( are valid b

for homeomorphisms; for the history of the problem pIJ,</ = jlf(t)-CP(t) Iqda(t) (q>O),
a
and its present 'state-of-the-art' see [5].
where aCt) is a non-decreasing function different from a
References constant on [a, b], is called mean-power approximation
[I) HALMOS, P.R.: Lectures on ergodiC theory, Math. Soc. of lapan, (with exponent q) with respect to the distribution da(t).
1956.
[2) KAroK, A.B. and SrEPIN, A.M.: 'Approximations in ergodic
If a(t) is absolutely continuous and <l>(t)=a(t), then one
theory', Russian Math. Surveys 22, no. 5 (1967), 77-102. obtains mean-power approximation with weight <l>(t), and
(Uspekhi Mat. Nauk 22, no. 5 (1967), 81-106) if aCt) is a step function with jumps Ck at points tk in
[3) ANosov, D.V. and KAroK, A.B.: 'New examples in smooth
[a, b], one has weighted mean-power approximation with
ergodic theory. Ergodic diffeomorphisms', Trans. Moscow
Math. Soc. 23 (1970),3-36. (Trudy Moskov. Mat. Obshch. 23 respect to the system of points {td with measure of
(1970), 1-35) approximation
[4) KAroK, A.B.: 'Ergodic perturbations of degenerate integrable
Hamiltonian systems', Math. USSR-Jzv. 7, no. 3 (1973), 535- pIJ, </ = ~Ck I f(td -CP(td Iq.
k
571. (Izv. Akad. Nauk SSSR Ser. Mat. 37 (1973),539-576)
(5) KAroK, A.B. and SrEPIN, A.M.: 'Metric properties of measure These concepts are extended in a natural way to the
preserving homeomorphisms', Russian Math. Surveys 25, no. 2
case of functions of several variables.
(1970), 191-220. (Uspekhi Mat. Nauk 25, no. 2 (1970), 193-
220) References
[6) AKCOGLu, M.A., CHACON, R.V. and SCHWARTZBAUER, T.: [I) GONCHAROV, V.L.: The theory of interpolation and approxima-
'Commuting transformations and mixing', Proc. A mer. Math. tion offunctions, Moscow, 1954 (in Russian).
Soc. 24, 637-642. (2) NIKOL'SKli, S.M.: Approximation offunctions of several variables
[7) SCHwARrzBAuER, T.: 'Entropy and approximation of measure and imbedding theorems, Springer, 1975 (translated from the
preserving transformations', Pacific J. Math. 43 (1972), 753- Russian).
764. [3) RICE, l.R.: The approximation offunctions, I. Linear theory,
[8) KOCHERGIN, A.V.: 'On mixing in special flows over a shifting Addison-Wesley, 1964.
of segments and in smooth flows on surfaces', Math. USSR-Sh. N.I. Korndchuk
25, no. 3 (1975),441-469. (Mat. Sh. 96, no. 3 (1975), 472-502) VP. Motornyt
D. V Anosov Editorial comments.

Editorial comments. Contributions to the .foundation of References


[A 1] TIMAN, A.F.: Theory of approximation of functions of a rea!
the theory of approximations were also made by VA Rokh- vanable, Pergamon, 1963 (translated from the Russian).
lin (cf. [A2]). [A2] RIVLIN, TJ.: An introduction to the approXimation of func-
If in an approximation by periodiC transformations one has tions, Dover, reprint, 1981.

214
APPROXIMATION OF A DIFFERENTIAL BOUNDARY VALUE PROBLEM BY DIFFERENCE BOUNDARY VALUE PROBLEMS

[A3] CHENEY, E.W.: Introduction to approximation theory, Chel- is valid, no matter how the norms in Uh and cI>h have
sea, reprint, 1982. been chosen.
AMS 1980 Subject Classification: 41 A25 The additional condition, the fulfilment of which in
fact ensures convergence, is stability [3], [5] - [8], which
APPROXIMATION OF A DIFFERENTIAL BOUN- must be displayed by the finite-difference problem
DARY VALUE PROBLEM BY DIFFERENCE BOUN- !l'hUh =O. This problem is called stable if there exist
DARY VALUE PROBLEMS - An approximation of a numbers ~>o and O<ho such that the equation
differential equation and its boundary conditions by a !l'hzh =CPh has a unique solution Zh E Uh for any CPh EcI>h,
system of finite (usually algebraic) equations giving the II cf>h II <~, h <h o, and if this solution satisfies the ine-
values of the unknown function on some grid, which is quality
subsequently made more exact by making the parame- II Zh - Uh II u, ..; C II <Ph II <1>"
ter of the finite-difference problem (the step of the grid, where C is a constant not depending on h or on the
the mesh) tend to zero. perturbation CPh of the right-hand side, while Uh is a
Consider the computation of a function U which solution of the unperturbed problem !l'huh =0. If a
belongs to a linear normed space U of functions solution U of the differential problem exists, while the
defined in a given domain Du with boundary r, and finite-difference problem !l'hUh approximates the dif-


which is the solution of the differential boundary value
problem Lu = 0, Iu Ir = 0, where Lu = is a differential
equation, while Iu Ir =0 is the set of boundary condi-
ferential problem on solutions U of order p and is
stable, then one has convergence of the same order, i.e.
II [U1,- Uh Ilu, = O(hP).
tions. Let D hU be a grid (cf. Approximation of a dif-
ferential operator by difference operators) and let Uh be For instance, the problem
the normed linear space of functions Uh defined on this
grid. Let [V]h be a table of values of the function v at
L(u) - ~~ - ~~ = 0, t>O, - OO<X<OO,}
(1)
the points of D hU A norm is introduced into Uh so IU/ r = u(O,x)-l/;{x) = 0, -oo<x<oo,
that the equality
where #.x) is a given function with a bounded second-
!iTo II [vlh II u, = II v II u order derivative, can be approximated, for a natural
is valid for any function v E U. The problem of comput- definition of the norms, by the finite-difference prob-
ing the solution U is replaced by a certain problem lem
!l'huh =0 for the approximate computation of the table
[u lh of values of U at the points of D hU Here,!l'hUh is
a certain set of (non-differential) equations for the
values of the grid function Uh E Uh
Let Vh be an arbitrary function of Uh , let !l'hvh =CPh, = E cI>h,
and let cI>h be the normed linear space to which CPh where ui!z is the value of Uh at (xm' In) = (mh, m), r=rh,
belongs for any Vh E Uh One says that the problem r =const. If the norm of CPn is taken to be the upper
!l'hUh is a finite-difference approximation of order p of bound of the moduli of the right-hand sides of the
the differential boundary value problem Lu =0, equations which constitute the system !l'h Vh =CPh,
Iu Ir =0, on the space of solutions U of the latter if Vh E Uh , then the approximation of problem (1) by

II.Ph [u1, 11<1>, = O(hP). problem (2) on solutions U is of the first order. If r> 1,
there is no convergence, whatever the norm. If r';;;;; 1
The actual construction of the system !l'huh involves a
and the norm is
separate construction of its two subsystems LhUh =0
and IhUh Ir, =0. For LhUh =0 one uses the difference II Uh II U. = sup
m.n
I u;:' I,
approximations of a differential equation (cf. Approxi- the problem is stable, so that there is convergence [2],
mation of a differential equation by difference equa- [3]:
tions). The complementary equations IhUh Ir, =0 are II [u1, -Uh II U. = O(h).
constructed using the boundary conditions 1Uh Ir =0. The replacement of differential problems by differ-
An approximation such as has just been described ence problems is one of the most universal methods for
never ensures [2] that the solution Uh of the finite- the approximate computation of solutions of differen-
difference problem converges to the exact solution u, tial boundary value problems on a computer [7].
i.e. that the equality The replacement of differential problems by their
lim II [ulh -Uh II u,
h-+O
= difference analogues started in the works [1], [2] and
[4], and is sometimes employed to prove that the dif-

215
APPROXIMATION OF A DIFFERENTIAL BOUNDARY VALUE PROBLEM BY DIFFERENCE BOUNDARY VALUE PROBLEMS

ferential problem is in fact solvable. This is done as fol- For instance, the equation
lows. It is proved that the set of solutions Uh of the _ d 2u du
difference analogue of the differential boundary value Lu = dx2 +p(x) dx +q(x)u =0
problem is compact with respect to h, after which a is approximated with second-order accuracy by the
proof is given that a solution U of the differential boun- finite-difference equation
dary value problem is the limit of a subsequence Uh,
um+I-2um+Um-1
which converges as hk~O. If this solution is known to LhUh h2 +
be unique, then not only the subsequence, but also the
Um+I-Um-1
entire set of Uh converges to the solution U as h~O. +p(xrn) 2h +q(xm)um = 0,
References
[I] LYUSTERNIK, L.A.: 'Dirichlet's problem', Uspekhi Mat. Nauk 8
where the grids DhU and DhF consist of points Xm = mh,
(1940), 125-124 (in Russian). m is an integer, and Um is the value of the function Uh
[2] COURANT, R., FRIEDRICHS, K. and LEWY, H.: 'Ueber die par- at X m Again, the equation
tiellen Differenzengleichungen der mathematischen Physik',
Math. Ann. 100 (1928). L =
U-
~-
at
a2 u = 0'
ax2
[3] GODUNOV, S.K. and RYABEN'KIl, V.S.: The theory of difference
schemes, North-Holland, 1964.
[4] PETROVSKIl, I.G.: 'New existence proofs for the solution of the is approximated by two different difference approxima-
Dirichlet problem by the method of finite differences', Uspekhi tions on smooth solutions:
Mat. Nauk 8 (1940), 161-170 (in Russian).
Unn+1 -Umn Unm+1 -
2n+n
Um Urn -I
[5] RYABEN'KIl, V.S.: 'On the application of the method of finite
differences to the solution of the Cauchy problem', Dokl. Akad.
L hIl) Uh =
-

T
- h2
=0
Nauk SSSR 86, no. 6 (1952), 1071-1073 (in Russian). (explicit scheme), and
[6] RJABENKI, V.S. [V.S. RYABEN'KIi] and FILlPOV, A.F. [A.F.
Un + 1 - un un + 1 _ 2un + 1 + un + 1
FILlPPOV]: Uber die stabilitiit von DifJerenzgleichungen,
Deutsch. Verlag Wissenschaft., 1960 (translated from the Rus-
L~2)Uh m rn _ m + 1 rn m -I =0
T h2
sian).
[7] SAMARSKIl, A.A.: Theorie der DifJerenzverfahren, Akad. VerI. (implicit scheme), in which the grids DhU and DhF con-
Geest u. Portig K.-D., 1984 (translated from the Russian). sist of the points (xm, tn)=(mh, n7), T=rh2, r=const,
[8] FILlPPOV, A.F.: 'On stability of difference equations', Dokl.
Akad. Nauk SSSR 100, no. 6 (1955), 1045-1048 (in Russian).
m and n are integers, and U:;, is the value of the func-
VS. Ryaben'kit tion Uh at the point (xm, tn) of the grid. There exist
finite-difference operators Lh which represent an
Editorial comments. For additional references, see the approximation to the differential operator L especially
additional references to Approximation of a differential well only on solutions U of the equation Lu = 0 and less
operator by difference operators. well on other functions. For instance, the operator L h ,
AMS 1980 Subject Classification: 65LXX, 65MXX
_ Urn + 1 - Urn [ h Urn +U] [ h]
Lhun = h -</> xrn + 2 ' - 2 - = Ji, xm+2'
APPROXIMATION OF A DIFFERENTIAL EQUA-
TION BY DIFFERENCE EQUATIONS - An approxi- where U=Um +hcj>(xm ), is a first-order approximation
mation of a differential equation by a system of alge- (with respect to h) of the operator L,
braic equations for the values of the unknown functions Lu ~~ -</>Cx, u) = j(x),
on some grid, which is made more exact by making the
parameter (mesh, step) of the grid tend to zero. on arbitrary smooth functions u(x) and it is a second-
Let L, Lu = f, be some differential operator and let order approximation on solutions of the equation
L h, LhUh = Ji" Uh E Uh, Ji, EFh, be some finite-difference Lu = 0 (it is assumed that the function u is sufficiently
operator (cf. Approximation of a differential operator by smooth). When finding numerical solutions of boun-
difference operators). One says that the finite-difference dary value problems for the differential equation Lu = 0
expression LhUh =0, OEFh, is an approximation of with the aid of the finite-difference equation LhUh =0, it
order p to the differential equation Lu =0 on solutions is the approximation property of Lh when applied to
U if the operator Lh approximates the operator L on solutions U of Lu =0 that is significant and not its
solutions U of order p, i.e. if approximation property when applied to arbitrary

II f, = O(hP).
smooth functions. For a large class of differential equa-
I Lh[u lh
tions and systems of equations there exist methods for
The simplest example of the construction of a finite- constructing difference equations approximating them,
difference equation Lh Un = 0 which approximates the which also meet various additional conditions: stability
differential equation Lu = 0 on solutions u, consists in of the solution Uh with respect to rounding-off errors
replacing each derivative in the expression Lu by its such as are permitted in the calculations; the validity of
finite-difference analogue. certain integral relations for Uh which hold for the solu-

216
APPROXIYlATIO:--; OF A DIFFERENTIAL OPERATOR BY DIFFERENCE OPERATORS

tion u of the differential equation; the permissibility of function Lu from the table of values [u 1r of the func-
using arbitrary grids DhU and DhF (this is important in tion u and for the approximation of a differential equa-
calculating the motion of a continuous medium); a lim- tion by difference equations.
itation on the number of arithmetical operations There are two principal methods for constructing
required to compute the solutions; etc. operators Lh approximating L.
The approximation of a differential equation by In the first method Lh[u 1h is defined as the result of
difference equations is an element of the approximation applying the differential operator L to a function in U,
of a differential boundary value problem by difference obtained by some interpolation formula from the grid
boundary value problems in order to approximately cal- function [u 1h'
culate a solution of the former. The second method is as follows. In the domain DF
References of definition of a function fin F one introduces a grid
[I) GoDUNOV. S.K. and RYABEN'KIi, V.S.: The theory of difference DhF , and considers the linear space Fh of grid functions
schemes, North-Holland, 1964 (in Russian). defined on DhF . The operator Lh[u 1r is constructed as
[2) SAMARsKIi, AA: Theorie der Differenzverfahren, Akad. VerI.
Geest u. Portig K.-D., 1984 (translated from the Russian). the product of two operators: an operator which con-
[3) GoDUNOV, S.K., ET AL.: Numerical solution of multi-dimensional verts the function [u 1r into the grid function fh in Fh ,
problems of gas dynamics, Moscow, 1976 (in Russian). i.e. into a table of approximate values of f, and an
[4) SAMARSKIi, AA and PoPov, Yu.P.: Difference schemes of gas
dynamics, Moscow, 1975 (in Russian).
operator which extends fh from DhF to the entire
VS. Ryaben'kif domain Df . For instance, in order to approximate the
Editorial comments. For additional references, see the differential operator
additional references to Approximation of a differential
L -- ~
dx' du -- ji()
dx x, 0,;:::: ,;:::: 1
""'Z""',
operator by difference operators.
AMS 1980 Subject Classification: 65L99, 34A99 one constructs the grid D hU consisting of points Xb
k=O, ... ,N,
APPROXIMATION OF A DIFFERENTIAL OPERA-
TOR BY DIFFERENCE OPERATORS - An approxima-
= Xo < ... < Xk < Xk+l < ... < XN = 1,
tion of the differential operator by parameter- max(Xk+l -Xk)
k
= h,
dependent operators such that the result of their appli-
and a grid DhF consisting of the points
cation to a function is determined by the values of this
function on some discrete set of points - a grid - x; =Xk+8(Xk+l-Xk), k=0, ... ,N-1,
which become more exact as its parameter (mesh, step .; ; 8 .;;;; 1, 8 = canst
of the grid) tends to zero. The values of the operator Lhlu 1h at the points x; are
Let L, Lu = f, be a differential operator which con- defined by the equations:
verts any function u of a class of functions U into a
- fi( ') -
function f of a linear normed space F. Let Du be the L hUh
[ 11 U(Xk+l)-U(Xk)
. - h Xk - ,
x=x, Xk+l- Xk
domain of definition of the functions in U, and let
there be some discrete subset in Du - a grid DhU - k=O, ... ,N-I.
which 'becomes more dense' as h~O. Consider the set Thereafter, the definition of Lh[u 1h is piecewise linearly
Uh of all functions [u 1r defined on the grid only and extended outside DhF with possible breaks at the points
coinciding with u in the points of the grid. A difference Xk, k= 1, ... ,N -2, only.
operator is defined as any operator Lh that converts the Let the norm in F be defined by the formula
grid functions in Uh into functions fh in F. One says
that the operator L h, Lhlu 1h = fh, represents an order p
II <I> II F = sup
x
I </>(x) I
approximation to the differential operator L on U if for Then, on the class of functions U with a bounded third
any function u E U derivative, for (}=O and (}=h /2 the operator Lh
I Lu - Lh[u1h IIF ~ 0, represents an order 1, respectively 2, approximation to
L =d / dx. On the class U of functions with bounded
I Lu-Lh[u1. IIF .;;;; ch P , c=c(u)=const, second derivatives, the representation is of order 1
as h~O. Occasionally, an approximation is understood only, for any (}E[O, 11.
to be the equality The task of approximating a differential operator by
finite-difference operators is sometimes conditionally
considered as solved if a method is found for the con-
in the sense of some weak convergence. The approxi- struction of the grid function
mation of a differential operator by difference opera-
tors is used for an approximate computation of the Lh[U1.1 = Ji, E Fh,
D"

217
APPROXIMATION OF A DIFFERENTIAL OPERATOR BY DIFFERENCE OPERATORS

determined at the points of DhF only, while the task of + u""m + 1 - 2um,/I + Um,n - 1 f,
completing the function fh everywhere on DF is
h2 = mn,

ignored. In such a case the approximation is defined by where um,n and fm,n are the values of the functions [u lit
considering the space Fh as normed, and by assuming, and fh at the point (mh, nh).
for the grid and for the norm, that for any function There are also other methods of constructing opera-
fEF, the function {f}h EFh , which coincides with f at tors Lh which are approximations to the operator L on
the points of DhF , satisfies the equation the space of solutions u of the differential equation
Lu = 0, and which satisfy additional conditions.
~ II {fh IIF. = IIfIiF' References
The operator Lh is understood to be an operator from [I) FILIPPOV, A.F.: 'On stability of difference equations', Dokl.
Akad. Nauk SSSR 100, no. 6 (1955), 1045-\048 (in Russian).
Uh in Fh , and one says that Lh represents an order p (2) BEREZIN, I.S. and ZHIDKOV, N.P.: Computing methods, Per-
approximation to L on U if, for h~O, gamon Press, 1973 (translated from the Russian).
II {Lu h- Lh[u1.IIF. ~ 0, V.S. Ryaben'kit
Editorial comments. The approximation of a differential
I {Luh - Lh[U1. IIF. ~ ch P
operator by difference operators is an ingredient for both
In order to construct an operator Lh which is an the approximation of a differential equation by difference
approximation to L of given order on sufficiently equations and for the approximation of a differential
smooth functions, one often replaces each derivative boundary value problem by difference boundary value
contained in the expression L by its finite-difference problems, and is therefore extensively treated in the litera-
approximation, basing oneself on the following fact. ture on (finite-) difference methods for ordinary and partial
For any integers i, j and for any ko, 2ko + l-;;;:;'i + j, in differential equations. The references listed below not only
provide discretizations of differential equations and boun-
the equation
dary value problems, but also the solution of these prob-
ko
h-j L cku(x+kh) = uV)(x)+l(x,h,Ck o" " ,Ck o )' lems. References [A1] - [A3] , [A5], [A6] are introducing
k= -k o textbooks, whereas [M], [Al], [AB] and [A9] also present
more advanced material.
it is possible, by using the method of undetermined
References
coefficients and Taylor's formula, to select numbers Ck
[A1] AMEs, W.F.: Numerical methods for partial differential equa-
not depending on h, so that for any function u (x) with tions, Nelson, London, 1969.
j + r (r ~ i) bounded derivatives, an inequality of the [A2] FORSYTHE, G.E. and WASOW, W.R.: Finite difference methods
for partial differential equations, Wiley, 1960.
type
[A3] GARABEDIAN, P.R.: Partial differential equations, Wiley, 1964.
I l(x,h,Ck o" " ,Ck o ) I ~ Aijsup
t
I uv+r)(t) Ih r , [A4] GODUNOV, S.K. and RYABEN'KIT, V.S.: The theory of differ-
ence schemes, North-Holland, 1964 (translated from the
where Ai) depends only on i and j, is valid. As an Russian).
[A5] LAMBERT, J.D.: Computational methods in ordinary differen-
example, suppose one constructs an approximating tial equations, Wiley, 1973.
operator for the Laplace operator Ll, [A6] MITCHELL, A.R. and GRIFFITHS, D.F.: The finite difference

!::"u -
a2 u
-2 +-2
a2 u = f(x,y), method in partial differential equations, Wiley, 1980.
[A7] RrCHTMEYER, R.D. and MORTON, K.W.: Difference methods
ax ay for initial value problems, Wiley, 1967.
if Du is the closed square I x I ~ 1, Iy I ~ 1, and DF is [A8] SAMARSKIT, A.A.: Theorie der Differenzverfahren, Akad. VerI.
its interior I x 1<1, Iy 1< l. Assume that h = 1/ N
Geest u. Portig K.-D., 1984 (translated from the Russian).
[A9] STETIER, H.J.: Analysis of discretization methods for ordinary
where N is a natural number, and construct the grid differential equations, Springer, 1973.
with points AMS 1980 Subject Classification: 65D25, 65LXX,
(x,y) = (mh, nh), I mh I ~ 1, I nh I ~ 1, 65MXX
which belong to DhU ' The points APPROXIMATION OF FUNCfIONS - The replace-
(x,y) = (mh,nh), I mh I ~ 1, I nh I ~ 1, ment, according to a definite rule, of a function by a
then belong to D hF , for integers m and n. Since function <p close to it in some sense and belonging to a
set 9C (the approximating set) that is prescribed in
1 h
J;2[r(X+ )-2y(x)+y(x-h)] = y "{x)+U?
h 2 (4)
(~), advance. It is assumed that f is defined on a set Q in
m-dimensional Euclidean space (the real axis being a
Ll can be approximated with second-order accuracy on particular case) in which the approximation is carried
a space of sufficiently smooth functions by the finite- out; f can either be given explicity in terms of elemen-
difference operator Lh if one puts, at the points of tary functions or f can be the solution of some equa-
tion. If only incomplete information about f is avail-
able, the approximation problem is in fact a matter of

218
APPROXIMA nON OF FUNCTIONS

approximating the whole class of functions defined by with respect to tlN is a function s composed of alge-
the information available. braic polynomials of degree r 'tied together' at the
In practice, the necessity to approximate functions points TI, . . . ,TN -I in such a way that on the whole
arises in various situations when it is needed to replace interval [a, b) the spline s is continuous together with
a function by a smoother function or by one which is its derivatives up to and including order r - k. Thus
simpler and more convenient in computations, or when SEC -k[a, b) and on each interval (Tj -I> Tj)
it is required to establish a functional dependence on (i = 1, ... ,N) s is an algebraic polynomial of degree at
the basis of experimental data, etc. most r. For instance, a polygonal line with corners at
In the general problem of the approximation of func- the nodes (knots, joints) Tj is a spline of degree one and
tions it is usually possible to single out the following defect one; a function s that is continuously differenti-
more particular problems: the choice of the approxi- able on [a, b) and that coincides on [Tj - I> Til,
mating set 9C; the choice of the measure of the error of i = I, ... ,N, with a cubic polynomial is a cubic spline
approximation; the choice of the method of approxima- of defect 2, etc. Splines in two or more variables are
tion, e.g. the rule according to which a function cP in 9C similarly defined. Having finite smoothness properties,
is made to correspond to f; and the investigation and splines have a greater local flexibility than polynomials;
estimation of the error of the approximation. a change in the values of a spline in an interval (a, f3)
With respect to the choice of the approximating set 9C, does not much affect (if at all) its behaviour outside
apart from the necessity of ensuring the required accu- (a, f3). Apart from the simplicity of their implementa-
racy of the approximation, one is guided by the aim to tion in computers, the advantages of splines show up,
deal with structurally simple functions cp that are con- in particular, when the information about the function
venient in computations and that can be submitted to a f to be approximated has a discrete character, for
priori specified conditions, e.g. related to smoothness. instance, when the values of f and, possibly, of some of
The classical tools for approximation consist of alge- its derivatives, are given at some points.
braic polynomials (if Q is a bounded closed set) and If f has singularities or if the approximation is car-
trigonometric polynomials (in the periodic case) in one ried out on an unbounded domain, a convenient tool
or more variables. Their widespread application as for approximation consists of rational fractions p / q,
approximating sets is due, in particular, to the fact that where p and q are algebraic polynomials (rational
it is in principle possible to approximate any continu- approximation, Pade approximation). Non-periodic
ous function by means of algebraic of trigonometric functions defined on the whole real axis can also be
polynomials with any degree of accuracy specified in approximated by entire functions of exponential type.
advance. The accuracy of the approximation can be The measure of the error of approximation, pIJ, cp), is
increased at the expense of increasing the degree of the usually chosen by taking into account the condition of
polynomial; however, this complicates the approximat- the concrete problem and the information available
ing process and increases the difficulties of applying it. about the function to be approximated. Most fre-
In practice one takes subspaces of algebraic or tri- quently it is a matter of choosing a function space con-
gonometric polynomials of a fixed degree as approxi- taining f in whose metric it is expedient to estimate the
mating sets and one aims at obtaining the accuracy error of approximation. If
required while keeping the degrees of the polynomials p.(j, f[ = sup I f(t)-!f>(t) I,
used as low as possible. A more general, and at the teQ

same time more flexible, approximation tool is obtained then one deals with uniform, or Chebyshev, approxima-
by considering generalized polynomials tion, while if
!f>(t) = Clf[>l(t)+ ... +CNf[>N(t), p.(j,f[ = jlf(t)-!f>(t) IPdt, p>O,
where {CPI, ... ,CPN} is a system of linearly independent Q

functions which should be chosen such that the condi- then one has mean-power approximation, or 4-
tions of the problem at hand and the a priori condi- approximation, which in the case of p = I is called
tions on cP are taking into account. approximation in the mean. A particular important case
In many problems splines (cf. Spline) have been is that of p =2, mean-square approximation: the error of
found to be more natural and more convenient from the best approximation of the function f by a finite-
the viewpoint of computation than classical polynomi- dimensional subspace can be expressed exactly in terms
als. If of certain determinants. In some problems the
requirements of proximity (closeness) of the functions f
fjw = {a =7'0<7'1 < ... <7'N=b}, N~2, (1)
and cP are different at different points; in order to take
is a fixed subdivision of the interval [a, b), then a (poly- into account this lack of homogeneity a weight function
nomial) spline of degree r and defect k (k = 1, . . . ,r) p(t);;a.O is introduced and weighted approximation is

219
APPROXIMATION OF FUNCTIONS

considered, where the measure of the error of approxi- If W is a linear manifold and if it contains a system of
mation is given by functions cp], ... ,CPN such that cp;(1;) = 1 and CP;(tk)=O
pif, </ = ~~8 I f(t)-<J>(t) I p(t) when ki=i, then the function
N
or
pif, </ = f I f(t)-<J>(t) IPp(t)dt, p>O.
</>(f, t) = ~f(tk}</>k(t)
k=l
Q
The weight function allows one also to see to it that the belongs to ~ and satisfies (2); it yields an interpolation
error is finite when, for instance, f is unbounded. If the method of approximation and is obviously linear. It may
error of approximation has to take into account the be required to choose the function <P(f, t) in such a way
closeness of f and </> only at some individual points tko that at the points tk it coincides not only with f, but
k = 1, ... ,N, in Q, then as JL(f, </ one may take one of also with some its derivatives; this case is called inter-
the quantities polation with multiple nodes. If Q = [a, b) and
a.,;;;;;tl < ... <tN.,;;;;;b, then there is a unique algebraic
polynomial of degree N - 1, and in the periodic case
or N
~ If(td-<J>(tk) IP, p>O,
(b-a=27T, tN =t2m-l<b) - a unique trigonometric
k=l polynomial of degree n -1, coinciding with f at the
into which weight coefficients can also be introduced. points tk' Multiple interpolation is achieved with Her-
When deciding according to which rule the approxi- mite polynomials, a particular case of which is the Tay-
mating function cp(t)=<P(f, t) should be chosen from the lor polynomial; in this case the values of a function
set W (when chosing the method of approximation) it is and of its first n consecutive derivatives at a point are
evident that one should strive for the highest possible interpolated by an algebraic polynomial of degree n.
accuracy of approximation and at the same time for Interpolation with splines has its peculiarities con-
simplicity in the construction of <P(f, t), taking account nected with the choice of the interpolation nodes and
of the information available about f The first require- with the boundary conditions ensuring the existence
ment leads to a function </>j in W that is 'nearest' to f, and uniqueness of the interpolating spline. For
i.e. </>j is such that instance, a spline s of degree r;:.2 and defect 1 with
pif, </>f) = inf pif, </. respect to the subdivision (1) taking prescribed values
,/>E!fI
at N different points t; of the interval (a, b) such that
Here immediately questions arise about the existence 'T;-I~ti~'T;, i=l, ... ,N, exists and is unique if the
and the uniqueness of such a function </>j (the function boundary conditions are given in terms of ma condi-
of best approximation), and also about its characteristic tions s(p)(a), O";;;;;p~r-l, and mb conditions s{Jt)(b),
properties (see [5]). Existence is ensured if W is a closed O";;;;;JJ.~r -1, with ma +mb =r. Functions
locally compact set, and in particular if W is a finite- fEG2r-k-l[a, b) can be put in one-to-one correspon-
dimensional subspace. Uniqueness depends both on the dence with splines s(j, t) of degree 2r - 1 and defect k,
properties of the approximating set W (see Haar condi- l.,;;;;;k ~r, with respect to the subdivision (1) by requir-
tion; Chebyshev system of functions) and on the metric ing that S(P)(j,'Ti)=/P)('Ti), p=O, ... ,k-l;
which defines piJ, </. There is a number of well-known i=l, ... ,N, where, if k<r, one also has to impose
necessary and sufficient conditions which should be some boundary conditions. When k = r, this spline is
satisfied by the function </>j of best approximation in a called a Hermite or local spline, because the behaviour
variety of cases (see Polynomial of best approximation; of the spline in an interval ('Ti -I, 'T;) is determined by
Chebyshev theorem; Markov criterion). However, as a the values of the function f and of its derivatives f(p) ,
rule, these criteria do not yield effective algorithms to p= 1, ... ,k -1, at the points 'T; -I and'Ti.
construct </>/t). Hence it is of great importance to have In the approximation of functions an important role
methods available which, on the basis of the informa- is also played by linear methods constructed on the
tion about the function f to be approximated, allow basis of an expansion of the function in a Fourier series
one to construct effectively some function <P(f, t) in W with respect to an orthogonal system. In particular, in
yielding an acceptable approximation. Here, in the the periodic case a widely used approximation tool con-
first place, one ought to mention linear methods (these sists of Fourier sums, and their various means (the case
satisfy cp(aJ/1 + a2/2 , t)=al</>(fh t)+a2</>(f2, t, to of the trigonometric orthogonal system) (see Approxi-
which, in particular, interpolation methods belong: hav- mation of functions, linear methods).
ing prescribed the points tl, ... ,tN in Q, one chooses The investigation and estimation of the error of
<P(f, t) from those functions cp in ~ that satisfy the approximation is important from the practical point of
interpolation condition view and at the same time is the branch of the approxi-
(2) mation of functions that is richest in ideas. More pre-

220
APPROXIMATION OF FUNCTIONS

cisely, the development of methods dealing with Nikol'skll which initiated a new direction of research
estimating the error, the investigation of this depen- connected with the approximation of classes of func-
dence on the smoothness of the approximated function, tions by finite-dimensional subspaces and with obtain-
as well as the study and comparison of the approxima- ing accurate estimates of the error, using difference-
tion properties of various approximation methods, have differential properties defining the class. The objective
led to the creation of the approximation theory of func- is to obtain the quantities
tions, one of the most rapidly developing branches of sup p.(j, PN(f)),
mathematical analysis.
The foundation of the approximation theory of func- where p.(g, cp) is the chosen measure of the approxima-
tions was laid in the papers of P.L. Chebyshev in 1854 tion error, un is a class of functions and Pn(j, t) is the
- 1859 (see [1)) on best uniform approximation of con- approximating (generalized) polynomial whose coeffi-
tinuous functions by polynomials and rational fractions cients are determined by the method of approximation.
and in the work of K. Weierstrass [2], who proved in Results of this kind make it possible to compare
1885 that with any continuous function on an interval approximation methods from the point of view of their
[a, b], or on the whole real axis and having period 277, approximating capabilities and to formulate the prob-
there can be associated a sequence of algebraic (respec- lem, important in applications, of determining for a
tively, trigonometric) polynomials Pn(j, t) of degree given class of functions the optimal (best) approximat-
n = 1, 2, ... , such that, as n-'HX), ing set (of fixed dimension N). Research in this direc-
tion, based both on the properties of specific approxi-
mation methods and on advanced results of functional
where Q is [a, b], respectively the whole real axis. Simi- analysis, turned out to be also very fertile in ideas,
lar assertions hold in the case when the measure of the because, by proceeding in this way, new facts about the
error of approximation is defined by an integral metric, mutual connections between many extremal problems
as well as in the case of functions of several variables. of various types were discovered, making it possible to
Particular importance has been given to the investiga- obtain deep and delicate relations in the theory of func-
tion of the speed with which the sequence pf.j, Pn(j)) tions. Owing to that it was possible to conclusively
decreases in dependence on the properties of the solve a number of extremal problems concerning the
approximated function and the choice of the approxi- best approximation of important classes of functions
mating polynomials Pn(j, t). The most attention has (see [5] and [7], and also Approximation of functions,
been paid to the study of best approximation problems, extremal problems in function classes).
and also to that of linear methods of approximation Some other aspects of the approximation of functions.
with the property that for a given f the polynomial Additional restrictions can be put on the approximat-
Pn(j, t) can be constructed effectively. An important ing function cp(t)=cp(j, t) in 91. If they are not con-
stage in the development of the approximation theory nected with the function f (for instance, restrictions on,
of functions is connected with the names of Ch.J. de la or relations between, the coefficients of the approximat-
Vallee-Poussin, D. Jackson, and S.N. Bernstein [S.N. ing polynomial), then it is really a matter of determin-
Bernshteln], who initiated the investigation of the rela- ing 91 more precise. A new situation arises if the restric-
tion between the speed of decrease of the approxima- tions on cp(j, t) are connected with the approximated
tion when approximating the function f by means of function f; one of the interesting cases is that of one-
suitably chosen polynomials Pn(j, t) of degree n (as sided approximation. Here cp(j, t) in 91 is required to
n ~ (0), and the difference-differential properties of f It satisfy the inequality cp(j, I)"';;'f(t) (or ~) and the error
turns out that in many cases these properties of f, e.g., is estimated in an integral metric (see [19)).
f having derivatives, smoothness of those derivatives, In applied problems, besides the case of explicity
etc., can be characterized by the sequence of approxi- given functions, one may have to approximate curves
mating polynomials and by the behaviour of the and surfaces which are given only in parametric form;
corresponding error of approximation (see Approxima- as tools of approximation one can use, for instance,
tion of functions, direct and inverse theorems). This parametric splines. It is most natural to measure the
yields a new, constructive characterization of continu- error of approximation by means of the Hausdorff dis-
ous and differentiable functions. In the first third of the tance, which properly takes into account the geometric
twentieth century this kind of problem was dominating proximity of such objects. For instance, for two curves
approximation theory; for this reason this field was I} and 12 , the Hausdorff distance is defined by
also thought of as the constructive theory of functions.
In the nineteen-thirties and -fourties there appeared r(/l, [2) = max {max min P(P, Q), maxminp(P, Q)},
Pe/l Qe/, Pel, Qe/l
papers by A.N. Kolmogorov, J. Favard and S.M.

221
APPROXIMATION OF FUNCTIONS

where p(P, Q) is the Euclidean (or any other) distance [5] KORNEiCHUK, I.P.: ExtTem(ll problems in approximation theory,
between the points P and Q. Here the Hausdorff dis- Moscow, 1976 (in Russian).
[6] DZYADYK, V.K.: Introduction to the theory of uniform approxi-
tance is preferable when choosing the measure of the mation offunctiOns by polynomials, Moscow, 1977 (in Russian).
error of approximation; this is also the case in some [7] TnrnOMIROV, V.M.: Some questions in approximation theory,
cases of the approximation of functions, for instance Moscow, 1976 (in Russian).
[8] NIKOL'sKIi, S.M.: Approximation offunctions of several variables
when a discontinuous function has to be approximated
and imbedding theorems, Springer, 1975 (translated from the
by a smooth function (see [16]). Russian).
The solution of a number of problems in the theory [9] ACHIEZER, N.I. [N.1. AKHIEzER]: Theory of approximation, F.
of approximation of functions is closely connected with Ungar, 1956 (translated from the Russian).
[10] TIMAN, AF.: Theory of approximation offunctions of a real
the study of extremal properties of polynomials in one variable, Pergamon, 1963 (translated from the Russian).
or another concrete system of functions (inequalities for [11] KOROVKIN, P.P.: Linear operators and approximation theory,
the derivatives of polynomials, polynomials least deviat- Hindushtan Publ. Comp., Delhi, 1960 (translated from the
Russian).
ing from zero, etc.). In particular, the proofs of inverse [12] AHLBERG, I.H., NILSON, liN. and WALSH, I.F.: The theory of
theorems in the theory of approximation of functions splines and their applications, Acad. Press, 1967.
strongly rely on inequalities giving estimates of the [13] STECHIUN, S.B. and SUBBOTIN, Yu.N.: Splines in numerical
mathematics, Moscow, 1976 (in Russian).
norm (or value at a fixed point) of some derivatives of
[14] LAURENT, P.I.: Approximation et optimisation, Herman, 1972.
an algebraic or trigonometric polynomial, using certain [15] CoLLATZ, L. and KRAss, W.: Approximationstheorie: Tscheby-
characteristics of the polynomial itself. In this direction scheffiche Approximationen mit Anwendungen, Teubner, 1973.
a number of precise results is known, and these have [16] SENDOV, B.: Hausdorff approximations, Sofia, 1979.
[17] NIKOL'sKIi, S.M.: Quadratureformulas, Moscow, 1979 (in Rus-
generalizations to entire functions (see [6] - [10]). sian).
The problem concerning an algebraic polynomial [18] ZAv'YADov, Yu.S., KVASOV, B.1. and MlROSHNICHENKO, V.I.:
(with prescribed leading coefficient) least deviating Methods of spline functiOns, Moscow, 1980.
from zero in the metric of C or 4 on the interval [a, b],
[19] KoRNEicHUK, N.P., LIGUN, AA and DoRONIN, V.G.: Approx-
imation with constraints, Kiev, 1982 (in Russian).
and the equivalent problem concerning the best [20] DAVIS, P.J.: Interpolation and approximation, Dover, reprint,
approximation of the function t n by a polynomial of 1963.
[21] LoRENTZ, G.G.: Approximation offunctions, Holt, Rinehart
degree at most n -1, was investigated by Chebyshev and Winston, 1966.
(for C) and by his students (for Ld. The solution is [22] KORNEicHUK, N.P.: Splines in approximation theory, Moscow,
given by the Chebyshev polynomials of the first kind 1984 (in Russian).
N.P. Korneichuk
(C) and the second kind (L 1) and by the Legendre
polynomials (L 2 ), which have wide applications both in Editorial comments.
theoretical and applied investigations. A number of
References
results is known for the more general case when all [A1] TODD, 1.: 'Applications of transformation theory; a legacy
coefficients of the polynomial are subjected to some from Zolotarev (1847-1878)" in S.P. Singh, J.H.w. Burry and
restrictions (see [6]). The problem concerning a mono- B. Watson (eels.): Approximation theory and spline functions,
Reidel, 1984, pp. 207-245.
spline of minimal norm, equivalent to that of finding [A2] SZEG<), G.: 'The contributions of L. Fejer to the constructive
the best approximation of the function t n by splines of function theory', in G. Alexits and S.B. Stechkin (eels.): Con-
degree n - 1 with free knots, is of special importance in structive theory of functions, Akad. Kiadn6, Budapest, 1972,
pp. 19-26.
view of the fact that in a number of cases the problem
[A3] IBRAGIMOV, 1.1.: 'On S.N. Bernstein's publications in the con-
of obtaining the best quadrature formula reduces to it. structive theory of functions', in G. Alexits and S.B. Stech kin
For the approximation of functions in the complex (eels.): Constructive theory of functions, Akad. Kiadn6,
plane see Approximation of functions of a complex vari- Budapest, 1972, pp. 27-40 (in Russian).
[A4] MICHELLI, C.A, PAl, D.V. and LIMAYE, B.V. (EDS.): Methods
able. of functional analYSis in approximation theory, Birkhauser,
References 1986.
[I] CHEBYSHEV, P.L.: 'Problems about smallest quantities con- [A5] CHENEY, E.W.: Introduction to approximation theory, Chel-
nected with an approximate representation of a function sea, reprint, 1982.
(1859)', in Complete collected works, Moscow-Leningrad, 1947, [A6] SCHUMAKER, L.L.: Spline functions, basic theory, Wiley,
pp. 151-235 (in Russian). 1981.
[2A] WEIERSTRASS, K.: 'Ueber die analytische Darstellbarkeit [A7] RlvLIN, T.J.: The Chebyshev polynomials, Wiley, 1974.
sogenannter willkiirlicher Funktionen einer reellen [A8] DEVORE, R.A.: 'Freud's work in constructive function theory'
Veranderlichen', Sitzungsberichte Akad. Berlin (1885), 633-639. J. Approximation theory 46 (1986), 32-37.
[2B] WEIERSTRASS, K.: 'Ueber die analytische Darstellbarkeit [A9] TuRAN, P.: 'On some open problems of approximation
sogenannter willkiirlicher Funktionen einer reellen theory'. J. Approximation theory 29 (1980), 23-85. (Mat.
Veranderlichen', Sitzungsberichte Akad. Berlin (1885), 789-805. Lapok25 (1974),21-75)
[3] GoNCHAROV, V.L.: The theory of interpolation and approxima [A 10] BRAESS, D.: Nonlinear approximation theory, Springer, 1986.
tion offunctions, Moscow, 1954 (in Russian).
[4] NATANSON, I.P.: Constructive function theory, 1-3, F. Ungar, AMS 1980 Subject Classification: 41-XX, 41 A65,
1964-1965 (translated from the Russian). 41A10, 41A15

222
APPROXIMA nOl\i Of IT;-";CTIOI\S, DIRECT Al\iD Il\iVERSE THEOREMS

APPROXIMATION OF FUNCTIONS, DIRECT AND integer part of 1 / c5. And if for some positive integer r
INVERSE THEOREMS - Theorems and equalities the series
establishing a connection between the difference-
n =1
differential properties of the function to be approxi-
mated and the magnitude (and behaviour) of the error converges, then it follows that fEcr and that, analo-
of approximation, by various methods. Direct theorems gous to (2), w(f(r), 1 / n) can be estimated in terms of
give an estimate of the error of approximation of a E(f, Tn -I), n = 1, 2, . .. (see [4], [8] and [12]). These
function in terms of its smoothness properties (the estimates imply, in particular, that if
existence of derivatives of a given order, the modulus of E(f, Tn-I) = O(n- r - a ), r=O, I, ... , O<a<l,
continuity of f or of some of its derivatives, etc.). In then f E Cr and the fr)(t) satisfy for O<a< 1 the
the case of best approximation by polynomials, direct
Holder inequality
theorems are also known as Jackson-type theorems [1],
together with their many generalizations and refine- IJ(r)(t')-J(r)(t'') I,;;;; K I t'-t" 1\ (3)
ments (see Jackson inequality and Jackson theorem). whereas for a = 1 they satisfy the Zygmund inequality
Inverse theorems characterize difference-differential
properties of functions depending on the rapidity with /j<r V )-2J (r) [t' ~t"] +J(r)(t'') I ,; ; Kit' -t" I. (4)
which the errors of best, or any other, approximations
tend to zero. The problem of obtaining inverse Denoting this class of functions by Kir+ a , one obtains
theorems in the approximation of functions was first a constructive characteristic for it: fEKHr+a if and
stated, and in some cases solved, by S.N. Bernstein only if
[S.N. Bernshtei'n] [2]. A comparison of direct and
inverse theorems allows one sometimes to characterize A 2'17"-periodic function is infinitely differentiable on the
completely the class of functions having specific whole real axis if and only if
smoothness properties using, for instance, sequences of
best approximations.
lim n rE(f, Tn -d =
n--+co

The connection between direct and inverse theorems for all r= 1,2, ....
is most simple in the periodic case. Let C be the space Similar properties hold for the approximation of
of 2'17"-periodic continuous functions on the whole real periodic functions in the metric of Lp(0,2'17")
axis with norm (l";;;;p < 00), and also for (not necessarily periodic) func-
tions defined on the whole real axis when being
I J I c= max
t
IJ(t) I; approximated by entire functions of finite order (see [7]
let and_ [8]). Direct and inverse theorems are known for
fEer; as a difference-differential characteristic the
modulus of smoothness Wk(f, c5) of order k = 1,2, ... ,
be the best approximation of a function f in C by the
of f (or of some of its derivatives) is used (see [4] and
subspace Tn of trigonometric polynomials of degree at
[8]).
most n, !et w(f, c5) be the modulus of continuity of f:
The situation is different in the case of approxima-
and let Cr , r = 1, 2, ... , be the set of functions in C
tion on a finite interval. Let C = C[a, b) be the space
(;0 = C) that are r times continuously differentiable .sm
of continuous functions on [a, b), with norm
the whole real axis. A direct theorem states: If f E er,
then II J lie = max IJ(t) I
a.;;;t.;;;b

(1)
Let er = er[a, b) be the set of the functions that are r
times continuously differentiable on [a, b), CO = C, and
n = 1,2, . , ., r = 0, I, ' .. ,
let KHr+a[a, b) be the class of functions defined by the
where the constant M does not depend on n. A inequalities (3) and (4) with t', t" E[a, b). For the best
stronger assertion is: It is possible to indicate a approximation,
sequence of linear methods Un, n =0, 1, ... , associat-
E(f,An-l;a,b) = pEA,,_1
inf IIJ-p lie, n=I,2, ... ,
ing a polynomial Un(f, DE Tn to a function f(t) in C
and such that for fEcr the error Ilf-Un(f) lie is of a function f E er by the subspace An - 1 of algebraic
bounded by the right-hapd side of (1). An inverse polynomials of degree at most n - lone has an esti-
theorem states that for f E C mate of the form (1) in terms of the modulus of con-
1IJ..61 tinuity of f r ) on [a, b), but the converse, analogous to
w(f, 8) ,;;;; M8 Z E(f, Tn -d, 8>0, (2)
n =1 that of the periodic case (with an inequality of the form
where M is an absolute constant and [1/ c5] IS the (2)), now only applies for intervals contained in (a, b).

223
APPROXIMATION OF FUNCTIONS, DIRECT AND INVERSE THEOREMS

For instance, if [2] BERNSHTEiN, S.N.: 'On the best approximation of continuous
functions by polynomials of a given degree (1912)" in Col-
E(f,A n - 1 ;a,b) 0;;;; Mn- r - a, (5)
lected works, Vol. 1, Moscow, 1952, pp. 11-104 (in Russian).
r = 0, 1, ... , 0 < a < 1, [3] NIKOL'SKII, S.M.: Izv. Akad. Nauk SSSR Ser. Mat. 10, no. 4
(1946),295-317.
then one can only assert that f belongs to [4] DzYADYK, V.K.: Introduction to the theory of uniform approxi-
KHr+"[a I, b d defined by the inequalities (3) (with mation offunctions by polynOmials, Moscow, 1977 (in Russian).
[5] KORNEicHUK, N.P.: Extremal problems in approximation theory,
O<u<l) merely on the interval [aJ,bdC(a,b); the Moscow, 1976 (in Russian).
constant K depends on a, ai, b 1 ,b, and can increase [6] 'flxHOMIROV, V.M.: Some questions in approximation theory,
unboundedly as a 1 ~a and b 1 ~b. There exist functions Moscow, 1976 (in Russian).
[7] ACHIEZER, N.J. [N.J. AKHIEZER]: Theory of approximation, F.
not belonging to KHr+"[a, b] for which nevertheless Ungar, 1956 (translated from the Russian).
E(f,A n - 1 ;a,b) = O(n- r - a ). [8] TIMAN, A.F.: Theory of approximation offunctions of a real
variable, Pergamon, 1963 (translated from the Russian).
For instance [9] KOROVKIN, P.P.: Linear operators and approximation theory,
Hindushtan Publ. Comp., Delhi, 1960 (translated from the
E(VI=t2,A n - 1; -1, 1) <~, n=I,2, ... , Russian).
'/Tn [10] STECHKIN, S.B. and SUBBOTIN, Yu.N.: Splines in numerical
although VI=f2 fKH" on [-1, 1] for every u> 1/2. mathematics, Moscow, 1976 (in Russian).
[II] DAUGAVET, I.K.: Introduction to the theory of approximation of
It turned out that algebraic polynomials, retaining on functiOns, Leningrad, 1977 (in Russian).
the whole interval [a, b] the best order of approxima- [I2] STECHKIN, S.A.: Izv. Akad. Nauk SSSR Ser., no. 3 (1951),
tion of a function fEC, can yield at the end points of 217-242.
[13] SENDOV, B.: Hausdorff approximations, Sofia, 1979.
the interval a substantially better approximation. This
[14] LORENTZ, G.G.: Approximation offunctions, Holt, Rinehart
phenomenon was first discovered by S.M. Nikol'skii and Winston, 1966.
(see [3]). If, in particular, fEKHr+"[a, b], then for any N.P. Kornelchuk
n >r there is a polynomial Pn(t)EAn -I such that Editorial comments.

M[~v'(t-aXb-t)+ n\ r+
References
If(t)-Pn(t) I 0;;;; a
, (6) [A 1] NATANsON, I.P.: Constructive theory of functions, 1-3, F.
Ungar, 1964-1965 (translated from the Russian).
[A2] DEVORE, R.A.: The approximation of continuous functions by
a 0;;;; t 0;;;; b, positive linear operators, Springer, 1972.
where the constant M depends neither on n nor on t. AMS 1980 Subject Classification: 41 A27, 41 A25
Contrary to (5), this assertion has a converse: If for an
fEC there exists a sequence of polynomials Pn(t)EA n - 1 APPROXIMATION OF FUNCTIONS, EXTREMAL
such that for some r =0, 1, ... , and O<u< 1 (6) is PROBLEMS IN FUNCTION CLASSES - Problems con-
satisfied, then fEKHr+"[a, b]. Direct and inverse nected with the search for upper bounds of approxima-
theorems for fEC[a, b] are known, stated in terms of tion errors with respect to a fixed class of functions and
the modulus of continuity and moduli of smoothness with the choice of an approximation tool that is best in
(see [4] and [8]). some sense or other. The foundations of the research
Direct theorems giving order estimates for the error concerning this kind of problems were laid by A.N.
of approximation in terms of difference-differential pro- Kolmogorov ([1], [2]), J. Favard ([3], [4]) and S.M.
perties of the approximated function were established Nikol'skii ([5], [6]). These researches were greatly
for many concrete approximation methods (see [6], [8] extended in the nineteen-fifties. They stimulated the
and [9], in particular for splines, splines of best approx- requirements of numerical mathematics, penetrating
imation and interpolation [10], cf. Spline). ever deeper into problems of an optimization character.
Direct and inverse theorems for approximation in a If in a normed function space X one considers the
Hausdorff metric are known (see [13]). Some peculiari- approximation of functions from a class we by func-
ties arise here; in particular, the characterization of tions from a fixed set 91cX, then problems on deter-
classes of functions by their best Hausdorff approxima- mining the quantity
tion is connected not only with the order of this E(I]](,9C)x = supE(j, 9C)x, (1)
approximation but also with the magnitude of the con- jE'lJI

stant in the relevant inequality. For direct and inverse where


E(j, \}c)x inf II f -<I> I x
= .pE'l/
theorems in the multi-dimensional case, see Approxima-
tion of functions of several real variables. is the best approximation of a function f(t) by 91, as
References well as on determining the quantity
[I] JACKSON, D.: Ueber die Genauigkeit der Anniiherung stetiger
Funktionen durch ganze rationale Funktionen gegebenen Grades 6' (I]]e, \}c, U)x sup II f -
= jdJl Uf II x, (2)
und trigonometrische Summen gegebener Ordnung, Giittingen,
1911. Thesis. where U is a specific approximation method, given by

224
APPROXIMATION OF FUNCTIONS, EXTREMAL PROBLEMS IN FUNCTION CLASSES

some operator acting from X into 91, are of interest. E(Mwr sr-I),
P' 2n L" = C(MwrP' sr-I),
2n L"
= MKn- r
r'
Geometrically, the supremum in (1) characterizes the
deviation of the set we from 91 in the metric of X. In p = 1,00; n, r = 1,2, ... ,
practice, E(we,91)x can be looked upon, first, as the are valid. The splines (Jr-I (j, t) in S~; 1 interpolating a
minimal-possible bound from above for the best function I at k'IT / n if r is even, and at (2k + I)'IT / 2n if
approximation of 91 by functions that are only known r is odd, provide the best linear methods in this case.
to belong to we, and secondly, it gives a tool for Relative to MW~ these splines have exceptional
measuring and comparing the approximative properties approximative properties, _ since they_ give the best
of concrete approximation methods on we. In (2), the approximation of an IEMW~ in any Lp (cf. [7], [15]).
most important case is when 91 = 91N is an N- When (1) and (3) coincide and when it is possible to
dimensional space and U is a linear approximation construct a concrete linear method solving both prob-
method. A whole series of exact and asymptotically- lems, the cases considered are, in a well-known sense,
exact results is known for the approximation of func- optimal. In other cases it proved expedient in solving
tion classes by concrete linear methods (in particular, (1) to adopt an approach based on general duality
by polynomial or spline methods), see [l] - [12], [19]. theorems, reflecting fundamental relations in geometric
However, special interest attaches to methods attaining convex analysis (see [7], [8]). If, e.g. X is an arbitrary
the infimum linear normed space, X is its dual and 91 is a convex
com, inNh = UXC'1I
inf C([Q, inN, U)x, (3)
set in X, then for any x EX
N

over all linear bounded operators U from X into 91x , inf


ue91
II x-u II = sun
Fet,
[F(X)-SUPF(u)];
UE~
(4)
i.e. to all linear methods that are optimal for we. Obvi- II F II..;I
ously, in particular, if 91 is a subspace,
inf II x-u I = SUp {F(x): FEX, II F 11';;;;1, (5)
and there naturally arises the question on whether the ue'1l

equality sign can be attained. Apart from the trivial F(u)=O for all uEin}.
case when X is a Hilbert function space and the best In a number of cases (4), or (5), allows one to reduce
approximation of a function is given by its Fourier the calculation of the supremum (1) to the more tran-
sums with respect to an orthogonal basis of 91 x , some sparent problem of determining the extremum of an
results when linear methods realize the best explicitly defined functional on some set of functions,
approximation on the entire class we are also known for which, if 91 is a subspace, is related to orthogonality
non-Hilbert spaces. conditions. For example, by (5) the estimation of
Thus, if X is the space L = Lp[O, 2'17'] (I';;;;;p,;;;;; 00) of E(Wq, 91fn-I)i" I~p, q';;;;;oo, can be reduced, using
2'IT-periodic functions, 91~ _I is the subspace of tri-
known inequalities, to the calculation of the supremum
gonometric polynomials of order n- 1
of the norms I g Ilq', q' =q / (q -1), on the set of func-
(dim 91fn-1 =2n-l),_and if MW;, r=I,2, ... , is the tions gE W;', p' =p / (p -1), for which
class of functions IELp for which I(r-I)(t) is absolutely
continuous on [0,2'17'] and for which 11/(r) IIi, ,;;;;;M, then 2.. coskt
[g(t)Sinktdt = 0, k=O, ... ,n-l.
E(MW;, inf,,-I)i, = C(MW;, inf,,-di, = MKrn- r,
More refined situations occur when (1) has to be solved
p = 1,00; n, r = 1,2, ... ,
on classes not given by restrictions on the norm of the
where Kr is_ the Favar~ constant. The best approxima- r-th derivative j<r)(t), but on its modulus of continuity
tion on MWl and MW~ is obtained, moreover, by a w(j(r) .z.~)x. Particularly int~resting the case where is
linear method U~ -I, constructed on the basis of we=W'H'" (r=O,I, ... ; WOH"'=H"') is the class of
Fourier sums (cf. Approximation of functions, linear c
2'IT-periodic functions IE r (CO = C) for which
methods, formula (3 for a specific choice of the multi-
plication coefficients A~n -I) =A~n -I)(r). Linear opera- w(j(r),~)c = w(j(r),~) .;;;; w(~),
tors, with values in 91~ -I, yielding the supremum of where w(~) is a given modulus of continuity, e.g.
the best approximatiol! on convolu!ion classes, includ- w(~)=M~ (0 < a';;;;; 1, O';;;;;~';;;;;'IT). Here the application of
ing, in particular, MW~ and MWl with a rational (5) requires the use of fine results on differentiable
number r>O, as well as on classes of conjugate func- periodic functions of the type of the comparison
tions, have been established (cf. [10], [11]). theorem and the Kolmogorov inequality (for the norm
For approximation by the subspace Sq;, of 2'17'- of derivatives), here described using the tool of rear-
periodic splines of order m and defect 1 with knots rangements (of equi-measurable functions). If w(~) is
(nodes, joints) at k'IT / n (dimS~ =2n), the equalities convex from above, the following equalities ([7], [15])

225
APPROXIMA T10N OF FUNCTIONS, EXTREMAL PROBLEMS IN FUNCTION CLASSES

are valid: that uniformly in tEl -1,1], as n~oo, one has


E(WrH"', ~L-I)x = E(WrH"', S5,,)X = lIf"r(CAl)ilx,
n = 1,2, ... ; r = 0, 1, ....
I j(t)-Pn(j, t) I .;;; ~ [[ : v'l=t2r +o(n- a)]
- -
Here X=C or LJ, thef"r(w,t) are functions in W"Hw
- = E(MHQ, ~:)d(l-t2)a/2 +0(1)].
of period 2'1T / n, with zero average over a period, for Analogous results hold for MW" HI, r= 1, 2, ... , see
which fi,~) is even, equal to [w('1T /n-2t)]/2 on [11 ]. In spline-approximation problems (both of best
[0, '1T / 2n], and equal to - [.>(2t -'1T / n)1/ 2 on and of interpolation type) certain exact and
['1T / 2n, '1T / n]. The norms II f"r(w) II x have an explicit asymptotically-exact results are known for function
expression, e.g. classes on intervals (cf. [15]).
In the case of one-sided approximation (in an
II f" 0 (CAl) lie = ~ CAl [: J. integral metric) a series of exact results concerning the
estimation of the error of best approximation by poly-
IIf"r(CAl) lie = 2!r [~r('/T-t)CAl [~) dt, r=I,2, ... , nomials and splines regarding the above-mentioned
function classes is known (cf. [14]). In deriving them,
where the functions ~k(t) on [0, '1T] are given recurrently essential use is made of duality relations for the best
by ,,_/ approximation under restrictions given by cones.
~I(t) = t, ~k(t) = ~ I ~k-I(U)du. The search for a best approximation tool (of fixed
dimension), leads, for a fixed function class W1, to the
The best linea.,r method from Cinto 91~-1' or into S~n' problem of widths: Determine (cf. (1), (3:
for the class WH W is known for w(8)=m8, 0~8~'1T, i.e. dN(IJR, X) = infE(IJR, ~N)x'
when Wr H W =MW"oo+ 1 . The interpolation splines !liN

or(j, t) in S~'IT yield the supremum E(W" H W , S~'IT)c (for


any convex w(8 only if r = 1.
For solutions to extremal problems for function where the infima are taken over all subspaces 91N of X
classes on a finite interval and not restricted by rigid (as well as over their translations) of dimension N, and
boundary conditions, it is not possible to give such also to determine extremal (best) subspaces yielding
complete results as in the periodic case; the extremal these infima. An upper bound for dN and d~ is given
end points of the interval have a perturbing influence by E(W1,91)x and C(W1,91N )x, which have been esta-
on the extremal functions, which require an increase in blished for concrete subspaces 91N . The fundamental
the order of differentiability. Here, some exact asymp- difficulty in the problem of widths, here, is to obtain
totic results are known. If MWr Ha, r =0, 1, ... , exact lower bounds. In a number of cases it is possible
O<a<l, MWOHa=MH a, is the class of functions to obtain such bounds by appealing to topological con-
fEel -1, 1] for which siderations, in particular, Borsuk's antipodal theorem
(cf. [8]). In practically all cases of exactly solving best
IprV)-j<r)(t'') I.;;; M I t'-( la (t',t"E[-I,I]),
approximation problems in the classes MW; and
then for the best uniform approximation on [- 1, 1] by W" H W of periodic functions by the subspaces 91fn-1
the subspace 91~ of algebraic polynomials of degree (trigonometric polynomials of order n - 1) or S~n
n - 1, the relations (splines of order m and defect 1 with respect to the
partition of knots k'1T / n), the known exact upper
(6)
bounds E(W1, 91N )x give also the values of dN for these
1T classes. Moreover, for periodic classes d 2n - 1 =d2n In
lim nr+aE(MwrH a, ~~)c
n_oo
= M2 f~r('IT-t)tadt,
0
(7) particular (cf. [7], [8], [15], [16])
r = 1,2, ... , d 2n - UV';", C) =
1 d 2n (J.v';", C) = d 2n - UV}, Ld =
1

hold. It is useful to compare these results with the = d 2n (W},L 1) = Krn- r, n,r=I,2, ... ,
corresponding ones in the periodic case. For a = 1, the and for an upper convex w(8) and for X=C or LIane
right-hand sides of (6), (7) are equal, respectively, to has
MK I and MKr + I. Dropping the requirement of a
d2n -
-
1(W'Hw,X)
-
= d2n (W'HW,X) = 11f",(CAl)llx,
polynomial of best approximation, one obtains stronger
results, essentially improving the approximation at the n=I,2, ... ; r=O, 1, ....
end points of [-1, 1] without losing the best asymptot- It should be observed that \)(~ -I is extremal for the
ics on the whole interval. E.g., for any fEMHa there is classes considered, for all r, and that no subspace of
a sequence of algebraic polynomials Pn(j, t)E91: such dimension 2n gives a better approximation for these

226
APPROXIMATION OF FUNCTIONS, EXTREMAL PROBLEMS IN FUNCTION CLASSES

classes than 9Cf" -I (which is of dimension 2n - 1). The For


g:Q = M1JV;,
s~bspace ~Tn of splines is ~xtremal for W~+ I and
W' H W in C if m =r ~d for _W;+I ~ LI if_ m~r. The g:Ql = M2W~ (O<r<k)
linear widths d~ of W~ in C and W~ in LI coincide relations have been obtained between inequalities for
with dN , they are attained on 9Crn-I and Ss.;1 by the norms of derivatives and best approximations of a dif-
best linear methods that were referred to above. The ferentiation operator by linear bounded operators (cf.
widths dN and d~ of Ws. in L2 for N=2n -1 and [13]).
N = 2n are equal and are obtained by Fourier tri- Many extremal problems in the approximation of
gonometric sums. Exact asymptotic results for widths functions may be interpreted as problems of optimal
of function classes on an interval are known in a recovery (cf. [15]). Let the information on a function
number of cases: The widths dN and d~ of W'oo in JEXbe given by a vector T(f, A)={Al(f), ... ,An(f)},
C[ - 1, 1] coincide and are obtained for interpolation where Ak are given functionals on X (e.g. values of J
splines of order r - 1 with respect to non-uniform parti- and/or some derivatives in fixed points). Given that J
tions (cf. [8]). belongs to a class we, it is required to reconstruct J or
The problem of estimating the approximation error some linear functional L(f) of it (e.g. L(f)= AI},
on the set xr of r-fold integrals of functions from X f b
L(t)= J(t)dt, etc.) on the basis of T(f, A) with least
(which is not locally compact) can be stated properly if possible error. The minimization need not only take
for JEX r and fixed y>O one estimates into account methods S juxtaposing T(f, A) with a
E(j, IflNh function q:,(t)~J(t) (or a functional 1(f)~L(f, but
w(j(r) , 'Yh also involves the choice of functionals Ab k = 1, ... ,N.
Depending on the choice of the error measure and the
(w(g, o)x is the modulus of continuity of g in X) or (in class of methods S, the problem of optimal recovery of
the case of a concrete approximation method) if one a function may sometimes be reduced to determining
estimates the widths dN , d~, the Chebyshev centre, or other
Ilj- UN/llx
w(j(r), 'Yh . characteristics of we. The problem of optimal recovery

Determining the suprema of these quantities on x r is of f b


J(t) dt on the basis of information of the form
equivalent to the search for the least constant in the {{(tk)} or {{(v)(tk)} leads to the problem of best qua-
corresponding Jackson inequality; subsequently one drature formulas for we. The best tool of recovering is
may deal with the minimization over all subspaces of achieved, in a number of cases, by splines; e.g. the
dimension N. In a number of cases, these problems splines (Jr-l (f, t) in ss.; 1 that inte_rpolate at equally-
have been solved. E.g. in the inequality distant points tk reconstruct an JE W~ on the basis of

E(j, SS,)c ~ Mrn-rw ~(r), : t, jEer,


information Ak(f) = J(td in each_ point t=/=tk with
minimum error on the whole class W~.
In spaces of functions of two or more variables,
excluding the trivial case of approximation in Hilbert
the least constant is Mr =Kr /2, and it cannot be
spaces, exact solutions of extremal problems have not
improved if sS.n is replaced by any other subspace of
yet been obtained (1983). In a few cases an asymptotic
the same dimension (d. [15]). Exact constants in the
relation for the error of uniform approximation in func-
Jackson inequalities for trigonometric approximation in
tion classes by Fourier sums or some average Fourier
the uniform and in an integral metric are known (cf.
sums is known (cf. [12]).
[7]). In the non-periodic case there are asymptotic
results available. References
Among approximation problems in which the [I] KOLMOGOROY, A.N.: Ann. of Math. 36, no. 2 (1935), 521526.
[2] KOLMOGOROFF, A.N. [A.N. KOLMOGOROY]: 'Ueber die beste
approximating set is not a linear manifold but a convex Annliherung yon Funktionen einer gegebenen Funktionen-
set, the problem of approximating one function class we klasse', Ann. of Math. 37, no. 1 (1936), 107-110.
by another weI with best, in some sense or other, [3] FAYARD, J.: C.R Acad. Sci. 203 (1936), 1122-1124.
[4A] FAVARD, 1.: 'Sur les meilleurs procedes d'approximation de
smoothness properties, is of interest. Such a problem certaines classes de fonctions par des polynomes
arose initially at an intermediate stage in obtaining trigonometriques', Bull. Sci. Math. 61 (1937), 209-224.
exact bounds for [4B] FAYARD, 1.: 'Sur les meilleurs procedes d'approximation de
certaines classes de fonctions par des polynomes
E(H"', Iflfn-l)C (g:Q = H"', g:Ql = MW),,) trigonometriques', Bull. Sci. Math. 61 (1937), 243-256.
[5] NIKoL'sKIi, S.M.: 'Approximations of periodic functions by
(d. [7]); later it became to be considered as an indepen- trigonometric polynomials', Trudy Mat. [nst. Steklov. 15 (1945),
dent problem, and in a number of cases it was possible, 1-76 (in Russian). English abstract.
using (4), to obtain exact results.

227
APPROXIMATION OF FUNCTIONS, EXTREMAL PROBLEMS IN FUNCTION CLASSES

[6] NIKOL'sKIi, S.M.: 'Mean approximation of functions by tri- {tdf is a fixed system of points in the domain of
gonometric polynomials', Izv. Akad. Nauk SSSR Ser. Mat. 10
(1946), 207-256 (in Russian).
definition of f, and if q:,k(ti)=O when i=/=k, while
[7] KORNEicHUK, N.P.: Extremal problems in approximation theory, q:,k(tk) = 1, then UN(j, tk)= f(tk), k= 1, ... ,N, and one
Moscow, 1976 (in Russian). has an interpolation method (for instance, Lagrange's
[8] TnrnOMIROV, V.M.: Some questions in approximation theory, interpolation polynomial, or an interpolation spline). If
Moscow, 1976 (in Russian).
[9] DZYADYK, V.K.: Introduction to the theory of uniform approxi.
X = H is a Hilbert space of functions and Ck(f) are the
mation by polynomials, Moscow, 1977 (in Russian). Fourier coefficients of a function f with respect to an
[10] ACHIEZER, N.!. [N.I. AKHIEZER]: Theory of approximation, F. orthonormal system {q:,k(t)}, then the sum at the right-
Ungar, 1956 (translated from the Russian).
[ II] nMAN, AF.: Theory of approximation offunctions of a real
hand side of (1) yields a linear method of orthogonal
variable, Pergamon Press, 1963 (translated from the Russian). projection of X onto 'inN; in this case
[12] STEPANETS, AI.: Uniform approximation by trigonometric poly

[13]
nomials. Linear methods, Kiev, 1984 (in Russian).
ARESTOV, V.V.: 'Some extremal problems for differentiable Ilf- UN(j)IIH = ~IJt-k~lak~kt
functions of one variable', Trudy Mat. Inst. Steklov. 138 (1975),
3-28 (in Russian). Thus, this method yields the best approximation of f by
(14] KORNEicHUK, N.P., LIGUN, A.A and DORONIN, V.G.: Approx linear combinations of the functions q:,k.
imation with contraints, Kiev, 1982 (in Russian).
(15] KORNEicHUK, N.P.: Splines in approximation theory, Moscow, In the theory of linear approximation methods, spe-
1984 (in Russian). cial attention is given to problems of convergence. Let
N.P. Korneichuk
X be a Banach space, let {q:,1 (t), !f>z (t), . . .} be a
Editorial comments. The recent book by A. Pinkus [A1] linearly independent system of functions on X, let 'inN
contains a comprehensive and valuable bibliography. be the subspace generated by the first N functions of
References this system, N = 1, 2, ... , and let UN be bounded
[A1] PINKUS, A: nwidths in approximation theory, Springer, linear operators from X into 'inN, N= 1,2, . ... The
1985.
[A2] MICHELLI, C.A and RIVLIN, TJ.: 'A survey of optimal
convergence UN(f, t)-'?f(t) (in the sense that
recovery', in CA Michelli and T.J. Rivlin (eds.): Optimal esti- I UN - f II x-'?O as N -'? 00) for any function f in X
mation in approximation theory, Plenum Press, 1977, pp. 1- holds if and only if: 1) the sequence of norms I UN I
54.
of the operators UN is bounded (see Banach-Steinhaus
AMS 1980 Subject Classification: 41 A65, 41 A50, theorem); and 2) UN(j, t)-'?f(t) for all functions fin a
41A46 set A that is everywhere dense in X. These conditions
are satisfie_d, ~. particular, in the space of 2'IT-periodic
APPROXIMATION OF FUNCTIONS, LINEAR
functions Lp = Lp[O, 2'17], with 1<p < 00, and the opera-
MEfHODS - Approximation methods defined by linear
tors Sn defined by the trigonometric Fourier sums
operators. If in a linear normed function space X a
linear manifold (linear subspace) 'in is chosen as
approximating set, then any linear operator U which
transforms a function fEX into a function n = 1,2, ... ,
U(j, t)=(Uj)(t)E'in so that of f As the set A on which 2) is satisfied one can take
U(adl +ad2' t) = al U(jJ, t)+a2U(h, t) the set _of ~ trigonometric polynomials. If X is the
(where lXI and lX2 are arbitrary numbers), defines a space C = qo, 2'17] (of continuous functions _on the
linear approximation method for functions in X by whole real axis with period 2'17) or if X equals L I, then
means of functions in 'in. A linear approximation II Sn I ~lnn as n-'?ooJsee Lebt:sgue constants). Conse-
method is called projective if U(j, t) = f(t) for all f in 'in; quently, there are in C and in LI functions f to which
it is called positive if U(j, t);;.O for non-negative func- the sequence {Sn(j, t)} does not converge in the
tions f appropriate metric. Since in the Banach space of func-
The finite-dimensional case is the most interesting tions X with a norm that is invariant with respect to
one. Here 'in = 'inN is an N-dimensional subspace, and translations the linear operator U;, projecting X onto
then the subspace Tn of trigonometric polynomials of degree
at most n satisfies the i~equa1itx. I U;, I ;;.11 Sn I (see
N

U(f, t) = UN(f, t) = ~ Ck(j)ct>k(t), (I)


k =1 [3]), the divergence in C and L I also holds for the
where {q:,k(t)}f is a basis of 'inN and Ck are certain sequence {U;'}. In particular, this is true in C for the
linear functionals defined on X. The choice of the sequence of Lagrange interpolation operators with any
linearly independent system {q:,k(t)}f and the set of triangular matrix of interpolation nodes. Similar facts
functionals {Ck} f is determined by the information of a negative character also hold in the non-periodic
about the function that one intends to use while con- case for the linear projection operators of the spaces
structing the linear method. If Ck(f) = f(tk), where qa, b] and Lda, b] onto the subspaces An,

228
APPROXI~IATION OF FUNCTIONS. LINEAR METHODS

n = 1,2, ... , of algebraic polynomials of degree at mates of the error in terms of the difference-differential
most n. properties of the approximated function, and to
The diyergence of the sequence (2) for some func- answering the question of how the linear approximation
tions in C made it necessary to introduce and examine method behaves when the smoothness properties (of the
various averages of Fourier sums that do not have this approximated function) are improved.
disadvantage. Such are, for instance, the Fejer swnma- In the examination of approximating properties of
tion method and the Bernstein - Rogosinski swnmation the linear approximation method (1), a natural role is
method, which are particular cases (with a specific played by the best approximation Elf, ~N) of fby ele-
choice of the numerical coefficients Ar) of polynomials ments of the subspace ~N' The Lebesgue inequality
of the form IIf-Sn(f) lie';;; Elf, Tn)C(lnn+3) (5)
ao n
(3) shows, in connection with the Jackson inequality
U~lf,t) = 2+ ~>W)(akcoskt+bksinkt),
v(r) , ! ]
k=1

n = 1,2, .... Elf, Tn)e .;;; Mnw

Since (where wJg, 8) is the modulus of continuity of a func-


U~lf, t) =-
'Tr 0
f
1 2.". [ 1
-2 + ~>W) cosk(t -u) f(u)du,
n

k =1
]
(4)
tion g E C), that, although the order of approximation
by Fourier sums is slightly worse than the best approxi-
the averages (3) belong to the very wide class of linear mation (In n in (5) cannot be replaced by a constant),
approximation methods which can be represented in these sums are affected by any increase of the order of
the form of a convolution of fwith some (singular) ker- differentiability of the function to be approximated.
nel whose properties (in this case, the properties of the For some linear approximation methods the order of
triangular matrix of numbers {Ar)}) determine the approximation cannot be higher than a certain quan-
answer to the problem of convergence. If tity, no matter how many derivatives the function f
may have (the phenomenon of saturation). Thus, the
lim A\,") = 1, k=I,2, ... , order of approximation by positive linear polynomial
n~oo

then the sums (3) converge uniformly when f is a tri- operators U;; cannot be higher then O(n -2); for Fejer
gonometric polynomial, and so 2) is satisfied. For the sums the saturation order is O(n -I); and for
Bernstein - Rogosinski sums it is O(n -2). Interpolation
boun9edness of the norm of U~ as an operator from C
splines with a specific choice of the knots (nodes,
into C, it is necessary that
joints) ensure the best order of approximation not only
Aln) A~n)
-n-+ ... +-1- = 0(1) for the function itself, but also for some of its deriva-
tives, depending on the degree of the polynomials out
and Ar) = 0(1), uniformly in n and k = 1, ... ,n. of which the spline is composed (see [7], [8]).
These conditions are sufficient as well for the bounded-
In some particular cases, for specific linear approxi-
ness of the sequence {II U~ II} if the matrix {A~n)} is
mation methods exact or asymptotically-exact estimates
subject to some additional restrictions (for instance,
for the errors of approximation were obtained for some
convexity or concavity of the columns). Similar to (3),
classes of functions. Strictly speaking, the first non-
using the matrix of coefficients {A~n)} one can also con-
trivial result of this kind was established by A.N. Kol-
struct averages by means of Fourier - Chebyshev sums
mogorov, who proved in 1935 that
(for fEC[ -1, I]) and also by means of the Lagrange
interpolation polynomials with nodes 2k7T / (2n + I) (in
the periodic case) or with nodes cos[(2k -1)7T / (2n + 2)]
on [ - I, 1] (see, for instance, [6]). where W r K, r = 1, 2, ... , is the class of functions fEe
The question of convergence of positive linear opera- for which f(r-I)(t) is absolutely continuous on [0,27T],
tors U;; acting from C[a, b] into An or from Cinto Tn and for which almost everywhere I f(r)(t) I,.;;;,K.
(in particular, of operators of the form (4) with a posi- Subsequently, results of similar character were obtained
tive kernel) is solved in terms of three test functions for Fourier sums (and some of their averages) with
(see [1]): For the uniform convergence of the sequence respect to other important classes of functions, for
U;;(f,t) to f(t)EC[a,b] or to f(t)EC it is necessary instance, in terms of upper bounds on the modulus of
and sufficient that this occurs for the functions 1, t and continuity of the r-th derivative (see [2]), [6] and [9]).
t 2 , or, respectively, for the functions 1, sint, cost. Special interest is given to linear approximation
The investigation of the error of approximation by a methods (I) yielding, on some class of functions, the
linear approximation method leads, mostly, to studies least upper bound of the best approximation by the
on the rate of convergence of UN(f, t) to f(t), to esti- subspace ~N' Such a property for the classes W r K,

229
APPROXIMATION OF FUNCTIONS, LINEAR METHODS

r = I, 2, ... , with a specific choice of the A~n), applies the metric in a function space containing both f and <1>.
to sums of the form (3). For instance, for r = lone For example, if f and cp are continuous functions on a
should take segment [a, b], the uniform metric of C[a, b] is com-
A(n) hr k'TT
k = 2n+2 ctg 2n+2; monly used, i.e. one puts
the property also applies to interpolation splines of p.(j, q,) max I /(t)-<f>(t) I
= a.;;/.;;b
order r - I and defect I with knots hr / n, k = 0, I, ...
If continuity of the approximated function is not
(see [4] and also Approximation of functions, extremal
guaranteed or if the conditions of the problem imply
problems in function classes; Best linear method).
that it is important that f and <I> are close on [a, b] in
References
[IJ KOROYKIN, P.P.: Linear operators and approximation theory, an average sense, the integral metric of a space Ip[a, b]
Hindushtan Pub!. Comp., Delhi, 1960 (translated from the may be used, putting
Russian). b
[2J DZYADYK, V.K.: Introduction to the theory of uniform approxi.
mation offUnctions by polynomials, Moscow, 1977 (in Russian).
p.(j, q,) = f q(t) I/(t)-<f>(t) IP dt,
a
P >0,
[3J TnrnOMIROY, V.M.: Some questions in approximation theory,
Moscow, 1976 (in Russian). where q(t) is a weight function. The case p =2 is most
[4J KoltNEicHUK, N.P.: Extremal problems in approximation theory, often used and is most convenient from a practical
Moscow (in Russian). point of view (cf. Mean-square approximation of a
[5J GoNCHAROY, V.L.: The theory of interpolation and approxima
tion offUnctions, Moscow, 1954 (in Russian).
function).
[6J TIMAN, AF.: Theory of approximation offUnctions of a real The measure of approximation may take into
variable, Pergamon, 1963 (translated from the Russian). account only values of f and cp in discrete points tk,
[7J AHLBERG, l.R., NILSON, E. and WALSH, l.L.: The theory of
k=l, ... ,n, of [a, b], e.g.
splines and their applications, Acad. Press, 1967.
[8J STECHKIN, S.B. and SUBBOTIN, Yu.N.: Splines in numerical p.(j, q,) max I /(tk)-<f>(tk) I,
= l.;;k';;n
mathematics, Moscow, 1976 (in Russian).
[9J STEPANETS, AI.: Uniform approximation by trigonometric poly n
nomials. Linear methods, Kiev, 1981 (in Russian). p.(j, q,) = ~ qk I /(td-<f>(tk) IP,
[IOJ KORNEicHUK, N.P.: Splines in approximation theory, Moscow, k=l

1976 (in Russian). where qk are certain positive coefficients.


[11J RICE, l.R.: The approximation offunctions, I. Linear theory,
Addison-Wesley, 1964.
One defines in an analogous way the measure of
[12J ScHUMAKER, L.L.: Spline functions, basic theory, Wiley, 1981. approximation of functions in two or more variables.
[13J DAVIS, P.l.: Interpolation and approximation, Dover, reprint, The measure of approximation of a function f by a
1963.
N.P. Korneichuk family F of functions is usually taken to be the best
Editorial comments. The fact that three test functions approximation:
are enough for uniform convergence of a sequenc~ of posi- E(f, F) = p.(j, F) = .pEF
inf p.(j, q,) .
tive linear operators from C[a, b] into An (or from C into Tn)
can be found in, e.g., [A 1], Chapt. 3, Sect. 3. The theorem The quantity
is a generalization of the Bernstein theorem, and was
obtained by H. Bohman [A2]. For integral operators it was
E(Wl, F) = p.(Wl, F) = sup inf /L(f, cp)
fE~IcF

obtained by P.P. Korovkin (see also [1]). is usually taken as the measure of approximation of a
References class we of functions f by functions cp from a certain
[A1J CHENEY, E.W.: Introduction to approximation theory, Chel-
sea, reprint, 1982.
fixed set F. It characterizes the maximal deviation of
[A2J BOHMAN, R.: 'On approximation of continuous and of ana- functions in we from functions in F that are closest to
lytic functions', Arkiv Mat. 2 (1952), 43-56. them.
[A3] KrnSEWETIER, H.: Vorlesungen Dber lineare Approximation,
In general, when approximation in an arbitrary
Deutsch. Verlag Wissenschaft., 1973.
[A4] DEVORE, R.A: The approximation of continuous functions by metric space X is considered, the measure of approxi-
positive linear operators, Springer, 1972. mation JL{x, u) of an element x by an element u (a set
[A5] CHENEY, E.W., HOBBY, G.R., MORRlS, P.D., ScHURER, F. and F) is the distance p(x, u) (or p(x, F) between x and u
WUBERT, D.E.: 'On the minimal property of the Fourier pro-
jection', Trans. Amer. Math. Soc. 143 (1969), 249-258.
(or F) in the metric of X.

AMS 1980 Subject Classification: 41 A36, 41 A45,


References
[I] GONCHAROY, V.L.: The theory of interpolation and approxima
41A40 tion offunctions, Moscow, 1954 (in Russian).
[2] NIKOL'SKIi, S.M.: Approximation offunctions of several variables
APPROXIMATION OF FUNCOONS, MEASURE and imbedding theorems, Springer, 1975 (translated from the
OF - A quantitative expression for the error of an Russian).
approximation. When the discussion is about the [3] RICE, J.R.: The approximation of functions, 1-2. Addison-
Wesley, 1964-1968.
approximation of a function f by a function <1>, the N.P. Kornetchuk
measure of approximation J1Ij, <1 is usually defined by v.P. Motornyt

230
APPROXIMATION OF FUNCTIONS OF A COMPLEX VARIABLE

Editorial comments. The measure of approximation is del


also called the error measure. p(j, Pn(z;j, E, p == En(j, E, p) == inf{p(j, P): degP.;;;n},
del
References p(j, Rn(z ;/, E, p == Rn(j, E, p) == inf{p(j, R): degR';;;n},
[A1] LORENTZ, G.G.: Approximation of functions, Holt, Rinehart
and Winston, 1966. where the infima are over the set of polynomials P(z)
[A2] PINKUS, A.: n-widths in approximation theory, Springer, 1985
of degree degP:s;;;;n or rational functions R(z) of degree
(translated from the Russian).
deg R :s;;;; n, respectively (or over parts of these sets, dis-
AMS 1980 Subject Classification: 41 A50 tinguished by addition requirements). In fact, one deals
here with properties of solutions for a certain class of
APPROXIMATION OF FUNCTIONS OF A COM- extremal problems. In this context one may also con-
PLEX VARIABLE - The branch of complex analysis sider the study of other extremal problems on sets of
studying problems regarding the approximate represen- polynomials, rational functions and on certain classes
tation (approximation) of functions of a complex vari- of analytic functions, as well as the study of analytic
able by means of analytic functions of a specific class. properties of polynomials and rational functions (in
The fundamental problems in the theory of approxima- particular, obtaining inequalities between various norms
tion of functions of a complex variable are: the possi- of these functions and their derivatives).
bility of approximating; the rate of approximation; and 3) The study of the dependence of the rate of
the approximation properties of various methods of decrease (to zero) of En(f, E, p) and Rn(f, E, p), as
representing functions (interpolation sequences and n ~ 00, on the properties of f, E and p (the so-called
series, series in orthogonal polynomials or Faber poly- direct theorems of approximation theory) and the
nomials, expansion in continued fractions and Pade dependence of the properties of f on the rate of
approximation, sequences of polynomials in exponen- decrease of En(f, E, p) and Rn(f, E, p) to zero as n~oo
tial functions, Dirichlet series, etc.). The theory of and on the properties of E and p (inverse theorems).
approximation of functions of a complex variable is The study of approximation properties of well-known
intimately connected with other branches of complex methods in approximation theory (e.g., series in Faber
analysis, and with mathematics in general. Methods polynomials, various interpolation processes (cf. Inter-
and results on conformal mapping, integral representa- polation process, as well as the search for new effec-
tion, potential theory, the theory of function algebras, tive approximation methods are related to this direc-
etc., play an important role in approximation theory. tion.
The central problems in the theory of approximation 4) The approximation of functions of several com-
of functions of a complex variable relate to approxima- plex variables. Here, basically, the same problems are
tion by polynomials or rational functions, in particular solved as in the case of one complex variable, but the
by polynomials and rational functions of best approxi- results and the methods for obtaining them differ, as a
mation (existence, characteristic properties, uniqueness), rule, sharply from those used in the case of one vari-
as well as to extremal problems and various estimates able.
for polynomials and rational functions (growth esti- In the following some basic results have been listed.
mates, inequalities for derivatives, polynomials and I) The problem of the existence of a uniform approx-
rational functions, least deviation from zero, etc.). imation by polynomials that is as good as one pleases
A.A. Gonchar is solved by the Runge theorem (if f is analytic on E),
The approximation of functions of a complex variable by the Lavrent'ev theorem (if f is continuous on E), the
polynomials and rational functions. In this branch of Keldysh theorem (if E is a closed domain, f is continu-
approximation theory one may distinguish several ous on E and analytic within E) and the Mergelyan
directions. theorem (in the general case: E is a compact set, f is
I) The study of the possibility of approximating a continuous on E and analytic at interior points of E).
function f(z) of a complex variable z with given accu- 2) The problem whether it is possible to approximate
racy by polynomials or rational functions in z, in holomorphic fu~ctions on closed sets E in the extended
dependence on the properties of the set E on which f is complex plane C is solved by Runge'S theorem. In the
given and on which f is to be approximated, on the study of approximating a function f in various spaces,
properties of the metric p of deviation and on the pro- using the metric of these spaces, by rational functions,
perties of f itself. an important role is played by characteristics of the set
2) The study of the properties of polynomials and e ::J C analogous to the analytic capacity y( e). In terms
rational functions of best approximation, i.e. polynomials of y(e) the problem of the description of all compact
Pn(z;j, E, p) and rational functions Rn(z;j, E, p) of sets E on which any continuous function can be
degree not exceeding n, n =0, I, ... , for which approximated with arbitrary accuracy by rational func-

231
APPROXIMATION OF FUNCTIONS OF A COMPLEX VARIABLE

tions is solved as follows: It is necessary and sufficient 8) In a number of cases for the approximation of
that either analytic functions various interpolation processes prove
a) y(o(r, a)\E) = y(o(r, a = r effective, including Pade approximation and its generali-
zations.
for any disc o(r, a)={z: I z-a I <r}, aEC, r>O; or 9) For n ;;;'2, in en
there exist both non-closed Jor-
that
b) lim y(o(r, a)\E) = 00 dan curves on which not every continuous function can
,--+0 r2 be uniformly approximated by polynomials in
for any a EE (the equivalence of a) and b) expresses the (Zl, ... ,zn) to any degree of accuracy, and closed Jor-
so-called 'instability' of the analytic capacity). dan curves on which polynomials uniformly approxi-
3) If E is bounded and Lebesgue-measurable and if mate any continuous function. In CI this is impossible.
1~p <2, then the set of all rational functions is dense 10) Up till now (1983) there are comparatively few
in ~(E). direct theorems on the approximation by rational func-
4) If P >0 and if G is a simply-connected domain tions with free poles (i.e. without any condition on the
with a rectifiable Jordan boundary, then the family of position of the poles of the approximating functions)
all polynomials in z is dense in the Smirnov class Ep(G) and a considerable amount of inverse theorems.
if and only if G is a Smimov domain. References
5) Let the complex-valued functions [I] GoNCHAROV, V.L.: The theory of interpolation and approxima-
tion offunctions, Moscow, 1954 (in Russian).
I(Z),I/>I(Z), ... ,I/>n(z), n;;;'l, be continuous on a com- [2] WALSH, J.L.: Interpolation and approximation by rationalfunc-
pact set E c C. Among all generalized polynomials tions in the complex domain, Arner. Math. Soc., 1969.
[3] SMIRNOV, V.I. and LEBEDEV, N.A.: Functions of a complex vari-
P(z) = Ctq>t(z)+ ... +cnq>n(z) able: constructive theory, M.I.T., 1968 (translated from the Rus-
(CI, ... ,Cn are arbitrary complex numbers) a general- sian).
ized polynomial P o(z) deviates least from 1 in the
[4] DZYADYK, V.K.: Introduction to the theory of uniform approxi-
mation offunctions by polynomials, Moscow, 1977 (in Russian).
metric [5] RUSAK, V.N.: Rational functions as approximating tool, Minsk,
1979 (in Russian).
Pe(j, P) = max{ I j(z)-P(z) I: zEE} [6] GAMELIN, T.W.: Uniform algebras, Prentice-Hall, 1969.
if and only if [7] LEONT'EV, A.F.: Exponential series, Moscow, 1976 (in
Russian).
min{Re[P(z)(Po(z)- j(z))]: [8] Some questions in approximation theory, Moscow, 1963 (in Rus-
sian; translated from the English).
ZEE, Ij(z)-Po(z) I =Pe(j, Po)} ,;;; 0 [9] KEWYSH, M.V.: Dokl. Akad. Nauk SSSR 4 (1936), 163-166.
for each P(z). [10] LAFRIENTIEFF, M. [M.A. LAVRENT'EV]: 'Zur Theorie der kon-
formen Abbildung', Trudy Mat. Inst. Steklov. 5 (1934), 159-
6) If E is a compact set with connected complement 245.
G and if G has a Green function (for the first boundary [11] KOLMOGOROV, A.N.: 'Remark on the polynomials of P.L. Che-
value problem for the Laplace equation) g(z, (0) with a byshev deviating least from a given function', Uspekhi Mat.
Nauk 3, no. 1 (1948),216-221 (in Russian).
pole at 00, then for each Z E G and each polynomial
[12] MERGELYAN, S.N.: 'Uniform approximation to functions of a
P(z) of degree n, the inequality complex variable', Series and approximation 3 (1962), 294-391,
I P(z) I ,;;; M exp{ ng(z, oo)}
Arner. Math. Soc.. (Uspekhi Mat. Nauk 7, no. 2 (1952),31-
122)
with M=max{ I P(z) I: Z EE} holds. [13] VlTUSHKIN, A.G.: 'The analytic capacity of sets in problems of
approximation theory', Russian Math. Surveys 22 (1967), 167-
7) If E is a bounded non-degenerate continuum with
200. (Uspekhi Mat. Nauk 22, no. 6 (1967), 141-199)
connected complement G and if I(z) is analytic at inte- [14] DZHRBASYAN, M.M.: 'Some questions of the theory of
rior points of E and continuous on E with modulus of weighted polynomials in a complex domain', Mat. Sb. 36
(1955),353-440 (in Russian).

l l1~ JJ.
continuity w(~), then
[15] GONCHAR, A.A.: 'The rate of approximation of functions by
rational numbers and properties of numbers', in Proc. Internat.
En(j, E, pc) ,;;; C(j)w d n Congress Mathematicians Moscow, 1966, 1968, pp. 329-356 (in
Russian).
where [16] DOLZHENKO, E.P. and UL'YANOV, P.L.: Vestn. Moskov. Univ.
Ser. Mat. Mekh. 1 (1980), 3-13.
d(t) = max{min{ I ~-z I: ~EG, g(~, 00)=1n(1 +t)}: Z EaG}. [17J MERGELYAN, S.N.: 'On the approximation of functions of a
complex variable', in Mathematics in the USSR during 40
If the closed domain G is bounded by an analytic years: 1917-1957, Vol. I, Moscow, 1959, pp. 383-398 (in Rus-
curve T, then the condition sian).
[18] GONCHAR, A.A. and MERGELYAN, S.N.: History of paternal
En(j, E, pc) = O(n -p Q) mathematics, Vol. 1, Kiev, 1970, pp. 112-193 (in Russian).
is equivalent to the condition that f(P)(z) is Holder [19J TAMRAZOV, P.M.: Smoothness and polynomial approximation,
Kiev, 1975 (in Russian).
continuous of order <x, O<<x< 1, in G. The case when T [20J MEL'NIKOV, M.S. and SINANYAN, S.O.: Contemporary problems
is a piecewise-smooth curve with corners has been stu- in mathematics, Vol. 4, Moscow, 1975, pp. 143-250 (in Rus-
died. sian).
E. P. Dolzhenko

232
APPROXIMATION OF FUNCTIONS OF SEVERAL REAL VARIABLES

Editorial comments. Let EC C be a compact set. Let be approximated depends on two or more variables:
A(E) denote the set of functions f that are continuous on E f(t) = f(t], ... ,tm ), m ;;;'2.
and analytic at interior points (if any) of E. Denote by peE)
(respectively, R(E) the set of functions fEA(E) that can be (See Approximation of functions.) In comparison with
uniformly approximated on E by polynomials (respectively, the one-dimensional case, the study of problems on the
rational functions). Mergelyan's theorem states: 1) approximation of functions of m (m ;;;;'2) variables is
A(E)=P(E) if and only if CE, the complement of E, is con- highly complicated because of essentially new cir-
nected; 2) A(E) = R(E) if CE is finitely-connected. Examples cumstances that arise related to dimension. First of all
of compact sets E for which A(E)=I=R(E) are known (e.g. this applies to the domain on which the approximation
the Schweizer Kase of A. Roth, cf. [A 1]). The problem is carried out. Simply-connected compact sets (in the
arises of characterizing those E for which A(E) = R(E). The one-dimensional case intervals) in Rm (even in the
solution was given by E. Bishop and A.G. Vitushkin
plane) can have a large variety of shapes; therefore it
(independently) for compact sets without interior pOints, and
becomes necessary to classify the domains according to,
by Vitushkin for general compact E. Vitushkin's theorem can
for instance, the smoothness properties of their boun-
be found in [A1].
The sets A(E), peE) and R(E) are uniform algebras, e.g. daries. Complications also arise in the description of
function algebras endowed with the sup-norm (cf. Algebra difference-differential properties of functions of m vari-
of functions; Uniform algebra). For studies stressing this ables. Generally speaking, these properties can be dif-
aspect see [6] and [M]. ferent in different directions, and their characterization
Another direction of research is to study not approxima- ought to take into account both the geometry of the
tion by linear combinations of 1, Z, Z2, . . . , or domain on which the function is defined and the
... ,Z-1, 1, z, .. (polynomials or rational functions) but by behaviour of the function while approaching the boun-
linear combinations of powers ZA" or of exponentials eiA,Z, dary, so that the study of boundary properties of the
where {An} cR, on sets ECC, mostly on curves satisfying function is of great importance. If one attempts to sim-
some 'oscillation condition'. Results of this type related to plify the solution of an approximation problem by
the Muntz theorem; Lacunary power series; the
passing to a domain with a simpler structure, the prob-
Paley-Wiener theorem, zero sets of analytic functions,
lem that arises is to extend a function f from a domain
etc. (cf. [AS]).
In cn approximation problems depend essentially on the Q cRm into a canonical domain Ql (for instance a
geometry of the domain under consideration (this is related parallelepiped or the whole space Rm) containing Q
to the so-called Levi problem of characterizing domains of while preserving certain smoothness properties (see
holomorphy). One way to approach approximation problems Extension theorems). This kind of problems is closely
a
is via H6rmander's mechanism (cf. [A6]). An example of connected with imbedding theorems, and also with
an approximation theorem is Kerzman's theorem: Let nc C2 questions arising in the solution of the boundary value
be a strongly pseudo-convex domain with sufficiently problems of mathematical physics.
smooth boundary (C 5 suffices). Then every function fez) An increase in the number of independent variables
holomorphic on n and continuous on Q can be uniformly complicates, of course, also the tools of approximation,
approximated on Q by functions f} that are holomorphic ~
A
because when the dimension is increased, with it, e.g.,
some (strongly pseudo-convex) domain n containing n, the degrees of the polynomials increase. An algebraic
[A7].
polynomial of degrees n I, . . . ,nm in the variables
References t I, . . . ,tm , respectively, has the form
[A 1] GAIER, D.: Vorlesungen iiber Approximation im Komplexen,
Birkhauser, 1980. Pn" ... ,nm(t I , " " - ~ak"
tm ) -.c... l' .. lmm,
... ,km 1 (1)
[A2] BUCK, R.c.: 'Survey of recent Russian literature on approxi-
mation', in R.E. Langer (ed.): On numerical approximation, where ak" ... ,km are real coefficients. The summation is
Univ. of Wisconsin Press, 1959, pp. 341-359.
carried out with respect to k P = I, . . . ,m, from 0 to
pO
[A3] KOREVAAR, J.: 'Polynomial and rational approximation in the
complex domain', in J.G. Clunie (ed.): Aspects of contem- n so that, e.g., the subspace of polynomials of degree
pO

porary complex analysis, Acad. Press, 1980, pp. 251-292. 3 in each of the m variables has dimension 4m. Some-
[A4] STOUT, E.L.: The theory of uniform algebras, Bogden &
times the overall or total degree n of the polynomial is
Quigley, 1971.
[A5] REDHEFFER, R.M.: 'Completeness of sets of complex
fixed; then the summation in (1) is carried out over all
exponentials', Adv. in Math. 24 (1977), 1-62. indices satisfying the inequality O';;;;k 1 + ... +km.;;;;n.
[A6] HORMANDER, L.: An introduction to complex analysis in A real trigonometric polynomial of degrees n I, . . . ,nm
several variables, North-HOiland, 1973.
[A7] KERZMAN, N.: 'Holder and LP estimates for solutions of
in the variables t I, . . . ,tm can be written in the form
au= f on strongly pseudo-convex domains', Commun. Pure
Appl. Math. 24 (1971), 301-380. Tn" ... ,nJt l , .. , ,tm) = ~ak, ..... k,exp [(~kptp],
p=1

AMS 1980 Subject Classification: 30E10, 41 A25


where complex coefficients ak, ... ,k, with indices of
APPROXIMATION OF FUNCflONS OF SEVERAL opposite sign are complex conjugate, and the summa-
REAL VARIABLES - The case when the function f to tion is carried out with respect to k P = I, ... ,m, pO

233
APPROXIMATION OF FUNCTIONS OF SEVERAL REAL VARIABLES

from - n" to n". Such a polynomial can be Let Q be an arbitrary open set in Rm (in particular,
represented in the form of a linear combination of all Q= Rm), let e be a unit vector in Rm, let h >0 and let
possible products of the form <Pk,(tl) ... <Pk.(tm), where Qhe be the set of points t E Q such that the segment
<Pk.(t,,) is either sink"t" (OE;;k,,=E;;;n,,) or cosk"t" [t, t+he]EQ. If fELp(Q) and l=e;;;p=E;;;oo, then the quan-
(O=E;;;k,,=E;;;n,,). Multi-dimensional splines have a wide tity
field of applications; they consist of 'pieces' of alge- weif; 8)~(Q) = ~~ II f(t +he)- f(t) II ~(Q.,.)
braic polynomials in m variables 'tied' together in order
is called the modulus of continuity of the function
to obtain specified smoothness properties. In the case
f(t I, . . . ,tm) in the direction e with respect to the metric
m = 2, splines of the simplest form are formed by poly-
of Lp(Q). The quantity
nomials 'tied' together along straight lines parallel to
the coordinate axes. As a tool of approximation func- wif; 8)~(Q) = sup weif; 8)~(Q)
e
tions g(tI. ... ,tm ) are used which are polynomials or
is called the modulus of continuity of f in Lp(Q).
splines only in one of the variables. For the approxi- _ In the periodic case, the best _approximation
mation of non-periodic functions given on the whole
En ..... ,n.(fi,(Rm of a function fELp(Rm) having
space Rm (or on an unbounded subset of Rm) one may
(Sobolev generalized) partial derivatives
use entire functions of exponential type. These can be
represented in the form of sums of absolutely conver- D~f = L f E i,,(Rm)
gent power series at:
Gn.. ... .nJt;, ... ,tm) = ~ ak .. ... k. tt, ... t~ (2)
(where the r,,;;;;'O are integers and DOf=f,
k."O. p= 1, ... ,m) by trigonometric polynomials Tn" ... ,n.,
p=l ... m
satisfies the inequality
under the condition that for any >0 and all complex _ m
tI. ... ,tm one has E....... n.if)"i,(R") .;;;;; M~n;rwe.(D:f;n;I)i,(R.), (3)
m .=1
I Gn .......Jt\, ... ,tm) I .;;;; M,exp ~(n.+()1 t.l,
where e" is the unit vector directed along t", and where
where the constant M. depends only on (see [1]). It the constant}1 depends neither on f nor on n". For a
should be noted that, unlike polynomials, the function function fELp(Rm) having generalized partial deriva-
(2) is determined by an infinite number of parameters. tives
In the multi-dimensional case one has Weierstrass'
theorem on the possibility of approximating a function
fEC(Q), i.e. continuous on_abounded and closed set of order r=rl + ... +rm (where r=(rI. ... ,rm, the
Q CRm, or a function fEC(Rm), continuous on the
following inequalities hold:
whole space Rm with period 2'1T in each variable, with E..... n(f)i,(R) .;;;;;~ ~ w(Drf;n-I)i,(R)' (4)
arbitrary accuracy by means of algebraic (respectively, n'I+"'+,,,,;::::7
trigonometric) polynomi~s. A similar result holds in If
w(Drf; 8)i.,(Rm ) Kif', 0<0:< 1,
the spaces Lp(Q) and Lp(Rm) (in the periodic case) .;;;;;

(1 =E;;;p < 00). General results and theorems concerning i.e. if the function Drfsatisfies a Holder condition, then
properties of the best approximation, the existence and
ff)-.";;:' ~
uniqueness, the characteristic properties of functions of
E- n... nV ~(R) '"'" r+a' r -0
- , 1, ... ., 0< a..."
.,;;:,1 . (5)
n
best approximation, and general relations of duality In the last case the inverse theorem asserts that if for a
when approximating by (means of) a convex set (of function fELp(Rm) inequality (5) hol,?s for all
functions) and, in particular, by a subspace, can be n = 1,2, ... , then the derivatives DrfELp(Rm) exist
extended to normed linear spaces of functions of m and satisfy for any bERm the inequalities
variables (see [3] and [4]). However, to obtain explicit
formulations of these theorems, taking into account a II Drf(t+h)-Drf(t) 1Ii.,(R m) .;;;;; K I h la (6)
specific metric and specific properties of the approxi- for 0<0:< 1; whereas
mating subspace in the multi-dimensional case, involves II Drf(t+h)-2Drf(t)-Drf(t-h) 1Ii.,(Rm
) .; ; ; K Ih I (7)
great difficulties.
Quite thorough investigations were carried out on for 0: = 1. Here K does not depend on the length
questions concerning the connection between the I b I =(hr + ... +hm )I/2 of the vector
smoothness properties of a function of sev.eral variables b=(hl' ... ,hm ).
and the rate of decrease of its best approximation by Theorems analogous to those given above also hold
algebraic and trigonometric polynomials, as well as by for non-periodic functions fELp(Rm) if entire functions
entire functions. of exponential type are used as tools of approximation.

234
APPROXIMATION OF FUNCTIONS OF SEVERAL REAL VARIABLES

The above-mentioned results can be extended also to .;;; A In(2+min{nl, n2})(En"oo(f)x+Eoo,n,(f)x),


classes of functions whose smoothness is described in
where A is an absolute constant, and the factor
terms of moduli of continuity (moduli of smoothness)
1n(2 + min {n 1 , n 2 }) in (9) cannot be replaced by
of a higher order (see [1]).
In the case of approximating functions /ELP(Q) by another factor tending slower to infinity as
algebraic polynomials P n" ... ,n~ on a bounded paral- min{nl' n2}~00 (see [15]).
Fundamental peculiarities arise in problems of inter-
lelepiped (as well on some other bounded sets) direct
polation for functions of several variables. For instance,
theorems similar to (3), (4) and (5) have been proved.
contrary to the one-dimensional case, the existence of
Inverses of these theorems, like in the case of functions
an algebraic interpolation polynomial depends substan-
defined on a bounded interval, are possible only on a
tially on the position of the interpolation nodes.
proper subset Q 1 of Q. Inverse theorems are known
Nevertheless, effective methods of constructing polyno-
assuming a better order of approximation in the neigh-
mials and splines interpolating a function /(/1, ... ,1m)
bourhood of boundary of Q (see [13]); there are also
on a distribution of nodes chosen in a definite way,
direct theorems asserting the possibility of achieving
have been established (see Interpolation). For multi-
such an improvement in a neighbourhood of angular
dimensional interpolation by splines, in a number of
points (see [14]). Necessary and sufficient conditions
cases order estimates of the approximation errors, for
for a function / to belong to a class H'c+a(Q) (defined
the function / as well as for its partial derivatives, have
in the metric of C(Q)) by conditions similar to the con-
been found; in this direction, two-dimensional splines
ditions (6) and (7)) at the cost of raising the order of
of low degree, as well as local (Hermitian) splines of
approximation in a neighbourhood of the boundary
arbitrary degree, have been studied in more detail (see
(like in the one-dimensional case) are unknown (1983).
[7], [10] - [12]). Among the other linear methods for
However, one has the following results of a negative
approximating functions of several variables, multiple
character (see [13]). Let Q={t: tER2 , I t I ~1}. There
Fourier sums and their various means have been stu-
does not exist a sequence A~a)( I t I), n = 1, 2, ... ;
died most extensively. Here, order estimates of the
0<0:< 1, of functions in Q having the two following
approximation with respect to classes of functions are
properties:
known, and in some case asymptotically exact results
1) For any function /EH'C(Q) there are a constant
were obtained (see [5], [6] and [8]).
M and a sequence of polynomials
References
n=l,2, ... ,
[I] NIKOL'sKIi, S.M.: Approximation offunctions of seJieral variables
and imbedding theorems, Springer, 1975 (translated from the
such that Russian).
I I(t)- Pn(t) I .;;; MA~")( I t I), tEQ. (8) [2] GUTER, R.S., KUDRYAVTSEV, L.D. and LEVITAN, B.M.: Ele-
ments of the theory offunctions, Moscow, 1963, 106-198 (in
2) The existence of a constant M>O and of a Russian).
sequence of polynomials Pn(t) satisfying (8) implies that [3] KORNEicHUK, N.P.: Extremal problems in approximation theory,
Moscow, 1976 (in Russian).
/EH'C(Q) for any function/defined on Q. [4] TIKHOMIROV, V.M.: Some questions in approximation theory,
To illustrate the specific nature of approximation of Moscow, 1976 (in Russian).
functions of several variables, one can mention the fol- [5] DZYADYK, V.K.: Introduction to the theory of uniform approxi-
mation offunctions by polynomials, Moscow, 1977 (in Russian).
lowing ~esults. [6] TIMAN, A.F.: Theory of approximation offunctions of a real
Let En"n, (j)x be the best approximation of a 2'lT- variable, Pergamon, 1963 (translated from the Russian).
periodic function / of two variables by trigonometric [7] STECHKlN, S.B. and SUBBOTIN, YU.N.: Splines in numerical
mathematics, Moscow, 1976 (in Russian).
I:0ly~omials Tn"n, in the metric of X (X=C(R2) or [8] STEPANETS, A.I.: Uniform approximation by trigonometric
X=Lp (R 2 )), _and let En"oo(j)X be the best approxima- polynomials. Linear methods, Kiev, 1980 (in Russian).
[9] LAURENT, P.l.: Approximation et optimisation, Hermann, 1972.
tion of / in X by functions Tn" 00' i.e. by trigonometric [10] AHLBERG, l.H., NILSON, E.N. and WALSH, l.F.: Theory of
polynomials
.
of degree at most n 1 in the variable I 1 and splines and their applications, Acad. Press, 1967.
~th coefficients that are functions of (2' The quantity [II] ZAV'YALOV, Yu.S., KVASOV, B.1. and MIROSHICHENKO V L .
Methods of spline functions, Moscow, 1980 (in Russian). . ..
E oo,n, (j)x is defined similarly. [12] VARGA, R.S.: Functional analySiS and approximation theory in
If I <p < 00, one has the inequalities numerical analysis, Reg. Conf. Series in App!. Math., 3, SIAM,
1971.
En,.n, (f)L,(R') .;;; Ap {En,.oo(f)L,(R') + E oo.n, (f)L,(R')}, [13] NIKOL'sKIi, S.M.: 'Approximation of functions of several vari-
ables by polynomials', Siberian Math. 1. 10, no. 4 (1969),792-
wher~ Ap _depends only on p. 799. (Sibirsk. Mat. Zh. 10 (1969), 1075-1083)
If X=L 1(R 2) or X=C(R 2), then [14] BRUDNYi, YU.A.: 'Approximation of functions defined in a
convex polyhedron', Soviet Math. Doklady 11, no. 6 (1970),
(9) 1587-1590. (Dokl. Akad Nauk SSSR 195 (1970), 1007-1009)

235
APPROXIMATION OF FUNCTIONS OF SEVERAL REAL VARIABLES

[15] TEML'YAKov, V.N.: 'Best approximations for functions of two Editorial comments.
variables', Soviet Math. Doklady 16, no. 4 (1975), 1051-1055.
References
(Dokl. Akad. Nauk SSSR 223 (1975), 1079-1082)
[A1] MOllER, M.W.: Approximationstheorie, Akad. Ver-
V. N. Konovalov lagsgesellschaft, Wiesbaden, 1978.
N.P. Korneichuk [A2] LORENTZ, G.G.: Approximation of functions, Holt, Rinehart
References and Winston, 1966.
[A1] CHENEY, E.W.: 'Four lectures on multivariate approximation', AMS 1980 Subject Classification: 41 A25
in S.P. Singh, J.HW. Burry and B. Watson (eds.): Approxi-
mation theory and spline functions, Reidel, 1984, pp. 65-87.
[A2] SCHEMPP, W. and ZELLER, K. (EDS.): Multivariate approxima- APPROXIMATION THEORY - The branch of
tion theory, Birkhauser, 1979. mathematical analysis studying methods for approxi-
[A3] SCHEMPP, W. and ZELLER, K. (EDS.): Multivariate approxima-
tion theory /I, Birkhauser, 1982. mating some mathematical objects by others and also
[A4] SCHEMPP, W. and ZELLER, K. (EDS.): Multivariate approxima- studying questions related to the research and estima-
tion theory /II, Birkhauser, 1985. tion of the error that arises here.
[A5] ScHEMPP, W. and ZELLER, K. (EDS.): Constructive theory of
functions of several variables, Springer, 1977.
The main contents of approximation theory concerns
[A6] LORENTZ, G.G.: Approximation of functions, Holt, Rinehart the approximation of functions. Its foundations are laid
and Winston, 1966. by the work of P.L. Chebyshev (1854 - 1859) on best
[A7] GOLOMB, M.: 'Approximation by functions of fewer variables'
in R.E. Langer (ed.): On numerical approximation, Univ. of
uniform approximation of functions by polynomials
Wisconsin Press, 1959, pp. 275-327. and by K. Weierstrass, who in 1885 established that in
[A8] SCHUMAKER, L.L.: Spline functions. Basic theory, Wiley, principle it is possible to approximate a continuous
1981. function on a finite interval by polynomials with arbi-
AMS 19S0 Subject Classification: 41-XX, 41 A03, trary pre-given error. The development of approxima-
41A25,41A27 tion theory was to a large extent determined by the
fundamental work of H. Lebesgue, Ch.J. de la Vallee-
APPROXIMATION ORDER - The order of the error Poussin, S.N. Bernstein [S.N. Bernshtein], D. Jackson,
of approximation as a variable quantity, depending on J. Favard, A.N. Kolmogorov and S.M. Nikol'skii on
a continuous or discrete argument 'T, relative to another the approximation of functions and function classes.
variable 1f>('T) whose behaviour, as a rule, is assumed to With the development of functional analysis, many
be known. In general, 'T is a parameter that is a numeri- problems in approximation theory were considered in
cal characteristic of the approximating set (e.g. its the most general setting, e.g. as the approximation of
dimension) or of the method of approximation (e.g. the elements of an arbitrary linear normed space X. Three
interpolation step). The set of values of 'T may, more- classes of problems arose here, corresponding more or
over, have an infinite or finite limit point. The function less to the main chronological stages of development of
1f>('T) is most often a power, an exponential or a loga- research in approximation theory.
rithmic function. The modulus of continuity (cf. Con- 1. The approximation of a fixed element x E X by ele-
tinuity, modulus of) of the approximated function (or ments of a fixed set 9C c X. If
that of some derivative of it) or a majorant of it may E(x, 91) = inf II x-u I
figure as If>('T). UE9(

The approximation order is characterized both by the is taken as the approximation measure, i.e. the best
properties of the approximation method, as well as by a approximation of x by 91, then, along with the investi-
definite property of the approximated object, e.g. the gation and estimation of E(x, 9C), questions on the
differential-difference properties of the approximated existence of an element of best approximation Uo E9C
function (cf. Approximation of functions, direct and (for which I x - Uo I = E(x, 9C, its uniqueness and
inverse theorems). characteristic features arise. Any operator A mapping X
In numerical analysis, the approximation order of a into 9C gives rise to an approximation method with
numerical method having error O(h m ), where h is the error I x - Ax II. If 9C is a linear manifold, linear
step of the method, is the exponent m. operators are of particular importance. For sequences
References {An} of such operators, the question of the conditions
[I] GONCHAROV, V.L.: The theory of interpolation and approxima of convergence Anx--n for any x EX arises.
tion of functions, Moscow, 1954 (in Russian). 2. The approximation of a fixed set we C X by ele-
[2] TIMAN, A.F.: Theory of approximation offunctions of a real
variable, Pergamon, 1963 (translated from the Russian).
ments of another fixed set 91 C X. The best approxima-
[3] BAKHVALOV, N.S.: Numerical methods: analysis,. algebra, ordi tion here is expressed by
nary differential equations. Mir. 1977 (translated from the Rus-
sian). E(IJJI,91) = .suo
XE::!1,
E(x, 91),
N.P. Korneichuk
v.P. Motornyi which glves the minimal possible error estimate when

236
ARBITRA nON SCHEME

approximating an arbitrary x EX by elements from 91. to the history of theorems and methods), as well as a useful
In concrete cases, the problem consists of estimating, or bibliography. The more recent book [A2] also contains an
expressing exactly, E('iJR,91) by characteristic properties extensive bi bl iog raphy .
of the given sets 'iJR and 91. If the approximation is References
established by an operator A, the supremum [A1] CHENEY, E.W.: Introduction to approximation theory, Chel-
sea, reprint, 1982.
~~~ II x-Ax II [A2] HOLLAND, A.S.B. and SAHNEY, B.N.: The general problem of
approximation and spline functions, R.E. Krieger, 1979.
is investigated, as well as (if 91 is a linear manifold) [A3] NATANSON, !'P.: Constructive theory of functions, F. Ungar,
1964-1965 (translated from the Russian).
6'(IJR,9C) inf sup I x -
= AXe91XE~ Ax II, [A4] SINGER, I.: Best approximation in normed linear spaces by
elements of linear subspaces, Springer, 1970 (translated
where the infimum is taken over all linear operators A from the Rumanian).
mapping X into 91. A linear operator realizing this [AS] PINKUS, A.: n-widths in approximation theory, Springer, 1985
(translated from the Russian).
infimum (if it exists) gives rise to a best linear method
of approximation. The case lS'('iJR, 91)=E('iJR, 91) is of AMS 1980 Subject Classification: 41 A50, 41 A46,
particular interest. 41A65
3. Best approximation of a fixed set 'iJR C X by a
ARABIC NUMERALS - The traditional name given
given class {91} of approximating sets in X. It is
to the mathematical symbols: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9.
assumed that in {91} there are, in a definite sense,
Using these symbols, every number can be expressed in
'equally-expensive' classes, e.g. containing the same
the decimal system (d. Numbers, representations of).
amount of elements or having the same dimension. The
first case leads to the (-entropy of 'iJR (relative to X), These numerals arose in India (not later than the fifth
the second - to the problem of calculating the width of century), in Europe they became known in the lO-th to
'iJR (in X), in particular, 13-th centuries through Arabic works (hence the name).
BSE-3
dN(IJR, X) = inf
'liN
(IJR, 9C N ), (1) AMS 1980 Subject Classification: 01-XX
and ARBITRATION SCHEME - A rule by which each
d~(IJR, X) = inf 6' (IJR, 9C N ), (2)
9lN arbitration game (d. Cooperative game) is put into
where the infimum is taken over all subspaces 91N in X correspondence with a unique outcome of the game is
of fixed dimension N (or over all possible translations called an arbitration solution. The first arbitration
91N +a of it). Thus in (1), (2) the problem is to deter- scheme was considered by J. Nash [1] for the case of a
mine the best (respectively the best linear) approxima- two-person game. Let R = { u = (u \, ... ,un)} be the set
tion tool of dimension N for 'iJR. of outcomes, let d = (d \, ... ,dn ) be the status quo
point, i.e. the point corresponding to the situation in
References
[I] ACHIEZER, N.!. [N.!. AxHIEZER]: Theory of approximation, F. which no cooperative outcome is realized, let [R, d] be
Ungar, 1956 (translated from the Russian). the corresponding arbitration game and let u be an
[2] GONCHAROV, V.L.: The theory of interpolation and approxima- arbitration solution of it. An outcome u is called a
tion offunctions, Moscow, 1954 (in Russian).
[3] DZYADYK, V.K.: Introduction to the theory of uniform approxi- Nash solution if
mation offunctions by polynomials, Moscow, 1977 (in Russian). II(u;-d;) = maxII(u;-d;),
[4] NIKOL'SKIi, S.M.: Approximation offunctions of several variables i UER i
and imbedding theorems, Springer, 1975 (translated from the
Russian). Only a Nash solution satisfies the following axioms: 1)
[5] KORNEicHUK, N.P.: Extremal problems in approximation theory, if f is a linear non-decreasing mapping then fo is an
Moscow, 1976 (in Russian).
arbitration solution of the game [jR, fd] (invariance
[6] TIKHOMIROV, V.M.: Some questions in approximation theory,
Moscow, 1976 (in Russian). with respect to utility transformations); 2) u~d, u ER
[7] TIMAN, A.F.: Theory of approximation offunctions of a real and there is no u ER such that u;:. u (Pareto optimal-
variable, Pergamon, 1963 (translated from the Russian).
[8] RICE, J.R.: The approximation offunctions, 1-2, Addison-
u
ity); 3) if R' CR, d' =d, u ER', then =u (indepen-
Wesley, 1964-1968. dence of irrelevant alternatives); and 4) if d; =dj ,
[9] DAVIS, PJ.: Interpolation and approximation, Blaisdell, 1965. i, j = 1, ... ,n, and R is symmetric, then u; = Uj'
[10] LORENTZ, G.G.: Approximation offunctions, Holt, Rinehart
and Winston, 1966.
i,j=l, ... ,n (symmetry).
N.P. Kornefchuk Another arbitration scheme for an n-person game
VP. Motornyi with characteristic function v and player set
N = {1, ... ,n} was given by L.S. Shapley [2]. The
Editorial comments. There are many good books on
approximation (and interpolation) theory. Below some of Shapley solution cj>( v) = (q,\ (v), ... ,q,n (v)), where
them are mentioned. [A1] contains extensive notes (related q,;(v) = ~ Yn(s)[v(S)-v(S \ {i})],
seN

237
ARBITRA TION SCHEME

'Yn(S)=(S -l)!(n -S)! / n! and S is the number of ele- ARc, simple arc, Jordan arc - A part of a curve
ments of the set S, also satisfies the axiom of sym- comprised between two of its points (and containing no
metry, but, moreover, ~.(Mv)=v(N), and for any two multiple points). An arc in the plane is often defined
I

games u and v the equality <p(u +v)=<p(u)+<p(v) holds. by specifying the coordinates of its points as continu-
Arbitration schemes with interpersonal utility comparis- ous functions x = <p(t), y =#..1) of some parameter t,
ons have also been considered [3]. a";;;;'t ";;;;'b; it is assumed in so doing that different values
The arbitration schemes of Nash and Shapley were of t correspond to different points.
generalized by J.e. Harsanyi [4]. A Harsanyi solution Editorial comments. In general any homeomorphic
satisfies, apart from the four axioms of Nash, the two image of the unit interval is called an arc, not to be con-
axioms: I) the solution depends monotonically on the fused with a path: A continuous function with as domain the
initial demands of the players; and 2) if u' and u" are unit interval.
two solutions, then Ii, defined by A related concept is that of a Hausdorff arc (or sometimes
also arc): A continuum (a compact and connected topologi-
u ~ min(u;,
ieN
u;*), cal space) that is irreducibly connected between two of its
pOints (any closed and connected subset containing these
is also a solution if and only if Ii belongs to the boun-
two points must be equal to the whole space).
dary of the set R.
Under suitable conditions an arbitration scheme References
[A1] KURATOWSKI, K.: Topology, 2, Acad. Press, 1968 (translated
depends continuously on the parameters of the game. from the French).
References AMS 1980 Subject Classification: 30A99, 51 N99,
[l] NASH, J.: 'The bargaining problem', Econometrica 18, no. 2
(1950), 155-162.
53A04, 54FXX
[2] SHAPLEY, L.S.: 'A value for n-person games', in H.W. Kuhn,
A.W. Tucker and M. Dresher (eds.): Contributions to the theory
ARC, CONTACfLESS (FREE) - A smooth curve
of games, Vol. 2, Princeton Univ. Press, 1953, pp. 307-317.
[3] RAiFFA, H.: 'Arbitration schemes for generalized two-person without self-intersections in the phase plane of a two-
games', in H.W. Kuhn, A.W. Tucker and M. Dresher (eds.): dimensional autonomous system of differential equa-
Contributions to the theory of games, Vol. 2, Princeton Univ. tions
Press, 1953, pp. 361-387. x= P(x,y), y= Q(x,y), (*)
[4] HARsANYI, J.e.: 'A bargaining model for the cooperative n-
person game', in H.W. Kuhn, A.W. Tucker and M. Dresher at each point of which the phase velocity vector of the
(eds.): Contributions to the theory of games, Vol. 4, Princeton system (cf. Phase velocity vector) is defined, is non-
Univ. Press, 1959, pp. 325-355. zero and is not a vector tangent to the curve. The con-
E.I. Vilkas
cept was introduced by H. Poincare [1), and is exten-
Editorial comments. Arbitration schemes are also called
bargaining schemes and a Nash solution is also called a
sively employed in the qualitative theory of differential
bargaining solution. The Shapley solution vector <p is also equations (2). Thus, it is possible to draw a contact less
called the Shapley value. For other, more recent, bargain- segment of sufficiently small length through an arbi-
ing schemes, such as the Kalai- Smorodinsky solution and trary ordinary point of a trajectory of the system (*). A
Szidarovsky's generalization of the concept of a Nash solu- con tactless arc is characterized by the fact that all the
tion, the reader is referred to [A 1], [A2] , respectively [A6]. trajectories of system (*) intersecting the curve intersect
For further developments concerning Harsanyi solutions, cf. it in the same direction. If the derivative along the flow
[A3]. Some authors distinguish between bargaining of the system (*) (cf. Differentiation along the flow of a
schemes and arbitration schemes. Then the Nash scheme is dynamical system) at each point of the smooth curve
a bargaining scheme and the Shapley one an arbitration does not vanish, this curve is a contactless arc. A closed
scheme, [A5]. contactless arc is said to be a contactless cycle.
References
[A 1] KALAl, E. and SMORODINSKY, M.: 'Other solutions to Nash's
References
[I] POINCARE, H.: 'Memoire sur les courbes definies par une
bargaining problems', Econometrica 43 (1975), 513-518.
equation differentielle I - IV', in Oeuvres de H. Poincare, Vol.
[A2] ROTH, A.E.: Axiomatic models of bargaining, Lect. Notes in
I, Gauthier-Villars, 1916, pp. 3-222.
Econom. and Math. Systems, 170, Springer, 1979.
[2] LEFSCHFTZ, S.: Differential equations: geometric theory, Intersci
[A3] HARSANYI, J.e.: Papers in game theory, Reidel, 1982.
ence, 1962.
[A4] AUMANN, R.I. and SHAPLEY, L.S.: Values of non-atomic N.Kh. Rozov
games, Princeton Univ. Press, 1974.
[A5] RApOPORT, A.: N-person game theory: Concepts and appli- AMS 1980 Subject Classification: 34C35, 58F99
cations, Univ. of Michigan Press, 1970, p. 168.
[A6] SZEP, 1. and FORG6, F.: Introduction to the theory of games,
Reidel, 1985. ARC FUNCTION, arcus junction - A function inverse
[A7] VOROB'EV, N.N.: Game theory, Springer, 1977 (translated to a trigonometric function, i.e. one of the functions:
from the Russian). arcsine, arccosine, arctangent, arccotangent, arc secant,
AMS 1980 Subject Classification: 90012 arccosecant. Cf. Inverse trigonometric functions.
ARCHIMEDEAN SEMI-GROUP

Editorial comments. positive cone: It is considered that a,..."b if there exist


References positive integers m and n such that
[1] ABRAMOWITZ, M. and STEGUN, J.A.: Handbook of mathemati- I a I < I b Im and I b I < I a In,
cal functions, National Bureau of Standards, 1964. Dover
reprint, 1972. where
Ixl = max{x, X-I}.
AMS 1980 Subject Classification: 26A09
The positive cone of an Archimedean group consists of
ARCHIMEDEAN AXIOM - An axiom, originally for- a single Archimedean class.
O.A. Ivanova
mulated for segments, which states that if the smaller
AMS 1980 Subject Classification: 06F05
one of two given segments is marked off a sufficient
number of times, it will always produce a segment ARCIDMEDEAN GROUP - A partially ordered group
larger than the larger one of the original two segments. within which the Archimedean axiom is valid: If an <b
This axiom can be formulated in an analogous manner for all integers n (a and b are elements of the
for surfaces, volumes, positive numbers, etc. In general, Archimedean group), then a is the unit of the group (in
the Archimedean axiom applies to a given quantity if additive notation: If na <b for all integers n, then
for any two values A and B of this quantity such that a =0). Totally ordered Archimedean groups may be
A <B it is always possible to find an integer m such described as follows (Holder's theorem): A totally
that Am> B. The axiom forms the base of the process ordered group is Archimedean if and only if it is iso-
of successive division in arithmetic and in geometry (cf. morphic to some subgroup of the additive group of real
Euclidean algorithm). The importance of the numbers with the natural order. Thus, the additive
Archimedean axiom only became fully apparent after group of all real numbers is in a certain sense the larg-
the discovery, in the 19-th century, of magnitudes to est totally ordered Archimedean group. All lattice
which it does not apply - so-called non-Archimedean ordered Archimedean groups are commutative. Totally
quantities (cf. Quantity; see also Archimedean group; ordered groups without non-trivial convex subgroups
Archimedean ring; Archimedean class). (cf. Convex subgroup) are Archimedean.
The axiom was explicitly formulated by Archimedes
References
(third century B.C.) in his work The sphere and the [1] BIRKHOFF, G.: Lattice theory" Amer. Math. Soc., 1967.
cylinder; it had been previously utilized by Eudoxus of [2] KOKORIN, A.1. and KoPYTov, A.M.: Fully ordered groups,
Cnides, and is for this reason also called Eudoxus' Israel Progr. Sci. Trans!., 1974.
[3] fuCHS, L.: Partially ordered algebraic systems, Pergamon, 1963.
axiom.
BSE-3 A.!, Kokorin
AMS 1980 Subject Classification: 01 A20, 06F05 V.M. Kopytov
AMS 1980 Subject Classification: 06F15
ARCHIMEDEAN BODIES - A synonym for Semi-
regular polyhedra. ARCIDMEDEAN RING - A partially ordered ring the
additive group of which is an Archimedean group with
AMS 1980 Subject Classification: 52A25, 51 M20
respect to the given order. An Archimedean totally
ARCHIMEDEAN CLASS - A class resulting from the ordered ring R is either a ring with zero multiplication
subdivision induced by the Archimedean equivalence (i.e. xy =0 for all x and y in R) over an additive group
relation on a totally ordered semi-group. This which is isomorphic to some subgroup of the group of
equivalence is defined as follows. Two elements a and b real numbers, or else is isomorphic to a unique subring
of a semi-group S are called Archimedean equivalent if of the field of real numbers, taken with the usual order.
one of the following four relations is satisfied: An Archimedean totally ordered ring is always associa-
a"';;;;b"';;;;a n , b"';;;;a",;;;;b n,
tive and commutative.
an",;;;;b"';;;;a, bn"';;;;a"';;;;b; References
[I) FUCHS, L.: Partially ordered algebraic systems, Pergamon Press,
which amounts to saying that a and b generate the 1963.
same convex sub-semi-group in S. Thus, the subdivi- o.A. Ivanova
sion into Archimedean classes is a subdivision into AMS 1980 Subject Classification: 06F25
pairwise non-intersecting convex sub-semi-groups.
Moreover, each subdivision into pairwise non- ARCIDMEDEAN SEMI-GROUP - 1) A totally
intersecting convex sub-semi-groups, can be extended ordered semi-group all strictly-positive (strictly-
to a subdivision into Archimedean classes. negative) elements of which belong to the same
The Archimedean equivalence on a totally ordered Archimedean class. All naturally ordered Archimedean
group is induced by the Archimedean equivalence of its semi-groups (cf. Natura1ly ordered groupoid) are iso-

239
ARCHIMEDEAN SEMI-GROUP

morphic to some sub-semi-group of one of the follow- It is described by a point M moving at a constant rate
ing semi-groups: the additive semi-group of all non- along a straight line d that rotates around a point a
negative real numbers; the semi-group of all real lying on that straight line. At the starting point of the
numbers in the interval (0, 1) with the usual order and motion, M coincides with the centre of rotation a of
with the operation ab = min {a + b, I}; the semi-group the straight line (see Fig.). The length of the arc
consisting of all real numbers in the interval (0, 1) and between the points M I (PI, <1>1) and M 2(P2, <1>2) is
the symbol 00 with the usual order and with the opera-
tions: a+b if a+b.;;;l, 1= f [cp~ +In(cp+ ~)]:.
ab ={
00 ifa+b>l. The area of the sector bounded by an arc of the
The former case occurs if and only if S is a semi-group Archimedean spiral and two radius vectors PI and P2,
with cancellation. corresponding to angles <1>1 and <1>2, is
3_ 3
References S=~
[1] FUCHS, L.: Partially ordered algebraic systems, Pergamon, 1963. a
a.A. Ivanova An Archimedean spiral is a so-called algebraic spiral
2) A semi-group S which satisfies the following con- (cf. Spirals). The generalization of the Archimedean
dition: For any a, bES there exists a natural number n spiral is called a neoid, the equation of which in polar
such that an ESbS. If an ESb (an EbS), the semi-group coordinates is
S is called left (right) Archimedean. For commutative p = acp+l.
semi-groups all these concepts are equivalent. Any The spiral was studied by Archimedes (third century
commutative semi-group S is uniquely decomposable B.c.) and was named after him.
into a band of Archimedean semi-groups (and such a
decomposition coincides with the finest decomposition References
[1] SAYELOV, A.A.: Plane curves, Moscow, 1960 (in Russian).
of S into a band of semi-groups). This result may be D.D. Sokolov
generalized in a different manner to non-commutative
semi-groups [1]. A semi-group S with an idempotent is Editorial comments.
Archimedean (right Archiroedean) if and only if it has References
a kernel K and if K contains an idempotent (K is a [A1] LOCKWOOD, E.H.: A book of curves, Cambridge Univ. Press,
1961.
right group, cf. also Kernel of a semi-group) while the
Rees quotient semi-group (cf. Semi-group) is a nil AMS 1980 Subject Classification: 53A04, 14H45,
semi-group. Archimedean semi-groups without idempo- 51N10
tents are harder to study. A complete description in
terms of certain constructions, which is especially clear ARCSINE DISTRIBUTION - A probability measure
for semi-groups with a cancellation law [2], [3], was on the real line whose density is zero outside the inter-
given for the commutative case only. val (0,1) and is Cy'x(l-x-I /'IT if O<x<1. The
corresponding distribution function is equal to
References
[1] PurCHA, M.S.: 'Band of t-Archimedean semigroups', Semi (2/ 'IT) arcsin \Ix for O';;;;;x';;;;; 1.
group Forum 6 (1973), 232-239. The generalized arcsine distribution is employed
[2] CLIFFORD, A.H. and PRESTON, G.B.: The algebraiC theory of together with the arcsine distribution. To the general-
semigroups, 1-2, Amer. Math. Soc., 1961-1967.

l
[3] TAMURA, T.: 'Construction of trees and commutative ized arcsine distribution corresponds the distribution
Archimedean semigroups', Math. Nachr. 36, no. 5-6 (1968), function Fa(x) with density
255-287.
L.N. Shevrin Sin'lT'lTa x-u(l-x)u-I ifO<x<l,
AMS 1980 Subject Classification: 06F05, 20MXX fu(x) =
ARCHIMEDEAN SPIRAL - A plane transcendental
o ifx';;;O,x~l,

curve the equation of which in polar coordinates has if 0<0:<1. The density !1/2(X) coincides with the den-
the form: p = acp.
sity of the arcsine distribution. The generalized arcsine
distribution is a special case of the beta-distribution.
The first-order moment of the generalized arcsine dis-
tribution is 1-0:, and its variance is (1-0:)0:/2. The
arcsine distribution and the generalized arcsine distri-
bution occur in the study of the fluctuations of random
walks, in renewal theory (cf. Arcsine law), and are used
in mathematical statistics as special cases of the beta-
distribution.

240
AREA

References lim !11-


L() - l for 0 <y<oo.
[1) FELLER, W.: An introduction to probability theory and its appli x-+oo x
cations, 1-2, Wiley, 1957-1971.
There exists a close connection between the arcsine law
[2) KENDALL, M.G. and STUART, A.: The advanced theory of statis
tics. Distribution theory, Griffin, 1969. in renewal theory and the arcsine law governing a ran-
B.A. Rogozin dom walk [3].
AMS 1980 Subject Classification: 60EXX
References
[1) FELLER, W.: An introduction to probability theory and its appli-
ARCSINE LAW - A limit theorem describing the cations, 2, Wiley, 1957-1971.
fluctuations of a random walk on the real line, which [2) SPITZER, F.: Principles of random walk, Springer, 1976.
[3) ROGOZIN, B.A.: 'The distribution of the first ladder moment
results in an arcsine distribution or a generalized and height and fluctuation of a random walk', Theory Probab.
arcsine distribution. The following feature of a Appl. 16, no. 4 (1971),575-595. (Teor. Veroyatnost. i Primenen.
Brownian motion {~I: t~O, ~o=O} was noted in 1939 16, no. 4 (1971), 593-613)
B.A. Rogozin
by P. Levy. Let TI be the Lebesgue measure of the set
{u: ~u >0, O",;:;u"';:;t} or, in other words, the time spent Editorial comments. The function L in the article above
is called a slowly varying function, cf. [1], p. 269.
by a Brownian particle on the positive semi-axis during
the interval of time [0, t]. The ratio TI / t will then have AMS 1980 Subject Classification: 60FXX
the arcsine distribution:
AREA - A numerical characteristic assigned to
p{ ~ <x} = F I/ 2(X) = ; arcsin Yx, O";;;x,,;;;l, t>O. planar figures of a certain class (for example, polygons)
and having the following properties: 1) the area is
It was subsequently noted [2] that a random walk with non-negative; 2) the area is additive (in the case of
discrete time obeys the following arcsine law: Let polygons this means that if a figure P U Q is com-
So =0, ... ,Sn, ... , be the successive locations in the posed of two figures P and Q not having common inte-
random walk, n rior points, then area(P U Q) = area P + area Q); 3) the
Sn = ~~u, area is preserved under displacements; and 4) the area
u=1
of the unit square is 1. The term 'area' is used also in a
where ~l, b ... , are independent and identically dis-
more general sense as a numerical characteristic of
tributed, let Tn be equal to the number of indices k
two-dimensional surfaces in three-dimensional space,
among 0, ... ,n for which Sk >0, and let
for k-dimensional surfaces in an n-dimensional
Kn = min {k: Sk = max Sm}, Euclidean or Riemannian space (2",;:;k"';:;n), and for the
O~m~n
boundaries of sets and other objects, see below.
then the relationships The area of a planar figure. Historically, the area was
determined initially on the class of polygons (figures
limp{~<x}
n
n ______ oc
= lim
n~oo
p{Knn <x} = Fa(x), having a decomposition into a finite number of trian-
gles without common interior points). It is important
lim P{SI <O}+ ... +P{Sn<O} =a that an area with the properties 1) - 4) exists and is
n ______ oo n
unique in the class of polygons [1], [2]. One of the
are all satisfied or not satisfied at the same time; here, consequences of 1) - 4) is that the area of an entire fig-
Fa(x) for 0<0:< 1 is the generalized arcsine distribu- ure is not less than the area of its parts.
tion, In Antiquity, the existence and uniqueness of the
FI(x) = E(x) and Fo(x) = E(x-l), area with properties 1) - 4) was assumed without an
where E(x)=O if x",;:;o and E(x)= 1 if x>O. explicit description of the class of figures; attention was
The arcsine law in renewal theory states that for concentrated on ways of calculating the area. The for-
0<0:< 1 the following equalities are valid: mula for the area of a rectangle, including one with
P{~I;:;;'O} = 1 irrational sides, was based on the method of exhaustion
(d. Exhaustion, method of). The area of a triangle or
and for
y, = t-S~" of any polygon was calculated as the area of the rectan-
where 1/1 is defined by the relation S.,." <t",;:;S"",+I' gle such that the given triangle or polygon and the rec-
tangle are both built up from the same congruent parts.
limp{Y; <x} = Fa(x) It has been shown [2] that any polygons of equal area
are decomposable into the same congruent parts.
if and only if Later, the class of squarable (Jordan-measurable) fig-
ures was distinguished. A figure M in the plane is said
for x >0, where L (x) is a function which is defined for to be squarable if for any (>0 there exist polygonal fig-
x >0 and which has the property ures P and Q such that P eM e Q and

241
AREA

(areaQ-areaP)<t:. The class of squarable figures is smooth boundaries (or without boundary). Usually,
extremely rich. It includes, in particular, all bounded this is based on the following construction. The surface
domains in the plane with piecewise-smooth boun- is split up into small parts with piecewise-smooth boun-
daries. However, there are also non-squarable planar daries. In each part one selects a point at which the
figures. In the class of squarable figures, an area with tangent plane exists and projects the part orthogonally
the properties I) - 4) exists and is unique [2]. on the tangent plane to the surface at that point; the
Historically, before the class of squarable figures was areas of the resulting planar projections are summed,
considered, people knew how to calculate the areas of and finally one passes to the limit for increasingly fine
some of them: the disc, a circular sector and segments, subdivisions (such that the largest of the diameters in
various kinds of choral figures, and curvilinear trapezia. the parts of the subdivision tends to zero). This limit
These calculations were based on the method of always exists in the class of surfaces, and if the surface
exhaustion by polygons. In some cases, Cavalieri's prin- is given parametrically by a piecewise C I-smooth func-
ciple (d. Cavalieri principle) was used as the basis for tion r( u, v), where the parameters u and v vary in a
such calculations. It states that if two planar figures of domain D in the (u, v)-plane, then the area F is
this type intersect every line parallel to a given one in expressed by the double integral
segments of identical length, then these figures are
(1)
equal in area. Techniques in integral calculus (see, for
example, [3]) give convenient methods of calculating the where gil =r~, gI2 =ru r.. g22 =r;, while ru and rv are
area of any planar domain with a piecewise-smooth the partial derivatives with respect to u and v. The
boundary. The integral calculus justifies Cavalieri's proof is given, for example, in [3], [5]. In particular, if
principle. the surface is the graph of a C I-smooth function
Attempts to extend the concept of area to more gen- z = f(x,y) over a domain D in the (x,y)-plane, then
eral planar sets while retaining the properties 1) - 4) led
to measure theory and to the distinction of the class of F = \l'1+.fx +.Ii dx dy. (2)
planar Lebesgue-measurable sets. The transition to even
Here (1 +f; +/})I /2 = cos a., where a. is the (acute)
more general classes of sets in the plane leads to non-
angle between the normal to the surface and the z-axis.
unique measures with the properties 1) - 4).
From (1) and (2) one gets the standard formulas for the
Oriented area. If there is a directed closed curve I in
area of a sphere or parts of it, as well as methods for
the oriented plane, possibly with self-intersections and
calculating the areas of surfaces of rotation, etc.
overlaps, then for each point in the plane not lying on I
For two-dimensional piecewise-smooth surfaces in
there is an integer function (positive, negative or zero),
Riemannian manifolds [6], formula (1) serves as a
called the degree of the point with respect on I. It indi-
definition of the area, where the roles of gil, gI2 and
cates how many times and in what direction the con-
g22 are played by the components of the metric tensor
tour I encircles that point. The integral over the entire
restricted to the surface itself.
plane of this function, if it exists, is called the oriented
It is important that even in the case of a two-
area enclosed by I. The latter differs from the ordinary
dimensional surface, the area is assigned not to the set
area in having a sign. See [4] for simple properties of
of points but to the equivalence class of mappings
an oriented area.
representing the two-dimensional manifold into a space,
The area of surfaces. The areas of polyhedral sur-
and thus differs from a measure.
faces are first and most simply defined as the sums of
k-dimensional area. For a piecewise-smooth immer-
the areas of the planar faces. There are difficulties in
sion f: M ~Rn of a k-dimensional manifold (with or
attempts to introduce the concept of area for curved
without boundary) into n-dimensional Euclidean space,
surfaces as the limit of the areas of inscribed polyhedral
surfaces (in a similar was as the length of a curve is 2:s;:;. k < n, the area is defined by means of a construc-
defined as the limit of inscribed polygonal lines). Even tion that is completely analogous to that described
above for piecewise-smooth surfaces. The only differ-
for a very simple curved surface, the areas of inscribed
ence is that the projection now involves k-dimensional
polyhedra with successively smaller faces may have dif-
tangent planes and that the summation is over the k-
ferent limits, depending on the choice of the sequence
dimensional projection volumes. If one has introduced
of polyhedra. This is clearly demonstrated by Schwartz'
coordinates u I, ... , uk in a domain D eM, then the
example, in which sequences of inscribed polyhedra
area F of the immersion (D'!I D) is expressed by the
with different limits of the area are constructed for the
integral
lateral surface of a right circular cylinder [2]. F = ydet(g'j) du l ... du k , (3)
Most frequently, the area of surfaces is defined for
the class of piecewise-smooth surfaces with piecewise- where gij are the scalar products (af / au i , af / au}). If

242
AREA

k =n -
1 and D is a domain in the coordinate hyper- useful properties of the surface (for example, the possi-
plane (x I, ... ,Xn -I), while f has the explicit form bility of introducing isothermal parameters); in that
Xn =z(xJ, ... ,Xn -d, (3) becomes case, the Lebesgue area is a convenient tool sufficient
to solve the Plateau problem and more general two-
az ]2 dx
F=[ 1+ ~
n -I [

;=1
-a.
X,
l dx n - I (4) dimensional problems in variational calculus. There
arise difficulties in the investigation of variational prob-
If f is a smooth mapping, then the gij are the coeffi- lems for the class of continuous mappings for k>2 (the
cients of the metric tensor induced by the immersion, compactness problem), which has led to the search for
and it follows from (3) that the area defined by the other objects (currents and varifolds) and related
external construction belongs to the intrinsic geometry characteristics of area type. The use of the Lebesgue
of the immersed manifold. In general, (3) is taken as area is restricted also by the complexity in establishing
the definition of area in the case of an immersion not its relationships to other concepts of area and certain
only into Rn but also into a Riemannian manifold. of its properties (for example, the right-hand inequality
On the class of piecewise-smooth immersed mani- in (5)). There are two notable features of L(M,f): for
folds, the area is: a) non-negative; b) equal to 1 on the L(M,f) =0 it may happen that the volume
k-dimensional unit cube in Rn; c) preserved under V({(M)) >0, and, secondly, even for k =2, decompos-
orthogonal transformations; d) additive; e) semi- ing Minto M 1 and M 2 with a common boundary in
continuous, i.e. F({):S;;;;liminfi-+OO F(f;) if J;~f uni- the form of a curve, it may happen that
formly. Further: f) if </>: Rn~Rn is a non-stretching L(M,j) > L(M lo fIM,)+L(M 2 ,fIM,). (7)
mapping, then F(</>0j):S;;;;F(j); and g) if {Pi} is a set of
Studies have also been made concerning which of the
(V pairwise-orthogonal k-dimensional planes and if Pi
properties a) - g) can be retained when additivity is
is the projection onto Pi, then
(;)
replaced by semi-additivity, e.g. assuming an inequal-
F(p;0f> .;;;; F(j)';;;; ~F(P;0f>. (5) ity of the type of (7) one still gets the Lebesgue area.
;=1 Topics of this type have not yet (1983) been examined
Further extensions. The theory of area. The extension completely [7].
of F to more general objects while retaining some of Integral-geometric areas. (Cf. [9].) Let N(/, </>,y) be
properties a) - g) is possible in various ways and leads some multiplicity function for mappings </>: M~Rn at a
to various results. The theory of area deals with these point YERk, for example N =card </>-I(y). Then for
extensions. See [10] for a review of the relations f: M ~Rn, one may define the integral-geometric area
between various concepts of area. f (N(M,Paf,y)dV(y)dv(o), (8)
The boundary between the theory of area and meas- G(n, k)A.k

ure theory is not clear-cut. Traditionally, the theory of where G(n, k) is the Grassmann manifold of k-
area relates primarily to the study of the areas of con- dimensional subspaces Rk eRn, v is the normalized
tinuous mappings, where multiplicity is taken into Haar measure on G(n, k) and pa are the orthogonal
account and the preservation of additivity is less impor- projections on eJEG(n, k). The areas (8) may differ for
tant. See, for example, Hausdorff measure; Favard various multiplicity functions. A very special choice of
measure or k-dimensional measures in n-dimensional N(M, </>,y) makes (8) coincide with the Lebesgue area
space. for k =2 for triangulable M, as well as for k >0 if the
A, area can be introduced on the basis of an Hausdorff measure Hk + 1({(M)) vanishes completely
approximating, integral-geometric, or functional [9]. For the case k =2, various integral-geometric areas
approach or else axiomatically. The most common con- have been introduced also by G. Peano, Hetz and S.
cepts are given below. Banach [7].
The Lebesgue area. It (cf. [7], [9]) is defined by The area of the boundary of a set. The Minkowski
L(M,f) = lim F(j;), (6) area was introduced in connection with the proof of the
Brunn - Minkowski inequality and the classical isoper-
where M is a finitely-triangulable k-dimensional mani- imetric inequality. It is assigned to a set A eRn, but
fold, J;: M ~Rn are all possible sequences of piecewise- characterizes the area of the boundary, and is defined
linear mappings with J;~f, and F(f;) is the k- by
dimensional area of the corresponding polyhedral sur- F(a) = lim hI (V(A +hB)- V(A, (9)
face. The Lebesgue area is the same for Frechet- h~O

equivalent mappings and therefore is a characteristic of where B is the unit sphere in Rn and V denotes the
a Frechet surface. volume. For sets A with a piecewise-smooth boundary
If k = 2, the condition L(M,f) < 00 implies various and for convex sets A, the ordinary limit in (9) exists; it

243
AREA

coincides with the (n -I)-dimensional area of the L(M,f)<oo is naturally put into correspondence with
boundary. Definition (9) still applies to sets in finite- a varifold Vf with mass L(M,f); see [13] for this in the
dimensional normed spaces, where the area (9) may language of currents.
differ from the Hausdorff measure Hn - I (aA) even for References
convex sets A. [1) LEBESGUE, H.: 'Sur la mesure des grandeurs', Monogr. de
There is another useful characteristic analogous to I'Enseign. Math. 1 (1935).
[2) Encyclopaedia of elementary mathematics, Vol. 5, Moscow, 1966
area, which is put into correspondence with a set, but (in Russian).
which characterizes mainly the boundary, namely the [3) IL'IN, V.A. and POZNYAK, E.G.: Foundations of mathematical
perimeter of a measurable set. This is a particular case analysis, 1, Moscow, 1971 (in Russian).
of the concept of a current mass. [4) LoPSHITS, A.M.: Calculating the area of oriented figures, Mos-
cow, 1956 (in Russian).
Intrinsic area. If M is metrized and f: M ~Rn is a [5) POGORELOV, A.V.: Differential geometry, Noordhoff, 1959
locally isometric mapping, one has to consider the rela- (translated from the Russian).
tion between L(M,f) and the Hausdorff measure [6) RASHEWSKl, P.K. [P.K. RAsHEvsKIi): Riemannsche Geometrie
und Tensorana/yse, Deutsch. Verlag Wissenschaft., 1959
Hk(M). When M is a two-dimensional manifold of (translated from the Russian).
bounded c1ln'ature, one has L(M,f)=H 2 (M). In gen- [7) CESARI, L.: Surface area, Princeton Univ. Press, 1956.
eral, on the other hand, a continuous f: M ~Rn induces [8) FEDERER, H.: Geometric measure theory, Springer, 1969.
[9) FEDERER, H.: 'Measure and area', Bull. Amer. Math. Soc. 58,
a generalized metric p distinguished by the possibility no. 3 (1952), 306-378.
p(x,y)=oo on the connected components f-I(u), [10) BURAGO, Yu.D. and ZALGALLER, V.A.: Geometrical inequali-
u ERn. The construction of the k-dimensional Haus- ties, Leningrad, 1980 (in Russian).
[II) BUSEMANN, H.: 'Intrinsic area', Ann. of Math. Ser. 248(1947),
dorff measure applied to p gives a characteristic that 234-267.
can be taken as the intrinsic area of an immersion. For [12) ALMGREN, F.J.: The theory ofvarifolds, Princeton Univ. Press,
fin a Lipschitz class, this coincides with L(M,f) [11]. 1965.
Current masses and varifolds. The integration of k- [13) FEDERER, H.: 'Currents and area', Trans. A mer. Math. Soc. 98,
no. 2 (1961), 204-233.
forms with respect to a k-dimensional piecewise-smooth Yu.D. Burago
oriented manifold M imbedded in Rn leads to the V.A. Zalgaller
current TMq,= t(q,(x), v(xdF(x), being a linear func- L.D. Kudryavtsev
tional of the k-forms q, on Rn. Here p is a unit k-vector AMS 1980 Subject Classification: 53A05, 53825,
tangent to M. The linear functional TM essentially 53C65, 51 M25, 26815, 28A25, 49F20, 58A22,
characterizes M. Also, a non-linear functional can be 58A25
defined (in the same way and for non-oriented M): the
varifold VMq,= 1M I (q" p) I dF. The integral norm AREA FUNCflON - The set function on a sphere Q
equal to the area S (E) of that part of the convex sur-
(masses) II TM II and II VM I coincide with the area, i.e. face F that has spherical image E C Q. This definition
with F(M). The inclusion of the class of piecewise-
remains meaningful for general convex surfaces and it
smooth submanifolds of Rn in more general current
gives a totally-additive set function on the ring of Borel
and varifold classes plays the same role in variational
sets.
calculus as do generalized solutions in the theory of
partial differential equations. References
[I) ALEKSANDROV, A.D.: Mat. Sh. 3, no. I (1938),27-44.
There are extensive classes of integer currents and [2] BUSEMAN, H.: Convex surfaces, Interscience, 1958.
varifolds among all of these. These retain many M.I. Voftsekhovskif
geometric properties of submanifolds. For example, an
Editorial comments. In the article, Q is the unit sphere in
integer-valued current is a current having a representa-
R3 with centre at the origin. If one associates to each point
tion T = ~;: I ni TM" where ni are integers and Mi are x E F the unit normal vector nx and shifts this vector nx so
C I-smooth submanifolds, subject to the condition that that its base coincides with the origin, then the end pOint of
the mass of the (k - I )-dimensional current dT defined nx is a point x on Q The point x' is called the spherical
by dT(q,) = T(dq,) is finite. The masses of integer-valued image of x. The procedure for obtaining the spherical image
currents and varifolds can be considered as an exten- of a point goes under the name of spherical mapping, cf.
sion of the concept of the area of a surface. Here again, Spherical map.
there is a relationship to the Lebesgue area. Let AMS 1980 Subject Classification: 53A05, 52AXX
piecewise-smooth mappings j;: M ~Rn converge uni-
formly, j;~f, and let L(M,f) = limi->ooL(M,j;). Then AREA-FUNCTION - A function inverse to a hyper-
the corresponding varifolds Vi, converge weakly to bolic function, i.e. one of the functions: arcsinh,
some integer-valued varifold Vf' where arccosh, arctanh, arccotanh. See also: Imerse hyper-
I Vf I = L(M, f). Thus, each f: M ~Rn with bolic functions.

244
AREA PRINCIPLE

Editorial comments. functions f(z), z EB, with B a domain, one usually


References understands any inequality having the property that
[1] ABRAMOWITZ, M. and STEGUN, l.A.: Handbook of mathemati- equality holds if and only if the area of the complement
cal functions, National Bureau of Standards, 1964. Dover G of f(B) is zero, and the same applies for a class of
reprint, 1972.
systems of univalent functions {/k(z): z EBdZ =0,
AMS 1980 Subject Classification: 26A09 n = 1,2, ... , where Bk is a domain and G is the com-
plement of U ~ =o!k(Bj ). Usually, such a theorem is
AREA METHOD - A method of solving problems in proved by means of the area principle. That is, one
the theory of univalent functions by means of area considers any regular function Q(w), or more generally,
theorems (cf. Area principle). one having a regular derivative, on G and calculates the
Editorial comments. area a(Q(G of the image of G under the mapping of
References the function Q. Therefore, (2) is a certain extremely
[A 1] DUREN, P.L.: Univalent functions, Springer, 1983, Chapt. 4. general area theorem in the class ~.
Let FE~ and let
AMS 1980 Subject Classification: 30C50
1n F(t)- F(z) = ~
"'" Wp,q
t -p
z
-q
, t, z E~ .
A'

t -z p,q~1
AREA PRINOPLE - The area of the complement to
the image of a domain under a mapping by a function One chooses a suitable function, regular in CF(I::.,), to
regular in it is non-negative. The area principle was be able to write (2) as
first used in 1914 by T.H. Gronwall [1], who in this
way proved the so-called exterior area theorem for func- q~lq Ip~lwp,qXp 12 .;;; P~I~ I Xp 1
2
,
(3)

tions of class ~ - functions


where the xp are any numbers not simultaneously equal
al
F(z) = z +Ilo +-+
z
... , to zero and such that
lim I xp II Ip < 1.
that are regular and univalent in the annulus /:l: = {z: p ..... oo

1< I z 1< oo} (cf. Univalent function). The area More general area theorems in ~ have also been
a(CF(/:l' of the complement of the image F(I::.') of t::.' obtained [5).
under a mapping W=F(Z)E~ can be determined by the Area theorems have been proved for the following:
formula the class IDC(alo ... ,an) of systems {/k(z): /k(O)=ako
z EI::. H= I of functions /k that map the disc I::. confor-
(J(CF(~' = 'TT [1- k~lk I ak 12] ;;;. mally and univalently onto domains without pairwise
common points, i.e. onto non-adjacent domains [6]; the
and consequently class ~(B) (~(B), B 300, is the class of functions F that
00
~klakI2.;;;1. (1) are regular and univalent in B \ { oo} and such that
k ~I F( 00) = 00, limz--+oo F(z) / z = 1), [7]; and non-
By means of (1) the first results were obtained for func- overlapping multiple-connected domains (see [6] and
tions of the classes ~ and S, where S is the class of also [8], [9]). All the area theorems for multiple-
functions f(z)=z + ~:=2akzk that are regular and connected domains can be proved by the method of
uni" lent in the disc I::.={z: I z I <I} (cf. Bieberbach contour integration (cf. Contour integration, method ot).
conjecture; Distortion theorems). A more general area By the area method one understands methods of solv-
theorem has been proved [2]. G.M. Goluzin [3] ing various problems in the theory of univalent func-
extended the area theorem to p-valent functions in the tions by the use of area theorems.
disc (cf. Multivalent function). For instance, from (3) one may obtain by means of
The following area theorem has been proved [4]: Let the Cauchy inequality that
!:. E~, E=CF(I::.'), and let Q(w) be a regular function in
B.1f I ~ wP.qXpX~ 12 .;;; ~ 1- I
~q~1 p~IP
Xp 12 ~ 1- I X~ 12,
q~lq
(4)
+00
Q(F(z)) = ~ ak z -k 'I< const; where xp and x~ are such that the series on the right-
k~-oo
hand side converge. If in (4), for example, xp = t -P,
then +00 x~ =z -q, I t I > 1, I z I> 1, one obtains the chord-
~ k I ak 12 .;;; 0, (2) distortion theorem
k~-oo

and equality holds only if the area a(E) of E is zero. In F(t) - F(z) 12 .;;; In I t 12 In I z 12 .
l t-z It1 2-1 Iz12-1
By the area theorem for a certain class of univalent

245
AREA PRINCIPLE

Area theorems, for example in the class WC(a I, . . . , an), is then called an Arf-invariant [1]. If this expression
give necessary and sufficient conditions for a system equals zero, then there is a symplectic basis on all ele-
{}k(z): jk(O)=ak, ZELl}Z=1 of meromorphic functions ments of which ~o vanishes; if this expression equals
jk to belong to the class WC(al,' .. ,an) (see [6], p. one, then there is a symplectic basis on all elements of
179). which, except e I and fl, the form ~o vanishes, while
References tJ;o(ed = tJ;O(jI) = l.
[1] GRONWALL, T.H.: 'Some remarks on conformal representa-
tion', Ann. of Math. Ser. 2 16 (1914 - 1915),72-76. References
[2] PRAWITZ, H.: 'Ueber Mittelwerte analytischer Funktionen', [1] ARF, c.: 'Untersuchungen tiber quadratischen Formen in
Arkiv. Mat. Astron., Fysik lOA, no. 6 (1927), 1-12. K6rpem der Charakteristik 2, 1', J. Reine Angew. Math. 183
[3] GOLUZIN, G.M.: 'On p-valued functions', Math. Sb. 8, no. 2 (1941), 148-167.
(1940), 277-284 (in Russian). A. V Chernavskif
[4] LEBEDEV, N.A. and MILIN, I.M.: 'On the coefficients of certain Editorial comments. See [A 1], Appendix 1, for material
classes of analytic functions', Mat. Sb. 28, no. 2 (1951),359-
concering the Arf-invariant for inner product spaces over a
400. (in Russian).
[5] NEHARI, Z.: 'Inequalities for the coefficients of univalent func- field F of characteristic 2 in relation to the Witt algebra of
tions', Arch. Rational Mech. and Anal. 34, no. 4 (1969),301- quadratic inner product spaces over F.
330.
References
[6] LEBEDEV, N.A.: The area principle in the theory of univalent
[A1] MILNOR, J. and HUSEMOLLER, D.: SymmetriC bilinear forms,
functions, Moscow, 1975 (in Russian).
Springer, 1973.
[7] MILIN, I.M.: Univalent functions and orthonormal systems, Mos-
cow, 1971 (in Russian). AMS 1980 Subject Classification: 10C05
[8] ALENITSYN, YU.E.: 'Area theorems for functions that are ana-
lytic in a finitely-connected domain', Izv. Akad. Nauk SSSR
Ser. Mat. 37, no. 5 (1973), 1132-1154 (in Russian). ARGUMENT - 1) The argument of a function is the
[9] GULTYANSKIi, V.YA. and SHCHEPETOV, V.A.: 'A general area variable (also called the independent variable) on which
theorem for a certain class of q-quasi-conformal mappings', the value of the function depends.
Dokl. Akad. Nauk SSSR 218, no. 3 (1974), 509-512 (in Rus-
sian). 2) The argument of a complex number
[10] GRUNSKY, H.: 'Koeffizientenbedingungen fUr schlichtabbil- Z = x + ry = r( cos <p + i sin <p), represented in the plane by
dender meromorphe Funktionen', Math. Z. 45, no. 1 (1939), the point with coordinates x and y, is the angle <p of the
29-61.
N.A. Lebedev radius vector r of this point with the x-axis.
Editorial comments. The inequalities (3) and (4) are a AMS 1980 Subject Classification: 30-01
form of the Grunsky inequalities.
References
ARGUMENT, PRINOPLE OF THE, argument prin-
[A 1] DUREN, P.L.: Univalent functions, Springer, 1983, Chap!. 1 O. ciple - A geometric principle in the theory of functions
of a complex variable. It is formulated as follows: Let
AMS 1980 Subject Classification: 30C50, 30CXX
D be a bounded domain in the complex plane C, and
ARF-INV ARIANT, invariant of Arf - An invariant of let, moreover, the boundary aD be a continuous curve,
a quadratic form modulo 2, given on an integral lattice oriented so that D lies on the left. If a function w = f(z)
endowed with a bilinear skew-symmetric form. Let II is meromorphic in a neighbourhood of D and has no
be an integral lattice of dimension k = 2m and let zeros or poles on aD, then the difference between the
~(x,y) be a form for which ~(x,y)= -~(y, x). There number of its zeros N and the number of its poles P
exists bases {e I ,II, ... , em,fm }, called symplectic inside D (counted according to their multiplicity) is
bases, in which the matrix of ~(x, y) reduces to block- equal to the increase of the argument of f when travel-
diagonal form: The diagonal contains the blocks ling once around aD, divided by 27T, i.e.
I
N- P = 27T !:J.aDargj,

l.e. where argf denotes any continuous branch of Argf on


IHei ,!,) = -tJ;(j;, e,l = 1, the curve aD. The expression on the right-hand side
while the other entries are zero. equals the index indof(aD) of the curve f(aD) with
Suppose that a mapping respect to the point w =0.
tJ;0: IT ..... Z2
The principle of the argument is used in the proofs
of various statements on the zeros of holomorphic func-
is given on II such that
tions (such as the fundamental theorem of algebra on
tJ;o(x+y) = tJ;o(x)+tJ;o(Y)+tJ;(x,yl Il)od2 polynomials, the theorem of Hurwitz on zeros, etc.).
(a 'quadratic form modulo 2'). The expression From the principle of the argument follow many other
m important geometric principles of function theory, e.g.
~ tJ;o(e,)tJ;o(j;) the principle of invariance of domain (cf. Invariance of
I I

246
ARITHMETIC

domain, principle of), the maximum-modulus principle concept of a number, methods and means of calcula-
and the theorem on the local inverse of a holomorphic tion, the study of operations on numbers of different
function. In many questions the principle of the argu- kinds, as well as analysis of the axiomatic structure of
ment is used implicitly, in the form of its corollary: the number sets and the properties of numbers. When
Rouche theorem. referring to the logical analysis of the concept of a
There are generalizations of the principle of the argu- number, the term theoretical arithmetic is sometimes
ment. The condition that f be meromorphic in a neigh- employed. Arithmetic is closely connected with algebra,
bourhood of 15 may be replaced by the following: f has which includes the study of operations performed on
only a finite number of poles and zeros in D and numbers. The properties of integers form the subject of
extends continuously to aD. Instead of the complex number theory (d. Elementary number theory; Number
plane, an arbitrary Riemann surface may be con- theory).
sidered: the boundedness of D is then replaced by the The term 'arithmetic' is also sometimes employed to
condition that 15 be compact. From the principle of the denote operations performed on objects of very dif-
argument for a compact Riemann surface it follows ferent kinds: 'matrix arithmetic', 'arithmetic of qua-
that the number of zeros of an arbitrary meromorphic dratic forms', etc.
function, not identically equal to zero, is equal to the The art of computation arose and developed long
number of poles. The principle of the argument for before the times of the oldest written records extant.
domains in C is equivalent to the theorem on the sum The oldest mathematical records are the Cahoon papyri
of the logarithmic residues (d. Logarithmic residue). and the famous Rind papyrus, which is believed to date
For this reason, the following statement is sometimes back to about 2000 B.c.. An additive hieroglyphic sys-
called the generalized principle of the argument. If f is tem for the representation of numbers (d. Numbers,
meromorphic in a neighbourhood of a domain 15 which representations of) enabled the ancient Egyptians to
is bounded by a finite number of continuous curves perform addition and subtraction operations on natural
and if f has no zeros or poles on aD, then for any fun:: numbers in a relatively simple manner. Multiplication
tion cp which is holomorphic in a neighbourhood of D was carried out by doubling, i.e. the factors were
the equality decomposed into sums of powers of two, the individual
1 1" {z I N P summands were multiplied, and the components added.
-2. (<j>(Z)T(
) dz = ~<j>(ad- ~<j>(bk) Operations on fractions (cf. Fraction) were reduced in
'lTl iD J' Z k= I k- I
Ancient Egypt to operations on aliquot fractions, i.e. on
holds, where the first sum extends over all zeros and
fractions of the type I / n. More complicated fractions
the second sum extends over all poles of finD. There
were decomposed with the aid of tables into a sum of
is also a topological generalized principle of the argu-
aliquot fractions. Division was carried out by subtract-
ment: The principle of the argument is valid for any
ing from the number to be divided the numbers
open mapping f: D~C that is locally finite-to-one and
obtained by successive doubling of the divisor. The
extends continuously to aD, while 0, 00 fI-f(aD). clumsy hexadecimal system of the ancient Babylonians
An analogue of the principle of the argument for func- gave rise to many difficulties in executing arithmetic
tions of several complex variables is, for example, the operations. There are numerous tables employed by
following theorem: Let D be a bounded domain in en ancient Babylonians to effect multiplication and divi-
with Jordan boundary aD and let f: 15~c,,- be a holo- SIOn.
morphic mapping of a neighbourhood of D such that In Ancient Greece arithmetic was conceived as the
Otlf(aD); then the number of pre-images of 0 in D study of the properties of numbers; practical calcula-
(counted according to multiplicity) is equal to tions were not included. Problems on the technique of
indof(aD). operations on numbers, i.e. methods of calculation,
References were considered to be an independent science - logis-
[I] LAWRENTJEW, M.A. [M.A. LAVRENT'EV] and SHABAT, B.V.: tics. This differentiation was inherited from the Greeks
Methoden der komplexen Funktionentheorie, Deutsch. Verlag
by Europeans in the Middle Ages. It was only with the
Wissenschaft., 1967 (translated from the Russian).
[2] SHABAT, B.V.: Introduction to complex analySiS, 2, Moscow, advent of the Renaissance that both the beginnings of
1976 (in Russian). the theory of numbers and the practice of calculation
E.M. Chirka
began to be included in the concept of arithmetic.
AMS 1980 Subject Classification: 30AXX, 32AXX Greek mathematics made a sharp distinction between
the concepts of a number and of a magnitude. To
AluTHMEnc - The science of numbers and opera- Greek mathematicians numbers meant only what is
tions on sets of numbers. Arithmetic is understood to known as natural numbers in our own days (d. Natural
include problems on the origin and development of the number); they also distinguished between concepts of

247
ARITHMETIC

different types, such as a number and a geometrical Indian mathematics exerted a decisive influence on
magnitude. There are no logistic studies by ancient the development of the knowledge of the arithmetic of
Greeks extant, but it is known that their multiplication the Arabs. The treaty on arithmetic written by
technique was close to our own. Their alphabetic Muhammed AI-Khwarizmi in the 9-th century greatly
number system severely complicated operations carried contributed to the dissemination of the Indian decimal
out on numbers. In Ancient Greece ordinary fractions notation and of the methods of addition, subtraction,
were employed in calculations; however, a fraction was multiplication, division and extraction of square roots.
not considered as a number in its own right, but merely Many ancient nations performed their calculations
as a quotient of natural numbers. on an abacus, which had replaced the primitive count-
Books 7 - 9 of Euclid's Elements (3-rd century B.C.) ing on fingers. The external form of the abacus varied,
deal exclusively with arithmetic in the sense in which but its principle remained the same: ruling a column,
the word was employed in ancient times. They mainly or some other cascade-type marking of numbers. An
deal with the theory of numbers: the algorithm for abacus was used by the Greeks long before the Chris-
finding the greatest common divisor (cf. Euclidean algo- tian era. The Chinese suan-pan and the Russian schety,
rithm), and with theorems about prime numbers (cf. which externally resemble each other, are also variants
Prime number). Euclid proved that multiplication is of the abacus.
commutative, and that it is distributive with respect to While the European studies in the domain of number
the operation of addition. He also studied the theory theory are based on Greek mathematics, in particular
of proportion, i.e. the theory of fractions. Other books on the work of Euclid and Diophantus, this is not true
of his treatise include the general theory of relations of computational techniques. The development of arith-
between magnitudes, which may be considered as the metic in Europe is connected with the arrival of the
beginning of the theory of real numbers (cf. Real Indian positional decimal system and of Arabic
number). numerals in Europe. The technique of arithmetic opera-
The manuscripts of Diophantus (probably 3-rd cen- tions was not taken over from India directly, but rather
tury A.D.) that are left to us contain operations on by way of the work of Al-Khwarizmi and other Arab
powers (not greater than 6) of numbers, and a few mathematicians.
examples of subtraction operations. These are impli- The abacus was extensively employed in the Middle
citly operations on negative numbers. Diophantus Ages. It also became a synonym of the word arithmetic,
applied his own rules to rational numbers only (cf. so that Leonardo da Pisa (13-th century) called his
Rational number). arithmetical treatise The book of abacus. The book
Chinese mathematicians in the 2-nd century per- quotes calculation methods taken from Arab sources,
formed operations on fractions and negative numbers. while introducing many original improvements. Thus,
At a somewhat later period they studied the extraction the addition of fractions is performed by way of the
of square roots and cubic roots, the approximate values least common multiple of the denominators, and the
of which were expressed as decimal fractions (cf. operations are checked not only, like the Indians, in the
Decimal fraction). The methods employed by Chinese base nine, but also in certain other bases. The problems
mathematicians in solving problems in arithmetic, in dealt with in the book include the rule-of-three, the rule
particular the rule of two false assumptions, appears in of alligation, problems involving recurrent sequences,
several manuals on arithmetic, first in Arab, then in arithmetic progressions and geometric progressions (cf.
European manuals. Very little is known about the ini- Arithmetic progression; Geometric progression). In
tial development of arithmetic in India. The simplest Europe decimal fractions began to be employed in the
fractions were utilized in India long before the Chris- 15-th century, but became widely known only in the
tian era. Our own decimal computation system is of 16-th century following the publication of the studies of
Indian origin. The earliest written Indian mathematical S. Stevin.
records extant were compiled in the 5-th century and Various methods for the multiplication and division
indicate that the knowledge of arithmetic in India of of multiplace numbers were proposed in the 15-th, 16-
that period was of a high standard. Indian mathemati- th and in the later centuries. They differed from each
cians operated on integers and fractions using methods other only by the system of notation of the intermedi-
very similar to our own. They solved many problems ate computations. A. Riese, through numerous text-
on proportions and on the rule-of-three, and could books, was most influential in the move to replace the
compute percentages. Studies on negative numbers old computation (in terms of counters and Roman
began in India in the 7-th century. The works of numerals) by the newer methods (using pen and
Bhaskara II The wreath of science (l2-th century) con- Hindu-Arab numerals).
tain rules for the multiplication and division of negative In Europe negative numbers were first used by
numbers. Leonardo da Pisa, who treated them as one would treat

248
ARITHMETIC

debts. A system of operations on negative numbers was that in order to built a theory, it is sufficient to decom-
presented in the l6-th century by M. Stiefel. He pose it into a small number of clear, simple, primary
referred to these numbers as 'fictitious'. Proofs for the assumptions and to ensure that all basic postulates of
rules of operation on negative numbers were still stu- the theory can be deduced from them in a purely logi-
died as late as the l8-th century, and it is only owing cal manner. It was tacitly assumed that the connection
to the critical reasoning in the second half of the 19-th of these fundamental principles with the real world
century that such studies have ceased to be taken seri- should be immediately perceptible.
ously. The method of models, used to give foundations of
In Europe, up to the l5-th or l6-th centuries, arith- mathematical theories, was discovered in the 19-th cen-
metic operations on irrational numbers were limited to tury. The use of such a method was inevitable, since
the extraction of square roots. Nevertheless, Leonardo certain objects and theories studied in mathematics had
da Pisa considered the problem of the approximate cal- not found a real-life interpretation. These include,
culation of cubic roots as well as square roots. S. dal mainly, complex numbers, ideals, and non-Euclidean
Ferro (around 1500) and N. Tartaglia (l6-th century) and n-dimensional geometries. The method of models
used cubic roots in solving equations of the third permitted one to reduce the consistency of one
degree. There was no general treatment of operations mathematical theory to the consistency of another.
performed on real numbers. The concept of a real Thus, on the assumption that Euclidean geometry is
number was assimilated in mathematics only gradually, consistent, is was proved that Lobachevskii's geometry
in connection with the development of analytic was consistent, while the consistency of Euclidean
geometry and mathematical analysis. Up to the l8-th geometry was reduced to the consistency of the arith-
century, proofs of operations on irrational numbers metic of real numbers.
were limited to magnitudes expressible by radicals. At the end of the 19-th century, the foundation of
Complex numbers were encountered at various times, arithmetic seemed complete. R. Dedekind and G.
beginning with Indian mathematics, in solving qua- Peano, independently of each other, gave an axiom sys-
dratic equations. However, imaginary solutions were tem for the natural numbers from which all known pro-
discarded as non-existent. The arithmetic of complex positions of this science could be deduced. K. Weier-
numbers (cf. Complex number) begins with R. Bombelli strass proposed to take pairs of natural numbers as a
(l6-th century), who gave formal rules for performing model for the integers and positive rational numbers.
arithmetic operations on such numbers. However, even The geometrical representation of complex numbers,
as late as the l7-th century such operations were per- discovered by J. Argand, C Wessel and CF. Gauss, is
formed in a manner similar to those on real numbers, in essence a model for the theory of complex numbers
which frequently led to errors. It is only in the l8-th in the framework of the theory of real numbers.
century that a precise definition of the arithmetic of Finally, the set-theoretic approach allowed Dedekind,
complex numbers could be given, owing to the formu- G. Cantor and Weierstrass to built the theory of real
las of A. de Moivre and L. Euler. numbers.
The idea of logarithms dates back to Archimedes (3- However, following the discovery of paradoxes in set
rd century B.C), who compared the terms of arithmetic theory, there arose the question of how to give a foun-
and geometric progressions. Stiefel (l6-th century) dation for the arithmetic of natural and real numbers.
extended these progressions to the left by adding nega- How can one be sure there are no paradoxes in those
tive powers. He showed that there is a connection branches of mathematics as well? Direct perception
between operations carried out on these series, thus does not tell one that the universe is infinite or that
introducing the fundamental idea of a logarithm. Tak- matter is infinitely divisible. For this reason the ideas
ing logarithms as a routine stage in calculations was of the infinite set of natural numbers and the continu-
introduced by 1. Napier and 1. Burgi in the first half of ous number axis may be considered as not directly con-
the l7-th century. nected with the physical world. On the other hand,
The first calculating machines were built in the l7-th there is no simpler model of the arithmetic of natural
century by W. Schickard and B. Pascal, who worked numbers than the theory of natural numbers itself,
independently of each other; they were the prototypes while models of the theory of real numbers are based to
of modern calculators. However, it was only in the 19- a considerable extent on the apparatus of set theory,
th century that such machines began to be extensively the reliability of which seems to be open to doubt.
employed in practical work. The rise of fast electronic What should be the methods and means of reason-
computers in middle of the 20-th century enhanced the ing, not involving models, which would directly indi-
importance of performing an algorithmic operation in a cate the absence of inconsistencies in a given theory -
minimum number of elementary operations. or, in other words, which would show that a logical
It has been considered ever since the times of Euclid reasoning based on the axioms of the theory will never

249
ARITHMETIC

yield results which contradict each other? itself. G. Gentzen in 1936 gave a proof of the con-
In the view of D. Hilbert, the reason for the appear- sistency of formal arithmetic, using transfinite induc-
ance of paradoxes in set theory is that modes of rea- tion up to the transfinite number Eo. One must natur-
soning that are undoubtedly applicable to finite sys- ally inquire into the consistency of the means used in
tems of objects, are illegitimately applied to infinite the proof. Other approaches to the problem of con-
families. This may be avoided if the symbols employed sistency of the arithmetic of natural numbers were also
are considered as objects of some new theory, and if investigated in this context.
the logical conclusions are expressed with the aid of a Attempts to overcome the difficulties involved in the
formal process. In such a case a statement in a theory foundation of the theory of real numbers played a cer-
is expressed as a formula constructed out of a finite set tain role in the development of the constructive
of symbols, while a proof is represented by a finite approach in mathematics.
chain of formulas, obtained according to certain rules References
from formulas known as axioms (cf. Axiom). In this [I] The history of mathematics, Moscow, 1970-1973 (in Russian).
way, according to Hilbert, it will be possible to operate [2] WAERDEN, B.L. VAN DER: Ontwakende wetenschap, Noordhoff,
1957.
on finite rather than on infinite objects and to obtain a
[3] KLEENE, S.c.: Mathematical logic, Wiley, 1967.
reliable procedure for establishing the consistency of [4] HILBERT, D.: Grundlagen der Geometrie, Springer, 1913.
any theory. Hilbert hoped that this approach would [5] Encyclopaedia of elementary mathematics, Vol. 1, Moscow-
yield, in the first place, a positive solution to the prob- Leningrad, 1951-1963 (in Russian).
[6] MOLODSHIi, V.N.: Fundamentals of the study of numbers in the
lem of the consistency of the arithmetic of natural 18-th and the beginning of the 19-th century, Moscow, 1963 (in
numbers and that it would show that the addition of Russian).
any unprovable formula in number theory to the for- [7] NECHAEV, V.l.: Number systems, Moscow, 1975 (in Russian).
[8] STRUIK, DJ.: A concise history of mathematics, Dover, reprint,
mulas of arithmetic converts this system of axioms into 1967.
an inconsistent system. [9] THuRSTON, H.A.: The number-system, Biackie, 1956.
However, these hopes were smashed. K. G6del in A.A. Bukhshtab
1931 demonstrated that formal arithmetic (cf. Arith- v.I. Pechaev
metic, formal) is incomplete. He found, moreover, that Editorial comments.
for any consistent formal system containing the axioms
References
of arithmetic, it is possible to find an explicit descrip- [A1) WAERDEN, B.L. VAN DER: Geometry and algebra in ancient
tion of some closed formula u such that neither the for- civilisations, Springer, 1983.
mula u itself nor its negation can be deduced in the
AMS 1980 Subject Classification: 00A25, 10-XX,
formal system. 01A05
Using this result it is possible to prove that non-
isomorphic models of formal arithmetic exist. At the ARITHMETIC CONTINUUM - The number axis.
same time, the system of Peano axioms is categorical. AMS 1980 Subject Classification: 26A03
How is this to be explained? The system of Peano
axioms contains the induction axiom: Any natural ARITHMETIC DISTRIBUTION - A discrete proba-
number has a certain property P if the number 1 has bility distribution concentrated on a set of points of the
this property and if for any natural number n that has form -+-nh, where h >0 and n = 1, 2, .... It is a special
the property P the natural number n + 1 also has this case of a lattice distribution.
property. In this axiom P may stand for any conceiv- A. V. Prokhorov
able property of natural numbers. In the corresponding AMS 1980 Subject Classification: 60-XX
axiom of formal arithmetic, P may denote only such
properties of natural numbers as can be expressed by ARITHMETIC, FORMAL, arithmetical calculus - A
the methods of the given formalism. The difference logico-mathematical calculus which formalizes elemen-
between these two axioms is immaterial if one discusses tary number theory. The language of the most common
theorems of elementary number theory, but is very kind of formal arithmetic contains the constant 0,
material in the explanation of the properties of the for- numerical variables, the equality sign, the function sym-
bols +, " ' (successor) and logical symbols. The terms
mal theory.
G6del also showed that a consistent formal system
which includes formal arithmetic contains a formula

are built from the constant and the variables of the
function symbols; in particular, natural numbers are
expressing its consistency, and that this formula cannot denoted by terms of the form 0' ... '. The atomic formu-
be proved in this system. Thus, the consistency of such las are equalities of terms; the remaining formulas are
a formal system can only be substantiated by means built from atomic formulas by means of the logical
that are stronger than those formalized in the system connectives &, V, ~, -', "d, 3. Formulas in the

250
ARITHMETIC. FORMAL

language of formal arithmetic are known as arithmetical tions and the corresponding additional postulates. For-
formulas. The postulates of formal arithmetic are the mal arithmetic satisfies both Godel incompleteness
postulates of predicate calculus (classical or intuitionis- theorems (cf. GOdel incompleteness theorem). In partic-
tic, depending on the formal arithmetic in question), ular, there exist polynomials P, Q in several variables
the Peano axioms such that the formula 'v'X(P(X)==Q(X)) cannot be
a'=b' ~a=b, .....,(a'=O), (a=b&a=c)~b=c, proved even though expressing a true statement - the
consistency of the system of formal arithmetic. A com-
a=b ~ a'=b', a+O = a, a+b' = (a+b)',
binatorial (Ramsey-type) theorem independent of for-
a'O = 0, ab' = (ab)+a, mal arithmetic has been constructed [6].
and the axiom scheme of induction: In studying the structure of a system (in particular,
A(O)& \fx(A(x)~A(x')) ~ A(x), the problems of consistency), formal arithmetic is for-
mulated without quantifiers, but with Hilbert's (-
where A is an arbitrary formula, known as the induction
symbol. The formulas of this system are quantifier-free,
formula.
but it is permitted to use terms of the type (xA (x)
The means of formal arithmetic are adequate for the
(yielding some x which satisfies condition A if it exists;
derivation of the theorems presented in standard
otherwise 0). The postulates of the (-system are the pos-
courses on elementary number theory. At the time of
tulates of propositional calculus, the equality rules, the
writing (the 1980's) it would appear that no significant
rule of substitution for the free numerical variables, the
number-theoretic theorem which can be demonstrated
axioms for the functions +, " ',pd (predecessor), and
without recourse to analysis is known which could not
the following (-axioms:
be demonstrated by formal arithmetic. To write down
and prove such theorems in formal arithmetic, the A(a) ~ A (xA (x)); A(a) ~ xA(x)=rt=a';
expressive potential of the latter must be investigated. -A (xA (x)) ~ xA(x)=O.
The expression of many number-theoretic functions in
The quantifiers are introduced by means of abbrevia-
terms of formal arithmetic is especially important. In
tions:
particular, statements about primitive (and even partial) 3xA(x)~A(xA(x)); \fxA ~.....,3x-A.
recursive functions (cf. Recursive function) may be The induction axiom turns out to be derivable.
rewritten in the language of formal arithmetic. In this The formulas of formal arithmetic with free variables
way it is possible to prove formulas expressing impor- define number-theoretic predicates. Formulas not con-
tant properties of partial recursive functions, in particu- taining free variables (closed formulas) express proposi-
lar their defining equations. Thus, the equation tions. A k-place predicate g; about natural numbers is
f(x')=g(x,f(x)) is expressed by the formula called arithmetic if there is an arithmetical formula
\fa \fb \fc(F(x', a) & F(x, b) & G(x, b, c) ~ a =c), P (x], ... ,Xk) such that for any natural numbers
n 1, . . . ,nk, the relation
where F and G are formulas representing the graphs of
the functions f and g, respectively. It is possible to for- P(nj, ... ,nd ~ <nj, ... ,nk>E9
mulate statements about finite sets in the language of is valid. This determines a classification of arithmetic
formal arithmetic. Moreover, classical formal arithmetic predicates by the type of prefix in the prenex form of
is equivalent to the Zermelo- Fraenkel axiomatic set the respective formula. The class ~n (the class TIn) con-
theory without the axiom of infinity: A model of any sists of predicates Pea) representable by a formula of
one of these systems can also be constructed in the the form
other system.
QjXj ... Qnxn(f(xj, ... ,xn, a)=O),
The deductive force of the system of formal arith-
metic is characterized by the ordinal 0 (the least solu- where f is a primitive recursive function, Q1 is 3 (or,
tion of the equation w =(): It is possible to deduce in respectively, 'v') and Qi are alternating quantifiers (i.e.
formal arithmetic the scheme of transfinite induction up Qi + 1 is Qi where V is 3, and "3 is 'v'). Each such class
to any ordinal a<o, but not the scheme of induction contains a universal predicate, i.e. a predicate T(e, a)
up to 0. The class of provably-recursive functions of the such that for each one-place predicate P of this class
system of formal arithmetic (i.e. the class of partial there exists a number ep for which the identity
recursive functions, the general recursiveness of which 'v'x(T(ep ' x) P(x)) is valid. Multi-place predicates can
can be established by the means of formal arithmetic) be reduced to one-place predicates with the aid of func-
coincides with the class of ordinal recursive functions tions which code systems of natural numbers. For any
with ordinals <0. This makes it possible to extend n >0 the inequality ~n==TIn is valid, and a class with a
formal arithmetic in certain ways, e.g. to have a formal lower index is properly included in any class with a
arithmetic with symbols for all primitive recursive func- larger index. This classification of predicates generates

251
ARITHMETIC, FORMAL

a classification of arithmetic functions based on the no quantifiers, it follows that formal arithmetic is con-
classification of their graphs. Not all number-theoretic sistent. If Gooel's second incompleteness theorem is
predicates are arithmetic: an example of a non- now applied, it follows that the cut elimination theorem
arithmetic predicate is the predicate T(x) such that for from any proof in formal arithmetic cannot be
any closed arithmetical formula A the identity demonstrated by the means of formal arithmetic;
T<rAl):=A is valid, where rAl is the number of the nevertheless, for any given N a proof in formal arith-
formula A in some fixed numbering satisfying some metic is possible for any derivation with complexity of
natural conditions. The predicate T plays an important induction ';;;;;N. Passing to w-proofs makes it possible to
role in the studies of the structure of formal arithmetic, establish many meta-mathematic theorems about the
in particular in the problem of the independence of its system of formal arithmetic, in particular completeness
axioms. The addition to formal arithmetic of the sym- with respect to formulas of ~I and the ordinal charac-
bol T with axioms of the type ter of provably-recursive functions.
T<rA&Bl) T<rAl)&TnBl), which express its com- References
mutativity with logical connectives, makes it possible to [1] KLEENE, S.C.: Introduction to metamathematics, North-Holland,
prove that formal arithmetic is consistent. The same 1951.
[2] HILBERT, D. and BERNAYS, P.: Grundlagen der Mathematik, 1-
construction, this time inside formal arithmetic, may be 2, Springer, 1968-1970.
carried out for a subsystem A [n1 of formal arithmetic [3] NOVIKOV, P.S.: Elements of mathematical logic, Edinburgh,
in which the induction scheme is restricted by the fol- 1964 (translated from the Russian).
[4] KREISEL, G.: 'A survey of proof theory', J. Symbolic Logic 33,
lowing condition: The complexity of the induction for- no. 3 (1968), 321-388.
mula is ';;;;;n, i.e. it belongs to Iln U ~n' The function of [5] GODEL, K.: 'Ueber eine bisher nicht beniitzte Erweiterung des
T is successfully performed by, say, the universal predi- finiten Standpunktes', Dialectica 12 (1958), 280-287.
[6] PARIS, J. and HARRINGTON, L.: 'A mathematical incomplete-
cate for ~n + I, and the corresponding proof is per- ness in Peano arithmetic', in 1. Barwise (ed.): Handbook of
formed in the system A [n + 11' It follows from Godel's mathematical logic, North-Holland, 1977, pp. 1133-1142.
second incompleteness theorem that A [n + 11 is stronger G.E. Mints
than A [n1' so that formal arithmetic does not coincide AMS 1980 Subject Classification: 03830
with anyone of the systems A [n1' and the induction
scheme cannot be replaced by any finite set of axioms. ARITHMETIC FUNcrION, number-theoretic function
Formal arithmetic turns out to be complete with - A complex-valued function, the domain of definition
respect to formulas from the class ~I: A closed formula of which is one of the following sets: the set of natural
of this class is provable in formal arithmetic if and only numbers, the set of rational integers, the set of integral
if it is true. Since the class LI contains an ideals of a given algebraic number field, a lattice in a
algorithmically-unsolvable predicate, it follows that the multi-dimensional coordinate space, etc. These are
problem of provability in formal arithmetic is algo- arithmetic functions in the wide sense. However, the
rithmically unsolvable. term is often employed to denote a function of the
If formal arithmetic is defined as the Gentzen formal above type with special arithmetic properties. The most
system of the usual type, a cut (the analogue of modus commonly occurring arithmetic functions have tradi-
ponens in Gentzen systems) cannot be eliminated. tional symbolic notations: cp(n) is the Euler function;
Elimination of a cut becomes possible if the induction d (n) or T( n) is the number of divisors; p,( n) is the
scheme is replaced by the rule of infinite induction (w- Mobius function; A(n) is the Mangoldt function; a(n) is
rule): the sum of divisors of the number n. Arithmetic func-
A (0) ... A(N) ...
tions also include the integral part of a number, [x],
V'xA(x)
and the fractional part of a number, {x}. Arithmetic
One of the first proofs of the consistency of formal functions giving the number of solutions of an equation
arithmetic was obtained in this way. A manageable for- are also studied; for example, r (n) is the number of
mulation of the w-rule is: 'The number e is a number of integer solutions x and y of the equation x 2 +y2 = n in
the general recursive function which describes the the Goldbach problem; J (N) is the number of solutions
derivation of length <io by the w-rule' (w-prooj). This in prime numbers of the equation N=PI+P2+PJ'
predicate belongs to the class Il 2 , and the system Other arithmetic functions express the quantity of
obtained is not formal, but only semi-formal. To each numbers satisfying certain conditions; thus, for
derivation of a formula A in formal arithmetic it is pos- instance, the function 7T(X) - the number of primes not
sible to assign a number e such that the statement 'e is larger than x - describes the distribution of primes;
an w-proof of the formula A without a cut' is true, and 7T(X, q, I) gives the number of primes not larger than x
can even be proved in formal arithmetic. Since a cut- in the arithmetic progression p 1 (modq). The Che-
free w-proof of a closed quantifier-free formula contains byshev functions also deal with properties of primes:

252
ARITHMETIC GROUP

O(x) is the sum of the natural logarithms of the prime Pa(X) == (-It(X(X, (!) x)-I),
numbers up to x, while "'(x) = ~ n~x A(n) (d. Che- where n
byshev function). X(X, (!) x) == ~(-IY dimk H;(X, (!) x)
;=0
Algebraic number theory deals with generalizations
of the above arithmetic functions of a natural argu- is the Euler characteristic of the variety X with coeffi-
ment. Thus, for instance, in an algebraic field K of cients in the structure sheaf (!) x. In this form the defin-
degree n, Euler's function CP(U) - the number of resi- ition of the arithmetic genus can be applied to any
due classes by the ideal U mutually prime with U - is complete algebraic variety, and this definition also
introduced for an integral ideal U. shows the invariance of Pa(X) relative to biregular map-
Arithmetic functions appear and are employed in pings. If X is a non-singular connected variety, and
studies on the properties of numbers. However, the k = C is the field of complex numbers, then
n-I
theory of arithmetic functions is also of independent
Pa(X) == ~gn-I(X)'
interest. The laws governing the variations of arithmetic ;=0
functions cannot usually be described by simple formu- where gk(X) is the dimension of the space of regular
las, and the asymptotic behaviour in terms of numerical differential k-forms on X. Such a definition for n = 1,2
functions is determined. Since many arithmetic func- was given by the school of Italian geometers. For
tions are not monotone, the study of their average example, if n = 1, then Pa(X) is the genus of the curve
values is of great importance. An important class of X; if n =2,
arithmetic functions is constituted by the multiplicative
arithmetic functions and the additive arithmetic func- where q is the irregularity of the surface X, while Pg is
tions (cf. Multiplicative arithmetic function; Additive the geometric genus of X.
arithmetic function). The problem of the distribution of For any divisor D on a normal variety X, O. Zariski
their values is studied in probabilistic number theory (see [1]) defined the virtual arithmetic genus Pa(D) as the
[5]. constant term of the Hilbert polynomial of the coherent
References sheaf (!) xeD) corresponding to D. If the divisors D and
[I] WINOGRADOW, I.M. [I.M. VINOGRADOV]: Elemente der D' are algebraically equivalent, one has
Zahlentheorie, Oldenburg, 1956 (translated from the Russian).
[2] VINOGRADOV, I.M.: The method of trigonometric sum, in th" Pa(D) == Pa(D').
theory of numbers, Interscience, 1954 (translated from the Rus- The arithmetic genus is a birational invariant in the
sian).
[3] HUA, L.-K.: 'Abschatzungen von Exponentialsummen und ihre case of a field k of characteristic zero; in the general
Anwendung in der Zahlentheorie', in Enzyklopaedie der case this has so far (1977) been proved for dimensions
Mathematischen Wissenschaften mit Einschluss ihrer Anwen- n~3 only.
dungen. Vol. I, 1959, Heft 13, Teil I.
[4] CHANDRASEKHARAN, K.: Arithmeticalfunctions, Springer. 1970. References
[5] KUBILYUS. I.: Probabilistic methods in the theory of numbers. [I] BALDASSARRl, M.: Algebraic varieties, Springer, 1956.
Amer. Math. Soc . 1964 (translated from the Russian). [2] HIRZEBRUCH, F.: TopolOgical methods in algebraic geometry,
N.I. Klimov Springer, 1978 (translated from the German).] V D I h
. . 0 gac ev

Editorial comments. AMS 1980 Subject Classification: 14A99


References
ARITHMETIC GROUP - A subgroup H of a linear
[A1] HARDY, G.H. and WRIGHT, E.M.: An introduction to the
theory of numbers, Oxford Univ. Press, 1979, Chapt.16. algebraic group G defined over the field Q of rational
numbers, that satisfies the following condition: There
AMS 1980 Subject Classification: 10A20
exists a faithful rational representation p: G~GLn
defined over Q (d. Representation theory) such that
AluTuMETIc GENUS - A numerical invariant of
algebraic varieties (cf. Algebraic variety). For an arbi-
p(H) is commensurable with peG) n
GL(n, Z), where Z
is the ring of integers (two subgroups A and B of a
trary projective variety X (over a field k) all irreducible
components of which have dimension n, and which is
group C are called commensurable if A B is of finite n
index in A and in B). This condition is then also satis-
defined by a homogeneous ideal I in the ring
fied for any other faithful representation defined over
k[To, ... ,TN], the arithmetic genus Pa(X) is expressed
Q. More generally, an arithmetic group is a subgroup
using the constant term cp(I, 0) of the Hilbert polyno-
of an algebraic group G, defined over a global field k,
mial cp(I, m) of I by the formula
that is commensurable with the group Go of O-points
Pa(X) == (-It(CP(I, 0)-1). of G, where 0 is the ring of integers of k. An arith-
This classical definition is due to F. Severi [1]. In the metic group H n
GR is a discrete subgroup of GR.
general case it is equivalent to the following definition: If </>: G~Gl is a k-epimorphism of algebraic groups,

253
ARITHMETIC GROUP

then the image cj>(H) of any arithmetic group He G is A characteristic property of an arithmetic progression is
an arithmetic group in G) [1]. The name arithmetic an+1 +an -I
group is sometimes also given to an abstract group that an = 2
is isomorphic to an arithmetic subgroup of some alge-
If d>O, the progression is increasing; if d<O, it is
braic group. Thus, if k is an algebraic number field, the
decreasing. The simplest example of an arithmetic pro-
group Go~G~ , where G' is obtained from G by res-
gression is the series of natural numbers 1, 2, . . .. The
tricting the field of definition from k to Q, is called an
number of terms of an arithmetic progression can be
arithmethic group. In the theory of Lie groups the
bounded or unbounded. If an arithmetic progression
name arithmetic subgroups is also given to images of
consists of n terms, its sum can be calculated by the
arithmetic subgroups of the group of real points of G R
formula:
under the factorization of G R by compact normal sub-
Sn
groups.
References Editorial comments. For results on prime numbers in
[I) BOREL, A.: Ensembles fundamentaux pour les groups arithmetic progressions see Distribution of prime numbers.
arithmetiques et formes automorphes, Fac. Sc. Paris, 1967.
[2) BOREL, A. and HARISH-CHANDRA: 'Arithmetic subgroups of AMS 1980 Subject Classification: 10A40
algebraic groups', Ann. of Math. 75 (1962), 485-535.
[3) Arithmetic groups and discontinuous subgroups, Proc. Symp. ARITHMETIC PROPORTION - An equation of the
Pure Math., 9, Amer. Math. Soc., 1966.
form
VP. Platonov a-b = c-d,
Editorial comments. Useful additional references are where a, b, c, d are given numbers. An arithmetic pro-
[A1] - [A3]. [A2] is an elementary introduction to the theory portion is also called a difference proportion.
of arithmetic groups.
AMS 1980 Subject Classification: 10A99
Conjectures of A. Selberg and 1.1. Pyatetskii-Shapiro
roughly state that for most semi-simple Lie groups discrete
ARITHMETIC ROOT, arithmetical value of the n-th
subgroups of finite co-volume are necessarily arithmetic.
G.A. Margulis settled this question completely and, in root of a real number a;;'O - A non-negative number the
particular, proved the conjectures in question. See Discrete n-th power of which is equal to a. In considering the
subgroup for more detail. two real values of the n-th root, where n is an even
number, of a non-negative number, one speaks of the
References algebraic values of the root in the domain of the real
[Ai) BOREL, A.: 'Arithmetic properties of linear algebraic groups',
in Proc. Internat. Congress Mathematicians Stockholm, 1962, numbers; if, on the other hand, one is considering all n
Uppsala, 1963, pp. 10-22. values of an n-th root, then one speaks of the values of
[A2) BOREL, A.: Introduction aux groupes arithmetiques, Hermann, the root in the domain of the complex numbers.
1969.
[A3) HUMPHREYS, J.E.: Arithmetic groups, Springer, 1980. AMS 1980 Subject Classification: 26A09
AMS 1980 Subject Classification: 20G15, 20G30
ARITHMETIC SERIES of order m - The sequence of
values of a polynomial of degree m:
ARITHMETIC MEAN - The number a obtained by
dividing the sum of several numbers aI, ... an by
p(x) = ao+a\x+ ... +amx m ,

their number n: assumed by the polynomial when the variable x takes


successive integral non-negative values x = 0, 1, .... If
a = m=l, i.e. p(x)=aO+alx, one obtains an arithmetic
progression with initial term ao and difference a I. If
AMS 1980 Subject Classification: 10A99 p(x)=x 2 or p(x)=x 3 , one obtains sequences of
squares or cubes of integers. i.e. special cases of arith-
ARITHMETIC PROGRESSION, arithmetic series of metic series of the second and third orders. If a first
the first order - A sequence of numbers in which each difference series is created, constituted by the differ-
term is obtained from the term immediately preceding ences between successive terms of an arithmetic series,
it by adding to the latter some fixed number d, which is then a series of differences of the first difference series
known as the difference of this pror,ression. Thus, each (second differences) is written. and then the second
arithmetic progression has the form differences are used to form third differences, etc .. one
a. a+d, a+2d .... , finds. at the moth stage, that all moth differences are
equal. Conversely. if the m-th differences of a numeri-
in which the general term is
cal sequence are all equal. the sequence is an arithmetic
(1" = a + (n ~ l)d. series of order m. Using this property. it is possible to

254
ARITH\-IFIICAL A 'vTRAG ES. SliMMATION METHOD OJ

construct arithmetic series of different orders from their ARITHMETIC SPACE, number space, coordinate
differences. For example, the sequence I, I, I, ... , space, real n-space - A Cartesian power R" of the set of
may be regarded as the first differences of the series of real numbers R having the structure of a linear topo-
natural numbers I, 2, 3, ... ; as the second differences logical space. The addition operation is here defined by
of the series of triangular numbers I, 3, 6, 10, ... ; as the the formula:
third differences of the sequence of tetrahedral numbers
1,4, 10,20, ... , etc. These numbers are so called
because triangular numbers represent numbers while multiplication by a number AER is defined by
arranged in the form of a triangle (Fig. I), while the formula
tetrahedral numbers represent numbers arranged in the A(X" ... ,xn ) = (Ax"". , Axn ).
form of tetrahedra (pyramids) (Fig. 2).
The topology in Rn is the topology of the direct pro-
duct of n copies of R; its base is formed by open n-
a
1
4~ 10
dimensional parallelepipeda:
I = {(x" ... ,xn)ERn: aj<x,<bj, i=l, ... on},
Fig. 1. Fig. 2.
where the numbers a j, . . . ,an and b 1, . . . ,bn are
Triangular numbers are expressed by the formula given.
[n (n + 1)1/ 2, while tetrahedral numbers are given by The real n-space Rn is also a normed space with
the formula respect to the norm
n(n + l)(n +2) n = 1, 2, ....
II x I = v'XT + ... + x~,
6
where x = (x 1, . . . , Xn) E Rn , and is a Euclidean space
A generalization of triangular numbers is constituted with respect to the scalar product
by k-gonal or figurate numbers, which played an impor-
tant role in the development of arithmetic in its various <x,y> = x,y"
i ---::: 1
stages.
wherex=(Xj, ... ,xn),Y=(yj, ... ,Yn)ERn.
I. V Dolgachev
a~-a
~-~~ AMS 1980 Subject Classification: 15A03, b1 N20,
.-~-4t
9
468XX

Fig. 3. Fig. 4.
ARITHMETIC TRIANGLE - A synonym for Pascal
triangle.
k-gonal numbers are given by the formula:
_ n(n-1) _ AMS 1980 Subject Classification: 10A99
n +(k 2) 2 ' n -1, 2, ....

They form an arithmetic series of the second order, ARITHMETICAL AVERAGES, SUMMATION
with one as their first term, k as their second term and METHOD OF - One of the methods for summing series
k - 2 as their second differences. If k = 3, triangular and sequences. The series
numbers are obtained; if k = 4, one obtains squares
(n 2); if k = 5, one obtains pentagonal numbers
(3n 2 - n) / 2, etc. These appellations will become clear
is summable by the method of arithmetical averages to
from Figs. 3 and 4, in which the number of beads
the sum s if
arranged in the form of a square or a pentagon is s 0 + ... +s n
Ii
expressed by the respective square or pentagonal n~n;, n +1 = s,
number. Figurate numbers satisfy the following
theorem, proposed by P. Fermat and first proved by where Sn = ~~ = 1Uk In this case, one also says that the
A.L. Cauchy: Any natural number can be represented sequence {sn} is summable by the method of arithmeti-
as a sum of not more than k k-gonal numbers. cal averages to the limit s. The summation method of
arithmetical averages is also called the Cesaro summa-
References
[I) WAERDEN, B.L. VAN DER: Algebra, 1-2, Springer. 1967-1971
tion method of the first order (cf. Cesaro summation
(translated from the German). methods). The summation method of arithmetical aver-
[2) ARNOL'D, V.1.: Theoretical arithmetic, Moscow, 1939 (in Rus- ages is completely regular (see Regular summation
sian).
BSE-2 methods) and translative (see Translativity of a summa-
AMS 1980 Subject Classification: 10A40 tion method).

255
ARITHMETICAL AVERAGES, SUMMATION METHOD OF

References creation of the theory of real numbers in the nineteenth


[1] HARDy, G.H.: Divergent series, Clarendon, 1949.
century using set-theoretic constructions, starting from
1.1. Volkov the natural numbers.
Editorial comments. Instead of 'arithmetical averages'
the term 'arithmetical means' is sometimes used, cf. [A 1],
References
[I] GOOEL, K.: 'Ueber formal unentscheidbare Satze der Principia
and instead of 'summation' one also uses 'summability': Mathematica und verwandter Systeme 1', Monatsh. Math. Phy-
summability method. sik 38 (1931), 178-198.
[2) CHURCH, A.: 'An unsolvable problem of elementary number
References
theory', Amer. J. Math. 58 (1936), 345-363.
[A1] COOKE, R.G.: Infinite matrices and sequence spaces, Macmil-
[3) KLEENE, S.c.: Introduction to metamathematics, North-Holland,
lan, 1950.
1951.
V.A. Uspenskil
AMS 1980 Subject Classification: 40G05
Editorial comments.
AluTHMETIZATION - A method used in mathemat- [A1) SMORINSKY, c.: 'The incompleteness theorems', in J. Barwise
(ed.): Handbook of mathematical logic, North-Holland, 1977,
ical logic for replacing a reasoning on the expressions
pp. 821-866.
of some logico-mathematical language by reasonings on
natural numbers. For this purpose the replacement is AMS 1980 Subject Classification: 03F40
constructed by some sufficiently simple one-to-one
mapping of the set of all words (in the alphabet of the ARRANGEMENT with repetitions of n out of m ele-
language under consideration) into the natural number ments - A finite sequence a= (ail' ... , aiJ of elements
sequence. The image of a word is called its number. from some set A ={a), ... ,am}. If all the term of a
Relations between and operations defined on words are are distinct, then a is called an arrangement without
transformed by this mapping into relations between repetitions. The number of arrangements of n out of m
and operations on natural numbers. The requirement of elements with repetitions is m n , and that without
a 'sufficiently simple' mapping leads to the fact that repetitions in (m)n = m(m - 1) ... (m - n - 1).
some basic relations (such as the relation of imbedding An arrangement can be regarded as a function </>
of one word into another, etc.) and some operations given on Zn = { 1, ... ,n} and taking values in A:
(like the operation of concatenation of words, etc.) are </>(k)=ai k=l, ... ,no The elements of A are usually
k '

transformed into relations and operations having a sim- called cells (or urns), while those of Zn are called parti-
ple algorithmic nature (e.g. are primitive recursive). In cles (or balls); </> defines the filling of the various cells
particular, if among the expressions of the language by the various particles. If one speaks of indistinguish-
under consideration there exist programs for some fam- able particles or cells, it is understood that classes of
ily of computable functions (d. Computable function), arrangements are being considered. Thus, if all the par-
arithmetization naturally leads to an enumeration of ticles are indistinguishable, then two arrangements
this family (in which for the number of each function is defined by functions </> and 1/;, respectively, belong to
taken the number of its program). the same class if and only if there is a permutation a of
The first arithmetization was constructed by K. Zn such that </>(a(k=I/;(k) for all k EZn. In this case,
G6del [1] for the proof of the incompletness of formal the number of such classes, or, as it is called, the
arithmetic (d. GOdel incompleteness theorem). More number of arrangements filling m different cells by n
precisely, G6del put the letters of the alphabet in indistinguishable particles, is the number of combina-
correspondence with some, pairwise different, natural tions of n out of m elements, with repetition allowed
numbers and attached to the word 1') . Tn the (d. Combination).
number i' ... p~, where ti is the number correspond- If one says that all the cells are indistinguishable,
ing to the letter Ti' and Pi is the i-th prime number in then one means that the arrangements are put into the
the natural number sequence. Such an enumeration is classes in such a way that two arrangements, defined
called a Codel enumeration. In a broad sense every by functions </> and I/; respectively, belong to the same
enumeration of words arising from arithmetization is class if there exists a permutation l' of A for which
called a Gbdel enumeration and the number T(</>(k=I/;(k) for all k EZn- In this case the number of
corresponding to a word is called its Codel number. arrangements of n different particles in m indistinguish-
In 1936 A. Church used arithmetization to obtain the able cells, that IS, the number of classes, IS
first example of an unsolvable algorithmic problem of ~;=)S(n,k), where S(n,k) is the Stirling number of
arithmetic. the second kind:
The term 'arithmetization' (in the phrase 'arithmeti-
zation of analysis') is also used in the literature on the S(n,k) = S(n-l,k-I)+kS(n-l,k), S(O, 0) = I,
foundations of mathematics for the denotation of the S(n,k) = fork>n andk=O, n>O.

256
ARTINIAN RING

If one does not distinguish either particles or cells, then submodules, quotient modules, finite direct sums and
an arrangement of n identical particles in m identical extensions. Extension in this context means that if the
cells is obtained; the number of such arrangements is modules B and A / Bare Artinian, then so is A. Each
L.~=IPn(k), where Pn(k) is the number of additive par-
Artinian module can be decomposed into a direct sum
titions of n into k natural numbers. One can also con- of sub modules which are no longer decomposable into
sider other partitions of arrangements into classes, for a direct sum. A module has a composition series if and
example when the above-mentioned permutations (J and only if it is both Artinian and Noetherian. See also
T are taken from subgroups of the symmetric groups of
Artinian ring.
L.A. Skornyakov
degrees n and m, respectively (this and other generaliza- Editorial comments.
tions can be found in [I], [2]). Synonyms of 'arrange-
References
ment' are the terms 'n-permutation', and 'ordered n- [A1] FAITH, c.: Algebra: rings, modules and categories, 1,
sample of a population'. Springer, 1973.
[A2] FAITH, c.: Algebra II: ring theory, Springer, 1976.
References
[I] SACHKOV, A.V.: Combinatorial methods of discrete mathematics, AMS 1980 Subject Classification: 13E1 0
Moscow, 1977 (in Russian).
VM. Mikheev ARTINIAN RING, right Artinian ring - A ring that
Editorial comments.
satisfies the minimum condition for right ideals, i.e. a
ring in which any non-empty set M of right ideals that
References
[A 1] COMTET, L.: Advanced combinatorics, Reidel, 1977.
is partially ordered by inclusion has a minimal element
[1] - a right ideal from M that does not strictly con-
AMS 1980 Subject Classification: 05A05, 05A 17 tain right ideals from M. In other words, an Artinian
ring is a ring which is a right Artinian module over
ARTINIAN GROUP, group with the minimum condi-
itself. A ring A is an Artinian ring if and only if it
tion for subgroups, group with the descending chain condi-
satisfies the decreasing chain condition for right ideals,
tion - A group in which any decreasing chain of distinct
i.e. for any decreasing sequence of right ideals
subgroups terminates after a finite number. Artinian
Bid B 2 d . .. there exists a natural number m such
groups are periodic, and the question of their structure
that Bm = Bm + I = .... The definition of a left Artinian
hinges on Schmidt's problem on infinite groups with
ring is similar.
finite proper subgroups [3] and the minimality problem:
Each associative Artinian ring with a unit is right
Is an Artinian group a finite extension of an Abelian
Noetherian (cf. Noetherian ring). Every finite-
group? Both these problems have been solved for
dimensional algebra over a field is an Artinian ring.
locally solvable groups [I] and locally finite groups [3],
The properties of Artinian rings in the class of alterna-
[4].
tive rings, and particularly in the class of associative
References rings, have been studied most fully (cf. Alternative rings
[I] CHERNIKHOV, S.N.: 'Infinite locally solvable groups', Mat. Sb.
7 (49), no. I (1940), 35-64 (in Russian). and algebras; Associative rings and algebras). The
[2] CHERNIKHOV, S.N.: 'The finiteness condition in general group Jacobson radical of an associative Artinian ring is nil-
theory', Uspekhi Mat. Nauk 14, no. 5 (1959), 45-96 (in Rus- potent and contains every one-sided nil-ideal. A ring A
sian).
[3] KARGAPOLOV, M.I.: 'On a problem of O.Yu. Schmidt', Sibirsk.
is a simple associative Artinian ring if and only if it is
Mat. Zh. 4, no. I (1963),232-235 (in Russian). isomorphic with the ring of all matrices of some finite
[4] SHUNKOV, V.P.: 'On the minimality property for locally finite order over some associative skew-field. In the class of
groups', Algebra and Logic 9, no. 2 (1970), 137-151
alternative rings any simple Artinian ring is either asso-
(Algebra i Logika 9, no. 2 (1970), 220-248)
ciative, or else is a Cayley - Dickson algebra over its
VP. Shunkov
centre, which is then a field. The structure of associa-
Editorial comments. Schmidt's problem actually states:
tive Artinian rings with zero Jacobson radical has been
Under what conditions does an infinite group have proper
described (cf. Semi-simple ring). There exists a variant
infinite subgroups?
of this theorem in the case of alternative rings. For
References
[A1] KEGEL, O.H. and WEHRFRITZ, B.V.: Locally finite groups, associative rings with non-zero Jacobson radical a
North-Holland, 1973. fairly-advanced structure theory has been developed [1],
AMS 1980 Subject Classification: 20FXX, 20E07, [2]. Several classes of Artinian rings - quasi-Frobenius
20E28 rings, uniserial rings, balanced rings (cf. Quasi-
Frobenius ring; Uniserial ring; Balanced ring) - are
ARTINIAN MODULE - A module that satisfies the being intensively studied.
decreasing chain condition for submodules. The class of References
Artinian modules is closed with respect to passing to [I] ARTIN, E., NESBITT, C.l. and THRALL, R.M.: Rings with

257
ARTINIAN RING

minimum condition, Univ. Michigan Press, Ann Arbor, 1946. References


(2) JACOBSON, N.: Structure of rings, Amer. Math. Soc., 1956. (1) ARzELA, C: Rend. Accad. Sci. Bologna 9, no. 2 (1905), 100-
(3) Itogi Nauk. Algebra Topol. Geom. 1965 (1967), 133-180. 107.
(4) 1togi Nauk. Algebra Topol. Geom. 1968 (1970),9-56. (2) HAHN, H.: Theorie der reel/en Funktionen, 1, Springer, 1921.
K.A. Zhevlakov B.1. Golubov
Editorial comments. AMS 1980 Subject Classification: 26830
References
[A 1) FAITH, C: Algebra: rings, modules and categories, 1, ASSERTION, proposition, statement, (assertive) sen-
Springer, 1973. tence - A narrative communication which by virtue of
[A2] FAITH, c.:Algebra II: ring theory, Springer, 1976.
its meaning is true or false. In a narrower sense, the
AMS 1980 Subject Classification: 13E1 0 term assertion in mathematical logic is understood to
be a closed formula in a logico-mathematical language
ARZELA - ASCOLI THEOREM - The name of a which by virtue of the semantics of the language can be
number of theorems that specify the conditions for the classified as true or false.
limit of a sequence of continuous functions to be a con- Thus, in axiomatic set theory, various mathematical
tinuous function. One such condition is the quasi- statements, for example, the axiom of choice or the con-
uniform convergence of the sequence. tinuum hypothesis, are written in the form of formulas
References that for general semantic reasons can be interpreted as
[1) ARZELA, C: Mem. Accad. Sci. Bologna (5) 5 (1893),225-244. expressing the contents of the statement. It is by no
[2) ASCOLI, G.: Rend. Accad. Lincei 18 (1883),521-586.
means necessary that there exists a method of recogniz-
P.s. Aleksandrov ing which assertions in the language are true and which
Editorial comments. false. Moreover, the semantics need not actually be
References sufficiently developed, or may present fundamental dif-
[A 1) DUNFORD, N. and SCHWARTZ, IT.: Linear operators. General ficulties in the solution of the question of whether cer-
theory, 1, Interscience, 1958. tain assertions are true. The un solvability of assertions
within the framework of a theory is explained by the
ARZELA VARIATION - A numerical characteristic
formalization method (for examples, see Axiomatic set
of a function of several variables, which may be
theory).
regarded as the multi-dimensional analogue of the vari- A. G. Dragalin
ation of a function III one unknown. Let AMS 1980 Subject Classification: 00A25, 03A05
f(x)=f(x], ... ,xn) be a rea)-valued function given on
an n-dimensional parallelepipedon ASSOCIATED FUNCTION of a complex variable - A
D n =[a1,bJlX'" X[an,b n ], n=2,3, ... , and let Gn function which is obtained in some manner from a
be the class of all continuous vector functions given function f (z) with the aid of some fixed function
x(t)=(X1(t), ... ,xn(t, O.:;;;t.:;;;l, such that each of the F(z). For example, if
functions Xk is non-decreasing on [0, 1], and with
= :L ak zk
00

xk(O)=ako xk(l)=bk , k = 1, ... ,no Then f(z)


k =0
m
is an entire function and if
A(j,Dn) = _sup S~? ~ If(x(t,))-fCx(t,-d) I,
x(t)""G/I =1
= :L bkz k
S 00

F(z)
where II={0=t o <t1<'" <tm =1} is an arbitrary k =0

system of points in [0, 1]. This definition for the case is a fixed entire function with bk*O, k~O, then
n =2 was proposed by C. Arzela [I] (see also [2], p. oc ak
543). If A(j, Dn)<oo, one says thatfhas bounded (fin- y(z) = :L-Z-(k+I)
k =0 bk
ite) Arzela variation on D n , and the class of all such
functions is denoted by A (Dn). For a function is a function which is associated to f (z) by means of
f(x 1, . . . ,xn) to helong to the class A (Dn) it is neces- the function F(z); it is assumed that the series con-
sary and sufficient that there exists a decomposition verges in some neighbourhood I z I > R. The function
f= fl -h, where fl and h are finite non-decreasing f (z) is then represented in terms of y(z) by the formula
functions on Dn. A function f is called non-decreasing f(z) = 1.
-2 f y(t)F(zt)dt.
on Dn if 'TTl It I =R,>R
f(x;, ... ,x;;) ~ f(Xj, . ... x;,),
In particular, if ak
for ak ':;;;x~ <x~ .:;;;bk (k = I, ... ,IJ). The class A (Dn) f(z) = :L -zk
00

contains the class of functions of bounded Hardy varia- k=(j k!


tion on Dn. is an entire function of exponential type and F(z)=e z ,

251-\
ASSOCIA TIVE CALCULUS

then stituting the pair are equivalent in m:. In the algebraic


=
'Xl
y(z) ~akZ-(k+I) interpretation, this problem is the identity problem for
k ~O
the associative system Kill:. Thue succeeded in solving
is the Borel-associated function of f (z) (cf. Borel the problem in a few special cases only. In the modern
transform). interpretation of the problem (1930's and later) the
A.F. Leont'ev
algorithm of this problem is sought in some exact
AMS 1980 Subject Classification: 30BXX, 44AXX meaning of the word, such as a partial recursive
function, a Turing machine or a normal algorithm. If the
ASSOCIATIVE CALCULUS - A name given to cal- problem is thus modernized, the problem of finding a
culi (cf. Calculus) of a very definite type, especially suit- specific associative calculus for which the desired algo-
able for specifying finitely-presented associative systems rithm does not exist becomes meaningful. Markov [3]
(semi-groups, cf. Semi-group). The term 'associative and E.L. Post [4], working independently of each other,
calculus' was introduced by A.A. Markov, who also constructed unsolvable associative calculi, i.e. associative
developed the theory of associative calculi [2]. calculi with an unsolvable algorithmic problem of
Any associative calculus m: is defined by specifying recognition of word equivalence. These results yield a
some alphabet A and a finite list a of relations in A - negative solution of Thue's problem in its modern
pairs of words (cf. Word) in this alphabet. The words form. However, if one accepts the Church thesis or any
constituting a relation are often called its parts - left other equivalent assumption regarding the adequacy of
and right. A a-permissible operation on words in A is recursive functions for rendering the concept of an
any substitution of one part of any relation belonging algorithm more precise, one comes to the inevitable
to a, where it occurs imbedded in a word in m: (cf. conclusion that Thue's problem in its original form as
Imbedded word), by the other part of the same relation. well has a negative solution for some concrete associa-
An associative calculus m: is the performance of a tive calculi.
sequence of a-permissible operations, starting from any The original examples constructed by Markov and
word P in A. One says that all words Q obtained in Post were extremely complex. Simpler unsolvable asso-
this way (including the initial word P itself) are ciative calculi were subsequently presented. They
equivalent to P in the associative calculus m: (in sym- include, for example, an associative calculus with seven
bolic notation, m:: PrrQ). The relation rr for any associ- very simple relations [5], and one with only three such
ative calculus m: is reflexive, symmetric and transitive. relations [6]. Thue's problem has been almost fully
In addition, it follows from m:: PrrQ and m:: RrrS that solved for the case of an associative calculus with one
m:: PRrrQS. This makes it possible, in a natural relation [7].
manner, to assign to each associative calculus m: a In a natural way one defines an isomorphism of one
finitely-presented associative system Kill:, whose ele- associative calculus into another and onto another [2].
ments are equivalence classes of words in A and whose Of special interest from the algebraic point of view is
multiplication operation is induced by the operation of the study of those properties of associative calculi that
concatenation of words in A. The associative system are invariant under isomorphisms; they are the
Kill: thus constructed will have a unit (the element properties of abstract associative systems. Markov [2],
represented by the empty word); the elements of K & who based himself on his own studies on the recogni-
represented by the letters of the alphabet A will consti- tion of word equivalence in an associative calculus,
tute a system of generating elements for Kill:, while the obtained a very general result, which yields a negative
list of relations a represents a complete system of rela- solution to practically all algorithmic problems con-
tions between these generating elements of Kill: in the cerning the basic classifications of associative calculi
sense that the elements of Kill: represented by two studied at that time. He showed, in particular, that if I
words P and Q are identical in Kill: if and only if P and is an isomorphism-invariant property of an associative
Q are equivalent in m:. Thus, the recognition of the calculus, if there is a unique associative calculus with
identity of elements in Kill: is reduced to the recognition the property I and if there exists also an associative
of equivalence of words in m:. Hence the importance of calculus not included in any associative calculus with
the study of the solvability of the algorithmic problem the property I, then the algorithmic problem of recog-
of recognition of equivalent words in an arbitrary asso- nition of an associative calculus with the property I
ciative calculus. This problem was first formulated by among other associative calculi is unsolvable for any
A. Thue [1]: Construct, for an arbitrary associative cal- alphabet with more than three letters. It immediately
culus m:, an algorithm which, for any pair of words in follows that algorithmic problems of recognition of
the alphabet of this associative calculus, determines, in uniqueness of associative calculi, finiteness of an associ-
a finite number of steps, whether or not the words con- ative calculus, the semi-group character of an associa-

259
ASSOCIA T1VE CALCULUS

tive calculus, inclusion in group-associative calculi, and and all a, b from the algebra.
isomorphism of pairs of associative calculi, are unsolv- The first examples of associative rings and associative
able for any alphabet with more than three letters. It algebras were number rings and fields (the field of
was subsequently proved [8] that the set of pairs of iso- complex numbers and its subrings), polynomial alge-
morphic associative calculi is recursively enumerable; bras, matrix algebras over fields, and function fields.
the method used in this connection also makes it possi- The theory of associative rings and algebras became an
ble to establish the recursive enumerability of the independent part of algebra at the beginning of the 20-
number of invariant properties of associative calculi. th century. This theory has many contact points with
References numerous fields of mathematics, especially so with alge-
[lA) TRUE, A.: 'Probleme tiber Veriinderungen von Zeichenreihen braic geometry and algebraic number theory (commuta-
nach gegebener Regeln', Kra. Vidensk. Selsk. Skrifter. 1. Mat. tive rings), functional analysis (commutative normed
Nat. KI., no. 10 (1914).
[lB) TRUE, A.: 'Probleme tiber Veriinderungen von Zeichenreihen rings, operator rings and function rings), and topology
nach gegebener Regeln', in Selected Math. Papers, Univ. For- (rings of continuous functions on topological spaces).
laget, Oslo, 1977, pp. 493-524. The theory of fields, the theory of commutative rings
[2) MARKov, A.A.: Theory of algorithms, Israel Progr. Sci. Transl.,
Jerusalem, 1961 (translated from the Russian). Also: Trudy
(cf. Field; Commutative ring, see also Commutative
Mat. Inst. Steklov. 42 (1954). algebra) and the theory of representations of associative
[3) MARKov, A.A.: 'On the impossibility of certain algorithms in algebras have become independent branches of the
the theory of associative systems', c.R. Acad. Sci. URSS 55
theory of associative rings and algebras. The theory of
(1948),533-586. (Dokl. Akad. Nauk SSSR 55, no. 7 (1947),
587-590) topological rings and skew-fields forms part of topologi-
[4) POST, E.L.: 'Recursive unsolvability of a problem of Thue', J. cal algebra.
Symbolic Logic 12, no. I (1947), I-II. The classical part of the theory of associative rings
[5) TSEITIN, G.S.: 'Associative calculi with undecidable
equivalence problems', Dokl. Akad. Nauk SSSR 107, no. 3
and algebras is formed by the theory of finite-
(1956), 370-371 (in Russian). dimensional associative algebras [2]. The principal
[6) MATIYASEVICH, YU.V.: 'Simple examples of undecidable asso- results of this theory are: A finite-dimensional simple
ciative calculi', Soviet Math. Dokl. 8, no. 2 (1967), 555-557.
(Dokl. Akad. Nauk SSSR 173, no. 6 (1967), 1264-1266)
associative algebra (i.e. one without proper ideals) over
[7) ADJAN, S.I. [S.I. ADYAN): 'Defining relations and algorithmic a field F is a complete matrix algebra over some skew-
problems for groups and semigroups', Proc. Steklov Inst. Math. field that is finite-dimensional over F (Wedderburn's
85 (1967). (Trudy Mat. Inst. Steklov. 85 (1966
theorem); a finite-dimensional associative algebra over a
[8) NAGORNYi, N.M.: 'On the search of isomorphisms of associa-
tive calculi', Z. Math. Logik Grundl. Math. 6 (1960),319-324 field of characteristic zero (and, even more generally -
(in Russian). German abstract. NM N - a separable finite-dimensional associative algebra) is the
. . agornYI
direct sum (as linear spaces) of its radical I (i.e. its
Editorial comments. The term 'associative calculus' maximal nilpotent ideal) and some semi-simple (i.e.
seems to be restricted to the Russian literature; in the West
with a radical zero) sub algebra S, any two complemen-
the term 'Thue system' is often used. For an account of
tary semi-simple sub algebras Sand S 1 being conjugate
Thue systems and related combinatorial decision problems,
(cf. Wedderburn- Mal'tsev theorem).
see [A1], Chapt. 6, or [A2].
One of the most important classes of associative alge-
References bras are skew-fields (cf. Skew-field, i.e. associative rings
[A 1) DAVIS, M.: Computability and unsolvability, McGraw-Hili,
1958. in which the equations ax = band ya = b are solvable
[A2) DAVIS, M.: 'Unsolvable problems', in J. Barwise (ed.): Hand- for all a, b in the ring, a=r!=O). Skew-fields that are alge-
book of mathematical logic, North-Holland, 1977, pp. 567- bras over a certain field are called division algebras (cf.
594.
Division algebra). The theory of finite-dimensional divi-
AMS 1980 Subject Classification: 03003 sion algebras forms a classical part of the theory of
fields. All finite-dimensional associative division alge-
ASSOCIATIVE RINGS AND ALGEBRAS - Rings bras over the field of real numbers have been
and algebras with an associative multiplication, i.e. sets described: these are the field of real numbers, the field
with two binary operations, addition + and multiplica- of complex numbers and the skew-field of quaternions
tion " that are Abelian groups with respect to addition (cf. Frobenius theorem). All finite skew-fields are com-
and semi-groups with respect to multiplication, and in mutative (Wedderburn's theorem on skew-fields). A
which the multiplication is distributive (from the left Galois theory of skew-fields has been constructed [5].
and from the right) with respect to the addition. More- The key concepts in the structural theory of associa-
over, an associative algebra should be a. vector space tive rings are the concepts of the Jacobson radical.
over a fixed field F, and the multiplication in it is com- semi-simplicity and primitivity. An associative ring is
patible with multiplication by field elements in the called semi-simple (in the sense of Jacobson) if its
sense of the condition a(ab)=(aa)b=a(ab) for all aEF Jacobson radical is zero. A ring is called primitive

260
ASSOCIATIVE RINGS AND ALGEBRAS

(from the right) if it has an irreducible exact right finitely-generated F<X>-modules contain the same
module. All semi-simple associative rings are a sub- number of elements (Cohn's theorem). Other examples
direct sum of primitive rings. Any primitive associative of rings with free ideals are group algebras of free
ring R is a dense ring of linear transformations of some groups, and free products of associative division alge-
vector space V over a skew-field (Jacobson's density bras. The free associative algebra F <X > is also a
theorem); here the meaning of density is that for all unique factorization domain: Any non-invertible ele-
linearly independent elements v I, . . . ,Vk from V and ment aEF<X> has a representation a=pI .. 'pn,
all elements U I, . . . ,Uk from V there exists a transfor- where pj are irreducible elements, and this representa-
mation rER such that Vjr=Uj for l';;;;i.;;;;k. The general tion is unique except for the order of the sequence and
theory of radicals occupies an important place in the similarities (two element a and b of a ring R are called
structural theory of rings (cf. Radical of rings and alge- similar if R / aR and R / bR are isomorphic as right
bras). R-modules). The centralizer of each non-scalar element
The theory of Artinian rings (from the right, cf. of the algebra F < X> is isomorphic to the algebra of
Artinian ring), i.e. rings with the descending chain con- polynomials F[t] in one variable t (Bergman's theorem).
dition (minimum condition) for right ideals, forms a Group algebras and PI-algebras are important classes
classical part of the theory of associative rings. The of associative algebras (cf. Group algebra; PI-algebra).
main result of this theory is that an associative ring is a The theory of varieties of rings is in the process of
semi-simple Artinian ring if and only if it is a direct being developed.
sum of a finite number of complete matrix rings over The importance of ring theory in mathematics has
skew-fields (the Wedderburn- Artin theorem). increased with the development of homological algebra.
The concept of the (classical) ring of fractions is Many known classes of rings may be characterized in
important in the structural theory of associative rings. terms of properties of the category of modules (cf.
A ring Q(R) is called the (right) ring of fractions of a Modules, category of) over these rings. For instance, a
subring R of it if in Q(R) all the regular elements (i.e. ring R is a semi-simple Artinian ring if and only if all
not divisors of zero) of R are invertible and if any ele- right (left) modules over R are projective (injective). A
ment of Q(R) has the form ab- I , where a, bER. An ring R is regular (in the sense of von Neumann) if and
associative ring has a ring of fractions if and only if for only if all right (left) modules over R are flat. See also
all elements a, b ER, where b is regular, there exist ele- Regular ring; Homological classification of rings;
ments aJ, b l ER such that ab l =bal and b l is regular Quasi-Frobenius ring.
(Ore's theorem). A ring R has a semi-simple Artinian
References
ring of fractions if and only if it is semi-primary (i.e. [1] WAERDEN, B.L. VAN DER: Algebra, 1-2, Springer, 1967-1971
/2=1=0 for all non-zero ideals / c;;;,R), satisfies the (translated from the German).
minimum condition for right annihilating ideals of the [2] ALBERT, A.A.: Structure of algebras, Amer. Math. Soc., 1939.
[3] ARTIN, E., NESBITT, C.l. and THRALL, R.M.: Rings with
form minimum condition, Univ. Michigan Press, Ann Arbor, 1946.
r(S) = {xER: Sx=O},
[4] JACOBSON, N.: The theory of rings, Amer. Math. Soc., 1943.
where S is a subset of R, and contains no infinite direct [5] JACOBSON, N.: Structure of rings, Amer. Math. Soc., 1956.
sums of right ideals (Goldie's theorem). In addition to [6] KUROSH, A.G.: Lectures on general algebra, Chelsea, 1963
(translated from the Russian).
the classical rings of fractions, rings of fractions having [7] HERSTEIN, I.: Noncommutative rings, Math. Assoc. Amer., 1968.
other meanings are also studied - mainly maximal or [8] LAMBEK, J.: Lectures on rings and modules, Blaisdell, 1966.
complete rings of fractions [8]. [9] CURTIS, C.W. and REINER, I.: Representation theory offinite
Special stress is laid on the study of free associative groups and associative algebras, Interscience, 1962.
[10] PONTRYAGIN, L.S.: Topological groups, Princeton Univ. Press,
algebras (cf. Free associative algebra). Let F be a field 1958 (translated from the Russian).
and let X be a set. The free associative algebra F <X > [II] COHN, P.M.: Free rings and their relations, Academic Press,
over F with a unit element and with base X is the alge- 1971.
[12] ZARISKI, O. and SAMUEL, P.: Commutative algebra, 1-2, van
bra of non-commutative polynomials with free terms in Nostrand, 1960. (reprinted: Springer, 1975).
the set of variables X and with coefficients from F. The [13] CARTAN, H. and ElLENBERG, S.: Homological algebra, Prince-
algebra F < X> is characterized by the fact that it is is ton Univ. Press, 1956.
[14] Itogi Nauk. Algebra Topol. 1962 (1963), 59-79.
generated by X as an algebra with a unit element and
[15] Itogi Nauk. Algebra Topol. Geom. 1965 (1967), 133-180.
any mapping of X into an associative algebra R with a [16] Itogi Nauk. Algebra Topol. Geom. 1968 (1970),9-56.
unit element can be extended (even uniquely) to a [17] BOKUT', L.A., KUZ'MIN, E.N. and SHIRSHOV, A.I.: Rings, 1-3,
homomorphism from F<X> into R. A free Novosibirsk, 1973 (in Russian).
[18] DIVINSKY, N.J.: Rings and radicals, Allen and Unwin, 1965.
associative algebra is a ring with free ideals, i.e. the [19] PASSMAN, D.S.: Infinite group rings, New York, 1971.
right (left) ideals of the ring F <X > are free right [20] PROCESI, C.: Rings with polynomial identities, New York, 1973.
(left) F <X>-modules, and all bases of the free L.A. Bokut'

261
ASSOCIATIVE RINGS AND ALGEBRAS

Editorial comments. from the point A is


References
[A1] FAITH, c.: Algebra I: Rings, modules and categories, I = ; R sin2 t.
Springer, 1973.
[A2] FAITH, c.: Algebra If: Ring theory, Springer, 1976.
The length of the entire curve is 6R. The radius of cur-
[A3] GILMORE, R.: Multiplicative ideal theory, M. Dekker, 1972. vature is
[A4] McDoNALD, B.R.: Finite rings with identity, M. Dekker, 1974. 'k = 23 R'sm 2t
AMS 1980 Subject Classification: 16-XX The area bounded by the curve is

ASSOCIATIVITY, law of associativity - A property of s = .l'ITR2


8 .
an algebraic operation. For the addition and multipli-
The astroid is the envelope of a family of segments of
cation of numbers associativity is expressed by the fol-
constant length, the ends of which are located on two
lowing identities:
mutually perpendicular straight lines. This property of
(a+b)+c = a+(b+c) and (ab)c = a (bc). the astroid is connected with one of its generalizations
A binary algebraic operation * is associative (or, - the so-called oblique astrOid, which is the envelope of
which is the same thing, satisfies the law of associativity) the segments of constant length with their ends located
if the identity on two straight lines intersecting at an arbitrary angle.
(X*y)*z = x*(y*z)
References
is valid in the given algebraic system. In a similar [I] SAVELOV, A.A.: Plane curves, Moscow, 1960 (in Russian).
manner, associativity of an n-ary operation w is defined D.D. Sokolov
by the identities Editorial comments.
References
= XI .. Xj(Xj+1 . . . Xj+nw)Xj+n+1 . X2n-IW, [A1] LAWRENCE, J.D.: A catalog of special plane curves, Dover,
reprint, 1972.
for all i= 1, ... ,no [A2] LOCKWOOD, E.A.: A book of curves, Cambridge Univ. Press,
o.A. Ivanova 1961.
D.M. Smirnov
AMS 1980 Subject Classification: 51 N1 0, 51 A03
AMS 1980 Subject Classification: 08A02
ASTRONOMETRY, MATHEMATICAL PROBLEMS
ASSOCIATOR of three elements a, b, c of a ring R - OF - Problems in astronomy related to the creation of
The element equal to an inertial reference coordinate system in space and a
(ab)c-a(bc). unification of the totality of fundamental astronomic
It is denoted by (a, b, c). constants by determining the coordinates of celestial
K.A. Zhevlakov bodies and by studying the rotation of the Earth.
AMS 1980 Subject Classification: 16-01,17-01 Methods in astronometry are based on the theory and
the practice of geometrical measurements on the celes-
ASTROID - A plane algebraic curve of order six, tial sphere and the determination of distances of stars.
described by a point M on a circle of radius r rolling An important element in astronometric studies is the
on the inside of a circle of radius R =4r; a hypocycloid most accurate possible determination of the direction of
with module m =4. Its equation in orthogonal Carte- the straight line 'observer - star' at the moment of the
sian coordinates is observation. Since the study of the mutual location of
x2/3+ y 2/3 = R2/3; points is more convenient and more illustrative than
the study of mutual location of directions, an auxiliary
and a parametric representation is spherical surface (the so-called celestial sphere) is intro-
X = R cos3 .!... y = R sin 3 .!... duced in the treatment, and all objects are assumed to
4' 4'
be at an equal distance from the observer and located
y
on this sphere. Spherical trigonometry makes it possible
to use various coordinate systems on the celestial
sphere and to determine numerous relations between
the angles and the arcs of configurations of celestial
objects. These relations determine a large proportion of
astronometric methods of observation and the geometr-
ical foundation of astronometric telescopes.
There are four cusps (see Fig.). The length of the arc The phenomena studied in astronometry are related

262
ASTRO!'JOMY. MATHEMATICAL PROBLEMS OF

to various irregularities in the rotation of the Earth (the hand, mathematical techniques specially intended for
irregular rotation of the Earth, migration of poles, vari- Earth-bound problems, which may be modified as
ation of the rotation axis due to precession and nuta- required, are extensively used in astronomy.
tion, etc.) on one hand, and to the actual displacements Astronomy is a complex science which deals with
of celestial bodies on the other. These considerations celestial objects and their systems from various aspects,
determine the temporal and the qualitative structure of which may be very remote from one another. This is
the series of observations which are performed and why the number of mathematical problems in astron-
necessitate the use of specific mathematical methods for omy is very large.
their analysis. Hence the importance and the urgency A very important chapter of astronomy is
of solving and studying large systems of linear equa- astrometry, one of the main problems in which is to
tions involving thousands of equations with hundreds determine an inertial reference coordinate system in
of unknowns, which is complicated even further by the space. The coordinate systems which are traditionally
fact that some of them can only be insufficiently deter- employed in astronomy, geodesy and other fields of sci-
mined. A typical example of the genesis of such prob- ence, based on the planarity of the Earth's equator and
lems is the processing of observations made on the the vernal equinox (i.e. the intersection line of the
bodies of the solar system (mainly small planets) in Earth's equator with the plane of the ecliptic), are not
order to determine the origin of the fundamental coor- inertial and cannot be firmly fixed in space, since both
dinate system. Another example is the derivation of the these planes continuously execute a complex motion (as
constituents of the gravitation field of the Earth from a result of precession, nutation and migration of the
the observations of the Earth made by artificial satel- Earth's poles). For the sake of comparability, the coor-
lites, realized by the worldwide network of observation dinates of the stars are usually referred to by the posi-
stations. tion of the plane of the equator and of the point of ver-
The complexity of the rotation of the Earth, the nal equinox at some fixed date (epoch), while the coor-
difference between the real Earth, which is a visco- dinate system itself, which has been fixed in this way, is
elastic body, from the model as an absolutely solid based on very careful determinations of the coordinates
makes it necessary to search for latent relationships by of a number of stars, recorded in special catalogues
methods such as correlation-spectral analysis and vari- (fundamental star catalogues). However, a substantial
ous smoothing methods. Such studies include the difficulty remains: In order to reproduce such a coordi-
studies of the variation in latitudes and longitudes and nate system at a moment of time other than the epoch
the resulting pole migration, as well as evaluations of of the catalogue, one must know the changes in the
fine and complicated fluctuations in the rate of rotation locations of the fundamental stars relative to the coor-
of the Earth. dinate system, due to their proper motion. In order to
Owing to the nature of the observed phenomena it is overcome this difficulty, it has been attempted since the
often impossible to complete the observations in a sin- middle of the 20-th century to determine the inertial
gle cycle, and it is then necessary to work with partly coordinate system with respect to remote galaxies, the
overlapping segments of observations and with a vari- proper motion of which is negligible. In this context
able origin. Such observations are processed by special importance is given in astronometry to prob-
methods of solving finite-difference equations. This is lems involved in the calculation of the most probable
typical of instrumental research in astronometry and of values of the parameters which determine the directions
the problems in astronometry which involve the meas- of the stars on the strength of repeated observations,
urement of some linear combinations of errors in the and to estimating the probability characteristics of
coordinates of celestial objects. these values. The solution of this problem is also
Methods of mathematical statistics (mostly linear important in most other branches of astronomy, since
methods) are employed to allow for errors in astro- the latter is to a large extent an observational science.
nometric observations. Various mathematical problems must be solved in
v. V. Nesterov theoretical astrophysics, which on the basis of observa-
V. V. Podobed tions of celestial objects studies the structure of these
AMS 1980 Subject Classification: 85-01 objects, the physical processes taking place in them and
their evolution. One of principal problems of astrophy-
ASTRONOMY, MATHEMATICAL PROBLEMS OF - sics is the structure and evolution of stars. The theory
Mathematical problems in the study of celestial objects. of the internal structure of stars leads to differential
Special methods have been developed for the solution equations which describe the conditions of mechanical
of a number of such problems. These methods also and energetic equilibrium of the stars. The solutions of
found use in other branches of science. On the other these equations can sometimes be expressed in terms of

263
ASTRONOMY, MATHEMATICAL PROBLEMS OF

elementary functions, but most often they are so com- bodies which attract each other according to the law of
plicated that they must be solved by numerical gravity under arbitrary initial conditions is solved by
methods. numerical integration. However, the solutions given by
The study of atmospheres of stars and of processes this method are only satisfactory during a limited
taking place in the nebulae and in the interstellar space period of time, and no conclusions as to the evolution
is based on the mathematical theory of radiative of the system of objects are possible. The partial prob-
transfer, which was intensively developed in astrophy- lem of three bodies was more satisfactorily solved using
sics. In certain cases, e.g. in the study of the passage of power series in the powers of time; however, these
radiation through a plane material layer, the transfer series converge very slowly and are therefore unsuitable
equation reduces to integral equations, the solution of for application to the observations on the motion of
which permits one to determine the characteristics of actual objects. Particular problems - the 3-body prob-
the radiation field inside the star, and also of the radia- lem, the lO-body problem (the Sun and the 9 major
tion emitted by the (interstellar) medium and accessible planets), etc. - have also been studied.
to measurements. Problems involving the motion of specific celestial
In studying the motion of gas masses in stars and bodies are solved by series expansions with respect to
nebulae, in studying the processes involved in the powers of a small parameter and subject to various
extension of gas clouds, their collisions with each other assumptions that simplify the calculation.
and with the interstellar space, extensive use is made of Astrodynamics, which deals with the motion of artifi-
the mathematical apparatus of gas dynamics and elec- cial celestial objects, makes use of specific differential
trodynamics. equations of motion. Solutions of problems involved in
Stellar astronomy, which is concerned with the laws the motion of artificial satellites must allow for the dis-
governing the structure, dynamics and evolution of stel- turbing forces due to the non-spherical shape of the
lar systems, makes use of mathematical relationships Earth, the resistance of the atmosphere, the solar radia-
between the distribution of some given characteristics tion pressure (in the case of balloon satellites) and for
of the stellar system (so-called distribution functions) certain other factors.
and the distribution of the observed characteristics. For For more details see the following entries:
instance, the study of the connections (under certain Astrometry, mathematical problems of; Astrophysics,
supplementary assumptions) between the distribution mathematical problems of; Stellar astronomy, mathemat-
functions of the distances of stars in a given solid angle ical problems of; aassical celestial mechanics,
and their absolute and apparent (observational) values mathematical problems in.
yields an integral equation, the solution of which offers N.P. Erpylev
an explanation of the law governing the distribution of AMS 1980 Subject Classification: 85-01
the stellar density in this solid angle. Similar equations
result on comparing the distribution functions of the ASTROPHYSICS, MATIlEMATICAL PROBLEMS
sought-for spatial velocities of the stars and the OF - A type of problems in theoretical astrophysics in
observed radial velocities. which mathematical methods of investigation are exten-
In stellar kinematics the problem of determining the sively employed. The principal task of theoretical astro-
components of the Sun's velocity and the rotational physics is the interpretation of results of observations
characteristics of the Galaxy from statistical studies on with the purpose of studying the structure of the
the coordinates, the proper motion and the radial velo- objects encountered in the Universe, and to study the
city, leads to an overdetermined system of constrained physical processes taking place in it. In addition, there
equations compiled for individual stars (or individual are certain mathematical problems of a general nature
areas in the sky). The mathematical techniques in which are important not only in astrophysics, but also
mechanics are used in solving problems of stellar in other branches of physics and in mathematics. The
dynamics connected with the study of star clusters, most obvious example of the influence of mathematical
galaxies and clusters of galaxies. The individual objects problems in astrophysics on the development of
which constitute the system are regarded as material mathematics is the so-called principle of invariance (cf.
points which interact in accordance with the law of Invariance, principle of) [1], which was first considered
gravitational attraction, with allowance for the specific in astrophysics, after which it was used to solve a wide
features of celestial bodies. A solution of the problems class of problems in mathematical physics and certain
of celestial mechanics, which is concerned with the problems in probability theory.
motion of celestial bodies in a gravitation field, leads to One of the principal problems in astrophysics is the
systems of differential equations of motion. The solu- study of the structure and the evolution of stars, since
tion of the most general problem of the motion of n the bulk of the matter present in the Universe is con-

264
ASTROPHYSICS. MATHEMATICAL PROBLEMS OF

centrated in the stars. It is common to deal with the brium is also frequently postulated. In theory, the solu-
internal structure of the stars separate from the struc- tion of the transfer equation under the above condi-
ture of stellar atmospheres. The problems in the theory tions yields the spectrum of the star, the comparison of
of the internal structure are formulated as differential which with the results of observations provides infor-
equations, resulting from the conditions of mechanical mation on the structure of the stellar atmosphere and
and energetic equilibrium of the star [2]. The condition on the processes taking place in it. In particular, the
of mechanical equilibrium of a star is expressed as the determination of the chemical composition of stellar
equality of the force of attraction directed towards the atmospheres is based on the study of the line spectra of
interior of the star and the force of the pressure exerted the star.
by the gases and by light, which is directed outwards. Radiative transfer processes also take place in nebu-
The condition of energetic equilibrium is an expression lae and in the interstellar space. The transfer process
of the equality of the amount of energy produced in a often consists in repeated light scattering as it passes
given volume of the star and the amount of energy through matter. Sunlight, after being dispersed in the
emitted by this volume. In addition to the equilibrium planetary atmosphere, provides information about its
conditions, the equations of state of the stellar matter, structure and its physical properties. Anisotropic light
the absorption coefficient of the radiation and the law scattering takes place in atmospheres of planets and in
of emission of energy as a result of thermonuclear reac- dusty nebulae.
tions, as a function of the density of the matter and of The theory of radiative transfer is one of the most
the temperature for the given chemical composition, are important branches of theoretical astrophysics. This
assumed to be known. Owing to the complexity of the theory owes its impressive development [3], [4] to the
equations, they are usually solved by numerical solution of the astrophysical problems mentioned
methods. The results of the theory of stellar atmo- above. Its methods are also extensively employed in
spheres are sometimes employed as the boundary con- other branches of physics such as geophysics (cf. Ge0-
ditions, which renders the conclusions more reliable. physics, mathematical problems in), in the theory of
These equations of stellar equilibrium may be reduced neutron transfer and in the calculations of the lumines-
in a significant number of cases to the Emden equation: cence of plasmas. Most frequently, radiation takes
place through a plane layer of matter. An example of
_1 -.!L
x 2 dx
[X2~]
dx
+yn = 0 (1) this are stellar and planetary atmospheres, since their
thickness is usually small in comparison to their radius.
with the conditions y = 1 and y' = 0 when x = O. If n In this case, for isotropic scattering, the radiative
assumes one of the values 0, 1,5, the solutions of equa- transfer equation is reduced to the integral equation
tion (1) may be expressed by elementary functions.
Equation (1) is applicable, for example, in the zones of y(x} ="2AXl K( I x-x ' I}y(x}dx
" +f(x}, (2)
the star in which the energy transfer mainly takes place
by a convective flow, while the transfer of radiation where f (x) defines the distribution of the radiation
energy is relatively small. Another difficulty in develop- sources in the layer, while y (x) is the unknown radia-
ing the theory of the internal structure of stars is that tion field. The kernel K (x) of the integral equation and
the interiors of stars cannot be observed. Observational the parameter "11. include the law of interaction of radia-
data are usually compared only with the calculated tion with matter, while Xo is the thickness of the layer,
integral characteristics of the star such as its mass, its expressed in unit lengths of the free path of radiation.
radius and the integral amount of energy emitted by The study of integral equations of the type (2) is one of
the star in unit time. The use of statistical material con- the principal mathematical problems of radiation
cerning these characteristics makes it possible to theory. The mathematical problems of astrophysics
present hypotheses on the possible evolution of stars. include the search for exact analytic solutions of such
The theory of stellar atmospheres is based on the stu- equations [4]. In particular, if Xo = 00, "11.= 1 and
dies of processes of radiative transfer (cf. Radiative f(x) 0, as well as K(x)=E 1(x), where E 1(x) is the
transfer theory). The energy generated in the interior of first integral exponential function, equation (2) is
the star undergoes a complex process of transfer known as Milne's equation. Its solution (cf. Milne prob-
through the atmosphere, after which it emerges to the lem) under the assumption of certain supplementary
outside. The transfer equation is solved in this case conditions yields the distribution of the temperature in
under conditions of radiation equilibrium, which the interior of the stars. In more complicated problems
expresses the fact that each volume element of the (anisotropic scattering, scattering of polarized light,
atmosphere radiates and absorbs equal amounts of light scattering in a medium of arbitrary shape, etc.)
energy. The presence of a local thermodynamic equili- the resulting unknown function is not the function

265
ASTROPHYSICS, MATHEMATICAL PROBLEMS OF

y(x), but functions which depend not only on the coor- using the electrodynamic equations (Maxwell equations)
dinates but also on the direction and on other magni- and the hydrodynamic equations. The solution of this
tudes which characterize the radiation field at the given system is highly complicated, since the equations are
point. These functions are determined with the aid of non-linear. There usually exist simplified variants, e.g.
appropriate systems of integral equations, which are a one-dimensional and auto-modelling motions (cf.
generalization of (2). Magneto-hydrodynamics, mathematical problems in).
These methods are used to find the complete solution Results obtained by radio-astronomic methods are
of the problem, which makes it possible to find both highly important in the development of astrophysics.
the radiation field inside the medium and the charac- Since the radiation of radio waves usually takes place
teristics of the radiation emitted by the medium. In the in an ionized gas (a plasma), there arise the problems
applications of the theory it is often only necessary to of determining the properties of the plasma under the
know the emitted radiation. It has been shown that it conditions that exist in stars and in interstellar space
is possible to determine the characteristics of the radia- [7]. The behaviour of the plasma can be most fully
tion emitted by the medium directly, without previously described by means of a kinetic equation. The differ-
finding the complete solution. This often simplifies the ence between the kinetic equation for a plasma and the
task considerably, and is realized either by using the kinetic Boltzmann equation for an ordinary gas consists
integral equation (2) or by applying the principle of in the fact that the Coulomb interaction of the charged
invariance. The other approach consists in representing particles of the plasma extends to relatively large dis-
the given layer as a sum of two layers and establishing tances, while in a gas consisting of neutral atoms and
the relations between the characteristics of the radiation molecules, the interactions are significant only during
emitted by the medium as a whole and the radiation direct collisions.
characteristics at the boundaries of the two layers. The study of the distribution of matter in the Galaxy
These techniques are also applied in the theory of neu- is made more difficult by the absorption of light by the
tron transfer, where they are known as the Albedo interstellar dust particles. This effect is particularly
method. marked in directions close to the Milky Way, since the
Another prominent subject in astrophysics is the layer of interstellar dust is concentrated near the plane
application of methods of gas dynamics [5] and electro- of the Galaxy. This effect of light absorption is also
dynamics [6] to the study of stars and interstellar space. responsible for the decrease in the number of observed
These methods are indispensable in the study of the galaxies as the direction of viewing approaches the
motion of gas masses in stars and nebulae. The hydro- plane of our Galaxy. However, the absorbing substance
dynamic effects (cf. Hydrodynamics, mathematical prob- is not uniformly distributed; rather, it forms discrete
lems in) in the atmospheres of the Sun and other stars, clouds of differing magnitudes, randomly distributed in
the expansion of gas clouds and their collisions with space. Thus there arises the problem of the statistical
each other and with the more rarefied interstellar study of the apparent distribution of the brightness of
medium, as well as resulting shock waves and turbulent the Milky Way and of the apparent distribution of
motions in it are some of the problems studied. galaxies. These problems were studied in detail [1] in
One of the more important features of interstellar gas the theory of fluctuations in brightness of the Milky
dynamics is the necessity of considering the interaction Way. The relevant equations were derived using the
of an ionized gas with magnetic fields. This interaction principle of invariance. The mathematical problems of
affects both the motion of the gas and the changes in the theory of the fluctuation of brightness are close to
the configuration and the energy density of the mag- the theory of certain types of random processes. In
netic field. When the gas is in motion, its particles their simplest variant it is assumed that the entire equa-
'adhere' all the time, as it were, to the magnetic line of torial plane of the galaxy is infinitely filled, at a uni-
force, moving along it or dragging it with them as they form density, with stars and with light-absorbing
move across the field. One says that that the lines of clouds. If it is assumed that the proportion of the
force are 'frozen' into the matter (cf. Frozen-in transmitted light q is the same for all clouds, and that
integral). The equations of motion of interstellar mag- g(u)du is the probability of the dimensionless bright-
netic gas dynamics include: the continuity equation ness being included between u and u + du, the function
(the law of conservation of mass); the induction equa- g(u) will be given by the equation
tion of the magnetic field, which expresses the frozen-in
principle; the equation of the influx of the energy of g(u)+g ,(u) = -qg u }.
1 [ -q (3)
interstellar radiation (the law of conservation of
energy); and the equation which expresses the law of Equations of a similar type result on considering more
conservation of momentum. The system is established general models as well. The results of the observations

266
ASTROPHYSICS, \IATHEMA TICAL PROBLEMS OF

are frequently interpreted by using moments of the tifiable initial assumptions to a minimum. A classical
magnitudes under study; accordingly, the solution of example of such an approach is the problem of finding
the complicated initial equations is not invariably the distribution of the spatial density of the stars from
necessary. The relationships of the moments, derived their distribution in a projection [8]. The connection
from the initial equations, are quite simple and rapidly between the observed and the spatial density distribu-
yield the desired results. tion is obtained in the form of an Abel integral equa-
The study of the structure, dynamics and evolution tion, the solution of which yields the desired result. The
of stellar systems forms the subject of statistical same equation is also obtained in a number of other
mechanics of stellar systems. Here, systems of bodies problems, e.g. in the study of the distribution of atoms
are usually considered as material points interacting in with respect to altitude in the chromosphere and of the
accordance with the law of mutual attraction. The most electrons in the corona of the Sun. In all such cases
important systems are star clusters, galaxies and clus- only one assumption - on spherical symmetry - is
ters of galaxies. The given problems lead to questions made. A more complex integral equation is obtained
which are studied in physical kinetics. However, as dis- in the derivation of the distribution function of the spa-
tinct from studies of ordinary gases, the study of a stel- tial velocities of the stars from the distribution of the
lar gas involves very significant characteristic features. observed radial velocities. Here, the function is repro-
First, a stellar gas is not at a complete statistical equili- duced in three-dimensional space from the values of its
brium. Secondly, it is important to note that the gas is integrals in all possible planes. The examples here con-
situated in its own gravitation field, which should be sidered refer to so-called inverse problems. Inverse
determined together with other unknown magnitudes. problems in astrophysics involve considerable difficul-
Finally, the interaction between stars differs from the ties and uncertainties both of mathematical nature and
interaction between gas molecules in that it concerns a in obtaining the necessary results of observations. The
remote action discharge, which is analogous to the mathematical aspects, in particular, have to do with
Coulomb force acting between the charged particles in possible ill-posedness of the problems. The method of
a plasma. These features seriously complicate the regularization of the solutions of ill-posed problems
theoretical considerations. However, in studying the proves valuable in solving many astrophysical ques-
motion of stars in a galaxy it is permissible to neglect tions.
the interaction at a first approximation, since the relax- An example of an ill-posed inverse mathematical
ation times prove to be longer than the period of time problem in astrophysics is finding the optical properties
which determines the time of evolution of the Universe of the atmospheres of planets by measuring their pho-
[1], [8]. tometric characteristics. Repeated scattering of light in
For star clusters the relaxation time is relatively the atmosphere and reflections from the surface of the
short, and it is then necessary to consider the interac- planet result in a loss of information on the magnitudes
tion between the stars. Here the usual practice is to which determine the unknown optical properties of the
take advantage of the analogy between the Coulomb atmosphere. To obtain a single-valued solution, auxili-
forces and the gravitational forces and to use Landau's ary simplifying assumptions must be made, which
kinetic equation. This equation was obtained for a strongly limit the possibilities of the investigation.
plasma using the Fokker- Planck approximation (cf. Inverse mathematical problems in astrophysics often
Fokker- Planck equation) in representing the diffusion involve the solution of an integral equation
process in the velocity space. Landau's equation cannot +00
give a sufficiently rigorous description of the process of F(x) = f A(x-y)j(y)dy (4)
evolution of a stellar system, which inevitably involves
star losses. Accordingly, more general kinetic equations of the first kind, which determines the transition from
are employed in studying the evolution of star clusters. the function F(x), which represents the known data
One of the more important stages of astrophysical about some magnitude, obtained by observations, to
studies is to obtain the most complete and reliable the unknown function f (x), which gives the true values
information possible about the objects being studied by of this magnitude. The results of observations may be
comparing the results obtained by observations with distorted for various reasons - a fact which is quanti-
theoretical results. The usual practice is to study a cal- tatively described by the function A (x) in equation (4),
culated theoretical model and to check the validity of which is usually known for a given problem. For
the assumptions on which its construction is based. In instance, the functions F (x) and f (x) determine the
certain cases model representations can be dispensed experimental and the true profile of the spectral line,
with and the observed characteristics can be directly while the function A (x) corresponds to the observed
related to the theory, while reducing the number of jus- broadening profile of the ideally monochromatic spec-

267
ASTROPHYSICS, MATHEMATICAL PROBLEMS OF

tralline, caused by the properties of the spectrograph. without an orientation-reversing homeomorphism.


Many inverse problems arise in radio-astronomy Thus, for instance, the complex projective plane is an
both in processing the results of observations and in asymmetric variety, since the self-intersection of the
the interpretation of the data thus obtained. The pas- complex straight line is + 1 or - 1, depending on the
sage from the observed distribution of temperature to orientation. Certain knots can differ from their mirror
the true distribution in the one-dimensional case is image owing to the fact that their branched coverings
effected by the use of equation (4), in which A (x) are asymmetric varieties.
A. V. Chernavskif
determines the property of the radio-telescope antenna
and is called its diagram. An important feature of Editorial comments. This notion can be found, e.g., in
radio-astronomical studies is the use of the 21-cm radio [A1], Chapt. 5.
wave-length, which is generated by the transition References
[A1] HIRSCH, M.W.: Differential topology, Springer, 1976.
between the sublevels of the superfine structure of the
hydrogen atom. The study of the interstellar radio-type AMS 19aO Subject Classification: 57MXX
radiation on this wave-length yields valuable informa-
tion on the structure of the Galaxy and its rotation, on ASYMMETRY COEFFICIENT - The most frequently
the distribution of the interstellar gas in it, and on the employed measure of the asymmetry of a distribution,
structure of the central part of the Galaxy which is defined by the relationship
inaccessible for optical methods. The line profiles _ IL3
observed in the various directions have a complicated "Yl - 3'72'
IL2
shape due to various factors (thermal motion of atoms,
where JL2 and J.L3 are the second and third central
chaotic motion of the interstellar gas, the rotation of
moments of the distribution, respectively. For distribu-
the Galaxy, etc.). Accordingly, a number of inverse
tions that are symmetric with respect to the mathemati-
problems arise on passing from the measured to the
sought characteristics. In order to eliminate the blurring cal expectation, YI =0; depending on the sign of YI one
effect of thermal and chaotic macroscopic movements speaks of positive asymmetry (YI >0) and negative asym-
metry (YI <0). In the case of the binomial distribution
of the gas from the observed profile, equation (4) is
corresponding to n Bernoulli trials with probability of
employed, in which A (x) is a description of the velocity
success p,
distribution. This routine systematically yields gas velo- _ i-2p
(*)
Y1 - -Y"'n=rp=(}:'_=p=) ,
cities and other magnitudes.
References one has: If p = 1 /2 (YI =0), the distribution is sym-
[l) AMBARTSUMYAN, V.A.: Scientific work, Vol. I, Erevan, 1960 (in metric; if p < 1 / 2 or p> 1 / 2, one obtains typical dis-
Russian). tribution diagrams with a positive (Fig. a) and negative
[2) ScHWARZSCHILD, M.: Structure and evolution of stars, Prince-
ton Univ. Press, 1958. (Fig. b) asymmetry.
[3) SOBOLEV, V.V.: A treatise on radiative transfer, v. Nostrand,
1960 (translated from the Russian).
[4) SOBOLEV, V.V.: Course in theoretical astrophysics, NASA,
Washington, D.C., 1969 (translated from the Russian).
[5) KAPLAN, S.A.: Interstellar gasdynamics, Moscow, 1958 (in Rus-
sian).
[6) PIKEL'NER, S.B.: Fundamentals of cosmic electrodynamics, Mos-
cow, 1966 (in Russian).
a) P(k, 10, 1/5)
I.N. Minin
Editorial comments.
References
[A1] EHLERS, J. (ED.): Relativity theory and astrophysics, 1-3,
Amer. Math. Soc., 1967.
[A2) KURT, R.: Introduction to stellar statistics, Pergamon Press, k
1967.
[A3] LANDSBERG, P.T. and EVANo, D.A.: Mathematical cosmology,
Clarendon Press, 1977. b) P(k, 10,4/5)
[A4) CHANDRASEKHAR, S.: Radiative transfer, Dover, reprint, 1960.
[AS] CHANDRASEKHAR, S.: Stellar structure, Dover, reprint, 1957.
[A6) HAWKING, S.W. and ELLIS, G.F.R.: The large scale structure Diagrams of the binomial distribution
of space-time, Cambridge Univ. Press, 1973. P (k, n, p) corresponding to n = 10 Ber-
AMS 1980 Subject Classification: 85-01 noulli trials: a) with positive asymmetry
(p = 1/5); b) with negative asymmetry
ASYMMETRIC VARIETY - An oriented variety M (p=4/5).
ASY.\1PTOTIC BASIS

The asymmetry coefficient (*) tends to zero as n~oo, asymptote can be vertical or inclined. The equation of
in accordance with the fact that a normalized binomial a vertical asymptote is x = a, where f (x)~ + 00 ( - 00)
distribution converges to the standard normal distribu- as x~a (from one side) is satisfied. An inclined
tion. asymptote, with equation y = kx + I, exists if and only if
The asymmetry coefficient and the excess coefficient the limits
are the most extensively used characteristics of the
k = limM., I = lim[f(x)-kx),
accuracy with which the distribution function Fn(x) of x
the normalized sum exist as x~+oo (or as x~-oo).
(X, + ... + Xn)- nIL, Similar formulas are also obtained for parametrized
y;;;; (unbounded) curves in general parametric representa-
tion. In polar coordinates an asymptote of a curve
where XI, ... ,Xn are identically distributed and mutu-
r = r( cp), where r >0, with slope angle lX, is defined by
ally independent with asymmetry coefficient 8 1, may be
the condition r~ + 00 as cp~a. The distance p of this
approximated by the normal distribution function
asymptote from the coordinate origin is calculated by
CP(x) = . b-
v2'IT
Je-,2 12 dz.
the formula
-x
p = lim It Ir(a+t) as t~+O (or as t~-O).
Under fairly general conditions the Edgeworth series If there exists a limit position of the tangent line to
yields the infinite branch of the curve, this position is an

Fn(x) = CP(x)- Fn ~ cp(3)(x)+O l~l' asymptote. The converse is not always true. Thus, the
curve y = (sin x 2) / x has the asymptote y = as
x~-+- 00, even though a limit position of the tangent
where qP)(x) is the derivative of order three.
line does not exist. Hyperbolas are the only second-
References order curves with asymptotes. The asymptotes of the
[I] CRAMER, H.: Mathematical methods o/statistics, Princeton
Univ. Press, 1946. hyperbola (x 2 / a 2 )_(y2 / b 2 )= 1 are given by the
[2] WILKS, S.S.: Mathematical statistics, Wiley, 1962. equations (x / a)-+-(y / b)=O. An inclined asymptote
A. V Prokhorov yields a simple (linear with respect to x) asymptotic
approximation of the function:
Editorial comments, The asymmetry coefficient is usu-
ally calied the coefficient of skewness. One correspondingly f(x) = kx+l+o(l)
speaks of the skewness of a distribution and of positive,
as x~+oo (or as x~-oo).
respectively negative, skewness.
The excess coefficient is more often called the coefficient References
[I] RASHEVSKIi, P.K.: A course 0/ differential geometry, Moscow,
of kurtosis.
1956 (in Russian).
AMS 1980 Subject Classification: 62-XX [2] KUDRYAVTSEV, L.D.: Mathematical analysis, Moscow, 1973 (in
Russian).
L.P. Kuptsov
ASYMMETRY OF A DISTRIBUTION - A qualita- Editorial comments.
tive property of the curve of a distribution, indicating References
that the distribution deviates from a symmetric one. [A 1) POGORELOV, A.V.: Differential geometry, Noordhoff, 1959
The asymmetry of a distribution is positive (negative) if (translated from the Russian).

the asymmetry coefficient is positive (negative). If the AMS 1980 Subject Classification: 53A04, 26A 12
asymmetry of a distribution is positive (negative), then
the 'longer' part of the density curve of the distribution ASYMPTOTIC BASIS, asymptotic basis of order k -
is found to the right (left) of the mode. A sequence of natural numbers and zero, which as a
K.P. Latyshev result of its summation repeated k times yields all suffi-
ciently large natural numbers. The number k is called
Editorial comments. In Western literature one commonly
calis this notion the skewness of a distribution, and accord-
the order of the asymptotic basis. Thus, the sequence of
ingly one speaks of the skewness coefficient. prime numbers is an asymptotic basis of order 4 (LM.
Vinogradov, 1937); the sequence of cubes of natural
AMS 1980 Subject Classification: 60EXX, 62EXX
numbers is an asymptotic basis of order 7 (Yu.V. Lin-
nik, 1942).
ASYMPTOTE of a curve y = f (x) with an infinite B.M. Bredikhin
branch - A straight line the distance of which from the Editorial comments. See also Waring problem; Gold-
point (x, f (x on the curve tends to zero as the point bach problem.
moves along the branch of the curve to infinity. An AMS 1980 Subject Classification: 10105

269
ASYMPTOTIC DENSITY

ASYMPTOTIC DENSITY - A variant of the general References


concept of the density of a sequence of natural [A1] HsIUNG, c.c.: A first course in differential geometry, Wiley,
1981.
numbers, which measures how large a part of the [A2] STRUIK, 0.1.: Lectures on classical differential geometry,
sequence of all natural numbers belongs to the given Addison-Wesley, 1950.
sequence A of natural numbers including zero. The AMS 1980 Subject Classification: 53A05
asymptotic density of a sequence A is expressed by the
real number a defined by the formula ASYMPTOTIC EQUALITY - Two functions f (x)
I . A (x\ and g(x) are called asymptotically equal as X-Ho if in
a = lmmf~,
x------.oc X some neighbourhood of the point Xo (except possibly at
where
A(x) = ~ I, x;;;;.l.

x itself)
j(x) = (x)g(x),
UEA.
where
=
O<a~x
lim (x) I,
The number X-Xo

f3 = lim sup~
x I.e.
x----"'oo
f(x) = g(x)[1 +0(1)),
is known as the upper asymptotic density. If the numbers
a and f3 coincide, their common value is called the as X-Ho (xo is a finite or an infinite point of the set
natural density. Thus, for instance, the sequence of
on which the functions under consideration are
defined). If g(x) does not vanish in some neighbour-
numbers that are free from squares has the natural den-
hood of xo, this condition is equivalent to the require-
sity () = 6/"? The concept of an asymptotic density is
ment
employed in finding criteria for some sequence to be an lim M.. = 1.
asymptotic basis. X~Xo g(x)
B. M. Bredikhin
In other words, asymptotic equality of two functions
Editorial comments. The number a as defined above is
f(x) and g(x) as X-Ho means, in this case, that the
also called the lower asymptotic density.
relative error of the approximate equality of f (x) and
References g(x), i.e. the magnitude [f(x)-g(x)]/ g(x), g(x)=,FO,
[A 1] HALBERSTAM, H. and ROTH, K.F.: Sequences, 1, Clarendon
Press, 1966.
is infinitely small as X-Ho. Asymptotic equality of
functions is meaningful for infinitely-small and
AMS 1980 Subject Classification: 10L02
infinitely-large functions. Asymptotic equality of two
ASYMPTOTIC DERIVATIVE - A synonym for functions f(x) and g(x) is denoted by f(x)~g(x) as
X~Xo, and is reflexive, symmetric and transitive.
approximate derivative.
Accordingly, the set of infinitely-small (infinitely-large)
AMS 1980 Subject Classification: 26A24
functions as X~Xo is decomposed into equivalence
ASYMPTOTIC DIRECTION - A direction on a regu- classes of such functions. An example of
lar surface in which the curvature of the normal section asymptotically-equal functions (which are also called
of the surface is zero. For the direction du: dv at a equivalent functions) as X~Xo are the functions u(x),
point P to be an asymptotic direction, the following sin u(x), In(l +u(x)], eU(x) -I, where limHXo u(x)=O.
condition is necessary and sufficient: If f~fl and g~gl as X~Xo, then

Ldu 2 +2Mdudv+Ndv 2 = 0, lim 1.l!:l = lim flex) ,


,~\u g(x),~xu gl(X)
where u and v are interior coordinates on the surface,
while L, M and N are the coefficients of the second where the existence of anyone of the limits follows
fundamental form of the surface, calculated at P. At an from the existence of the other one. See also Asymptotic
elliptic point of the surface the asymptotic directions expansion of a function; Asymptotic formula.
are imaginary, at a hyperbolic point there are two real M.1. Shabunin
asymptotic directions, at a parabolic point there is one Editorial comments. One also says that f (x) and g(x)
real asymptotic direction, and at a flat point any direc- are of the same order of magmtude at x(] instead of asymp-
tion is asymptotic. Asymptotic directions are self- totically equal.
conjugate directions (d. Conjugate directions). References
[A 1] COURANT, R. Differential and integral calculus, 1, Blackie
References and Son, 1948, Chapt. 3, Sect. 9.
[II RASHEVSKli, P.K.: A course of differential geometry, Moscow,
1956 (in Russian). AMS 1980 Subject Classification 26E99
E. V Shikin
Editorial comments. ASYMPTOTIC EXPANSION of a function j(x) - A

270
ASYMPTOTIC LINE

senes often disregarded if asymptotic approximations can be


relatively easily found.
n =0
References
such that for any integer N;;.O one has [I] POINCARE, H.: 'Sur les integrales irregulieres des equations
N lineaires', Acta Math. 8 (1886),295-344.
f(x) = 2:,"'n(X)+O(<PN(X (x-Ho), (I) [2] WHITTAKER, E.T. and WATSON, G.N.: A course of modern
n =0
analysis, Cambridge Univ. Press, 1952.
where {<Pn(x)} is some given asymptotic sequence as [3] ERDELYI, A. and WYMAN, M.: 'The asymptotic evaluation of
certain integrals', Arch. Rational Mech. Anal. 14 (1963),217-
X~Xo. In such a case one also has
oc
260. M. V. Fedoryuk
f(x) ~ 2:, "'n(x), {<Pn(x)}, (x-no) (2) Editorial comments.
n =0
References
The sequence {<Pn(x)} is omitted from formula (2) if it [A1] BRUUN, N.G. DE: Asymptotic methods in analysis, Dover,
is clear from the context which sequence is meant. reprint, 1981.
The asymptotic expansion (2) is called an asymptotic [A2] ERDELYI, A.: Asymptotic expansions, Dover, reprint, 1956.
expansion in the sense of Erdelyi [3]. An expansion of [A3] COPSON, E.T.: Asymptotic expansions, Cambridge Univ.
Press, 1965.
the type 00
[A4] MURRAY, J.P.: Asymptotic analysis, Springer, 1984.
f(x) ~ 2:, an<Pn(X) (x-Ho), (3) [A5] BLEISTEIN, N. and HANDELSMAN, R.A.: Asymptotic expan-
n =0
sions of integrals, Dover, reprint, 1986.
where an are constants, is called an asymptotic expan- AMS 1980 Subject Classification: 41 A60
sion in the sense of Poincare. If the asymptotic sequence
of functions {<Pn(x)} is given, the asymptotic expansion ASYMPTOTIC EXPRESSION - A synonym for
(3), contrary to the expansion (2), is uniquely defined asymptotic formula.
by the function f (x) itself. If (1) is valid for a finite AMS 1980 Subject Classification: 41 A60
number of values N=O, ... ,No<oo, then (1) is called
an asymptotic expansion up to O(<PNo(X. The series ASYMPTOTIC FORMULA - A formula containing
x 00 the symbols 0, 0 or the equivalence sign "-' (asymptotic
2:, "'Ax), 2:, an <Pn (X) equality of functions).
n =0 n =0
Examples of asymptotic formulas:
are known as asymptotic series. As a rule such series are sinx = x +o(x2), x~O;
divergent. Asymptotic power series are the ones most
commonly employed; the corresponding asymptotic COSX = 1+O(X2), x~O;

expansions are asymptotic expansions in the sense of x 3+x+1 ~ x3, x~oo;


Poincare. X
17(X) ~ Inx' x~oo
The following is an example of an asymptotic expan-
sion in the sense of ErdeIyi: ('1T(x) is the amount of prime numbers not exceeding x).
B.M. Bredikhin
I,(x) ~ -..Vf2[COS
-:;;; [x- 17V2 _!!....J
4
~(-I)na2nX-2n-
n =0
Editorial comments. On the meaning of the symbols
0,0 and ~, see e.g. [A1] or [A2].

-sin [x- 17{ - : t~0(-I)"a2n+lx-2n-l] References


[A1] BRUUN, N.G. DE: Asymptotic methods in analysis, Dover,
reprint, 1981.
(x~ + 00), where J vex) is the Bessel function, and [A2] ERDELYI, A.: Asymptotic expansions, Dover, reprint, 1956.
a = rev +n + I /2) AMS 1980 Subject Classification: 41 A60
n 2n n!r(v-n+I/2)
The concepts of an asymptotic expansion of a func- ASYMPTOTIC LIMIT - A synonym for approximate
tion and of an asymptotic series were introduced by H. limit.
Poincare [1] in the context of problems in celestial AMS 1980 Subject Classification: 26A24
mechanics. Special cases of asymptotic expansions were
discovered and utilized as early as the 18-th century [2]. ASYMPTOTIC LINE - A curve r on a regular sur-
Asymptotic expansions play an important role in many face F such that the normal curvature along r is zero.
problems in mathematics, mechanics and physics. This An asymptotic line is given by the differential equa-
is because many problems do not admit exact solutions, tion:
but their solutions can be obtained as asymptotic II 2= Ldu +2Mdudv+Ndv
2 = O.
approximations. Moreover, numerical methods are where II is the second fundamental form of the surface.

271
ASYMPTOTIC LINE

The osculating plane of an asymptotic line r, if it [A4] SPIVAK, M.: Differential geometry, 3, Publish or Perish, 1975.
exists, coincides with the tangent plane to F (at the [AS] HICKS, N.J.: Notes on differential geometry, v. Nostrand,
1965.
points of f), and the square of the torsion of an [A6] BLASCHKE, W. and LEICHTWEISS, K.: Elementare Differential-
asymptotic line is equal to the modulus of the Gaussian geometrie, Springer, 1973.
curvature K of the surface F (the Beltrami- Enneper
AMS 1980 Subject Classification: 53A05
theorem). A straight line I EF (e.g. a generating line of a
ruled surface) is always an asymptotic line. If r is a ASYMPTOTIC NEGLIGIBILITY - A property of
parabolic curve (e.g. a circle on a standard torus random variables indicating that their individual contri-
separating the domains of Gaussian curvatures of dif- bution as components of a sum is small. This concept
ferent signs), it is an asymptotic line. is important, for example, in the so-called triangular
A unique asymptotic line, which coincides with the array. Let the random variables Xnk (n = 1, 2, ... ;
rectilinear generator, passes through each point of a k = 1, ... ,kn ) be mutually independent for each n, and
parabolic domain (where K=O, but n*O). Through let
each point of a hyperbolic domain (where K <0) there
pass exactly two asymptotic lines, forming the so-called If for all i>O and S>O, at sufficiently large values of n,
asymptotic net, which plays an important role in the the inequality
study of the spatial form of a surface of negative curva- (1)
ture (cf. Negative curvature, surface of). For instance,
on a complete surface this net is homeomorphic to the is .satisfied, the individual terms Xnk are called asymp-
Cartesian net on the plane if totically negligible (the variables Xnk then form a so-
called zero triangular array). If condition (1) is met,
grad I vi ~ K I ~ q, q = const. one obtains the following important result: The class of
limit distributions for Sn - An (An are certain 'center-
Asymptotic nets on surfaces of constant negative curva- ing' constants) coincides with the class of infinitely-
ture are Chebyshev nets (cf. Chebyshev net), and the divisible distributions (cf. Infinitely-divisible distribu-
surface area of a quadrangle formed by asymptotic tion). If the distributions of Sn converge to a limit dis-
lines is proportional to the excess of the sum of its inte- tribution, kn----,>oo, and the terms are identically distri-
rior angles (Xi over 2w: buted, condition (1) is automatically met. If the
I K IS = 2'1T-0:] -0:2 -0:3 -0:4 requirement for asymptotic negligibility is strengthened
by assuming that for all {>o and S>O for all suffi-
(Hazzidakis' formula).
ciently large n one has
Under a projective transformation w of the space, the
asymptotic lines of a surface F become the asymptotic P [max
l~k~k"
I Xnk I >] < 0, (2)
lines of the transformed surface w(F).
Asymptotic lines on surfaces in a three-dimensional then the following statement is valid: If (2) is met, the
Riemannian space are defined in a similar manner. limit distribution for Sn - An can only be a normal dis-
Various generalizations of the concept of an asymptotic tribution (in particular with variance equal to zero, i.e.
line on manifolds imbedded in a multi-dimensional a degenerate distribution).
A. V. Prokhorov
space are known; the most frequently-used one involves
Editorial comments.
the concept of the second fundamental form, which is
associated with a given normal vector. References
[A 1] FELLER, W.: An introduction to probability theory and its
References applications, 2, Wiley, 1966.
[I] POGORELOV. A.V.: Differential geometry. Noordhoff. 1959
(translated from the Russian). AMS 1980 Subject Classification: 60-XX
[2] RASHEVSKli, P.K.: A course of differential geometry, Moscow,
1956 (in Russian). ASYMPTOTIC NET - A net on a surface formed by
M.I. Voitsekhovski(
two families of asymptotic lines (cf. Asymptotic line).
Editorial comments. Hazzidakis' formula can be found in An asymptotic net only exists on non-developable sur-
[A1] and [A2J, p. 204.
faces of non-positive curvature. The orthogonality of an
References asymptotic net characterizes a minimal surface.
[A 1] HAZZIDAKIS. 1.N. 'Uber elnlge Eigenschaften der Flachen
mit konstanten Krummungsmasz', Crelle's J Math. 88
V T Bazr/ev
(1880), 68-73. AMS 1980 Subject Classification: 53A05
[A2] STRUIK. D.l.: Lectures on classical differential geometry,
Addison-Wesley, 1950.
[A3] HSIUNG. c.c.: A first course In differential geometry, Wiley,
ASYMPTOTIC POINT - A singular point of a curve
1981. around which it twists an infinite number of times,

272
ASYMPTOTIC VALCE

approaching it without limit. For example, the asymp- uniformly in argz as I z I~oo inside any closed angle
totic point of the logarithmic spiral. contained in D, then
AMS 1980 Subject Classification: 51 N1 0 , ~ nan
f(z) ~ - ...-
n=1 zn+1
ASYMPI'OTIC POWER SERIES - An asymptotic
series with respect to the sequence
uniformly in arg z as I z I~ 00 in any closed angle con-
tained in D.
{x-n} (x~oo)
References
or with respect to a sequence [1) COPSON, E.T.: Asymptotic expansions, Cambridge Univ. Press,
1965.
{(x - xo)"} (x~xo) [2) ERDELYI, A.: Asymptotic expansions, Dover, reprint, 1956.
[3) WHITTAKER, E.T. and WATSON, G.N.: A course of modem
(cf. Asymptotic expansion of a function). Asymptotic analysis, Cambridge Univ. Press, 1952.
power series may be added, multiplied, divided and M.l. Shabunin
integrated just like convergent power series. Editorial comments.
Let two functions f(x) and g(x) have the following References
asymptotic expansions as x~oo: [A1] BRUUN, N.G. DE: Asymptotic methods in analysis, Dover,
x a x b reprint, 1981.
f(x) ~ ~~, g(x) ~ ~~.
n=OX n=OX AMS 1980 Subject Classification: 41 A60
Then
~ Aan+Bbn ASYMPTOTIC REPRESENTATION - A synonym for
I) Af(x) +Bg(x) ~ ... n
n=O X asymptotic formula.
(A, B are constants); AMS 1980 Subject Classification: 41 A60
ex; C
2)f(x)g(x) ~ ~~;
n=O X ASYMPTOTIC SEQUENCE - A sequence of func-
1 1 d tions {<Pn(x)} such that
3) - - ~ -+ ~_n ao=FO
00

f(x) ao n=1 xn ' <Pn+I(X) = o(<Pn(x, x~Xo, xEM,


(cn , dn are calculated as for convergent power series); where Xo is a limit point of the set M (finite or infin-
4) if the function f (x) is continuous for x >a >0, ite). If the nature of M is clear from the context, then
then ex; one simply writes X~Xo. If {<Pn(x)} is an asymptotic
jff(t)-ao-!!J...]dt ~ ~ an+nl ; sequence and !f,{x) is a function defined on M, then
xl t n=lnx N(x )<Pn(x)} will also be an asymptotic sequence.
5) an asymptotic power series cannot always be dif- Examples of asymptotic sequences:
ferentiated, but if f (x) has a continuous derivative 1) {(x-xo)n}, X~Xo;
which can be expanded into an asymptotic power 2) {x- n }, x~oo;
series, then 3) {eXx- n }, x~oo;
j'(x) ~ _ ~ (n-I~an-I. 4) {z-n}, z~oo, ZED, where D is an unbounded
n=2 X domain in the complex plane. Asymptotic sequences
Examples of asymptotic power series. such as 1), 2) and 4) are called asymptotic power
sequences.
OOje x - t ex; (_l)n-I(n-I)! M.l. Shabunin
-dt ~ ~ - - - -'
x t n= I xn ' Editorial comments.
Yxe-iXmp(x) ~ ~ e- l7T
/
4(-i)"[f(n+I/2)j2 References
n=O 2n-I/2~/2n!xn ' [A1] BRUUN, N.G. DE: Asymptotic methods in analysis, Dover,
reprint, 1981.
where Hb1) (x) is the Hankel function of order zero (cf. [A2] ERDEL YI, A.: Asymptotic expansions, Dover, reprint, 1956.
Hankel functions) (the above asymptotic power series
AMS 1980 Subject Classification: 41 A60
diverge for all x).
Similar assertions are also valid for functions of a
ASYMPI'OTIC SERIES - See Asymptotic expansion
complex variable Z as z~oo in a neighbourhood of the
of a function.
point at infinity or inside an angle. For a complex vari-
able 5) takes the following form: If the function f (z) is AMS 1980 Subject Classification: 41 A60
regular in the domain D = { I z I >a, a< I argz I <f3}
and if ASYMPTOTIC VALUE - A limit value along some
00 a
f(z) ~ ~~ path. More exactly, a complex number a or a= 00 is
n=O Z called an asymptotic value for a function f (z) of the

273
ASYMPTOTIC VALUE

complex variable z at a point a of the closure 15 of its observation. An estimator Yn E~ which attains the
domain of definition D if there exists a path L: lower bound just mentioned is asymptotically efficient.
z =z(t), O.;;;;t<l, LCD, terminating at a, i.e. so that Under certain conditions this property is satisfied by
lim z(t) ::::: a, the maximum-likelihood estimator for (), which makes
t~I-O
the classical definition meaningful. If the
along which asymptotically-efficient estimator Yn exists, the magni-
limf(z) ::::: a, ZEL.
z~a

tude rJ2 (Vn r..


For instance, at the point a = 00 the function f (z) = eZ lim n)
n~oo 02(Vn Tn)
has the asymptotic values a] = 0 and a2 = 00 along the
paths L]: z=-t, O.;;;;t<+oo, and L 2 : z=t, is called the asymptotic relative efficiency of Tn. Cer-
O';;;;t < + 00, respectively. Sets of asymptotic values play tain variants of the concept of an asymptotically-
an important role in the theory of limit sets (cf. Limit efficient estimator are due to R.A. Fisher, C.R. Rao
set). and others.
If f (z) has two different asymptotic values at a, a is References
called a point of indeterminacy for the function f (z). [1) RAo, C.R.: Linear statistical inference and its applications,
For any function fez), defined in a simply-connected Wiley, 1965.
O. V Shalaevskil
plane domain, the set of points of indeterminacy is at
Editorial comments. More modern definitions of this
most countable.
concept are due to J. Hajek, L. LeCam and others.
The above definition of asymptotic value refers to
References
asymptotic point values. If the limit set of a curve L is a [A 1] IBRAGIMov, J.A. and HAS'MINSKII, R.Z. [R.Z. KHAS'MINSKIT]:
set E C aD rather than a single point a EaD, one also Statistical estimation. Asymptotic theory, Springer, 1981
speaks of the asymptotic value A (j, E) associated with (translated from the Russian).
E. AMS 1980 Subject Classification: 62F12
References
[I] COLLINGWOOD, E.F. and LOHWATER, A.1.: The theory of clus- ASYMPTOTICALLY-STABLE SOLUTION - A solu-
ter sets, Cambridge Univ. Press, 1966, Chapt. I; 6.
tion of a differential system that is stable according to
[2] MAcLANE, O.R.: Asymptotic values of holomorphic functions,
Rice Univ. Studies, Math. Monographs, 49, Rice Univ., Hous- Lyapunov (d. Lyapunov stability) and that attracts all
ton, 1963. the other solutions with sufficiently close initial values.
VI. Gavrilov
Thus, the solution
E.D. Solomentsev
XC-T, ~o), x (lX, ~o) ::::: ~o,
Editorial comments. The most famous results on asymp-
totic values is the Oenjoy- Carleman- Ahlfors theorem. Let of the system dx
fez) be an entire function with n distinct (finite) asymptotic dT ::::: J(T, x) (*)
values at the point 00. Then fez) must be of order :?n / 2.
with a right-hand side f(T, ~), given for all T;;;'a, ~ERn,
This result was conjectured by A. Oenjoy (1907). The first
complete proof was given by L. Ahlfors (1929), after T.
and which is such that solutions of (*) exist and are
Carleman had obtained a less sharp result. See, for exam- unique, will be an asymptotically-stable solution if,
ple, [A1], Sect. 60. together with all its sufficiently close solutions
References X(T,~), I~-~o I <h, h>O,
[A 1] DING HAS, A.: Vorlesungen Uber Funktionentheorie, Springer,
1961.
it is defined for all T;;;'a and if for an arbitrary >0
there exists a 0, O<o<h, such that I ~-~o I <0 implies
AMS 1980 Subject Classification: 30D30, 30D40
II X(T, ~)- X(T, ~o) II < {
ASYMPTOTICALLY-EFFICIENT ESTIMATOR - A for all T;;;'a and
concept which extends the idea of an efficient estimator
to the case of large samples (cf. Effective estimator).
An asymptotically-efficient estimator has not been as T _____ + 00.
uniquely defined. Thus, in its classical variant it con- The concept of an asymptotically-stable solution was
cerns the asymptotic efficiency of an estimator in a introduced by A.M. Lyapunov [I]; it, together with
suitably restricted class R of estimators. In fact, let Tn various special types of uniform asymptotic stability, is
be a consistent estimator of a one-dimensional parame- extensively used in the theory of stability [2], [3], [4].
ter () constructed from a random sample of size n. Then References
Tn ESt if the variance rl-cVn Tn) exists,and if it is [1] LYAPliNOV, A.M.: 'ProbJeme general de la stabilite du mouve-
ment', in Ann. of Math. SlUdies, Vol. 17, Princeton Univ. Press,
bounded from below, as n-H/J, by the inverse of the 1947.
Fisher amount of information corresponding to one [2] KRASOVSKI1, N.N.: Stahility o(mOlion. Applications of

274
ATOM

Lyapunov's second method to differential systems and equations is called the asymptotic power function of the test Rn.
with delay, Stanford Univ. Press, 1963 (translated from the 0. V. Shalaevskil
Russian).
[3] HAHN, W.: Theorie und Anwendung der direkten Methode von AMS 1980 Subject Classification: 62F05
Ljapunov, Springer, 1959.
[4] RoveHE, N., HABETS, P. and LALOY, M.: Stability theory by
Liapunov's direct method, Springer, 1977. v S B d
ASYMPTOTICS OF ARITIIMETIC FUNCflONS,
1 U.. og anov asymptotics of number-theoretical functions - Approxi-
Editorial comments. mate representations of arithmetic functions (functions
defined for all natural number values of the argument)
References
[A1) HAHN, W.: Stability of motion, Springer, 1967. by means of comparatively simple expressions with
arbitrary small error terms. More precisely, for an
AMS 1980 Subject Classification: 34005, 34020
arithmetic function f(x) there exists an asymptotic if
one has an asymptotic indentity
ASYMPTOTICALLY-UNBIASED ESTIMATOR - A
concept indicating that the estimator is unbiased in the j(x) = <j>(x)+R(x),
limit (cf. Unbiased estimator). Let XI, X 2 , . . . , be a where cp(x) is the approximating function, and R(x) is
sequence of random variables on a probability space the error term, about which one knows in general only
(n, S, P), where P is one of the probability measures in that
a family fJJ. Let a function g(P) be given on the fam- lim..BQl = o.
x~oo<j>(x)
ily fJJ, and let there be a sequence of S-measurable
A short notation is: f(x)=cp(x)+o(cp(x or f(x)~cp(x)
functions Tn(X I , . . . ,Xn), n = 1, 2, ... , the mathemat-
(cf. Asymptotic formula).
ical expectations of which, Ep Tn (X I, . . . ,Xn), are
Finding asymptotics of arithmetic functions is one of
gJven. Then, if, as n .....H:fJ,
the most important problems in analytic number
EpTn(Xj"",Xn)~g(P), PE9, theory. This is explained by the fact that almost-all
one says that Tn is a function which is asymptotically arithmetic functions with interesting arithmetical pro-
unbiased for the function g. If one calls X I, X 2, . . . , perties are characterized by extreme irregularity in their
'observations' and Tn 'estimators', one obtains the changes as the argument increases. If one considers
definition of an asymptotically-unbiased estimator. In instead of an arithmetic function f(x) its average value
the simplest case of unlimited repeated sampling from a (~n .;;)"(n / x (n a natural number), then the 'irregu-
population, the distribution of which depends on a larity' of f(x) is smoothed out. Hence, a typical prob-
one-dimensional parameter ()Ee, an asymptotically- lem for an arithmetic function is that of obtaining an
unbiased estimator Tn for g(), constructed with respect asymptotic for its average value function. For example,
to the sample size n, satisfies the condition the average value of the function T(n), whose value at n
Eo Tn(X j , ... ,Xn) ~ g(8) is the number of divisors of n, is equal to
for any ()Ee, as n~oo.
I
- ~ T(n) ~ Inn.
0. V. Shalaevskil n m~n
AMS 1980 Subject Classification: 62F12 The problem that arises here, of the best possible
bound for the error term in the asymptotic identity, is
ASYMPTOTICALLY-UNBIASED TEST - A concept still unsolved (1984) for many functions, in particular
indicating that the test is unbiased in the limit. For for the function T(X) (cf. Analytic number theory).
example, in the case of n independent samples from a Asymptotics of arithmetic functions play an impor-
one-dimensional distribution depending on a parameter tant a role in additive problems (cf. Additive number
()En, let H be the null hypothesis: ()En H , and let K be theory). For many of them there is no known direct
the alternative: proof of the existence of decompositions of a number
8 E S2 K, S2 H U S2K = Q, QH U QK = 0. into terms of a given form. However, as soon as one
has an asymptotic for the number of decompositions of
The critical set Rn in the n-dimensional Euclidean
the type one is looking for, one can deduce that the
space, n = 1, 2, ... , is an asymptotically-unbiased test
required decomposition exists for all sufficiently large
of the hypothesis H with level LX if
numbers n.
lim P(R n 18) .:;;; a, 8EQH, B.M. Bredikhin
n~x

AMS 1980 Subject Classification: 10H25


a .:;;; lim P(R n 18), 8EQK'
n~x

The function ATOM - A minimal non-zero element of a partially


lim P(R n 18) ordered set with a zero 0, i.e. an element p such that
n~oo

275
ATOM

O<x~ implies x=p. boundary point). A conformal mapping of a simply-


L.A. Skornyakov
connected domain G onto the unit disc D: I z I < 1 can
Editorial comments. By a natural extension of meaning, be continuously extended to the non-terminal points of
the term atom is also used for an object of a category hav- an attainable boundary arc to a homeomorphism of the
ing no subobjects other than itself and the null subobject
open attainable boundary arc onto some open arc of
(cf. Null object of a category).
AMS 1980 Subject Classification: 06A 10
the circle I z I = 1. E.P. Dolzhenko
Editorial comments. The standard Western terminology
ATOMIC DISTRIBUTION - A probability distribu- is accessible boundary arc and accessible boundary point,
tion on a a-algebra s1 of subsets of a space Q concen- see e.g. [A 1].
trated on the set of its atoms (cf. Atom). A set A E s1 References
in a probability space (Q, s1, P) is called an atom of the [A1) COLLINGWOOD, E.F. and LOHWATER, A.I.: The theory of
cluster sets, Cambridge Univ. Press, 1966.
distribution if peA 0 and if it follows from A ~ BE s1 [A2] MA1ucUSHEVICH, A.I.: Theory of functions of a complex vari-
that P(B)=O or that peA \B)=O. In the particular able, 3, Chelsea, 1977, Chap!. 2 (translated from the Russian).
case in which the atoms are discrete points, an atomic
distribution coincides with a discrete distribution. AMS 1980 Subject Classification: 30040
A. V. Prokhorov
ATTAINABLE BOUNDARY POINT - A point on the
AMS 1980 Subject Classification: 60EXX
boundary of a domain together with the class of
equivalent paths leading from the interior of the
ATOMIC LATTICE - A lattice with a zero in which,
domain to that point. Let ~ be a point on the boundary
for each non-zero element a, there exists an atom p ";;a.
aG of a domain G in the complex z-plane and let there
o.A. Ivanova
exist a path described by the equation z =z(t), where
AMS 1980 Subject Classification: 06BXX the function z(t) is defined and continuous on a certain
segment [a, ,8], Z(t)EG if a";;t<,8, z(,8)=~. One then
ATOMIC RING - An integral domain with a unit ele- says that this path leads to the point ~ (from the inside
ment, satisfying the ascending chain condition (max- of G) and defines the attainable boundary point
imum condition) for principal ideals. In other words, represented by f Two paths leading to ~ are said to be
any family of principal ideals of an atomic ring has a equivalent (or, defining the same attainable boundary
maximal element. An element of the ring is called an point) if there exists a third path which also leads to ~
atom, or an extremal element, or an indecomposable ele-
from the inside of G and which has non-empty intersec-
ment, if it cannot be decomposed into a product of
tions inside G as close to ~ as one pleases with each of
non-invertible elements. An integral domain is an
the two paths considered. The totality of a point ~EaG
atomic ring if and only if each non-zero non-invertible
and the class of equivalent paths leading to ~ from the
element is a product of atoms. All Noetherian rings are
interior of G is said to be an attainable boundary point
atomic. Factorial rings are atomic rings in which each of the domain G. Not every point ~EaG represents an
atom is a prime element (i.e. an element that generates a attainable boundary point; on the other hand, the same
prime ideal). An atomic Bezout ring is a principal point ~EaG can represent several, or even an infinite
ideal ring. v.I. Danilov set of different, attainable boundary points.
An attainable boundary point is the unique point of
Editorial comments. There seems to be no special name
established in Western literature for these rings. Instead one a prime end (cf. Limit element) of the first kind; a
simply speaks of rings which satisfy the ascending chain (multi-point) prime end of the second kind contains
condition for principal ideals or the divisor chain condition. exactly one attainable boundary point, while prime
References
ends of the third and fourth kinds do not contain
[A1) GILMORE, R.: Multiplicative ideal theory, M. Dekker, 1972. attainable boundary points. Each point of the boun-
dary of a Jordan domain is attainable.
AMS 1980 Subject Classification: 13EXX, 13FXX,
1 6A02, 16A 14, 16A 1 5, 16A34 References
[I) MARKUSHEVICH, A.I.: Theory ojjunclIons of a complex variable.
3, Chelsea, 1977 (translated from the Russian).
ATTAINABLE BOUNDARY ARC of a domain G in (2) COLLINGWOOD, E.F. and LOHWATER, A.J.: The theor}' ofclus.
the z-plane - A Jordan arc forming part of the boun- ter sets, Cambridge Univ. Press, 1966.
E.P. Dolzhenko
dary of G and at the same time forming part of the
boundary of some Jordan domain g C G. Each point Editorial comments. The standard Western terminology
on an attainable boundary arc is an attainable (from is accessible boundary point.
the inside of g) boundary point of G (d. Attainable AMS 1980 Subject Classification: 30040

276
ATTRACTION. PARTIAL DOMAIN 01

ATIAINABLE SUBGROUP - A subgroup H that can F(-x) c,


- ~ - - as x~oo,
be included in a finite normal series of a group G, i.e. [l-F(x)+F(-x)] C,+C2
in a series for some Cj, C2~0, Cl +C2>0, determined by Vex),
{I} c H = Hoe H, c ... C H. =G and [1- F(x)+ F( - x)] a
[1-F(tx)+F(-tx)] ~t asx~oo,
in which each subgroup Hi is a normal subgroup in
Hi + 1. The property of a subgroup to be attainable is for each constant t >0. Restriction on the behaviour of
transitive. An intersection of attainable subgroups is an the normalizing coefficients Bn, n = 1, 2, ... , leads to
attainable subgroup. The subgroup generated by two narrower classes of distribution functions for which the
attainable subgroups need not be an attainable sub- convergence in distribution (*) holds. The set of distri-
group. A group G all subgroups of which are attainable bution functions F(x) for which (*) converges weakly,
satisfies the normalizer condition, i.e. all subgroups for a suitable choice of An, C >0 and Bn =cn -1 / 2 ,
differ from their normalizers (cf. Nonnalizer of a sub- n = 1,2, ... , to a stable distribution function Vex)
set). Such a group is therefore locally nilpotent. with exponent a, is called the normal domain of attrac-
tion for V(x). The normal domain of attraction of a
References
[I] KUROSH, A.G.: The theory of groups, 1-2, Chelsea, reprint, normal distribution coincides with the set of non-
1955-1956 (translated from the Russian). VM v t degenerate distributions with a finite variance.
. . I\.0PY ov
The normal domain of attraction of a non-degenerate
Editorial comments. Instead of attainable subgroup, the stable distribution function V(x) with exponent 0:
term accessible subgroup is used in [1]. In the Western (0<0:<2) is formed by the functions F(x) for which
literature the term subnormal subgroup is standard for this
kind of subgroup. lim
x~-oo
IF(XI~
x
= c, ;;;.. 0,
References
lim 1 - F(x) -- ::;;, 0
C2 ".-
[A1] SUZUKI, M.: Group theory, 2, Springer, 1986. x--+oo xa
AMS 1980 Subject Classification: 20F26 exist and are finite, where Cj, C2 are determined by
Vex).
ATfRACflON DOMAIN OF A STABLE DISTRIBU-
References
TION, domain of attraction of a stable distribution - The [I] GNEDENKO, B.Y. and KOLMOGOROV, A.N.: Limit distributions
totality of all distribution functions F(x) such that for for sums of independent random variables, Addison-Wesley,
a sequence of independent identically-distributed ran- 1954 (translated from the Russian).
[2] IBRAGIMov, LA. and LINNIK, Yu.V.: Independent and station-
dom variables Xl, X 2 , . , with distribution function ary sequences of random variables, Wolters-Noordhoff, 1971
F(x) and for a suitable choice of constants An and (translated from the Russian).
En >0, n = 1, 2, ... , the distribution of the random [3] PETRov, V.V.: Sums of independent random variables, Springer,
variable 1975 (translated from the Russian).
B.A. Rogozin

~Xk-A. AMS 1980 Subject Classification: 60E07, 60F05
-,,-k~---,''--_ _ , n =1,2, ... , (*)
B.
ATfRACflON, PARTIAL DOMAIN OF, of an
converges weakly, as n-Hx:;, to a non-degenerate distri- infinitely-divisible distribution - The set of all distribu-
bution function V(x), which is necessarily stable. tion functions F(x) such that for a sequence of
One of the fundamental problems in the theory of independent identically-distributed random variables
stable laws is the description of domains of attraction XI, X 2, . . . , with distribution function F, for an
of stable laws. Thus, for the normal distribution, A.Ya. appropriate choice of constants An and En >0,
Khinchin, W. Feller and P. Levy established in 1935 n = 1,2, ... , and a subsequence of integers
that F(x) belongs to the domain of attraction of a nor- n 1 <n2 < .. " the distribution functions of the ran-
mal law if and only if, as X-HlO, dom variables

x2 IY~XdF(y)
!
~o.
y2 dF(y)
Iy <x
converge weakly, as k-HIJ, to a (given) non-degenerate
Later B.V. Gnedenko and W. Doeblin (1940) gave a distribution function V(x) that is infinitely divisible;
description of the domain of attraction of a stable law every infinitely-divisible distribution has a non-empty
with exponent a, 0<a<2: F(x) belongs to the domain partial domain of attraction. There exist distribution
of attraction of a non-degenerate stable law V(x) with functions that do not belong to any partial domain of
exponent 0: if and only if: attraction and there also exist distribution functions

277
ATTRACTION. PARTIAL DOMAIN OF

that belong to the partial domain of attraction of any freedom may be defined as systems whose equations of
infinitely-divisible distribution. motion have one or more limit cycles in the phase
References plane (cf. Limit cycle).
[1) GNEDENKO, B.V. and KOLMOGOROV, A.N.: Limit distributions An important typical property of auto-oscillations is
for sums of independent random variables, Addison-Wesley, that their amplitude is largely independent of the initial
1954 (translated from the Russian).
B.A. Rogozin conditions, i.e. the existence of one or more ranges of
Editorial comments. The notion defined in this article initial conditions such that to any initial set of condi-
is also commonly called the domain of partial attraction. tions within anyone of these ranges there corresponds
the same auto-oscillation amplitude. This means that
AMS 1980 Subject Classification: 60E07, 60F05
oscillating systems may contain several stationary
processes with different amplitudes; each one of these
AUTO-CORRELATION of a stochastic process XI -
becomes realized in the system, depending on a choice
Correlation of the values of XI and XI +h. The term
of a region of initial conditions. This property consti-
'auto-correlation', along with the term 'correlation func-
tutes the basic difference between periodic motion in
tion', is mostly employed in studies of stationary sto-
an auto-oscillating system and periodic motion in a
chastic processes, in which the auto-correlation depends
conservative system. Another typical property of auto-
only on h and not on t (d. Stationary stochastic pro-
oscillating systems is the fact that the periods of the
cess).
A. V Prokhorov auto-oscillations are determined by the properties of
the system itself and are not imposed from outside.
Editorial comments. I.e. the auto-correlation of the pro-
This is the basic difference between auto-oscillations
cess Xt is the correlation coefficient of Xt and Xt + h.
and forced oscillations.
AMS 1980 Subject Classification: 62M10 An essential feature of auto-oscillations consists in
the fact that the loss of energy must be compensated by
AUTO-CORRELOGRAM of a stochastic process - The
a constant source of energy. Since in an autonomous
same as the correlogram. system the forces do not explicitly depend on time, this
AMS 1980 Subject Classification: 62M10 source of energy must produce a force which is not a
given function of time and which is determined by the
AUTO-OSCILLATION - A non-damped oscillation system itself. Constant sources of energy include the
in a non-linear dynamical system, whose amplitude and winding mechanism in a clock, a shaft rotating at a
frequency can remain constant during a long period of constant angular velocity, an endless belt moving at a
time, are largely independent of the initial conditions, constant rate, or a jet of liquid or gas having a constant
and are determined by the properties of the system velocity; in systems in which the motive force is an
itself (see also Oscillations, theory of). The term 'auto- electric flow, an electric battery may serve as such a
oscillation' was introduced by A.A. Andronov (see [1 D. source.
Dynamical systems capable of performing oscilla- An example of a very simple auto-oscillating system
tions are known as auto-oscillating systems. They is a pendulum in a viscous frictional medium, acted
include clocks, generators of electric vibrations, electric upon by a force of constant magnitude which always
bells, blow and bow musical instruments, etc. Under acts in the direction of the motion. The differential
certain conditions auto-oscillations can also be pro- equation of this dynamical system is
duced in dynamical systems which normally function x+2hx+k 2 x = F sign x,
without auto-oscillations (e.g. auto-oscillations in the
front suspension of a car-'noise', the flutter of an air- where h >0, F>O, k 2 are constant coefficients. Under
craft wing, or auto-oscillations in automatic control and any initial conditions a periodic oscillation becomes
monitoring systems). The simplest auto-oscillating sys- established in the system, the maximum deflection of
tem can be represented as consisting of a constant the pendulum from its position of equilibrium (ampli-
source of energy, a mechanism regulating the supply of tude) being
] +e- hT F
energy to the oscillating system, and the oscillating sys- XII = ]-e hT k2 '

tem itself. An essential feature of such a system is its


feedback nature: on the one hand, the regulating where T=1T / Yk 2 -h 2 is the half-period of the
mechanism controls the motion of the oscillating sys- periodic motion. A stable limit cycle in the phase plane
tem but, on the other, it is the motion of the oscillating xx corresponds to this motion [3].
system that influences the operation of the regulating A typical property of all auto-oscillating systems is
mechanism [6]. From the mathematical point of view, the connection between the constant source of energy
autonomous auto-oscillating systems with one degree of and the system. which is such that the energy given by

27R
AUTO-REGRESSIVE PROCESS

the source varies periodically, the period of this varia- [2) ANDRONOV, A.A., VITI, A.A. and KHAiKIN, S.E.: Theory of
tion being determined by the properties of the system. oscillations, Moscow, 1959 (in Russian).
[3) BUfENIN, N.V.: Elements of the theory of non-linear oscillations,
This may be expressed by saying that an auto- Blaisdell, 1965 (translated from the Russian).
oscillating system is a system which executes a periodic [4) GoRELIK, G.S.: Oscillations and waves, Moscow-Leningrad,
process at the expense of a non-periodic energy source. 1950 (in Russian).
[5) FEL'DBAUM, A.A: Introduction to the theory of non-linear
The shape of auto-oscillations can be close to chains, Moscow-Leningrad, 1948 (in Russian).
sinusoidal, but can also be very different. The latter [6) KIiARKEvICH, AA: Auto-oscillation, Moscow, 1953 (in Rus-
type of auto-oscillations is known as relaxation oscilla- sian). N. V Butenin
tion. Auto-oscillations with a shape close to sinusoidal
Editorial comments. The general mathematical definition
usually occur in systems in which the loss in energy
of 'autonomous auto-oscillating systems' would be: vector
during one period is small, and consequently, the
fields having attractors which are non-constant periodic
amount of the energy fed into the system is also small. solutions. The 'ranges of initial conditions' are the basins of
They may also occur when the energy losses in the sys- attraction of these solutions. In conservative systems,
tem during one period are large, but if the parameters periodic solutions cannot be attractors and, if all solutions in
are properly chosen, the losses are compensated not an open set are periodic, the amplitude and period will usu-
merely for each period, but also for each small fraction ally depend on the initial conditions. The 'soft auto-
of a period. Such systems include the so-called Re- oscillations' can be identified with the Hopf bifurcation in
generators of sinusoidal oscillations [4]. During relaxa- dynamical systems.
tion oscillations energy losses are usually large and are In [A1], especially pages 517 and 504 are useful.
compensated for after one period by almost the entire References
energy of the oscillating system. [A1] ABRAHAM, R. and MARSDEN, J.E.: Foundations of mechanics,
Benjamin-Cummings, 1978.
The generation of auto-oscillations may be 'soft' or
'hard' [2]. In the former case, variation of some param- AMS 1980 Subject Classification: 58F21, 58FXX,
eter of a system in a stable equilibrium results in the 93CXX
generation of auto-oscillations whose amplitude steadily
increases from zero as the parameter is continuously AUTO-REGRESSION - A regressive dependence of
varied. If the parameter is now varied in the opposite the values of Xn of a given random sequence {Xn:
direction, the amplitude of the auto-oscillation steadily n =0, + 1, ... } on the preceding values of
decreases to zero, and the system returns to a stable Xn - I , . ,Xn - m . A linear auto-regression scheme of
equilibrium. If the generation of the auto-oscillations is order m is defined by a linear regression equation
hard, the system passes from a state of stable equili- between Xn and Xn - k , k=I, ... ,m, i.e.
brium to an auto-oscillation with finite amplitude as Xn == !31Xn- 1+ ... +!3m Xn-m+n, (*)
one of its parameters is slowly and continuously varied; where 131, ... ,13m are constants and the random vari-
as the parameter is further varied, the rate of increase ables (n are identically distributed with average zero,
of the amplitude becomes constant. If the parameter is variance r? and are uncorrelated (sometimes they are
varied in the reverse direction, the amplitude continu- assumed to be independent). An auto-regression
ously decreases and, after it has attained a certain scheme is a useful stochastic model for the description
value, the system returns to the state of stable equili- of certain time series (the concept of a linear auto-
brium. It is important to note in this connection that regression scheme was first introduced by G. Yule in
the system passes from stable equilibrium to auto- 1921) in order to analyze time series describing a sys-
oscillations and from auto-oscillations back to stable tem which is oscillating under the effect of internal
equilibrium at different values of the parameter. forces and random external shocks. The auto-regression
Auto-oscillating systems display an interesting and scheme (*) may be regarded as a stochastic process of a
important property - the effect of enforced synchroni- special type: an auto-regressive process.
zation, which is sometimes called 'capture' or 'mode A. V Prokhorov
locking'. This takes place when the difference between
the frequency of the auto-oscillating system and that of AMS 1980 Subject Classification: 62JXX
the external force acting on it is sufficiently small. The
AUTO-REGRESSIVE PROCESS - A stochastic pro-
steady periodic motion of the system then assumes the
cess {Xt } whose values satisfy an auto-regression equa-
frequency of the external force, i.e. the external force
tion with certain constants aI, ... , ap :
imparts its own frequency to the auto-oscillating system
[3], [5]. X, == a1X'-1 + ... +apX,_p + Y" (*)

References where p is some positive number and where the vari-


[1) ANDRONOV, A.A: Collected work, Moscow, 1956 (in Russian). ables Yt are usually assumed to be uncorrelated and

279
AUTO-REGRESSIVE PROCESS

identically distributed around their average value 0 AUTOMATA, ALGEBRAIC THEORY OF - A branch
with a variance 01. If all the zeros of the function of the theory of automata (cf. Automata, theory of) in
cp(z) =zP - a 1zP -I - . . . - ap of a complex variable z which algebraic tools are employed in the study of
lie inside the unit circle, then equation (*) has the solu- automata. The algebraic theory of automata is based on
tion 00
the fact that automata can be regarded as special alge-
~ = ~b.Y,-" bras or algebraic systems. In addition, events which are
.=0
representable by finite automata form an algebra with
where the b. are connected with the aj by the relation respect to the operations of union, composition and
iteration, generated by a finite set of so-called elemen-
tary events, each one of which consists of a one-letter
For example, let {Yt } be a white noise process with or empty word. The algebraic approach makes it possi-
spectral density 01 /2'IT; in such a case the only kind of ble to utilize algebraic results in the theory of automata
auto-regressive process satisfying equation (*) will be a directly, and in certain cases aids in establishing a con-
process {Xt } with spectral density nection between the theory of automata and other
branches of mathematics. E.g., the theory of automata
f(A) = [~) 1 cp(e iA ) 1- 2 , was used to obtain a proof of the solvability of certain
second-order arithmetical theories, and also to obtain a
which is stationary in the wide sense if </l(eo,) has no new, simpler solution of the Burnside problem.
real zeros. The autocovariances (cf. Autocovariance) from the algebraic point of view, a finite or infinite
rk = EXtXt -k of the process satisfy the recurrence rela- automata 2f=(A, S, B, cp,l[;) is a tribasic algebra, i.e. an
tion algebra with three sets of elements A, S and B and two
rk = al'k-l + ... +ap'k-p, k>O,
operations cp: S XA ~S and 1[;: S XA ~B. On the other
and, in terms of the b., have the form hand, a transition system (A, S, cp), where
00
A ={ai, ... ,an} is the input alphabet, S is the set of
'k = rl- ~b.b.+k' states (cf. Automaton, fmite) can be regarded as an
.=0
The parameters ak of the auto-regression are connected algebra <S, a b . . . ,an> with unary operations,
with the auto-correlation coefficients Pk =rk / ro of the denoted by the letters aj of the input alphabet, such
process by the matrix relation that ajsj = CP(Sj , aj). Concepts such as a homomorphism
of automata (cf. Automata, homomorp1sm of), an iso-
A = R-lp*
P P' morphism, a sub-automaton, etc., as applied to auto-
where A =(ab ... ,apl, P;=(PI,'" ,pp ) and Rp is mata, are thus naturally defined. This approach also
the matrix of auto-correlation coefficients (the permits the assignment to the automaton
Yule - Walker equation). 2f=(A, S, E, cp,l[;) of a semi-group of transformations
References of the set S (under the composition operation), gen-
[l] GRENANDER, U. and ROSENBLAIT, M.: Statistical analysis of erated by the operations aj, so that for any arbitrary aj,
stationary time series, Wiley, 1957.
[2] HANNAN, EJ.: Time series analysis, Methuen, London, 1960.
aj from A and S from S one puts
A. V. Prokhorov ajajS = cp(cp(s, aj), aj).
AMS 1980 Subject Classification: 62JXX
This semi-group is known as the semi-group of the auto-
AUTOCOVARIANCE of a stochastic process Xl - The maton 2 and is used as a means of describing certain
covariance of Xl and Xl + h . If EX denotes the properties of automata (classification, decomposability,
mathematical expectation of a random variable X, then isomorphism, etc.) in algebraic language. Also, to any
the autocovariance equals semi-group Q with a unit element there may be
assigned an automaton with a given input alphabet
E(Xt - EXt )(Xt +h -- EXt f h)'
A = { ai, ... ,an} and a set of states Q as follows. To
The term 'autocovariance' is usually applied to station- each letter aj of A is assigned some element qj from Q,
ary (in the wide sense) stochastic processes (d. Station- and the transition function is then defined as
ary stochastic process). For such processes the autoco- </l(q, aj)=qqj. The semi-group of such an automaton is
variance depends only on h and differs from the auto- isomorphic to the sub-semi-group of Q generated by
correlation by the presence of a single factor which is the elements q I, . . . ,qn, which means that if
equal to the variance of XI' The terms 'covariance q 1, . ,qn are generators of Q, then the semi-group of
function' and 'autocovariance function' are used the automaton is isomorphic to the original semi-group
together with the term 'autocovariance'. Q. The semi-group of an automaton is clearly iso-
A. V. Prokhorov morphic to a quotient semi-group of the input semi-
AMS 1980 Subject Classification: 62M10 group A' of all words over the alphabet A with the

280
AUTOMATA. COMPLETE SYSTEMS OF

operation of successive union (concatenation) of words tain set 0 of operations with values in M has been
according to the congruence defined. These subsets have the following fundamental
R = {(a, fJ): a, fJEA' and for all s ES(I[>(S, a)=l[>(s, fJ))}. completeness property: The set of all automata obtained
by a finite number of operations from 0 to automata
For any state s ES the congruence R is the maximal
from a given subset A c;;,M, is identical with M. The
subcongruence of the relation
problem of whether a set is complete or not is known
Rs = {(a, fJ): a, fJEA' and I[>(s, a)=l[>(s, fJ)}. as the completeness problem for automata. It has been
This means, in particular, that an event which can be studied for various models of automata and operations.
represented by the initial acceptor 2(s =(A, S, cp, S') is The range of objects of such studies includes, in order
a union of certain R-classes. Since the semi-group of an of increasing complexity, true automata, automata real-
automaton characterizes it up to an isomorphism, vari- izing functions with delays (i.e. functions of k-valued
ous classes of semi-groups have their own classes of logic with a certain shift in time), finite automata, etc.
automata. If the semi-group of an automaton is free, The completeness problem of true automata with super-
Abelian, cyclic, nilpotent, etc., or is itself a group, the position operations (which are usually considered) is
automaton is known, respectively, as a free, Abelian, essentially the same as the completeness problem for
cyclic, nilpotent or a group automaton; these last are also functions of k-valued logic (see Many-valued logic) and
called permutation automata. Another approach, con- has been fairly thoroughly studied. Synchronous super-
nected with the algebraic classification of transition and position is defined as superposition of automata for
output functions, leads to the classes of linear, subsitu- which the automata realizing functions with the same
tion and other automata (cf. Automaton). Substitution delay are combined at all inputs. The criteria of com-
automata realize one-to-one functions and are used in pleteness, found under these conditions, imply, in par-
coding theory. Linear automata are of interest because ticular, the existence of an algorithm by means of
of their simple structure. which the completeness or incompleteness of any finite
An automaton (A, S, B, CP,1/I) is said to be a linear system of automata can be established. Fundamental
automaton if A, S and B are linear spaces over some criteria of completeness are given in terms of the so-
field P, called pre-complete classes (i.e. subsets of the class M
that are closed with respect to the operations from 0,
I[>(s, a) = (l>I(s)+~(a), #.s, a) = lh(s)+1jJ2(a), each one of which forms together with any automaton
where CPI, <f>2, 1/11, 1/12 are, respectively, linear mappings of not belonging to it a complete system). It has been
S into S, A into S, S into B, and A into B. It is usually shown that a set A is complete if and only if it is not a
assumed that the field P is finite, and that the spaces subset of any pre-complete class. These classes, in tum,
A, S and B are finite-dimensional; in such a case a have been fully described. This approach was success-
linear automaton is a finite automaton. If multiple fully employed in a large number of other cases. It is
operations are permitted in the representation of a fin- also applicable, in principle, to finite automata if the
ite acceptor as an algebra whose operations are letters operation of superposition of automata and the feed-
of the input alphabet, the resulting generalization is back operation are chosen as 0 (cf. Automata, composi-
known as an automaton over terms (automaton over tion of). The criterion given above is applicable also in
trees; generalized automaton). Such automata are used this case; it has been established, however, that the
to prove the solvability of certain second-order family of pre-complete classes is a continual class,
mathematical theories. which excludes the existence of any effective criteria in
References these terms. One may note that in all the cases men-
[1) Algebraic theory of automata, languages and semi-groups, Mos- tioned here there exist finite complete systems, and for
cow, 1975 (in Russian). this reason the completeness problem for arbitrary sys-
[2] GLUSHKOV, Y.M.: 'The abstract theory of automata', Russian
Math. Surveys 16, no. 5 (1961), 1-53. (Uspekhi Mat. Nauk 16,
tems of automata is reduced to the completeness prob-
no. 5 (1961), 3-62) lem for finite systems of automata. One must consider
(3) THATCHER, J.W. and WRIGHT, J.B.: Kibernet. Sb. 6 (1969), in this context, as above, the problem of algorithmic
112-114.
solvability of the completeness problem for finite sys-
See also the references to Automaton. tems of finite automata. The problem may be general-
V.B. Kudryavtsev ized as follows: To determine, for a given automaton a
Yu.I. Yanov and set B, whether or not a may be obtained from
AMS 1980 Subject Classification: 18820, 68D05 automata in B by effecting a given set of operations.
One must thus study the predicate P (x, y) - 'the auto-
AUTOMATA, COMPLETE SYSTEMS OF - Special maton x is realized by means of a set y'. It was found
subsets of a given class M of automata on which a cer- that the problem of recognition of realizability is algo-

281
AUTOMATA, COMPLETE SYSTEMS OF

rithmically unsolvable for any given a, i.e. that the ;=1, ... ,n, is the automaton 2l=(A, S, B, Cp,o/) for
one-place predicate P (a, y) is non-recursive. On the which
other hand, for various values B of the parameter y, the S = SIX ... XSn , A =AIX ... XA n ,
predicate P(x, B) may be either recursive or non-
recursive. A problem connected with the algorithmic B = BI X .. , XBn
unsolvability of the completeness problem for automata while the functions <p(s, a) and o/{s, a) are defined by
is the problem of finding classes of sets for which it has the relations
an effective solution. In particular, there exists an algo-
rithm for the recognition of the completeness of sys-
tems consisting only of Moore automata and all true = (4)I(SIo al), ... ,4>n(sn, an,
automata. A related problem is that of finding concrete 1/{(SIo ... ,sn)' (a 10 ... ,an =
complete sets of automata with given properties. It was = (lh (s 10 al), ... ,I/In(sn, an.
found that, for any positive integer n, there exists a
The superposition of automata
complete system of automata no proper subset of which
211 =(A h SI, B I , </>1, 0/1) and 212 =(Bh S2, B 2 , </>2, 0/2) is
is complete, and that, for a given n, the number of such
the automaton ~ = (A I, S, B 2, cp, 0/) for which
systems is infinite. There also exists a most simple, in a
S=S2 XS I and
certain sense, automaton with two states, two input
channels and one output channel that constitutes a 4>(S2,SI),a) = (f/>z(S2, I/I1(SIo a, 4>1 (s 10 a,
complete system. The completeness problem has also 1/{(S2,sl),a) = 1/12(S2,lh(sloa,
been studied for different generalizations of finite auto-
where (S2, SI)ES and a EA I.
mata and operations performed on them; another kind In problems of completeness and synthesis of auto-
of generalization concerns the introduction of various mata the feedback operation is very important. This
equivalence relations into the class of automata under operation is applicable to automata with n inputs and
consideration. m outputs 2l=(A, S, B, </>,1/;) where A =A I X ... XA n ,
References B=BI X ... XB m ,
[I] YABLONSKII, S.V.: 'Functional constructions in k-placed logic',
Trudy Mat. Inst. Steklov. SI (1958), 5-142 (in Russian). 1/{s, (a 10 ... ,an =
[2] KUDRYAVTSEV, V.B.: 'The completeness theorem for one class
of feedback automata', Problemy Kibernet. 8 (1958),91-115 (in
= (I/II(S, (a 10 ,an, ... ,I/Im(s, (a 10 . ,an),
Russian). such that for certain i, j the relation B; =Aj is true and
[3] KUDRYAVTSEV, V.B.: 'On the cardinality of sets of pre-
complete sets of certain function systems, related to automata', the function 0/; is independent of aj, i.e.
Problemy Kibernet. (1965),45-74 (in Russian).
[4] KRATKO, M.l.: 'On the existence of non-recursive bases of fin-
I/Ii = I/Ii(S, (a 10 ... ,aj-I, aj+lo ... ,an,
ite automata', Algebra i Logika 3, no. 2 (1964), 33-44 (in Rus- Under these conditions, the feedback operation on
sian).
[5] LETICHEVSKll, A.A.: Conditional completeness in the class of the ;-th output and j-th input of automaton 21 yields
Moore automata, Kiev, 1963 (in Russian). the automaton 2f = (A', S, B, </>',1/;') such that
[6] BUEVICH, V.A.: 'The construction of universal finitely- A'=AIX'" XA} - IXA+IX'" XA n,
}
determined functions in two input variables, having two inter-
nal states', Problemy Kibernet., no. 22 (1970), 75-83 (in Rus- <I>'(s, (aI, ... ,aj-lo aj+l, ... ,an =
sian).
V.B. Kydryavtsev = 4>(s, (aI, ... ,aj-lo
Editorial commp.nts. The topic of this article is rather
esoteric. The problem of completeness (in this sense) has
1/1' (s, (a 10 , aj - 10 aj + I, ... , an =
not really been studied outside the Soviet Union and is not
considered very significant in the Western literature. = 1/{s, (aI, ... ,aj_1>
AMS 1980 Subject Classification: 68D05, 03D05 "'i(S, (al,. ,aj_l, aj+I, .. ,an, aj+I, .. ,an,

There are also other kinds of compositions of auto-


AUTOMATA, COMPOSmON OF - Operations by
mata, e.g. the product, direct sum, semi-direct product,
which more complex automata are obtained from other
etc.
automata by combining the initial automata in accor-
dance with certain rules. Composition of automata References
[1) GLUSHKOV, V.M.: 'The abstract theory of automata', Russian
plays an important role in problems of synthesis and
Math. Surveys 16, no. 5 (1961), I-53. (Uspekhi Mat. Nauk 16,
decomposition of automata. The most important and no. 5 (1961), 3-62)
the most frequently used compositions of automata are [2) KUDRYAVTSEV, V.B.: 'On the cardinality of sets of pre-
the direct product, superposition and feedback. complete sets of certain function systems, related to automata',
Problemy Kibernet. (1965),45-74 (in Russian). VN. Red'ko
The direct product of automata 21; =(A;, S;, B;, cp;, 1/;;),

282
AUTOMATA. EXPERIMENTS WITH

Editorial comments. For more information see [A1]. Another problem in the study of equivalence of auto-
References mata is the problem of equivalent transfonnations of
[A 1] GEISEG, F.: Products of automata, Springer, 1986. automata. This problem was studied for two of the pos-
AMS 1980 Subject Classification: 68005, 20M35 sible ways of defining finite automata - diagrams and
logical networks. Generally speaking, the problem con-
AUTOMATA, EQUIVALENCE OF - An equivalence sists in finding a system of transformation rules which
relation on the set of automata, which arises in the con- satisfy certain conditions and which enable one to
text of studying some individual internal properties of transform any given automaton into an automaton
automata. Such a property is usually the behaviour of equivalent to it: a so-called complete system of rules. In
automata (cf. Automaton, behaviour of an), since two both cases the transformation rule is a pair of schemes
automata are considered equivalent if their behaviour is (fragments of diagrams or logical networks) which real-
identical. In this context the behaviour of an automa- ize identical choices of mappings. The rule is applied
ton is usually understood to be the totality of functions by substituting one fragment for another. For finite
realized by it (cf. Automaton, fmite). For finite auto- automata a complete system of rules cannot exist, but
mata such an equivalence relation is solvable, and for they do exist for logical networks with a limited
this reason there exists a minimization algorithm for number of delays.
automata, i.e. an algorithm used to construct for a The fundamental concepts, problems and methods
given automaton an equivalent automaton with the which arise in the study of the equivalence of finite
smallest possible number of states (a minimal automa- automata are usually extended to other types of auto-
ton). mata, with due regard for their particular features. For
Equivalence of automata is conveniently defined in references see Automaton.
V.B. Kudryavtsev
terms of the concept of equivalence of states: Two states
Yu.I. Yanov
s and s' of the automata ~ and ~' (they may be identi-
cal) are said to be equivalent if the initialized automata Editorial comments. The reader is referred to the arti-
~s and ~~. realize the same function. Such an cles Formal languages and L-systems. Most of the topics
equivalence of the automata ~ and ~' will then mean in this article are nowadays not considered to be in the
that it is possible to find, for an arbitrary state of ~, a mainstream anymore.
state of ~' equivalent to it, and vice versa. An automa- AMS 1980 Subject Classification: 68005
ton is minimal if and only if no two states of it are
equivalent. The minimal automaton is unambiguously AUTOMATA, EXPERIMENTS WITH - A method of
defined for any automaton, up to an isomorphism. The obtaining information about the internal structure of an
algorithm of solution of this equivalence relation for automaton from its behaviour as can be deduced from
finite automata is based on Moore's theorem, according external experiments (that is, experiments where input
to which states s and s' of ~ are equivalent precisely if words are fed into an automaton, the corresponding
the functions realized by the initialized automata ~s sequence of output words is examined and conclusions
and ~s are identical on words of length n -1, where n are drawn on the basis of these observations).
is the number of states of ~. If the states sand s' Experiments with automata can be used to seek
belong, respectively, to two automata ~ and ~', the approaches to the solution of the following problems:
estimate is equal to n +n' -1, where n' is the number 1) Given that an automaton is in an initial state it is
of states of 2{'. This theorem also forms the principle of required to determine this state.
a well-known minimization algorithm of finite auto- 2) The construction of an experiment that transfers
mata, which involves the construction of the so-called an automaton from any state to some pre-assigned state
reduced automaton, the states of which are classes of (the adjustment problem).
equivalent states, while its transition and output func- 3) The verification of the correct functioning of an
tions are naturally induced by the corresponding func- automaton. It is required to find out by means of an
tions of the original automaton. A reduced automaton experiment if a given automaton functions correctly.
is minimal, since any two of its states are non- 4) The diagnostic of an automaton. It is required to
equivalent. find out not only if an automaton is in good working
There exist asymptotic estimates of the number order, but also what kind of malfunction occurs.
A (m, n, p) of minimal automata with n states, m input 5) The problem of identifying an automaton from a
and p output letters; if m+n+p-HX) for m;;;;;'3, the given class (deciphering). Knowing that the automaton,
estimate is A (m, n, p ),....,,(np )mn / n !, while viewed as a 'black box', belongs to a given set, it is
(n" \2" required to find out which element the automaton is.
A(2, n,p) ~ e-l/(2P')~.
n. Let ~ be a class of initialized automata with input

283
AUTOMATA, EXPERIMENTS WITH

alphabet X and output alphabet Y. Next, let X be the automaton. The following theorem on the distinguisha-
collection of all words in X, let aEX and let A em:. bility of states by a simple experiment holds: If A is an
An experiment with the automaton A is a set [a, A] of all (r, m, p )-automaton with all states distinct, then the dis-
pairs of the form (x,y) such that x Ca and y is a word tinguishability of any two states can be established by
into which A produces x. means of a simple experiment of length r - 1, and the
The classification of experiments. The following types bound r -1 cannot be lowered (see [4]).
of experiments are usually considered: The problem of determining the initial state of an
1. A simple unconditional experiment - a set [a, A], automaton can be solved by means of a simple uncon-
where a consists of a single word. In the process of ditional experiment of length A, where
experimenting the single element a is fed into the input 3,(I+'d/ 6 ,,;;; A< 3,(1+<2)/6 (j,(2- 0 as r_oo).
of the automaton aLd the resulting output is some
word from Y. Adjustment for an automaton with r states of which
2. A simple conditional experiment - a set [a, A], k are admissible can always be solved by means of a
where a consists of a single word X=(X(I), ... ,X(T simple unconditional experiment of length A, where
and where for any j E {1, ... ,T} the values X(j) A:OS:::(2r-k)(k-l)/2, and this bound cannot be
depend on [a',A], a'=(X(I), ... ,X(j-I. Thus, in improved.
the process of experimenting with automata every sub- As a rule, the above estimates are exact only for a
sequent letter of the input word depends on the output small number of automata. It has been established (see
word obtained earlier. [4]) that the problem of determining the initial state for
3. A multiple unconditional experiment - a generali- aUnost-all (r, m, p )-automata with two admissible states
zation of a simple unconditional experiment to the case can be solved by means of a simple unconditional
when the set [a, A] is such that a consists of more than experiment of length A, where A:os:::log", lo&> r +4, and
one word. It is usually assumed that in the process of the adjustment problem for almost-all (r, m, p)-
execution of the given experiment there is more than automata by means of a simple unconditional experi-
one specimen of the automaton A, or that the automa- ment of length A, where A< 510&> r.
ton can be reset in its initial state. There is no algorithm that would permit the decy-
4. A multiple conditional experiment - a generaliza- phering of initialized Mealy automata with unknown
tion of a simple conditional experiment to the case number of states. However, it turns out that a great
when the set [a, A] is such that a consists of more than part of such automata can nevertheless be decyphered.
one word: Experiments with automata are also used as tools for
the control of the functioning of automata. A unified
a = {(XI(I), ... ,X1(TI,'" , (Xs(l), ... ,X,(TJ)}. approach has been developed to problems of control of
Moreover, two possibilities can be distinguished here: automata, which makes it possible to indicate funda-
a) for any i E {l, ... ,s} and j E {l, ... ,T;} the value mental methods for their solution. An effective algo-
of Xi(j) depends only on [a', A], where a' consists of rithm has been constructed for finding all dead-end
the single word (Xi(l), ... ,Xi(j -1; or b) for any multiple experiments for various automata (see [2]).
iE{1, ... ,s} and jE{l, ... ,T;} the value of x,(j) References
depends on [a", A], where a" is the set consisting of the [1] MOORE, E.F.: 'Gedanken experiments on sequential machines',
. \
coII ectlOns {(X\(l), ... ,X\(tj, ... , in C. Shannon and 1. MacCarthy (eds.): Automata studies,
Princeton Univ. Press, 1956, pp. 179-210.
(Xs(l), ... ,X,(tj} and the [2] YABLONSKli, S.V.: 'Mathematical logic, theory of algorithms
index t;
for i E { 1, ... ,s} coincides with Ti if j - I >Ti and theory of sets', Trudy Mat. [nst. Steklov. 133, 263-272 (in
Russian).
or withj-l ifj-l:OS:::Ti' [3] TRAKHTENBROT, B.A. and BARZDIN', YA.M.: Finite automata.
Measures of the complexity of experiments. These are Behaviour and synthesiS, North-Holland, 1973 (translated from
some numerical characteristics that estimate the degree the Russian).
[4] HIBBARD, T.N.: 'Lower upper bounds on minimal terminal
of complexity of an experiment. For a multiple experi-
state experiments for two classes of sequential machines', J.
ment the usual measures of complexity are the heighl of Assoc. Comput. Mach. IS, no. 4 (961),601-612.
the experiment - the number of symbols of the largest Kh.A. Madatyan
input word, the multiplicity - the number of input AMS 1980 Subject Classification: 68050
words, and the length - the total number of symbols
in all the input words. It is considered that the measure AUTOMATA, HOMOMORPlDSM OF - A mapping
of complexity of a simple experiment is its length (the of the input and output alphabets and of the set of
number of symbols of the input word used). states of an automaton into analogous sets of a second
Some results. A Moore automaton A with r states, m automaton that preserves the transition and output
input and p output symbols is called an (r, m, p)- functions. More precisely, a homomorphism of an auto-

284
AUTOMATA, METHODS OF SPECIFICATION OF

maton = (A I, S I, B I, <1>1,1/;1) into an automaton


~I automaton (cf. Automaton, behaviour of an). The func-
~2=(A2,S2,B2,<i>2,1/;2) (cf. Automaton, fmite) is a tions <I> and I/; are often defined by means of a table, a
mapping h=(h l ,h 2 ,h 3 ) of the set AIXSIXB I into diagram or a matrix of transitions. The table of transi-
the set A 2 X S 2 X B 2 tions T of the automaton ~ consists of two subtables
T.p=llttll and T",=llttll, i=l, ... ,n,
hl:A I - A 2, h2: SI - S2, h3: BI - B 2,
j = 1, ... ,m, tt ES, tt EB. The functions <I> and I/; are
such that the following equalities are valid for any s defined by </>(Sj, aj)=tt, I/;(Sj, aj)=tt. If all the
from S I and a from A I : columns of T", coincide, the table T defines a Moore
h 2<PI(S, a) = </l2(h2(S), hl(a)), automaton. For instance, let A 0 = {a I, a2},
h3!/JI(S, a) = !/J2(h2(S), hl(a)). SO={SI,S2}, Bo={bJ,b 2 }; the table T (Fig. 1) will
then specify the functions <Po and 1/;0 of some automa-
Initialized automata are subject to the additional
ton ~ (Fig. 2 shows the subtables T.p and T", of 1).
requirement that h maps the initial state to the initial
state. The automata ~I and ~2 are said to be I
0, I 02
homomorphic if there exists a homomorphism h of auto- I
I
mata mapping A I XS I XB I into A2 XS 2 XB 2 If, in s, s,.b 2 : S2.b2 s, s, I S2 S, b2 : b2
- ----1--- ---t--- ---r--
addition, h is one-to-one, h is called an isomorphism, S2 S2.b, I s,.b 2
I
S2 I
I
S, S2 b, I b2
and the automata ~I and ~2 are said to be isomorphic
automata. If the alphabets A I and A 2, and also the '---~v,..-----' ~--------~yr--------~
Fig. 1. Fig. 2.
alphabets B I and B 2, are identical and the mappings
h I and h 3 are the identity mappings, the homomor-
The diagram of (transitions oj) an automaton is an
phism (isomorphism) h is known as a state homomor-
oriented graph G with a one-to-one correspondence
phism (state isomorphism). Input and output homomor-
between the vertices of the graph and the elements of
phisms (isomorphisms) are defined in a similar manner.
S, and between the edges of the graph and certain sets
State-isomorphic automata and state-homomorphic ini-
of pairs of the form (a, b), a EA, b EB. Each vertex of
tialized automata are equivalent (cf. Automata,
G is the starting point of at least one edge, and the set
equivalence of).
~ of all pairs assigned to the edges with the same ver-
The concept of a homomorphism of automata is used
tex as their ongm has the fonn
in the context of problems concerning minimization,
~={(al,bjJ, ... ,(am,b;J}. The functions <I> and I/;
decomposition and completeness of automata, among
are then defined as follows: </>(Sj, aj)=s" I/;(Sj, aj)=bp if
others.
the pair (aj, bp ) is assigned to the edge issuing from
References from Sj and if this edge ends in the vertex s,. It is con-
[II GLUSHKOV, Y.M.: 'The abstract theory of automata', Russian
Math. Surveys 16, no. 5 (1961), 1-53. (Uspekhi Mat. Nauk 16, venient to fonnulate certain properties of automata in
no. 5 (1961), 3-62) the language of diagrams (connectivity of an automa-
A.A. Letichevskil
ton, attainability of states, etc.). Figure 3 shows the
AMS 1980 Subject Classification: 68005, 20M35 transition diagram of the automaton ~.

AUTOMATA, METHODS OF SPECIFICATION OF -


The various ways of describing automata and their
functioning or behaviour. The method of specification
of an automaton will depend on the approach to the Fig. 3.
concept of an automaton. If the macro-approach is The transition matrix is used to describe the function-
adopted (cf. Automaton, fmite), it is the external ing of the transition system <A, S, <1 (cf. Automaton,
behaviour of an automaton which is described. If the finite). It is represented by the (n Xn)-matrix
micro-approach is adopted, the definition should P = II Pij II whose elements are subsets of the alphabet
include a description of the elements from which the A (including the empty subset) such that a EPij if and
automaton is constructed, and the schemes of their only if </>(Sj, a)=sj, so that for any i= 1, ... ,n the rela-
composition. Methods of specifying finite automata are tions Pjp n
P j, = 0 if P =l=r and U n = IPij = A are true.
described below. In order to extend the function <l>i to the set S XA
Macro-approach. To specify a finite automaton (where A is the set of all words over the alphabet A
~= <A, S, B, <1>,1/;> for given alphabets including the empty word), one considers the sequence
A = {a I, ... ,am}, S = {s I, ... , Sn } and of powers of the matrix P. Multiplication of P by itself
B = {b I, ... , bd is tantamount to describing the func- is carried out by the usual algorithm, using the opera-
tions <I> and I/; or to describing the behaviour of this tions of concatenation and union of word sets instead of

285
AUTOMATA. METHODS OF SPECIFICATION OF

multiplication and addition. If a EA is a word of semi-Thue grammar, etc.). In this case the semi-Thue
length d and aEP~f> where P~f> is an element of the system is given as the quadruple T= <A, S,~, w>
matrix pd, one has q,(Si' a)=Sj. Thus, the transition where A, S are finite alphabets, A n S = 0, ~ is an
matrix P of the transition system <Ao, So, 1/10> and axiom scheme of the form soX and w is the set of
the matrix p2 have, respectively, the forms: schemes of derivation rules of the form saX~s' X, r~r,

/I:: :~ II,
where s, s', rES, aEA and X is a variable which
P = assumes values from A *. If, moreover, saX~s' X and
saX~S" X belong to w, one has s' = s". The word a EA
{alaI> a2a2} {ala2, a 2a d "
p2 = PXP = {a2al> ala2} {a2 a2, alai} .
is deducible in the system T if there exists a sequence
of words soa=wJ, ... ,Wn such that Wi+1 is obtained
"
A number of minimization (reduction) algorithms and from Wi, i = 1, ... ,n -1, by the application of a cer-
algorithms of automaton synthesis are associated with tain rule from w, Wn ES and w does not contain the rule
these methods of specifying automata. wn~wn' A grammar generating a regular event has a
In order to specify the behaviour of an initialized similar form. It is given by the quadruple
(not necessarily finite) automaton ~s, (transformer), f=(A, S, SJ, w) where SI from S is an axiom, w is the
one must define the functionj(a)=t/{sJ, a) which maps set of rules of the form s;~asj or s;~a. The word
A into B* (or A 00 into Boo, where A 00, BOO are the a=al ... an is deducible in f if w includes the rules
sets of all the superwords over the alphabets A and B, SI ~als;" s;, ~a2s;" ... ,Sin ~an' Algorithms which
respectively). yield the transition matrix of an automaton from for-
mal schemes of this type are known. Thus, an event
represented in the acceptor <Ao, So, 1/10, SI > by the
state S2 may be defined, for example, as the set of
words that are deducible in the semi-Thue system

Fig. 4.
.r = <Ao,SO,~={SIX},
"'={Sla2X~S2X; slaIX~sIX;
This function may be defined by means of an informa-
s2aIX~s2X; s2a2X~sIX; sl~sd>.
tion tree. From each vertex of the information tree there
issue m edges, which are in one-to-one correspondence There exist several other ways of specifying an auto-
with the respective letters of A. To each vertex is maton. For instance, a (necessarily finite) transition
assigned a state of the automaton ~s" while to each system <A, S, <P> may be specified as an algebra
edge is assigned a letter of B in the following way. The < S, P> where P = {<Pa: a EA} is the set of unary
state S I is assigned to the root. If the state Si has been operations on S such that <Pa(s)=q,(a, s). Thus, the
assigned to some vertex, the letter b = t/{Si' a) is transition system <Ao, So, 1/10> may be regarded as
assigned to the edge corresponding to the letter a from the algebra <So, {<Pa"<Pa,}>, where <Pa,(SI)=SI,
A, while to the vertex in which this edge terminates the <Pa,(S2)=S2, <Pa,(SI)=S2,_ <Pf,(S2)=SI' ?ne may also
state Sj=q,(Si' a) is assigned. To each word consider the algebra <S, P> where S is the set of
a = al ... ar EA * corresponds a unique sequence words of the form sa, S ES, aEA and 4>~ {~a: a EA}
P = PI ... Pr of edges of this tree such that PI issues ~s the set of unary oper~ti~ns on S such that
from the root and Pi + I issues from the vertex of <Pa(sa)=saa. The algebra <S, P> is defined by the
termination of Pi' The word f3 = f31 ... f3n where f3i is system of S-generators and the set of defining relations
the letter from B assigned ~o the edge Pj, i = 1, ... ,r, is w={sja;=s;a;: i=1,2, ... }. Such an algebra specifies
identical with the value t/{s I, a). If the function j is a (usually partial) automaton <A, S, <P> such that if
realized by a finite automaton, the corresponding infor- s;aj =s;a; is a relation from w, the relation
mation tree may be effectively determined by means of q,(Sj, a;)=q,(s;,a;) is true. For example, the transition
a finite subtree. Figure 4 shows Lhe subtree of the ini- system <110, So, *0> may be specified by a system of
tialized automaton ~~, (the left-hand edges, issuing So-generators and the set of defining relations
from the vertices, correspond to the symbol a I, while w= {SI =Slal, SI =S2a2, S2 =Sla2, S2 =S2al}. It IS
the right-hand edges correspond to the symbol a2)' then assumed that q,(SI' A)=SI, q,(S2' A)=S2.
The behaviour of a finite automaton (acceptor) in The behaviour of an automaton may be described in
terms of a representable event (super-event) may be the language of the logic of one-place predicates. A
effected with the aid of a regular expression (cf. Regu- choice of the class of formulas defining a finite automa-
lar event). Such events may also be specified as sets of ton may be performed in various ways. The descrip-
words generated (derived) in some formal system (a tion may be incomplete, because the choice defines a

286
AUTOMATA, METHODS OF SPECIFICATION OF

class of automata whose the behaviour is identical to I 2


within the accuracy of this description. For instance, 2 2 3 3
MI
I , M2 = 3 2
,
the 'questionnaire' approach involves specifying classes I
2 2 5 5
of automata with the aid of fragments of information
trees, a partial definition of the functions cp and "', etc. 3
4 4 I 0
The above ways of specifying automata may also be MI
I ,
M2=
I 2 3
applied, with suitable modifications, to the behaviour
2 2 5 5
of certain generalized finite automata (non-
deterministic, infinite, etc.; cf. Automaton). Thus, the In order to define a finite automaton over terms
elements of the table T </> of a finite, non-deterministic <A, S, 0, aa> when the alphabets A ={a I, . . . ,am}
automaton may consist of arbitrary subsets of the set S. and S = {S I, . . . ,sn} are known one must first specify
The behaviour of a finite, non-deterministic acceptor is a mapping 0 of the set A into the finite set of non-
described by a regular expression, as in the case of a negative integers such that there exists at least one ele-
deterministic acceptor. Other generalizations of finite ment a EA for which o(a)=O and, secondly, one must
automata are finite probabilistic automata (cf. Automa- define, for each a; EA, i = I, ... ,m, a o(a;)-ary function
ton, probabilistic), automata over terms, mosaic struc- aa, mapping the set [st., =sx ... XS into S. To
tures, etc. each element a EA for which o(a)=O there corresponds
To specify a probabilistic automaton <A, S, D, J-L> an element aa ES, called an initial state of the automa-
on known alphabets A = {a I, . . . ,am}, ton. For example, if A={al,a2}, S={S],S2},
D={b], ... ,bd, S={SI' ... ,sn}, means to specify, o(ad=O, 0(a2)=2, aal =s], aa,(s], SI)=S2,
for any given i and j (1:OS;;;i:OS;;;n, I:OS;;;j:OS;;;m) an arbitrary aa, (s I, S2) =s], aa, (S2' SI )=S2, aa, (S2' S2)=S2, then the
probability measure J-LiSn bq ) on the set of all pairs functions 0, aa l , aa, define some automaton over terms
(s"b q ), r=I, ... ,n, q=I, ... ,k. To do this, one usu-
<A, S, 0, {aa l , aa,} > with initial state S I.
ally considers a system of square matrices with non-
In order to specify a mosaic structure (an infinite
negative elements
union of transition systems of the form <A, S, cp>,
= II p.r.} II,
where A = S', cf. Automaton), one must define for each
Mq,j

i,r = I, ... ,n; j = I, ... ,m; q = I, ... ,k, integral point of the n-dimensional space an ordered set
such that each matrix M J = "'2:/ = I Mq,J is stochastic. of integral points; this set is called its neighbourhood.
The measure J-L is defined by the relation The input alphabet A of the transition system
J-Lij(sn bq )=J-L7.'/. The probabilistic automaton <A, S, cp> placed at a given point is the Cartesian
<A, S, D, J-L> is considered in conjunction with a cer- product of the set of states of the transition systems
tain initial probability distribution on the set S: placed at the points of its neighbourhood. For exam-
n ple, let S ={S ], S 2 }, and let the ordered set
J.I<J = %(s I), ... , J.I<J(sn, J.I<J(Si);;;' 0, ~J.I<J(Si) = 1. {(a-I, P), (a, P+ I), (0.+ 1, P)} be the neighbourhood
Da,p of an integral point (a, P) of the two-dimensional
i=1

Sometimes probabilistic automata are defined by speci- space. In order to specify a uniform two-dimensional
fying only the matrices MJ or only the matrices mosaic structure, the function cp is defined as follows:
MJ = II p)"q II, where cp(s],sl,sd=sl; in all other cases cp(s',s",Sm)=S2'
Here, the input alphabet A is the Cartesian product
r=1 SXSXS.
i=I, ... ,n; q=I, ... ,k;j=I, ... ,m. Any finite
Micro-approach. A micro-approach definition of a
Markov chain can be regarded as a finite probabilistic
structural automaton describes its constituent elements
automaton in which the matrices MJ, j = 1, ... ,m, are
and the scheme of their combination. This description
identical. The system of matrices shown below defines
may be detailed or more general. It is frequently res-
some probabilistic automaton ~ (n = 2, k = 2, m = 2)
tricted to the so-called canonical scheme of the con-
and the matrices MI, M2, MI, M2 of this automaton:
struction of an automaton. The elements then divide
I 0 I 2 into two groups: functional elements (one-state auto-
4 3 3
MI,I = I I ' M I ,2 = I I '
mata) and memory elements. The canonical scheme
4 4 5 5 (Fig. 5) consists of two functional blocks f and g to
which are added Moore automata
~; = <R;, S;, Q;, cp;, "'; >, i = I, ... ,I, as memory ele-
= o 0
4 2
I
, M 2,2
2 I ' ments. The blocks f and g consist of functional ele-
4 4 5 5 ments. When the structure is defined in this way, these

287
AUTOMATA, METHODS OF SPECIFICATION OF

blocks are not further described, but a specification is where 1 = 1,2, ... , is an integer parameter, So ES,
given (e.g. by means of tables) of the vector functions while the functions j;, i = 1, ... ,I, g;, i = 1, ... ,m, and
realized by these blocks: the variables Xn r = 1, ... ,n, take their values from the
set A. A canonical scheme in which all the memory ele-
ments are identical corresponds to this system:
21'; = <A, S, B, </>,1/;, so>, where A =S =B, i = 1, ... ,I;
</>(a, a')=a', 1/;{a, a')=a. The functioning of the auto-
g = (g1(XI> ... ,Xn' Ul> ... , U/), ... , maton 21'; may essentially be described as follows. At
the moment 1 suppose that the letter a (I) is assigned to
, ... ,g/(XI>'" ,Xn, UI>'" ,U/:
the input of 21';, then the same letter is produced as the
AnXQI X ... XQ/ ~ Hm, output of 21'; at the moment 1 + 1. Let Figure 6
where A n and Bm
are the input and the output alpha- represent an automaton 21' whose canonical equations
bets, respectively, of the canonical scheme. have the form
u(l) = 0,
u(t+l) = x(t)+u(t) (mod 2),
yet) = x(t)Vu(t).

In the general case, a description of structural auto-


mata involves defining a selection of elementary auto-
mata and a certain class of 'correctly constructed'
9 9 schemes (networks), the latter usually being defined by

fY ly . . t
J 2 m fy
induction.
References
Fig. 5. Fig. 6. [1] SHANNON,C. and MACCARTHY, J. (EDS.): Automata studies,
Princeton Univ. Press, 1956.
[2] GLUSHKOV, V.M.: SyntheSiS of numerical automata, Moscow,
This scheme specifies the structural automaton
1962 (in Russian).
<An, S, B m, </>, 1/;>, where S=SI X ... XS/, while the V.A. Buevich
functions </> and I/; are defined as follows: S. V. Aleshin
<f>SI, ... ,s/), (a I> ... ,an = (S'I, ... ,s~), Editorial comments. See the Editorial Comment to the
article Automata, equivalence of.
Sj = cf>j(S;' j;(a I> , an, 1/11 (S I), ... ,I/I/(s/)));
I/;(SI> ,s/), (a I> ... ,an = (bl> ... ,bm ), AMS 1980 Subject Classification: 68005

b; = gj(al>'" ,an,I/II(SI), ... ,I/I/(s/. AUTOMATA, MINIMIZATION OF - Minimization


An important example of structural automata are of the parameter values of automata which results in
logical networks (cf. Automaton, finite). F~Ee _6 ~h<?ws equivalent and, in a sense, optimal automata. The
the canonical scheme of an automaton <A, S, B, </>, 1/;> problem of minimization of automata arises during the
isomorphic to _ th~ au!omaton ~ (cf. Fig. 3 for its synthesis of automata, and its specific nature will
diagram), A =S =B = {O, 1}. The automaton depend on the approach adopted in the study of the
21" = <s, q;,
S, S, 1/;' > is a Moore automaton such that problem.
q;
</>'(1, z)=z, (0, z)=z. If the macro-approach is adopted, the usual pro-
Structural automata are frequently described by cedure is to minimize the number of states of automata,
means of canonical equations, i.e. systems of the form thus obtaining minimal or reduced automata. The actual
method of search for reduced automata will depend on
the form in which they are defined and on their type of
behaviour. If, for example, the behaviour of a finite
automaton (cf. Automaton, finite), defined with the aid
of, for example, a transition diagram, IS understood to
mean the system of finitely-determined functions (cf.
Finitely-detennined function) realized by the automa-
ult+ I) = ji(UI(t), ... ,u,(t), x\(t), ... ,xn(t, ton, the minimal finite automaton equivalent to the ini-
tial automaton can be effectively found. This is based
YI(t) = g\(UI(t), ... ,U,(t),XI(t), ... ,Xn(t,
on Moore's theorem, according to which it may be esta-
blished by an experiment of length n - I whether or not
two states of an automaton with n states are distin-

288
AUTOMATA, THEORY OF

guishable from one another. The mmmuzation algo- Given an automaton m, defined, for example, by
rithm consists of constructing an automaton as defined canonical equations, it is required to exhibit a logical
above whose states correspond to the classes of indis- network S, constructed from the elements of the basis
tinguishable states of the initial automaton, whereas the ~ using superposition and feedback operations, realiz-
transition and the output functions are defined by ing mand containing a minimum number L(m) of ele-
means of representatives of these classes. The minimal ments of ~ necessary for the realization of this automa-
automaton is unique up to a state-isomorphism. If the ton. In this sense the network S will be the optimal
final automaton is considered as an acceptor represent- scheme of the automaton m. In the general case, the
ing - by means of the subset of isolated states - an problem of realizing an arbitrary automaton mwith the
event described with the aid of the given regular aid of networks over a basis ~ is unsolvable, and for
expression, the minimal automaton is effectively con- this reason the complexity function L(m) of the auto-
structed, and the algorithm of its construction may be maton mcannot be calculated. If a basis of mis known
divided into two stages. In its first stage, the initial reg- to be complete (cf. Automata, complete systems of), the
ular expression is used to construct some automaton, construction of any optimal network for m can be
not necessarily minimal, representing the corresponding effectively realized e.g. in the following manner. It is
regular event. Such an automaton may be constructed, known that the realizability check of an automaton m
for example, by induction from the operations of union with the aid of a given network S can be effectively
U, concatenation 0 and iteration * which are included established. Moreover, for a given number I the number
in the regular expression. The automata m1, m 2 and m3 of networks over a base ~ of complexity at most I can
which represent, respectively, the events R 1, R2 and R3 be calculated, and all these networks can be effectively
serve to effect the special constructions of the automata constructed. Therefore, a sequential inspection of all
m4, ms and m6 which represent the events R 1U R 2, networks of complexity 1, ... ,L(m) for the realizabil-
R 1 0R 2 and R;; moreover, the number of states of m 4 ity of the automaton m may be used as the algorithm
does not exceed the product of the number of states of for constructing all optimal networks for a given auto-
m1and m2; the number of states of ms does, generally, maton m. The problem of the behaviour of the function
not exceed the product of the number of states of m1 L(m) and of certain of its generalizations is part of the
with the number of all subsets of the set of states of general problem of synthesis of automata. There exist
m2; while the number of states of m6 does not exceed several heuristic algorithms for the synthesis of minimal
the number of subsets of the set of states of m 3 During schemes for automata, based on the special properties
the second stage of the algorithm the number of states of the bases and on specific interpretations of the con-
of the resulting automaton is minimized in the usual cept of optimality. For references see Automaton, fm-
way, the classes of indistinguishable final states of the ite.
V.A. Buevich
initial automaton corresponding to the final states of
the minimal automaton. The minimal automaton AMS 1980 Subject Classification: 68005
representing a given super-event is constructed in the
same manner. There are unique (up to a state- AUTOMATA, THEOR.Y OF - A branch of the theory
isomorphism) minimal automata which represent given of control systems whose subject is the study of
events and super-events. matematical models of transformers of discrete infor-
A reduced automaton, which is unique up to a state- mation, known as automata. In a sense, such
isomorphism, also exists for non-deterministic finite transformers may be both real mechanisms (computers,
automata, and for both deterministic and non- automata, living organisms, etc.) and abstract systems
deterministic infinite automata. In the former case, this (mathematical machines, axiomatic theories, etc.). The
automaton is found using a method similar to that in theory of automata was born in the mid-twentieth cen-
the case of finite automata discussed above, while in tury in connection with finite automata (d. Automaton,
the second case its construction is, as a rule, non- fmite), which are mathematical models of nervous sys-
effective. For a stochastic automaton with a finite tems and electronic computers. The class of objects
number of states there exists, generally speaking, a con- and the scope of problems dealt with by the theory of
tinuum of various reduced automata are equivalent to automata was subsequently enlarged, and came to
the initial automaton. This is why there is no effective include certain concepts in and problems of other
way of describing the set of reduced automata derived branches of mathematics. The theory of automata is
from a given stochastic automaton. most closely connected with the theory of algorithms
In the micro-approach to the study of finite auto- (d. Algorithms, theory of), particularly so with the
mata, a given automaton is constructed starting from theory of abstract machines, since automata can be
some finite selection of a basis ~ of logical networks. regarded as a special case of such machines.

289
AUTOMATA, THEORY OF

Most problems of the theory of automata are similar about the structure of an automaton by observing its
to those of the other main types of control systems. reactions to certain outside effects. This involves a wide
They include the problems of analysis and synthesis of circle of problems concerning the classification of
automata, completeness problems, minimization prob- experiments and problem-solving by certain kinds of
lems, problems on equivalent transformations of auto- experiments, as well as estimates of the shortest experi-
mata, etc. The problems of analysis concern the ment sufficient to solve a given problem. The concept
description of the behaviour of a given automaton or of experiments with automata is also used in control
the determination of certain of its properties from its problems concerning automata (cf. Reliability and
behaviour and from given incomplete data on its inspection of control systems). Problems of the interac-
behaviour (cf. Automaton, behaviour of an; Automata, tion of automata - either with each other or with
experiments with). Problems in the synthesis of auto- given external environments - are dealt with in
mata include the construction of automata with pre- branches dealing with automaton games and the
defined properties or automata functioning in a desired behaviour of automata (cf. Automaton, behaviour of an).
manner (cf. Synthesis problems). Related problems con- Many of these problems may be considered as being
cern estimates of the complexity of automata behaving algorithmic in nature. Most of them have a positive
in a given manner and the construction of automata solution for finite automata.
which are, in some sense, optimal (cf. Automata, minim- The theory of automata has found applications both
ization of). In addition, there arises the problem of in other branches of mathematics and in solving practi-
completeness in connection with classes of initialized cal.problems. Thus, it can be used to prove the solva-
automata or mappings of automata (cf. Functional sys- bility of certain formal calculi. The application of the
tem; Automata, complete systems of). The problem of methods and concepts of the theory of automata to the
equivalent transformations may be posed both for auto- study of formal and natural languages gave rise to a
mata and in various problems concerning their special branch of the theory of algorithms: mathemati-
behaviour (cf. Equivalent transformations). In addition cal linguistics. The concept of an automaton can serve
to problems of the above type, which are common to as model in a great variety of problems, and can thus
many control systems, there are problems specific to be employed in many studies, both in pure and applied
automata. Thus, depending on the particular problem sciences.
at hand, it may be convenient to specify the behaviour References
of automata in various languages (regular expressions, [1) SHANNON, C. and MACCARTHY, J. (EDS.): Automata studies,
canonical equations, language of logical predicates, etc., Princeton Univ. Press, 1956.
[2) GLUSHKOV, Y.M.: SyntheSiS of numerical automata, Moscow,
cf. Automata, methods of specification of), which gives 1962 (in Russian).
rise to problems concerning the choice of an adequate [3) ThAKHTENBROT, B.A. and BARZDIN', YA.M.: Finite automata.
and convenient language, and concerning the Behaviour and synthesis, North-Holland, 1973 (translated from
the Russian).
translation from one language to another. Other prob- VB. Kudryavtsev
lems, closely connected with synthesis and equivalent Yu.l. Yanov
transformation problems, is the problem of minimiza- Editorial comments. This article refers to only some
tion of the number of states of an automaton and that aspects of the theory. One should also consult the articles
of obtaining corresponding estimates. Fairly simple Formal languages; L-systems; Complexity.
algorithms have been developed for finite automata AMS 1980 Subject Classification: 680XX, 93AXX,
making it possible, with the aid of regular expressions, 03005, 68F10
to obtain automata with the minimum possible number
of states and representing appropriate events (cf. Auto- AUTOMATIC CONTROL THEORY - The science
mata, minimization of). A related set of problems con- dealing with methods for the determination of laws for
cerns the modelling of the behaviour of automata in controlling systems that can be realized by automatic
one class by automata in another class. Problems of the devices. Historically, such methods were first applied
minimization of modelling automata and estimation of to proce~~t:~ which ",-ere mainly technical in nature [1].
their complexity are also of interest. For instance, on Thus, for instance, an aircraft in flight is a system, the
passing from a non-deterministic automaton (source) control laws of which ensure that it remains on the
representing (generating) a regular set of words to a required trajectory. The laws are realized by means of a
finite automaton representing the same set, the number system of transducers (measuring devices) and actua-
of states may increase exponentially. A special field of tors, which is known as the automatic pilot. This
the theory of automata are the so-called experiments development was due to three reasons: many control
with automata (cf. Automata, experiments with). Here, systems had been identified by classical science (to
the principal problem is to obtain certain information identify a control system means to write down its

290
AUTOMATIC CONTROL THEORY

mathematical model, e.g. relationships such as (I) and wntmg (1977) only insignificant progress has been
(2) below); long before the development of automatic made towards its solution. If equation (1) is linear
control theory, thanks to the knowledge of the funda-
x = Ax+Bu, (3)
mental laws of nature, there was a well-developed
mathematical apparatus of differential equations and where A, B are stationary matrices, the criterion of
especially an appparatus for the theory of steady complete controllability is formulated as follows: For
motion [2]; engineers had discovered the idea of a feed- (3) to be completely controllable it is necessary and suf-
back law (see below) and found means for its realiza- ficient that the rank of the matrix
tion. Q = II B AB ... A n -I B II (4)
The simplest control systems are described by an be n. The matrix (4) is known as the controllability
ordinary (vector) differential equation matrix.
x = j(x, u, t) (1) If A, B are known differentiable functions of t, the
controllability matrix is given by
and an inequality
(5)
N(x, u, t) ;;;. 0, (2) where
where x {x I, . . . ,xn } is the state vector of the system, dL -

u {U I, . . . ,u,} is the vector of controls which can be


LI(t) = B(t), Lk(t) = A(t)Lk - I -----;;;-,
k 1
k=2, ... ,no

suitably chosen, and t is time. Equation (1) is the The following theorem applies to this case: For (3) to
mathematical representation of the laws governing the be completely controllable it is sufficient if at at least
control system, while the inequality (2) establishes its one point t* E.rthe rank of the matrix (5) equals n [3].
domain of definition. Criteria of controllability for non-linear systems are
Let U be some given class of functions u (t) (e.g. unknown (up to 1977).
piecewise-continuous functions) whose numerical values The first principal task of automatic control theory is
satisfy (2). Any function U(t)E U will be called a per- to select the permissible controls that ensure that the
missible control. Equation (1) will be called a target xf is attained. There are two methods of solving
mathematical model of the control system if: this problem. In the first method, the chief designer of
1) A domain N(x, u, t);a.O in which the function the system (1) arbitrarily determines a certain type of
f (x, u, t) is defined has been specified; motion for which the target xf is attainable and selects
2) A time interval .r= [ti' tf] (or [ti' tf)' if tf= 00) dur- a suitable control. This solution of the programming
ing which the motion x(t) is observed, has been speci- problem is in fact used in many instances. In the
fied; second method a permissible control minimizing a
3) A class of permissible controls has been specified; given cost of controls is sought. The mathematical for-
4) The domain N;;;'O and the function f(x, u, t) are mulation of the problem is then as follows. The data
such that equation (1) has a unique solution defined for are: the mathematical model of the controlled system
any t E.'T, Xo EN, whatever the permissible control u (t). (1) and (2); the boundary conditions for the vector x,
Furthermore, f (x, u, t) in (1) is always assumed to be which will be symbolically written as
smooth with respect to all arguments.
(i, j) = 0; (6)
Let Xi =x (t;) be an initial and let xf=x (tf) be a
(desired) final state of the control system. The state xf a smooth function G(x, t); and the cost of the controls
is known as the target of the control. Automatic control used
t:J.G
If= G(x, t) ;- (7)
theory must solve two major problems: the problem of
The programming problem is to find, among the permis-
programming, i.e. of finding those controls u(t) permit-
sible controls, a control satisfying conditions (6) and
ting the target to be reached from Xi; and the determi-
yielding the minimum value of the functional (7).
nation of the feedback laws (see below). Both prob-
Necessary conditions for a minimum for this non-
lems are solved under the assumption of complete con-
classical variational problem are given by the L.S. Pon-
trollability (1).
tryagin 'maximum principle' [4] (cf. Pontryagin max-
The system (1) is called completely controllable if, for
imum principle). An auxiliary vector tPN]' ... , tPn}
any Xi, XfEN, there is at least one permissible control
and the auxiliary scalar function
u(t) and one interval .rfor which the control target is
attainable. If this condition is not met, one says that H(l/I, x, u, t) = l/I-j(x, u, t) (8)
the object is incompletely controllable. This gives rise to are introduced. The function H makes it possible to
a preliminary problem: Given the mathematical model write equation (1) and an equation for the vector tP in
(1), find the criteria of controllability. At the time of the following form:

291
AUTOMATIC CONTROL THEORY

. aH' aH system (1) is completely controllable does not mean


X = ai' I/; = aX (9)
that the system (11) is completely controllable as well.
Equation (9) is linear and homogeneous with respect to One says that (11) is observable along the coordinates
I/; and has a unique continuous solution, which is flo ... ,f" r~n, if one has at his disposal a set of
defined for any initial condition 1/;{ti ) and t EY. The vec- measuring instruments that continuously determines the
tor I/; will be called a non-zero vector if at least one of coordinates at any moment of time t EY. The signifi-
its components does not vanish for t EY. The following cance of this definition can be illustrated by consider-
theorem is true: For the curve xo, Uo to constitute a ing the longitudinal motion of an aircraft. Even though
strong minimum of the functional (7) it is necessary aviation is more than 50 years old, an instrument that
that a non-zero continuous vector 1/;, as defined by would measure the disturbance of the attack angle of
equation (9), exists at which the function H(I/;, x, u, t) the aircraft wing or the altitude of its flight near the
has a (pointwise) maximum with respect to u, and that ground has not yet been invented. The totality of meas-
the transversality condition ured coordinates is called the field of regulation and is
denoted by P(y], ... ,Yr), r~n.
[8G-H8t+~l/;a8xay = 0
Consider the totality of permissible controls ~, deter-
is met. Let XO(t, Xi' Xj), Uo(t, Xi, Xj) be solutions of the mined over the field P:
corresponding problem. It has then been shown that
for stationary systems the function H (1/;0, xo, UO) satis- ~ = ~], ... ,f" t,p), r~n, (12)

fies the condition where p is a vector or a matrix parameter. One says


H(I/;,xO,UO) = C, (10)
that the control (12) represents a feedback law if the
closure operation (i.e. substitution of (12) into (11
where C is a constant, so that (10) is a first integral. yields the system
The solution uO, XO is known as a program control.
Let uO, XO be a (not necessarily optimal) program y= Ay+B~(y], ... ,y"t,p)+ (l3)
control. It was found that the knowledge of only one +cJ>(y'~(yI"" ,y,,!,p),!),
program control is insufficient to attain the target. This such that its undisturbed motion y =0 is asymptotically
is because the program uO, XO is usually unstable with stable (cf. Asymptotically-stable solution). The system
respect to, however small, changes in the problem, in (13) is said to be asymptotically stable if its undisturbed
particular to the most important changes, those in the motion y = 0 is asymptotically stable.
initial and final values (i, f) or, in other words, the There are two classes of problems which may be for-
problem is ill-posed. However, this ill-posedness is such mulated in the context of the closed system (13): The
that it can be corrected by means of automatic stabili- class of analytic and of synthetic problems.
zation, based solely on the 'feedback principle'. Hence Consider a permissible control (12) given up to the
the second main task of control: the determination of selection of the parameter p, e.g.:
feedback laws.
Let y be the vector of disturbed motion of the system PII Plr

and let ~ be the vector describing the additional deflec- ~=


tion of the control device intended to quench the dis- prl prr
Yr
turbed motion. To realize the deflection ~ a suitable
control source must be provided for in advance. The The analytic problem: To determine the domain S of
disturbed motion is described by the equation: values of the parameter p for which the closed system
(13) is asymptotically stable. This domain is con-
y = Ay+B~+cJ>(y,~,t)+f'(t). (II)
structed by methods developed in the theory of stability
where A and B are known matrices determined by the of motion (cf. Stability theory), which is extensively
motion of xv, uo, and are assumed to be known func- employed in the theory of automatic control. In partic-
tions of the time: cf> i." the non-linear part of the ular, one may mention the methods of frequency
development of the function f(x, u, t); f'(t) is the con- analysis; methods based on the first approximation to
stantly acting force of perturbation, which originates Lyapunov stability theory (the theorems of Hurwitz,
either from an inaccurate determination of the pro- Routh, etc.), on the direct Lyapunov method of con-
grammed motion or from additional forces which were structing v-functions, on the Lyapunov- Poincare
neglected in constructing the model (l)._Equation ill) theory of constructing periodic solutions, on the
is defined in a neighbourhood II f II ~H,. where H is method of harmonic balance, on the B.V. Bulgakov
usually quite smalL but in certain cases may be any fin- method, on the A.A. Andronov method, and on the
ite positive number or even 00. theory of point transformations of surfaces [5). The last
It should be noted that, in general, the fact that the group of methods makes it possible not only to con-

292
AUTOMATIC CONTROL THEORY

struct domains S in the space P, but also to analyze the many other formulations in all cases in which r(t) is a
parameters of the stable periodic solutions of equation random function - in servomechanisms for example,
(13) which describe the auto-oscillatory motion of the [5], [8]. Other formulations are concerned with the pos-
system (13). All these methods are widely employed in sibility of a random alteration of the matrices A and B
the practice of automatic control, and are studied in or even of the function cp [5], [8]. This gave rise to the
the framework of various specifications in schools of development of methods for studying random
higher learning [5]. processes, methods of adaptation and learning
If S is non-empty, a control (12) is called a feedback machines [9]. Transition processes in systems with delay
law or regulation law. Its realization, which is effected mechanisms and with distributed parameters (see [10],
using a system of measuring instruments, amplifiers, [11]) and with a variable structure (see [12]) have also
converters and executing mechanisms, is known as a been studied.
regulator. The synthesis problem: Given equation (11), a field of
Another problem of considerable practical regulation P(y], ... ,y,), r';;;;n, and a set
importance, which is closely connected with the ana- ~], ... ,y" t) of permissible controls, to find the
lytic problem, is how to construct the boundary of the whole set M of feedback laws [13]. One of the most
domain of attraction [6], [7]. Consider the system (13) in important variant of this problem is the problem of the
which pES. The set of values y(ti)=yo containing the structure of minimal fields. A field P(y], ... ,r,), r ';;;;n,
point y =0 for which the closed system (13) retains the is called a minimal field if it contains at least one feed-
property of asymptotic stability, is known as the back law and if the dimension r of the field is minimal.
domain of attraction of the trivial solution y =0. The The problem is to determine the structure
problem is to determine the boundary of the domain of P(Ya" ... ,YaJ of all minimal fields for a given equa-
attraction for a given closed system (13) and a point tion (11) and a set of permissible controls. The
pES. following example illustrates the nature of the problem:
Modem scientific literature does not contain effective
methods of constructing the boundary of the domain of z = Az+Bu,
attraction, except in rare cases in which it is possible to o mOO 00
construct unstable periodic solutions of the closed sys- -m 0 0 n 1 0
tem. However, there are certain methods which allow A o OOk,B= o0
one to construct the boundary of a set of values of yo o -n k 0 o1
totally contained in the domain Qf attraction. These
methods are based in most cases on the evaluation of a where m, n, k are given numbers. The permissible con-
domain in phase space in which the Lyapunov function trols are the set of piecewise-continuous functions
U2, U4 that take their values from the domain
satisfies the condition v ~O, V';;;;O [7].
Any solution y(t,yo,p) of the closed system (13) Iu21 .;;; U2, Iu41 .;;; U4-
represents a so-called transition process. In most cases The minimal fields in this problem are either the field
of practical importance the mere solution of the stabil- P(Z2) or the field P(Z4). The dimension of each field is
ity problem is not enough. The development of a pro- one and cannot be reduced [13].
ject involves supplementary conditions of considerable So far (1977) only one method is known for the syn-
practical importance, which require the transition pro- thesis of feedback laws; it is based on Lyapunov func-
cess to have certain additional features. The nature of tions [13]. A relevant theorem is that of
these requirements and the list of these features are Barbashin - Krasovskil [6], [10], [15]: In order for the
closely connected with the physical nature of the con- undisturbed motion y = 0 of the closed system
trolled object. In analytic problems it may often be
possible, by a suitable choice of the parameter p, to j = cp(y) (14)
preserve the desired properties of the transition process, to be asymptotically stable, it is sufficient for a
e.g. the pre-set regulation time t. The problem of positive-definite function v (y) to exist, such that by
choosing the parameter p is known as the problem of equation (14) its complete derivative is a function w(y)
quality of regulation [5], and methods for solving this which is semi-definite negative, and such that on the
problem are connected with some construction of esti- manifold w (y) = 0 no complete trajectory of the system
mates for solutions y(t,yo,p): either by actual integra- (14), except for y =0, lies. The problem of finding out
tion of equation (13) or by experimental evaluation of about the existence and the structure of the minimal
such solutions with the aid of an analogue or digital fields is of major practical importance, since these
computer. fields determine the possible requirements of the chief
The analytic problems of transition processes have designer concerning the minimum weight, complexity

293
AUTOMATIC CONTROL THEORY

and cost price of the control system and its maximum The synthesis problem has been solved in greatest
reliability. The problem is also of scientific and practi- detail for the case in which the field P has maximal
cal interest in connection with infinite-dimensional sys- dimension, r = n, while the cost index of the system is
tems as encountered in technology, biology, medicine, characterized by the functional
economics and sociology.
In designing control systems it is unfortunately J = fw(y,~,t)dt, (16)
impracticable to restrict the work to solving problems
where w(y,~, t) is a positive-definite function of y, f
of synthesis of feedback laws. In most cases the
requirements of the chief designer are aimed at securing The problem is then known as the problem of analytic
certain important specific properties of the transition construction of optimum regulators [14] and is in fact
process in the closed system. The importance of such thus formulated. The data include equation (11), a class
requirements is demonstrated by the importance of of permissible controls g(y, t) defined over the field
monitoring an atomic reactor. If the transition process P (y) of maximal dimension, and the functional (16).
takes more than 5 seconds or if the maximum value of One is required to find a control g=g(y, t) for which
some of its coordinates exceeds a certain value, an the functional (16) assumes its minimum value. This
atomic explosion follows. This gives rise to new prob- problem is solved by the following theorem: If equation
lems of synthesis of regulation laws, based on the set (11) is such that it is possible to find an upper semi-
M. Below the formulation of one such problem is given. continuous positive-definite function v(y, t) and a
Consider two spheres Ilyo I =R, YO=Yi, Ily(td II =f; function go(y, t) such that the equality
Rf are given numerical values. Now consider the avO avO
--at+ ay(Ay+B~+</>{y,~,t=O (17)
set M of all feedback laws. The closure by means of
any of them yields the equation: is true, and the inequality
j = Ay+BYh'" ,y" t)+ avO avO
(15) --at + ay (Ay + B~+</>{y,~, t ;;;. O.
</>{y, Y l, ... ,y" t), t). 00

Consider the entire set of solutions y(t, Yo) of equation V(to,yo) = min fw(y,~, t)dt,
(15) which start on the sphere R and call them R- ~
solutions. Since the system is asymptotically stable for is true. The function v(y, t) is known as the Lyapunov
any Yo on the sphere, there exists a moment of time t1 optimum function [15]. It is a solution of the partial dif-
during which the conditions ferential equation (17), of Hamilton - Jacobi type, satis-
fying the condition v (y( (0), (0) = O. Methods for the
Ily(tl,yo) I = (, Ily(t,yo) II < (, effective solution of such a problem have been
are valid for any t>t1. developed for the case in which the functions wand cp
Let can be expanded in a convergent power series in y, g
t* = supt l .
Yo with coefficients which are bounded continuous func-
It is clear from the definition of t 1 that t * exists. The tions of t. Of fundamental importance is the solvability
interval t * - ti is called the regulation time (the time of of the problem of linear approximation to equation (11)
damping of the transition process) in the closed system and the optimization with respect to the integral of
(15) if any arbitrary R-solution starts at the f-sphere if only second-order terms contained in the development
t1 "';;;t*, but remains inside it if t>t1. It is clear that the of w. This problem is solvable if the condition of com-
regulation time is a functional of the form plete controllability is satisfied [15].
t * =t * (R, f, ~). Let T be a given number. There arises References
the problem of synthesis of fast-acting regulators: Given [I] MAKSVELL, D.K., VISHNEGRADSKli. I.A. and STODOLA. A.: The
theory of automatic regulation, Moscow, 1942 (in Russian).
a set M of feedback laws, one has to isolate its subset
[2] CHETAEV, N.G.: Stabili~v of motion, Moscow, 1965 (in Rus-
M 1 on which the regulation time in a closed system sian).
satisfies the condition [3] KRASOVSKII. N.N.: Theory of control of motion. Moscow. 1968
(ill Russian).
t* -t, ~ T
[4] PONTRYAGIN. L.S., ET At.: The mathematical theorv of optimal
One can formulate in a similar manner the problems of processes. Moscow. 1969 (in Russian).
synthesis of the sets M 2, . . . ,Mk of feedback laws, [5] Technical cybernetics. Moscow, 1967.
[6] BARBASHIN, E.A.: inrroduction to the theon' of stabilitr.
which satisfy the other k -1 requirements of the chief Wolters-Noordhoff, 1970 (translated from the Russian).
designer. [7] ZUBOV. V.1.: Mathematical methods of stu ding svstem.1 of
The principal synthesis problem of satisfying all the automatic control. Leningrad. 1959 (in Russian).
[R] PuGACHEV, V.S.: The theory' random functions and its applica-
requirements of the chief designer is solvable if the sets
tions to problems of automatic control, Moscow, 1962 (in Rus-
M 1 , ,Mk have a non-empty intersection [13]. sian).

294
AUTOMATIC CONTROL THEORY

[9] TSYPKIN, Y A.Z.: Foundations of the theory of serving systems, tem is said to be completely control/able (to x(f if for
Moscow, 1970 (in Russian).
every possible initial state x there is a control which steers x
[10] KRAsovSKli, N.N.: Stability of motion. Applications of
Lyapunov's second method to difJerential systems and equations to x(f). In the case of continuous-time, time-invariant,
with delay, Stanford Univ. Press, 1963 (translated from the finite-dimensional systems the two notions coincide, but this
Russian). is not always the case.
[II] BESEKERSKIi, V.A. and PoPov, E.P.: The theory of automatic There are also many result on reachability and controlla-
control systems, Moscow, 1966 (in Russian).
bility for non-linear systems, especially for local controllabil-
[12] EMELYANOV, S.V. (ED.): Theory of systems with a variable struc-
ture, Moscow, 1970 (in Russian). ity and reachability. These results often are stated in terms
[13] LETOV, A.M.: 'Some unsolved problems in control theory', Dif- of Lie algebras associated to the control system. E.g., for a
ferential equations N. Y 6, no. 4 (1970), 455-472. control system of the form
(DifJerentsial'nye Uravneniya 6, no. 4 (1970),592-615) . m
[14] LETOV, A.M.: Dynamics offlight and control, Moscow, 1969 (in X = f(x) + ~u;g;(x), xER n , uER m, (A3)
Russian). ;=1
[15] MALKIN, 1.0.: Theorie der Stabilitiit einer Bewegung, R. Olden- one considers the Lie algebra .P(~) spanned by the vector
burg, Miinchen, 1959 (translated from the Russian).
fields (ad f)k(g;) , i=1, ... ,m; k=0,1, ... , where
A.M. Letov ad f(g)=[f, g], (ad f)k=(ad f)o(ad f)k-1. Given a Lie algebra
of vector fields .P on a manifold M the rank of .P at a point
Editorial comments. The article above reflects a different
xEM, rkx.P' is the dimension of the space of tangent vec-
tradition and terminology than customary in the non-Russian
tors {X(x): XE.P} C TxM. A local reachability result now
literature. It also almost totally ignores the vast amount of
says that rkx .P(~) = n implies local reachability at x. There
important results concerning automatic (optimal) control
are also various global and necessary-condition-type results
theory that have appeared in the non-Russian literature.
especially in the case of analytic systems. Cf. [A2], [A 10],
Rougly speaking a control system is a device whose
[A14] for a first impression of the available results.
future development (either in a deterministic or stochastic
Given a control system (A 1) an important general ques-
sense) is determined entirely by its present state and the
tion concerns to what extent it can be changed by feed-
present and future values of the control parameters. The
back, in this case state space feedback. This means the fol-
'recipe' which determines future behaviour can be almost
lowing. A state space feedback law is a suitable mapping
anything, e.g. a differential equation
k: Rn~Rm, xf->u=k(x). Inserting this in (A1) results in the
x = t(x, u, t), xER n , uER m, tER, (A1) closed loop system x= f(x, k(x), t). Cf. Fig. 1.
as in the main article above, or a difference equation
Xt +1 = teXt, Ut, t), XtER n , utER m , t=O, 1, ... , (A2) V 1"""'1.--"' U
L
or more generally, a differential equation with delays, an
evolution equation in some function space, a more general
partial differential equation, ... , or any combination of
these. Frequently there are constraints on the values the If
control uER m may take, e.g., I U II ~M where M is a con- Fig. 1.
stant. However there certainly are (engineering) situations
when the controls u(t) which can be used at time t depend Often 'new' controls vER m are also introduced and one
both explicitly on time and the current state xU) of the sys- considers, e.g. the new control system x = f(x, k(x) + v, t)
tem (which of course in turn depends on the past controls or, more generally, x= f(x, k(x, v), t). Dynamic feedback
used). This may make it complicated to describe the space laws in which the function k(x, v) is replaced by a complete
of admissible or permissible controls in simple terms. input-output system (cf. below)
It may even be not very well possible to describe the con-
trol structure in such terms as equations (A 1) and con- y = g(y, x, v, t), U = hey), (A4)
straints N(x, u, t);;:.0 defined on Rn X RmX R. This happens, are also frequently considered. Important questions now
e.g., in the case of the control of an artificial satellite where are, e.g., whether for a given system (A1) a k(x) or k(x, v)
the space Rm in which the controls take their value may or system (A4) of various kinds can be found such that the
depend on the state x, being, e.g., the tangent space to a resulting closed loop system is stable. Or, whether by
sphere at the point x. Then controls become sections in a means of feedback a system can be linearized or imbedded
vector bundle usually subject to further size constraints. in a linear system (4). Cf. [A6] for a selection of results.
At this point some of the natural and traditional problems Using (large) controls in engineering situations can be
concern reachability, and control/ability and state space expensive. This leads to the idea of a control system with
feedback. Give an initial state x(O) the control system is said cost functional J which is often given in terms such as

l
to be completely reachable (from x(O if for all states x t,
there is an admissible control steering x(O) to x. This is J(u) = g(x, u, t) dt+ F(XU1 .
termed complete controllability in the article above. Control-
lability in the Western literature is usually reserved for the There result optimal control questions such as finding that
opposite notion: Given a (desired) final state x(f) the sys- admissible function U which minimizes J(u) (and steers x to

295
AUTOMATIC CONTROL THEORY

the target set) (optimal open loop control) and finding a pletely observable if for any two initial states xc, X~ ERn and
minimizing feedback control law u=k(x) (optimal closed (known) control u there holds that
loop control). In practice the case of a linear system (4) with y(x(t, u;xo), u, t)=y(x(t, u;x~), u, t) for all t';;il=ta implies
a quadratic criterion is very important, the so called LO Xo =x~. This is more or less what is meant by the phrase
problem. Then 'observable along the coordinates y" ... ,Yn' in the main
article above. In the case of a time-invariant linear system
G(x, u, t) = u TRu+2u TSx+X TOx, F(x) = xTMx. (A5)
Here the upper T denotes transpose and R, S, 0, M are suit-
x= Ax+Bu, y = Cx, (A9)

able matrices (which may depend on time as may A and 8). complete observability holds only if and only the (block)
In this case under suitable positive-definitness assumptions observabi/ity matrix" CTCTA T ... CT(AT)n-,,, is of full
on R, M and the block matrix rank n. This is completely dual to the reachability (controla-

II~ ~II'
bility) result for linear systems in the article above.
In this setting of input-ouput systems many of the prob-
lems indicated above acquire an output analogue, e.g. out-
the optimal solution exists. It is of feedback type and is put feedback stabilization, where (in the Simplest case) a
obtained as follows. Consider the matrix Riccati equation x
function u = key) is sought such that = f(x, k(h(x, u, t), t
k = -AK-KA+(S+BTKlR-'(S+BTK)-O, (A6) is (asymptotically) stable, the dynamic output feedback
K(t,) = M. problem and the optimal output feedback control problem.
In addition new natural questions arise such as whether it is
Solve it backwards up to time to. Then the optimal control is possible by means of some kind of feedback to make cer-
given by tain outputs independent of certain inputs (decoupling prob-
(A7) lems). In case that there are additional stochastic distur-
bances possibly both in the evolution of x and in the meas-
The solution (method) extends to the case where the linear
urements y problems of filtering are added to all this. E.g.
system is in addition subject to Gaussian stochastic distur-
bances, cf. [A9] , [A 17]. the problem of finding the best estimate x(t) of the state of
Also for non-linear systems there are synthesis results for the system given the observations yes), ta o;;;;so;;;; t, [A9].
optimal (feedback) control. One can, for instance, use the There is also the so-called realization problem. Given an
Pontyagin maximum principle to determine the optimal initial state xa a system such as (AB) defines a mapping
(open loop) control for each initial state xERn (if it exists from a space of input functions u to a space of output func-
tions y, and the question arises which mappings can be
and is unique). This yields a mapping x .... u which is a candi-
realized by means of systems such as (AB). Cf. [A6] for a
date for an optimal feedback control law and under suitable
survey of results in this direction in the non-linear deter-
regularity assumptions this is indeed the case, cf. [A3] ,
ministic case and [A9] for results in the linear stochastic
[A 15]. Some standard treatises on optimal control in various
case.
settings are [A5] , [A 12], [A 13].
With the notable exception of the output feedback prob-
In many situations it cannot be assumed that the state x
lem one can say at the present time that the theory of linear
of a control system (A1) is directly accesible for control pur-
time-invariant finite-dimensional systems, possibly with
poses, e.g. for the implementation of a feedback law. More
generally only certain derived quantities are immediately Gaussian noise and quadratic cost criteria, is in a higly-
observable. (Think for example of the various measuring satisfactory state. Cf., e.g., the standard treatise [A 11]. Gen-
devices in an aircraft as compared to the complete state eralization of this substantial body of results to a more gen-
description of the aircraft.) This leads to the idea of an eral setting seems to require sophisticated mathematics, e.g.
imput-output (dynamicaO system, or briefly (dynamical) sys- algebraic geometry and algebraic K-theory in the case of
tem, also called plant (cf. Fig. 2), families of linear systems [AB] , [A 16], functional analysis
and contraction semi-groups for infinite-dimensional linear
x= f(x, u, t), y = hex, u, t), (AB) systems [A4], functional analysis, interpolation theory and
xERn, UER m, YERP. Fourier analysis for filtering and prediction [A9] , and folia-
tions, vector bundles, Lie algebras of vector fields and other
notions from differential topology and geometry for non-
lineai systems theory [A2], [A9].
u >
A great deal of research at the moment is concerned with
systems with unknown (or uncertain) parameters. Here
adaptive control is important. This means that one attempts
Fig. 2. to design e.g. output feedback control laws which automati-
cally adjust themselves to the unknown parameters.
Here the U E Rm are viewed as controls or inputs and the A good idea of the current state of the art of system and
YERP as observations or outputs. Let x(t, u; xo) denote the control theory can be obtained by studying the proceedings
solution of the first equation of (AB) for the initial condition of the yearly IEEE CDC (Institute of Electronic and Electical
x(to)=xa and a given u(t). The system (AB) is called com- Engineers Conference on Decision and Control) confer-

296
AUTOMATIC TRANSLATION

ences and the biyearly MTNS (Mathematical Theory of Net- At present (the nineteen-seventies) the concept of
works and Systems) conferences. automatic programming is being extended to cover a
References comprehensive procedure for the synthesis of an algo-
[A 1] BARNETI, S.: Introduction to mathematical control theory, rithm in the framework of a much wider language for
Oxford Univ. Press, 1975.
[A2] BROCKETI, R.W.: 'Nonlinear systems and differential formulating problems, not limited to a specific class.
geometry', Proc. IEEE 64 (1976), 61-72. Such a synthesis entails a correctness proof for the
[A3] BRUNOVSKY, P.: 'On the structure of optimal feedback sys- algorithm and employs formal approaches such as vari-
tems', in Proc. Internat. Congress Mathematicians Helsinki,
ous models for the semantics of natural languages,
1978, Vol. 2, Acad. Sci. Fennica, 1980, pp. 841-896.
[A4] CURTAIN, R.F. and PRITCHARD, AJ.: Infinite-dimensional predicate calculus and theorem-proving techniques.
linear system theory, Springer, 1978. A.P. Ershov
[A5] FLEMING, W.H. and RISHEL, R.W.: Deterministic and sto- Editorial comments.
chastic optimal control, Springer, 1975.
[A6] FLIEss, M. and HAzEWINKEL, M. (EDS.): Algebraic and References
geometric methods in nonlinear control theory, Reidel, 1986. [A1] BALZER, R.: 'A 15-year perspective on automatic program-
[A7] HAZEWINKEL, M.: 'On mathematical control engineering', ming', IEEE Trans. Software Engineering 11, no. 11 (1985),
Gazette des Math. 28, July (1985), 133-151. 1257-1267. Special issue on artificial intelligence and
[A8] HAZEWINKEL, M.: '(Fine) moduli spaces for linear systems: software engineering, Guest Ed.: J. Mostow.
What are they and what are they good for', in C.1. Byrnes
AMS 1980 Subject Classification: 68RXX
and C.F. Martin (eds.): Geometric methods for linear system
theory, Reidel, 1980, pp. 125-193.
AUTOMATIC TRANSLATION, machine translation -
[A9] HAZEWINKEL, M. and WILLEMS, J.e. (EDS.): Stochastic
systems: The mathematics of filtering and identification, Translation of texts from one natural language to
Reidel, 1981. another using automatic devices.
[A 10] JURDJEVIC, V. and KUPKA, I.: 'Control systems on semi-
Automatic translation is a problem in the area of
simple Lie groups and their homogeneous spaces', Ann. Inst.
Fourier 31 (1981), 151-179. modelling and automation of human mental activities
[A11] KWAKERNAAK, H. and SIVAN, R.: Linear optimal control sys- - in the present case, of language. It is performed by
tems, Wiley, 1972. an algorithm, the rules of which do not interact with
[A12] LEE, E.B. and MARKUS, L.: Foundations of optimal control
theory, Wiley, 1967.
the knowledge and intuition of man, i.e. are strictly for-
[A 13] LIONS, lL.: Optimal control of systems governed by partial mal. An algorithm for automatic translation can be
differential equations, Springer, 1971. implemented for any suitable automatic device, e.g. by
[A14] LOHRY, e.: 'Controlabilite des systemes non-lineaires', in
a digital computer.
Outils et modeles mathematiques pour I'automatique,
I'analyse des systemes et Ie traitement du signal, Vol. 1, Automatic translation is closely connected with the
CNRS, 1981, pp. 187-214. development of modern structural and mathematical
[A15] SUSSMANN, HJ.: 'Analytic stratifications and control theory', linguistics, including difficult and fundamental linguis-
in Proc. Internal. Congress Mathematicians Helsinki, 1978,
Vol. 2, Acad. Sci. Fennica, 1980, pp. 865-871. tic problems, many of which were neglected in the past
[A16] TANNENBAUM, A.: In variance and system theory: algebraic or were not even explicitly stated.
and geometric aspects, Springer, 1981. The theoretical basis of the development of
[A 17] WILLEMS, J.e.: 'Recursive filtering', Statistica Neerlandica 32
(1978), 1-39.
automatic translation is the theory of formal grammars
[A18] Handbuch der Systemtheorie, Akademie Verlag, 1986. (cf. Grammar, fonnal). An automatic translation algo-
AMS 1980 Subject Classification: 93-XX, 49EXX, rithm realizes a certain correspondence between two
49BXX languages which are defined by their grammars - a
so-called translation correspondence. It is formulated in
AUTOMATIC PROGRAMMING - The use of com- terms of structural descriptions of the properties of
puters to produce computer programs (cf. Program) these languages within the framework of their gram-
automatically from some initial description which is mars. The most frequently used structural descriptions
closer to the original formulation of the problem. The are constituent structures and dependency trees (cf.
meaning of 'automatic programming' has shifted in Syntactic structure). An automatic translation algorithm
time, reflecting the general evolution of human- accordingly consists of three main parts: 1) a parsing of
computer communication and of programming the text in the source language, i.e. an analysis of the
methods. Originally, automatic programming consisted structure of the input text, based on the given grammar
in leaving it to the cmputer to produce a translation of the input language; 2) a transfer, i.e. a transforma-
into machine code of the description of an algorithm tion from the structure of the text in the source
for solving some problem, expressed in some algo- language to the structure of the text in the target
rithmic language. Subsequently, automatic programming language, based on the given translation correspon-
was understood to include a method for the directed dence; and 3) a synthesis of the text in the target
synthesis of an algorithm from a compact problem language, i.e. a transition from the structure of the out-
description, restricted to some fixed class of problems. put text to the specific word sequence.

297
AUTOMATIC TRANSLATION

During the early period of studying automatic trans- serve in the foreseeable future for translating scientific
lation the transition rules were usually formulated and technical publications only; automatic translation
directly, in the form of algorithmic rules, without a of literature and fiction is both unrealistic and unneces-
preliminary construction of the formal grammars of the sary.
source and target languages, and without an explicit References
formulation of the translation correspondence. As a [1] PANOV, D.Yu.: Automatic translation, Moscow, 1958 (in Rus-
result, purely linguistic information became confused sian). S. vLa. rltW
D" IOV
with the mathematical formulation of the algorithm. In
Editorial comments. Machine translation started in the
a later period, automatic translation algorithms were
USA around 1955.
constructed on the base of general schemes applicable
From a linguistic point of view the programs were naive
to a large number of languages, with the maximum (in principle they gave word-by-word translations), and they
possible particularization of the translation stages; the were designed in an unstructured way (e.g. linguistic infor-
synthetic and the analytical stages were made mutually mation was not separated from the translation algorithm).
independent, with an itemization of all possible The programs were not satisfactory: they were slower, less
varieties of interpretation of the text, permissible gram- accurate and more costly than translations provided by
mars and translation correspondences (so-called multi- human translators. Therefore it was decided in the USA in
variant translation). Information about the specific 1966 to stop all support by the government for machine
language was in most cases rigidly separated from the translation projects. In the years 1966 - 1975 hardly any
algorithmic part. In this way it became possible to fundamental research concerning machine translation was
disregard a large number of details specific for each done. The programs from the first period were developed
further in commercial surroundings, and it are mainly these
language, and to concentrate on the development of
programs that are used practically nowadays. Their use is
general procedures for obtaining solutions valid under
not in the quality of the translations, but in the saving of
the specified conditions. time they give to professional translators when used as auxi-
As regards the purely linguistic aspects of automatic liary tool (e.g. as text editor). There is up till now only one
translation, all morphological and almost all syntactic fully automatic translation system: the Canadian system
problems in a given sentence may be considered as TAUM that translates wheather forecasts from English into
solved. The principal difficulties in the creation of com- French. Since 1975 there is a revival of research in
pletely automatic systems of high-quality translation machine translation, espeCially in western Europe and
are due to the relatively backward state-of-the-art of Japan. Important factors in this research are developments
the semantic theory of languages, which could be used in theoretical linguistics, hardware developments which
for an exact formulation of the rules of processing the change the factors of costs and speed, and the increased
sense and the various meanings, and also for under- need for specialized translations (especially for technical
manuals).
standing the logical connection between the separate
sentences of a coherent text. Unless due allowance is References
[A1] HUTCHINS. W.J.: Machine translation: past, present, future,
made for the semantics of the text, the translation often
Ellis Norwood limited, Chichister, England, 1986.
proves ambiguous or faulty. A faulty translation is usu- [A2] SLOCUM, J.: 'A survey of machine translation: its history,
ally due to the fact that the translation rules do not current status, and future prospects', Computational Linguis-
tics 11 (1985), 1-17.
include all possible meanings of the text to be
translated or do not invariably ensure a proper selec- AMS 1980 Subject Classification: 68F30, 68F05
tion of the meaning, which may depend on a fairly
wide context or even on the knowledge of the subject AUTOMATON - A control system, which is usually
matter of the text being translated, on general understood to mean a finite automaton (cf. Automaton,
knowledge, etc. finite) or one of its modifications obtained by changing
Operational industrial systems of high-quality components or the mode of operation. The principal
automatic translations are as yet (nineteen-seventies) concept - a finite automaton - originated in the
non-existent. There exist experimental systems of vary- mid-twentieth century in connection wiin aiiempls iu
ing degrees of complexity. Simplified and specialized describe, in mathematical language, the functioning of
systems of automatic translation are successfully used nervous systems, of universal computers and of other
in practical work; these are based on automatic pro- real automata. The first such efforts are due to W.
cessing of scientific and technical information (word- McCulloch and W. Pitts (1943), S.c. Kleene (1951),
for-word translation with partial grammatical process- A.W. Burks and 1. Wright (1954), and others. A
ing, partial translation and automatic referencing for characteristic feature of such a description is the
on-the-spot information, patent documentation, and for discrete character of the corresponding mathematical
information search systems). Automatic translation can models and the finiteness of the domain of their param-

29S
AUTOMATON

eter values, which accounts for the name 'finite' auto- with finite automata are naturally extended to infinite
maton. The external effects, reactions and states are automata. Alphabets of higher cardinality extend the
considered to be letters of three alphabets named, computational capacities of automata. For instance,
respectively, the input alphabet, the output alphabet while finite automata realize finitely-determined
and the alphabet of states. The relations governing (sequential) functions (cf. Finitely-detennined function),
their interactions may then be given by two functions infinite automata may be used to realize any deter-
- a transition function and an output function - mined function. Moreover, using infinite automata it is
which map the pair 'state-input letter' into 'state letter' possible to describe the functioning of any automaton
and the pair 'state-output letter' into 'output letter', and machine. At the same time, this very general
respectively. At each discrete moment of time the sys- nature of infinite automata impairs their significance,
tem, being in a given state, receives an input signal (a so that most studies concern only special subclasses of
letter of the input alphabet), emits an output signal (a infinite automata, connected with specific models of
letter of the output alphabet, determined by the output control systems.
function) and passes into a new state determined by the 2. Non-deterministic and asynchronous automata
transition function. Studies of finite automata are sup- (second group). A non-deterministic automaton is for-
plemented by studies of their various generalizations mally defined as a system (A, S, B, X), where A, S and
and modifications which reflect various features of real B are alphabets in the sense explained above, while
systems. In the case of a finite automaton XeS XA X S X B is a transition-to-output relation. If
(A, S, B, cp, t/J) the existing modifications may be subdi- the relation X is a function mapping S XA into S X B,
vided into the following three main groups. The first the non-deterministic automaton is called a determinis-
group includes automata some alphabets A, S or B of tic automaton and is in fact identical with a finite auto-
which are infinite, and such automata are said to be maton, since in such a case X may be considered as the
infinite. The second group includes automata in which pair of functions cp, t/J which map S XA into S and B,
the functions t/J and cp are replaced by arbitrary rela- respectively. As distinct from a finite automaton, an
tionships or by random functions. These are partial, initialized non-deterministic automaton ~S, has several
non-deterministic, probabilistic and other automata. initial states, which form a subset S I of the set S. The
The third group includes automata with specific sets of behaviour of ~S, is usually understood to mean one of
input objects. These are automata with a variable struc-
the following generalizations of the behaviour of a fin-
ture, automata over terms (or tree automata, cf. Auto-
ite automaton.
mata, algebraic theory of). There are automata belong-
a) Instead of a function f, an automaton ~S, realizes
ing to several groups at the same time, e.g. fuzzy auto-
mata belong to all three groups. Special subclasses of a relation / , consisting of all pairs of words
(al ... a", b l . . . b,,)EA * XB* such that there are
finite automata are very important; these include, for
instance, memoryless automata, autonomous automata, states s 1> ,S" + 1> for which S I ES I and
reversible automata, etc. In addition, the use of con- (Sjaisi+lbi)EX is true for any i=l, ... ,no The class of
cepts and methods of other branches of mathematics relations realized by initialized non-deterministic auto-
also generates specific classes of automata and related mata coincides with the class of finitely-determined
problems. For instance, if algebraic methods are relations (cf. Finitely-determined function).
employed there result the concepts of automata over b) An initialized non-deterministic automaton ~S, in
terms, as well as linear, group, free and other automata which a set S' of final states has been indicated, while
(cf. Automata, algebraic theory of); problems in coding the alphabet B is empty (i.e. XeS XA X S), represents
theory generate the concepts of self-regulating atomata, the event L~ consisting of all words a I ... an EA * such
reversible automata, etc. (cf. Automaton, probabilistic). that there are states S 1> 'Sn + I for which S I E S I ,
Structural automata also have several generalizations, Sn+IES' and (Siaisi+I)EX is true for any i=l, ... ,no
mainly consisting in permitting infinite networks and The class of events that can be represented by the auto-
altering the interconnections between the elements dur- maton ~S, is identical with the class of regular events,
ing the operation. This leads to the concept of a grow- i.e. with respect to these aspects of their behaviour
ing automaton. The principal modifications and sub- non-deterministic automata are equivalent to finite
classes of finite automata, together with their most automata. However, the highly general nature of the
important properties, are described below. concept of a non-deterministic automaton is reflected
Macro approach. l. Infinite automata (first group) in the fact that a different number of states may be
differ from finite automata only by the fact that their necessary to represent the same event using a non-
alphabets A, B or S (input, output and set of states) deterministic automaton and using a finite automaton.
may be infinite. Most concepts and problems connected There are events which are representable by a non-

299
AUTOMATON

deterministic automaton with m states and by a finite letter which is fed in at that moment. Such automata
automaton with 2m states, but not by any finite auto- are frequently referred to as functional elements and are
maton with a smaller number of states. extensively used in problems of synthesis of automata.
A special subclass of non-deterministic automata is Automata with a finite storage space (or automata with
formed by the so-called partial automata, in which the a finite memory) are finite automata in which each out-
relation X is a partial function mapping the set S XA put letter is fully determined by a bounded segment of
into S X B, and which realize partial finitely-determined the input word fed in during the preceding moments of
functions. time, irrespective of the initial state. For automata with
The term 'asynchronous automaton' usually denotes a finite memory and with n states the length of such a
one of the two following types of automata. The first segment does not exceed n (n - I) / 2, and this max-
type comprises automata of type (A, S, B, cp, l/J), in imum value is in fact attained for some automata.
which the output function l/J maps the set S XA into B* Automata with a finite memory are called self-
(for a finite automaton l/J maps S XA into B). These regulating if, at a given time t, the output letter at any
automata are mainly used in coding theory. The second moment 7"~t is independent of the initial state. Such
type comprises finite automata whose transition func- automata are used in coding theory (cf. Coding and
tion cp has the following property: cp(s, aa)=cp(s, a) for decoding), and the modification of the automata usually
all s and a. These automata are used in coding theory employed in such cases satisfies the above condition
and also in modelling certain systems in technology not for the set of all input words, but only for some
and biology. subset of this set. Automata with a finite memory are
3. Automata with a variable structure (third group) are realized by logical networks without feedback.
finite automata 2f=(A XA, S, B, cp, l/J) with two input Reverse automata or automata without loss of informa-
channels, together with some fixed infinite sequence a tion are finite automata which realize one-to-one func-
(a superword) in the alphabet A. Arbitrary words in tions. Such automata are also used in coding theory.
the alphabet A are sent to the first input (channel) of Micro-approach. There are three types of generalized
such an automaton, while initial segments of the same structural automata, which may be considered as gen-
length of the sequence a are sent to the second input eralized logical networks: 1) generalized logical net-
(channel). This imposes a restriction on the set of pairs works with a permanent structure, in which both the
of input words. If the automaton with a variable struc- elements and the relation between them remain
ture is considered as an automaton having only the first unchanged when the automaton is functioning; 2) gen-
input (into which any word of A can be fed), its transi- eralized logical networks with a variable structure; and
tion and its output functions will depend on the length 3) generalized logical networks made of volume ele-
of the input word which was fed in. In its behaviour, ments and connections. The main classes of such auto-
an automaton with a variable structure is equivalent to mata are described below.
an infinite automaton with finite input and output 1. Generalized logical networks with a permanent struc-
alphabets and with a countable set of states. ture. These include mosaic structures and iterative net-
4. Fuzzy automata constitute a generalization of the works, which are finite fragments of mosaic structures
concept of a finite automaton, obtained by replacing with a similar range of problems.
the transition and output functions by fuzzy relations. Mosaic structures are infinite unions of transition sys-
A fuzzy subset of a set M is defined as a function which tems (A, S, cp) (i.e. of finite automata of the type
maps M into the segment [0, 1]. Accordingly, in a fuzzy (A, S, S, cp, cp), where the output function is identical
automaton, the transition and the output functions are with the transition function and the output alphabet is
replaced by functions mapping the sets S XA X Sand identical with the set of states). Such systems are placed
S X A X B into [0, 1], where S is the set of states, A is at points with integer coordinates (integral points) of
the input alphabet and B is the output alphabet. The the n-dimensional space, while for each such point
basic concepts and problems typical of finite automata there is defined a finite set of integral points, called its
have natur:ll generalizatilJ!1s to fll7zy automata: this neighbourhood. The input alphabet of the transition
applies, in particular, to the representation of fuzzy system placed at a certain point is the Cartesian pro-
events and the realization of fuzzy relations. Fuzzy duct of the sets of states of the transition systems
automata are mathematical models for certain recogni- placed at the points in its neighbourhood.
tion mechanisms, and are used in pattern recognition. A mosaic structure may be regarded as an infinite
5. Special classes of finite automata. automaton whose input and output alphabets as well as
Automata without memory are single-state finite auto- its set of states are equal and are the infinite Cartesian
mata or automata equivalent to them. In such auto- product of the sets of states of all the transition systems
mata each output letter is fully determined by the input contained in it. This makes it possible to reduce many

300
AUTOMATON. BEHAVIOUR OF AN

problems for mosaic structures to problems for infinite distinguished by the fact that a certain volume is
automata. Problems specific to mosaic structures assigned to their elementary automata and their mutual
include modelling of effective procedures; in particular, connections. As a result there arises the problem of
of computational processes. Mosaic structures in which synthesis of automata with minimum possible volume.
arbitrary automata instead of transition systems are The term 'automaton' is also used in concepts such
used are also occasionally considered. as two-sided, multi-tape, multi-headed, linearly
Uniform structures form an important class of mosaic bounded, etc., automata, which are in fact modifica-
structures. If all the transition systems are identical tions of the Turing machine. Sometimes all abstract
and if the neighbourhood of any point is obtained by a machines are included in the concept of automata.
parallel translation of a certain fixed neighbourhood, References
the mosaic structure is known as a uniform structure or [I] McCullOCH, W.S. and PIrrs, W.: 'A logical calculus of ideas
a cellular automaton. It is usually assumed, in this con- immanent in nervous activity', Bull. Math. Biophys. S (1943),
115-133.
text, that there is some 'stable' state of the transitional [2] KLEENE, S.c.: 'Reproduction of events in nerve sets and finite
system, which is preserved if the input word is a tuple automata', in Automata studies, Princeton Univ. Press, 1956,
each term of which corresponds to this state. Typical pp. 3-14.
[3] BURKS, A.W. and WRIGHT, J.B.: 'Sequence generators, graphs,
problems on uniform structures are problems of self-
and forma11anguages', Inform. and ControlS (1962), 204-212.
reproduction and the 'Garden-of-Eden' problem. [4A] GLUSHIKOV, V.M.: 'The abstract theory of automata', Russian
At any moment the states of the transition systems Math. Surveys 16, no. 5 (1961), 1-53. (Uspekhi Mat. Nauk 16,
located at points in an integral lattice generate a kind no. 5 (1961), 3-62)
[4B] GWSHIKOV, V.M.: Uspekhi Mat. Nauk 17, no. 2 (1962), 270.
of spatial mosaic pattern, which is usually denoted as a [5] RABIN, M.O. and SCOTT, D.: 'Finite automata and their deci-
configuration. A configuration containing only a finite sion problems', IBM J. Res. Develop. 3 (1959), 114-125.
set of transition systems whose states are unstable (the [6] ZADEH, L.A.: 'Probability measures of fuzzy events', J. Math.
Anal. Appl. 23 (1968),421-427.
excited part) is called finite. The self-reproduction prob- [7] ALAD'EV, V.Z.: On the theory of uniform structures, Tallinn,
lem consists of finding out whether finite configurations 1972 (in Russian).
exist that, during the operation of the uniform system, V.B. Kudryavtsev
pass into configurations, whose excited part is the mul- Yu.I. Yanov
tiply iterated excited part of the original configuration. Editorial comments. See the Editorial Comment to the
The 'Garden oj Eden' is a term denoting a configuration article Automata, equivalence of.
which cannot result from configurations different from AMS 1980 Subject Classification: 03005, 18820,
itself. The 'Garden-oj-Eden problem' is to determine the 680XX, 93AXX
existence of 'Gardens of Eden' for a given uniform
structure. AUTOMATON, BEHAVIOUR OF AN - A mathemat-
2. Generalized logical networks with a variable struc- ical concept describing the interaction of the automaton
ture. There exist various types of such logical net- with its external environment. Thus, for a finite auto-
works. The most general ones include mosaic structures maton (cf. Automaton, fmite) the external environment
in which both the neighbourhoods of the elements and usually is the set of input words, while its behaviour is
the elements themselves vary during the operation. As the word function generated by the automaton or the
an example of such a generalized logical network one event (sometimes the super-event) represented by it.
may mention the one-dimensional structure simulating For an automaton over terms (see Automata, algebraic
the functioning of a Turing machine with input. In this theory of), the set of constant terms is the environment;
case one of the nodes of the one-dimensional network its behaviour is the class of terms whose values, calcu-
corresponds to the control mechanism, while the other lated using the automaton, belong to the isolated subset
nodes correspond to the tape cells, which are con- of elements of the corresponding algebra. For a mosaic
sidered as transition systems in which the letters of the structure the environment is the set of initial configura-
working alphabet of the Turing machine serve as the tions, while its behaviour consists of the sequences of
input letters and states. The commutation is determined configurations generated at the various moments of the
by the position of the head. time set. In general, the concept of the behaviour of
Another type of generalized logical networks with a (more general) automata is similar to, or is essentially a
variable structure are the so-called growing automata. slight modification of, the concept of the behaviour of
They are uniform structures whose excited part finite automata.
increases during the operation. There are a number of The so-called behaviour oj an automaton in a random
models of such automata, which simulate various medium is a special case. Here, 'medium' is understood
features of real mechanisms. to mean a probabilistic automaton ~, which transforms
3. Generalized logical networks oj volume elements are the output signals of the automaton m: under considera-

301
AUTOMATON, BEHAVIOUR OF AN

tion to input signals for 2. Thus, it may be assumed that d rnax :;;;;.1 /2. (An automaton 2n with a linear tac-
that the automaton 2 in the random medium ~ is an tic, with an output alphabet of k letters, has kn states
autonomous logical network, constructed out of the sij' i = 1, ... ,k, j = I, ... ,n, and the following transi-
automata 2 and ~ by connecting the output of each tion function cp and output function 0/:
automaton with the input of the other. The behaviour q,(sij' 1) = Si,j+ I> if J = 1, ... ,n -1,
of the automaton 2 in the random medium ~ may
then be regarded as the functioning of this autonomous q,(Sin, 1) = Sin, q,(sij'O) = Si,j-I> if J=2, ... ,n,
logical network. The medium ~ is said to be stationary q,(Sjj, 0) = SHI,1> !J,(sij,a) = hi,
if it is a single-state automaton. If the output signals of j = 1, ... , k, J = 1, ... , n.)
the automaton ~ are regarded as 'rewards' or 'punish-
The rules governing asymptotically optimal behaviour
ments' for the automaton 2, the natural problem arises
in a stationary random medium were first studied in
of how to construct an automaton 2 whose behaviour
mathematical statistics. However, the results obtained
in the medium ~ would be optimal, i.e. would yield the
in that field can obviously be translated into the
highest possible gain in that medium. It is usually
language of the theory of automata.
assumed that the output alphabet of the medium ~
The behaviour of automata is also studied in more
consists of the letters 0 and 1 and that, as a response to
complex media, as is the behaviour of collectives of
the output signals b \, ... ,bk of the automaton 2, the
automata in random media. In the latter case the auto-
letter 1 is put out, respectively, with probabilities
mata are considered to be players, and the rules of the
p J, . . ,Pk. Only the letter 1 is considered to be the
game played by the automata are considered as the
'reward' of the automaton 2.
medium.
If the medium ~ is stationary, the set of states of the
autonomous logical network becomes identical with the References
[I) BARZDIN', Y A.M. and 'fRAKlrrENBROT, B.A.: Finite automata:
set of states of the automaton 2. If, in addition, the behavior and syntheSiS, North-Holland, 1973 (translated from
output letter of the automaton 2 is unambiguously the Russian).
[2) TSETLIN, M.L.: Studies on the theory of automata and modelling
determined by its state, the functioning of this logical
of biological systems, Moscow, 1969 (in Russian).
network may be described by a stochastic matrix Q of [3) ROBBINS, H.: 'Sequential decision problems with a finite
state transitions. The case which is usually considered is memory', Proc. Nat. Acad. Sci. USA 42, no. 12 (1956), 920-923.
that of an ergodic matrix (cf. Ergodicity). Then the VB. Kudryavtsev
function Yu.I. Yanov
k k k
W(~,)8) = ~aiPi - ~ai(I-Pi) = ~ai(2pi -1)
Editorial comments. See the Editorial Comment to the
i=1 i=1 i=1 article Automata, equivalence of.
is defined, where (Ji is the sum of the final probabilities AMS 1980 Subject Classification: 03005, 680XX
of all the states which determine the output letter hi.
Then AUTOMATON, FINITE - A mathematical model of
min(2p, -1) ,;;:;; W(~,)8) ,;;:;; max(2pi -1). a system with a finite memory which processes discrete
, ,
information. Finite automata are one of the most
If the output signals of the automaton 2 do not depend
important types of control systems (cf. Control sys-
on the effects of the medium and if they are equally
tem). Substantially, a finite automaton may be
probable, i.e. if (J\ = ... = (Jk = 1 / k, one has
described as a system with input and output channels
W(~,)8) = Wo ?'-1.
2 k
= ki that is in one of n states s \, ... ,Sn at any discrete
moment of time. These moments constitute the time set.
The function W(2,~) is the mathematical expectation At each such moment signals - that is, letters from the
of a variable called the gain of the automaton 2 in the input alphabet - are fed into the input channel and
medium ~. One says that the behaviour of the automa- signals - that is, letters from the output alphabet -
ton 2r til a medium IR is goal-conforming if are produced by the output channel. Depending on the
W(2, ~ Woo The problem of optimal behaviour in a particular poini uf view, :;uch system may include for-
random medium can now be stated as follows. One has mal systems (d. Formal system), real automata, living
to construct a so-called asymptotically optimal sequence organisms, etc.
of automata 21'\, 21'2, ... , such that the mathematical The concept of a finite automaton may be defined
expectation of the gain of automaton 2ln tends to the from different points of view. In the macro approach,
maximum possible gain in this medium, which is i.e. when the only feature of interest is the external
d max = maxi {2Pi -1: i = 1, ... ,k} as n increases. In the behaviour of the system, a finite automaton may be
case here considered such a sequence is formed by so- defined as a class of functions, as a finite oriented
called automata with linear tactics under the condition graph or (in algebraic form) as a finite algebra with

302
AUTOMATON. FINITE

unary operations (cf. Automata, methods of specification are extended in a natural manner to infinite sequences
of). If the micro-approach is adopted, a finite automaton (superwords) over A. For this reason, the functioning of
is defined as a set of elements and a scheme of their a finite automaton is sometimes understood to mean a
interconnections, i.e. not only the functioning of the relation of the type F in which a is an arbitrary super-
automaton but also its structure is considered. This word. In such a case the value of the function cp(s, a) is
conception is accordingly known as structural, while the defined as the set of those and only those states which
finite automata themselves are known as structural are encountered in the sequence ~s, a) an infinite
automata or automata networks. number of times.
Macro approach. A finite automaton is a system A finite automaton with a designated initial state s 1
(A, S, B, cp,l/I) where A, S, B are finite alphabets, usu- is known as initialized and is denoted by ~Sl' The
ally non-empty, known, respectively, as the input alpha- behaviour of an initialized finite automaton ~SI is usually
bet, the set of states and the output alphabet; cp is the defined as one of the following four relations.
transition function, which maps the set S XA into S; 1/1 1) The function f(a)=~s], a), which maps A' into
is the output function, which maps S XA into B. Such B' (or A 00 into Boo, where A 00 and Boo are the sets of
finite automata are sometimes known as Mealy auto- all superwords over A and B respectively). This func-
mata. If the output function 1/1 maps S into B (i.e. is tion is called computable or realizable by the initialized
independent of the letters of the input alphabet), the finite automaton ~SI
finite automaton is known as a Moore automaton. Any 2) The set Ls' CA', which is defined for a given
Moore automaton is also a Mealy automaton. S' C S of the final states as follows:
The most important characteristic of a finite automa- Ls' = {aEA': cp(sl,a)ES'}.
ton is its behaviour (cf. Automaton, behaviour of an),
The set Ls is called an event representable by the finite
which may be defined in different ways. Depending on
automaton ~SI with set S' of final states.
the kind of behaviour under consideration, finite auto-
mata may be grouped into transformers, acceptors 3) The set of values of the function f(a)=~(s 1, a) for
(identifiers), generators, etc. In order to define the main all possible a from A " called the set enumerable by the
types of behaviour of finite automata, the functions cp given finite automaton ~SI
and 1/1 are extended to the set S XA' (where A' is the 4) The set M'Y CA 00, defined for a system y of sub-
set of all words over A, including the empty word !\): sets of S as follows:
cp(s, !\) = s, cp(s, aa) = cp(cp(s, a), a), My = {aEA 00: CP(SI, a)EY}.

Ij;(s, !\) = !\, Ij;(s, aa) = o/(cp(s, a), a), The set M'Y is known as a super-event, representable by
where s ES, aEA', a EA, while aa denotes the word the finite automaton ~SI with a system y of subsets of
obtained by adjoining the letter a to the word a. Thus, final states. Finite automata with behaviour of type 1)
the extension of the functions cp(s, a) and I/J(s, a) to are known as finite transformers, while those with
arbitrary s and a describes, respectively, the state into behaviour of type 2) are known as finite identifiers or
which the automaton passes from the state s under the acceptors.
effect of the input word a, and the output letter which If the Cartesian products A 1 X ... XAm and
is produced by the automaton at the moment of entry B 1 X ... X Bn are taken as the input and output
of the last letter of the input word a. Let al n denote the alphabets, respectively, behaviour of type 1) will be a
initia! segment of length n of the word a, and let ~s, a) tuple of n functions of m arguments. One says in such
and I/J(s, a) be the words over S and B defined as fol- a case that the automaton has m inputs and n outputs,
lows: the alphabets AI, ... ,Am and B 1, . . . ,Bn being called
cp(s, a) = cp(s, all )lj;(s, ah) ... cp(s, a), input and output alphabets of such an automaton,
Ij;(s, a) = Ij;(s, all )lj;(s, ah) ... Ij;(s, a). respectively. The class of events representable by a fin-
ite automaton and the class of functions computable by
The functions ~s, a) and ~s, a) describe the sequence
a finite automaton may be described by various
of states assumed by the automaton during inputting
mathematical tools. The principal result is that events
the letters of the word a, and also the output word, i.e.
representable by a finite automaton are identical with
the sequence of letters of the output alphabet put out
so-called regular events, while functions computable by
by the automaton when acted upon by the input word
a finite automaton are identical with finitely-
a. The ternary relation
determinded functions (d. Finitely-determinded func-
F = {(a,~s, a),J,(s, a)): aEA', SES} tion). In addition, the class of sets enumerable by a fin-
is known as the functioning (operation) of the finite ite automaton is identical with the class of events
automaton ~=(A, S, B, CP,I/I). The functions ~ and ~ representable by a finite automaton.

303
AUTOMATON, FINITE

Regarding behaviour of types 2), 3) and 4), Moore maton as defined above with input and output alpha-
automata are equivalent to Mealy automata in the bets of the form An, where A is a finite alphabet, ident-
sense that to each Mealy automaton there corresponds ical for all elements. The construction rules of struc-
an equivalent (i.e. displaying the same behaviour) tural finite automata from these elements describe the
Moore automaton, and vice versa. This is not true of permitted unions (identifications) of the inputs and out-
behaviour of type 1). However, if in a Moore automa- puts, and thus also define the input and output sets of
ton functions of the form the finite automaton thus obtained.
;j{s, a) = t[,{t/>(s, all )N{t/>(s, ah ... t[,{t/>(s, a,
are taken instead of ~, Moore automata will be
equivalent to Mealy automata.
A Moore automaton m=(A, S, B, cp,1/;) is said to be I
g
... t
r--b--i--l
! t... i
I

... J
L_______
an autonomous automaton, or a generator, if the transi-
tion function is independent of the letters of the input
alphabet. The behaviour of an initialized autonomous
automaton m SI is usually understood to mean the super- Fig. 2.
word
The most important example of such finite automata
where cpn+l(s)=cp(cpn(s)). This sequence IS periodic
are logical networks. One version of this concept is
apart from a certain initial segment. presented below. In this case A = {a, I} and the ele-
A finite automaton mis said to be a transition system ments are the so-called functional elements, which
if B = S and if, for any s from S, the equation represent a finite automaton with a single state, and
~s, a)=s is true, or if the output alphabet B is empty. also certain special Moore automata, called delays.
Thus, a transition system is completely defined by the
three parameters (A, S, cp).
The concepts of a finite automaton and functioning

These are distinguished by the fact that they have two
states, which are usually denoted by the letters and 1
of the input alphabet, and that the transition and out-
of a finite automaton may be defined by several put functions satisfy the conditions cp(a, b)=b and
equivalent methods (cf. Automata, methods of specifica- t/;( a) = a. Since functional elements have only one state,
tion of; Automaton, behaviour of an). The so-called
they are completely determined by the output function
canonical equations are extensively employed. For any 1/;, which in this case is a Boolean function of n argu-
word a let aCt) denote the t-th letter of a and let I a I
ments, where n is the number of inputs of the func-
denote the length of a. The functioning F of a finite tional elements. The elements themselves are initialized
automaton mwill then comprise those and only those logical networks whose inputs and outputs correspond,
word triplets (a, a, 13) that satisfy the following condi- respectively, to the inputs and outputs of the elements.
tions: 1) I a I = I 13 I = I a I - 1; 2) for each t such that Subsequent construction of more complex logical net-
1~t ~ I a I the equalities a(t + 1) = cp(a(t), aCt)), works is performed according to the following rules.
f3(t)=t/;(a(t), aCt)) hold. To define the behaviour of an 1. The union of two logical networks is a logical net-
initialized finite automaton mSI one must add the work whose inputs and outputs are, respectively, those
equality a(l) =s I. The totality of these equalities unam- of the two logical networks (Fig. 1).
biguously determines the behaviour of an initialized 2. The result of identifying two arbitrary inputs of a
finite automaton, and is usually referred to as the logical network is a logical network whose outputs are
canonical equations. Canonical equations are exten- the same as those of the initial logical network and
sively employed in the analysis and synthesis of auto- whose inputs are the same as those of the initial logical
mata. network, except for one of the identified inputs (Fig. 2).
Micro-approach. 3. The result of connecting one output of a logical
network to an input of another logical network IS a log-
l--r.-:.l-----t~:-r--l ical network. Its inputs are all the inputs of the first

____~ __ J
logical network and those inputs of the second logical

l-~T--
-t~-:-t -r.~~t
network that are not identified with the output of the
first logical network; the outputs are all the outputs of
both logical networks (Fig. 3).
Fig. 1. 4. The result of identification of the output of a logi-
cal network that is the output of a delay element with
Consider a set of elements consisting of a finite auto- any input of the same logical network is a logical net-

304
AUTOMATON. FINITE

the logical network as described above is identical with


the functioning of the finite automaton
(A m, S, An, cp, tV), where m is the number of inputs, n is
the number of outputs of the logical network and S is
the Cartesian product of the sets of states of all delays
in the logical network; the transition function cp is a
Fig. 3. coordinate-wise application of the transition functions
of the delays, while the output function tV is determined
work whose inputs are all the inputs of the initial logi- in accordance with the functioning of functional ele-
cal network except for the identified input, and whose ments and delay elements as described above.
outputs are all the outputs of the initial logical network For instance, let the elements consist of delays
(Fig. 4). With certain restrictions this rule may also be (marked as rectangles in Fig. 5) and of functional ele-
applied to outputs that are not outputs of a delay ele- ments with output functions X, x Vy and x&y (marked
ment of a logical network. as triangles with the respective symbols of the func-

i40
tions).
r--~-------1
I f ~ I

I 'I i
l--i-.-::- r ___ J
Fig. 4.

5. In any logical network it is possible to consider as


output only some of the outputs as defined in 1 - 4
above. Logical networks obtained from functional ele-
ments by the rules 1, 2, 3 and 5 are usually said to be
diagrams of functional elements.
The essential functioning of a logical network may be Fig. 5.
substantially described as follows. Let specified input
letters be assigned to all inputs of the logical network Fig. 5 shows a logical network which, at the moment of
at the moment t and let the states of the delay elements time t, outputs letter 1 if and only if the input letters at
be given. In this situation letters will be assigned to all the moments 0, ... ,t contain an odd number of letters
inputs and outputs of the elements of the logical net- '1' (at the initial moment the delay has the value 0;
work, in particular to all outputs of the logical network, letters a and b denote, respectively, the input and the
in accordance with the following rules. If the letters output of the logical network). If s(t), a(t) and b(t)
ail' ... ,ai. have been assigned to the inputs of a func-
denote, respectively, the state, the input letter and the
tional element with output function tf;(x1' ... ,xn ), the output letter at the moment t, the functioning of such a
letter assigned to its output at this moment will be the logical network may be defined by the canonical equa-
value tf;(ail, ... ,aiJ. If at the moment t the delay is in tions:
state a, the letter a is assigned at the same moment to
s(O) = 0, s(t + 1) = s(t)+a(t) (mod 2), b(t) = s(t).
the output. Identical letters are assigned to identified
inputs and outputs. Moreover, at the moment t + 1 the If the macro-approach is adopted, this automaton can
states of the delay are determined by the input letters be represented in the form (A, S, A, cp, tV) where
at the moment t, as was stated above, i.e. cf>(a, b)=b. A =S={O, I}, cf>(s, a)=s+a (mod 2) and tf;(s, a)=s.
Thus, if the initial states of the delay are specified, the The concept of a finite automaton was the starting
logical network defines a certain mapping of input point of the modern theory of automata (cf. Automata,
sequences over the alphabet A m into output sequences theory 01), which is also concerned with various modifi-
over the alphabet An, where m ~ 1 is the number of cations and generalizations of this concept.
inputs and n ~ 1 is the number of outputs of the logical References
network. The class of such mappings is identical with [1] SHANNON, C. and MACCARTHY, J. (EDs.): Automata studies,
Princeton Univ. Press, 1956.
the class of functions realized by finite automata in the [2] GLUSHKOV, Y.M.: Synthesis of numerical automata, Moscow,
first meaning of the word (i.e. with the class of 1962 (in Russian).
finitely-determined functions), since the functioning of [3] KORBINSKII, N.E. and TRAlrnTENBROT, B.A.: Introduction to

305
AUTOMATON, FINITE

the theory offinite automata, Moscow, 1962 (in Russian). bilistic automaton merely represents a countable class
V.B. Kudryavtsev of events, which also includes irregular events in the
Yu.I. Yanov general case. At special cut points, the so-called iso-
Editorial comments. See the Editorial Comments to the lated points, the probabilistic automaton represents
article Automata, equivalence of. only regular events. A number II. from the segment
AMS 1980 Subject Classification: 03D05, 68DXX [0, 1] is known as an isolated cut point for this automa-
ton if there exists a positive number 8 such that for any
AUTOMATON, PROBABILISTIC, stochastic automa- input word the probability of transforming the proba-
ton - A generalization of a finite automaton (cf. Auto- bilistic automaton by this word into its final state
maton, rmite) in which the transition and the output differs by at least 8 from 11..
functions are random functions. In other words, a pro- Most of the concepts and problems typical of finite
babilistic automaton may be defined as a system automata may be generalized, in different variants, to
(A, S, B, 1/>,1/1) where A, S, B are finite alphabets having probabilistic automata. Many of the concepts and
the same meaning as for finite automata, while 1/>, 1/1 are problems preserve the properties of finite automata.
random functions mapping S XA into S and B, respec- For instance, it is possible to introduce the concept of
tively, and are represented by systems of probability equivalence of states so as to retain the well-known
measures I/>s,a, 1/Is,a, defined for all a from A and s from theorem on the possibility of differentiating between
S, on S and B, respectively. These measures are usually states by means of a simple experiment (cf. Automata,
given by a stochastic matrix (cf. Automata, methods of experiments with). On the other hand, unlike finite
specification of). If these probability measures can automata, whose minimal form is unambiguously

assume only two values and 1, the concept of a pro- defined (up to an isomorphism), a continuum of
equivalent minimal probabilistic automata may exist
babilistic automaton is practically identical with that of
a deterministic automaton. Autonomous probabilistic for a given probabilistic automaton.
automata without output are essentially equivalent to There are different ways and means of setting up
discrete Markov chains. The functioning of a proba- probabilistic automata. For instance, a probabilistic
bilistic automaton is determined similarly to that of a automaton may be represented by a deterministic auto-
non-deterministic automaton, the initial state being maton with two inputs into one of which is fed a ran-
determined by giving a probability measure (J on S. If dom sequence of input letters. Probabilistic automata
the probabilistic automaton is in state s with a certain are mathematical models of many real mechanisms and
probability p and accepts the input letter a, it will, with are used in studying the behaviour of living organisms.
probability p-w(s, a, s', b) pass into state s', and will References
output the letter b of the output alphabet. [1] BUKHAREV, R.O.: Probabilistic automata, Kazan, 1970 (in Rus-
As with finite automata, probabilistic automata can sian).
[2] STARKE, P.: Abstrakte Automaten, Deutsch. Verlag Wissen-
be classified into transformers and acceptors, in accor- schaft., 1969.
dance with their type of behaviour. In the former case V.B. Kudryavtsev
the automaton operates by transforming input words Yu.I. Yanov
into output words and into words in the alphabet of AMS 1980 Subject Classification: 68025
states with certain probabilities. If the words are of the
same length, these probabilities form a probability AUTOMORPHIC FORM - A meromorphic function
measure, since such behaviour may be regarded as giv- on a bounded domain D of the complex space en that,
ing a countable system of such measures. In the latter for some discrete group r operating on this domain,
case, a subset S' \: S of final states and a number II. satisfies an equation:
from [0, 1], called the cut point, are given. An event f(y(x = F;m(x)f(x), XED, yEf,
which is representable by the probabilistic acceptor Here j y(x) is the Jacobian of the mapping x ~y(x) and
2!" = (A, S, 1/>, S', 11.), where I/> is a random function m is an integer known as the weight of the automorphic
1I1aWllig S XA into S a.nd is defined by a system of form. If the group r acts fixed-point free, then auto-
probability measures I/>s,a defined on S, consists of all morphic forms define differential forms on the quotient
words over A under the effect of which the automaton space D / r and vice versa. Automorphic forms may be
passes into one of the final states with probability at used in the construction of non-trivial automorphic
least 11.. As distinct from finite automata, probabilistic functions (cf. Automorphic function). It has been
automata can represent a continuum class of events. shown that if g (x) is a function that is holomorphic
Moreover, even a single probabilistic automaton may and bounded on a domain D, then the series
represent a continuum class of events if ,\ is varied. If
the input alphabet is a one-letter alphabet, each proba- 2:g(y(xj~' (x)
yE"i'

306
AUTOMORPHIC FUNCTION

converges for large values of m, thus representing a at that time is a simply-connected domain. Three cases
non-trivial automorphic function of weight m. These are distinguished: M=pl(C) (the complex projective
series are called Poincare theta-series. line, or the Riemann sphere), M=C and M=H (the
The classical definition of automorphic forms, given upper half-plane {ZEC: Imz>O}). In the first case the
above, has recently served as the starting point of a discrete groups f are finite, the curves M / f are alge-
far-reaching generalization in the theory of discrete braic curves of genus 0 (cf. Genus of a curve) and, con-
subgroups of Lie groups and adele groups. sequently, the automorphic functions generate a field of
References rational functions. Examples of automorphic functions
[1] POINCARE, H.: Oeuvres de Henri Poincare, Gauthier-Villars, in the case M = C are periodic functions (thus, the
1916-1965. function e 2wiz is invariant under the translation group
[2] SIEGEL, c.L.: Automorphe Funktionen in mehrerer Variablen,
{z~z +n: n EZ}) and, in particular, elliptic functions.
Math. Inst. Gottingen, 1955.
A.N. Parshin In this latter case, the curve M / f is compact and is
Editorial comments. References [A2] and [A3] can an elliptic curve, while the field K (f) is the field of all
serve to get some idea of modern developments and topics algebraic functions on M / f. Finally, for M=H and
in the theory of automorphic forms and its relations with a discrete group f such that M / f is compact or has a
other parts of mathematics. (Cf. the comments to the article finite volume (in the Poincare metric), M / f is an alge-
Automorphic function for a more general notion). braic curve and K(f) is again the field of algebraic
References functions on M / r. The genus g of this curve may be
[A1] BAILY, W.L. JR.: Introductory lectures on automorphic forms, determined by constructing a fundamental domain for
Iwanami Shoten & Princeton Univ. Press, 1973.
f in the form of a polygon in the upper half-plane H
[A2] BOREL, A. and CASSELMAN, W. (EDS.): Automatic forms,
representations and L-functions, 1-2, Amer. Math. Soc., (here regarded as the Lobachevskii plane). The basic
1979. method for constructing an automorphic function in
[A3] GELBART, S.S.: Automorphic forms on adele groups, Prince- this situation is to consider the quotient of two auto-
ton Univ. Press, 1975.
morphic forms (cf. Automorphic form) of the same,
AMS 1980 Subject Classification: 10005, 32NXX sufficiently large, weight. The method is due to Poin-
AUTOMORPHIC FUNCfION - A meromorphic func-
care, who used it to prove the results mentioned above
tion of several complex variables that is invariant under concerning the structure of the fields of automorphic
some discrete group f of analytic transformations of a functions [2], [3], [4]. An analogous construction for
given complex manifold M: elliptic functions is to represent such functions in terms
of quotients of theta-functions. It can be shown, using
I(y(x)) = I(x), XEM, YEf.
uniformization theory, that all fields of algebraic func-
Automorphic functions are often defined so as to tions of a single variable are obtained in this way [3J.
include only functions defined on a bounded connected These results, which were obtained as early as the
domain D of the n-dimensional complex space cn that nineteenth century, give a full description of the fields
are invariant under a discrete group f of automor- of automorphic functions for n = I and of the groups f
phisms of this domain. such that the space H / f has finite volume. The case
The quotient space X =M / f can be given a com- of groups f for which H / f has infinite volume
plex structure and automorphic functions are then (Kleinian groups) is much more difficult; the problems
meromorphic functions on X. A large number of cases involved are still being intensively investigated [5], [6].
studied concern a space X with a compactification X. It In the twentieth century the theory of automorphic
is then natural to include in the definition of an auto- functions concentrates on functions of several variables.
morphic function the requirement that it can be Perhaps the only case of automorphic functions of n
extended to the entire space X as a meromorphic func- variables studied in detail in the nineteenth century
tion. If M =D (i.e. M is a bounded connected domain), concerned Abelian functions, which are related to
this condition must be required for n = I only (if n > I Abelian varieties in a way similar to the relationship
or if M / f is compact, the condition is automatically between elliptic functions and elliptic curves [I], [7].
fulfilled). It can readily be shown that the automorphic The first example of automorphic functions of n vari-
functions constitute a field K(f) and the study of this ables on a bounded domain D are the modular func-
field is one of the main tasks in the theory of auto- tions of c.L. Siegel [7] (cf. Modular group). Their
morphic functions. domain of definition is an n-dimensional generalization
Automorphic functions of a single variable have been of the upper half-plane H, and is one of the main
very thoroughly studied. The theoretical foundations examples of a bounded, symmetric domain. Siegel
were laid by F. Klein [1] and H. Poincare [2] in the must also be credited with the first general results
nineteenth century. The manifold M usually considered obtained concerning arbitrary automorphic functions

307
AUTOMORPHIC FUNCTION

on a bounded domain D. He generalized Poincare's are also of importance in number theory. At the time
construction of automorphic functions, mentioned of writing they are the only tool in the study of zeta-
above, and showed that the field K(f) always contains functions of algebraic varieties [ 11 ]. Another very
at least n algebraically independent functions. promising number-theoretical direction in the theory of
Subsequent efforts were aimed at exhibiting domains automorphic functions is the study of p-adic auto-
D and groups f for which the following theorem on morphic functions and forms [9]. Finally, one must
algebraic relations is true. If JI, ... ,f" are algebraically mention the application of automorphic functions to
independent automorphic functions, then the field K(f) the study of ordinary differential equations in a com-
is a finite algebraic extension of the field of rational plex domain [I2] and in the construction of solutions of
functions e(fJ, ... ,f,,) CK(f). algebraic equations of degrees higher than four.
At the time of writing (1977) this theorem was References
proved for the following cases: 1) if the quotient space [1] KLEIN, F.: Development of mathematics in the 19th century, 1,
D / f is compact [7]; 2) if the group f is pseudo- Math. Sci. Press, 1979, Chapt.8 (translated from the Ger-
man).
concave [8]; and 3) if D is a symmetric domain and f [2] POINCARE, H.: Oeuvres de Henri Poincare, Vol. 4, Gauthier-
is an arithmetic group. A pseudo-concave group is Villars, 1916-1965.
defined as follows. Let X be a subdomain of a domain [3] FORD, L.R.: Automorphic functiOns, Chelsea, reprint, 1951.
[4] BATEMAN, H. and ERDELYI, A.: Higher transcendental functions.
D with closure also contained in D. In this situation a
Automorphicfunctions, Vol. 3, McGraw-Hill, 1955.
boundary point Xo EaX is said to be pseudo-concave if [5] La geometrie non-euclidienne dans la tMorie des fonctions auto-
for any open neighbourhood U of Xo and for any func- morphes, Moscow, 1952 (in Russian; translated from the
tion cp(x) regular in U there exists a point x E U n X French).
[6) KRA, I.: Automorphic forms and Kleinian groups, Benjamin,
for which 1 cp(x) 1 ~ 1 cp(xo) I. A group f is said to be 1972.
pseudo-concave if there exists a subdomain XC D such [7) SIEGEL, C.L.: Automorphe Funktionen in mehrerer Variablen,
Math. lost. Gottingen, 1955.
that each boundary point x EaX can be transformed by
[8) ANDREOTII, A. and GRAUERT, H.: 'Algebraische Korper von
means of an element of f into an interior point of X or automorphen Funktionen', Nachr. Akad. Wiss. G6ttingen 3
into a pseudo-concave point of the boundary ax. (1961).
The nature and the properties of the algebraic [9) SERRE, J.-P., DELIGNE, P. and KUYK, W. (EDS.): Modular func-
tions of one variable. Proc. Internat. Summer Scool RUCA 1972,
varieties occurring in the theory of automorphic func- 1-3, Springer, 1973.
tions of n variables have not been intensively studied, [10) JACQUET, H. and LANGLANDS, R.P.: Automorphicforms on
as distinct from the case of a single variable. GL(2), Springer, 1970-1972.
[il) SHIMURA, G.: Introduction to the arithmetic theory of auto-
Important generalizations of the concept of auto-
morphiC functiOns, Math. Soc. Japan, 1971.
morphic functions - automorphic forms, theta- [12) GoLUBEV, V.V.: Vorlesungen Uber Differentialgleichungen im
functions (cf. Theta-function) and certain other gen- Komplexen, Deutsch. Verlag Wissenschaft., 1958 (translated
eralizations - are all special cases of the following from the Russian).
A.N. Andrianov
general construction. Consider a fibre bundle (cf. Fibra- A.N. Parshin
tion) Lover M and an action of a group f on it. It is
Editorial comments. The result mentioned above that
then possible to consider the sections of L that are the field K(f) is a finite algebraic extension of a field of
invariant under r. An automorphic function is rational functions C(f1, ... , fn) (the theorem on algebraic
obtained if the fibre bundle L and the action of the relations) in the case of a symmetric domain 0 and an arith-
group r are both trivial. metic group r is due to, independently, W.L. Baily jr. and A.
The study of automorphic functions revealed the Borel [A6], and 1.1. PyatetskiT-Shapiro [An
important role played by the group of automorphisms Let X be some kind of space (e.g. complex- or real-
of a domain D. It is in this way that the concepts and analytic, a smooth manifold), r a group of automorphisms
methods of the theory of automorphic functions were of X and H a group acting on a space V. Let Mor(X, H) be
applied in the theory of algebraic groups, in which they the set of morphisms from X into H. An automorphy factor
play an important part in the description of infinite- of r is a 1-cocycle (crossed homomorphism) j of r with
values in Mor(X, H). That means it is a mapping j: XX r--,>H
di..~ensiona! representations [10]
such that llX, yy') = j(x, y)j(xy, y"). All eXctli1pie is the Jaco-
From the very beginning of its development, the
bian of j as a diffeomorphism X--,>X (chain rule). An auto-
theory of automorphic functions has been connected in
morphic form of type j is now a morphism f: X-'> V such that
numerous ways with other branches of mathematics. f(x) = j(x, y) f(xy). Taking the Jacobian as an automorphy
This applies in particular to algebraic geometry. In factor and H=GL(C) acting on C via the m-th power of the
addition to the results discussed above, methods in the determinant one recovers the more classical notion of an
theory of automorphic functions are important in the automorphic form of weight m, cf. Automorphic form. The
study of moduli varieties for objects such as algebraic automorphy factor j can be used to define an action of r on
curves and Abelian varieties. Automorphic functions XX V by (x, v)y=(xy, j(x, y)v). If r now operates freely on

30~
ACT00iOMOUS SYSTEM

xx Vas a properly discontinuous group of transformations, AUTONOMOUS SYSTEM of ordinary differential


then (XX V) / f is a fibre bundle over X / f with fibre Vand equations - A system of ordinary differential equations
the automorphic forms are the cross-sections of this bundle, which does not explicitly contain the independent vari-
or, equivalently, the f-equivariant cross-sections of the able t (time). The general form of a first-order auto-
trivial bundle XX V~x. nomous system in normal form is:
In a still more group-theoretical setting let G be a real
semi-simple Lie group with Lie algebra g. Identify the univer- Xj = jj(Xh ... ,xn ), )=1, ... ,n,
sal enveloping algebra Ug of g with the right-invariant dif- or, in vector notation,
ferential operators D( G) on G by extending the mapping x = f(x). (1)
which assigns to a E g the corresponding right-invariant vec-
tor field. Let K be a maximal compact subgroup of G and f x
A non-autonomous system = f (t, x) can be reduced
a discrete subgroup and let p: K~GL(V) be a representation to an autonomous one by introducing a new unknown
of K. A smooth vector-valued function f: G~ V is called an function Xn + I = t. Historically, autonomous systems
automorphic form for f if f(kgy) =p(k) f(g), (Zg)f is a finite first appeared in descriptions of physical processes with
vector space, where Z(g)CUg=D(G) is the centre of Ug, a finite number of degrees of freedom. They are also
and f satisfies a certain growth condition. The link with the called dynamical or conservative systems (cf. Dynamical
notion 'automorphic form of type j' discussed just above is system).
provided by X = K \ G, the left coset space of Kin G, and a A complex autonomous system of the form (1) is
canonical automorphy factor (with H= Kc) which can be
equivalent to a real autonomous system with 2n
defined in this setting. Cf. [A 1] for more details on all this.
unknown functions
Besides the applications of automorphic functions in ordi-
d d
nary differential equations and algebraic equations men- dt (Rex) = Ref(x), dt (1m x) = Imf(x).
tioned above there is also a most revealing connection
between the harmonic analysis of functions automorphic The essential contents of the theory of complex auto-
with respect to a discrete subgroup of SL2 (R) and nomous systems - unlike in the real case - is found
Lax- Philips scattering theory applied to the non-Euclidean in the case of an analytic f (x) (cf. Analytic theory of
wave equation, cf. [A4], [A5]. differential equations).
For more material closely related to automorphic forms Consider an analytic system with real coefficients
and automorphic functions, cf. also the articles Modular and its real solutions. Let x =q,(t) be an (arbitrary)
form; Modular function; Fuchsian group; Discrete sub-
solution of the analytic system (1), let ~ =(L , t +) be
group; Discrete group of transformations.
the interval in which it is defined, and let x (t; to, xo)
References be the solution with initial data x 1/=10 =xo. Let G be a
[A1] BOREL, A.: 'Introduction to automorphic forms', in A. Borel
and G.D. Mostow (eds.): Algebraic groups and discontinuous
domain in Rn and fEe 1(G). The point x EGis said to
subgroups, Amer. Math. Soc., 1966, pp. 199-210. be an equilibrium point, or a point of rest, of the auto-
[A2] FRICKE, R. and KLEIN, F.: Vorlesungen Dber die Theorie der nomous system (1) if f(xo)-O. The solution cp(t)=-X,
automorphen Funktionen, 1-2, Teubner, 1926.
[A3] BOREL, A. and CASSELMAN, W.: Automorphic forms, t E R = ( - 00, + 00), corresponds to such an equilibrium
representations and L-functions, 1-2, Amer. Math. Soc., point.
1979. Local properties of solutions. I) If 4>(t) is a solution,
[A4] FADDEEV, L.D. and PAVLOV, B.S.: 'Scattering theory and
automorphic functions', Proc. Steklov Inst. Math. 27 (1972),
then 4>(t + c) is a solution for any c ER.
161-198. 2) Existence: For any to ER, xO EG, a solution
[AS] LAX, P.D. and PHILLIPS, R.S.: 'Scattering theory for auto- x(t; to, xo) exists in a certain interval ~3t.
morphic functions', BUll. Amer. Math. Soc. (New Ser.) 2
3) Smoothness: If fEcP(G), p~l, then
(1980), 261-296.
[A6] BAILY, W.L., JR. and BOREL, A.: 'Compactifications of arith- q,(t)E cP + 1 (~).
metic quotients of bounded symmetric domains', Ann. of 4) Dependence on parameters: Let f= f (x, a),
Math. 84 (1966), 442-528. aEG a CRm , where G a is a domain; if fEcP(GXG a ),
[A7) PYATETSKlT-SHAPIRO, I.I.: 'Arithmetic groups on complex
domains', Russ. Math. Surveys 19 (1964),83-109. p ;;:'1, then x (t; to, xO, a)E cP(~ X G a) (for more details
see [1] - [4]).
AMS 1980 Subject Classification: 32NXX
5) Let xO be a non-equilibrium point; then there
exist neighbourhoods V, W of the points xO,f(xo),
AUTOMORPIDSM - An isomorphism (isomorphic
mapping) of a system of objects onto itself. The totality respectively, and a diffeomorphismy=h(x): V~W such
of all automorphisms of an arbitrary algebraic system that the autonomous system has the form y =const in
forms a group, and the study of this group is an impor- W.
tant and powerful tool in the study of the properties of A substitution of variables x = <I>(y) in the auto-
the system itself (cf. Algebraic system, automorphism of an). nomous system (1) yields the system
AMS 1980 Subject Classification: 18A20 (2)

309
AUTONOMOUS SYSTEM

where q; (y) is the Jacobi matrix. of a point other than an equilibrium point is trivial (cf.
Global properties of solutions. 1) Any solution x =4>(t) local property 5) of the solutions): The family of phase
of the autonomous system (1) may be extended to an trajectories is diffeomorphic to a family of parallel
interval a=(I-, t +). If a=R, the solution is said to straight lines. For a linear autonomous system the
be unboundedly extendable; if t + = + 00, t _ > - 00, the structure of phase trajectories in a neighbourhood of an
solution is said to be unboundedly extendable forwards equilibrium point is known, since the autonomous sys-
in time (and, in a similar manner, backwards in time). If tem is integrable [5J. For non-linear autonomous
t + < + 00 then, for any compact set K cO, xO EK, systems this problem has not yet been completely
there exists a T=7{K)<t + such that the point solved, even for n =2 (d. Qualitative theory of differen-
x(t;to,xo) is outside K for t>7{K) (and, analogously, tial equations). One aspect of this problem is the ques-
for t _ > - 00; cf. Prolongation of solutions of differen- tion of stability of an equilibrium point (cf. Stabllity
tial equations). theory). A few results will be given below. Let xO,yO
2) The extension is unique in the sense that any two be equilibrium points of the system (1), let
solutions with common initial data are identical j = g(y) (1')
throughout their range of definition. and let U, V be neighbourhoods of the points xO ,yo.
3) Any solution of an autonomous system belongs to The systems (1) and (1') are said to be equivalent in
one of the following three types: a) aperiodic, with neighbourhoods of their equilibrium points xO ,yO if there
4>(t)=F4>(t2) for all t(=I=t2, tjER; b) periodic, non- exist neighbourhoods U, V and a bijective mapping
constant; or c) 4>(t)=const. h: U~V such that (hof)x=(g'oh)x (for XEU, fXEU,
Geometric interpretation of an autonomous system. To (gl oh)x E V), i.e. as a result of the substitution y =h (x)
each solution x =4>(t) is assigned a coresponding curve the trajectories of the autonomous system (1) go into
r: x =4>(t), tEa, inside the domain G. G is then said trajectories of the autonomous system (1'). The
to be the phase space of the autonomous system, r is equivalence is said to be differentiable (topolOgical) if h
its trajectory in the phase space, and the solution is is a diffeomorphism (homeomorphism). Let xO be an
interpreted as motion along this trajectory in the phase equilibrium point of the autonomous system (1), let the
space. The mapping gl: G~G defined by the formula matrix / (xo) be non-degenerate, and let it not possess
gIXO=x(t;O,xo) (Le. each point moves along the phase any pure imaginary eigen values. Then the autonomous
trajectory during time t) is called the phase flow. In its system (1) in a neighbourhood of xO is topologically
domain of definition the phase flow satisfies the follow- equivalent to its linear part y =/ (xo)y. An important
ing conditions: 1) gl X is continuous in (t, x); and 2) the example is the autonomous system x=Ax,y=By where
II
group property +1, X =gIl gl, x. A, B are constant matrices with pure imaginary eigen
The Liouville theorem is valid: Let D C G be a values and n > 2; it is not known when these auto-
domain with a finite volume and let VI be the volume of nomous systems are topologically equivalent. One of
the domain gl D C G, then the most fundamental problems in the theory of auto-

~: Lo = div f(x) dx.


nomous systems is that of the structure of the entire
(3) family of phase trajectories. The most complete results
have been obtained for n = 2, but even in this case the
For a Hamiltonian system, a consequence of (3) is solution is far from complete.
the conservation of the phase volume by the phase
References
flow. A second variant of (3) is obtained as follows. Let [1) PETROWSKI, I.G. [I.G. PETROVSKIi): Vorlesungen aber die
x =cf>(t, a) be a family of solutions of (1), Theorie der gew6hnlichen Differentialgleichungen, Teubner, 1954
a=(a), ... ,an-dEG a , let Ga be a domain and let (translated from the Russian).
(2) PONTRYAGIN, L.S.: Ordinary differential equations, Addison-
q,EC)(aXG a ), then Wesley, 1962 (translated from the Russian).

:r lnI(t, a) = div f(x), (3')


(3) CODDINGTON, E.A. and LEVINSON, N.: Theory of ordinary dif-
ferential equations, McGraw-Hill, 1955.
(4) ARNOL'D, V.I.: Ordinary differential equations, M.LT., 1973
where I (t, a) =det ax / a(t, a). (translated from the Russian).
Structure of phase trajectories. 1) Any two phase tra- (5) NEMYTSKli, V.V. and STEPANOV, V.V.: Quantitative theory of
differential equations, Princeton Univ. Press, 1960 (translated
jectories have either no points in common or coincide. from the Russian).
2) Any phase trajectory belongs to one of the M. V. Fedoryuk
following types: a) a smooth, simple, non-closed Jor- Editorial comments. The question of the topological
dan arc; b) a cycle, i.e. a curve diffeomorphic to a cir- equivalence (homeomorphic equivalence of orbit systems)
cle; or c) a point (an equilibrium point). The local x y
of two linear dynamical systems = Ax and = By has been
structure of phase trajectories in a small neighbourhood examined and solved [A2]. Let V,t, VA, \.1 be the decompo-

310
AVERAGE

sition of Rn corresponding to (generalized) eigen values of must be made between a given linguistic term used as a
A with positive, negative and zero real parts respectively, name, and the term used for the object denoted by the
and let AD be the restriction of A to \.1. Then the orbit sys- name. The differentiation between the denotation and
tems of x=Ax and y=By are homeomorphic if the object denoted may be achieved by using terms spe-
dim lit =dim vt, dim VA =dim Va and AD and BO are cially created for the purpose, or by using quotation
linearly equivalent (i.e. AD = S-1 B OS for some constant
marks; the term between quotation marks is then
invertible matrix 5).
assumed to differ from the same term used without the
The analogous question for discrete dynamical systems:
when does there exist for two linear endomorph isms
quotation marks. Autonymous use of a term merges the
A, B: Rn~Rn a homeomorphism t/> such that At/> = t/>B, is meaning of the term with the term itself: it is both the
much deeper (topological similarity of matrices) and has to object itself and the name by which it is denoted.
do with such things as lens spaces and the P.A. Smith con- References
jecture. See [A1] for a full account. A theorem of G. de [1] CHURCH, A.: Introduction to mathematical logic, 1, Princeton
Rham says that if the orthogonal matrices A and B are topo- Univ. Press, 1956.
[2] KLEENE, S.c.: Introduction to metamathematics, North-Holland,
logically similar and the topological (non-linear) similarity
1951.
preserves the unit sphere and restricts to a diffeomorphism [3] CURRY, H.B.: Foundations of mathematical logic, McGraw-Hill,
in it, then the matrices A and B are also linearly similar. 1963.
Results of S.E. Cappell and J.L. Shaneson imply that for A.S. Kuzichev
matrices of eigen values of absolute value 1 (the crucial AMS 1980 Subject Classification: 00A25
case) topological and linear equivalence are the same for
dimensions ";;;5. Combined with results of Kuiper-Robbin AVERAGE with weight q=(ql,'" ,qn), qi>O,
[A3] this gives a complete topological classification of ~qi = 1, of a set of real numbers a =(aj, ... ,an) - A
matrices of dimension at most 5. In higher dimensions it is variable
definitely not true in general that for matrices of eigen
values of modulus 1 topological similarity implies linear simi-
ffiC</o(a, q) = t/>-I [~qjt/>(a,)],
I

larity [A 1]. These results can be used to tell at what times t where <t>(x) is a continous strictly-monotone function on
the phase flows for continuous-time systems become topo- R. When <j>(x) = x', one obtains
logically equivalent. This can happen repeatedly even for
systems that are not topologically equivalent. ffiCr(a, q) = [~qjaT / r
References
[A 1] CAPPELL, S.E. and SHANESON, J.L.: 'Non-linear similarity', and, in particular, when r = 1, qi = 1/ n, i = 1, ... , n,
Ann. of Math. 113 (1981), 315-355. 1JR,(a, 1/ n)=m:(a) will be the arithmetic average of the
[A2] KUIPER, N.H.: 'The topology of the solutions of a linear dif-
ferential equation on Rn', in Proc. Internat. Congress on numbers aI, ... , an, while when r = - 1, it will be the
Manifolds Tokyo, 1973, pp. 195-203. harmonic average. The concepts of the geometric average
[A3] KUIPER, N.H. and ROBBIN, J.W.: 'Topological classification of (a) = (IIai)1 / n and the weighted geometric average
linear endomorphisms', Inv. Math. 19 (1973), 83-106. I

AMS 1980 Subject Classification: 34A34


@(a, p) = [1/afT /~'p,
AUTONYMY - An expression (a word) used as its are introduced separately.
own proper name. Such a use of an expression is said One of the basic results of the theory of averages is
to be autonymous (as distinct from its use in the ordi- the inequality (a)<m:(a), except when all ai are equal
nary sense). For instance, if one says 'x forms part of to each other. Other results are:
the equation x + 3 = 2', then one is using x as the name 1) IJRcp(ka,p)=klJRcp(a,p), k>O;
of the letter x, and is using 'x + 3 = 2' as the name of 2) f1J4(a,p)=lJRcp(a,p) if and only if 1/;= acp +/3,
the expresion x+3=2. If one says '12 is divisible by /3ER, a=O;
2', then the term '12' is used in a non-autonymous 3) 1JR",(a,p)~lJRcp(a,p) if and only if cpol/;-I is a con-
manner (as the designation of an integer), while if one vex function; in particular 1JR,(a,p)~IJRS<a,p) if r<s.
says '12 consists of two digits' one says the term '12' is The concept of an average can be extended to infin-
used autonymously (as the name of itself). ite sequences under the assumption that the
In natural languages the context and the syntax are corresponding series and products converge, and to
usually sufficiently reliable indicators for distinguishing other functions. The following is such an example:
between the autonymous and non-autonymous use of b

an expression. However, such a differentiation may be t/>-I jp(x)<p(j(xdx


difficult in certain cases. Special care is then required ffiC</o (t, p) = - - - " -a- b - - - - - - " ' -
to avoid ambiguity: the object itself must be dis- jp(x)dx
tinguished from its name (its denotation); a difference a

311
AVERAGE

given the condition that J(x)~O almost everywhere on existence of an average rotation for any uniforml
the corresponding interval and that p(xO. Thus, almost-periodic function for which inf IJ (t) I >0 (see
b b [4]). In this case, the difference argJ(t)-ct is a uni-
jf(x)p(x)dx ",;;; lJJl.pif,p)jp(x)dx. formly almost-periodic function and is bounded. The
a a
average rotation of analytic almost-periodic functions
References in the general case has also been studied (see [1], [4]).
[1) HARDY, G.H., LITTLEWOOD, J.E. and P6LYA, G.: Inequalities, In this case, the average rotation does not always exist,
Cambridge Univ. Press, 1934.
VI. Sobolev but if it does, then the difference argJ (t) - ct is not
Editorial comments. Instead of 'average' the term necessarily bounded. Nonetheless, it can still possess
'mean' is also quite often used: arithmetic mean, geometric certain generalized properties of almost-periodicity; this
mean, etc. is true, in particular, for trigonometric polynomials [5].
References
Except in the analytic case, only isolated results con-
[A 1] MITRINOVlt, D.S.: Analytic inequalities, Springer, 1970. cerning the average rotation of a function J for which
[A2] MITRINOVlt, D.S.: Elementary inequalities, Noordhoff, 1964. IJ(t) I~O, inf IJ(t) I =0, exist (see [6], [7]).
AMS 1980 Subject Classification: 26DXX References
[1) JESSEN, B. and TORNEHAVE, H.: 'Mean motions and zeros of
almost periodic functions', Acta Math. 77 (1945), 137-279.
AVERAGE ROTATION of the argument argJ(t) of a
[2) JESSEN, B.: 'Some aspects of the theory of almost periodic
complex-valued, uniformly almost-periodic function functions', in Proc. Internat. Congress Mathematicians Amster-
J (t)- A phenomenon consisting of the existence (given dam, 1954, Vol. 1, North-Holland, 1954, pp. 304-351.
certain conditions, see below) of the limit [3) WEYL, H.: 'Mean motion', Amer. J. Math. 60 (1938),889-896.
(4) LEVITAN, B.M.: Almost-periodic functions, Moscow, 1953 (in
lim arg f(t) = c. Russian).
' ..... 00 t (5) Doss, R.: 'On mean motion', Amer. J. Math. 79, no. 2 (1957),
389-396.
The limit itself is also called the average rotation (mean (6) LEVITAN, B.M.: Mat. Zametki 1, no. 1 (1967), 35-44.
motion). If IJ (t) I~O for all t, then the selection of a (7) GoRIN, E.A.: 'A function algebra invariant of the Bohr-van
continuous branch of argJ (t) is implied. For analytic Kampen theorem', Mat. Sb. 82 (124) (1970), 260-272 (in Rus-
almost-periodic functions, the concept of average rota- sian). D. V Anosov
tion can be retained even when J contains zeros. Editorial comments. Instead of the term 'average rota-
Namely, the concepts of 'right' and 'left' arguments are tion' one also uses 'mean motion', cf. e.g. [2], p. 310.
introduced, whose difference jumps by +k'TT at a zero AMS 1980 Subject Classification: 42A75, 42A99
of multiplicity k of f, and one therefore also speaks of
right and left average rotation, unless they coincide, in AVERAGE VALUE, THEOREM ON VARIATIONS
which case one simply speaks of average rotation. OF THE, in statistical mechanics - A statement on the
The question of the average rotation arose in connec- variation of the average value of a dynamic magnitude
tion with the fact that in celestial mechanics the longi- in the Gibbs statistical aggregate as a result of an infini-
tude of the perihelion of a planet is expressed approxi- tesimal change in the Hamiltonian. The variation of the
mately as the argument of a certain trigonometric poly- average depends, generally speaking, on the way of
nomial 'inclusion' of the change in the Hamiltonian and on the
initial conditions. Let a system of many interacting par-
ticles (a quantum system or a classical system), that can
J.L. Lagrange studied two simple cases, namely, where
explicitly be described by a time-independent Hamil-
one of the I aj I is greater than the sum of the remain-
tonian H, be in thermodynamic equilibrium at the ini-
ing coefficients, and for n = 2; he noted that in other
tial moment of time t~- 00. As a result of an adia-
cases the question is complicated. The study of this
batic inclusion of an infinitesimal, time-dependent per-
question has been taken up only in the twentieth cen-
turbation of the Hamiltonian
tury (for its history, see [I] - [3]). The final result that
a trigonOlllt:lllC polynomial ahvays has an average rot::l- H ~ H+8V(t),
tion was stated in 1938 by B. Jessen (for its proof, see where
[I)). (From the point of view of the theory of dynami-
8V(t) = ~e"-iE'8VE' >O, ~+O,
(E)
cal systems, it is a matter of averaging a certain func- the Gibbs equilibrium average <A > of the explicitly
tion on a torus along the trajectories of the flow time-independent dynamic variable A will change (in a
defined by shifts by elements of a one-parameter sub- linear approximation with respect to the perturbation)
group. However, this function has singularities that by the magnitude
obstruct the automatic use of the corresponding general
theorem.) Even earlier than this, H. Bohr proved the 8<A(t = -2'7Ti~e't-lE' A 18VE~et),
(E)

312
AVERAGING

where A 18VE~et) is the Fourier transform of 3) Multi-frequency non-autonomous systems


the retarded commutator Green function (cf. Green
function in statistical mechanics).
~ = p.X(x,y, t),
!!l!.. _
1 (3)
The main application of the theorem is in the theory dt - w(x,y, t)+p.Y(x,y, t).
of non-equilibrium irreversible processes (in which one
formulation of the theorem is also known as the Instead of the systems (1) - (3), 'simpler' averaged
fluctuation-dissipation theorem), and in deducing the systems of a first approximation are considered:
chains and systems of equations for the Green func-
tions from the chains and systems of equations for the ~; = p.Xo(x); (1 ')
correlation functions (cf. Correlation function in statisti-
~; = p.X1(X),}
cal mechanics).
(2')
References !!1.
dt
= w(x)'
[I) KUBO, R.: J. Phys. Soc. Japan 12 (1957),570.
[2) BOGOLYUBOV, JR., N.N. and SADOVNIKOV, B.I.: Zh. Eksper.
Teoret. Fiz., no. 43 (1962), 677. ~; = p.X2 (x, xo,Yo, t),}
[3) BOGOLYUBOV, JR., N.N. and SADOVNIKOV, B.I.: Some ques- (3')
tions in statistical mechanics, Moscow, 1975 (in Russian). !!1.-
dt - w(x, y, t),
[4) TYABLIKOV, S.V.: Methods of the quantum theory of magnetism,
Plenum Press, 1967 (translated from the Russian). where
1T
VN. Plechko f
Xo(x) = lim -T X (x, t, 0) dt,
T_oo 0
(4)
Editorial comments.
References (5)
[A1] BOGOLYUBOV, N.N. and BOGOLYUBOV, IR.N.N.,: Introduction 1 2" 2"
to quantum statistical mechanics, World Scientific, 1982. = (27T)" [ ... [X(X,Yl, ... ,Yn)dYI ... dYn,
AMS 1980 Subject Classification: 82A05
X 2 (x, xo,Yo, to) = (6)
to+T
AVERAGING - The operation of calculating the
mean values of functions forming part of the structure
= lim T1
T _00
f X(x, <j>(xo,Yo, to, t), t)dt.
10

of differential equations which describe periodic,


The formulas (4) - (6) express the most widespread
almost-periodic, and, generally, oscillating processes.
averaging methods.
The operation of averaging can be considered as a type
Formula (6) expresses the scheme of averaging 'along
of smoothing operator. Averaging methods first came
the generating solution'. In the function X(x,y, t), the
into use in celestial mechanics in the study of planetary
vector y is initially substituted by the generating solu-
motion around the Sun. Later, they propagated to a
tion of the system
wide variety of areas: to the theory of non-linear oscil- dx =0
lations, to physics, to the theory of automatic control, dt '
to astrodynarnics, and to others. Averaging methods !!l!..-
dt - w(x, Y, t),
have often provided approximate solutions for the ini-
tial equations. The most typical classes of differential after which the integral average (6) is calculated.
equations for which averaging methods are used are the The principal question which arises when the systems
following. (1) - (3) are changed is the construction of f-estimates
1) Standard systems in the sense of N.N. Bogolyubov for the norms
dx
dt = pX(x, t, p.), (1)
I x(t, p.)-x(t, p.) II, I y(t, p.)-y(t, p.) I
on the largest possible interval (of order 1/ fL) of time,
where x, X are vectors, t is the time, and fL is a small if
positive parameter. x (0, p.) = x(O, p.), Y (0, p.) = yeO, p.).
2) Multi-frequency autonomous 2'lT-periodic systems This is the essence of the problem of averagmg

~~ = p.X(X,y), 1 methods. For systems (1), this problem of averaging


methods was proposed by Bogolyubov, whose results
!
d
= w(x)+p.Y(x,y),
(2)
formed the basis of the modem algorithmic theory of
ordinary differential equations.
where x, y, X, Y are vectors, References
X(X,y+(27T)) X(x,y), Y(x,y+(2'lT)) Y(x,y), [I) BOGOLYUBOV, N.N. and MITROPOL'SKIi, YU.A.: Asymptotic
methods in the theory of non-linear oscillations, Gordon and
and w(x) is the frequency vector. Breach, 1961 (translated from the Russian).

313
AVERAGING

[2] MrrROPOL'SKIi, Yu.A.: Averaging methods in non-linear which can be moved in space to form two spheres
mechanics, Kiev, 1971 (in Russian). identical to it.
[3] VOLOSOV, V.M. and MORGUNOV, B.I.: Averaging methods in
the theory of non-linear oscillatory systems, Moscow, 1971 (in Many postulates equivalent to the axiom of choice
Russian). were subsequently discovered. Among these are: I) The
[4] GREBENIKOV, E.A. and RYABov, Yu.A.: Constructive methods well-ordering theorem: On any set X there exists a total
of analysis of non-linear systems, Moscow, 1979 (in Russian).
order Rc;;,XXX such that any non-empty set Vex
E.A. Grebenikov contains a least element in the sense of the relation R;
Editorial comments. A recent up-to-date treatment of 2) the maximality principle (Zorn's lemma): If any
averaging matters is given in [A 1]. totally ordered subset V of a partially ordered set X is
References bounded from above, X contains a maximal element; 3)
[A1] SANDERS, J.A. and VERHULST, F.: Averaging methods in non- any non-trivial lattice with a unit element has a maxi-
linear dynamical systems, Springer, 1985.
mal ideal; 4) the product of compact topological spaces
AMS 1980 Subject Classification: 58FXX, 34C29 is compact; and 5) any set X has the same cardinality
as XXx.
AXIAL VECfOR, pseudovector - A vector in oriented The axiom of choice does not contradict the other
space which, when the orientation of space is reversed, axioms of set theory (e.g. the system ZF) and cannot be
is transformed into the inverse vector. An example of logically deduced from them if they are non-
an axial vector is the vector product of vectors. contradictory. The axiom of choice is extensively
BSE-3 employed in classical mathematics. Thus, it is used in
AMS 1980 Subject Classification: 51 NXX, 15A72 the following theorems. I) Each subgroup of a free
group is free; 2) the algebraic closure of an algebraic
AXIOM - A fundamental assumption, a self-evident field exists and is unique up to an isomorphism; and 3)
principle. In deductive scientific theories the axioms are each vector space has a basis. It is also used in: 4) the
the fundamental primitive assumptions of the given equivalence of the two definitions of continuity of a
theory from which its remaining contents is derived by function at a point (the t:-8-definition and the defini-
deduction, i.e. by purely logical procedures. Cf. tion by limits of sequences) and in proving 5) the
Axiomatic method. countable additivity of the Lebesgue measure. The last
P. S. N ovikov two theorems follow from the countable axiom of
AMS 1980 Subject Classification: 03A05 choice (the formulation of the axiom includes the
condition of countability of the family F). It was
AXIOM OF CHOICE - One of the axioms in set proved that the theorems I) to 5) are not deducible in
theory. It states that for any family F of non-empty the system ZF if ZF is non-contradictory.
sets there exists a function f such that, for any set S A model of set theory has been constructed which
from F, one has f(S)ES if is called a choice function meets the countable axiom of choice and in which each
on F). For finite families F the axiom of choice can be set of numbers is Lebesgue-measurable. This model was
deduced from the other axioms of set theory (e.g. in the constructed on the assumption that the system ZF does
system ZF). not contradict the axiom of the existence of an inacces-
The axiom of choice was explicitly formulated by E. sible cardinal number.
Zermelo (1904) and was objected to by many mathema- References
ticians. This is e.\.plained, first, by its purely existential [I) FRAENKEL, A.A., BAR-HILLEL, Y. and LEVY, A.: Foundations of
set theory, North-Holland, 1973.
character which makes it different from the remaining [2) JECH, TJ.: Lectures in set theory: with particular emphasis on
axioms of set theory and, secondly, by some of its the method of forcing, Springer, 1971.
implications which are 'unacceptable' or even contrad- [3) JECH, TJ.: The axiom of choice, North-Holland, 1973.
ict intuitive 'common sense'. Thus, the axiom of choice v.N. Grishin
implies: the existence of a Lebesgue non-measurable set Editorial comments.
of real numbers; the existence of three subdivisions of References
the sphere B [A 1] JECH, TJ.: Settheory, Acad. Press, 1978.
B = Uj U ... U Un,
AMS 1980 Subject Classification: 03E25, 04A25
B = Vj U ... U Vm ,
B = X U '" U Xn +m'
j AXIOM OF EXTENSIONALITY - One of the axioms
such that Vi is congruent with Xi, I~i~n, and Vj is of set theory, asserting that two sets are equal if they
congruent with Xn +j, I ~j ~m. Thus, the sphere B is contain the same elements:
divisible into a finite number of parts XI, ... , Xn +m \tu \tv (\tx (x EU=X EV) => U =v).

314
AXIOMATIC METHOD

In a language not containing the equality symbol and An example of a scheme of non-logical axioms is the
having only one predicate symbol E, the axiom of following variant of the induction scheme in the tradi-
extensionality has the form tional axiomatizations of arithmetic:
Vu Vv (\fx (x EU~X EV) ~ Vz (u EZ~V EZ. (a(O)&V'x (a(x):,) a(x') :,) Vx a(x),

The axiom of extensionality has no real importace where a and x are assumed not to belong to the alpha-
for the formalization of mathematics in the bet of the language of the formalization considered and
Zermelo- Fraenkel system ZF. Anything that can be are interpreted, respectively, as an arbitrary formula
constructed within the system ZF can be formalized in and an arbitrary variable of this formalization.
a system without the axiom of extensionality. Let ZF- The use of an axiom scheme usually permits one to
be the system obtained from ZF by removing the dispense with the rule of substitution when constructing
axiom of extensionality and by replacing formulas of formal theories. Thus, in any sufficiently strong propo-
the form u = v in the remaining axioms by the formula sitional calculus with two derivation rules - the rule of
Vx (XEU '> XEV).
substitution and the rule of conclusion - it is possible
to conduct proofs by substituting into axioms only,
Then it can be shown that there exists an interpretation which makes it possible to modify the calculus in an
of ZF in ZF-. A similar assertion is valid for the equivalent manner by replacing each axiom by the
theory of types. respective axiom scheme, and removing the rule of sub-
For Quine's system NF, obtained from the theory of sitution as a derivation rule.
types by the 'erasure' of the type indices, the situation
is different: It is not possible to interpret NF in NP- . References
[I] KLEENE, S.c.: Introduction to metamathematics, North-Holland,
The system NF- (NF without the axiom of exten- 1951.
sionality) is a rather weak system, and its consistency [2] CHURCH, A.: Introduction to mathematical logic, I, Princeton
can be proved in formal arithmetic. The sytem NF, Univ. Press, 1956.
F.A. Kabakov
however, is not weaker than the theory of types with
AMS 1980 Subject Classification: 038XX, 03FXX
the axiom of infinity.
References AXIOMATIC METHOD - A way of arriving at a
[I] BARWISE,J. (ED.): Handbook of mathematical logic. North-
Holland, 1977. scientific theory in which certain primitive assumptions,
[2] BOFFA, M.: 'The consistency problem for NF', J. Symbolic the so-called axioms (d. Axiom), are postulated as the
Logic 42, no. 2 (1977),215-220. basis of the theory, while the remaining propositions of
VN. Grishin
the theory are obtained as logical consequences of these
Editorial comments.
axioms.
References In mathematics, the axiomatic method originated in
[A1] ScOTI, D.S.: 'More on the axiom of extensionality', in Y.
Bar-Hillel, E.I.J. Poznanski and M.O. Rabin, et al. (eds.): the works of the ancient Greeks on geometry. The most
Essays on the foundation of mathematics, North-Holland, brilliant example of the application of the axiomatic
1962. method - which remained unique up to the nineteenth
AMS 1980 Subject Classification: 03E30, 03E65 century - was the geometric system known as Euclid's
Elements (ca. 300 B.c.). At the time the problem of the
AXIOM SCHEME - A unified manner of formulating description of the logical tools employed to derive the
axioms (cf. Axiom) with the same syntactic structure. A consequences of an axiom had not yet been posed, but
specific axiom scheme is usually realized by an expres- the Euclidean system was a very clear attempt to
sion ~ (which is most often formulated in a language obtain all the basic statements of geometry by pure
different from that in which the axioms are written), derivation based on a relatively small number of postu-
which fixes the syntactic structure of the scheme, and lates - axioms - whose truth was considered to be
by using rules which make it possible to obtain any self-evident.
axiom of the initial structure starting from ~. The discovery of a non-Euclidean geometry by N.I.
In the context of previously formulated or unambigu- LobachevskiI and J. Bolyai at the beginning of the
ously understandable rules for generating axioms using nineteenth century stimulated further development of
an expression ~, an axiom scheme is usually referred to the axiomatic method. They showed that if the tradi-
as the self-expression of ~. For instance, one speaks of tional, and apparently the only 'objectively true', fifth
the axiom scheme a~(f3~a) of propositional calculus postulate of Euclid concerning parallel lines is replaced
P when one means the set of axioms of the form by its negation, then it is possible to develop in a
A ~(B~A), where A and B are arbitrary formulas in purely logical manner a geometry which is just as
P. elegant and meaningful as is Euclidean geometry. The

315
AXIOMATIC METHOD

attention of mathematicians of the nineteenth century axioms Aj in T, is interpreted in T 1 by some proposi-


was thus drawn to the deductive manner of construct- tion A, which is deducible in T 1 from the interpreta-
ing mathematical theories; this in turn gave rise to a tions A; of the axioms Aj , and is therefore true. This
new problem, connected with the concept of the last statement is based on an additional implicit
axiomatic method itself and with the formal (axiomatic) assumption to the effect that the logical tools of the
mathematical theory. With the gradually increasing theories T and T 1 are to some extent similar, in prac-
number of mathematical theories which had been tice this condition is usually satisfied. (In the initial
axiomatically derived - one can, in particular, mention applications of the method of interpretation, this
the axiomatic derivation of elementary geometry by M. assumption was not even discussed, since it was
Pasch, G. Peano and D. Hilbert which, unlike Euclid's assumed to be self-evident; in fact, during the first
Elements is logically unobjectionable, and Peano's first attempts at demonstrating a relative consistency
attempt at the axiomatization of arithmetic - the con- theorem, the logical tools of T and T 1 were simply
cept of a formal axiomatic system became more identical; this was the classical theory of predicates.)
rigorous (see below), resulting in a class of specific Now let the theory T be inconsistent, i.e. let both some
problems which eventually established proof theory as statement A of this theory and its negation be derivable
one of the main chapters of modern mathematical in it. It then follows from the above that the proposi-
logic. tions A and non-A will both be true statements of
It was recognized as early as the nineteenth century T 1, i.e. T 1 is inconsistent. This was the method
that foundations must be created for mathematics and employed by F. Klein and H. Poincare to show that
for the relevant mathematical problems. It was mainly LobachevskiI's non-Euclidean geometry is consistent if
in analysis that the basic concepts were rendered more Euclid's geometry is consistent; also, the problem of the
precise, and that more complex ideas were reduced to consistency of Hilbert's axiomatization of Euclidean
simpler concepts by methods involving increasingly geometry was reduced by Hilbert to the problem of the
rigorous logical reasoning (the (-() language of A.L. consistency of arithmetic. The method of interpretation
Cauchy, the function-theoretic concepts of B. Bolzano can also be used to solve the problem of independency
and K. Weierstrass, and the continuum of G. Cantor of a systems of axioms: In order to prove that an
and R. Dedekind). Also, the discovery of non- axiom A of a theory T is independent of the other
Euclidean geometries stimulated the development of the axioms of that theory, i.e. cannot be deduced from
axiomatic method, the development of new ideas, and them, and that its inclusion is therefore essential in
the postulation of more general mathematical problems, order to obtain the complete theory, it is sufficient to
mainly those connected with concepts of an arbitrary construct an interpretation of T in which A is false,
axiomatic theory, such as consistency, completeness while all the other axioms are true. Another form of
and independence of a given axiom system. The first this manner of proving the independence of A is to
results in this direction were obtained by the method of establish the consistency of the theory obtained by
interpretation, which may be described as follows. Let replacing A in T by its negation. The result of the
some specific mathematical object be assigned to each above-mentioned reduction of the problem of the con-
primitive concept and to every relation of a given sistency of Lobachevskll's geometry to the problem of
axiomatic theory T. The population of such objects is the consistency of Euclidean geometry, and of reducing
called the field of interpretation. It is now natural to the problem of the consistency of Euclidean geometry
assign to each proposition IJr of T some sentence 1Jr' to the problem of the consistency of arithmetic, is the
about the elements of the field of interpretation, which conclusion that Euclid's fifth postulate is not derivable
may be true or false. One then says that the proposi- from the remaining geometrical axioms, provided that
tion IJr of the theory T is true or false, respectively, in the arithmetic of positive integers is consistent. The
this interpretation. The field of interpretation and its weak point of the method of interpretation is the fact
properties are usually themselves the object of study of that, as far as problems of consistency and indepen-
some, usually different, mathematical theory T j, which dence of axiom systems are concerned, it inevitably
can also be axiomatic. The method of interpretation yields results of merely a relative character. However,
can be used to establish relative consistency - i.e. to its important achievement is the fact that it served to
demonstrate statements of the type 'if a theory T j is reveal that arithmetic plays a special role, in that it is a
consistent, so is the theory T' in the following manner. mathematical theory to whose consistency the problem
Assume that the theory T has been interpreted in the of consistency of several other theories is reduced.
theory T j so that all axioms Ai of T are interpreted by The method of the so-called formalism of the founda-
true statements A; of T j. All theorems of T, i.e. any tions of mathematics, due to Hilbert and his school,
proposition A which is logically deducible from the was a further step (and, in a sense, a peak) in the

316
AXIOMATIC METHOD

development of the method. It rendered the concept of tion in S is any finite sequence of formulas in that sys-
an axiomatic theory more precise by introducing the tem in which each formula is either an axiom of S or
notion of a fonnal system as the next stage in the follows directly from some formulas which precede it in
development of the axiomatic method. As a result, it the sequence in accordance with one of the derivation
proved possible to treat the mathematical theories laws R; of S. A formula of S is called a theorem of this
themselves as precise mathematical objects and to con- system if there exists a derivation in S ending in this
struct a general theory of such theories - a so-called formula.
meta-theory. The possibility of solving all the main Any specific mathematical theory T may be
problems in the foundations of mathematics in this way translated into the language of a suitable formal system
appeared very attractive, and Hilbert himself was S so that any meaningful (true or false) statement of T
tempted to follow this path. The principal concept in can be expressed by one of the formulas of S.
this approach was that of a formal system. Any formal It can naturally be expected that this method of for-
system is constructed as a perfectly definite class of malization would make it possible to construct all the
expressions - that is, formulas from which a subclass meaningful elements of any mathematical theory on the
of formulas, known as the theorems of the formal sys- precise and apparently reliable basis represented by the
tem, is derived in a certain well-defined manner. The concept of a derivable formula (a theorem of the formal
formulas of the formal system are not directly meaning- system), and to solve fundamental problems such as the
ful, and in general they may be constructed from signs problem of the consistency of the mathematical theory
or symbols merely chosen for reasons of technical con- by proving the corresponding statements about the for-
venience. In fact, the way of constructing formulas and mal system that formalizes this theory. Since the formal
the concept of a theorem of some formal system are so systems of the type just described are exact, or 'finitis-
chosen that the resulting formal apparatus can be used tic' , using the term used by the school of Hilbert,
for the most adequate expression of some mathematical mathematical objects, it could be expected that it would
(or even non-mathematical) theory, more precisely, as be possible to obtain jinitistic proofs of consistency
an expression of both its factual contents and its statements, i.e. proofs which would in a certain sense
deductive structure. The general procedure for con- be effective, that is, independent of such powerful tools
structing an arbitrary formal system S is as follows. as, for example, the abstraction of actual infmity (which
I. The language of the system S. is one of the reasons for the difficulties encountered in
a) The alphabet is a list of elementary symbols of the the foundations of classical mathematical theories).
system. Thus, the requirement that the tools employed in arriv-
ing at results concerning formal systems, and in partic-
b) The construction (syntactic) rules are rules accord-
ular concerning their consistency, be finitistic,
ing to which the formulas of S are constructed from the
represented a regular feature of Hilbert's formalistic
elementary symbols. A sequence of elementary sym-
program. However, the results obtained by K. GOdel in
bols is a taken as a formula if and only if it can be
the early nineteen-thirties frustrated the main expecta-
constructed by means of syntactic rules.
tions based on this program. Godel showed that:
II. The axioms of the system S. A certain set of for-
. 1) Any natural consistent formalization of arithmetic
mulas (usually a finite or countable set), said to be the
or of any other mathematical theory which involves
axioms of the system S, is specified.
arithmetic (e.g. set theory) is incomplete and cannot be
III. The derivation rules of the system S (or deduction
completed in the sense that a) S contains (true with
rules). A (usually finite) population of predicates
respect to their contents) unsolvable formulas, i.e. for-
R;, ... ,Rk is defined on the set of all formulas of S.
mulas A such that neithtr A nor non-A are derivable in
Let Rj(x I, . . . ,xn, + I ) be one of these predicates
S (the incompleteness of formal arithmetic); b) whatever
(nj>O); if, for given formulas F I , ... ,Fn,+I, the
the finite set of supplementary axioms (e.g. of unsolv-
statement R;(FI' ... ,Fn,+d is true, then one says that able formulas in S) which are introduced to enlarge S,
the formula Fn, + I follows directly from the formulas the new, stronger, system thus produced inevitably con-
F 1> ,Pn, by the rule R;. tains its own unsolvable formulas (incompleteness; see
Specifying I, II and III is the same as specifying the [5] and GOdeI incompleteness theorem).
formal system S as a precise mathematical object, since 2) If formal arithmetic is in fact consistent then,
the concept of a theorem or a derivable formula of S is while the statement of its consistency is expressible in
formed for all formal systems in a uniform manner (the its own language, it is impossible to prove this state-
degree of accuracy depending on the level of accuracy ment by the methods of formalized arithmetic itself.
of the alphabet, of the syntactic rules and of the deriva- This means that, even in the case of arithmetic, it is
tion laws, i.e. of the predicates R 1> ,Rk ). A deriva- intrinsically impossible to exhaust all its contents-wise

317
AXIOMATIC METHOD

true statements by a class of derivable formulas of any AxIOMATIC SEI' 1HEORY - The branch of
formal system, and that there is no hope of obtaining mathematical logic in which one deals with fragments
any finitistic proof of the consistency of arithmetic, of the informal theory of sets by methods of mathemat-
since it seems that any reasonable precision of the con- ical logic. Usually, to this end, these fragments of set
cept of a finitistic proof is formalizable in formal arith- theory are formulated as a formal axiomatic theory. In
metic. a more narrow sense, the term 'axiomatic set theory'
All this imposes definite limitations on the possibili- may denote some axiomatic theory aiming at the con-
ties of the axiomatic method in the form it takes in the struction of some fragment of informal ('naive') set
framework of Hilbert's formalism. However, within theory.
these limits it continues to play an important role in Set theory, which was formulated around 1900, had
the foundations of mathematics. Thus, subsequent to to deal with several paradoxes from its very beginning.
the publication of GOdel's results, he himself (1938 - The discovery of the fundamental paradoxes of G. Can-
1940) and P. Cohen (1963), basing themselves on the tor and B. Russell (cf. Antinomy) gave rise to a
axiomatic approach with the use of the method of widespread discussion and brought about a fundamen-
interpretation, obtained fundamental results concerning tal revision of the foundations of mathematical logic.
the compatibility and mutual independence of the The axiomatic direction of set theory may be regarded
axiom of choice and the continuum hypothesis in set as an instrument for a more thorough study of the
theory [6], [7]. As regards consistency, which is a funda- resulting situation.
mental problem in the foundations of mathematics, it The construction of a formal axiomatic theory of sets
had become clear in view of GOdel's results that it can begins with an accurate description of the language in
be solved only with the aid of tools and ideas which are which the propositions are formulated. The next step is
not finitistic. Various approaches to this question to express the principles of 'naive' set theory in this
proved possible, not all of which are equally acceptable language, in the form of axioms and axiom schemes. A
to mathematicians, mainly because of the differences of brief description of the most widespread systems of
opinion as to the acceptability of various logical tools. axiomatic set theory is given below. In this context, an
Among the results obtained concerning the consistency important part is played by the language which con-
of formal systems, one can mention, first of all, the tains the following primitive symbols: 1) the variables
proof of the consistency of formal arithmetic [8], based x,y, z, U, V, Xl> , which play the part of common
on infinite induction up to a certain countable transfin- names for the sets in the language; 2) the predicate
ite number. Another, more recent, example of this kind symbols E (sign of incidence) and = (sign of equality);
is the attempt to use certain ideas of intuitionism for 3) the description operator L, which means 'an object
proving the consistency of the formal system of analysis such that ... '; 4) the logical connectives and quantif-
[9]. iers: B (equivalent), -+ (implies), V (or), /\ (and), -,
References (not), 'fI (for all), 3 (there exists); and 5) the
[1] HEATH, T.L.: The thirteen books Euclid's elements, 11-13, parentheses ( and ). The expressions of a language are
Dover, reprint, 1956. (Translated from the Greek). grouped into terms and formulas. The terms are the
[2] KAGAN, V.F.: The foundations of geometry, 1, Moscow-
Leningrad, 1949 (in Russian). names of the sets, while the formulas express proposi-
[3] HILBERT, D.: Grundlagen der Geometrie, Springer, 1913. tions. Terms and formulas are generated in accordance
[4] llEANo, G.: 'Principii di logica matematica', Rivista Matema- with the following rules.
tika 1 (1891), 1-10.
[5] GoDEL, K.: 'Ueber formal unentscheidbare Satze der Principia
RI. If T and 0 are variables or terms, then (TEO) and
Mathematica und verwandter Systeme 1', Monatsh. Math. Phy- (T=O) are formulas.
sik 38 (1931), 178-198. Rl. If A and B are formulas and x is a variable, then
[6] GoDEL, K.: The consistency of the axiom of choice and of the
(A BB), (A -+B), (A V B), (A I\B), -,A, 'fI xA, 3 xA are
generalized continuum hypothesis with the axioms of set theory,
Princeton Univ. Press, 1940. formulas and LXA is a term; the variable x is a term.
[7] CoHEN, PJ.: Set theory and the continuum hypothesis, Benja- For instance, the formula 'fIX(XEy-+XEZ) is tan-
min, 1966. tamount to the statement y is a subset of z', and can
[8] GENTZEN, G.: 'Die Widerspruchsfreiheit der reinen
Zahlentheorie', Math. Ann. 112 (1936),493-565. be written as y C;:z; the term LW'fIy(y EWBY c;:z) is the
[9] SPECTOR, C.: 'Provable recursive functionals of analysis', in name of the set of all subsets z and, expressed in con-
Recursive function theory, Amer. Math. Soc., 1962, pp. 1-27. ventional mathematical symbols, this is Pz. Let the
P.S. Novikov symbol <9 mean 'the left-hand side is a notation for the
Editorial comments. right-hand side'. Below a number of additional nota-
References tions for formulas and terms will be presented.
[A1] KLEENE, S.C.: Introduction to metamathematics, North- The empty set:
Holland & Noordhoff, 1959. o # u'Vy...,yex.
AMS 1980 Subject Classification: 03A05, 00A25 The set of all x such that A (x):

318
AXIOMATIC SET THEORY

{x: A(x)} ~ LZ'VX(XEZ _A(x, axiomatic system of this type was the system Z, due to
where z does not enter freely in A (x) (i.e. IS not a E. Zermelo (1908). However, this system does not allow
parameter of the formula A (x. a natural formalization of certain branches of
The unordered pair x and y: mathematics, and the supplementation of Z by a new
principle - the axiom of replacement - was proposed
{x,y} ~ {z: Z =x V z=y}. by A. Fraenkel in 1922. The resulting system is known
The single-element set consisting of x: as the Zermelo- Fraenkel system and is denoted by ZF.
{x} ~ {x,x}. b) The second group is constituted by systems the
axioms of which are selected in the context of giving
The ordered pair x and y:
some explanations for paradoxes, for example, as a
<x,y> ~ {{x}, {x,y}}. consequence of non-predicative definitions. The group
The union of x and y: includes Russell's ramified theory of types, the simple
theory of T-types, and the theory of types with
x Uy ~ {z: ZEX V zEy}.
transfinite indices (cf. Types, theory of).
The intersection of x and y: c) The third group is characterized by the use of
x ny ~ {z: ZEX I\ZEy}. non-standard means of logical deduction, multi-valued
The union of all elements of x: logic, complementary conditions of proofs and infinite
derivation laws. Systems in this group have been
Ux ~ {z: 3V(ZEV VVEX)}.
developed to the least extent.
The Cartesian product of x and y: d) The fourth group includes modifications of sys-
x Xy ~ {z: 3uv(z = <u, v> 1\ UEX 1\ v Ey)}.
tems belonging to the first three groups and is aimed at
attaining certain logical and mathematical objectives.
Notation for: w is a function: Only the system NBG of Neumann - Gooel- Bemays
Fnc(w) ~ 3v(w C;v Xv) 1\ (1925) and the system NF of W. Quine (1937) will be
1\ 'VUVI V2U, VI > EW 1\ <U, V2>EW ~ VI =V2)' mentioned here. The construction of the system NBG
was motivated by the desire to have a finite number of
The values of the function w on the element x:
axioms of set theory, based on the system ZF. The sys-
W'X ~ o/<X,y>EW. tem NF represents an attempt to overcome the stratifi-
The standard infinite set z cation of the concepts in the theory of types.
The systems Z, ZF and NF can be formulated in the
Inf(z) ~ 0 EZ 1\ 'VU(UEZ ~u U {U}EZ).
language described above. The derivation rules, and
The axiomatic theory A that follows is the most com- also the so-called logical axioms, of these systems are
plete representation of the principles of 'naive' set identical, and form an applied predicate calculus of the
theory. The axioms of A are: first order with equality and with a description opera-
AI. Axiom of extensionality: tor. Here are the axioms of equality and of the descrip-
'VX(X EY_X EZ) ~ y=z tion operator:
('if the sets x and y contain the same elements, they are x = x, x =y ~ (A(x)~A(y,
equal'); where A (x) is a formula not containing the bound vari-
A2. Axiom scheme of comprehension: able y (i.e. it has no constituents of the type '1y, 3y, ry),
3y 'Vx(x E y_A), while A (y) is obtained from the formula A (x) by
replacing certain free entries of the variable x with y:
where A is an arbitrary formula not containing y as a
parameter ('there exists a set y containing only elements 3!xA(x) ~ A (tXA(x,
x for which A'). where the quantifier 3!x means that 'there exists one
This system is self-contradictory. If, in A2, the for- and one only x', while the formula A(tXA(x is
mula -,x EX IS taken as A, the formula obtained from the formula A (x) by replacing all free
'1X(XEJ~XEX) readily yields yEy~Ey, which is
entries of the variable x with the term tXA (x). The
a contradiction. quantifier 3!x can be expressed in terms of the quantif-
The axiomatic systems of set theory may be subdi- iers 'v' and 3 and equality.
vided into the following four groups. Non-logical axioms of the system Z:
a) The construction of axiomatic systems in the first ZI.The axiom of extensionality AI.
group is intended to restrict the comprehension axioms Z2. The pair axiom:
so as to obtain the most natural means of formalization
3U'VZ(ZEU _ z=x V z=y)
of conventional mathematical proofs and, at the same
time, to avoid the familiar paradoxes. The first ('the set {x, y} exists');

319
AXIOMATIC SET THEORY

Z3. The union axiom: formula A (x) can be used to form a class, the infinite
3yV'X(XEy ~ 3t(tEz I\XEt number of ZFaxioms can be replaced by a finite
number of axioms containing a class variable. The
('the set U z exists');
axiom of choice has the form:
1.4. The power set axiom:
3X(Fnc(X) 1\ \tx(-,x = 0 ~ X' X EX
3y V'x(xEJ ~ x ~z)
('the set pz exists'); and confirms the existence of a selection function,
Z5. The separation axiom scheme: which is unique for all sets and which constitutes a
3yV'x(xEJ ~ XEZ 1\ A (x class.
The system NF has a simpler axiomatic form, viz.: 1)
('there exists a subset Z consisting of the elements x in z
the axiom of extensionality; and 2) the axioms of
for which A (x) is true'); the axioms Z2 - Z5 are exam-
comprehension in which a formula A can be stratified,
ples of axioms of comprehension;
i.e. it is possible to assign to all variables of the for-
Z6. The axiom of infinity:
mula A superscript indices so as to obtain a formula of
3z Inf(z); the theory of T-types, i.e. in the subformulas of type
Z7. The axiom of choice: x E y the index of x is one lower than the index of y.
V'z 3w(Fnc(w) 1\ V'x(x EZ 1\ oX = 0 ~ w'x EX The system NF has the following characteristics:
a) the axiom of choice and the generalized continuum
('for any set z there exists a function w which selects, hypothesis are disprovable;
out of each non-empty element x of the set z, a unique b) the axiom of infinity is demonstrable (d. Infinity,
element w'x'). The above axioms are complemented by axiom of);
the regularity axiom: c) the extensionality axiom plays a very important
ZS.
V'X(oX =0 ~ 3y(yEX I\y n X = 0, role. Thus, if the extensionality axiom is replaced by
the slightly weaker axiom:
which is intended to postulate that there are no des-
cending chains x2EX),x3EX2,X4EX3, .... Axiom ZS (3u(uEJ) 1\ \tU(UEJ~UEZ ~ y=z,
simplifies constructions in Z, and its introduction does which permits a large number of empty sets, while the
not result in contradictions. comprehension axioms of NF remain unchanged, a
The system Z is suitable for developing arithmetic, fairly weak theory is obtained: The consistency of the
analysis, functional analysis and for studying cardinal resulting system can be proved even in formal arith-
numbers smaller than ~w. However, if the alephs are metic.
defined in the usual manner, it is no longer possible to Results concerning the interrelationships between the
demonstrate the existence in Z of ~w and higher cardi- systems just described are given below.
nal numbers. a) Any formula of ZF is demonstrable in NBG if
The system ZF is obtained from Z by adding and only if it is demonstrable in ZF.
Fraenkel's replacement axiom scheme, which may be f3) In ZF it is possible to establish the consistency of
given in the form of the comprehension axiom scheme: Z, completed by any finite number of examples of the
ZF9.
3y \tx(x Ey ~ 3v(v EZ 1\ x =LtA (t, v) axiom scheme of replacement ZF9. Thus, ZF is much
stronger than Z.
('there exists a set y consisting of x, x = LtA (I, v), where
y) The consistency of T is demonstrable in Z, so that
v runs through all the elements of a set z'). In other
words, y is obtained from z if each element v of z is Z is stronger than T.
{) NF is not weaker than T in the sense that it is
replaced with LtA(/, v).
The system ZF is a very strong theory. All ordinary possible to develop the entire theory of types in NF.
mathematical theorems can be formalized in terms of The axiomatic approach to the theory of sets has
ZF. made it possible to state a proposition on the unsolva-
bility in principal (in an exact sense) of certain
The system NBG is obtained from ZF by adding a
mathematical problems and has made it possible to
new type of variables the class variables
demonstrate it rigorously. The general procedure for
X, Y, Z, . .. - and a finite number of axioms for
the utilization of the axiomatic method is as follows.
forming classes, by means of which it is possible to
Consider a formal axiomatic system S of the theory of
prove formulas of the type
sets (as a rule, this is ZF or one of its modifications)
3y\tX(XEY~A(x,
that is sufficiently universal to contain all the conven-
where A (x) is a formula of NBG which does not coun- tional proofs of classical mathematics, and for all ordi-
tain bound class variables or the symbol t. Since any nary mathematical facts to be deducible from it. A

320
AXIOMATIC SET THEORY

given problem A may be written down as a formula in choice and the continuum hypothesis have been added.
the language S. It is then established by mathematical It follows that it is impossible to disprove the axiom of
methods that neither A nor its negation can be deduced choice or the continuum hypothesis in ZF. In order to
in S. It follows that problem A cannot be solved (in prove this result, Godel constructed a model of the
either way) by tools of the theory S, but since this theory ZF consisting of the so-called GOdel construc-
theory S was assumed to contain all ordinary methods tive sets (cf. GOOel constructive set), this model plays an
of proof, the result means that A cannot be solved by important role in modern axiomatic set theory.
ordinary methods of conclusion, i.e. A is 'transcenden- 2) The problem as to whether or not the axiom of
tal'. choice or the continuum hypothesis is deducible in ZF
Results which state that a proof cannot be performed remained open until 1963, when it was shown by P.J.
in the theory S are usually obtained under the assump- Cohen, using his forcing method, that if ZF- is con-
tion that S, or some natural extension of S, is con- sistent, it will remain consistent after the addition of
sistent. This is because on the one hand, the problem any combination of the axiom of choice, the continuum
can be non-deducible in S only if S is consistent, but hypothesis or their negations. Thus, these two problems
such consistency cannot be established by the tools are independent in ZF.
offered by S (cf. GOOei incompleteness theorem), i.e. The principal method used for establishing that a
cannot be derived by ordinary methods. On the other formula A is not deducible in ZF is to construct a
hand, the consistency of S is usually a very likely model of ZF containing the negation of A. Cohen's
hypothesis; the very theory S is based on its truth. forcing method, which was subsequently improved by
Furthermore, the axiomatic approach to the theory other workers, strongly extended the possibilities of
of sets made it possible to accurately pose and solve constructing models of set theory, and now forms the
problems connected with effectiveness in the theory of basis of almost all subsequent results concerning non-
sets, which had been intensively studied during the ini- deducibility. For instance:
tial development of the theory by R. Baire, E. Borel, H. 3) It has been shown that one can add to ZF,
Lebesgue, S.N. Bernstein [S.N. Bernshtein], N.N. Luzin without obtaining (additional) inconsistencies, the
and W. Sierpiilski. It is said that an object in the hypothesis stating that the cardinality of the set of sub-
theory of sets which satisfies a property W is effectively sets of a set x may be an almost arbitrary pre-given
defined in the axiomatic theory S if it is possible to function of the cardinality of x on regular cardinals
construct a formula A (x) of S for which it can be (the only substantial restrictions are connected with
demonstrated in S that it is fulfilled for a unique Konig's theorem).
object, and that this object satisfies property W. 4) M.Y a. Suslin (1920) formulated the following
Because of this definition it is possible to show in a hypothesis. Any linearly totally ordered set such that
rigorous manner that for certain properties W in S it is any pairwise non-intersecting family of non-empty open
impossible to effectively specify an object which satis- intervals in it is 'at most countable must contain a
fies W, while the existence of these objects in S can be countable everywhere-dense subset. The non-
established. But since the chosen theory S is sufficiently deducibility of Suslin's hypothesis in ZF was esta-
universal, the fact that the existence of certain objects blished by Cohen's method.
in S is ineffective is also a proof of the fact that their 5) It was shown that the following postulate: 'Any
existence cannot be effectively established by ordinary subset of the set of real numbers is Lebesgue measur-
mathematical methods. able' is unsolvable in ZF- (without the axiom of
Finally, the methods of the axiomatic theory of sets choice).
make it possible to solve a number of difficult prob- 6) The interrelationship of many important problems
lems in classical branches of mathematics as well: in of descriptive set theory with ZF was clarified. The first
the theory of cardinal and ordinal numbers, in descrip- results relating to this problem were demonstrated by
tive set theory and in topology. P.S. Novikov [5]. The methods of axiomatic set theory
Some of the results obtained by the axiomatic theory made it possible to discover previously unknown con-
of sets are given below. Most of the theorems concern nections between the problems of 'naive' set theory. It
the axiomatic set theory of Zermelo- Fraenkel (ZF), was proved, for example, that the existence of a Lebes-
which is now the most frequently employed. Let ZF- gue non-measurable set of real numbers of the type ~1
be the system ZF without the axiom of choice Z7. In (i.e. A 2) implies the existence of an uncountable III
view of a), the results can be readily adapted to the (i.e. C s1) set without a perfect subset.
system NBG as well. 7) It was proved that an effectively totally ordered
1) It was shown in 1939 by K. GOdel that if ZF- is continuum is absent in ZF. Numerous results proved
consistent, it will remain consistent after the axiom of the absence of effectively defined objects in the descrip-

321
AXIOMATIC SET THEORY

tive theory of sets and m the theory of ordinal An elementary closed class K of models is called
numbers. complete if all its models are elementarily equivalent.
References Every axiomatized class of models is a sum of
[1) FIlAENKEL, A.A., BAR-HILLEL, Y. and LEVY, A.: Fowuiations of pairwise-disjoint complete classes. A class is said to be
set theory, North-Holland, 1973. categorical in cardinality m if all its models of cardinal-
(2) COHEN, P.J.: Set theory and the continuum hypothesis, Benja-
min, 1966.
ity m are isomorphic. A complete class of models of a
(3) JEeR, T.J.: Lectures in set theory: with particular emphasis on countable signature that is categorical in an uncount-
the method offorcing, Springer, 1971. able cardinality, is categorical in all uncountable cardi-
(4) DRAKE, F.R.: Set theory: an introduction to large cardinals,
North-Holland, 1974.
nalities, but may be non-categorical in a countable car-
(5) NOVIKOV, P.S.: 'On the consistency of certain propositions of dinality; in such a case the class has a countable
the descriptive theory of sets', Amer. Math. Soc. Transl. 29 number of pairwise non-isomorphic countable models.
(1963), 51-89. (Trudy Mat. Inst. Steklov. 38 (1951),279-316) For any n=l=2 there exists a complete axiomatized class
V.N. Grishin with exactly n non-isomorphic countable models.
A. G. Dragalin An axiomatized class K of models is called solvable if
Editorial comments. Gbdel's book [A4] contains his there exists an algorithm by which it is possible to tell,
proof of the statement in 1) above. for any closed formula of the language L, if it is true or
References false for each model in K. The following theorem
[Al] JOCR, T.J.: Set theory, Acad. Press, 1978. describes the interconnection of complete, categorical
[A2] lEMMON, EJ.: Introduction to axiomatic set theory, Routledge
and solvable classes: If K is categorical of infinite car-
& Kegan Paul, 1968.
[A3] TAKEUTI, G. and ZARING, W.M.: Introduction to axiomatic set dinality and has no finite model, then it is complete. A
theory, Springer, 1971. recursively-complete axiomatized class of models is
[A4] GOOEL, K.: The consistency of the continuum hypothesis, solvable.
Princeton Univ. Press. 1940.
Reductive and projective classes are generalizations
AMS 1980 Subject Classification: 03EXX, 04-XX ofaxiomatized clases. Projective classes are defined by a
second-order axiom in the form
AxIOMATIZED CLASS - A class of models of one
3T1 3Tn 2!(PJ, ... ,Ph Tlo ... ,Tn),
type, defined by an axiom system. A class K of models
of a formal language L is said to be axiomatized (fin- where Pi, Tj are predicate variables, and
itely axiomatized) if there exists a (finite) system ~ of 2l(Pj, ... ,Pk , Tj, ... ,Tn) is a formula of the signa-
closed formulas of L such that K contains those and ture o=(Pj, ... ,Pk> Tj, ... ,Tn). Many properties of
only those models on which all formulas of ~ are axiomatized classes can be applied to these classes.
defined and are true (cf. Algebraic system). A class of References
models of a recursive signature is said to be recurSively [11 MAL'TSEV, A.I.: Algebraic systems, Springer, 1973 (translated
from the Russian).
axiomatized if it can be specified by a recursive set of [2] MAL'TSEV, A.I.: Proc. 4-th All-Union Math. Congress 1961, Vol.
axioms. 1, Leningrad, 1963, pp. 169-198 (in Russian).
Many classes of algebraic systems studied in (3) COHN, P.M.: Universal algebra, Reidel, 1981.
mathematics are defined by a system of axioms of a A.D. Taimanov
first-order language. For instance, the cl~)ses of all Editorial comments. An axiomatized class is also called
Boolean algebras, all groups, all fields, and all lattices an axiomatic class. Elementary equivalent is also called
are finitely axiomatized. The classes of all torsion-free indiscernible.
groups, all fields of characteristic zero and all algebrai- AMS 1980 Subject Classification: 03CXX
cally closed fields are recursively axiomatized, but not
necessarily finitely axiomatized. The theory of axioma- AxONOMETRY - One of the methods of mapping
tized classes reveals regularities common to all classes three-dimensional figures into a plane. An axonometry
of objects defined by a specific language; it has been consists in projecting the figure, after an orthogonal
well developed for first-order languages, and therefore Cartesian coordinate system and a coordinate plane
only such classes and formulas will be dealt with in into which the figure will be projected have been
what follows. chosen, on the chosen plane for the diagram (figure).
Two models are said to be elementarily equivalent if Depending on the mode of the projection, the axo-
any formula of the first-order language which is true in nometry may be parallel or central. If the direction of
one of them is also true in the other. A model we is the parallel projection into the chosen plane is perpen-
said to be an elementary extension of a model ~ if any dicular to this plane, the axonometry is said to be nor-
formula which is defined and is true in ~ is also true in mal or orthogonal; otherwise it is said to be skew. The
Wl. parameters of an orthogonal axonometry are the

322
AXONOMETRY

cosines of the angles between the coordinate axes and theorem: Any tetrahedron can be projected to an arbi-
the chosen plane, and are sometimes called distortion trarily given plane tetragon.
indices. If two distortion indices are equal, the axo-
References
nometry is called a dimetry, and if three distortion [I] GLAZUNov, E.A. and CmrrvERUKHIN, N.F.: Axonometry, Mos-
indices are equal, an isometry; if all distortion indices cow, 1953 (in Russian). AB I
are different, it is called a trimetry. The principal .. vanov
theorem on axonometries is the Pohlke- Schwartz AMS 1980 Subject Classification: 51 M15

323
B
/l-DISTRIBUTION - See Beta-distribution. BAER MULTIPLICATION - A binary operation on
AMS 1980 Subject Classification: 62EXX the set of classes of extensions of modules, proposed by
R. Baer [1]. Let A and B be arbitrary modules. An
(B, <p)-STRUCTURE - A structure on a vector bun- extension of A with kernel B is an exact sequence:
dle (or sphere bundle, etc.) that is a generalization of o --4 B --4 X --4 A --4 O. (I)
the concept of the structure group of a fibration.
The extension (1) is called equivalent to the extension
Let CPn: Bn~BOn be a fibration and let ~ be an n-
dimensional vector bundle over a space X, classified by 0--4 B --4 XI --4 A --40

the mapping ~: X ~BOn' Then the homotopy class lift- if there exists a homomorphism a: X~Xl forming part
ing the mapping ~: X ~BOn to a mapping in Bn is of the commutative diagram
called a (Bn' cpn)-structure on ~, i.e. it is an equivalence
class of mappings r: X ~Bn such that CPn or =~, where
two mappings rand r': X~Bn are said to be
equivalent if they are fibrewise homotopic. No method
of consistently defining (Bn' <pn)-structures for
equivalent fibrations exists, because this consistency
depends on the choice of the equivalence. The set of equivalence classes of extensions is denoted
Let there be a sequence (B, cP, g) of fibrations by Ext(A, B). The Baer multiplication on Ext(A, B) is
CPr: Br~BOr and mappings gr: Br~Br+l such that induced by the operation of products of extensions
jrocpr=CPr+logr (jr:BOr~BOr+l is the standard map- defined as follows. Let
ping). The family {B" cP" gr} (and sometimes only f3 a

(B" CPr is called a structure series. An equivalence class o --4 B --4 X --4 A --40, (2)
of sequences of (B" CPr )-structures on the normal bundle
(3)
{~r} of a manifold M n is called a (B, cP )-structure on M;
they coincide beginning from some sufficiently large r. be two extensions. In the direct sum XEB Y the submo-
A manifold M n with a fixed (B, cp)-structure on it is dules
c= {(x,y): a(x)=al(Y)}
called a (B, cp)-manifold.
Instead of BOn, a more general space BGn, classify- and
D = {(-x,y): x=/3(b),y=/3I(b)}
ing sphere bundles, can be considered and (B, cp)-
are selected. Clearly, DeC, so that one can define the
structures can be introduced on them.
quotient module Z = C / D. The Baer product of the
References extensions (2) and (3) is the extension
[1] LASHOF, R.: 'Poincare duality and cobordism', Trans. Amer.
Math. Soc. 109 (1963), 257-277.
[2] STONG, R.E.: Notes on cobordism theory, Princeton Univ. Press,
1968. where
Yu.B. Rudyak /32(b) = [/3(b), O] = [O,/3'(b)],
Editorial comments. Here and
BOn = lim infGrasn(R'+n)
r~oo

References
is the limit of the Grassmann manifolds of r-planes in R n + r [I] BAER, R.: 'Erweiterung von Gruppen und ihren lsomorphis-
AMS 1980 Subject Classification: 55R25, 55R35 men', Math. Z. 38 (1934).374-416.

325
BAER MULTIPLICATION

[2) CARTAN, H. and ElLENBERG, S.: Homological algebra, Prince- References


ton Univ. Press, 1956. [1] BAlRE, R.: Lefons sur les fonctions discontinues, professees au
VE. Govorov college de France, Gauthier-Villars, 1905.
[2] HAUSDORFF, F.: Set theory, Chelsea, reprint, 1978.
AMS 1980 Subject Classification: 18G 15 [3] NATANSON, I.P.: Theorie der Funktionen einer reellen
Veriinderlichen, H. Deutsch, Frankfurt a.M., 1961 (translated
from the Russian).
[4) GAGAEFF, B.M.: 'Sur les suites convergentes de fonctions
BAlKE CLASSES - Families of real functions which
mesurables B', Fund. Math. 18 (1932), 182-188.
are defined inductively using the ordinal number of
I.A. Vinogradova
limit transitions involved in the definition of the func-
tion, and which constitute the classification of functions Editorial comments. A subset of a topological space is
called dense-in-itse/f if it has no relatively-isolated points. A
proposed in 1899 by R. Baire [1], known as the Baire
modern English reference for the notion of Baire classes is
classification. The set of all continuous functions
[A 1]. For an introduction to the topic and some applications,
f : A ~R, where A is a metric space, is called the zero-
cf. [A2].
th Baire class H o. The first Baire class HI is the set of
discontinuous functions f: A ~R which are the limit of References
[A1] Rooy, A.C.M. VAN and SCHlKHOF, W.H.: A second course
a sequence of continuous functions converging at each on real functions, Cambridge Univ. Press, 1982.
point. The Baire class H a' where a is an ordinal [A2] BOAS, R.P., JR.: A primer of real functions, Math. Assoc.
number of the first or second class, is defined as the set Amer., 1981.
of functions f: A ~R which do not form part of any of AMS 1980 Subject Classification: 26A21 , 54C20,
the preceding classes, but which may be represented as 04A15
f=limn~ooJ,., where J,. EH P.' f3n<a. The union of the
Baire classes H a over all ordinal numbers of the first BAIRE PROPERTY of a set A in a topological space -
and second class constitutes the set of Baire functions. A property analogous to the measurability of a set. A
This is the minimal class of functions f :A ~R contain- set A has the Baire property if there exists an open set
ing all the continuous functions and closed with respect G such that the differences A \ G and G \ A are sets of
to pointwise convergence. A linear combination, pro- the first category according to Baire (d. Category of a
duct and quotient (for non-zero denominators) of func- set); the term 'open' may be replaced by 'closed'.
tions of a Baire class not higher than a is a function in There are other equivalent definitions; thus, for
a Baire class not higher than a. A uniformly convergent instance, a set has the Baire property if it is the union
sequence of functions of a Baire class not higher than IX of a set of type Go and a set of the first category. The
has as limit a function in a Baire class not higher than class of sets with the Baire property is closed under the
IX. Necessary and sufficient conditions have been esta- operations of complementing, taking countable unions
blished for a sequence of functions in a Baire class not and taking countable intersections. For an example of
higher than IX to converge to a function in a Baire class a set which does not have the Baire property, see [1].
not higher than IX [4]. The union of all dense-in-itself
References
sets MeA is called the kernel of the topological space A. [I) KURATOWSKl, K.: Topology, Acad. Press, 1966-1968 (translated
If A is a complete space with a non-empty dense-in- from the French). Paragraph 40.
VA. Skvortsov
itself kernel, none of the Baire classes is emr)y [2]. The
set of Baire functions coincides with the set of Borel- Editorial comments. A set with the' Baire property is
measurable functions (cf. Borel measure), and for this often called a Baire set or an almost-open set. A set of the
reason they are all Lebesgue-measurable (cf. Lebesgue first category is often called meager.
measure). A Lebesgue-measurable function g: A ~ R is
References
equivalent to a Baire function of a Baire class not [A 1] CECH. E. Topological spaces, Wiley, 1966.
higher than two [3]. Baire, who considered functions
defined on Rn (mainly on R), made the most detailed . AMS 1980 Subject Classification: 54C32, 54C50
study of functions of the first class. He showed that a
necessary and sufficient condition for a discontinuous BAIRE SET in a locally compact Hausdorff space X -
function to belong to the first class is the existence of a A set belonging to the a-ring generated by the class of
point of continuity of the induced function on each all compact sets in X that are Go-sets. A Baire set
perfect set (Baire's theorem). This theorem is applicable serves to define the concept of a Baire-measurable func-
to a function f: A ~R if A has the Baire .property [2]. tion. In all classical particular cases in which measure
The concept of a Baire function can be naturally gen- theory is developed in topological spaces, e.g. in
eralized to functions cp: A ~ Y, where Y is an arbitrary Euclidean spaces, the concept of a Baire set coincides
metric space. with that of a Borel set.

326
BALANCED SET

References Editorial comments. A function in the first Baire class is


[I) HALMOS, P.: Measure theory, v. Nostrand, 1950. also called a Baire function.
VA. Skvortsov References
AMS 1980 Subject Classification: 28A05, 54C50 [A1] RUDIN, W.: Principles of mathematical analysis, McGraw-Hili,
1964.
BAIRE SPACE - 1) Any space in which the Baire AMS 1980 Subject Classification: 54E32, 54C50,
theorem on complete spaces is valid. 26A21
2) The metric space the points of which are infinite
sequences {ni} = { n 1 , n 2, . . . } of natural numbers, and BALANCED MODULE - A module M such that the
the distance is given by the formula: natural ring homomorphism cp: R ...... EndEndRM M, where
I
M is regarded as a right module over EndR M, defined
p( {n,}, {m,}) = k;' by cp(r)(m)=mr for any rER and mEM, is surjective.
A module P over a ring R is a generator of the category
where ko is the first natural number k for which
of R-modules if and only if P is balanced as an R-
nk=l=mk' This is a complete metric separable zero-
module, projective and finitely generated as an
dimensional space containing the topological image of
EndR P-module.
any metric separable zero-dimensional space.
P. S. A leksandrov References
[I] FAITH, c.: Algebra: rings, modules and categories, 1-2, Springer,
AMS 1980 Subject Classification: 54E32 1973-1976.
L.A. Skornyakov
BAIRE THEOREM - 1) Baire's theorem on complete AMS 1980 Subject Classification: 16A53
spaces: Any countable system of open and everywhere-
dense sets in a given complete metric space has a non- BALANCED RING, on the left (right) - A ring over
empty, and even an everywhere-dense, intersection in which all left (right) modules are balanced. A ring is
this space. An equivalent formulation is the following: balanced on the left if and only if all its quotient rings
A non-empty complete metric space cannot be are QF-l-rings, that is if all the exact left modules over
represented as a countable sum of its nowhere-dense it are balanced. In particular, a ring is balanced if all
subsets. Stated by R. Baire [I]. its quotient rings are quasi-Frobenius. Every balanced
ring can be split into a direct sum of a uniserial ring
References
[lJ BAIRE, R.: Ann. Mat. Pura Appl. 3 (1899),67. and rings of matrices over local rings of a special type.
P.s. Aleksandrov Every balanced ring is semi-perfect. A Noetherian bal-
anced ring is an Artinian ring.
Editorial comments. This theorem is also known as the
Baire category theorem (cf. [A1], p. 200). References
[I] Itogi Nauk. i Tekhn. Algebra Topol. Geom. 19 (1981),31-134.
References [2) FAITH, c.: Algebra, 1-2, Springer, 1973-1976.
[A 1] KELLEY, J.L.: General topology, van Nostrand, 1955.
L.A. Skornyakov
2) Baire's theorem on semi-continuous junctions: Let A
be a subset of a metric space M, and let j: M ~R. The AMS 1980 Subject Classification: 16A36
condition: For any number a the set
BALANCED SET - A set U in a real or complex vec-
{x: xEA, j(x)~a} (or, respectively,
tor space X such that x E U and 1,\ I .;:;; I imply Ax E U.
{x: xEA, j(x)';:;;a}) is closed in A, is necessary and
An example of a balanced set is given by the unit ball
sufficient for j (x) to be semi-continuous from above
in a normed vector space and, generally, by a neigh-
(Oi, respectively, from below) on A. Demonstrated by
bourhood U of zero in a base of neighbourhoods of
R. Baire for j: R~R [1]. It follows from this theorem
zero in a topological vector space. These neighbour-
that semi-continuous functions belong to the first Baire
hoods of zero are moreover absorbing, i.e. such that for
class (cf. Baire classes). A stronger theorem is valid: A
any x EX there exists an 0:>0 such that x E'\U for
function that is semi-continuous from above (from
below) and that does not assume the value + CXJ ( - (0)
I ,\ I ~a. If U is a convex, absorbing and balanced set,
then the functional Pu(x)=inf{ 1,\ I: x E'\U} is a
is the limit of a monotone non-increasing (non-
semi-norm, i.e. it has the properties
decreasing) sequence of continuous functions.
References Pu(x +y) ,.;; Pu(x)+pu(y), PU(Ax) = 1,\ Ipu(x).
[I] R.: La tMorie desfonctions discontinues, Gauthier-
BAIRE, A balanced set is also called centred.
Villars, 1905.
[2) NATANSON, LP.: Theorie der Funktionen einer reellen References
Veriinderlichen, H. Deutsch, Frankfurt a.M., 1961 (translated [I] KANrOROVICH, L.V. and AKILOv, G.P.: Functional analySis,
from the Russian). I .A . V'znogra dova Moscow, 1977 (in Russian).
VI. Sobolev

327
BALANCED SET

Editorial comments. The functional Pu mentioned above plest form is to find, fora given mass distribution J.L
is also called the Minkowski functional of the convex, inside a closed domain 15, a mass distribution p on
absorbing and balanced set U. f= aD such that the potentials of both distributions
References coincide outside D. If the boundary f is smooth, the
[A1] YOSIDA, K.: Functional analysis, Springer, 1980. solution of the balayage problem for J.L will be an abso-
AMS 1980 Subject Classification: 46AXX lutely continuous measure P. Its density, or the
derivative p' 0'), y Ef, may be written down in terms of
BALAYAGE METHOD - A method for solving the the Green function G(x, y) of the domain D in the
Dirichlet problem for the Laplace equation, developed
by H. Poincare ([1], [2], see also [4]), which will now be
form
p'(y) = j t,
aGe Y) dp.(x),
ny
p.;;;'O, yEf, (*)
described. Let D be a bounded domain of the where aG / any is the derivative of G (x, y) in the direc-
Euclidean space Rn , n;:' 2, let f = aD
be the boundary tion of the interior normal to f at the point y Ef.
of D. Let 8y be the Dirac measure concentrated at the Inside the domain D the potentials satisfy the inequal-
point y ED, let U(x; 8y ) be the Newton potential of the ity U(x; p)";;; U(x; p.), i.e. balayage inside the domain
measure 8y for n;:' 3, or the logarithmic potential of the results in a decrease of the potential. If p.=8x is the
measure 8y if n = 2. A balayage (or sweeping) of the Dirac measure at the point x ED, formula (*) yields
measure 8y from the domain D to the boundary f is a p'(y)=aG(x,y)/any , i.e. the normal derivative of the
measure /3y on f whose potential U(x; /3y ) coincides Green function is the density of the measure obtained
outside D with U(x; 8y ) and is not larger than U(x; 8y ) by balayage of the unit mass concentrated at the point
inside D; this measure /3y is unique and coincides with x ED. Generalization of formula (*) yields an expres-
the harmonic measure on f for the point y ED. The sion for the balayaged measure p(E) of an arbitrary
balayage of an arbitrary positive measure, concentrated Borel set E cf for an arbitrary domain D:
on D, is defined in a similar manner. If D is a sphere,
pCE) = jw(x;E, D)dp.(x),
the density of the mass distribution /3Y' i.e. the deriva-
tive of the measure /3y, is identical with the Poisson where w(x; E, D) is the harmonic measure of E with
kernel (cf. Poisson integral). In general, if the boundary respect to the domain D at the point x.
r is sufficiently smooth, the measure /3y is absolutely If K is an arbitrary compact set in Rn and p. is a
continuous, and the density of the mass distribution /3y bounded positive Borel measure, the balayage (or
coincides with the normal derivative of the Green func- sweeping) of the measure p. onto the compact set K is a
tion for D. The measure /3y serves to write down the measure p on K such that U(x; p)";;;; U(x; p.) every-
solution of the Dirichlet problem as the so-called for- where, and such that quasi-everywhere on K, i.e. with
mula of de la Val!ee-Poussin: the possible exception of a set of points of exterior

U(y) = /f (x) d/3y(x),


capacity zero, U(x; p)= U(x; p.). Such a formulation of
the balayage problem, which is more general than
balayage from a domain, may also be extended to
where f (x) is a function defined on f. potentials of other types, e.g. Bessel potentials or Riesz
In his original publication on the balayage method, potentials (cf. Bessel potential; Riesz potential). Balay-
Poincare began by demonstrating the geom) trical con- age of measures onto arbitrary Borel sets K is also con-
struction of the process for a sphere. Then, Lasing him- sidered.
self on Harnack's theorems (cf. Harnack theorem) and The problem of balayage for superharmonic func-
on the fact that it is possible to exhaust the domain D tions (cf. Superharmonic function) has been similarly
r
by a sequence of spheres {Bd = I, he constructed an formulated. Let v be a non-negative super harmonic
infinite sequence of potentials {Un} ~= I in which each function on a domain D eRn. The balayage of the
potential Un + I is obtained from the preceding one, Un, function v onto a compact set K C 15 is the largest
by the balayage method of moving the masses from the superharmonic function B~ such that 1) its associated
domain U: = 1Bk to its boundary, and which reduces measure is concentrated on K; 2) B~";;;;v everywhere;
to solving the Dirichlet problem for a sufficiently and 3) B~ = v quasi-everywhere on K.
smooth domain D (for a detailed discussion of the con- In abstract potential theory (cf. Potential theory,
ditions of applicability of the balayage method, see [3]). abstract) the balayage problem in both its formulations
In modern potential theory [5], [6] the balayage prob- is solved for sets K in an arbitrary harmonic space X,
lem is treated as an independem problem, resembling i.e. in a locally compact topological space X which per-
the Dirichlet problem, and it turns out that the balay- mits the isolation of an axiomatically defined sheaf of
aged measure can be considered on sets of a general harmonic functions. This axiomatic approach makes it
nature. For instance, the balayage problem in its sim- possible to consider the balayage problem for potentials

328
BANACH ALGEBRA

connected with partial differential equations of a more '11'/2 s"-I


general nature [7]. For the balayage method in stochas- V" = f(n /2+ 1) R" = -n- R ,
tics cf. [8]. where sn -I is the surface of the boundary sphere and
References f is the gamma-function: f(n + l)=n!,
[1) POINCARE, H.: 'Sur Ies equations aux derivees partielles de Ia f(n + I / 2)=2- n (l'3 ... (2n -1)Jn.l /2. In particular,
physique mathematique', Amer. J. Math. 12, no. 3 (1890), 211-
294. VI = 2R
,
V 2 = wR2
,
V3 = .i
3 wR 3' V4 = ~2R4 .
(2) POINCARE, H.: Theorie du potentiel Newtonien, Paris, 1899.
(3) VALLEE-POUSSIN, Cu.J. DE LA: Le potentiellogarithmique,
balayage et representation confonne, Gauthier-Villars, 1949. With the increase of the dimension, the volume of a
(4) SRETENSKIi, L.N.: Theory of the Newton potential, Moscow- ball 'concentrates' at its surface:
Leningrad, 1946 (in Russian).
vn(R2)- V"(R I )
(5) LANDKOF, N.S.: Foundations of modem potential theory,
V"(R 2) ~ 1, RI <R 2, n~oo.
Springer, 1972 (translated from the Russian).
(6) BRELOT, M.: Elements de la tMorie c/assique du potentiel, Sor-
bonne Univ., Paris, 1965. A ball is the simplest geometrical figure. Its topology
(7) CONSTANTINESCU, C. and CORNEA, A.: Potential theory on is trivial. Among all bodies of an equal volume, a ball
harmonic spaces, Springer, 1972. has minimal surface, and among all bodies of an equal
(8) MEYER, P.A.: Probability and potentials, Blaisdell, 1966.
surface, it has maximal volume.
E.D. Solomentsev
In exactly the same manner a ball can be defined in
Editorial comments. 8alayage is also referred to as a metric space; however, in this case it need not be, for
sweeping of a measure. A classic reference for problems in example, strictly convex, its surface may have non-
potential theory related to Green functions is [A 1]. smooth points, etc., that is, it may have all phenomena
In probabilistic potential theory the swept measure P.s on characteristic of arbitrary convex bodies.
f=aD of a probability measure p. concentrated on D turns
Unlike a finite-dimensional ball, an inJinite-
out to be the distribution of a standard Brownian motion on
dimensional ball, being the direct limit of a sequence of
D, which has initial distribution p., at the moment of first hit-
balls of succesive dimensions imbedded in one another,
ting f.
Another link with probabilistic potential theory is provided
does not have a compact closure. On the contrary, the
by the fact that, for each sufficiently nice harmonic space, compactness of a ball in a topological vector space
there exists a Hunt process (Xt ) whose excessive functions indicates the finite dimensionality of the latter.
are the positive hyper-harmonic functions. If PK denotes the For references, see Sphere.
I.S. Sharadze
hitting distribution of a compact set K, then PKu=B~ for
positive superharmonic functions u, and the balayage of a AMS 1980 Subject Classification: 51 N20, 46805
measure p. on K is given by P.PK' Therefore, the notion of
balayage of a function or a measure can also be defined in BANACH ALGEBRA - A topological algebra A over
terms of the potential kernel of a semi-group of kernels, see the field of complex numbers whose topology is defined
[A3]. by a norm which converts A into a Banach space, the
References multiplication of the elements being separately continu-
[A1) TSUJI, M.: Potential theory in modern function theory, Chel- ous for both factors. A Banach algebra is said to be
sea, reprint, 1975. commutative if xy = yx for all x, y EA (cf. Commutative
[A2] DooB, J.L.: Classical potential theory and its probabilistic
counterpart, Springer, 1984. Banach algebra). A Banach algebra is said to be an
[A3] DELLACHERIE, C. and MEYER, P.A.: Probabilites et potentiel, algebra with a unit if A contains an element e such that
1-2, Hermann, 1975-1983. ex =xe =x for any x EA. If a Banach algebra has no
AMS 1980 Subject Classification: 31825, 31815, unit, a unit may be adjoined, i.e. it is possible to con-
31820,60J45 s_truct a Banach algebra A with a unit element such that
A contains the initial algebra A as a closed sub algebra
BALL - A set vn of points x in a Euclidean space of codimension one. In any Banach algebra A with a
En at a distance from a given point Xo (the centre of unit element e it is possible to change the norm for an
the ball) less than (an open ball von), or not greater equivalent one so that in the new norm the relation-
than (a closed ball V") a quantity R (the radius of the ships II ab I , ; ; ; I a II I b II, I e I = I are valid. In what
ball), i.e. follows it is, as a rule, assumed that the algebra does
contain a unit and that it satisfies the norm conditions
von (V') = {XEE": p(x, xo)<R (";;;;;R)}.
given above.
A ball Vi is a line segment, V2 is a disc, vn for n > 3 is Examples. I) Let X be a compact topological space
sometimes called a hyperball. The boundary (surface) of and let C (X) be the set of all continuous complex-
a ball is a sphere. valued functions on X. C (X) will then be a Banach
The volume of a ball is algebra with respect to the usual operations, with norm

329
BANACH ALGEBRA

11I11 = m:x III mutative Banach algebra with respect to the pointwise
operations, with norm
2) The set of all bounded linear operators on a
Banach space forms a Banach algebra with respect to II I II = sup
get;
1/(g) I
the usual operations of addition and multiplication of 6) The skew-field of quaternions does not form a
linear operators with the operator norm. Banach algebra over the field of complex numbers,
3) Let V be a bounded domain in n-dimensional since the product of elements of a Banach algebra A
complex space en. The set of bounded holomorphic should be compatible with multiplication by numbers:
functions on V is a Banach algebra with respect to the For all AE e and x, y EA the equation
usual operations, with the natural sup-norm:
>..(xy) = (Ax)y = x(>"y),
I I II = sttP II I
must be valid; it is not valid in the field of quaternions
This Banach algebra contains the closed subalgebra of if A=i, x=j,y =k.
bounded holomorphic functions on V that have a con- Any Banach algebra with a unit is a topological alge-
tinuous extension to the closure of V. The simplest bra with continuous inverses. Moreover, if (A) is the
example is the algebra of functions that are continuous set of elements of a Banach algebra A which have a
in the disc I z I ,;;;; 1 and analytic in the disc I z I < 1. (two-sided) inverse with respect to multiplication, then
4) Let G be a locally compact group and let L] (G) (A) is a topological group in the topology induced by
be the space (of equivalence classes) of all functions the imbedding (A)CA. If II e-a I <1, then aE(A),
that are measurable with respect to the Haar measure and
on G and that are absolutely integrable with respect to
this measure, with norm
I I I = I Il(g) I dg where b = e - a, and the series is absolutely convergent.
G The set of elements invertible from the right (from the
(left Haar integral). left) in A also forms an open set in A.
If the convolution operation If in a Banach algebra A all elements have an inverse
(or even a left inverse), then A is isometrically iso
(II *h)(h) = III (g)h(g-I h)dg
G morphic to the field of complex numbers (the
is considered as the multiplication in L] (G), then Gel'fand- Mazur theorem).
L] (G) becomes a Banach algebra; if G is an Abelian Since a certain neighbourhood of the unit in a
locally compact group, then the Banach algebra L] (G) Banach algebra A consists of invertible elements, the
is commutative. The Banach algebra L](G) is said to closure of any non-trivial ideal is again an ideal which
be the group algebra of G. The group algebra L] (G) does not coincide with A. In particular, a maximal (left,
has a unit (with respect to the convolution) if and only right, two-sided) ideal is closed.
if G is discrete. An important task in the theory of Banach algebras
If G is commutative it is possible to construct a faith- is the description of closed ideals in Banach algebras.
ful representation of L] (G), given by the Fourier The problem can be simply solved in a number of
transform of each function f EL] (G), i.e. bJ the func- cases. In the algebra C(X) (cf. Example 1) each closed
tion ideal has the form 1= {fE C (X): fl y ~O}, where Y is a
I(x) = Ix(g)/(g)dg, closed set in X. If A is the algebra of all bounded linear
G
operators on a separable infinite-dimensional Hilbert
on the character group G of p. The set of functions f(~)
A A

space, then the ideal of completely-continuous opera-


form;; a certain algebra A (G) of continuous functions tors is the only closed two-sided ideal in A.
on G (with respect to the ordinary pointwise opera- An element a EA has a left (right) inverse if and only
tions), called the Fourier algebra of the locally compact if it is not contained in any maximal left (right) ideal.
Abelian group G. !n particular, if G is the group of The intersection of all maximal left ideals in A coin-
integers Z, then A (Z) is the algebra of continuous func- cides with the intersection of all maximal right ideals;
tions on the circle which are expandable into an abso- this intersection is called the radical of the algebra A
lutely convergent trigonometric series. and is denoted by RadA. An element ao EA belongs to
5) Let G be a topological group. A continuous RadA if and only if e + aao E(A) for any a EA. Alge-
complex-valued function f (g) on G is said to be almost bras for which RadA =0 are said to be semi-simple.
periodic if the set of its shifts f(gog), go E G, forms a The algebras C( X) and the group algebras L] (G) are
compact family with respect to uniform convergence on semi-simple. All irreducible (i.e. not having a non-
G. The set of almost-periodic functions forms a com- trivial invariant subspace) closed sub algebras of the

330
BANACH ALGEBRA

algebra of all bounded linear operators on a Banach II a* II = I a II for all aEA, then
space are semi-simple.
The resolvent of an element a EA is the function
t/J(a' a) ~ I/{e) I a' a I
A Banach algebra A with involution is said to be
l>.~a" = (a-l>.e)-I, completely symmetric if e + a * a E (A) for any a EA; A is
defined on the set of all AEC for which a (two-sided) said to be a C* -algebra (a completely-regular algebra) if
inverse to a - Ae exists. The domain of existence of the II a *a II = I a 112 for any a EA. Any C* -algebra is com-
resolvent contains all points A with I A I > II a II. The pletely symmetric. Examples of completely-symmetric
maximal domain of existence of the resolvent is an algebras include the group algebras L 1(G) of commuta-
open set; the resolvent is continuous on this set and is tive or compact groups. Examples of C* -algebras
even analytic, moreover da}, / dA=aX. In addition, include the algebras C(X) (the involution in C(X) is
Hilbert's identity defined as transition to the complex conjugate func-
tion) and closed subalgebras of the algebra of bounded
a"2 -a", = (l>.2-l>.da",a"2 linear operators in a Hilbert space containing both the
is valid. The complement of the domain of existence of operator and the adjoint operator (involution is defined
the resolvent is called the spectrum of the element a and as transition to the adjoint operator). Any C* -algebra
is denoted by a(a). For each a EA the set a(a) is non- is isometrically isomorphic (involution being preserved)
empty, closed and bounded. with one of these algebras (the Gel'fand- Narmark
If a, bEA, then the sets a(ab) and a(ba) need not theorem). In particular, any commutative C -algebra A
coincide, but is isometrically isomorphic (involution being preserved)
a(ab) U {O} = a(ba) U {O}. with one of the algebras C(X) (this theorem includes
the Stone- Weierstrass theorem).
The number An element a of a Banach algebra with involution is
lal = max Il>.l
"Eo(a)
said to be Hermitian if a* =a. For a Banach algebra
is called the spectral radius of the element a EA; with an involution to be a C* -algebra it is necessary
Gelfand's formula and sufficient that the condition II e ia II = 1 be fulfilled
I a 1= lim I an III/n, for all Hermitian elements a. If, in a Banach algebra
with an involution, sup II e ia II < 00 (upper bound over
where the limit on the right-hand side always exists, is all Hermitian elements), then this algebra is topologi-
valid. If a ERadA, then I a I =0; the converse is true, cally *-isomorphic with a C* -algebra. If, in an arbi-
generally speaking, only in commutative Banach alge-
trary Banach algebra, II e ita II = 1 for all real t for a cer-
bras whose radical coincides with the set of generalized
tain fixed element a, then II a II coincides with the
nilpotents, i.e. elements a for which I a I = O. In any
spectral radius, i.e. I a I = I a I
Banach algebra the relationships I a k I = I a Ik,
The theory of Banach algebras, and of commutative
I Aa I = I A I I a I and I a I ~ II a II are true. If A is Banach algebras in particular, has numerous applica-
commutative, then I ab I ~ I a I I b I and tions in various branches of functional analysis and in
I a + b I ~ I a I + I b I are valid. a number of other mathematical disciplines.
Examples of non-commutative algebras in which gen- References
eralized non-zero nilpotents are absent are known. [I] BOURBAKI, N.: Elements of mathematics. Spectral theories,
However, if II a 2 II = I a 112 for any a EA, then the Addison-Wesley, 1977 (translated from the French).
[2] GAMELIN, T.W.: Uniform algebras, Prentice-Hall, 1969.
Banach algebra A is commutative. The condition [3] GUNNING, R.C. and ROSSI, H.: Analytic functions of several
II ab II = I ba II for all a, b EA is also sufficient for an complex variables, Prentice-Hall, 1965.
algebra A with a unit to be commutative. [4] GEL'FAND, I.M.: 'Normierte Ringe', Mat. Sb. 9 (51), no. I
An algebra A is said to be an algebra with involution (1941), 3-24.
[5] GLEASON, A.M.: 'Function algebras', in Proc. Sem. on analytiC
if an operation a ----'?a * is defined on A that satisfies the functions, Vol. 2, 1958, pp. 213-226.
conditions [6] HOFFMAN, K.: Banach spaces of analytic functions, Prentice
hall, 1962.
(l>.a +ph)' = Xa' +jih', (a')' = a, (ab)' =b' Q', [7] GORIN, E.A.: 'Maximal sub algebras of commutative Banach
for all a, bEA, A, IlEe. The mapping a----'?a* is said to algebras with involution', Math. Notes 1, no. 2 (1967), 173-178.
(Mat. Zametki 1, no. 2 (1967),173-178)
be an involution in A. A linear functional1/; on an alge- [8] DUNFORD, N. and SCHWARTZ, J.T.: Linear operators, 1-3,
bra A with an involution is said to be positive if Interscience, 1958-1971.
1/;(aa*r;:.O for any aEA. If the linear functional1/; is [9] ZELAZKO, W.: Banach algebras, Elsevier, 1973 (translated from
the Polish).
positive, then [10] KAPLANSKY, I.: 'Functional analysis', in Surveys in applied
mathematics, Vol. 4. Some aspects of analysis and probability,
for all a EA. If the involution in A is an isometry, i.e. if Wiley, 1958.

331
BANACH ALGEBRA

[11] LoOMIS, L.H.: An introduction to abstract harmonic analysis, v. the type p.( U, Ck , f), have local properties which
Nostrand, 1953. correspond to classical properties: primary decomposi-
[12] NAiMARK, M.A.: Normed rings, Reidel, 1984 (translated from
the Russian). tion, Hilbert's Nullstellen theorem, the local description
[13] Some questions in approximation theory, Moscow, 1963 (in Rus- theorem, etc., are all applicable [2].
sian; translated from the English).
[14] RICKART, C.E.: General theory of Banach algebras, Nostrand,
References
1960. [I] DouADY, A.: 'Les problemes des modules pour les sous-espace
[15] ROYDEN, H.L.: 'Function algebras', Bull. Amer. Math. Soc. 69, analytique compacts d'une espace analytique donne', Ann. Inst.
Fourier (Grenoble) 16, no. 1 (1966), 1-95.
no. 3 (1963), 281-298.
[2] Sous-ensembles analytiques d'une variete banachique complexe,
E.A. Gorin Springer, 1970.
D.A. Ponomarev
Editorial comments. Gel'fand's formula is also called the
spectral radius formula. AMS 1980 Subject Classification: 46BXX, 32CXX

References
[A1] KADlSON, R.V. and RINGROSE, l.R.: Fundamentals of the BANACH INDICATRIX, multiplicity function, of a
theory of operator algebras, 1, Acad. Press, 1983. continuous function y = f (x), a";;;;x,,;;;;b - An integer-
AMS 1980 Subject Classification: 46H05, 46J05, valued function N(y,j), -oo<y<+oo, equal to the
43A10 number of roots of the equation f (x) = y. If, for a given
value of y, this equation has an infinite number of
BANACH ANALYTIC SPACE An infinite- roots, then
N(Y,f) = +00,
dimensional generalization of the concept of an analytic
space, which arose in the context of the study of defor- and if it has no roots, then
mations of analytic structures (d. Defonnation). Here, N(y,f) = O.
the local model is a Banach analytic set, i.e. a subset The function N(y,f) was defined by S. Banach [1] (see
p.( U, F, j) = f- 1(0) of an open set U in a Banach space also [2]). He proved that the indicatrix N (y, f) of any
E over C, where f: U ~F is an analytic mapping into continuous function f(x) in the interval [a, b] IS a
the Banach space F. As distinct from the finite- function of Baire class not higher than 2, and
dimensional case, not one structure sheaf, but a set of +00
f N(y, f)dy,
b
sheaves <1>( W), where W is an open set in an arbitrary V(f) = (*)
Banach space G, is defined on the local model. <1>( G) is a -00

defined as the quotient of the sheaf of germs of analytic where V;;(f) is the variation of f (x) on [a, b]. Thus,
mappings U ~G by the sub sheaf of germs of mappings equation (*) can be considered as the definition of the
of the type x~cp(x)f(x), where </>: U~Hom(F, G) is a variation of a continuous function f (x). The Banach
local analytic mapping, while <1>( W) c <1>( G) is generated indicatrix is also defined (preserving equation (* for
by mappings which assume values in W. The sheaves functions with discontinuities of the first kind [3]. The
<1>( W) define a functor from the category K of open sets concept of a Banach indicatrix was employed to define
in Banach spaces and their analytic mappings into the the variation of functions in several variables [4], [5].
category of sheaves of sets on f- 1(0). References
A topological space X with a functor from the [1] BANACH, S.: 'Sur les lignes rectifiables et 1es surfaces dont
category K into the category of sheaves of s) ts in X in l'aire est finie', Fund. Math. 7 (1925),225-236.
[2] NATANSON, J.P.: Theorie der Funktionen einer reellen
which all points have neighbourhoods isomorphic to Veriinderlichen, H. Deutsch, Frankfurt a.M., 1961 (translated
some local model, is said to be a Banach analytic space. from the Russian).
Complex-analytic spaces form a complete sub- [3] LOZINSKI), S.M.: 'On the Banach indicatrix', Vestnik Len-
ingrad. Univ. Math. Mekh. Astr. 7, no. 2, 70-87 (in Russian).
category in the category of Banach analytic spaces. A
[4] KRONROD, A.S.: 'On functions of two variables', Uspekhi Mat.
Banach analytic space is finite-dimensional if each one Nauk 5, no. 1 (1950),24-134 (in Russian).
of its points x has a neighbourhood that is isomorphic [5] VITUSHKIN, A.G.: On multi-dimensional variations, Moscow,
to a model p.( U, F, j) and for which there exists a map- 1955 (in Russian).
B.I. Golubov
ping g: U ~ U inducing an automorphism of the model
Editorial comments. More generally, for any mapping
and having a completely-continuous differential dgx [1].
t: X---'> Y define NCy, t) analogously. Then, let X be a separ-
A second special case of a Banach analytic space is a
able metric space and let tCA) be /L-measurable for all Borel
Banach analytic manifold, i.e. an analytic space that is subsets A of X. Let ~Cs)=/L(fCs) for Sex and let>/; be the
locally isomorphic to open subsets of Banach spaces. measure on X defined by the Caratheodory construction
An important example is the manifold or
linear sub-
spaces of a Banach space over C that are closed and
from ~. Then
>/;CA) = f
N(f)A) d/Ly
admit closed complements. for every Borel set Aex. Cf. [A1], p. 176 ft. For significant
Finitely-defined Banach analytic sets, i.e. models of extension of ("), cf. [A2].

332
BANACH - MAZUR FUNCTIONAL

Editorial comments. each positive additive functional is continuous in norm,


References which means that the classes of regular functionals and
[A 1] FEDERER, H.: Geometric measure theory, Springer, 1969. additive functionals which are continuous in norm
[A2] FEDERER, H.: 'An analytic characterization of distributions
coincide. The space X' which is dual in the sense of
whose partial derivatives are representable by measures',
Bull. Amer. Math. Soc. 60 (1954),339. Banach to the normed lattice X is a conditionally com-
plete Banach lattice. In a normed lattice the
AMS 1980 Subject Classification: 26A15
Hahn - Banach theorem may be strengthened as fol-
BANACH LATIICE, - A vector lattice that is at the lows: For any Xo >0 there exists a positive additive
same time a Banach space with a norm which satisfies functional f, which is continuous in norm, such that
the monotonicity condition: f(xo) = II Xo II, I f II = 1.
References
I x I ~ Iy I ~ II x I ~ I y II (1) VULIKH, B.Z.: Introduction to the theory of partially ordered
A Banach lattice is also called a KB-lineal, whereas an spaces, Wolters-Noordhoff, 1967 (translated from the Russian).
[2) DAY, M.M.: Normed linear spaces, Springer, 1958.
arbitrary normed lattice, i.e. a vector lattice with a
monotone norm, is called a KN-lineal. When complet- B.Z. Vulikh
ing a normed lattice in norm, the order relations may Editorial comments. Cf. Riesz space for the notion of
be extended to the resulting Banach space so that it 'convergence with a regulator'. The terminology in the arti-
becomes a Banach lattice. If it is possible to introduce cle above is from [1], and is not in general use in the
in a lattice a Banach topology which converts it to a Western literature.
A vector lattice (i.e. a real linear (partially) ordered vector
Banach lattice, such a topology is unique. The simplest
space that is at the same time a lattice) is usually called a
example of a Banach lattice is the space C(Q) of con-
Riesz space. A norm II . lion a Riesz space L is called a
tinuous functions on an arbitrary compact topological
Riesz norm if
space Q with the natural (pointwise) order and with the
ordinary (uniform) norm. Other examples of Banach I t I ~ I g I ~ I t II ~ II g II
lattices include Lp spaces and Orlicz spaces (cf. Orlicz whenever t, gEL (called the monotonicity condition,
space). In Banach lattices convergence in norm is (*)- above). Here
convergence for convergence with a regulator. This is
ItI = (- t)V(f).
not true of normed lattices. A normed Riesz space is understood to mean a Riesz space
An important special case is a Banach lattice of provided with a Riesz norm. The definition of a Banach lat-
tice can now be rephrased as follows: A Banach lattice is a
bounded elements. If a lattice X contains a strong unit
normed Riesz space that is complete in the norm.
1, i.e. if for each x EX there exists a ,\ such that
All (Riesz) norms on a Banach lattice are equivalent.
I x I :S;;;,\I, then the smallest ,\ for which this inequality For a Banach lattice L the order dual space (cf. Riesz
is valid is taken as II x II. The normed lattice thus space) and norm dual space coincide (this is not true for
obtained is called a normed lattice of bounded elements; normed Riesz spaces in general). A Banach lattice, required
if it is complete in norm, it is called a Banach lattice of to be complete in norm, need not be Oedekind complete.
bounded elements. In a Banach lattice (and even in a However, any Banach lattice L (even, every normed Riesz
normed lattice) of bounded elements convergence in space) can be regarded as a (Riesz) subspace of a
norm is identical with convergence with a regulator, Oedekind-complete Banach lattice (the bi-dual (in norm) of
while the boundedness of a set of elements in norm is L, cf. Banach space).
identical with order boundedness. If a normed lattice of References
bounded elements is conditionally a-complete, it is [A1] SCHAEFER, H.H.: Banach lattices and positive operators,

complete in norm. Springer, 1974.


[A2] LUXEMBURG, W.A.l. and ZAANEN, A.c.: Riesz spaces, 1,
The space C(Q) is a Banach lattice of bounded ele- North-Holland, 1971.
ments in which the function x(q) 1 is taken as the [A3] LINDENSTRAUSS, 1. and TZAFRlRl, L.: Classical Banach
unit. For any Banach lattice X of bounded elements spaces, 2, Springer, 1979.
[A4] ZAANEN, A.c.: Riesz spaces, 2, North-Holland, 1983.
there exists a compact Hausdorff space Q such that X
is algebraically and lattice isomorphic to the space AMS 1980 Subject Classification: 46A40, 46830
C (Q). This is an abstract characterization of the
Banach lattice of continuous functions on a compact BANACH - MAzUR FUNCfIONAL,
Hausdorff space. Banach - Mazur operator - A concept of a computable
In any normed lattice an additive functional that is functional (operator), proposed by S. Banach and S.
continuous in norm is regular and, moreover, is Mazur [l] and concerning the computability of a func-
representable as the difference of two additive function- tional (operator) from a set M 1 into a set M 2 as its
als which are continuous in norm. In a Banach lattice property to transform any computable sequence of e1e-

333
BANACH - MAZUR FUNCTIONAL

ments of M 1 into a computable sequence of elements BANACH MODULE (left) over a Banach algebra A -
of M2 (d. Computable function). A Banach space X together with a continuous bilinear
Let R be the set of all one-place general recursive operator m:A XX~X defining on X the structure of a
functions (d. General recursive function). A functional left module over A in the algebraic sense. A right
<I> defined on R and assuming natural numbers as Banach module and a Banach bimodule over A are
values is called computable according to defined in an analogous manner. A continuous
Banach - Mazur, or a Banach - Mazur functional, if for homomorphism of two Banach modules is called a
any two-place general recursive function g there exists a morphism. Examples of Banach modules over A
general recursive function f such that include a closed ideal in A and a Banach algebra B :JA.
j(n) = <I>(g(n, m)). A Banach module over A that can be represented as a
direct factor of Banach modules A + E, (where A + is
(Here g is considered to be a function of m for any
A with an added unit and E is a Banach space and
constant n.) All general recursive functionals and
mea, bx)=abx) is called projective. Cf. Topologi-
everywhere-defined effective functionals (d. Construc-
cal tensor product.
tive metric space) are Banach - Mazur functionals. On
References
the other hand, an example of a Banach - Mazur func- [1] R.iEFFEL, M.A.: 'Induced Banach representations of Banach
tional which is not identical with any general recursive, algebras and locally compact groups', J. Funct. Anal. 1 (1967),
and hence neither with any effective, functional is 443-491. A .1.v a. lGh eIemsk'II v

known [2]. An important property of Banach - Mazur


functionals is their continuity [1]: The values of such a AMS 1980 Subject Classification: 46825
functional on any general recursive function is defined BANACH SPACE, B-space - A complete normed vec-
only by a finite number of values of this function. tor space. The function spaces introduced by D. Hil-
The concept of computability just outlined can be bert, M. Frechet and F. Riesz between 1904 and 1918
extended to functions of a real variable. Let C be the served as the starting point for the theory of Banach
set of computable sequences of computable real spaces. It is in these spaces that the fundamental con-
numbers; each sequence {ak} E C is defined by two gen- cepts of strong and weak convergence, compactness,
eral recursive functions f and g such that for all nand linear functional, linear operator, etc., were originally
k la- (k, n)-g(k,
k n+l
n) I < _1_.
n+l
studied. Banach spaces were named after S. Banach
who in 1922 began a systematic study of these spaces,
based on axioms introduced by himself, and who
A function cf> of a real variable is said to be computable obtained highly advanced results.
according to Banach - Mazur (the set of such functions
The theory of Banach spaces developed in parallel
is denoted by 91) if for any sequence {ak} from C the
with the general theory of linear topological spaces (cf.
sequence {cf>( ak)} also belongs to C. All functions cf> E 91
Linear topological space). These theories mutually
are continuous at all computable points [1]. For
enriched one another with new ideas and facts. Thus,
instance, sgn x f/:. 91. The question as to whether all
the idea of semi-norms, taken from the theory of
functions from 91 are computably continuous is still
normed spaces, became an indispensable tool in con-
(1977) open. The set 91 is closed with resr~ct to the
structing the theory of locally convex linear topological
sequence of operations used in analysis, so) that com-
spaces. The ideas of weak convergence of elements and
putable analysis can be successfully developed on this
linear functionals in Banach spaces ultimately evolved
basis [1].
to the concept of weak topology. The theory of Banach
References spaces is a thoroughly studied branch of functional
[1] MAZUR, S.: Computahle analysis. PWN. 1963.
analysis, with numerous applications in various
[2] FRIEDBERG. R.M.: '4-quantifier completeness: A Banach-
Mazur functional not uniformly partial recursive', Bull. Acad. branches of mathematics - directly or by way of the
Polan. Sci. Ser. Sci. Malh., ASlr. Phys. 6, no. I (1958 J. 1-5. theory of operators.
[3] MARKOV, A.A.: 'On constructive functions'. Trud)' Mal. Jnsl. The problems involved in Banach spaces are of dif-
Sleklov. 52 (1958), 315-348 (in Russian).
[4] ROGERS, JR .. H.: Theory of recursive junclions and ejfeclive ferent types: the geometry of the unit ball, the
compulahilill'. McGraw-Hill, 1967. h geometry of subspaces, the linear topological classifica-
B.A. Kus ncr
tion, series and sequences in Banach spaces, best
Editorial comments. For a more recent exposition of approximations in Banach spaces, functions with values
recursive analysis, see [A1].
in a Banach space, etc. Regarding the theory of opera-
References tors in Banach spaces it should be pointed out that
[A1] AIIERTH. O. Computable analysis. McGraw-Hili. 1980.
many theorems are directly related to the geometry and
AMS 1980 Subject Classification: 03020, 03F60 the topology of Banach spaces.

334
BANACH SPACE

Examples. The Banach spaces encountered in Lp(S;~, JL). It is the space of Lebesgue-measurable
analysis are mostly sets of functions or sequences of functions, summable of degree p, with the norm
numbers which are subject to certain conditions.
I) ~,p;;;;'1, is the space of numerical sequences II x I = [[I xes) IPdsfIP.
x={~n} for which
00
13) AP is the Bohr space of almost-periodic func-
~ I ~n IP < 00, tions, with the norm
n=1

with the norm Ilxll= sup Ix(t)l.


-00<1<::+00

The spaces qa, b], C[a, b], Lp[a, b], c, ~ are


separable; the spaces M[ a, b], m, AP are non-
separable; C[K] is separable if and only if K is a com-
2) m is the space of bounded numerical sequences
pact metric space.
with the norm
I x I = sup
n
I ~n I A (closed linear) subspace Y of a Banach space, con-
sidered apart from the enveloping space X, is a Banach
3) c is the space of convergent numerical sequences space. The quotient space X / Y of a normed space by
with the norm a subspace Y is a normed space if the norm is defined
I x II = sup I ~n I
n
as follows. Let Y 1 = X 1 + Y be a coset. Then
4) Co is the space of numerical sequences which
converge to zero with the norm
II YII = YEY
infII X1+Y II
If X is a Banach space, then X / Y is a Banach space
II x II = max I ~n I
n
as well. The set of all continuous linear functionals
5) qa, b] is the space of continuous functions defined on the normed space X, with the norm
x=x(t) on [a, b] with the norm
11/11 = sUP I ((x) I , x~O,
IxI = max
a:<;;;,ts;,b
I x(t) I XEX I x II
is said to be the dual space of X, and is denoted by X'.
6) C[K] is the space of continuous functions on a
It is a Banach space.
compactum K with the norm
Banach spaces satisfy the Hahn - Banach theorem on
II x II = max I x(t) I
IEK
the extension of linear functionals: If a linear func-
tional is defined on a subspace Y of a normed space X,
7) C[a, b] is the space of functions with continuous
it can be extended, while preserving its linearity and
derivatives up to and including the order n, with the
continuity, onto the whole space X. Moreover, the
norm
I x II = k~=(j1";;I,,;;b
max I X(k)(t) I
extension can be made to have the same norm:
IIfllx = sUP I (x) I = IIflly = sUP I (Y) I.
8) cn[/m] is the space of all functions defined in an XEX II x II YEt II y I
m-dimensional cube that are continuously differentiable Even a more general theorem is valid: Let a real-valued
up to and including the order n, with the norm of uni- function p (x) defined on a linear space satisfy the con-
form boundedness in all derivatives of order at most n. ditions:
9) M[a, b] is the space of bounded measurable func- p(x +y) ~ p(x)+p(y),
tions with the norm p(Ax) = 'Ap(x), A;;;' 0, X,YEX,
I x II = ess max I x (t) I and let f (x) be a real-valued linear functional defined
a~t~b
on a subspace Y C X and such that
10) A (D) is the space of functions which are analytic
in the open unit disc D and are continuous in the I(x) ~ p(x), XE Y.
closed disc ]5, with the norm Then there exists a linear functional F(x) defined on
the whole of X such that
II x I = rna]( I x (z) I
::'ED
F(x) = I(x), XE Y; F(x) ~ p(x), XEX
11) Lp(S;~,JL),p;;;;'l, is the space of functions x(s)
A consequence of the Hahn - Banach theorem is the
defined on a set S provided with a countably-additive
'inverse' formula which relates the norms of X and X':
measure JL, with the norm
- I (x) I
IP II x II - ~ 11/11 ,/~O, XEX
IIxll = [fl x(s) IPJL{ds)f .
The maximum in this formula is attained for some
12) Ip[a, b], p;;;;.l, is a special case of the space f= Ix EX'. Another important consequence is the

335
BANACH SPACE

existence of a separating set of continuous linear func- A Banach space is said to be strictly convex if its unit
tionals, meaning that for any x \*x 2 E X there exists a sphere S contains no segments. Convexity moduli are
linear functional f on X such that f(x\)*f(X2) (cf. introduced for a quantitative estimation of the convex-
Complete set of functionals). ity of the unit sphere; these are the local convexity
The general form of a linear functional is known for modulus
many specific Banach spaces. Thus, on Ip[a, b], p> 1,
all linear functionals are given by a formula S(X,f) = inf{l-II X;y II:YES, Ilx-YII;;a.+
b
f(x)= !x(t)y(t)dt,
a XES, 0<f';;;2,
where Y ELq[a, b], 1 / p + 1 / q = 1, and any function and the uniform convexity modulus
y(t)ELq defines a linear functional f by this formula,
moreover b ]I/q 8(f) = inf 8(x, f).
XES

Ilfll = [ [Iy(t) Iqdt . If ~(x, 0 for all x ES and all >0, the Banach space
is said to be locally uniformly convex. If ~(O, the
Thus, the dual space of Lp is Lq: L; = Lq. Linear func- space is said to be uniformly convex. All uniformly
tionals on L \ [a, b] are defined by the same formula, convex Banach spaces are locally uniformly convex; all
but in this case y EM, so that Li =M. locally uniformly convex Banach spaces are strictly
The space X", dual to X', is said to be the second
convex. In finite-dimensional Banach spaces the
dual. Third, fourth, etc., dual spaces are defined in a converses are also true. If a Banach space is uniformly
similar manner. Each element in X may be identified
convex, it is reflexive.
with some linear functional defined on X':
A Banach space is said to be smooth if for any
F(f) = f(x) for allfEX' (FEX", XEX), linearly independent elements x and y the function
where II F II = I x II One may then regard X as a sub- 1f;(t) = II x + ty II is differentiable for all values of t. A
space of the space X" and X ex" C XIV ... , Banach space is said to be uniformly smooth if its
x' ex'" c .... If, as a result of these inclusions, the modulus of smoothness
Banach space coincides with its second dual, it is called
reflexive. In such a case all inclusions are equalities. If
per) = sup { I X +ry "; II X -ry II
X.YES
-I}, r>O,

X is not reflexive, all inclusions are strict. If the quo- satisfies the condition
tient space X" / X has finite dimension n, X is said to
be quasi-reflexive of order n. Quasi-reflexive spaces exist
lim 2S2l = o.
for all n. In uniformly smooth spaces, and only in such spaces,
RefleXiVity criteria for Banach spaces. 1) X is reflex- the norm is uniformly Frechet differentiable. A uni-
ive if and only if for each f EX' it is possible to find an formly smooth Banach space is smooth. The converse is
x EX on which the 'sup' in the formula true if the Banach space is finite-dimensional. A
II f II -- ~~~ I I[(x) I
x II ' X=F0,
Banach space X is uniformly convex (uniformly
smooth) if and only if X is uniformly smooth (uni-
is attained. formly convex). The following relationship relates the
2) In reflexive Banach spaces and only in such spaces convexity modulus of a Banach space X and the
each bounded set is relatively compact with respect to smoothness modulus of X':
weak convergence: Anyone of its infinite parts con-
pX'(r) suo {f(2T)
= 0<,l;;2 -8 X(f)}.
tains a weakly convergent sequence (the
Eberletn - Shmul'yan theorem). The spaces Lp and Ip' If a Banach space is uniformly convex (uniformly
p> 1, are reflexive. The spaces L \, 11, C, M, c, m, AP smooth), so are all its subspaces and quotient spaces.
are non-reflexive. The Banach spaces Lp and Ip, p> 1, are uniformly con-
A Banach space is said to be weakly complete if each vex and uniformly smooth, and
weak Cauchy sequence in it weakly converges to an ele-
2 (I <p .;;;2),
ment of the space. Every reflexive space is weakly com- {
8() c:o::: f!' (2.;;;p<oo);
plete. Moreover, the Banach spaces L 1 and 11 are
weakly complete. The Banach spaces not containing a i' (l<p.;;;2),
{
subspace isomorphic to Co form an even wider class. per) c:o::: ~ (2';;;p<oo);
These spaces resemble weakly-complete spaces in
several respects.
lrf()c:o:::CP() ~ a <M<hl.
CP() J

336
BANACH SPACE

The Banach spaces M, C, A, L I , AP, m, c, II are not plete normed space is not homeomorphic to any
strictly convex and are not smooth. Banach space. All infinite-dimensional separable
The following important theorems for linear opera- Banach spaces are homeomorphic.
tors are valid in Banach spaces: In the class of separable Banach spaces, C [0, 1] and
The Banach - Steinhaus theorem. If a family of linear A (D) are universal (cf. Universal space). The class of
operators !if = {T a} is bounded at each point, reflexive separable Banach spaces contains even no iso-
sup II Tax II < 00, XEX, morphic universal spaces. The Banach space II is
a
universal in a somewhat different sense: All separable
then it is norm-bounded: Banach spaces are isometric to one of its quotient
sup II Ta II < 00.
spaces.
a
Each of the Banach spaces mentioned above, except
The Banach open-mapping theorem. If a linear con- L2 and 12 , contains subspaces without a complement.
tinuous operator maps a Banach space X onto a In particular, in m and M every infinite-dimensional
Banach space Y in a one-to-one correspondence, the separable subspace is non-complementable, while in
inverse operator T- I is also continuous. C[O, 1] all infinite-dimensional reflexive subspaces are
The closed-graph theorem. If a closed linear operator non-complement able. If all subspaces in a Banach
maps a Banach space X into a Banach space Y, then it space are complementable, the space is isomorphic to a
is continuous. Hilbert space. It is not known (1985) whether or not all
Isometries between Banach spaces occur rarely. The Banach spaces are direct sums of some two infinite-
classical example is given by the Banach spaces L2 and dimensional subspaces. A subspace Y is complement-
12 The Banach spaces C[Kd and C[K 2 ] are isometric able if and only if there exists a projection which maps
if and only if K] and K2 are homeomorphic (the X onto Y. The lower bound of the norms of the projec-
Banach-Stone theorem). A measure of proximity of tions on Y is called the relative projection constant
isomorphic Banach spaces is the number ,\( Y, X) of the subspace Y in X. Each n-dimensional
d(X, Y)=lninf II T 11'11 T- 1 II, subspace of a Banach space is complementable and
T
'\(Yn' X)';;;; Vn. The absolute projection constant ,\(Y) of
where T runs through all possible operators which real- a Banach space Y is
ize a (linear topological) isomorphism between X and A(Y) = S~A(Y, X),
Y. If X is isometric to Y, then d(X, Y)=O. However,
non-isometric spaces for which d(X, Y)=O also exist; where X runs through all Banach spaces which contain
they are said to be almost-isometric. The properties of Yas a subspace. For any infinite-dimensional separable
Banach spaces preserved under an isomorphism are Banach space Y one has '\(Y)= 00. Banach spaces for
said to be linear topological. They include separability, which '\(Y)';;;;'\<oo form the class fJ'A ('\;;'1). The
reflexivity and weak completeness. The isomorphic clas- class fJ'1 coincides with the class of spaces C(Q) where
sification of Banach spaces contains, in particular, the Q are extremally-disconnected compacta (d.
following theorems: Extremally-disconnected space).
Fundamental theorems on finite-dimensional Banach
Lr =/=- Ls; Ir =/=- I" r=/=-s;
spaces. 1) A finite-dimensional space (a Minkowski
Lr =/=-1" r=/=-s; Lr = I,. r=s=2; space) is complete, i.e. is a Banach space. 2) All linear
M = m; C[O,I)=/=-A(D); operators in a finite-dimensional Banach space are con-
C[K] = C [0, 1] if K is a metric compactum with the tinuous. 3) A finite-dimensional Banach space is reflex-
cardinality of the continuum; ive (the dimension of X is equal to the dimension of
X). 4) A Banach space is finite-dimensional if and only
C[Im) =/=- C[O, I).
if its unit ball is compact. 5) All n-dimensional Banach
Each separable Banach space is isomorphic to a locally spaces are pairwise isomorphic; their set becomes com-
uniformly convex Banach space. It is not known (1985) pact if one introduces the distance
if there are Banach spaces which are isomorphic to d(X, Y) = lninf II T 11'11 r- I II
T
none of their hyperplanes. There exist Banach spaces
A series
which are not isomorphic to strictly convex spaces. (*)
Irrespective of the linear nature of normed spaces, it is
possible to consider their topological classification. Two is said to be convergent if there exists a limit S of the
spaces are homeomorphic if a one-to-one continuous sequence of partial sums:
correspondence, such that its inverse is also continuous,
can be established between their elements. An incom-
lim
n_oo
Ils- ~>k I = 0.
k =I

337
BANACH SPACE

If relations: Ji(ej)=8ij. In such a case the system {ek,fk}


is said to be biorthogonal. A set of linear functionals is
said to be total if it annihilates only the zero element of
the series (*) is convergent, and is said in such a case to the space. In each seperable Banach space there exists
be absolutely convergent. A series is said to be uncondi- a complete, minimal system with a total adjoint. Each
tionally convergent if it converges when its terms are element x EX can formally be developed in a series by
arbitrarily rearranged. The sum of an absolutely con- the biorthogonal system:
vergent series is independent of the arrangement of its
00
terms. In the case of series in a finite-dimensional x ~ ~.fk(x)eb
space (and, in particular, for series of numbers) uncon- k =1

ditional and absolute convergence are equivalent. In but in the general case this series is divergent.
infinite-dimensional Banach spaces unconditional r
A system of elements {ek} is said to be a basis in X
convergence follows from absolute convergence but the if each element x EX can be uniquely represented as a
converse is not true in any infinite-dimensional Banach convergent series
space. This is a consequence of the Dvoretskii- Rogers 00

theorem: For all numbers ak ;;"0, subject to the condi- x = ~ akeb ak =ak(x),
k=1
tion ~at < 00, there exists in each infinite-dimensional
Each basis in a Banach space is a complete uniform
Banach space an unconditionally convergent series
minimal system with a total adjoint. The converse is
~Xk such that I Xk II =ab k = 1, 2, .... In the space
not true, as can be seen from the example of the system
Co (and hence also in any Banach space containing a {eint}~oo in crO,2'IT] and LdO,2'IT].
subspace isomorphic to co), for any sequence ak;;"O A basis is said to be unconditional if all its rearrange-
that converges to zero, there exists an unconditionally ments are also bases; otherwise it is said to be condi-
convergent series ~Xb II Xk II =ak' In Lp(S;~, f.L) the tional. The system {eint}~oo in Lp [0,2'IT],p>l,p=2, is
unconditional convergence of the series ~Xk implies a conditional basis. The Haar system is an uncondi-
that 00 tional basis in Lp ' p> 1. There is no unconditional
~ II Xk liS < 00, basis in the spaces C and L \. It is not known (1985)
k =1
whether or not each Banach space contains an infinite-
where
s _
-
{2P (l';;;p';;;2), dimensional subspace with an unconditional basis. Any
(P~2). non-reflexive Banach space with an unconditional basis
contains a subspace isomorphic to 1\ or co.
In a uniformly convex Banach space with convexity
Two normalized bases {e~} and {e~} in two Banach
modulus 8(i) the unconditional convergence of the
spaces X \ and X 2 are said to be equivalent if the
series ~Xk implies that
00
correspondence e~e~, k = 1,2, ... , may be extended to
~o(11 Xk II) < 00. an isomorphism between X \ and X 2. In each of the
k =1
spaces 12 , 1\, Co all normalized unconditional bases are
A series ~Xk is said to be weakly unconditionally equivalent to the natural basis. Bases constructed in
Cauchy if the series of numbers ~ If (xd I converges Banach spaces which have important applications are
for each fEX. Each weakly unconditiona) y Cauchy not always suitable for solving problems, e.g. in the
series in X converges if and only if X contains no sub- theory of operators. T-bases, or summation bases, have
space isomorphic to co. been introduced in this context. Let {tij} r be the
r
A sequence of elements {ek} of a Banach space is matrix of a regular summation method (d. Regular
said to be minimal if each one of its terms lies outside summation methods). The system of elements {en} eX
the closure of x(n) = [ekh""'n' the linear hull of the is said to be a T-basis corresponding to the given sum-
remaining elements. A sequence is said to be uniformly mation method if each x EX can be uniquely
minimal if represented by a series
p(en;x(nJ)~Yllenll, O<y,;;;l, n=1,2, .... oc
X ~ ~akeko
If y = 1, the series is said to be an A uerbach system. In k =I
each n-dimensional Banach space there exists a com- which is summable to x by this method. The tri-
plete Auerbach system {e n }7. It is not known (1985) gonometric system {e 'nl } ~ x in C[O, 27T] is a summation
whether or not a complete Auerbach system exists in basis for the methods of Cesaro and Abel. Each T-
each separable Banach space. For each minimal system basis is a complete minimal (not necessarily uniformly
there exists an adjoint system of linear functionals {j,,}, minimal) system with a total adjoint. The converse is
which is connected with {ek} by the biorthogonality not true. Until recently (the 1970's) one of the principal

33~
BAND OF SEMI-GROUPS

problems of the theory of Banach spaces was the basis sequence of continuous linear mappings of E into F,
problem dealt with by Banach himself: Does a basis and if the sequence un(x) converges in F for all x EM,
exist in each separable Banach space? The question of then Un converges uniformly on any compact subset of
existence of a basis in specifically defined Banach E to a continuous linear mapping v of E into F.
spaces remained open as well. The first example of a References
separable Banach space without a basis was constructed [I] BANACH, S. and STEINHAUS, H.: 'Sur Ie principe de la conden-
in 1972; bases in the spaces cn(Im) and A(D) have sation de singularites', Fund. Math. 9 (1927),50-61.
[2] BOURBAKI, N.: Elements of mathematics. Topological vector
been constructed. spaces, Addison-Wesley, 1977 (translated from the French).
References [3] SCHAEFER, H.H.: Topological vector spaces, Macmillan, 1966.
[I] BANACH, S.: 'Sur les operations dans les ensembles abstraits et A.I. Shtern
leur application aux equations integrales', Fund. Math. 3
(1922), 133-181. Editorial comments.
[2] BANACH, S.S.: A course offunctionol analysis, Kiev, 1948 (in References
Ukrainian). [A 1] KOTHE, G.: Topological vector spaces, 1, Springer, 1969.
[3] DUNFORD, N. and SCHWARTZ, J.T.: Linear operators. General [A2] KELLEY, 1.L. and NAMIOKA, I.: Linear topological spaces,
theory, I, Interscience, 1958. Springer, 1963.
[4] DAY, M.M.: Normed linear spaces, Springer, 1958.
[5] BOURBAKI, N.: Elements of mathematics. Topological vector AMS 1980 Subject Classification: 46A32
spaces, Addison-Wesley, 1977 (translated from the French).
[6] SINGER, I.: Bases in Banach spaces, 12, Springer, 1970-1981. BAND OF SEMI-GROUPS of a given family {Sa} -
[7] LINDENSTRAUSS, J. and TZAFRIRI, L.: Classical Banach spaces,
1-2, Springer, 1977-1979. A semi-group S that has a partition into sub-semi-
[8] DIESTEL, J.J.: Geometry of Banach spaces. Selected topics, groups whose (isomorphism) classes are just the semi-
Springer, 1975. groups Sa, and such that for any Sa, S f3 there is an S y
[9] BEAUZAMY, B.: Introduction to Banach spaces and their
geometry, North-Holland, 1985.
such that Sa S f3 CS y' S is also said to be decomposable
M.l. Kadets into the band of semi-groups Sa. In other words, S has
B.M. Levitan a partition into a band of semi-groups Sa if all the Sa
Editorial comments. The second dual of a space is also are sub-semi-groups of S and if there is a congruence p
called the bidual. on S such that the p-classes are just the Sa. The semi-
References groups Sa are called the components of the given band.
[A 1] SEMANEDl, Z.: Banach spaces of continuous functions, Polish The term 'band of semi-groups' is consistent with the
Sci. Publ, 1971. frequent use of the word 'band' as a synonym of 'semi-
AMS 1980 Subject Classification: 468XX, 46EXX group all elements of which are idempotents', since a
congruence p on a semi-group S determines a partition
BANACH - STEINHAUS THEOREM - A general of S into a band if and only if the quotient semi-group
appellation for several results concerning the linear- S / p is a semi-group of idempotents.
topological properties of the space of continuous linear Many semi-groups are decomposable into a band of
mappings of one linear topological space into another. semi-groups with one or other 'better' property; thus,
Let E and F be locally convex linear topological spaces, the study of their structure is reduced in some measure
where E is a barrelled space, or let E and F be linear to a consideration of the types to which the com-
topological spaces, where E is a Baire space. The fol- ponents of a band belong, and of semi-groups of idem-
lowing propositions are then valid. I) Any subset of the potents (see, e.g. Archimedean semi-group; Completely-
set L(E, F) of continuous linear mappings of E into F simple semi-group; Oifford semi-group; Periodic semi-
which is bounded in the topology of simple conver- group; Separable semi-group).
gence is equicontinuous (the uniform boundedness princi- A band of semi-groups Sa is said to be commutative
ple); 2) If a filter P in L(E, F) contains a set bounded if for the corresponding congruence p the quotient
in the topology of simple convergence, and converges semi-group S / p is commutative; then S / p is a semi-
in the topology of simple convergence to some mapping lattice (in this case, S is frequently called a semi-lattice
v of E into F, then v is a continuous linear mapping of of semi-groups Sa; in particular, if S / p is a chain,
E into F, and P converges uniformly to v on each com- then S is called a chain of semi-groups Sa). A band of
pact subset of E [2], [3]. semi-groups is called rectangular (sometimes matrix) if
These general results make it possible to render the S / p is a rectangular semi-group (see Idempotents,
classical results of S. Banach and H. Steinhaus [I] more semi-group of). Equivalently, if the components of the
precise: Let E and F be Banach spaces and let M be a band can be indexed by pairs of indices Si"A, where i
subset of the second category in E. Then, I) if and A run over certain sets I and A, respectively, such
HCL(E, F) and sup{ II u(x) II: uEH} is finite for all that for any So" Sj/" one has Si"ASJ/" CSi/". Any band of
xEM. then sup {II u II: uEH}<oc; 2) if Un IS a semi-groups is a semi-lattice of rectangular bands, that

339
BAND OF SEMI-GROUPS

is, its components can be arranged into subfamilies so SnJ, ... ,nm , k, then Sn), ... ,nm ; this
that the union of the components of each subfamily is property is known as the step of the bar induction.
a rectangular band of components, and the original If the conditions 1) through 4) above are met, one
semi-group is decomposable into a semi-lattice of these can deduce from the principle of bar induction that
unions (Clifford's theorem [1]). Since the properties of S .
being a semi-group of idempotents, a semi-lattice or a The use of bar induction was proposed by L.E.1.
rectangular semi-group are characterized by identities, Brouwer as an intuitionistic method of reasoning which
for each of the listed properties () there is a finest indicates that the collection of free choice sequences is
congruence on any semi-group S for which the incomplete and to some extent uncountable. It was
corresponding quotient semi-group has the property (), demonstrated, in particular, by S.c. Kleene and,
that is, there exist greatest (or biggest quotient) partitions independently, by AA Markov that it follows from the
of S into a band of semi-groups, into a commutative principle of bar induction (in fact, even from a corol-
band of semi-groups and into a rectangular band of lary of bar induction, the fan theorem) that not all
semi-groups. choice sequences are recursive (d. Fan).
The term strong band concerns special types of bands The forms of bar induction which one started to util-
of semi-groups [4]: For any elements a and b from ize in the foundations of mathematics in the 1960's do
different components, the product ab is a power of one not deal with tuples of natural numbers, but rather
of these elements. An important special case of a with tuples of more complex objects, such as tuples of
strong band, and also a special case of a chain of choice sequences.
semi-groups, is the ordinal sum (or sequentially- Bar induction can be written down in the language
annihilating band): The set of its components {Sa} is of formal intuitionistic mathematical analysis as fol-
totally ordered, and for any Sa, S p such that Sa <S p, lows:
'Vx (R(x)V -,R(x & 'Va 3x R(a(x &
and for any aES a , bESp one has ab=ba=a; the ordi-
nal sum is defined uniquely up to an isomorphism, by & 'Vx (R(xPS(x &
specifying the components and their ordering. & 'Vx ('Vy S(x*'yPS(x:J S(Q).
References References
[I] CLIFFORD, A.H.: 'Bands of semi-groups', Proc. A mer. Math. [I] KLEENE, S.c. and VESLEY, R.E.: Thefoundations of intuition is-
Soc. 5 (1954), 499-504.
tic mathematics: especially in relation to recursive functions,
[2] CLIFFORD, A.H. and PRESTON, G.B.: The algebraic theory of
North-Holland, 1965.
semigroups, I, Amer. Math. Soc., 1961. A. G. Dragalin
[3] LYAPIN, E.S.: Semigroups, Amer. Math. Soc., 1974 (translated
from the Russian). Editorial comments.
[4] SHEVRIN, L.N.: 'Strong bands of semi-groups', lzv. Vyssh. References
Uchebn. Zaved. Mat., no. 6 (1965), 156-165 (in Russian). [A 1] DUMMETJ, M.: Elements of intuitionism, Clarendon Press,
L.N. Shevrin 1977.
[A2] TROELSTRA, A.S.: Choice sequences, Clarendon Press, 1977.
Editorial comments. A congruence on a semi-group 5 is
an equivalence relation such that for all a, b, CES one has AMS 1980 Subject Classification: 03F55
a~b =} ac~bc and ca~cb.
BARBER PARADOX - The same as the antinomy of
AMS 1980 Subject Classification: 20M10 the 'village barber'.
AMS 1980 Subject Classification: 03A05, 00A25
BAR INDUCTION - An inductive way of reasoning,
used in intuitionistic mathematics (cf. Intuitionism), BARBIER THEOREM on curves of constant width - If
which may be described as follows. Let certain proper- the distance between any two parallel supporting
ties Rand S be defined on finite tuples (cf. Tuple) of straight lines to a curve is constant and equal to a. then
natural integers, such that I) the property R is decid- the length of the curve is 1Ta. Discovered by E. Barbier
able, i.e. it is possible to effectively find out whether it in 1860.
A.B. Ivanov
is fulfilled on any tuple <n 1, . . . ,nm > or not; 2) for
Editorial comments. The original work of E. Barbier IS
any choice sequence 0: it is possible to find a tuple of
the form <0:(0), ... ,o:(n -I for which R is fulfilled.
[A 1].
References
Moreover, if 2) is fulfilled, one says that R 'bars' the
[A1] BARBIER. E.: 'Note sur Ie probleme de l'amgullle et Ie leu du
empty tuple < > (hence the name); 3) for any tuple 1T Joint couvert', J Math. Pure Appl 5 (1860), 273-286.
of natural integers, if R(1T), then S(1T) (the so-called [A2] BO""lsF~. T. and FEVHH. W. Theorie der konvexen
basis of the bar induction); and 4) if <n], ... ,n m > is K6rper, Springer. 1934.

a tuple such that for any natural number k one has AMS 1980 Subject Classification: 53A04, 52A 10

340
BARRIER

BARRELLED SPACE - A locally convex linear topo- BARllIER, Lebesgue barrier, in potential theory - A
logical space displaying several properties of Banach function the existence of which is a necessary and suffi-
spaces and Frechet spaces without the metrizability cient condition for the regularity of a boundary point
condition (cf. Banach space; Frechet space). It is one of with respect to the behaviour of a generalized solution
the most extensive class of spaces to which the of the Dirichlet problem at that point (cf. Perron
Banach - Steinhaus theorem applies. Barrelled spaces method; Regular boundary point).
were first introduced by N. Bourbaki. Let D be a domain in a Euclidean space Rn, n ~ 2,
A set A in a vector space E is said to be a balanced and let ~ be a point on its boundary f=aD. A barrier
set if ax EA for all x EA and for all a such that for the point ~ is any function w(x), continuous in the
I a I ~ 1. A balanced set ACE is said to be an absorb- intersection (D U f) n
B of the closed domain D U f
ing set if it absorbs each point of E, i.e. if for each with some ball B =B(R, ~ with centre at ~, which is
x EE there exists an a>O such that ax EA. superharmonic in D n
B and positive in (D U f) B, n
A barrel in a linear topological space is a closed, bal- except at ~, at which it vanishes. For instance, if n ~3
anced, absorbing, convex set. A barrelled space is a and ~ is any boundary point for which there exists a
linear topological space with a locally convex topology closed ball B(R, y) in D U f which meets f only in ~,
in which every barrel is a neighbourhood of zero. one can take as a barrier the harmonic function
Frechet spaces and, in particular, Banach spaces are
1 1
examples of barrelled spaces. Montel spaces (cf. Montel w~(x) = Rn-2 - I X -y In- 2'
space) are an important class of barrelled spaces, and
display remarkable properties. A quotient space of a where R is the radius of B(R, y) and y is its centre.
barrelled space, a direct sum and inductive limits of A barrier in the theory of functions of (several) com-
barrelled spaces are barrelled spaces. Every pointwise-
plex variables is a function the existence of which for all
bounded set of continuous linear mappings of a bar- boundary points of the domain D implies that D is a
domain of holomorphy. Let D be a domain in the com-
relled space into a locally convex linear topological
space is equicontinuous. In a space dual to a barrelled plex space en, n ~ 1, and let ~ be a point of the boun-
space, a bounded set in the weak topology is bounded dary f=aD. Any analytic function fez) in D with a
in the strong topology and relatively compact in the singular point at ~ will then be a barrier at ~. Thus, the
weak topology. function 1 / (z -~) is a barrier for the boundary point ~
of any plane domain DeC. There also exists a barrier
References
[1] BOURBAKI, N.: Elements of mathematics. Topological vector
at any point ~=(~l' ... '~n) of the boundary of the ball

= {z = (Zh ,zn): I
spaces, Addison-Wesley, 1977 (translated from the French).
[2] EDWARDS, R.E.: Functional analysis: theory and applications., D Zi 1 2 <R 2 },
Holt, Rinehardt, Winston, 1965. I~l

V. M. Tikhomirov
e.g. the function I / (~l Z I + ... +Inzn - R 2).
Editorial comments. Barrelled spaces are the most A barrier exists at a boundary point ~ of a domain D
extensive class of locally convex spaces to which the if there is an analytic function defined in D that is
Banach-Steinhaus theorem can be extended. They were
unbounded at ~, i.e. is such that for some sequence of
first introduced in [A4).
points {z(k)} CD which converges to ~ one has:
A not necessarily balanced set A in E is called absorbing
if for every xEE there is an ao such that xEaA for all lim
k ..... oo
If (Z(k) I = + 00.
I a I ;;"aa For the dual of a barrelled space the following The converse is true for domains D C en in the fol-
four statements are equivalent: 1) A is weakly bounded; 2)
A is strongly bounded; 3) A is equi-continuous; and 4) A is
lowing strong form: For any set E of boundary points
weakly compact. The last statement follows from the of a domain D at which a barrier exists, one can find a
stronger statement that the dual of a barrelled space is function holomorphic in D which is unbounded at all
quasi-complete for any a-topology. (For the last notion see points of E. If E is everywhere dense in the boundary
Topological vector space; Space of mappings, topologi- of D, then D is a domain of holomorphy.
cal.) References
References [1] COURANT, R. and HILBERT, D.: Methods of mathematical phy-
[A 1] ScHAEFER, H.H.: Topological vector spaces, Macmillan, sics. Partial differential equations, 2, Interscience, 1962
(translated from the German).
1966.
[A2] KELLEY, J.L. and NAMIOKA, I.: Linear topological spaces, [2] VLADIMIROV, V.S.: Methods of the theory offunctions of several
Springer, 1963. complex variables, M.I.T., 1966, Chapt. 3 (translated from
the Russian).
[A3] KOTHE, G.: Topological vector spaces, 1, Springer, 1969.
[A4] BoURBAKI, N.: 'Sur certains espaces vectoriels topologiques', [3] SHABAT, B.Y.: Introduction to complex analysis, 1-2, Moscow,
1985, Chapt. 3 (in Russian).
Ann. Inst. Fourier 2 (1950), 5-16.
E.D. Solomentsev
AMS 1980 Subject Classification: 46A07 M. Shirinbekov

341
BARRIER

Editorial comments. Good English references for the of homogeneous coordinates; they are affine invariants.
Lebesgue barrier are [A1] and [A2]. Barycentric coordinates of a simplex are used in alge-
References braic topology [2]. Barycentric coordinates of a point of
[A1] HAYMAN, W.K. and KENNEDY, P.B.: Subharmonic functions, an n-dimensional simplex a with respect to its vertices
1, Cambridge Univ. Press, 1976.
A a, ... ,An is the name given to its (ordinary) Carte-
[A2] HELMS, L.L.: Introduction to potential theory, Acad. Press,
1975 (translated from the German). sian coordinates III the basis of the vectors
OAa, ... ,0A n , where 0 is any point that does not lie
AMS 1980 Subject Classification: 31825, 31 A25,
in the n-dimensional subspace carrying a (if it is con-
32A40
sidered that a lies in some Euclidean space, then the
definition does not depend on the point 0), or to pro-
BARTLETf TEST - A statistical test in the
jective coordinates with respect to A a, ... ,An in the
Behrens- Fisher problem. It refers to the case of two
projective completion of the subspace containing a. The
samples Xl, . . . ,Xn , EN(a\, aT) and
barycentric coordinates of the points of a simplex are
y\, ... 'Yn, EN(a2' a~) having equal size, n\ =n2 =n. non-negative and their sum is equal to one. If the i-th
The critical regions are given by the inequality barycentric coordinate becomes zero, this means that
Vn i-- the point lies at the side of the simplex a opposite to
----~--~~~===-------~ > c the vertex Ai' This makes it possible to consider the
barycentric coordinates of the points of a geometric
complex with respect to all of its vertices. Barycentric
for some constant c >0, where coordinates are used to construct the barycentric subdi-
_ 1 n _ 1 n vision of a complex.
x = - ~Xi' Y = - ~Yi'
n i=1 n 1=1 Barycentric coordinates of abstract complexes are
The left-hand side of the inequality has a Student formally defined in an analogous manner [3].
distribution with n - 1 degrees of freedom, connecting References
the significance level of the test to the constant c. [I] MOBIUS. A.F.: 'Der barycentrische Kalkul', in Gesammelte
Werke, Vol. I, Hirzel. Leipzig, \885.
Bartlett's test is a special case of the Scheffe test and [2] PONTRYAGIN. L.S.: Grundzuge der kombinatorischen Topologie,
has analogous extremal properties, but was presented Deutsch. Verlag Wissenschaft., 1956 (translated from the Rus-
by M.S. Bartlett [1] prior to this solution. A second test sian).
[3] SPANIER, E.H.: Algebraic topology, MacGraw-Hill, 1966.
of Bartlett is connected with the comparison of the
E. G. Sklyarenko
variances of many samples.
AMS 1980 Subject Classification: 51 N1 0, 51 N20
References
(1) BARTLETT, M.S.: 'The information available in small samples', BARYCENTRIC SUBDIVISION of a geometric com-
Proc. Cambridge Phi/os. Soc. (1) 32, no. 3 (1936), 560-566.
plex K - A complex K I obtained by replacing the sim-
0. V. Shalaevskif
plices of K by smaller ones by means of the following
AMS 1980 Subject Classification: 62FXX procedure. Each one-dimensional simplex (segment) is
halved. On the assumption that all simplices of dimen-
BARYCENTRIC COORDINATES - CoordiT1ates of a sion ~n - I are already subdivided. the subdivision of
point in an n-dimensional vector space En, v'th respect any n-dimensional simplex a is defined by means of
to some fixed system Po, ... ,pn of points that do not cones over the simplices of the boundary of a with a
lie in an (n -I )-dimensional subspace. Every point common vertex that is the barycentre of the simplex a,
x EE n can uniquely be written as i.e. the point with barycentric coordinates 1/ (n + I).
x = I\)PII + ... +t..II PII' The vertices of the resulting complex K I are in a one-
where Ao, ... ,An are real numbers satisfying the condi- to-one correspondence with the simplices of the com-
tion Ao + ... + An = I. The point x is by definition the plex K, while the simplices of the complex K I are in
centre of gravity of the masses Ao, ... , An located at such a correspondence with inclusion-ordered finite
the points po . ... ,pn' The numbers Ao, ... ,An are tuples of simplices from K. The formal definition of a
called the barl'centric coordinates of the point x; the barycentric subdivision for the case of an abstract com-
point with barycentric coordinates I / (n + I) is called plex is analogous.
E. C. Sklvarenko
the barvcentre. Barycentric coordinates were introduced Editorial comments.
by A.F. Mobius in 1827. [I]. as an answer to the ques- References
tion about the masses to be placed at the vertices of a [A1 j LA!-.IOTKI, K. SemislmpilZla/e aigebralsche Top%gle,
Springer, 1968
triangle so that a given point is the centre of gravity of
these masses. Barycentric coordinates are a special case AMS 1980 Subject Classification 51 N 10, 51 N20

342
BASE CHANGE

BASE of a topological space X, base of a topology, if and only if it is paracompact with a uniform base (a
basis of a topology, open base - A family IB of open sub- Kolmogorovor To-space with a k-uniform base).
sets of X such that each open subset G C X is a union A base IB of the space X is called regular if for each
of sub collections U C lB. The concept of a base is a fun- point x EX and an arbitrary neighbourhood Ox of it
damental concept in topology: in many problems con- there exists a neighbourhood O~ such that the set of all
cerned with open sets of some space it is sufficient to the elements of the base which intersect both with O~
restrict the considerations to its base. A space can have and X \ Ox is finite. An accessible or T I-space is
many bases, the largest one of which is the family of all metrizable if and only if it has a regular base.
open sets. The minimum of the cardinalities of all bases A generalization of the concept of a base is the so-
is called the weight of the topological space X. In a called w-base (lattice base), which is a family IB of open
space of weight 7" there exists an everywhere-dense set sets in the space X such that each non-empty open set
of cardinality ';;;;7". Spaces with a countable base are in X contains a non-empty set from IB, i.e. IB is dense
also referred to as spaces satisfYing the second axiom of in X according to Hausdorff. All bases are w-bases, but
countability. The dual concept of a closed base, formed the converse is not true; thus, the set Z+ in the
by the complements of the elements of a base, is not Stone- Cech compactification of the set of natural
used to any significant extent. numbers in Z+ forms only a w-base.
A local base of a space X at a point x EX (a base of References
the point x) is a family IB(x) of open sets of X with the [I] ALEKSANDROV, P.S.: EinfUhrung in die Mengenlehre und die
following property: For any neighbourhood Ox of x it Theorie der reelen Funktionen, Deutsch. Verlag Wissenschaft.,
1956 (translated from the Russian).
is possible to find an element V E IB(x) such that [2] URYSOHN, P.S. [P.S. URYSON]: Works on topology and other
x EVe Ox' Spaces with a countable local base at every fields of mathematics, Vol. 1-2, Leningrad, 1951 (in Russian).
point are also referred to as spaces satisfYing the first [3] ALEKSANDROV, P.S. and PASYNKOV, B.A.: An introduction to
the theory of topological spaces and general dimension theory,
axiom of countability. A family IB of open sets in X is a
Moscow, 1973 (in Russian).
base if and only if it is a local base of each one of its [4] ARKHANGEL'SKll, A.V. and PONOMAREV, V.I.: Fundamentals of
points x EX. general topology: problems and exercises, Reidel, 1984
Let m, n be cardinal numbers. A base IB of the space (translated from the Russian).
[5] BOURBAKI, N.: Elements of mathematics. General topology,
X is called an m-point base if each point x EX belongs Addison-Wesley, 1966 (translated from the French).
to at most m elements of the family IB; in particular, if A.A. Mal'tsev
m = 1, the base is called disjoint; if m is finite, it is
Editorial comments. Closed bases are useful in com-
called bounded point finite; and if m = ~o, it is called pactification theory, cf. Compactification.
point countable. Besides the notions of a bounded point-finite base and a
A base IB of the space X is called m-local if each bounded local-finite base one also uses that of a point-finite
point x EX has a neighbourhood Ox intersecting with base and a local-finite base. A base (or any family of sub-
at most m elements of the family IB; in particular, if sets ~) is called point finite if every point x belongs to fin-
m = 1, the base is referred to as discrete; if m is finite, it itely many members of ~, i.e. if ~x={BE~: xEB} is finite
is called bounded locally finite; and if m = ~o, it is called for every x. Note that the families ~x can have arbitrary
locally countable. A base IB is called an (n - m)-point large finite cardinalities, in contrast to the definition of
base (or an (n-m)-local base) if it is a union of a set of bounded point finiteness, when the cardinalities of ~x are
cardinality n of m-point (m-Iocal) bases; examples are, bounded by a fixed finite m. Similar remarks apply to local
finiteness.
for n=~o, a-disjoint, a-point finite, a-discrete and a-
locally finite bases. AMS 1980 Subject Classification: 54AXX
These concepts are used mainly in the criteria of
metrizable spaces. Thus, a regular space with a count- BASE CHANGE, change of base - A category-
able base, or satisfying the first axiom of countability theoretical construction; special cases are the concept
and with a point countable base, is metrizable; a regu- of an induced fibration in topology, and extension of
lar space with a a-discrete or a-locally finite base is the ring of scalars in the theory of modules.
metrizable (the converse proposition is true in the Let C be a category with fib red products and let
former case only). g: S I ---"S be a morphism of C. A base change by means
A base IB of the space X is called uniform (k-uniform) of g is a functor from the category of S-objects (i.e. the
if for each point x EX (each compact subset F) and for category of morphism f: X ---"S, where X is an object of
each one of the neighbourhoods Ox (OF) only a finite C) to the category of S I -objects, taking an S-object
number of elements of the base contain x (intersect f: X ---"S to the S I-object fl : X I ---"S I, where
with F) and at the same time intersect with the X I = X X sS I and fl is projection onto the second fac-
complement X\ Ox (X\ OF)' A space X is metrizable tor. The morphism g is then called the base-change mor-

343
BASE CHANGE

phism. One also says that X I is obtained from X by [2] GROTHENDIECK, A and DIEUDONN~, J.: 'Elements de
base change. geometrie algebrique', Publ. Math. IHES 11 (1961).
[3] ARTIN, M., GROTHENDIECK. A and VERDIER, J.L. (EDS.):
A special case of a base change is the concept of a 'Theorie des topos et cohomologie etale des scMmas', in Sem
fibre of a morphism f: X ~S of schemes S: The fibre of Geom Alg., Vol. 4, Springer, 1973.
the morphism f over a point s E S is the scheme v.l. Danilov
AMS 1980 Subject Classification: 18-XX
Xs = xXss,
i.e. the scheme obtained from X by base change via the BASE OF A DEFORMATION - A conjugate net on a
natural morphism S~S. A similar definition yields the surface F and its deformation r
outside their points of
geometric fibre Xs; it is obtained by base change via the congruence. The base of a deformation is characterized
morphism Spec K ~S associated with a geometric point by the fact that the bend - the relation between the
of S, where K is an algebraically closed field. Many normal curvatures k and k* at isometrically-
properties of the S-scheme X are preserved under a corresponding points of F and r along corresponding
base change. The inverse problem - to infer the pro- directions - has extremal values along the directions
perties of a scheme X from those of the schemes of the base of the deformation.
obtained from X by base change - is considered in
References
descent theory (see also [3]). [I] KAGAN, V.F.: Foundations of the theory of surfaces in a tensor
Let fl: X I ~S I be the morphism obtained from setting, Moscow-Leningrad, 1948, Chapt. 2 (in Russian).
f: x~S via a morphism g: SI ~S, so that one has a M.l. Voftsekhovskif
Cartesian square g,
Editorial comments. For more references on the topic of
XI ~X
deforming or bending surfaces, cf. the article Deformation,
!I~ U isometric.
SI ~S
)( AMS 1980 Subject Classification: 53A07
Let F be a sheaf of sets on X. Then there exists a
natural sheaf mapping tfi:gj.(F)~fl.gI(F). If Fis a BASIC COMMUTATOR, regular commutator - An
sheaf of Abelian groups, then for every q ~o there object inductively constructed from the elements of a
exists a natural sheaf homomorphism given set R and from brackets, in the following manner.
The elements of R are considered by definition to be
>fq:g'(Rqf.(F)) ~ Rq!I,(g'I(F)).
basic commutators of length 1, and they are given an
Under these conditions, tfi and tfiq are also called base- arbitrary total order. The basic commutators of length
change morphisms. It is usually said that the base- n, where where n > 1 is an integer, are defined and
change theorem is valid if tfi (or tfiq) is an isomorphism. ordered as follows. If a, b are basic commutators of
In other words, the base-change theorem is a proposi- lengths smaller than n, then [ab] is considered to be a
tion about the compatibility (commutability) of the basic commutator of length n if and only if the follow-
functors Rqj. with the base-change functor. In particu- ing conditions are met: 1) a, b are basic commutators
lar, if g is the imbedding of a point s ES, the base- of lengths k and I, respectively, and k +I=n; 2) a>b;
change theorem states that there exists a natural iso- and 3) if a =[cd], then d~b. The basic commutators of
morphism (Rqf.(F))x :::::=.Hq(Xx, F IxJ bel),'een the length not exceeding n thus obtained are arbitrarily
fibre of the q-th direct image of the sheaf F and the q- ordered, subject to the condition that lab ]>b, while
dimensional cohomology group of the fibre of the mor- preserving the order of the basic commutators of
phism f The base-change theorem is valid in the lengths less than n. Any set of basic commutators con-
following situations: 1) f is a proper mapping of structed in this way is a base of the free Lie algebra
paracompact topological spaces, S is a locally compact with R as set of free generators [1].
space [1]; 2) f is a separable quasi-compact morphism References
of schemes, g is a flat morphism, F is a quasi-coherent [I] SHIRSHOY, AI.: 'On bases of free Lie algebras', Algebra i
sheaf of Ox-modules (the comparison theorem for the Logika 1, no. I (1962), 14-19 (in Russian)v M G h k
Lorc a ov
U. .
cohomology of ordinary and formal schemes - see [2]
Editorial comments. Let M(R) be the free magma on R,
- can also be interpreted as a base-change theorem);
i.e. the set of all non-commutative and non-associative
or 3) f is a proper morphism of schemes, F is a torsion words in the alphabet R. The basic commutators are to be
sheaf in the etale topology. Some other cases in which seen as a subset of M(R). This subset is also often called a
base-change theorems are valid are considered in [3]. P. Hall set. The identity on R induces a mapping
References </>: M(R)-,>LK(R), where LK(R) is the free Lie algebra on R
[I] GODEMENT, R.: Topologie algebrique et theorie des faisceaux, over the ring K. Let K(R) be a P. Hall set in M(R) (i.e. a set
Hermann, 1958. of basic commutators), then <P(H(R)) is a basis of the free

344
BASIS

K-module LK(R), called a P. Hall basis. Other useful bases erates it. Generation here means that by application of
of LK(R) are the Chen- Fox- Lyndon basis and the operations of a certain class 0 to elements b EB it is
Shirshov basis (these two are essentially the same), and the possible to obtain any element x EX. This concept is
Spitzer- Foata basis; cf. [A4] for these. Let R be finite of related to the concept of dependence: By means of
cardinality r= #R. Let I,(n) be the number of basic commu- operations from 0 the elements of X become dependent
tators on R of length n. Then
on the elements of B. Minimality means that no proper
I,(n) = .l ~p.(d)rn/d subset BleB generates X. In a certain sense this pro-
n dl n
perty causes the elements of B to be independent:
where p.: {1, 2, ... }~{ -1,0, 1} is the Mobius function, None of the elements b EB is generated by the other
defined by p.(1) = 1, p.(k) = 0 if k is divisible by a square, and elements of B. For instance, the set of all natural
p.(p, ... Pm) =(-1)m if p" ... , Pm are distinct prime
numbers Zo has the unique element 0 as basis and is
numbers.
generated from it by the operation of immediate succes-
References sion and its iteration. The set of all natural numbers
[A1] BOURBAKI, N.: Groupes etalgebres de Lie, Hermann, 1972,
Chapt. 2; 3.
> 1 is generated by the operation of multiplication
[A2] HALL, M., JR.: The theory of groups, Macmillan, 1959. from the basis consisting of all prime numbers. A basis
[A3] MAGNUS, W., KARRAss, A. and SOLITAR, D.: Combinatorial of the algebra of quaternions consists of the four ele-
group theory, Interscience, 1966.
ments {I, i, j, k} if the generating operations consist of
[A4] VlENNOT, G.: Algebres de Lie libres et monoides libres, Lect.
Notes in Math., 691, Springer, 1978. addition and of multiplication by real numbers; if, in
addition to these operations, one also includes multipli-
AMS 1980 Subject Classification: 17BXX, 08B20
cation of quaternions, the basis will consist of three ele-
BASIC SET of a linear system - The set of points of ments only - {l, i, j} (because k = ij).
an algebraic variety (or of a scheme) X which belong to A basis of the natural numbers of order k is a subse-
all the divisors of the movable part of the given linear quence 0 of the set Zo of natural numbers including 0,
system L on X. which, as a result of k-fold addition to itself (the gen-
Example. Let erating operation) yields all of Zo. This means that any
natural number n can be represented in the form
AoFn(xo,Xj,X2)+AIGn(XO,Xj,X2) = 0
be a pencil of n-th order curves on the projective plane.
The basic set of this pencil then consists of the set of where ai EO. For example, every natural number is a
common zeros of the forms F' and G', where sum of four squares of natural numbers (Lagrange's
F'H = Fn, G'H = Gn, theorem), i.e. the sequence of squares is a basis of Zo of
and H is the greatest common divisor of the forms Fn order 4. In general, the sequence of m-th powers of
and Gn natural numbers is a basis of Zo (Hilbert's theorem), the
U cf>L: X~P(L) is the rational mapping defined by L, order of which has been estimated by the Vinogradov
then the basic set of L is the set of points of indeter- method. The concept of a basis of Zo has been general-
minacy of cf>L. A basic set has the structure of a closed ized to the case of arbitrary sequences of numbers, i.e.
sub scheme B in X, defined as the intersection of all functions on Zo.
divisors of the movable part of the linear system. The A set X always contains a generating set (in the
removal of the points of indeterminacy of cf>L can be trivial case: X generates X), but minimality may prove
reduced to the trivialization of the coherent sheaf of to be principally impossible (such a situation is typical
ideals defining the sub scheme B (cf. Birational of classes 0 containing infinite-place operations, in par-
geometry). ticular in topological structures, lattices, etc.). For this
For any linear system without fixed components L reason the minimality condition is replaced by a
on a smooth projective surface F there exists an integer weaker requirement: A basis is a generating set of
no such that if n >no, then the basic set of the com- minimal cardinality. In this context a basis B is defined
plete linear system I nL I is empty (Zariski's theorem). as a parametrized set (or population), i.e. as a function
This is not true in the multi-dimensional case. b(t) on a set of indices T with values in X, such that
References b(T)=B; the cardinality of T is sometimes called as
[1) Algebraic surfaces, Moscow, 1965 (in Russian). the dimension (or rank) of the basis of X. For example,
[2) HARTSHORNE, R.: Algebraic geometry, Springer, 1977. a countable everywhere-dense set B in a separable
V.A. Iskovskikh topological space P may be considered as a basis for it;
AMS 1980 Subject Classification: 14C20 P is generated from B by the closure operation (which,
incidentally, is related to generation in more general
BASIS of a set X - A minimal subset B that gen- cases as well, see below).

345
BASIS

A basis for a topology of a topological space X (a O{1x(A)} = Jy(O{A}).


base) is a basis IS of the set of all open subsets in X; An algebraic system with a free basis is said to be free.
the generation is effected by taking unions of elements A typical example is a basis of a (unitary) module M
of IS. over a ring K, that is, a free family of elements from M
A basis of a Boolean algebra m: (a dual base of m: in generating M [3]. Here, a family A = { at: t E T} of ele-
the sense of Tarski) is a dense set S (of minimal cardi- ments of a K-module M is said to be free if ~~tat = 0
nality) in m:; the generation of m: from S (and hence S (where ~t =0 for all except a finite number of indices t)
itself) is determined by the condition s~a = V (which
implies that ~t =0 for all t, and the generation is real-
is equivalent to s Ca), where s ES, a Em:, V is the unit
ized by representing the elements x as linear combina-
of m: and '~' is the operation of implication. One also
tions of the elements at: There exists a set (dependent
introduces in an analogous manner a basis for a filter
on x) of elements ~t EK such that ~t =0 for all except a
v as a set S such that for an arbitrary a E v there finite number of indices t, and such that the decompo-
exists an SES with s Ca.
sition
More special cases of bases of a set X are introduced x = ~~tat
according to the following procedure. Let B (X) be the is valid (i.e. X is the linear envelope of A). In this sense,
Boolean algebra of X, i.e. the set of all its subsets. A the basis M is free basis; the converse proposition is
generating operator (or a closure operator) J is a map- also true. Thus, the set of periods of a doubly-periodic
ping of B(X) into itself such that if A CB, then function f of one complex variable, which is a discrete
J(A)CJ(B); A CJ(A); JJ(A)=J(A). Abelian group (and hence a module over the ring Z),
An element x E X is generated by a set A if x EJ (A); has a free baSis, called the period basis of f; it consists
in particular, A generates X if J (A) = X. A minimal set of two so-called primitive periods. A period basis of an
B possessing this property is said to be a basis of X Abelian function of several complex variables is defined
defined by the operator J. A generating operator J is of in a similar manner.
finite type if, for arbitrary A cX and x CX, it follows If K is a skew-field, all bases (in the previous sense)
from x EJ (A) that x EJ (A 0) for a certain finite subset are free. On the contrary, there exist modules without
A 0 CA; a generating operator J has the property of sub- a free basis; these include, for example, the non-
stitution if, for any y, Z EX and A C X, both Y fl.J (A) principal ideals in an integral domain K, considered as
and y EJ (A U {z }) imply that Z EJ (A U (y }). A gen- a K-module.
erating operator J of finite type with the substitution A basis of a vector space X over a field K is a (free)
property defines a dependence relation on X, i.e. a sub- basis of the unitary module which underlies X. In a
division of B (X) into two classes - dependent and similar manner, a basis of an algebra A over a field K is
independent sets; a set A is said to be dependent if a basis of the vector space underlying A. All bases of a
y EJ (A \y) for some YEA, and is said to be indepen- given vector space X have the same cardinality, which
dent if y fl.J(A \y) for any y EA. Therefore, A is depen- is equal to the cardinality of T; the latter is called the
dent (independent) if and only if some (arbitrary) non- algebraic dimension of X. Each element x EX can be
empty finite subset(s) A 0 CA is dependent (are represented as a linear combination of basis elements in
independent). a unique way. The elements ~t(x)EK, which are linear
For a set B to be a basis of the set X it) necessary functionals on X, are called the components (coordi-
and sufficient for B to be an independent generating nates) of x in the given basis {ad.
set for X, or else, a maximal independent set in X. A set A is a basis in X if and only if A is a maximal
If A is an arbitrary independent set, and C is an X- (with respect to inclusion) free set in X.
generating set containing A, then there exists a basis B The mapping
in X such that A C B C C. In particular, X always has a
basis, and any two bases of it have the same cardinal- where ~X<t) = ~t(x) if ~I is the value of the t-th com-
ity. ponent of x in the basis A, and 0 otherwise, is called
In algebraic systems X an important role is played the basis mapping; it is a linear injective mapping of X
by the concept of the so-called free basis B, which is into the space KT of functions on T with values in K.
characterized by the following property: Any mapping In this case the image :::( X) consists of all functions
of B C X into any algebraic system Y (of the same sig- with a finite number of non-zero values (functions of
nature) may be extended to a (unique) finite support). This interpretation permits one to
(homo)morphism from X into Y or, which is the same define a generalized basis of a vector space X over a
thing, for any (homo )morphism B: X ~ Y and any set field K as a bijective linear mapping from it to some
A C X, the generating operators J x and J y satisfy the subspace K(T) of the space KT of functions on T with
condition: values in K, where T is some suitably chosen set. How-

346
BASIS

ever, unless additional restrictions (e.g. an order) and where ~x<t)=~t(x), so that, symbolically,
additional structures (e.g. a topology) are imposed on {~t(X)}=K(n and {~X<T)}=X. Owing to its concrete,
T, and corresponding compatible conditions on K (n effective definition, the structure of K (n is simpler
are introduced, the concept of a generalized basis is sel- and more illustrative than that of the abstractly given
dom of use in practice. X. For instance, an algebraic basis of an infinite-
A basis of a vector space X is sometimes called an dimensional Banach space is not countable, while in a
algebraic basis; in this way it is stressed that there is no number of cases, if the concept of a basis is suitably
connection with additional structures on X, even if they generalized, the cardinality of T is substantially smaller,
are compatible with its vector structure. and K (n simplifies at the same time.
A Hamel basis is a basis of the field of real numbers The space K(n contains all functions of finite sup-
R, considered as a vector space over the field of port, and the set of elements of the basis {at} is the
rational numbers. It was introduced by G. Hamel [4] to bijective inverse image of the set of functions {~t(s)}
obtain a discontinuous solution of the functional equa- with only one non-zero value which is equal to one:
tion f(x +y)= f(x)+ f(Y); the graph of its solution is at = E-l[~t(s)l,
everywhere dense in the plane R2. To each almost-
periodic function corresponds some countable Hamel where ~t(s)=l if t=s, and ~t(s)=O if t=l=s. In other
words, at is the generator of a one-dimensional sub-
basis /3 such that each Fourier exponent An of this
space At which is complementary in X to the hyper-
function belongs to the linear envelope of /3. The ele-
plane defined by the equation Mx)=O.
ments of /3 may be so chosen that they belong to a
Thus, the role of the basis {at} is to organize, out of
sequence {Ai}; the set /3 is said to be a basis of the
the set of components ~t(x) which constitute the image
almost-periodic functions. An analogous basis has been
of x under the basis mapping, a summable (in some
constructed in a ring containing a skew-field P and
sense) set {~t(x)at}, i.e. a basis 'decomposes' a space X
which has the unit of P as its own unit. An algebraic
into a (generalized) direct sum of one-dimensional sub-
basis of an arbitrary vector space is also sometimes
spaces:
referred to as a Hamel basis.
A topological basis (a basis of a topological vector
A basis is defined in a similar manner in vector spaces
space X over a field K) is a set A ={at : tET} eX with
with a uniform, limit (pseudo-topological), linear (L-),
properties and functions analogous to those of the alge-
proximity, or other complementary structure.
braic basis of the vector space. The concept of a topo-
Generalizations of the concept of a basis may be and
logical basis, which is one of the most important ones
in fact have been given in various directions. Thus, the
in functional analysis, generalizes the concept of an
introduction of a topology and a measure on T leads to
algebraic basis with regard to the topological structure
the concept of the so-called continuous sum of elements
of X and makes it possible to obtain, for each element
from X and to corresponding integral representations;
X, its decomposition with respect to the basis {at},
the decomposition of the space X into (not necessarily
which is moreover unique, i.e. a representation of x as one-dimensional) components is used in the spectral
a limit (in some sense) of linear combinations of ele- theory of linear operators; the consideration of arbi-
ments at: trary topological algebras over a field K (e.g. algebras
x = lim ~~t(x)at, of measures on T with values in K or even in X, alge-
where ~t(x) are linear functionals on X with values in bras of projection operators, etc.) instead of K (n
K, called the components of x in the basis A, or the makes it possible to concretize many notions of
coefficients of the decomposition of x with respect to the abstract duality for topological vector spaces and, in
basis A. Clearly, for the decomposition of an arbitrary particular, to employ the well-developed apparatus of
x to exist, A must be a complete set in X, and for such the theory of characters.
a decomposition to be unique (i.e. for the zero element A countable basis, which is the most extensively stu-
of X to have all components equal to zero), A must be died and, from the practical point of view, the most
a topologically free set in X. important example of a basis, is a sequence {ai} of ele-
The sense and the practical significance of a topolog- ments of a space X such that each element x is in
ical basis (which will be simply denoted as a 'basis' in unique correspondence with its series expansion with
what follows) is to establish a bijective linear mapping respect to the basis {ai}
of X, called the basis mapping, ::: into some (depending
~~i(x)ai' ~i(X)EK,
on X) space K (n of functions with values in K,
defined on a (topological) space T, viz.: which (in the topology of X) converges to x. Here,
T=Z, and there exists a natural order in it. A count-
E(x):x E X ~ ~At) E K(T), able basis is often simply called a 'basis'. A weak count-

347
BASIS

able basis is defined in an analogous manner if weak tional basis; in the space C[a, b] of continuous func-
convergence of the expansion is understood. For tions on the interval [a, b] no basis can be uncondi-
instance, the functions e ikt , k EZ, form a basis in the tional. An orthonormal countable basis of a Hilbert
spaces Lp ' I <p < 00 (periodic functions absolutely space is an unconditional basis. A Banach space with
summable of degree p); on the contrary, these functions an unconditional basis is weakly complete (accordingly,
do not form a basis in the spaces L I, L 00 (measurable it has a separable dual space) if and only if it contains
functions which almost everywhere coincide with no subspace isomorphic to Co (or, correspondingly, II)'
bounded functions) or C I (continuous periodic func- Two bases {ai} and {bi } of the Banach spaces X and
tions). A necessary, but by far not sufficient, condition Y, respectively, are said to be equivalent if there exists a
for the existence of a countable basis is the separability bijective linear mapping T: ai~bi that can be extended
of X (e.g. a countable basis cannot exist in the space of to an isomorphism between X and Y; these bases are
measurable functions on an interval [a, b] with values said to be quasi-eqUivalent if they become equivalent as
in R). Moreover, the space 100 of bounded sequences, a result of a certain rearrangement and normalization
not being separable in the topology of 100 , has no of the elements of one of them. In each of the spaces,
countable basis, but the elements ai = {~ik}, where 11,1 2 , Co all normalized unconditional bases are
~ik = I if i =k, and ~ik =0 if i=Fk, form a basis in the equivalent. However, there exist normalized bases not
weak topology a(loo, II)' The question of the existence equivalent to orthonormal ones.
of a countable basis in separable Banach spaces (the A summable basis - a generalization of the concept
basis problem) has been negatively solved [6]. The of an unconditional basis corresponding to a set T of
analogous problem for nuclear spaces also has a nega- arbitrary cardinality and becoming identical with it if
tive solution [7]. T=Z - is a set A ={at : tET} such that for an arbi-
A countable basis is, however, not always 'well- trary element x EX there exists a set of linear combina-
suited' for applications. For example, the components tions (partial sums) of elements from A, which is called
~t(x) may be discontinuous, the expansion of x need a generalized decomposition of x, which is summable to
not converge unconditionally, etc. In this connection x. This means that for any neighbourhood U ex of
one puts restrictions on the basis or introduces general- zero it is possible to find a finite subset Au eA such
izations of it. that for any finite set A' =:J A u the relation
A basis of countable type is one of the generalizations
of the concept of a countable basis in which, although [ ~,~tat - x] EO U,
tEA
T is not countable, nevertheless the decomposition of is true, i.e. when the partial sums form a Cauchy sys-
x EX with respect to it has a natural definition: the tem (Cauchy filter). For instance, an arbitrary ortho-
corresponding space K (T) consists of functions with normal basis of a Hilbert space is a summable basis. A
countable support. For instance, a complete orthonor- weakly summable basis is defined in a similar way. A
mal set {ar) in a Hilbert space H is a basis; if x EH, totally summable basis is a summable basis such that
then ~t(x)= <x, at> (where <','> is the scalar pro- there exists a bounded set B for which the set of semi-
duct in H) for all (except possibly a countable set of) norms {pB(~tal)} is summable. A totally summable
indices t E T, and the series ~~Ial converges to x. The basis is at most countable. In a dual nuclear space all
basis mapping is determined by the orthog() lal projec- weakly summable bases are totally summable.
tions onto the closed subspaces generated oy the ele- An absolute basis (absolutely summable basis) is a
ments al . A basis of the space AP of all complex-valued summable basis of a locally convex space over a
almost-periodic functions on R consists of the functions normed field such that for any neighbourhood U of
eilA; here, T=R, K(T) is the set of countably-valued zero and for each t E T the family of semi-norms
functions, and the basis mapping is defined by the for- {p u( al )} is summable. All unconditional countable
mula: fT bases are absolute, i.e. the series ~ I ~Jx) Ip(ai)
:=:[x (A)] = r-----ox
lim -2 1 f x (A)e IlA dA. converges for all x EX and all continuous semi-norms
T T
pC} Of all Banach spaces only the space II has an
An unconditional basis is a countable basis in a space absolute countable basis. If a Frechet space has an
X such that the decomposition of any element x con- absolute basis, all its unconditional bases are absolute.
verges unconditionally (i.e. the sum of the series does In nuclear Frechet spaces any countable basis (if it
not change if an arbitrary number of its terms is rear- exists) is absolute [13].
ranged). For instance, in Co (sequences converging to A Schauder basis is a basis {a l : t E T} of a space X
zero) and lp (sequences sum mabIe of degree p, such that the basis mapping defined by it is continuous
I ~p < (0) the elements a, = {8,k} form an uncondi- (and is therefore an isomorphism onto some space
BATEMAN METHOD

K(T), i.e. a basis in which the components ~t(x) for (1974),304-313. (Funktsional. Analiz. i Prilozhen. 8, no. 4
any x EX and, in particular, the coefficients of the (1974), 35-47)
[8] EDWARDS, R.E.: Functional analysis: theory and applications.,
decomposition of x with respect to this basis, are con- Holt, Rinehardt, Winston, 1965.
tinuous functionals on X. This basis was first defined [9] DIEUDONNE, J.: 'Sur les espaces de Kothe', J. d'Analyse Math.
by 1. Schauder [5] for the case T=Z. The concept of a 1 (1951), 81-115.
[10] ARSOVE, M.G.: 'The Paley-Wiener theorem in metric linear
Schauder basis is the most important of all modifica-
spaces', Pacific J. Math. 10 (1960), 365-379.
tions of the concept of a basis. [II] BESSAGA, C. and PELCZYNSKl, A.: 'Spaces of continuous func-
A Schauder basis is characterized by the fact that tions IV', Studia Math. 19 (1960),53-62.
{at} and {~t} form a bi-orthogonal system. Thus, the [12] JAMES, R.C.: 'Bases and reflexivity in Banach spaces', Ann. of
Math. (2) 52, no. 3 (1950), 518-527.
sequences ai = {t3 ik } form countable Schauder bases in [13] DYNIN, A. and MITYAGIN, B.: 'Criterion for nuclearity in
the spaces Co and ~, p ~ 1. A countable Schauder basis terms of approximate dimension', Bull. A cad. Polon. Sci. ser.
forms a Haar system in the space C[a, b]. In complete Sci. Math., Astr. Phys. 8 (1960), 535-540.
[14] DAY, M.M.: Normed linear spaces, Springer, 1958.
metric vector spaces (in particular, in Banach spaces)
[15] PIETSCH, A.: Nuclear locally convex spaces, Springer, 1972
all countable bases are Schauder bases [10]. In Frechet (translated from the German).
spaces the concept of a weak basis and a Schauder [16] SINGER, I.: Bases in Banach spaces, 1-2, Springer, 1970-1981.
basis are identical [11]. In barrelled spaces in which M.l. Vortsekhovskir
there are no linear continuous functionals, there is also M.l. Kadets
no Schauder basis [8]. However, if a weak Schauder AMS 1980 Subject Classification: 08-XX, 10l05,
basis exists in these spaces, it is an ordinary Schauder 46815, 46A35
basis [9]. A barrelled locally convex space with a count-
able Schauder basis is reflexive if and only if this basis BATEMAN FUNCTION, kjunction - The function
is at the same time a shrinking set, i.e. if the {~t} 1T/2

corresponding to it will be a basis in the dual space X kv(x) = 1. f cos(x tg8-p8)d8, (I)
'IT 0
and will be boundedly complete, i.e. if the boundedness
where x and /l are real numbers. The function was
of the set of partial sums of a series ~~iai implies that
I defined by H. Bateman [1]. The Bateman function may
this series is convergent [12]. If a Schauder basis is an be expressed in the form of a confluent hypergeometric
unconditional basis in a Banach space, then it is a function of the second kind \{tea, h, x):
shrinking set (or a boundedly complete set) if and only
if X does not contain subspaces isomorphic to II (or,
[(p+l)k 2v (x) = e- x 'I'(-p,O;2x). x>O. (2)

respectively, to co). The relation (2) is conveniently taken as the definition


A Schauder basis in a locally convex space is of the Bateman function in the complex plane with the
equicontinuous if for any neighbourhood U of zero it is cut (- 00, 0]. The following relations are valid: for case
possible to find a neighbourhood V of zero such that (1)
kvC - x) = Lv(x),
I ~,(x) Ipu(a,) ~ pv(x)
for case (2)
for all x EX, t E T. All Schauder bases of a barrelled k2v(-~iO) = L2vW-e='=""'e~CP(-p.O; -2~).
space are equicontinuous, and each complete locally
convex space with a countable equicontinuous basis can where ~>O, and !lJ(a, b ; x) is a confluent hyper-
be identified with some sequence space [15]. An geometric function of the first kind.
equicontinuous basis of a nuclear space is absolute. References
[I] BATEMAN. H.: 'The k -function, a particular case of the con-
References fluent hypergeometric function', Trans. A mer. Math. Soc. 33
[1] COHN, P.M.: Universal algebra, Reidel, 1981. (1931),817-831.
[2] MAL'TSEV, A.I.: Algebraic systems, Springer, 1973 (translated [2] BATEMAN, H. and ERDELYI, A. (EDs.): Higher transcendental
from the Russian). functions, I, McGraw-Hill. 1953.
L.N. Karmazina
[3] BOURBAKI, N.: Elements of mathematics. Algebra: Algebraic
structures. Linear algebra, 1, Addison-Wesley, 1974, AMS 1980 Subject Classification: 33A30
Chapt. 1; 2 (translated from the French).
[4] HAMEL, G.: 'Eine Basis aller Zahlen und die unstetigen BATEMAN METHOD - A method for approximating
Losungen der Funktionalgleichung: f(x +y) =f(x) +f(Y)',
Math. Ann. 60 (1905), 459-462. the integral operator of a one-dimensional integral
[5] SCHAUDER, J.: 'Zur Theorie stetiger Abbildungen in Fredholm equation of the second kind; it is a particular
Funktionalraumen', Math. Z. 26 (1927), 47-65; 417-431. case of the method of degenerate kernels (cf. Degen-
[6] ENFLO, P.: 'A counterexample to the approximation problem
erate kernels, method of). In Bateman's method, the
in Banach spaces', Acta Math. 130 (1973),309-317.
[7] ZOBIN, N.M. and MITYAGIN, B.S.: 'Examples of nuclear linear degenerate kernel KN(X, s) is constructed according to
metric spaces without a basis', Functional Anal. Appl. 8, no. 4 the rule:

349
BATEMAN METHOD

KN(x, S) = Editorial comments.


o K(x, Sl)
References
K(xJ, S) K(xJ, Sl) [A1] LIPTSER, R.S. and SHIRYAYEV, A.N.: Statistics of random
processes, 1, Springer, 1977, Section 7.9.

AMS 1980 Subject Classification: 62A 15

BAYESIAN APPROACH to statistical problems - An


approach based on the assumption that to any parame-
ter in a statistical problem there can be assigned a
definite probability distribution. Any general statistical
where Si, Xi, i = I, ... ,N, are certain points on the decision problem is determined by the following ele-
integration segment of the integral equation considered. ments: by a space (X, f!4 x) of (potential) samples X, by
The method was proposed by H. Bateman [I]. a space (8, f!4 e) of values of the unknown parameter
References B, by a family of probability distributions {Po: BEe}
[I] BATEMAN, H.: Messeng. Math. 37 (1908), 179-187.
on (X, !J8 x), by a space of decisions (D, f!4 D) and by a
[2] KANTORovICH, L.V. and KRYLOV, V.I.: Approximate methods
of higher analysis, Noordhoff, 1958 (translated from the Rus- function L(B, d), which characterizes the losses caused
by accepting the decision d when the true value of the
sian). A.B. Bakushinskit
parametar is B. The objective of decision making is to
Editorial comments. find in a certain sense an optimal rule (decision func-
References tion) o=o(x), assigning to each result of an observation
[A1] BATEMAN, H.: Proc. Roy. Soc. A (1922),441-449.
XEX the decision o(x)ED. In the Bayesian approach,
AMS 1980 Subject Classification: 45805, 45L05 when it is assumed that the unknown parameter B is a
random variable with a given (a priori) distribution
BAYES FORMULA - A formula with which it is pos-
7T=7T(d8) on (8, 88 e) the best decision function
sible to compute a posteriori probabilities of events (or (Bayesian decision function) o = o (x) is defined as the
of hypotheses) from a priori probabilities. Let function for which the minimum expected loss
AI, ... ,An be a complete group of incompatible
inf,s p(7T, 0), where
events: UAi=n, Ai nAJ== 0 if i=/=j. Then the a
posteriori probability P(A i I B) of event A, if given that p(7T, 8) = j p(O, 8)7T(dO),
EI
event B with P(BO has already occurred may be
and
found by Bayes' formula: p(O, 8) = jL(0.8(xP e(dx)
.\
P(A,)P(B I A,)
peA, I B) = n
(*) is attained. Thus,
~ peA, )P(B I A,)
p(7T.8*) = inf j j L(O, 8(x Pe(dX)7T(dO).
ii (-J X
where P(Ai) is the a priori probability of Ai, PCB IAi) is
the conditional probability of event B occurring given In searching for the Bayesian decision function
event A, (with P(AiO) has taken place. The formula 0* = 0* (x), the following remark IS useful. Let
was demonstrated by T. Bayes in 1763. ) Po(dx) =p(x I B) d,u(x), w(dB) = 7T( B) dv( B). where ,u and v
Formula (*) is a special case of the following abstract are certain a-finite measures. One then finds, assuming
variant of Bayes' formula. Let B and ~ be random ele- that the order of integration may be changed.
ments with values in measurable spaces (8, BE) and j jL(0,8(xP o(dx)7T(dO) =
(X,B x ) and let EIK(B)I<oo. Put for any set (oJ .\

A EF~=a{w: ~(w)}. = jjL(0.8(xp(xI0)7T(0)dp.(x)dv(0) =


(oJ .\
G(A) = jg(0(wE[14(w) I Fel(w)P(dw).
11

where F8=a{w: B(w)} and JA(w) is the indicator of the


= f.\ dp.(x) lr(j-) L(O, 8(x)p(x I O)7T(O)dV(O)l

set A. Then the measure G is absolutely continuous It is seen from the above that for a given x EX, 0* (x) is
with respect to the measure P (CP) and that value of d* for which
E[g(B) I Fd(w)=(dG / dP)(w). where (dC / dP)(w) is the
Radon - Nikodym derivative of C with respect to P. in: j L(O. d)p(x 10)7T(O)dv(O)
d (_)

References is attained, or. what is equivalent for which


[I J KOl.MOGORm. 1\.1\.: FOIlI1c/mio/ls oj (he (heorl' "/pl'lllwlllllll.
Chelsea. reprint. 1950 (translated fmIll the Ru"ian). inf f L(O, d) p(xp(x)
10)77(0) dv(O).
A. S. Sh /'"),(1('1' d H

3S0
BA YESIAN APPROACH, EMPIRICAL

is attained, where ter X=x is known. If, as a result of some experiment,


only the realization of Y is observed, while the
p(X) ::: [p(X I O)TT(O) dp(O).
corresponding realization of X is unknown, and if it is
But, according to the Bayes fonnula necessary to estimate the value of a given function cJ>(X)
of the non-observed realization, then, in accordance
(L(O, d) p(x IO)TT(O) dv(O) ::: E[L(O, d) Ix]. with the empirical Bayesian approach, the conditional
~ p(x)
mathematical expectation E{ cJ>(X) I Y} should be used
Thus, for a given x, 13* (x) is that value of d* for which as an approximate value tf(Y) for cJ>(X). In view of the
the conditional average loss E[L(O, d) Ix] attains a Bayes fonnula, this expectation is given by the formula
minimum.
Example. (The Bayesian approach applied to the case Y - jcp(x)p(Ylx)p(x)d}L(x) (1)
1/;( ) - q(Y) ,
of distinguishing between two simple hypotheses.) Let
8={0],02}, D={d l ,d2 }, Lij=L=(Oi,dj ), i,j=I,2; where
7T(Od=7TJ, 7T(02) = 7T2, 7TI +7T2 = 1. If the solution di is q(y)::: jP(Ylx)p(x)d}L(x), (2)
identified with the acceptance of the hypothesis Hi: p (x) is the density of the unconditional (a priori) distri-
O=Oi' it is natural to assume that LII <L 12 , L 22 <L 21 bution of X, }L(x) is the corresponding a-finite measure;
Then and the function q(y) represents the density of the
p(TT,O) ::: j[TTIP(X IOI)L(OI, o(x+
x unconditional distribution of Y.
+TT2P(X I(2)L(02, o(x]d}L(x) If the a priori density p (x) is unknown, it is not pos-
sible to compute the values of tf and q. However, if a
implies that infs p( 7T, (3) is attained for the function
sufficiently large number of realizations of the random
variables Y I, . . . , Yk , which are drawn from the distri-
bution with density q(y), is known, it is possible to
o'(X) ::: construct a consistent estimator q(y), which depends
only on Y I , . . . , Y k S.N. Bernshteln [I] proposed to
estimate the value of tf( Y) by substituting q(y) for q (y)
in (2), finding the solution p(x) of this integral equa-
The advantage of the Bayesian approach consists in tion, and then substituting p
and q
in the right-hand
the fact that, unlike the losses p(O, (3), the expected side of (1). However, this method is difficult, since solv-
losses p(7T, (3) are numbers which are dependent on the ing this integral equation (2) is an ill-posed problem in
unknown parameter 0, and, consequently, it is known numerical mathematics.
that solutions 13; for which In certain special cases the statistical approach may
p(TT, 0;) .;;; inf p(TT, 0)+(, be employed not only to estimate q, but also tf [3]. This
6
is possible if the identity
and which are, if not optimal, at least (-optimal (>0),
cp(x)p(y Ix) ::: A(y)r[z(y) Ix], (3)
are certain to exist. The disadvantage of the Bayesian
approach is the necessity of postulating both the involving x and y, is true. In (3), '\(y) and z (y) are
existence of an a priori distribution of the unknown functions which depend on y only, while r(z Ix), being
parameter and its precise form (the latter disadvantage a function of z, is a probability density (i.e. may be
may be overcome to a certain extent by adopting an regarded as the density of an arbitrary distribution of
empirical Bayesian approach, cf. Bayesian approach, some random variable Z for a given value X=x). If (3)
empirical). is true, the numerator of (I) is equal to the product
References '\(Y)s [z (Y)], where s(z)= fr(z Ix)p(x) d}L(x) is the
[1] WALD, A.: Statistical decision functions, Wiley, 1950. density of the unconditional distribution of Z. Thus, if
[2] GROOT, M.H. DE: Optimal statistical decisions, McGraw-Hill,
1970.
a sufficiently large number of realizations of indepen-
A.N. Shiryaev dent random variables Z I, . . . ,Zm with density distri-
AMS 1980 Subject Classification: 62A15, 62C10 bution s (z) is available, then it is possible to construct
a consistent estimator s(z) Jor
s(z), and hence also to
BAYESIAN APPROACH, EMPIRICAL - A statistical find a consistent estimator tf( Y) for tf( Y):
interpretation of the Bayesian approach yielding conclu-
sions on unobservable parameters even if their a priori CP(X) ~ I/;(Y) ~ ~Y) ::: A(Y!S[Z(Y)]. (4)
q(Y)
distribution is unknown. Let (Y, X) be a random vector
for which the density p (y Ix) of the conditional distri- For instance, if one has to estimate tf(X)=xh, where
bution of Y for any given value of the random parame- h is a positive integer, and p(y Ix)=xYe- x Iy!

351
BA YESIAN APPROACH, EMPIRICAL

(y=o, I, ... ; x>O), then cp(x)p(y IX)=A(y)p(y+h Ix), tions of the empirical Bayesian approach to hypotheses
where A(y)=(y +h)! IY!. Since, here, r(z Ix)=p(z Ix), testing concerning the values of random parameters.
one has s (z) = q (z). Accordingly, References
[1] BERNSHTEiN, S.N.: 'On 'fiducial' probabilities of Fisher', lzv.
~(y) = A( Y)q( Y + h) I q( y), i.e. only the sequence of Akad. Nauk SSSR Ser. Mat. 5 (1941),85-94 (in Russian).
realizations Y I, Y 2, . . . , is required to find 1/;. If, on English abstract.
the other hand, p(Ylx)=b(Ylx)=C{xY(l-x)n- y [2] BOL'sHEv, L.N.: 'Applications of the empirical Bayes
approach', in Internat. Congress Mathematicians Nice, 1970,
(y=O, ... ,n; n is a positive integer; O':;;;x.:;;;l,
Vol. 3, Gauthier-Villars, 1971, pp. 241-247.
C{ =<;, then 1/;(x)p(y I x)=A(y)r(y +h I x), where [3] ROBBINS, H.: 'An empirical Bayes approach to statistics', in
A(Y)= C{ I C{ ti and r(z Ix)=b n +h(z Ix)*p(z Ix). For Proc. Berkeley Symp. Math. Statist. Probab., Vol. 1, Berkeley-
Los Angeles, 1956, pp. 157-163.
this reason two sequences of empirical values Y i and Z} L.N. Bol'shev
are required in this case to construct 1/;( Y).
AMS 1980 Subject Classification: 62A 15, 62C15
This form of the empirical Bayesian approach is
applicable to the very narrow class of densities p (y Ix) BAYESIAN DECISION FUNCflON - A rule (func-
and functions cp(x) which satisfy condition (3); even if tion) ~=~(x) which associates with each result x of a
this condition is in fact met, the construction of the statistical experiment a decision ~(x) with values in a
estimator (4) is subject to the observability of the ran- given set of decisions, and which attains the minimum
dom variables Zj' the distribution of which usually expected loss as defined in the framework of the Bayes-
differs from that of the variables Yi which are observed ian approach to statistical problems.
directly. For practical purposes, it is preferable to use A.N. Shiryaev
the empirical Bayesian approach in a modified form, in AMS 1980 Subject Classification: 62C10
which these disadvantages are absent. In this modifica-
tion the approximation which is constructed does not BAYESIAN ESTIMATOR - An estimator of an
yield a consistent estimator of 1/;(y) (such an estimator unknown parameter from the results of observations
may even be non-existent), but rather upper and lower using the Bayesian approach. In such an approach to
estimators of this function, which are found by solving the problem of statistical estimation it is usually
a problem of linear programming, as follows. Let assumed that the unknown parameter 0 E e C Rk is a
'II I (Y) and 'II2(Y) be the constrained minimum and random variable with given a priori distribution
maximum of the linear functional (with respect to the 7T=7T(dO), that the space of decisions D is identical to
unknown a priori densi ty p (x in the n umera tor of (1), the set e and that the loss L(O, d) expresses the devia-
calculated under the linear constraints p(x);;?O, tion between the variable 0 and its estimator d. It is
jp (x) dJL(x) = I and q(Y) jp(Ylx)p(x)dJL(x)=q(y), therefore supposed, as a rule, that the function L (0, d)
where q( Y) is the estimator of q (Y) mentioned above, has the form L(O, d)=a(O)A(O-d), where A is some
constructed from the results of the observations non-negative function of the error vector 8-d. If k = 1,
Y I, . . . , Yk . One may conclude in such a case that it is often assumed that A(O-d)= 18-d I", 0:>0; the
'III (Y) I q(Y).:;;;1/;(Y).:;;;'lI 2 (Y) I q(Y), where the proba- most useful and mathematically the most convenient is
bility of the truth of this conclusion tends to one (by the quadratic loss function L(O, d)= 18-d 12. For such
virtue of the law of large numbers) as the number of a loss function the Bayesian estimator (Bayesian deci-
random variables Yi , used to construct thl) estimator sion function) 0* = 0* (x) is defined as the function for
q(Y), increases without limit. Other modifications of which the minimum total loss
the empirical Bayesian approach are also possible - inf p(?T. 8) = inf f f 10-8(x) 12 Po (dx)?T(dO),
8 8 (_) X
for example, by adding to the last-named condition
q(Y)=q(Y) a finite number of conditions of the form IS attained. or, equivalently. for which the minimum
q (Yi) = q(Yi), where y/ are preliminarily given numbers; conditional loss
q
if is replaced by the corresponding confidence bounds infE{[0-8(x)f I x)
Ii
for q, the conditions are obtained in the form of ine-
is attained. It follows that in the case of a quadratic
qualities ql(YJ':;;;q(Yi)':;;;Q2(Y/). etc.
loss function the Bayesian estimator 0* (x) coincides
In certain cases. which are important in practice.
with the a posteriori average 0*(x)=E(8Ix). and the
satisfactory majorants. which can be computed without
Bayes risk is
the use of the laborious method of linear programming. p(?T,8') = E[O(Oix)j,
can be found for the functions 'II I and 'lI2 (see the where 0(01 xl is the variance of the a posteriori distri-
example in the entry Sample method whieh deals with bution:
statistical control). O(Olx) = E{[O-E(Olx)flx}.
See the entry Discriminant analysis for the applica- Example. Let x = (x 1 . . , x n ), where XI, . . . 'X n are

352
BEHRENS - FISHER PROBLEM

independent identically-distributed random variables tions JJ.l, JJ.2, of the variances aT, o'~ and of their ratio
with normal distributions N(), a2), a2 is known, while aT / o'~ are unknown. A sufficient statistic in the case
the unknown parameter () has the normal distribution nl, n2~2 is a four-dimensional vector (~\, X2, ST, sh
N (JJ., ~). Since the a posteriori distribution for () the components of which are expressed by the formulas
(where x is given) is normal N(JJ.n, ,.~) with
- -2 +
_ nxo
JLn - -2
fJ.T
-2
-2'
-2 _
Tn - no
-2 + T-2,
no +,. n, n2_

where x = (x I + ... + Xn) / n, it follows that for the Sy = ~(Xli-:\\)2, S~ = ~(X2)-X2)2,


i=1 )=1
quadratic loss function 1 ()-d 12 the Bayesian estima-
and which are mutually indeJ>endent random variables;
tor is 5' (X) =JJ.n, while the Bayesian risk IS
,.~ =0'2~ / (n~ +0'2).
~(XI-JJ.I)/O'l and yr;;;(X2 -JJ.2)/(J2 have a
A.N. Shiryaev standard normal distribution, while Sf / (JT and
Editorial comments. S~ / (J~ have a 'chi-squared' distribution with n I - I
and n2 -} degrees of freedom, respectively. Since a suf-
References
[A1] SVERDRUP, E.: Laws and chance variations, 1, North-Holland, ficient statistic contains the same information about the
1967, Chap!. 6, Section 4. unknown parameters JJ.I, JJ.2, (JT, (J~ as the initial
AMS 1980 Subject Classification: 62F15 n I + n 2 random variables X Ii and X 2j' it follows that
only the sufficient statistics need be considered in test-
BEHNKE - STEIN THEOREM - The umon of ing hypotheses about the values of these parameters.
domains of holomorphy Gk C en , k = I, 2, ... , for In particular, this idea is the basis of the modern for-
which Gk C Gk + I for all k, is also a domain of holo- mulation of the problem of hypotheses testing, concern-
morphy. The Behnke - Stein theorem is valid not only ing the hypothesis JJ.I - JJ.2 =~, where ~ is a previously
in the complex Euclidean space en, but also on any given number; here the Behrens- Fisher problem is
Stein manifold. If the sequence Gk is not monotone reduced to finding a set Ka in the space of possible
increasing by imbedding, the theorem is not valid; e.g. values of the random variables XI - X 2, ST, S~ such
the union of the two domains of holomorphy that, if the hypothesis being tested is correct, the proba-
bility of the event (XI-X2,Sf,S~)EKa does not
G1 = {(ZI,Z2): I ZI I <I, I Z2 I <2}
depend on all the unknown parameters and is exactly
and equal to a given number a in the interval O<a< l.
G2 = {(zJ, Z2): I ZI I <2, I Z2 I <I}
The question of the existence of a solution to the
in C2 is not a domain of holomorphy.
Behrens - Fisher problem was discussed at length by
References prominent mathematicians (mainly in connection with
[1] BEHNKE, H. and STEIN, K.: 'Konvergente Folgen von
Regularitatsbereichen und die Meromorphiekonvexitlit', Math.
the approach to the problem taken by R.A. Fisher,
Ann. 116 (1938),204-216. which passed beyond the borders of probability
[2] VLADlMIROV, V.S.: Methods of the theory offunctions of several theory). It was shown by Yu.V. Linnik et aI., in }964,
complex variables, M.LT., 1966 (translated from the Russian).
that if the sample sizes n I and n 2 are of different pari-
E.M. Chirka ties, a solution Ka to the Behrens- Fisher problem
AMS 1980 Subject Classification: 32005 exists. If the parities of n I and n2 are equal, the
existence of a solution remains an open question.
BEHRENS- FISHER PROBLEM - An analytical The Behrens- Fisher problem has often been gen-
problem which arose in the context of the statistical eralized and modified. A. Wald, in particular, posed the
problem of comparing, starting from empirical data, the problem of finding a set K a in the sample space of the
mathematical expectations of two normal distributions, two variables (XI - X 2) / ST and ST / S~. The question
the variances of which are unknown (it is assumed that of the existence of a solution to this problem remains
the ratio of the variances is also unknown). This prob- open. However, it is effectively possible to construct a
lem was posed by W.U. Behrens [l] in connection with set K*,. such that if the hypothesis JJ.I - JJ.2 = ~ being
processing crop data. The modern formulation of the tested is in fact correct, the probability of the event
Behrens - Fisher problem is due to R. Fisher and is X 1 -X 2)/ST,ST /S~)EK:, while still depending on
based on the concept of sufficient statistics. Let the unknown ratio aT / O'~, will deviate from the given
XII, ... ,X ln , and X 21 , ... ,X2n , be mutually a only by a small amount. This fact is the basis of
independent random variables with a normal distribu- modern recommendations for the practical construction
tion, and let EX li =JJ.J, E(Xli - JJ.d =O'f (i = I, ... ,n I) of tests to compare JJ.l and JJ.2. Simple tests for the
and EX2j=JJ.2' E(X2j-JJ.d=0'~ (j=I, ... ,n2). It is comparison of JJ.l with JJ.2, which are also computation-
assumed that the values of the mathematical expect a- ally convenient, were proposed by V.T. RomanovskiI,

353
BEHRENS - FISHER PROBLEM

M. Bartlett, H. Scheffe and others. However, the statis- the optimal solution with the aid of Bellman's equation
tics of these tests are not expressed in terms of suffi- is known as dynamic programming.
cient statistics and are, for this reason, usually less References
powerful than tests based on the solution of the [1] BELLMAN, R.: Dynamic programming, Princeton Univ. Press,
Behrens- Fisher problem and its generalizations. 1957.
V. G. Karmanov
References Editorial comments. The Bellman equation for
[I] BEHRENS, W.U.: Landwirtsch.lahresber. 68, no. 6 (1929),807-
837. continuous-time optimal control problems is also often
[2] LiNNIK, YU.V.: Statistical problems with nuisance parameters, called the dynamic programming equation. Cf. the article
Amer. Math. Soc., 1968 (translated from the Russian). Optimality, sufficient conditions for for examples and more
[3] LINNIK, Yu.V., ROMANOYSKIi, LV. and SUDAKOY, V.N.: 'A details. There is also a variant for stochastic optimal control
nonrandomized homogeneous test in the Behrens - Fisher problems.
problem'. Soviet Math. Dok!. 5, no. 2 (1964),570-572. (Dok!.
Akad. Nauk SSSR 155, no. 6 (1964), 1262-1264) References
L.N. Bol'shev [A1] FLEMING, W.H. and RISHEL, R.W.: Deterministic and sto-
chastic optimal control, Springer, 1975.
Editorial comments.
AMS 1980 Subject Classification: 35QXX, 90C39
References
[A 1] LiNNIK, YU.V.: 'Randomized homogeneous tests for the BELLMAN - IlARR.J.s PROCESS - A special case of
Behrens - Fisher problem', Selected Transl. in Math. Stat.
and Probab. 6 (1966), 207-217 (translated from the Rus- an age-dependent branching process (cf. Branching pro-
sian). cess, age~dependent). It was first studied by R. Bellman
AMS 1980 Subject Classification: 62FXX and T.E. Harris [1]. In the Bellman - Harris process it
is assumed that particles live, independently of each
BELL-SHAPED GAME - A game on the unit square other, for random periods of time, and produce a ran-
whose pay-off function takes the form cf>(x - y), where cp dom number of new particles at the end of their life
is a positive analytic proper P6lya frequency function, time. If G (t) is the distribution function of the life
1.e. : times of the individual particles, if h (s) is the generat-
1) cp(u) is defined for all UE(-oo, 00); ing function of the number of direct descendants of one
2) for any n and any sets - 00 <x 1< ... <xn < 00 particle, and if at time t =0 the age of the particle was
and - 00 <y I < ... <Yn < 00 there is an inequality zero, then the generating function F(t;s)=Esl'(t) of the
det II cf>(x, - Yj) II ~O; number of particles p,(t) satisfies the non-linear integral
3) for any set {xd (correspondingly, {Yd) there is a equation
set {Yd (correspondingly, {Xk }) such that
det II cf>(Xj - Yj) II >0; F(t;s) = jh(F(t-u;sdG(u)+s(l-G(t.
o
4) fjJoo
cf>(u) du < 00.
If
An example of a bell-shaped game is a game with G(t) = l-e- A1 , t?O,
pay-off function e -(x _y)2. The optimal strategies of the Bellman - Harris process is a Markov branching
players in a bell-shaped game are unique and are process with continuous time.
piecewise-constant distributions with a finite number of
References
steps. The value of a game with pay-o), function [I] BELLMAN. R. and HARRIS. T.E.: 'On the theory of age-
cf>(?-.(x -y)), as ?-.~OO, moves towards zero, while the dependent stochastic branching processes', Proc. Nat. Acad.
number of points in the supports of the optimal stra- Sci. USA 34 (1948), 601-604.
B.A. Sevast ')'anov
tegies grows unboundedly.
AMS 1980 Subject Classification: 60J80
References
[I] KARl.IN, S.: Mathematical method.1 and theory in Rames, pro- BELTRAMI COORDINATES - Coordinates on a sur-
gramming and economics, Addison-Wesley, 1959.
face in which all geodesic lines are expressed by linear
v.K. Domanskir equations. Surfaces on which the introduction of Bel-
AMS 1980 Subject Classification: 90005 trami coordinates is possible are said to he projective
(projective transformations of such surfaces convert
BELLMAN EQUATION - I) A partial differential
geodesics into geodesics). So called after E. Beltrami.
equation of a special type to solve a problem of optimal
who studied such surfaces in 1866.
control. If the solution of Cauchy's problem for the E. V S'hikin
Bellman equation can be found, the optimal solution of AMS 1980 Subject Classification: 53A05
the original problem is readily obtained.
2) A recurrence relation for the solution of a discrete BELTRAMI - ENNEPER THEOREM - A theorem
problem of optimal control. The method for obtaining estahlishing the following property of asymptotic lines

354
BENDIXSON CRITERION

(d. Asymptotic line) on a surface of negative curvature Editorial comments.


(E. Beltrami, 1866; A. Enneper, 1870): If the curvature References
of an asymptotic line at a given point is non-zero, then [A 1] KLEIN, F.: Vorlesungen uber nicht-Euklidische Geometrie,
the square of the torsion of this line is equal to the Springer, 1968.

absolute value of the curvature of the surface at this AMS 1980 Subject Classification: 51 M1 0, 53A35
point. The theorem can also be applied to the case
when the curvature of the asymptotic line at a given BELTRAMI METHOD - A method for solving wave
point is zero. The square of the torsion is replaced by equations with three spatial variables, proposed by E.
the square of the rate of rotation of the tangent plane Beltrami in 1864. The method is based on the fact that
to the surface at this point during the motion along the the wave equation on the surface of the characteristic
asymptote. Asymptotic lines emanating from the same cone can be converted with the aid of interior differen-
point have torsions of equal absolute values and of tial operators to an especially simple form which can be
opposite signs. used for the solution u(x, t)
E. V Shikin
Editorial comments. A useful general reference is [A 1]. 4'lT t5u(xo, to) = /} [u+to ~~ J do,
References
[A1] SPIVAK. A.M.: Differential geometry, 3, Publish or Perish, where g is the sphere {x: 1 x-xo 1 =to}, da is its sur-
1975. face element and a/ an
denotes differentiation in the
AMS 1980 Subject Classification: 53A05 direction of the outward normal on the cone
1 x - x 0 12 = (t - to)2. The method is applicable to the
BELTRAMI EQUATION - See Laplace-Beltrami cases of inhomogeneous equations and to equations
equation. with any odd number of spatial variables.

AMS 1980 Subject Classification: 58G05, 58G25 References


[I] COURANT, R. and HiLBERT, D.: Methods ofmathematicalphy-
sics. Partial differential equations, 2, Interscience, 1965
BELTRAMI INTERPRETATION - A realization of a (translated from the German).
Sh.A. Alimov
part of the Lobachevskii plane on a pseudo-sphere - a
VA. II'in
surface of constant negative curvature. In the Beltrami
in terpretation, the geodesic lines (cf. Geodesic line) and Editorial comments. The formula above is called
their segments on the pseudo-sphere play the role of Beltrami's formula.
straight lines and their segments on the Lobachevskil References
plane. The isometric mappings of the pseudo-sphere [A 1] JOHN, F.: Partial differential equations, Springer, 1978.

onto itself represent the movements in the Lobachevskii AMS 1980 Subject Classification: 35L05, 35C15
plane. with preservation of horocircles. The lengths,
angles and areas on the pseudo-sphere correspond to BENDIXSON CRITERION - A theorem that permits
the lengths, angles and areas on the Lobachevskii one to establish the absence of closed trajectories of
plane. Under these conditions, to each theorem of dynamical systems in the plane, defined by the equa-
Lobachevskii planimetry which refers to a part of the tion
Lobachevskii plane there corresponds an analogous x' = P(x,y), y' = Q(x,y). (*)
theorem of the intrinsic geometry of the pseudo-sphere. The criterion was first formulated by I. Bendixson [1]
The Beltrami interpretation realizes a part of the as follows: If in a simply-connected domain G the
Lobachevskii plane, but the entire Lobachevskii plane expression P~ + Q~ has constant sign (i.e. the sign
cannot be realized in three-dimensional Euclidean remains unchanged and the expression vanishes only at
space as a regular surface (Hilbert's theorem). isolated points or on a curve), then the system (*) has
The Beltrami interpretation was proposed by E. Bel- no closed trajectories in the domain G. This criterion
trami in 1868 [1]. This publication represented the first was generalized by H. Dulac [2] as follows: If G is a
realization of Lobachevskii's 'imaginary geometry' in simply-connected domain in the (x, y)-plane, if the
three-dimensional Euclidean space. functions P and Q E C 1(G), and if a function
For other interpretations of the Lobachevskii j(X,Y)EC 1(G) can be found such that

f ({Mf!l
geometry see Klein interpretation; Poincare model.
References ax + 2!J!2l}
ay dx dy =1= 0
[1] BELTRAMI. E.: 'Saggio di interpretazione della geometria non-
euc1idea'. in Opere Mat .. Vol. 1. 1868. pp. 374-405. for any simply-connected subdomain D C G, then the
[2] KAGAN. V.F.: The foundations of geometry. 1, Moscow- domain G does not contain any simple rectifiable
Leningrad. 1949 (in Russian).
E. V Shikin closed curve consisting of trajectories and singular

355
BENDIXSON CRITERION

points of the system (*). If the domain G is an annulus, of the Euclidean (x, y)-plane punctured at the point
a similar theorem states that a closed trajectory of (*), (0,0) into a similar Euclidean (u, v)-plane. It is the
if it exists, is unique. A generalization applying to the coordinate expression of the bijection <p generated by
case of system (*) with cylindrical phase space [3] is the Bendixson sphere. If the planes (u, v) and (x, y)
also possible. coincide, the Bendixson transformation is the inversion
References of the plane (x,y) with respect to the circle x 2 +y2 =4.
(I] BENDIXSON, I.: 'Sur les courbes definies par des equations References
differentielles'. Acta Math. 24 (1901), 1-88. [I] ANDRONOY, A.A., LEONTOYICH, E.A, GORDON, 1.1. and
[2] DULAC, H.: 'Recherches des cycles Iimites. C.R. Acad. Sci. MAIER, A.G.: Qualitative theory of second order dynamic sys
Paris Ser. 1 Math. 204 (1937). 1703-1706. te~, Wiley, 1973 (translated from the Russian)A FAd
[3] ANDRONOY, A.A., VITI, AA. and KHAiKIN, S.E.: Theory of .. n reev
oscillations, Moscow, 1959 (in Russian).
N.Kh. Rozov AMS 1980 Subject Classification: 51 N20, 34C20
Editorial comments. Bendixson's criterion is also called
BERGMAN KERNEL FUNCTION, Bergman kernel -
the Poincare- Bendixson theorem.
A function of complex variables with the reproducing
References
[A 1] ANDRONOY, A.A, LEONTOYICH, E.A., GORDON, 1.1. and
kernel property, defined for any domain D ccn in
MAIER, A.G.: Qualitative theory of second-order dynamic which there exist holomorphic functions f=l=0 of class
systems, Wiley, 1973 (translated from the Russian). L 2 (D) with respect to the Lebesgue measure dV. The
AMS 1980 Subject Classification: 34C05, 34C25 function was introduced by S. Bergman [1]. The set of
these functions f forms the Hilbert space
BENDIXSON SPHERE - The sphere in real analysis L 2,h(D)CL 2 (D) with orthonormal basis {<PI, </>2, ... };
which is known as the Riemann sphere in the theory of L 2,h(D)=L 2 (D) n
OeD), where OeD) is the space of
functions of a complex variable. holomorphic functions. The function
Let ~: X2 + y2 + Z2 = I be the unit sphere in the
Euclidean (X, Y, Z)-space, and let N(O, 0, I) and
S (0, 0, - I) be its north and south pole, respectively;
let p and a be planes tangent to ~ at the points Nand z = (z" ... ,zn), ~ = (~" ... '~n)'
S respectively; let xSy and uNv be coordinate systems is called the Bergman kernel function (or simply the
in a and p with axes parallel to the corresponding axes kernel function) of D. The series on the right-hand side
of the system XOY in the plane Z=O and pointing in converges uniformly on compact subsets of D, and
the same directions; let n be the stereographic projec- belongs to L 2,h(D) for each given fED, the sum does
tion of ~ onto a from the centre N, and let II' be the not depend on the choice of the orthonormal basis
stereographic projection of ~ onto p from the centre S. {<Pj }. The Bergman kernel function depends on 2n com-
Then ~ is the Bendixson sphere with respect to anyone plex variables, and is defined III the domain
of the planes a, P. It generates the bijection <p=n'n- 1 D X D C C2n ; it has the symmetry property
of the plane a (punctured at the point S) onto the plane K(f, z)=K(z, n,it is holomorphic with respect to the
P, which is punctured at the point N. This bijection is variable z and anti-holomorphic with respect to If r.
employed in the study of the behaviour of the trajec- D =D' XD", D' cern, D" ccn- rn , then
tories of an autonomous system of real algf) Iraic ordi- KD(z, n = K[)(z', nKlJ(z", n
nary differential equations of the second order in a
neighbourhood of infinity in the phase plane (the wherez'=(z], ... ,zrn), z"=(zrn+],'" ,zn).
right-hand sides of the equations are polynomials of the The most important characteristic of the Bergman
unknown functions). This bijection reduces the problem kernel function is its reproducing property: For any
to a similar problem in a neighbourhood of the point function !EL 2,h(D) and for any point z ED the follow-
(0, 0). Named after I. Bendixson. ing integral representation is valid:

References I(z) = jlmK(z, ndVm


[)
[I] ANDRONOY, A.A., LEONTOYICH, E.A, GORDON, 1.1. and
MAIER, AG.: Qualitative theory of secondorder dynamic sys Extremal properties of the Bergman kernel function
tem~, Wiley, 1973 (translated from the Russian)A FAd
. . n reev are:
I) For any point ZED
AMS 1980 Subject Classification: 51 N20, 34C20
K(::,::) = sup{i/(z) 1
2 :/r=L 2h iD), 1I/Ik(/J)~l},
BENDIXSON TRANSFORMATION - The mapping 2) Let a point fED be such that the class L 2,h(D)
4.\
=- -,--,. \' 4\'
= ---
contains functions satisfying the condition fm = 1. The
function K (::, n / K (r, n then satisfies this condition
U
x-+v- x'lr'

356
BERGMAN KERNEL FUNCTION

and has norm K (t f) - I /2, which is minimal for all References


such f The function K(z, f) / K(t f) is called the [II BERGMAN, S.: The kernel function and conformal mapping,
Amer. Math. Soc., 1950.
extremal function of D. [2] FUKS, B.A.: Special chapters in the theory of analytic functions
Changes in the Bergman kernel function under biho- of several complex variables, Amer. Math. Soc., 1965 (translated
lomorphic mappings are expressed in the following from the Russian).
[3] SHABAT, B.V.: Introduction to complex analysis, 1-2, Moscow,
theorem: If cp is a biholomorphic mapping of a domain 1976 (in Russian).
D onto a domain D*, cp(z)=w, CP(f)=1/, then E.M. Chirka
Editorial comments. Recently, additional properties of
dz d~
K (
D' W, Tj) = KD(z, n dw dTj' the Bergman kernel have been discovered. For a large class
of pseudo-convex domains with Coo boundary, which
where dz / dw is the Jacobian of the inverse mapping.
includes strictly pseudo-convex domains, and domains of
Owing to this property the Hermitian quadratic form finite type, the kernel function K(z, w) on 0 X 0 is smooth
ds 2 -- {-. (l2JogK(z, z) d di up to the boundary in z if w remains fixed in D. This is a
~ ~ 4
J.k = 1 az) (lzk consequence of the compactness of the Neumann operator
N for the complex Laplacian on 0 and the identity
is invariant under biholomorphic mappings.
The function K(z)=K(z, z), which is also called a P = I-a ''Na.
kernel function, plays an important role in the intrinsic Here P is the Bergman projection, that is, the orthogonal
geometry of domains. In the general case it is non- projection of L 2 (0) onto L 2h (0) given by integration against
negative, while the function log K(z) is plurisubhar- K; a is the Cauchy- Riemann operator and a
its Hilbert
space adjoint. In fact, for these domains P satisfies the so-
monic. In domains D where K (z) is positive (e.g. in
called 'condition R', that is P maps L 2s + 2(0) continuously
bounded domains), the functions K(z) and 10gK(z) are
into L 2s (0), where L 2k (0) denotes the Sobolev space of
strictly plurisubharmonic. The latter is tantamount to
order k. This property of the Bergman kernel function is
saying that in such domains D the form ds 2 is positive employed in the study of proper holomorphic and biholo-
definite and. consequently, gives a Hermitian Riemann- morphic mappings. (See [A2], [A4], [AS].) Moreover, the
ian metric in D. This metric remains unchanged under asymptotic behaviour of K(z, w) has been studied; for
biholomorphic mappings and is called the Bergman strictly pseudo-convex domains 0 one has
metric. It may be considered as a special case of a
Kahler metric. It follows from extremal property I) that
K(z, w) = F(z, w)(il/;(z, w-n-1 +G(z, w) log (il/;(z, w,

the coefficients of the Bergman metric increase to infin- where F, G and I/; are Coo functions on 75 X 75 and I/; satisfies
ity on approaching certain boundary points. If D cen a) I/;(z, z) = p(z) / i, where p is a strictly-plurisubharmonic
defining function for 0;
is a strictly pseudo-convex domain or an analytic
b) aA and awl/; vanish to infinite order at z= w; and
polyhedron, then K (z) increases to infinity for any
c) I/;(z, w)= -I/;(w, z).
approach of z to the boundary of the domain. Every
Similar results have been obtained for certain weakly
domain which has this property of the Bergman kernel
pseudo-convex domains, see [A1], [A3], [A4].
function is a domain of holomorphy. The Bergman kernel has also been studied for other
For domains of the simplest type, the Bergman ker- domains, e.g. Cartan domains (cf. [A6]) and Siegel
nel function can be explicitly calculated. Thus, for the domains (cl. [A7], Siegel domain).
ball B = {z: I z I <R} in en, the Bergman function has

]
References

l
the following form:
[A 1) BOUTET DE MONVEL, L. and SJOSTRAND, 1.: 'Sur la singular-
-n-I
n!Rn n_ ite des noyaux de Bergman et de Szego', Asterisque 34-35
KB(z, n = -- R2 - ~z)~) (1976), 123-164.
1T" )=1
[A2) CATLIN, D.: 'Global regularity of the a-Neumann problem', in
Proc. Symp. Pure Math., Vol. 41, Amer. Math. Soc., 1984,
and for the polydisc U={z: I z) I <Rj , )=1, ... ,n},
pp. 39-49.
in en: [A3] DIEDERICH, K., HERBORT, G. and OHSAWA, T.: 'The Bergman
kernel on uniformly extendable pseudo-convex domains',
Math. Ann. 273 (1986),471-478.
[A4) FEFFERMAN, c.: 'The Bergman kernel and biholomorphic
In the special case when n = I and U = B is the disc mappings of pseudo-convex domains', Invent. Math. 37
{z: I z I <R} in the complex z-plane, the Bergman (1974),1-65.
metric becomes the classical hyperbolic metric [AS) RANGE, R.M.: Holomorphic functions and integral represen-
tations in several complex variables, Springer, 1986,
ds 2 = 2R2 1 dz 12, Chap!. 7.
(R 2 - 1Z 12)2 [A6) HUA, L.K.: Harmonic analysis of functions of several com-
plex variables in the classical domains, Amer. Math. Soc.,
which is invariant under conformal mappings and 1963.
which defines the LobachevskiI geometry in U. [A7] GINDIKlN. S.G.: 'Analysis on homogeneous domains', Rus-

357
BERGMAN KERNEL FUNCTION

sian Math. Surveys 19 (1964),1-89. (Uspekhi Mat. Nauk 19 An important property of the Bergman - Weil
(1964), 3-92)
representation is that its kernel is holomorphic in z.
AMS 1980 Subject Classification: 32H10 Accordingly, if the holomorphic function f is replaced
by an arbitrary function which is integrable over 0,
BERGMAN - WElL REPRESENTATION, then the right-hand side of the Weil representation
Bergman - Weil formula, Wei! formula - An integral gives a function which is holomorphic everywhere in U
representation of holomorphic functions, obtained by S. and almost-everywhere in D \ au; such functions are
Bergman [I] and A. Weil [2] and defined as follows. Let called integrals of Bergman - Wei! type. If f is holo-
D be a domain of holomorphy in en,
let the functions morphic in U and continuous on fl, then its integral of
WI, ... , Uj be holomorphic III D and let Berg~an - Weil type is zero almost-everywhere on
V={zED: I Wk(z) 1<1, k=l, ... ,N} compactly D\U.
belong to D. It is then possible to represent ~y func- Bergman - Weil representations in a Weil domain V
tion f holomorphic in V and continuous on V at any yield, after the substitution
point Z E V by the formula: W], (z)
f
00

I fm det (Pi].) ~ W,+lm


fez) = -'-n ~ n dr, (*) v ~O lk

(2m) II(~.m- ~.(z the Wei! decomposition


k =1
where the summation is performed over all fez) =
j 1 < ... <jn, while the integration is carried out over ="2, ~ Q),j,S,s'{z)(W;,'(z)'" w;:(z
suitably-oriented n-dimensional surfaces aj" ... ,j,' form- Sir. ~o j I < ... <jk

ing the skeleton of the domain V (d. Analytic into a series of functions, holomorphic in D, and this
polyhedron), d~=d~11\ ... I\d~n' Here the functions series is uniformly convergent on compact subsets of V.
Pij(~' z) are holomorphic in the domain D X D and are
References
defined, in accordance with Hefer's lemma [3], by the [I] BERGMAN, S.B.: Mat. Sb. 1 (43) (1936), 242-257.
equations [2] WElL,A.: 'l'Integrale de Cauchy et des fonctions de plusieurs
n variables', Math. Ann. 111(1935), 178-182.
~m- ~(z) = "2,(ri -z,)Pi/(r, z). [3] VLADIMIROV, V.S.: Methods of the theory of functions of several
i=1 complex variables, M.I.T., 1966 (translated from the Russian).
The integral representation (*) IS called the E.M. Chirka
Bergman - Wei! representation.
Editorial comments.
The domains V appearing in the Bergman - Weil
References
representation are called Wei! domains; an additional [A1] HENKIN, G.M. and LEITERl:R, 1.: Theory of functions on com-
condition must usually be imposed, viz. that the ranks plex manifolds, Birkhauser, 1983.
of the matrices (aJ.fj, /az,,), v=l, ... ,k, fL=l, ... ,n,
AMS 1980 Subject Classification: 32A25
k ~n, on the corresponding sets
BERNOULLI AUTOMORPHISM - An automor-
1= ... =1 ~k I=l}
{ZEV: I~,
phism of a measure space, which describes Bernoulli
are maximal (= k) for all j I < ... <jl: (such Weil trials and their generalization - a sequence of indepen-
domains are called regular). The Weil dom) ins in the dent trials with the same result and with the same pro-
Bergman - Weil representations may be replaced by bability distribution.
analytic polyhedra U compactly belonging to 0, Let A be the collection of all possible outcomes of a
U = {zED: Wj(z)ED/, j=1. ... ,N}, trial, and let the probability of the event B CA be given
by the measure v; for a countable set A, denote its ele-
where the Dj are bounded domains with piecewise-
ments by ai and their probabilities by Pi = v(a,). The
smooth boundaries aDj in the plane C. The
phase space of a Bernoulli automorphism is the direct
Bergman - Weil representation defines the value of a
product of a countable number of copies of the set A.
holomorphic function f inside the analytic polyhedron
i.e. the points of the phase space are infinite sequences
U from the values of f on the skeleton a; for n > I the
h = {hk }. where k runs through the set of integers and
dimension of a is strictly lower than that of au. If
each hk c=A. The transformation T consists in shifting
n = I, analytic polyhedra become degenerate in a
all members of each sequence one place to the left:
domain with piecewise-smooth boundary, the skeleton
T { hI: } = {hl:,- I }. The measure fl is defi ned as the direct
and the boundary become identical, and if. moreover,
product of a countable number of measures v; thus if A
N=I and W(z)=z, then the Bergman-Wei! represen-
is countable. then
tation becomes identical with Cauchy's integral for-
mula.

35H
BERNOULLI l.EMNISCATE

In this case, the entropy of the Bernoulli automorphism References


is - V, logp,. [I] BERNOULLI, J.: Acta Erud. (1695), 59-67; 537-557.
[2] KAMKE, E.: DifJerentialgleichungen: LOsungsmethoden und
In ergodic theory, Bernoulli automorphisms (or, more LOsungen, 1, Chelsea, reprint. 1971.
exactly, the cascades generated by iteration of them) N.Kh. Rozov
play the role of a standard example of a dynamical sys- Editorial comments.
tem, the behaviour of which displays statistical features. References
A Bernoulli automorphism is a K-automorphism but [A 1) INCE, W.L.: Ordinary differential equations, Dover, reprint, 1956.
there exist K-automorphisms which are metrically non-
AMS 1980 Subject Classification: 34A08, 34-XX
isomorphic to a Bernoulli automorphism, even though
many K-automorphisms are metrically isomorphic to a BERNOULLI INTEGRAL of the equations of hydro-
Bernoulli automorphism. Two Bernoulli automor- dynamiCS - An integral which detennines the pressure p
phisms are metrically isomorphic if and only if they at each point of a stationary flow of an ideal homo-
have the same entropy [1]. A Bernoulli automorphism geneous fluid or a barotropic gas (p =F(p)) in terms of
is a quotient automorphism of any ergodic automor- the velocity v of the flow at that point and the body
phism of a Lebesgue space with a larger entropy [2]. force function per unit mass u(x,y, z):
References
[IA) ORNSTEIN, D.: 'Bernoulli shifts with the same entropy are iso-
f7=c-iIVI2+U. (1)
morphic', Adv. Math. 4 (1970), 337-352.
[IB) D.ORNSTEIN: 'A Kolmogorov automorphism that is not a Ber-
The constant C has a specific value for each flow line
noulli shift', Matematika 15, no. 1(1971), 131-150 (in Rus- and varies from one flow line to another. If the motion
sian). is potential, the constant C is the same for the entire
[2] SINAi, YA.G.: 'On weak isomorphism of transformations with
invariant measure'. Mat. Sb. 63 (lOS), no. I (1964).23-42 (in
flow.
Russian). For a non-stationary flow, the Bernoulli integral
D. V Anosov (sometimes called the Cauchy- Lagrange integral) holds
Editorial comments. A metric isomorphism between two in the presence of a velocity potential:
Bernoulli automorphisms can be given by means of a fini-
tary code ([A1], see also [A4D. f .EP... = ~_ll v 12+u+j(t), (2)
References
p at 2
where
[A 1) KEANE, M. and SMORODINSKY, M.: 'Bernoulli schemes with v = 'V <p(x, y, z, t),
the same entropy are flnltarily isomorphic', Ann. of Math. 109
(1979), 397-406. and f (t) is an arbitrary function of time.
[A2] SMORODINSKY. M.: Ergodic theory, entropy, Springer, 1971. For an incompressible liquid the left-hand sides of
[A3) ORNSTEIN, D.: Ergodic theory, randomness, and dynamical equations (l) and (2) are converted to the p / p form;
systems, Yale Univ. Press, 1974.
[A4) CORNFELD. I.P., FOMIN, S.Y. and SINAI, YA.G.: Ergodic
for a barotropic gas (p =F(p to the form
theory, Springer, 1982 (translated from the Russian).
f.EP... = f p' (p).EP....
p p
AMS 1980 Subject Classification: 28DXX
The integral was presented by D. Bernoulli in 1738.
BERNOULLI DISTRIBUTION - A synonym for References
binomial distribution. [ 1] MILNE-THOMSON, L.M.: Theoretical hydrodynamiCS, Macmillan,
1950.
AMS 1980 Subject Classification: 62EXX L.N. Sretenskir

BERNOULLI EQUATION - An ordinary first-order AMS 1980 Subject Classification: 76EXX


differential equation BERNOULLI LEMNISCATE - A plane algebraic
ao(x)/ +al(x)y = j(x)yu. curve of order four, the equation of which in orthogo-
where a is a real number other than zero or one. This nal Cartesian coordinates is:
equation was first studied by 1. Bernoulli [1]. The sub- (Xl +y2)2 -2a 2(x 2 _yl) = 0;
stitution i-a = z converts the Bernoulli equation to a
and in polar coordinates
linear inhomogeneous first-order equation, [2]. If a>O,
the solution of the Bernoulli equation is y 0; if p2 = 2a 2 COS 2cp.
O<a< 1, at some points the solution is no longer The Bernoulli lemniscate is symmetric about the coor-
single-valued. Equations of the type dinate origin (Fig.), which is a node with tangents
[f (y)x + g(y )xulr,' = h(y), 0:*0, I, y = +- x and the point of inflection.

~
are also Bernoulli equations if y is considered as the ) .I/~
independent variable, while x is an unknown function
\:

~t;-
ofy.

359
BERNOULLI LEMNISCATE

The product of the distances of any point M to the two m-1, m-2, ... , and the resulting equations are solved in
given points FI(-a,O) and F 2 (a, 0) is equal to the xP' x~, ....
square of the distance between the points F I and F 2. If two roots xp and Xq (of multiplicity p and q) of the same
The Bernoulli lemniscate is a special case of the Cassini maximum modulus occur, then y(m) satisfies, as m~oo, a
ovals, the lemniscates, and the sinusoidal spirals (cf. linear difference equation with (x- xp)P(x- Xq)q as charac-
teristic polynomial, and (A1) must be replaced by the
Cassini oval; Sinusoidal spiral).
corresponding difference equation of order p + q.
The Bernoulli spiral was named after lakob Ber-
noulli, who gave its equation in 1694. References
[A1] HILDEBRAND, F.: Introduction to numerical analysis,
References McGraw-Hili, 1956.
[1] SAYELOY, A.A.: Plane curves, Moscow, 1960 (in Russian).
D.D. Sokolov AMS 1980 Subject Classification: 65H05
Editorial comments.
References BERNOULLI NUMBERS - The sequence of rational
[A 1] BRIESKORN, E. and KNORRER, H.: Plane algebraic curves, numbers B 0, B I, . . . , discovered by 1acob Bernoulli
Birkhauser, 1986 (translated from the German).
[1] in connection with the calculation of the sum of
AMS 1980 Subject Classification: 53A04 equal powers of natural numbers:

BERNOULLI METHOD - A method for finding the


m -lIn
~ kn= - - ~ s Bsm n + l - s [n)
k =0 n + 1 s=o
real root of algebraic equations of the type
n =0, 1, ... , m = 1,2, ....
(*)
The values of the first Bernoulli numbers are:
with the largest modulus (absolute value). The method
1 1
was proposed by D. Bernoulli [1] and is based on the Bo = 1, BI = -2' B2 = 6' B3 = 0,
following principle. Let y (0), ... ,y (n - 1) be random
1 1
numbers and let the values of y(n),y(n + 1), ... , be B4 = -30' Bs = 0, B6 = 42' B7 = 0,
calculated by the following difference equation:
1 5
aoy(m +n)+aly(m +n -1)+ ... +any(m) = 0. B8 = -30' B9 = 0, BIO = 66' BII = 0.

In general, as m -HY), the expression y (m + 1) / y (m ) All odd-indexed Bernoulli numbers except for B I are
tends to the value of the root of equation (*) with the zero, and the signs of B 2n alternate. Bernoulli numbers
largest modulus. are the values of the Bernoulli polynomials at x =0:
Bn = Bn(O); they also often serve as the coefficients of
References
[1] BERNOULLI, D.: Comm. A cad. Sci. Imper. Petropolitanae 3 the expansions of certain elementary functions into
(1732), 62-69, Petropolis. power series. Thus, for example,
[2] WHITIAKER, E.T. and ROBINSON, G.: Mathematische Bear-
x ox; x'
beitung von Resultaten der Observation, Leningrad-Moscow, -x-
e -1
= ,=0
~Bp-" I x I <2w
1935 (in Russian; translated from the German). P.

L.N. Dovbysh (the so-called generating function of the Bernoulli


Editorial comments. The expression Y(I + 1) / y(m) numbers);
tends to the root of largest modulus only if this root is sim- 00 22,
ple and if no other roots of the same maximum modulus xcotgx = ,~(-I)'B2'(2p)!x2', Ix 1<'17,
occur. If not, then one can proceed as follows (cf. [A1]). Let
_ ox; 22, (22, - 1) 2,
there be one (not necessarily real) root xp of largest tgx - ,~IB2' I (2p)! x, Ix 1<;.
modulus, with multiplicity p. Then the solution of the linear
difference equation for y(m) is L. Euler in 1740 pointed out the connection between
Bernoulli numbers and the values of the Riemann
rp]
y( m) ;::,;
p-l
l
j~O C; j x~, zeta-function rC~) for even s = 2m:

as m--,>x. This expression is the solution of a linear differ- 1'(2 ) - ~2m - (2'17)2m I B I
~ m - :'_( - 2(2n)1 2m'
ence equation with characteristic polynomial (x - xp)P, so
that y(m) satisfies the linear difference equation Bernoulli numbers are used to express many improper
integrals, such as
p-l
y(m) ;::,; ~ xb [p]. y(m- j) (A 1) 1
f '- .-I
x X2n dx I
;=0 J = -4n I B 211 I, n = I, 2.....
II e-"\
as m-HXJ. If p~2, then this equation can be solved expli-
citly in xP ' otherwise (A 1) is rewritten with m replaced by Certain relationships involving Bernoulli numbers are:

360
BERNOULLI POLYNOMIALS

n'2,l [~]Bk '23 x 2 + '21 x.


B ()_3
3 X - X -
Bo = 1, = 0, n;;;'2,
k =0
Bernoulli polynomials may be computed by the
(the recurrence formula); recurrence formula

n~I[:JBs(X)=nxn-l, n=2,3, ....


s =0

For a natural argument x =m Bernoulli polynomials


were first studied by Jacob Bernoulli in 1713, in con-
The estimates: nection with the computation of the sum
2(2n)! < (-1)" -I B ~ ~(2n)!, n ;;;'1, fk n.
(21Tfn 2n 3(21T)2n
k =0
hold. Extensive tables of Bernoulli numbers are avail- L. Euler [1] was the first to study Bernoulli polynomials
able; for instance, [2] contains accurate values of B 2n for arbitrary values of x. The term 'Bernoulli polynomi-
for n ~90 and approximate values for n ~250. als' was introduced by J.L. Raabe in 1851. The funda-
Bernoulli numbers have found many applications in mental property of such polynomials is that they satisfy
mathematical analysis, number theory and approximate the finite-difference equation
calculations.
Bn(x + 1)-Bn(x) = nx n- I,
References
[I] BERNOULLI, J.: Ars conjectandi, Basle, 1713. and therefore play the same role in finite-difference cal-
[2] DAVIS. H.T.: Tables of the higher mathematical functions, 2, culus as do power functions in differential calculus.
Bloomington, 1935. Bernoulli polynomials belong to the class of Appell
[3] SAALSCHUETZ, L.: Vorlesungen uber die Bernoullischen Zahlen.
Berlin, 1893.
polynomials, i.e. they satisfy the condition
[4] CHISTYAKOV, I.I.: Bernoulli numbers, Moscow, 1895 (in Rus- B~(x) = nBn-l(x)
sian).

l
[5] NIELSEN, N.: Traite eiementaire de nombres de Bernoulli, Paris, and are closely connected with the Euler polynomials
1923.
[6] KUDRYAVTSEV, V.A.: Summation of powers of natural numbers
and Bernoulli numbers, Moscow-Leningrad, 1936 (in Russian).
En-I(x) = ~ [Bn(X)-2 nBn [ ] ~
[7] NORLUND, N.E.: Vorlesungen uber DijJerenzenrechnung, Berlin,
1924. The generating function of the Bernoulli polynomials
[8] GELFOND, A.O. [A.O. GEL'FOND]: DijJerenzenrechnung, IS
Deutsch. Verlag Wissenschaft., 1958 (translated from the Rus- ~ = ~ Bn(~) (n.
sian). e t -1 n =0 n.
[9] HARDY, G.H.: Divergent series, Clarendon, 1949.
Bernoulli polynomials are expandable into Fourier
Yu.N. Subbotin
series: For n = 1
Editorial comments. Bernoulli numbers play an impor-
tant role in the theory of cyclotomic fields and Fermat's last BI(x) = x--.l = - ~ sin 27TSX , O<x<l,
2 s= I S7T
theorem, see [A1], pp. 40-41, and [A2]. E.g., if P is an odd
prime number that does not divide the numerators of and for n = 2, 3, ... ,
8 2 , 8 4 , . . . ,8p - 3 , then x P + yP =zP has no solutions in
COS
x, y, ZEN. (See also Cyclotomic field; Fermat great 00

theorem.) Bn(x) = -2n!~--'-------'--, O~x~l.


s= I (21TS)n
References
[A 1] LANG, S.: Cyclotomic fields, Springer, 1978. Bernoulli polynomials satisfy the relationships
[A2] RiBENBOIM, P.: Thirteen lectures on Fermat's last theorem,
Springer, 1979. Bn(mx) = mn-Imi:,IBn [x+...i...]
s=o m
AMS 1980 Subject Classification: 1OL 10, 10H40
(the multiplication theorem);
BERNOULLI POLYNOMIALS - The polynomials Bn(l-x) = (-I)nBn(x)
(the complement theorem);
Bn(x) = s~o [:]Bsnn-s (n=O, 1, ... ),
Bn(x +y) = s~o [:]Bs(y)Xn-s
where Bs are the Bernoulli numbers. Thus, for
n=O,I,2,3, (the theorem on addition of arguments).
Bernoulli polynomials are employed to express the
residual term of the Euler- MacLaurin formula, and for

361
BERNOULLI POLYNOMIALS

the expansion of functions into series. Many important Thus, a Bernoulli random walk may be described in
properties of Bernoulli numbers are a consequence of the following terms. A particle moves 'randomly' along
the properties of Bernoulli polynomials. Bernoulli poly- the x-axis over a lattice of points of the form kh (k is
nomials are employed in the integral representation of an integer, h >0). The motion begins at the moment
differentiable periodic functions ( = 0, and the location of the particle is noted only at

I/;{x) = 2~ ff(t)dt+
-17
! f !/>k(t)P- I )(x+'1T-t)dt,
-'Tt
discrete moments of time 0, d(, 2dt, .... At each step
the value of the coordinate of the particle increases or
decreases by a magnitude h with probabilities p and
2k -I~ ['1T+t]
!/>k(t) = -k-! -Bk ~ , q = 1 - p, respectively, irrespective of the nature of the
previous motion. Thus, shifts in the positive and in the
and play an important part in the theory of approxima- negative directions ('successes' and 'failures') are
tion of such functions by trigonometric polynomials described by a Bernoulli trial scheme with probability
and other aggregates. Cf. Favard problem. of success p. The Bernoulli random walk is usually
Various generalizations of the Bernoulli polynomials represented geometrically: take the (-axis as the
have been proposed. N.E. Norlund introduced general- abscissa, and the x-axis as the ordinate (cf. Fig. 1,
ized Bernoulli polynomials of order JI and degree n: showing the initial segment of the graph of a random
B~v)(x Iw) = B~v)(x IWI. ... ,wv ) walk of a particle beginning to move from zero).
(certain special cases of such polynomials had been pre-
viously considered by V.G. Immenetskii, N.Ya. Sonin 4h
and D.M. Sintsov). Let 3h
2h
Il rex)
wJ'
= (x +w)- (x)
W h

and D.t i
I 3.6.t 5.c\1 I
B~O)(xlw)=xn, n=O,I, ... ; -h 2.c\1 4.6.1 6.c\t

then BAP)(x Iw) is successively determined as the poly-


nomial solution of degree n of the finite-difference Fig. 1. The initial segment of the graph
equation representing the motion of a particle per-
forming a Bernoulli random walk.
Ilw.B~O)(xlwl"" ,wv) = nB~V_-II)(xlw" ... ,Wv-I),
JI = 1, 2, ... , with initial conditions Let Xj be the random variable corresponding to the
B~P)(Olw" ... ,wp) = B~P)[WI"" ,wv), displacement of the particle in the j-th step. Then
XI, X 2 , , is a sequence of independent random
where B~P)[WI"" ,wp] (a generalized Bernoulli
variables. The coordinate of the randomly-moving par-
number) is found from the recurrence relations
ticle at the moment nM is equal to the sum
~
~
[n]
s WSII B(p)
n-s
= wn"B(v-I)
n-l
Sn = X I + ... + Xn. This is why the graph of a Ber-
s =1 noulli random walk is an illustrative representation of
(B~')[wil = w1Bn , Bd P ) = I, B~O) = 0, Bd P) = I). the behaviour of the cumulative sums of random vari-
ables; moreover, many typical features of the fluctua-
References tions are preserved for the sums of much more general
[1] EULER, L.: Institutiones calculi differentialis, 10, Teubner, 1980.
[2] NORLUND, N.E.: Vorlesungen iiber Differenzenrechnung, Berlin, random variables as well. This graph also shows the
1924. changes in the capital at the disposal of one of the
[3] BATEMAN, H. and ERDELYl, A.: Higher transcendental functions.
players in the classical ruin problem (the formulas giv-
The Gamma function. The hypergeometric function. Legendre
functions, Vol. I, McGraw-Hill, 1953. ing the probabilities of numerous events taking place in
[4] LIKHIN, V.V.: Istor Mat. Issled. (1959),59-134. the course of a Bernoulli random walk were in fact
Yu.N. Subbotin developed in connection with this problem).
AMS 1980 Subject Classification: 33AXX, 1OA 10 A Bernoulli random walk is used in physics as a
rough description of one-dimensional diffusion
BERNOULLI RANDOM WALK - A random walk processes (cf. Diffusion process) and of the Brownian
generated by Bernoulli trials. The example of a Ber- motion of material particles under collisions with
noulli random walk may be used to explain certain molecules.
basic features of more general random walks. In partic- Important facts involved in a Bernoulli random walk
ular, even in this very simple scheme there appear pro- will be described below. In so doing, it is assumed that
perties of 'randomness' which are intuitively paradoxi- 6.t = I, h = 1.
cal. Probabilities of returning. Let the walk begin from

362
BERNOULlI RANDOM WALK

zero. The probability of at least one return to zero then side more than 9930 units of time with a probability
is I - Ip - q I, that is, one in the symmetric case ;;'0.1, i.e. roughly speaking, such a graph will be
p = q = 1 /2, and less than one if p=!=q. In the sym- observed not less frequently than in one case out of ten
metric case the values of T] (i.e. the time elapsed until (even though this seems absurd at first sight).
the first return to zero) and T2 (the time between the Maximum deviation. If p >q or p <q, the randomly-
first and the second return to zero), etc., are indepen- moving particle will move towards + 00 or - 00 with
dent random variables with an infinite mathematical probability one. Thus, if p<q, one defines the random
expectation. The time elapsed until the N-th return, i.e. variable
the sum T] + ... +TN, increases as N 2 , while the aver- M+ = max S,
O<;;j< 00 J
age number N 2n of returns during 2n steps is given by
and then the probability of M+ =x is
the formula
E(N 2n ) = 2n22n+ I [2n1-
n
I

which increases as Vn: A bounded Bernoulli random walk. One often consid-
Vn
E(N2n)~2y:;;' ers a Bernoulli random walk in the presence of absorb-
ing or reflecting barriers. For instance, let the walk
The consequence is paradoxical: In a symmetric Ber- begin from zero. The presence of an absorbing barrier
noulli random walk, the intervals ('waves') between the at a point a is manifested by that, on reaching this
successive returns to zero on the graph are surprisingly point, the particle ceases to move. In the presence of a
long (Fig. 2).
reflecting barrier at the point a =(k + 1/2), k;;'O an
St 2v'1 integer, the particle will move from k to k - 1 with pro-
800 bability q, and will remain in place with probability p.
600
Finite-difference equations are a principal tool in com-
puting the probabilities of the absorption and of attain-
400 ing specified points. For instance, let the absorbing
barrier be located at the point - a (a >0). If Zt.x is the
200
,,'i probability of a particle located at x at the moment of
o fr---,f-!!J.?:-t<:!Y:-::=---+---~~~:'---_.J.-:-:15,fO~OO:;;"~--
-;-
',.-, ---'Fj2;::O::-:OO::::oo'!- t time t to be absorbed before or at the moment n, then
-\1'''.1 \ { "'I..",; >,,\ ""'; : ,.;,-, the following equation is valid
- 200 I : :' \ _"" . If :t v'....'V~it

- 400
Zt,x = qZt+l,x-1 +pZt+l,x+l, x>-a,
-Vi
with the following obvious boundary conditions:
- 600
Zt, -a = I, O~t~n,
- 800
-2vt Zn,x = 0, x> - a.
Fig. 2. Graphs of three Bernoulli random The solution of this problem for p = q = 1 /2 was
walks: each one was observed during known to A. de Moivre and to P. Laplace. The Laplace
200,000 units of time. formula is:
2 /2 . (
j
1T

Another related feature is that the least probable values = 1-- sm a+x)p( .J..)"d.J.. (*)
of Tn / n (the fraction of time that the graph is above
Zt x
, 'IT .
smcJ>
COS,/, ,/"

the abscissa) are those close to 1/2. More exactly, the where
following theorem is valid: If k---'?oo, n -k---'?oo, then P = a+x+2 [ n-t-x-a]
2 +1.
the formula
p2n,2k ~
'lTn
Vx(l- x) ' The transition to diffusion processes. As an example,
where x = xn,k = k / n, gives the probability p 2n, 2k of let p = q = 1 /2, 6.1 = 1 / N, h = 1 / Vii. Then, as
the equality T 2n = 2k. A corollary is the so-called N ---'?oo, many probabilities, calculated for a Bernoulli
arcsine law: For each O<a< I the probability of the random walk, tend to limits which are equal to the
inequality Tn / n <a tends to respective probabilities of a Brownian motion. As an
example, consider the probability of a particle, which
-Ija vi dx
= -2 .,/
arc Sill Va. has departed from zero, reaching a barrier located at a
'IT x(1-x) 'IT
point 0: before or at the moment T. The limit transition
It is possible to prove, starting from this fact, that dur- from formula (*), for n / N = T, 1 =0, a / Vii =a,
ing 10,000 steps the particle remains on the positive gives the magnitude

363
BERNOULLI RANDOM WALK

__1_ ~ .!!!. _1. ~ _1_


1_ _
2_ J e- z'/2dz = _2_ f
a/VT 00
e- z'/2dz, 50 -. n 5 -. 50
--Ii,; 0 --Ii,; a/ VT will be higher than 0.999 if n ;;;;:'25,550. By introducing
which is equal to the probability that the coordinate a slight improvement in the original reasoning of Ber-
X(p) of a particle executing a Brownian motion satisfies noulli, it is possible to conclude that it is sufficient to
the inequality select a value of n obeying the condition
min X(p) ..;; -a, 1 + 1 1
O....... T n > --Iog-+-,
2 '1/
i.e. to the probability that the particle is absorbed at
the barrier -a. For the sake of a more or less complete which gives in turn, for the probability 1- P of the ine-
description of such limit relationships it is expedient to quality
adopt a general standpoint and to consider the transi-
tion from the discrete process of 'cumulative sums' to a
an estimate of the form
continuous random process (cf. Limit theorems).
The scheme of a Bernoulli random walk is very use- 2exp { -+n~}
ful in explaining the characteristic features of the
behaviour of sums of random variables such as the The condition obtained for the above example is
strong law of large numbers and the law of the iterated n ;;;;:'17,665 (more sophisticated estimates show that it is
logarithm. sufficient to take n ;;;;:'6502; one may note, for the sake
of comparison, that the de Moivre- Laplace theorem
References
[I] FELLER, W.: An introduction to probability theory and its appli- yields 6498 as the approximate value of no). Other esti-
cations, I, Wiley, 1957-1971, Chapt. 14. Yu. V. Prokhorov mates for I - P may be obtained using the Bernstein
inequality and its analogues. See also Binomial distribu-
AMS 1980 Subject Classification: 60J15 tion.
References
BERNOULLI SCHEME - A synonym for Bernoulli [I] BERNOULLI, J.: Ars conjectandi, Vol. 4, Basle, 1713.
trials. [2] MARKOV, A.A.: Calculus ofprobabilities, Moscow, 1924 (in
Russian).
AMS 1980 Subject Classification: 60-XX [3] BERNSHTEiN, S.N.: Probability theory, Moscow-Leningrad, 1946
(in Russian).
Yu. V. Prokhorov
BERNOULLI THEOREM - The (historically) original
form of the (weak) law of large numbers. The theorem Editorial comments.
appeared in the fourth part of Jacob Bernoulli's book References
[A1] FELLER, W.: An introduction to probability theory and its
Ars conjectandi (The art of conjecturing). This part may
applications, 1, Wiley, 1965.
be considered as the first serious study ever of proba- [A2] SERFLING, R.I.: Approximation theorems of mathematical
bility theory. The book was published in 1713 by N. statistics, Wiley, 1980, 6, 96.
Bernoulli (a nephew of Jacob Bernoulli). The theorem AMS 1980 Subject Classification: 60F05
deals with sequences of independent trials, in each one
of which the probability of occurrence of some event BERNOULLI TRIALS - Independent trials, each one
('success') is p. Let n be the number of trial!. and let m of which can have only two results ('success' or
be the random variable equal to the number of success- 'failure') such that the probabilities of the results do
ful events. The Bernoulli theorem states that, whatever not change from one trial to another. Bernoulli trials
the value of the positive numbers t: and 1/, the probabil- are one of the principal schemes considered in proba-
ity P of the inequality bility theory.
Let p be the probability of success, let q = 1- P be
the probability of failure, and let 1 denote the
will be higher than 1-1/ for all sufficiently large n occurrence of success, while 0 denotes the occurrence of
(n ;;;;:.no). The proof of this theorem, which was given by a failure. The probability of a given sequence of suc-
Bernoulli and which was exclusively based on a study cessful or unsuccessful events, e.g.
of the decrease of probabilities in the binomial distribu- 1 0 0 1 1 0 1 0 ... I,
tion as one moves away from the most probable value,
is equal to
was accompanied by an inequality which made it possi-
ble to point out a certain bound for the given no if t: pqqppqpq "'p=pmqn-m,
and 1/ were given. Thus, it was found by Bernoulli that where m is the number of successful events in the series
if p =2 / 5, the probability of the inequality of n trials under consideration. Many frequently occur-

364
BERNSTEIN INEQUALITY

ring probability distributions are connected with Ber- [3] KAc, M.: Statistical independence in probability, analysis and
noulli trials. Let Sn be the random variable which is number theory, Math. Assoc. Amer., 1963. A V P kh
. . ro orov
equal to the number of successes in n Bernoulli trials.
AMS 1980 Subject Classification: 60-XX
The probability of the event {Sn = k} is then
BERNSTEIN INEQUALITY - 1) Bernstein's inequal-
[knJ P q ,k -
k n -k -
0, ... ,n,
ity in probability theory is a more precise formulation of
the classical Chebyshev ineqUality in probability theory,
i.e. Sn has a binomial distribution. As n-HYJ, this distri-
proposed by S.N. Bernshtein [1] in 1911; it permits one
bution can be approximated by the normal distribution
to estimate the probability of large deviations by a
or by the Poisson distribution. Let Y 1 be the number of
monotone decreasing exponential function (cf. Probabil-
trials prior to the first success. The probability of the
ity of large deviations). In fact, if the equations
event {Y 1 = k} then is
EX) = 0, EX] = hj , j= I, ... ,n,
qkp, k=O,I, ... ,
hold for the independent random variables XI, ... , Xn
i.e. Y 1 has a geometric distribution. If Yr is the number with
of failures which precede the r-th appearance of a suc-
cessful result, Yr has the so-called negative binomial dis-
tribution. The number of successful outcomes of Ber- (where 1>2 and H is a constant independent of j), then
noulli trials can be represented as the sum the following inequality of Bernstein (where r>O) is
XI + ... + Xn of independent random variables, in valid for the sum Sn=X) + ... +Xn:
which J0 = I if the j-th trial was a success, and Xj =0
otherwise. This is why many important laws of proba- P{ I Sn I >r} ~ 2 exp { - 2(Bnr~ Hr) }, (I)
bility theory dealing with sums of independent vari-
ables were originally established for Bernoulli trial where Bn = ~bj' For identically-distributed bounded
schemes (cf. Bernoulli theorem weak) Law of large random variables Xl (E.Aj =0, EX] =0'2 and I J0 I ~L,
numbers); Strong law of large numbers; Law of the j = 1, ... ,n) inequality (1) takes its simplest form:
iterated logarithm; Central limit theorem; etc.).
A rigorous study of infinite sequences of Bernoulli P{ISnl>t(JVn}~2exp{-2(l::/3)}' (2)
trials requires the introduction of a probability measure
where a=Lt / VnO'. A.N. Kolmogorov gave a lower
in the space of infinite sequences of zeros and ones.
estimate of the probability in (1). The
This may be done directly or by the method illustrated
Bernstein - Kolmogorov estimates are used, in particu-
for the case p = q = 1 /2 below. Let w be a number,
lar, in proving the law of the iterated logarithm. Some
uniformly distributed on the segment (0, 1), and let
idea of the accuracy of (2) may be obtained by compar-
00 X(w) ing it with the approximate value of the left-hand side
w=~~
j =1 2J ' of (2) which is obtained by the central limit theorem in
where J0( w) = 0 or 1, be the expansion of w into a the form
binary fraction. Then the J0, j = 1,2, ... , are indepen-

dent and assume the values and 1 with probability
-2-Je-
V2;
u2 /2du = _2_
I V2;t
[1-~Je-12/2,
t
I / 2 each, i.e. the succession of zeros and ones in the
where 0<0<1. Subsequent to 1967, Bernstein's ine-
expansion of w is described by the Bernoulli trial
qualities were extended to include multi-dimensional
scheme with p = 1 /2. However, the measure on (0, 1)
and infinite-dimensional cases.
can also be specified so as to obtain Bernoulli trials
with any desired p (if r:j= I /2 the measure obtained is References
[I] BERNSHTEiN, S.N.: Probability theory, Moscow-Leningrad, 1946
singular with respect to the Lebesgue measure). (in Russian).
Bernoulli trials are often treated geometrically (cf. [2] KOLMOGOROFF, A.N. [A.N. KOLMOGOROV]: 'Ueber das Gesetz
Bernoulli random walk). Certain probabilities of a large des iterierten Logarithmus', Math. Ann. 101 (1929), 126-135.
[3] HOEFFDING, W.: 'Probability inequalities for sums of indepen-
number of events connected with Bernoulli trials were dent random variables', 1. Amer. Statist. Assoc. 58 (1963), 13-
computed in the initial stage of development of proba- 30.
bility theory in the context of the ruin problem. [4] YURINSKli, V.V.: 'Exponential inequalities for sums of random
vectors', 1. Multivariate Anal. 6 (1976), 473-499.
References A. V Prokhorov
[I] GNEDENKO, B.Y.: The theory of probability, Chelsea, reprint,
1962 (translated from the Russian).
2) Bernstein's inequality for the derivative of a tri-
[2] FELLER, W.: An introduction to probability theory and its appli- gonometric or algebraiC polynomial gives an estimate of
cations, 1-2, Wiley, 1957-1971. this derivative in terms of the polynomial itself. If

365
BERNSTEIN INEQUALITY

Tn(x) is a trigonometric polynomial of degree not BERNSTEIN INTERPOLATION METHOD - A


exceeding n and if sequence of algebraic polynomials converging uniformly
on [ - 1, 1] to a function f (x) that is continuous on this
M = max I Tn{x) I,
O~x~2'1T
interval. More precisely, Bernstein's interpolation
then the following inequalities are valid for all x (cf. method is a sequence of algebraic polynomials
[1]): I T'n'){x) I .;;; Mn', r= 1, 2, ....
n
~A~n)Tn{x)
These estimates cannot be improved, since the number Pn(j;x) = ---,-k_~,--:I_ __ n=I,2, ... ,
Tn(x~n)xx - x~n) ,
M= 1 for
Tn (x ) = cos n{x - xo) where the
Tn(x) = cos{n arc cos x)
is sharp:
max I T'n'){x) I = n'. are the Chebyshev polynomials; the
x

Bernstein's inequality for trigonometric polynomials is


x~n) = cos [ (2k 2~ 1)'/T ]
a special case of the following theorem [2]: If f(x) is
an entire function of order ~a and if are the interpolation nodes; and
M= sup
-oo<x<oo
I/(x)l, A~n) = I(x~n)
then one has if k=l=2/s, I is an arbitrary positive integer, n = 21q + r,
sup Ij<')(x) I .;;; Mo" (r=I,2, .. .). q;;;.l, O~r<2/, s=l, ... ,q; otherwise
-oo<x<oo /-1 /-1
Bernstein's inequality for an algebraic polynomial A?} = ~/(X~?(s-I)+2i+I)- ~/(X~?(s-I)+2i)'
i~O i~1
has the following form [1]: If the polynomial
The ratio between the degree of the polynomial
Pn(x) = ~ akxk Pn(f;x) and the number of points at which Pn(f;x)
k ~o
equals f(x) is (n-l)/(n-q), which tends to
satifies the condition 21 / (2/-1) as n~oo; if I is sufficiently large, this limit
I Pn{x) I .;;; M, a,;;;x';;;b, is arbitrary close to one. The method was introduced
by S.N. Bernstein [S.N. Bernshtein] in 1931 [l].
then its derivative P~(x) has the property
References
I P~{x) I .;;; vex -~~b -x)' a<x<b,
[I] BERNSHTEIN, S.N.: Collected works, Vol. 2, Moscow, 1954, pp.
130-140 (in Russian).
P.P. Korovkin
which cannot be improved. As was noted by S.N.
Bernshtein [1], this inequality is a consequence of the Editorial comments. This method of interpolation seems
not very well known in the West. There is, however, a well-
proof of the Markov inequality given by A.A. Markov.
known method of Bernstein that uses the special interpola-
Bernstein's inequalities are in fact employed in prov-
tion nodes k / n, k=O, ... , n, for bounded functions on
ing converse theorems in the theory of approximation [0, 1]. This method is given by the Bernstein polynomials.
of functions. There are a number of generalizations of The sequence of Bernstein polynomials Bn( f ; x) constructed
Bernstein's inequality, in particular for entire functions for a bounded function f on [0, 1] converges to f(x) at each
in several variables. point of continuity XE[O, 1] of f. If f is continuous on [0, 1],
References the sequence converges uniformly (to f) on [0, 1]. If f is dif-
[I] BERNSTEIN, S.N. [S.N. BERNSHTEIN]: 'Sur l'ordre de la meil- ferentiable, B~( f ; x)-. f(x) (at each point of continuity of
leure approximation des fonctions continues par des f), cf [A 1].
polynomes', Acad. R. Belgique. Cl. Sci. Mem. Coil. 4. Stir. II 4 This method of Bernstein is often used to prove the
(1922).
Weierstrass theorem (on approximation). For a generaliza-
[2] BERNSTEIN, S.N. [S.N. BERNSHTEIN]: 'Sur une propriete des
fonctions entieres', C.R. Acad. Sci. Paris 176 (1923),1603-1605. tion of the method (the monotone-operator theorem), see
[3] NIKOL'SKII, S.M.: Approximation offunctions of several variables [A2], Chapt. 3, Sect. 3. See also Approximation of func-
and imbedding theoremf, Springer, 1975 (translated from the tions, linear methods.
Russian).
N.P. Kornelchuk References
VP. Motornyr [Ai] DAVIS, P.J.: Interpolation and approximation, Dover, reprint,
Editorial comments. 1975.
[A2] CHENEY, E.W.: Introduction to approximation theory, Chel-
References
sea, reprint, 1982.
[Ai] NATANSON, I.P.: ConstructIVe theory ot tunctlOns, 1-3, Ungar,
1964-1965 (translated from the Russian), AMS 1980 Subject Classification: 41 A 10
[A2] LORENTZ, G.G.: Approximation ot tunctlons, Holt, Rinehart
and Winston, 1966.
BERNSTEIN METHOD, method of auxiliary func-
AMS 1980 Subject Classification: 60E15, 42A05, tio/1.1 - A method which is employed in the theory of
41AXX linear and non-linear partial differential equations.

366
BERNSTEIN METHOD

Bernstein's method consists in introducing certain new


(auxiliary) functions, which depend on the solution +~u [a [~J2
ox
+2b~~+c
ox oy
[~J2]
oy
+
being sought, and which make it possible to establish a
priori estimates of the maximum modulus of the deriva- ou ou u_
+2d-+2e-+g- = Q.
oX oy a
tives of this solution of the required order.
A simple example of the application of Bernstein's Let M be the upper bound of I a I, I b I, I c I in D,
method is the a priori estimate of the modulus of the and let a = 1 /8M. If au / ax and au / ay are con-
derivatives of the solution of the Dirichlet problem for sidered as current coordinates in the plane, and x, y, z
the non-linear (quasi-linear) elliptic equation as parameters, the equation Q =0 is the equation of an
ellipse, since the determinant ale 1 - bt > 2u 2 /3, where
02Z 02 Z
OX 2 + oy2 =f
[ OZ OZ ] _
x,y, Z, ~' oy =
a) = ~ [1+ s~ J, b) = stu' C) = ~ [1+ s~ ].
= [~J2 +2b~~+c [~J2 +
a
ox ox oy oy
(*)
Thus, for any au / ax and au / ay, Q will not be
smaller than a certain negative number - P, Q;:;:' - P
+ 2d~ + 2e~ +g in the disc D, (the number P is readily obtained in explicit form). If
ox oy
one introduces the function u 1 given by the formula
z Ie = 0, P
u) =: U+ 4(x 2 +y2),
where a, b, c, d, e, g are smooth functions of x, y, z;
C is the circle, the boundary of the disc D with radius one obtains
R (the assumption to the effect that D is a disc and 02U) 02U)
z I c = 0 is immaterial, since the general case of an 2 + -2-
-0
x oy
= Q+ P ;;;. 0,
arbitrary-connected domain and inhomogeneous boun-
and u 1 atains its maximum on the boundary C of the
dary condition is readily reduced to the case under con-
domain D and, since u 1 is constant on C, one has
sideration by a change of the function and a conformal
transformation of the domain).
If .fz ;:;:'0, then the estimated maximum modulus n
OU 1
-op ;;;. and ~op""'" 0
:>-, -
1
-PR
2 '
where R is the radius of the circle C. Hence it is possi-
max I z(x,y) I
= (x.I')ED+C ble to find a negative lower bound for az / ap:
n

of the solution of problem (*) is immediately obtained ~ _ ~~:>-, _ aPR


from the maximum principle. op - u op ~ 2e(2n+a)/a'
In order to prove that a regular solution of problem If the same reasoning is applied to a second auxiliary
(*) exists it is sufficient to have a priori estimates of the function u
maximum modulus of the derivatives of the solution up
1
to the third order (cf. Continuation method (to a Z = l/>2(u) =: -n -a+aln - - ,
l-u
parametrized family. To estimate maxc I az / ax I
one obtains an estimate from above
and maxc I az / ay I, it is sufficient to estimate
maxc I az / ap I (since z Ie =0), where p, () are polar ~ ,;::: aP)R (n +a)/2
op '" 2 e
coordinates in the disc D. Now introduce a new (auxili-
ary) function u, given by the formula Thus, maxc I az / ap I is estimated, which means that
maxc I az / ax I and maxc I az / ay I are estimated as
Z = I/(u) = -n -a+alnu, well. The estimate of the maximum modulus of the
where 0:>0 will be selected later. The function first derivatives inside the domain D is performed in a
u =u (x, y) varies from e to e(2n +<>l / <> in the same direc- similar manner: introduce an auxiliary function u given
tion as z(x,y) (-n~z~n). Since by the formula
OZ a OU Z = cJ>3(U) = -n+alnlnu.
ox u ox'
The function u varies in the same direction as z, from e
2n la
to ee . In view of (*), on may write the following
expression for u
and similar results for derivatives with respect to y, it 02U + 02u = _1_ [(l+lnu+aa) [~J2 +2ab~~+
follows that u satisfies the equation ox 2 oy2 u Inu ox ox oy

~+~
ox 2 oy2
= ~U [ [~J2
ox
+ [~J2] +
oy +(1+1nu+ac) [ OUJ2]
oy OU OU ulnu
+2d~+2e oy +g-a- - Q).

367
BERNSTEIN METHOD

Considerations similar to those given above show that segment O~x ~ I, the convergence
if the function B~kV;X) ~ j<k)(X)

is uniform on this segment. A study was made ([IB],


[5]) of the convergence of Bernstein polynomials in the
attains a maximum in the domain D, this maximum complex plane if f is analytic on the segment O~x ~ 1.
does not exceed some number, the value of which References
depends solely on nand M. This yields the required [IA] BERNSHTEiN, S.N.: Collected works, Vol. I, Moscow, 1952, pp.
estimates of maxD 1 oz / ox
1 and maxD 1 I. oz / oy 105-106.
[IB) BERNSHTEiN, S.N.: Collected works, Vol. 2, Moscow, 1954, pp.
Bernstein's method may also be used to estimate, in 310-348.
a similar manner, the maximum modulus in the domain [2) GoNCHAROY, V.L.: The theory of interpolation and approxima-
D + C of all highest derivatives of the solution (the only tion offunctions, Moscow, 1954 (in Russian).
[3) BASKAKOV, V.A.: 'An instance of a sequence of linear positive
other operation which is required is the differentiation operators in the space of continuous functions', Dokl. Akad.
of the initial equation). Nauk SSSR 113, no. 2 (1957), 249-251 (in Russian).
The method was first utilized by S.N. Bernstein [S.N. [4) KOROVKIN, P.P.: Linear operators and approximation theory,
Hindushtan Publ. Comp., Delhi, 1960 (translated from the Rus-
Bernshteln] [I]. The method was subsequently sian).
extended and was systematically utilized in the study of [5) KANTOROYICH, L.V.: Izv. Akad. Nauk SSSR Ser. Mat. 8
various problems for elliptic and parabolic differential (1931), 1103-1115.
P.P. Korovkin
operators [3], [4], [5].
Editorial comments. There is also a multi-variable gen-
References eralization: generalized Bernstein polynomials defined by the
[IA] BERNSTEIN, S.N. [S.N. BERNSHTEIN]: 'Sur la generalisation du
completely analogous formula
probleme de Dirichlet (premiere partie)', Math. Ann. 62 (\ 906),
253-271. BnU, Xl,'" ,Xk) =
[lB] BERNSTEIN, S.N. [S.N. BERNSHTEIN]: 'Sur la generalisation du
probleme de Dirichlet (deuxieme partie)', Math. Ann. 69
(\ 910), 82-136.
~
n,

i, =0
~ f[~'
1,=0nl
... ,i...] [~l]
nk 1
[~k]k X
[2] BERNSHTEIN, S.N.: Collected works, Vol. 3, Moscow, 1960.
XX;'(1-Xlt,-I,. X~(1-Xk)n'-I'.
[3] LADYZHENSKAYA, O.A. and URAL'TSEYA, N.N.: Linear and
quasilinear elliptiC equations, Academic Press, 1968 (translated Here n stands for the multi-index n = (nl , ... ,nk)'
from the Russian). As in the one variable case these provide explicit polyno-
[4] POGORELOY, A.V.: Die Verbiegung konvexer Fliichen,
Akademie-Verlag, 1957 (translated from the Russian). mial approximants for the more-variable Weierstrass approx-
[5] OLEJNIK, O.A. and KRUZHKOY, S.N.: 'Quasi-linear parabolic imation and Stone-Weierstrass theorems. For the
equations of second order in several independent variables', behaviour of Bernstein polynomials in the complex plane
Russian Math. Surveys 16, no. 2 (1961), 105-146. (Uspekhi and applications to movement problems, cf. also [A3].
Mal. Nauk 16, no. 5 (\961),115-155)
I.A. Shishmarev References
[A1) DAYIS, P.J.: Interpolation and approximation, Dover, reprint,
AMS 1980 Subject Classification: 35J65, 35K60
1975, pp. 108-126.
[A2] RIVLIN, TH.J.: An introduction to the approximation of func-
BERNSTEIN POLYNOMIALS - Algebraic polynomi- tions, Dover, reprint, 1981.
als defined by the formula [A3) LORENTZ, G.G.: Bernstein polynomials, Univ. of Toronto
Press, 1953.
Bnif; x) = Bn(x) =
AMS 1980 Subject Classification: 33AXX, 41 A 10,
k~![ ~ j [~jxk(l_x)n-k, n=1,2, .... 41A17

Introduced by S.N. Bernshtein in 1912 (cf. [1]). The BERNSTEIN - ROGOSINSKl SUMMATION
sequence of Bernstein polynomials converges uniformly METHOD - One of the methods for summing Fourier
to a function f on the segment O~x ~ 1 if f is continu- series; denoted by (BR, an). A trigonometric series
ous on this segment. For a function which is bounded
by C, 0 < C < 1, with a discontinuity of the first kind,
f( C - ) +f( C + ) is summable by the Bernstein - Rogosinski method at a
Bnif; C) ~ 2 .
point Xo to the value S if the following condition IS
The equation satisfied:

Bnif; c)- f(c) = (c)c(l-c)


2n
+0 [11
'n

is valid if f is twice differentiable at the point c. If the


k-th derivative f(k) of the function is continuous on the

36R
BERRY-ESSEEN INEQUALITY

where {an}, an >0, an~O, is a sequence of numbers, minimal surfaces; if k <n - 1, then already for n;;;:.4 it
and where the Sn(x) are the partial sums of the series is possible to find non-linear minimal surfaces Fk,
(*). defined over any Ek.
W. Rogosinski [1] first (1924) considered the case References
an = p7T / 2n, where p is an odd number, and then [I] BERNSTEIN, S.N. [S.N. BERNSHTEiN]: 'D'ber ein geometrisches
(1925) the general case. S.N. Bernstein [S.N. Theorem und seine Anwendung auf die partiellen Differential-
gleichungen vom elliptischen Typus', Math. Z. 26 (1927),551-
Bernshtein] [2] considered (1930) the case 558 (translated from the French).
an =7T / (2n + 1). The (BR, an)-method sums the Fourier [2] NITSCHE, J.e.e.: Vorlesungen uber Minimaljldchen, Springer,
series of a function f EL [0, 27T] in the cases an =p7T / 2n 1975.
[3] OSSERMAN, R.: 'Minimal varieties', Bull. A mer. Math. Soc. 75
and an =7T / (2n + 1) at the points of continuity of the (1969), 1092-1120.
function to its value and is one of the regular :summa- [4] OSSERMAN, R.: A survey of minimal surfaces, v. Nostrand, 1969.
tion methods. [5] FOMENKo, A.T.: Plateau's problem, Gordon and Breach, 1987
(translated from the Russian).
The Bernstein - Rogosinski sums Bn(x, an) are I.Kh. Sabitov
employed as an approximation procedure. In both
Editorial comments. As an important reference for the
cases described above they realize an approximation of generalizations of Bernstein's theorem the paper of
the same order as the best approximation for functions Bombieri~de Giorgi~Giusti [A1] can be Quoted. The origi-
of the classes Lip a and WI Lip a. nal of [1] is [A2].
References References
[I] ROGOSINSKl, W.W.: 'Ueber die Abschnitte trigonometischer [A 1] BOMBIERI, E., GIORGI. E. DE and GIUSTI, E.: 'Minimal cones
Reihen', Math. Ann. 95 (1925), 110-134. and the Bernstein theorem', InventlOnes Math. 7 (1969),
[2] BERNSHTEiN, S.N.: Collected works, Vol. I, Moscow, 1952, p. 243-269.
37 (in Russian). [A2] BERNSTEIN, S.N.: 'Sur une theoreme de geometrie et ses
[3] HARDY, G.H.: Divergent series, Clarendon, 1949. applications aux equations derivees partlelles du type ellip-
A.A. Zakharov tique', Comm Soc. Math. Kharkov 15 (1915-1917), 38-45.
Editorial comments. AMS 1980 Subject Classification: 53A 10, 53A07
References
[A1] BEEKMANN, W. and ZELLER, K.: Theorie der Limitierungsver- BERRY - ESSEEN INEQUALITY - An inequality
fahren, Springer, 1970. giving an estimate of the deviation of the distribution
AMS 1980 Subject Classification: 40GXX, 42A24 function of a sum of independent random variables
from the normal distribution function. Let X I, . . . ,Xn
BERNSTEIN THEOREM on minimal surfaces - If a be independent random variables with the same distri-
minimal surface is given by the equation z =/ (x, y), bution such that '
where / has continuous partial derivatives of the first
EX} = 0, EX] = 02 > 0, E 1 JS 13 < 00.
and second orders for all real x and y, then F is a
plane. A proof of this theorem, which is due to S.N. Let
Bernstein [S.N. Bernshtein] [1], is a consequence of a
more general theorem on the behaviour of surfaces with and
= -- f
1 x 1
non-pOSitive curvature. Various generalizations of <l>(x) e- r l2dt;
Bernstein's theorem have been proposed, most of them yf2; -00

being of the three following kinds: 1 Quantitative then, for any n,

,I; X}';;;;X}-<l>(X) I ,; ; A+,


improvements; e.g. obtaining a priori estimates of the
form I K(O) I ~const/ R2 where R is the radius of the sup Ip{ ovn;=1
x vn
disc over which the minimal surface z = / (x, y) is
defined and K(O) is the Gaussian curvature of the sur- where A is an absolute positive constant. This result
face at the centre of the disc. 2) The search for other a was obtained by A.e. Berry [1] and, independently, by
priori geometric conditions under which the minimal e.G. Esseen [2].
surface would be of a specific kind - a plane, a References
catenoid, etc.; for instance, if the spherical image of a [I] BERRY, A.e.: 'The accuracy of the Gaussian approximation to
the sum of independent variables', Trans. Amer. Math. Soc. 49
complete minimal surface contains no open set on the (1941), 122-136.
sphere, then such a minimal surface is a plane. 3) The [2] EsSEEN, e.G.: 'On the Liapunoff limit of error in the theory of
generalization of Bernstein's theorem to minimal sur- probability', Ark. Mat. Astr. Fysik 28A, no. 2 (1942), 1-19.
[3] I'ETROV, V.V.: Sums of independent random variables, Springer,
faces Fk of dimension k, located in a Euclidean space 1975 (translated from the Russian).
En; for example, if k = n -1, any minimal surface over v. V. Petrov
all En - I is uniquely determined if n ~ 8, and is a Editorial comments. The constant A can be taken to be
hyperplane, while if n >8, there exist non-planar 33/4, cf. [A1], p. 515 fl.

369
BERRY - ESSEEN INEQUALITY

References and if the limit (finite or infinite)


[A 1] FELLER, W.: An introduction to probability theory and its
applications, 2, Wiley, 1966. B = lim Bn
n~oc

AMS 1980 Subject Classification: 60E15 exists, then the series is convergent if B> 1 and IS
divergent if B < 1. Established by J. Bertrand.
BERTINI THEOREMS - Two theorems concermng References
the properties of linear systems (cL Linear system) on [I] FICHTENHOLZ, G.M.: Differential und Integralrechnung. I,
algebraic varieties, due to E. Bertini [1]. Deutsch. Verlag Wissenschaft., 1964.
L.D. Kudryavtsev
Let V be an algebraic variety over an algebraically
closed field k of characteristic 0, let L be a linear sys- AMS 1980 Subject Classification: 40A05
tem without fixed components on V and let W be the
BERTRAND CURVES, conjugate curves, Bertrand
image of the variety V under the mapping given by L.
pair - Two space curves L and L with common princi-
The following two theorems are known as the first and
pal normals. Let k I and k 2 be the curvature and the
the second Bertini theorem, respectively.
torsion of L respectively. For the curves L and L to
1) If dim W> 1, then almost all the divisors of the
be conjugate it is necessary and sufficient that
linear system L (i.e. all except a closed subset in the
parameter space P (L) not equal to P (L are irreduci- ak J sinw+ak 2 cosw = sinw
ble reduced algebraic varieties. is true. Here a is a constant, and w is the angle between
2) Almost all divisors of L have no singular points the tangent vectors of L and L. The name Bertrand
outside the basis points of the linear system L and the curve is also given to a curve L for which there exists a
singular points of the variety V. conjugate curve L. They were introduced by J. Ber-
Both Bertini theorems are invalid if the characteristic trand in 1850.
of the field is non-zero. E. V. Shikin
Conditions under which Bertini's theorems are valid Editorial comments. Bertrand's original paper is [A2]. A
for the case of a finite characteristic of the field have general reference is [A 1].
been studied [3], [6]. If dim W= 1, Bertini's theorem is References
replaced by the following theorem: Almost all fibres of [A 1] BLASCHKE, W. and LEICHTWEISS, K.: Elementare Differential-
geometrie, Springer, 1973.
the mapping CPL: V ~ Ware irreducible and reduced if [A2] BERTRAND, J.: 'Memoire sur la theorie des courbes a double
the function field k (W) is algebraically closed in the courbure', Liouvilles Journal 15 (1850).
field k(V) under the imbedding cP~: k(W)~k(V). If
AMS 1980 Subject Classification: 53A04
the characteristic of k is finite, the corresponding
theorem is true if the extension k (V) / k (W) is separ- BERTRAND PARADOX (in probability theory) - A
able [3], [6]. The Bertini theorems apply to linear sys- paradox connected with an inaccurate formulation of
tems of hyperplane sections, without restrictions on the the initial assumptions in solving problems in probabil-
characteristic of the field [5]. ity. Noted by J. Bertrand [1]. Bertrand's problem is con-
References cerned with the probability that the length of a chord,
[I] BERTINI. E.: Introduction to the projective geometry of hyper- chosen at random in a disc of radius one, is larger than
spaces. Messina, 1923 (in Italian).
the side length of the inscribed equilateral triangle. Ber-
[2] Algebraic surfaces. Moscow. 1965 (in Russian).
[3] BALDASSARRI, M.: Algebraic varieties, Springer, 1956. trand singled out three various values of the unknown
[4] AKIZUKI. Y.: 'Theorems of Bertini on linear systems', 1. Marh. probability (1 /2, 1 /3, I /4), depending on the
Soc. Japan 3, no. 1(\951).170-180. parameters characterizing the location of the chord. (In
[5] NAKAI, Y.: 'Note on the intersection of an algebraic variety
with the generic hyperplane'. Mem. Call. Sci. Univ. K\,oto Ser.
the first case by the distance p to the centre of the cir-
A Marh. 26. no. 2 (1950). 185-187. cle and the angle (j between the normal to the chord
[6] ZARISKI. 0.: 'The theorem of Bertini on the variable singular and the x-axis; in the second by the angular coordi-
points of a linear system of varieties'. Trans. Amer. Math. Soc.
nates ex and f3 of the points of intersection of the chord
56 (1944). 130-140.
[7] HARTSHORNE. R.: Algebraic Keometrv. Springer. 1977. with the circumference of the disc; in the third case by
V.A. Iskovskikh the Cartesian coordinates (x, y) of the base of the nor-
mal dropped from the centre of the disc. In all three
AMS 1980 Subject Classification 14C20
cases the coordinate origin coincides with the centre of
BERTRAND CRITERION of convergence of series the disc.) It was shown by H. Poincare [2] that the ori-
gin of the paradox is due to the fact that in each of the
' " ryJ
~n 1
a fl with f70sitive numhers as terms - If .
three cases the respective pair of parameters is assumed
= rn l' ~ - to be uniformly distrihuted in the given zone, so that
E"
l a" I I
I j - Illn 11
J three different problems are in fact solved. If the distri-

370
BESSEL EQUATION

bution of some pair (say, IX and {3) is fixed, the distri- sumrnable to degree p on any finite interval of the real
bution of all other parameters can be uniquely calcu- axis, is called a Besicovitch almost-periodic function if
lated (and is not necessarily uniform, even if IX and f3 to each >0 there corresponds a sufficiently homogene-
are uniformly distributed). From the geometrical point ous set of numbers (the so-called (BP, )-almost-periods
of view, the most natural assumption is that p and () of f(x)):
are independent and uniformly distributed in the inter- ... < 'T -2 < 'T -I < 'To < 'TI < ... ,
val O~p~ 1, 0~()~2'lT (cf. [3]).
such that for each i
References
[I) BERTRAND, J.L.: Calcul des probabilites, Gauthier-Villars, 1907. Mx{ I I(x +'Ti)-I(x) IP} < f!',
[2) POINCARE, H.: Calcul des probabilites, Gauthier-Villars, 1912.
[3) KENDALL, M.G. and MORAN, P.AP.: Geometric probability, and for each c >0
Griffin, 1963.
A. V Prokhorov
Editorial comments.

f
References
where
[A 1) SZEKELY, G.G.: Paradoxes in probability theory and 1 F(x)dx,
Mx{F(x)} = lim -2
7----+00 T ~T
mathematical statistics, Reidel, 1986, pp. 43-48.
- - 1 n
AMS 1980 Subject Classification: 60005, 62AXX M,{F(i)} = n~~ 2n + 1 i~nF(i).
BERTRAND POSTULATE - For any natural number Here F(x) is a real-valued function, defined, respec-
n > 3 there exists a prime number that is larger than n tively, for a real variable and an integer argument.
and smaller than 2n - 2. In its weaker formulation
References
Bertrand's postulate states that for any x> 1 there [1) BESICOVITCH, AS.: 'On mean values of functions of a complex
exists a prime number in the interval (x, 2x). The pos- and of a real variable', Proc. London Math. Soc. (2) 27 (1927),
tulate was advanced by J. Bertrand in 1845 on the 373-388.
[2) BESICOVITCH, AS.: 'On Parseval's theorem for Dirichlet series',
strength of tabulated data. and was proved by P.L Proc. London Math. Soc. (2) 26 (1927),25-34.
Chebyshev (cf. Chebyshev theorems on prime numbers). [3) LEVITAN, B.M.: Almost-periodic junctions, Moscow, 1953 (in
Russian).
References E.A. Bredikhina
[I) CHEBYSHEV, P.L: Oeuvres de P.L. Chebyshev, I, Chelsea.
reprint, 1961 (translated from the Russian). B M B d'kh' Editorial comments. Besicovitch developed his theory in
. . re I In
[A1], rather than in [1], [2].
Editorial comments. As is implicit in the article, for each p;;;.1 there is a class
of almost-periodic functions, denoted by 8 P . The first part of
References
[A 1) HARDY. G.H. and WRIGHT, E.M.: An introduction to the
the article deals with 8 2 , the rest is more general. General
theory of numbers, Clarendon Press, 1965, p. 343ff. references may be found under Almost-periodic function.
References
AMS 1980 Subject Classification: 1OL 10, 10H15 [A 1) BESICOVITCH, AS.: 'On generalized almost periodic func-
tions', Proc. London Math. Soc. (2) 25 (1926),495-512.
BESICOVITCH ALMOST-PERIODIC FUNCflONS -
A class of almost-periodic functions in which the analo- AMS 1980 Subject Classification: 42A75
gue of the Riesz- Fischer theorem is valid: Any tri-
gonometric series BESSEL EQUATION - A second-order linear ordi-
nary differential equation
x 2/ ' + xi +(x 2 -'?)y = 0, p=const, (I)
where
or, in self-adjoint form:

is the Fourier series of some B 2 -almost-periodic


func-
(xy ") + [x - ,? ]y
~ = o.
tion. The definition of these functions [I], [2] is based
on a generalization of the concept of an almost-period, The number JI is called the order of the Bessel equation;
and certain additional ideas must be introduced in it. in the general case x, y and JI assume complex values.
A set E of real numbers is called sufficiently homo- The substitution y = ux -I /2 yields the reduced form of
geneous if there exists an L >0 such that the ratio equation (1):
between the largest number of members of E in an
interval of length L and the smallest number of u"+ [ I+~
1-4'? ] u = O. (2)
members in an interval of the same length L is less
than 2. A sufficiently homogeneous set is also relatively A Bessel equation is a special case of a confluent
dense. A complex-valued function f (x), - 00 <x < 00, hypergeometric equation; if x = z /2i is substituted into

371
BESSEL EQUATION

(2), equation (2) becomes a Whittaker equation. In The substitution y =x'w transforms (1) into the
equation (1) the point x =0 is weakly singular, while Laplace equation:
the point x = 00 is strongly singular. For this reason a xw" +(21'+ l)w' +xw = 0;
Bessel equation does not belong to the class of Fuchs-
ian equations (cf. Fuchsian equation). F. Bessel [1] was which permits one to represent the solutions of (1) by
the first to study equation (1) systematically, but such contour integrals in the complex plane.
equations are encountered even earlier in the works of In applications it is often required to find the eigen
D. Bernoulli, L. Euler and J.L. Lagrange. values of the equation
A Bessel equation results from separation of vari- x 2y" +xy' +(Ax2_~)y = 0, (3)
ables in many problems of mathematical physics [2], where p is fixed while A is a parameter. Equation (3) on
particularly in the case of boundary value problems of the segment O:S;;;;x';;;;;a with the boundary conditions:
potential theory for a cylindrical domain.
The solutions of Bessel equations are called cylinder
y(x) is bounded as x~o, yea) = 0,
functions (or Besselfunctions). These may be subdivided is an example of a problem with a discrete spectrum
into the cylinder functions of the first kind (Bessel (the e1gen values are determined by the condition
functions) J.(x), the cylinder functions of the second J .(a VA )=0 in terms of the zeros of a Bessel function).
kind (Weber functions or Neumann functions, (cf. Equation (3) with the boundary condition:
Weber function; Neumann function) Y.(x) and the y(x) is bounded on the semi-axis O.;;;x<oo,
cylinder functions of the third kind (Hankel functions)
m
H\}l (x), 2) (x). If the order p is fixed, all these func- represents a problem with a continuous spectrum (eigen
values A~O).
tions are analytic functions of the complex argument x;
The inhomogeneous Bessel equation
for all these functions, except for the functions In(x) of
integer order, the point x = 0 is a branch point. If the x 2y" +xy' +(X2_~)y = f(x) (4)
argument x is fixed, all these functions are single- has the particular solution
valued entire functions of the complex order P [3].
If the order p is not an integer, then the general solu- y =; Yv(x)jxJv(x)f(x)dx- ; Jv(x)jxYv(x)f(x)dx.
tion of equation (1) may be written as Solutions of equation (4) have been studied in more
y = C 1J v(X)+C 2J -vex), detail for a right-hand side of special form. Thus, if
where C 1, C 2 are arbitrary constants. For a given f(x)=x P, equation (4) is satisfied by a Lommel func-
order, any two of the functions J .(x), Y .(x), HVl (x), tion; if 4( v+ I

m 2 ) (x) are linearly independent and may serve as a


f(x) = _x /2)
y;f(p+ 1/2)
fundamental system of solutions of (1). For this reason, it is satisfied by a Struve function; if
the general solution of equation (1) can be represented,
in particular, in the following forms: f(x) = -.L(x-p)sinJI'IT,
'IT
y = C 1J v(X)+C 2 Yv(x), y = C 1HV)(x)+C 2 He)(x). it is satisfied by an Anger function; and if
The following equations are closely connected with
1
equation (l): the equation f(x) = --[(x+p)+(X-V)COSJI'IT],
'IT

z2y " +zy' _(Z2 +~)y = 0, it is satisfied by a Weber function.


which becomes (1) as a result of the substitution z = ix, There are linear equations of higher orders with solu-
and with as a fundamental system of solutions the tions whose properties are analogous to those of Bessel
modified cylinder functions (Bessel functions of imaginary functions. The general n-th order equation of Bessel type
argument), and the equation has the form
Z2 y " +zy' _(iz2+~)y = 0,
which becomes equation (1) as a result of the substitu-
IT
,=I
[x~
dx
+c,lJy+xny = 0,
tion z = Vix and which has the Kelvin functions as its
fundamental system of solutions. Many other second- C, = const, ~CI = 0,
order linear ordinary differential equations (e.g. the
Airy equation) can also be transformed into equation and its solution depends on n - 1 parameters. In partic-
(1) by a transformation of the unknown function and ular, a third-order equation of Bessel type (which has a
the independent variable. The solution of a series of solution with two parameters lX, {3) may be represented
linear equations of higher orders may be written in the in the form:
form of Bessel functions (4). xY" +3x 1y" +[1 +9a,B-3(a+f3)lJxy' +

372
BESSEL INEQUALITY

+[x 3-9ap(a+p)+2(a+p)3]y = 0, conditions the following expansion is valid:


a, P = const _ 00
f(x) - ~cnJp - , xJ'
[JL~ 1
.-1
References
[ JL~
[1) BESSEL, F.: Abh. d. K. Akad. Wiss. Berlin (1824), 1-52.
(2) GRAY, A. and MATHEws, G.B.: A treatise on Besselfunctions c. = ,2J~+12 (JL~) [f(x)Jp
I ]
-,-x x dx, o<x<,.
and their application to physics, Macmillan, 1931.
(3) WATSON, G.N.: A treatise on the theory of Bessel functions, 1-2, In an infinite interval this expansion is replaced by the
Cambridge Univ. Press, 1952.
Fourier- Bessel integral
(4) KAMKE, E.: Differentialgleichungen: Losungsmethoden und
Losungen, 1, Chelsea, reprint, 1971. 00 00
N.Kh. Rozov f(x) = [c~..Jp(Ax)dA, CA = [f(X)Jp(Ax)xdx,
Editorial comments.
o<x<oo.
References
[A1) LEBEDEV, N.N.: Special functions and their applications, The following formulas play an important role in the
Dover, reprint, 1972 (translated from the Russian).
theory of Bessel functions and their applications:
AMS 1980 Subject Classification: 34830, 33A40 1) the integral representation

BESSEL FUNCflONS - Cylinder functions of the In(z) = -I "


fcos(z sin<p-n<p)d<p,
'1T 0
first kind. A Bessel function of order p can be defined
as the series 2) the generating function
_ 00 (-It [Z]P+2k (*) ez(~-Cl)/2 = ~ In(z)/!,,
Jp(z) - k~or(k+l)r(k+p+l) "2 n= -00

3) the addition theorem for Bessel functions of order


_ [z]P (-It [z]2k
- "2 k~or(k+l)r(k+p+l) "2 '
00
zero
Jo(Va2+b2-2abcos<p) =
which converges throughout the plane. A Bessel func- 00

tion of order p is the solution of the corresponding


= J o(a)Jo(b)+2 ~Jk(a)Jk(b)cosk<P,
k=1
Bessel equation. If the argument and the order v are
4) the recurrence formulas
real numbers, the Bessel function is real, and its graph
has the form of a damped vibration (Fig.); if the order Jp_I(Z)+Jp+l(z) -2:P..
- Jp(z),
z
is even, the Bessel function is even, if odd, it is odd.
[JpCz)r = t[Jp-I(Z)-Jp+l(z).
For references, see Cylinder functions.
P.I. Lizorkin
I
AMS 1980 Subject Classification: 33A40

Graphs of the functions y =J 0 (x) and BESSEL INEQUALITY - The inequality


y=J 1(x). IIfl12 = (j,j);;;;' ~ I (j, <Pa) 12
The behaviour of a Bessel function in a neighbourhood aEA (<Pa, <Pa)
of zero is given by the first term of the series (*); for
large x, the asymptotic representation a~ IVII:: II ] 12,
where f is an element of a (pre-) Hilbert space H with
Jp(x) ~ .,.V12cos
-:;;; [x-:!!...v-:!!...]
2 4 scalar product (j, </ and {</>a: aEA} is an orthogonal
system of non-zero elements of H. The right-hand side
holds. The zeros of a Bessel function (i.e. the roots of
of Bessel's inequality never contains more than a count-
the equation J .(x) =0) are simple, and the zeros of
able set of non-zero components, whatever the cardinal-
J .(x) are situated between the zeros of J.+ 1(x). Bessel
functions of 'half-integral' order v=n + 1/2 are expres- ity of the index set A. Bessel's inequality follows from
sible by trigonometric functions; in particular the Bessel identity

J I/ 2(X) = .,.
V12
-:;;; sin x, J -1/2(X) = .,.
Vf2
-:;;; cosx. Ik-i~xa;<Pa, Ir IfI2-i~Aa; I x a, 12,
The Bessel functions J .(p.~/-1 x) (where p.~ are the posi- which is valid for any finite system of elements {</>a,:
tive zeros of J .(x), v> -1 /2) form an orthogonal sys- i = 1, ... ,n}. In this formula the x a; are the Fourier
tem with weight x in the interval (0, I). Under certain coefficients of the vector f with respect to the orthogo-

373
BESSEL INEQUALITY

nal system {cf>a l , ,cf>a.}, i.e. xo, Xo +h, Xo -h, ... ,Xo +nh, Xo -nh, Xo +(n + I)h,
at the point x=xo+th:
xa, = fif' <Pa), Aa; = (<Pa;, <PaJ
G 2n +2(XO+th) =/0+1112t+1o ~
a;
2! + ... + (1)
The geometric meaning of Bessel's inequality is that
the orthogonal projection of an element f on the linear + n+lt(t2-I),(t 2 -n 2)
fil2 (2n+I)! '
span of the elements cf>a, aEA, has a norm which does
not exceed the norm of f (i.e. the hypothenuse in a and the Gauss formula of the same order for backward
right-angled triangle is not shorter than one of the interpolation with respect to the node Xl =xo+h, i.e.
other sides). For a vector f to belong to the closed with respect to the population of nodes
linear span of the vectors cf>a, aEA, it is necessary and Xo +h, Xo, Xu +2h, Xu -h, ... ,Xo +(n + I)h, Xo -nh:
sufficient that Bessel's inequality becomes an equality. t{t - }\
If this occurs for any fEH, one says that the Parseval G 2n +2(XO+th) = /1 +1112(t-I)+fi~+ ... + (2)
2!
equality holds for the system {cf>a: aEA} in H. + n+1 t(t 2 -I)'" [t 2 -(n-1)2](t-n)(t-n-I)
Fora system {cf>a: a = 1, 2, ... } of linearly indepen- fij2 (2n+I)! .
dent (not necessarily orthogonal) elements of H Bessel's Putting
identity and Bessel's inequality assume the form

Ik-a,~ b~Pif, I <Pp)<Pa /1


2
Bessel's interpolation formula assumes the form ([ 1],
[2]):
B2n+2(XO+th) = (3)

= /1/2 +11/2 [t - tJ+ fi I 2 t(t; I) + ... +


II! 1/ 2 ;;;. ~ b~Pif, <Pa)(/, <Pp),
a,p=1
+ n t(t 2 -1)'" [t 2 -(n-I?](t-n) +
where b~{J are the elements of the matrix inverse to the fil2 (2n)!
Gram matrix (d. Gram determinant) of the first n vec- + n + I t(t 2 -1) ... [t 2 -en -1)2](t -n)(t - I ! 2)
tors of the initial system. fi / 2 (2n+I)! .
The inequality was derived by F.W. Bessel in 1828 Bessel's interpolation formula has certain advantages
for the trigonometric system. over Gauss' formulas (1), (2); in particular, if the inter-
References polation is at the middle of the segment, i.e. at t = 1 /2,
[I] KUDRYAVTSEV, L.D.: Mathematical analysis, 2, Moscow, 1973 all coefficients at the differences of odd orders vanish.
(in Russian). If the last term on the right-hand side of (3) is omitted,
L.P. Kuptsov
the polynomial B 2n + I (x 0 + th), which is not a proper
Editorial comments. Usually, the orthogonal system of
interpolation polynomial (it coincides with f (x) only in
elements {<Pa} is orthonormalized, i.e. one sets
the 2n nodes Xo -en -1)h, ... ,Xo +nh), represents a
!f;a =<Pa / II <Pa II Bessel's inequality then takes the form
better estimate of the residual term (d. Interpolation
~ I (t, !f;a) I .;;;; II f 11 2 , formula) than the interpolation polynomial of the same
aEA
degree. Thus, for instance, if x = X 0 + th EO (x 0, XI)' the
which is easier to remember. In this form i is used in
approximation theory, Fourier analysis, the theory of orthog-
estimate of the last term using the polynomial which is
onal polynomials, etc. most frequently employed
References
[A1) YOSIDA, K.: Functional analysis, Springer, 1980.
[A2) CHENEY, E.W.: Introduction to approximation theory, Chel-
sea, reprint, 1982. wri tten wi th respect to the nodes
[A3) DAVIS, P.l.: Interpolation and approximation, Dover, reprint, x0 - h, x 0, x 0 + h, x 0 + 2h, is almost 8 times better than
1975.
that of the interpolation polynomial written with
[A4) HEWITT, E. and STOMBERG, K.: Real and abstract analysis,
Springer, 1965. respect to the nodes Xo -h, xo, Xo +h or
xo,xo+h, xo+2h ([2]).
AMS 1980 Subject Classification: 42CXX, 47 A30
References
[IJ BEREZIN. IS and ZHIDKOY. N.P.: Computing methodl" L Per-
BESSEL INTERPOLATION FORMULA - A formula gamon Press. 1973 (translated from the Russian).
which is defined as half the sum of the Gauss formula [2J BAKHYALOY, N.S.: Numerical methodl': analvsis. algebra, ord,
(d. Gauss interpolation formula) for forward interpola- narr differential equations. Mir. 1977 (translated from the Rus-
sian).
tion on the nodes M.K. Samarin

374
BEST APPROXIMATION

Editorial comments. Editorial comments. The function Kp(z) is usually called


References the modified Bessel function of the third kind.
[A 1] ABRAMOWITZ, M. and STEGUN, I.A.: Handbook of mathemati- AMS 1980 Subject Classification: 31815
cal functions, Nat. Bur. Stand., Appl. Math. Ser., 55, Dover,
1970.
[A2] HILDEBRAND, F.: Introduction to numerical analysis, BESSEL SYSTEM - A concept in the theory of
Addison-Wesley, 1956. orthogonal systems. Let {!{In} and {gn} be two complete
AMS 1980 Subject Classification: 65D05 systems of functions in L 2(a, b)=L2 (i.e. measurable
functions that are square-integrable on the segment
BESSEL POTENTIAL - A potential of the form [a, b D, forming a biorthogonal system of functions. The
system {!{In} is said to be a Bessel system if, for any
Pa(x) = jGa(x-y)dp.(y), a>O,
function fEL2' the series
R'
00
where X=(Xh'" ,xn), y=(Yh'" ,Yn) are points in ~c;
the Euclidean space Rn, n ;;'2; dp. is a Borel measure on n=1

Rn; is convergent; here, Cn=(j, gn) are the coefficients of


G a(x)=2(2-n-a)/2'/T-n/2 [r [; ]r~(n-a)/2(' x I) 1x l(a-n)/2,
the expansion 00

f "" ~cno/n

r
n=1

1 x1 = [i~ 1 xT 1 /2,
of the function fwith respect to the system {o/n}. For a
system {!{In} to be a Bessel system it is necessary and
and K p(z) is the modified cylinder function (or Bessel sufficient that it be possible to define a bounded linear
function, cf. Cylinder functions) of the second kind of operator A on the space L2 such that the system {cf>n}
order P or the Macdonald function of order P; Ga(x) is defined by the equation A !{In =cf>n (n = I, 2, ... ) is a
called a Bessel kernel. complete orthonormal system. If the system {!{In} is a
The principal properties of the Bessel kernels Ga(x) Bessel system, there exists a constant M such that for
are the same as those of the Riesz kernels (d. Riesz any fEL2 ao

potential), viz., they are positive, continuous for X=F0, ~(f, gnf ~ MIl/ilL
n=1
can be composed
References
jGa(x-y)Gp(y)dy = Ga+p(x), [1) KACZMARZ, S. and STEINHAUS, H.: Theorie der Orthogonal
R' reihen, Chelsea, reprint, 1951.
P.I. Lizorkin
but, unlike the Riesz potentials, Bessel potentials are
applicable for all 0:>0, since AMS 1980 Subject Classification: 42CXX

Ga(x) "" 2(I-n-a)/271 I- n )/2 [r (;]] -II X l(a-n-I)/2e-lxl, BEST APPROXIMATION of a function x(t) by func-
tions u(t) from a fixed set F - The quantity
as I x I~oo. E(x, F) = inf p.(x, u),
If 0:>2m, where m is a natural number, and the ueF
measure dp. is absolutely continuous with square- where p.(x, u) is the error of approximation (see
integrable density f(y)EL 2 (R2m ), the Bessel potentials Approximation of functions, measure of). The concept of
satisfy the identities: a best approximation is meaningful in an arbitrary
(l-~rPa(x) = P a - 2m (x), metric space X when p.(x, u) is defined by the distance
and between x and u; in this case E(x, F) is the distance
(l-~rp2m(X) = I(x), from x to the set F. If X is a normed linear space, then
where Ll is the Laplace operator on R2m. In other for a fixed Fe X the best approximation
words, the function G2m (x) is a fundamental solution E(x, F) = ueF
inf II x-u II (1)
of the operator (1- Ll)m .
may be regarded as a functional defined on X (the
References
[I) NIKOL'SKIi, S.M.: Approximation offunctions of several variables
functional of best approximation).
and imbedding theorems, Springer, 1975 (translated from the The functional of best approximation is continuous,
Russian). whatever the set F. If F is a subspace, the functional of
(2) ARONSZAJN, M. and SMITH, K.T.: 'Theory of Bessel potentials
1', Ann. Inst. Fourier (Grenoble) 11 (1961), 385-475.
best approximation is a semi-norm, i.e.
E.D. Solomentsev E(xI + X2, F) ~ E(Xlo F)+ E(X2, F)

375
BEST APPROXIMATION

and The concept of a best uniform approximation of con-


E(Xx, F) = I A IE(x, F) tinuous functions by polynomials is due to P.L. Che-
for any AER. If F is a finite-dimensional subspace, then byshev (1854), who developed the theoretical founrla-
for any x EX there exists an element Uo EF (an element tions of the concept and established a criterion for
of best approximation) at which the infimum in (1) is polynomials of best approximation in the metric space
attained: C (see Polynomial of best approximation).
E(x, F) = II x -Uo II
The best approximation of a class of functions is the
In a space X with a strictly convex norm, the element
supremum of the best approximations of the functions
of best approximation is unique.
f in the given class WC by a fixed set of functions F, i.e.
Through the use of duality theorems, the best the quantity
approximation in a normed linear space X can be
expressed in terms of the supremum of the values of E(IJH, F) = suoE(j, F) = suoinfJ.t(f, cp).
!E~ !E~4>EF
certain functionals from the adjoint space X (see, e.g. The number E(WC, F) characterizes the maximum devi-
[5], l8]). If F is a closed convex subset of X, then for ation (in the specific metric chosen) of the class WC
any XEX from the approximating set F and indicates the
minimal possible error to be expected when approxi-
E(x,F) = J,!1f [r(x)-~~y/(u)]; (2)
mating an arbitrary function f E WC by functions of F.
II!II<;;]
Let WC be a subset of a normed linear function space
in particular, if F is a subspace, then
={
X, let U U 1(t), U 2 (t), . . .} be a linearly independent
E(x, F) = s~ I(x), (3) system of functions in X and let Fn, n = 1, 2, ... , be
{!II<;;] the subspaces generated by the first n elements of this
where Fl. is the set of functionals fin x' such that system. By investigating the sequence E(WC, F n ),
f(u)=O for any UEF. In the function spaces Cor L p ' n = 1, 2, ... , one can draw conclusions regarding both
the right-hand sides of (2) and (3) take explicit forms the structural and smoothness properties of the func-
depending on the form of the linear functional. In a tions in WC and the approximation properties of the sys-
Hilbert space H, the best approximation of an element tem U relative to WC. If X is a Banach function space
x EH by an n-dimensional subspace Fn is obtained by and U is closed in X, i.e. U Fn =X, then E(WC, Fn)~O
orthogonal projection on Fn and can be calculated; one as n ~ 00 if and only if WC is a compact subset of X.
has: In various important cases, e.g. when the Fn are sub-
G(x, u], ... ,un)
E(x, Fn) = spaces of trigonometric polynomials or periodic splines,
G(uJ, ... ,un) ,
and the class WC is defined by conditions imposed on
where U1, . . . ,Un form a basis of Fn and the norm or on the modulus of continuity of some
G(u), ... ,un) is the Gram determinant, the elements derivative t'l, the numbers E(WC, Fn) can be calculated
of which are the scalar products (Uj, Uj), i, j = 1, ... ,n. explicitly [5]. In the non-periodic case, results are avail-
If {Uk} is an orthonormal basis, then able concerning the asymptotic behaviour of E(WC, Fn)
as n~oo.
E2(X, Fn) = I X 112_ ~(x, ud
k=]
References
In the space C = qa, b] one has the foU) wing esti- [1] CHEBYSHEV, P.L.: Complete collected works, 2, Moscow, 1947
mate for the best uniform approximation of a function (in Russian).
[2] ACHIEZER, N.!. [N.I. AKHIEZER]: Theory of approximation, F.
X(t)EC by an n-dimensional Chebyshev subspace Ungar, 1956 (translated from the Russian).
Fn cC (the de la Vallee-Poussin theorem): If for some [3] DZYADYK, V.K.: Introduction to the theory of uniform approxi-
function U(t)EFn there exist n + 1 points tk, mation offunctions by polynomials, Moscow, 1977 (in Russian).
[4] GONCHAROV, V.L.: The theory of interpolation and approxima-
a~tl < ... <tn+1 ~b, for which the difference
tion of functions, Moscow, 1954 (in Russian).
~(t) = x(t) - u(t) [5] KORNEicHUK, N.P.: Extremal problems in approximation theory,
Moscow, 1976 (in Russian).
takes values with alternating signs, then [6] NIKOL'SKIi, S.M.: Approximation offunctions of several variables
and imbedding theorems, Springer, 1975 (translated from the
E(x, Fn);;;' min I ~(tk) I Russian).
]<;;k<;;n+]
[7] TIMAN, A.F.: Theory of approximation of functions of a real
For best approximations in L 1(a, b) see Markov cri- variable, Pergamon, 1963 (translated from the Russian).
terion. In several important cases, the best approxima- [8] TIKHOMIROV, V.M.: Some questions in approximation theory,
tions of functions by finite-dimensional subspaces can Moscow, 1976 (in Russian).
[9] LAURENT, PJ.: Approximation et optimisation, Hermann, 1972.
be bounded from above in terms of differential-
difference characteristics (e.g. the modulus of con- N.P. Kornelchuk
tinuity) of the approximated function or its derivatives. VP. Motorny!

376
BEST COMPLETE APPROXIMA nON

Editorial comments. In Western literature an element, a zero depends both on the system of subspaces Fn and
functional or a polynomial of best approximation is also on the smoothness characteristics of the approximated
called a best approximation. function x (the modulus of continuity, the existence of
References derivatives up to a specific order, etc.), and it can be
[A1] LORENTZ, G.G.: Approximation offunctions, Holt, Rinehart estimated in terms of these characteristics. Conversely,
and Winston, 1966.
[A2] CHENEY, E.W.: Introduction to approximation theory, Chel-
knowing the rate of convergence to zero of the
sea, reprint, 1982. sequence {E(x, Fn )}, one can draw conclusions with
[A3] RICE, J.R.: The approximation of functions, 1. Linear theory, respect to the smoothness of ,x (t) (see Approximation of
Addison-Wesley, 1964.
functions, direct and inverse theorems).
[A4] PINKUS, A: n-widths in approximation theory, Springer,
1985. References
AMS 1980 Subject Classification: 41 A50, 41 A65, [1] BERNSTEIN, S.N. [S.N. BERNSHTEiN]: Collected works, 2, Mos-
cow, 1954 (in Russian).
41A46 [2] GoNCHAROV, V.L.: The theory of interpolation and approxima
tion offunctions, Moscow, 1954 (in Russian).
BEST APPROXIMATION IN THE MEAN - The best [3] TIMAN, AF.: Theory of approximation offunctions of a real
variable, Pergamon, 1963 (translated from the Russian).
approximation of a function x by functions u from a
fixed set F when the measure (error) of approximation N.P. Kornerchuk
is expressed in terms of an integral metric (see Best VP. Motornyr
approximation; Approximation in the mean). Editorial comments. Theorems inferring smoothness
N.P. Kornelchuk characteristics of a function x E C or Lp from properties of
VP. Motornyr E(x, Fn) were first given by D. Jackson in 1911 for algebraic
or trigonometric approximation, see Jackson theorem.
AMS 1980 Subject Classification: 41 A50
Theorems converse to these, i.e. inferring properties of
E(x, Fn} from smoothness characteristics of the function x,
BEST APPROXIMATIONS, SEQUENCE OF - A have been proved by S.N. Bernstein [S.N. BernshteTn] and
sequence {E(x, F n )}, n = 1, 2, ... , of numbers, where A. Zygmund, cf. Bernstein theorem. See also [A2], Chapt.
E(x, Fn) is the best approximation of an element x of a 4, Sect. 6 and Chapt. 6, Sect. 3.
normed linear space X by elements of an n-dimensional
References
subspace Fn ex, with Fl eF2 e .. " so that [A1] NATANsoN, J.P.: Constructive function theory, 1-3, F. Ungar,
E(x, Fd~E(x, F2)~ .. '. Usually, Fn is the linear 1964-1965 (translated from the Russian).
span of the first n elements of some fixed system [A2] CHENEY, E.W.: Introduction to approximation theory, Chel-
sea, reprint, 1982.
{u 1, u2, ... } of linearly independent elements of X.
In the case X = q a, b] and Fn = F~ is the subspace AMS 1980 Subject Classification: 41 A50
of algebraic polynomials of degree n - 1, sequences of
best approximations were first considered in the BEST COMPLETE APPROXIMATION - A best
eighteen-fifties by P.L. Chebyshev; the fact that approximation of a function f(Xl, ... ,Xk) in k vari-
E(x, F~} ~ 0 as n~oo ables (k~2) by algebraic or trigonometric polynomials.
Let X be the space C or ~ of functions f(XIo ... ,Xk),
for any function X(t)E qa, b] was established in 1885 217-periodic in each variable, that are either continuous
by K. Weierstrass. In the general case, the relation or p-summable (p ~ 1) on the k-dimensional period
lim E(x, Fn}
n .... oo
=0 for all x EX cube with edges of length 217.
The best complete approximation of a function
is always satisfied when the union of the subspaces F n , f(Xh ... ,Xk)EX by trigonometric polynomials is the
n = I, 2, ... , is everywhere dense in X, quantity
UFn = X
En, . ... .n, (f)x = T,,~f,. II f - Tn, .. .. .n, II x,
(essentially, this is an equivalent statement). However,
the sequence {E(x, Fn)} may converge to zero arbi- where the infimum is taken over all trigonometric poly-
trarily slowly. This follows from a theorem of Bern- nomials of degree ni in the variable Xi (1 ~ i ~k).
stein: If {Fn} is a sequence of subspaces of dimension Together with the best complete approximation, one
n = 1, 2, ... , of a normed linear space X, such that also considers best partial approximations.
FIe F 2 e . .. and U Fn = X, then, for any monotone A best partial approximation of a function
decreasing null sequence {JLn} of non-negative real f(XIo . .. ,XdEX is a best approximation by functions
numbers, there exists an XEX such that E(x, Fn)=/Ln, Tn." ... ,n., (x 1, . . . ,Xk) E K that are trigonometric poly-
n = 1, 2, .... In the function spaces C and Lp ' the rate nomials of degree n v" .. ,n v, (l~r<k), respectively,
at which a sequence of best approximations tends to in the fixed variables x v,, ... ,x v, with coefficients

377
BEST COMPLETE APPROXIMATION

depending on the remaining k - r variables, i.e. approximating set F if, for all x Em,
En,!' ... ,n",oo(f)x =~ inf IIf-Tn,!, ... ,n"llx. II x-Ax I .;;; ~~gE(x, F)
11'1 .. ..,.

It is obvious that (E(x, F) is the best approximation of x by F) and if,


moreover, for all x EX,
I x-Ax II = E(x, F).
S.N. Bernstein [S.N. Bernshtein] [1] proved the fol-
lowing inequality for continuous functions in two vari- The latter is certainly true if X is a Hilbert space,
ables: F = Fn IS an n-dimensional subspace of X,
(1) n = 1, 2, ... , and A is the orthogonal projection onto
.;;; A In(2 + min { n J, n2 })(En"oo(f)c + En"oo(f)c),
Ax = ~ (x, ek)eb
where A is an absolute constant. It has been shown [3] k=l

that the term In(2+min{nbn2}) in inequality (1) (and where {e I, . . . ,en} is an orthonormal basis in Fn.
in the analogous relation for the space L I) cannot be Let X be a Banach space of functions defined on the
replaced by a factor with a slower rate of increase as entire real line, with a translation-invariant norm:
min{nb n2}-HX). II X('+T) II = II xO II (this condition holds, e.g. for the
In the space Lp (p> 1) one has the inequality norms of the spaces C=C[O,2'7T] and Lp =Lp(0,2'7T),
k l~';;;;oo, of 2'7T-periodic functions); let F=Tn be the
En" ... ,n,(f)Lp .;;; Ap,k ~Eni,oo(f)Lp' (2)
i=l subspace of trigonometric polynomials of order n.
There exist best linear methods (relative to Tn) for a
where the constant Ap,k depends only on p and k.
class m of functions X(t)EX that contains x(t+a) for
Similar definitions yield best complete approxima-
any aER whenever it contains x(t). An example is the
tions and best partial approximations of functions
linear method
defined on a closed bounded domain n C Rk by alge-
braic polynomials, and in this case inequalities similar /loao
to (1) and (2) have been established.
A(x;t;/L,p) = -2-+ (*)
n
References + ~ (/Lk(ak cos kt + bk sinkt)+ Pk(ak sin kt - bk coskt)},
[1) BERNSHTEiN, S.N.: Collected works, 2, Moscow, 1954 (in Rus- k=l
sian).
(2) TIMAN, A.F.: Theory of approximation offunctions of a real where ak and bk are the Fourier coefficients of x(t)
variable, Pergamon, 1963 (translated from the Russian). relative to the trigonometric system, and ILk and Pk are
(3) TEMLYAKov, V.N.: 'On best approximations of functions in
numbers.
two variables', Dokl. Akad. Nauk SSSR 223, no. 5 (1975),
1079-1082 (in Russian). NP v ~ h k Now consider the classes W'aoM (and WiM),
. . .n.ornelC U r = 1, 2, ... , of 2'7T-periodic functions x(t) whose
v.P. Motornyr derivatives x(r-l)(t) are locally absolutely continuous
Editorial comments. and whose derivatives x(r)(t) are bounded in norm in
References Lao (respectively, in L I) by a number M. For these
[A 1) LORENTZ, G.G.: Approximation of functions, Holt, Rinehart classes, best linear methods of the type (*) yield the
and Winston, 1966. ) same error (over the entire class) in the metric of C
AMS 1980 Subject Classification: 41 A50, 41 A63 (respectively, Ld as the best approximation by a
subspace Tn; the analogous assertion is true for these
BEST LINEAR METHOD - With respect to the classes with any rational number r>O (interpreting the
approximation of elements in a given set m, the linear derivatives x(r)(t) in the sense of Weyl). For integers
method that yields the smallest error among all linear r = 1, 2, ... , best linear methods of type (*) have been
methods. In a normed linear space X, a linear method constructed using only the coeff~cients ILk (all Pk = 0).
for the approximation of elements x Em C X by ele- If F = S~ is the subspace of 2'7T-periodic polynomial
ments of a fixed subspace Fe X is represented by a splines of order r and defect 1 with respect to the parti-
linear operator that maps the entire space X, or some tion kw / n, k = 0, +- 1, ... , then a best linear method
linear manifold containing m, into F. If !> is the set of for the classes W'ao+ I M (and Wi + I), r = 1, 2, ... , is
all such operators, a best linear_ method for m (if it achieved in Lp ' l:!(p';;; 00 (resp. in L I) by splines in S~
exists) is defined by an operator A E!> for which interpolating the function x(t) at the points
.~~~ I x - Ax II = }~t .~~~ I x - Ax II
kw /n+[l+(~IY1'7T /4n .
References
The method defined by an operator A in !> will cer- [1) ACHIEZER, N.I. [N.I. AKHIEZER): Theory of approximation, F.
tainly be a best linear method for m relative to the Ungar, 1956 (translated from the Russian).

378
BETA-DISTRIBUTION

[2] KORNEicHUK, N.P.: Extremal problems in approximation theory, drature formula generalizes in a natural way to func-
Moscow, 1976 (in Russian).
tions of several variables (cubature formulas).
[3] TIKHOMIROV, Y.M.: Some questions in approximation theory,
Moscow, 1976 (in Russian). References
N.P. Korneichuk [I] NIKOL'SKli,S.M.: Quadratureformulae, H.M. Stationery Office,
v.P. Motornyr London, 1966 (translated from the Russian).
[2] KRYLOV, N.M.: Approximate calculation of integrals, Macmil-
Editorial comments.
lan, 1962 (translated from the Russian).
References [3] LAURENT, PJ.: Approximation et optimisation, Hermann, 1972.
[A 1] KrnSEWETIER, H.: Vorlesungen Dber lineare Approximation, [4] ZHENSYKBAEV, A.A.: 'Monosplines of minimal norm and qua-
Deutsch. Verlag Wissenschaft., 1973. drature formulas', Uspekhi Mat. Nauk 36, no. 4 (1981), 107-159
[A2] RICE, J.R.: The approximation of functions, 1. Linear theory, (in Russian). N P v "h k
. . n.ornelC U
Addison-Wesley, 1964.
v.P. Motornyr
AMS 1980 Subject Classification: 41 A65, 41 A50
Editorial comments. The terminology 'best formula' is
often encountered in the literature on numerical analysis,
BEST QUADRATURE FORMULA, optimal quadra-
but, as was obseNed in [A2J, p. 75, it should be taken with
ture formula - An approximate integration formula that
a large dose of salt, because, after all, any quadrature for-
guarantees the minimum error for a given class of func- mula, no matter how the weights Pkl and the nodes Xk are
tions, relative to all formulas of a specified type. As an chosen, will exactly integrate an infinite-dimensional family
example, consider the quadrature formula of functions.
b n m A few recent textbooks are listed below.
jp(x)f(x)dx = ~ ~Pk,f(I)(Xk)+R(f), (*)
a k=li=O References
[A 1] BRASS, H.: Quadraturverfahren, Vanden hoek & Ruprecht,
where p(x) is a weight function. The remainder (error) 1977.
term R(j)=R(j, Xn, Pnm ) depends both on the func- [A2] DAVIS, P.J. and RABINOWITZ, P.: Methods of numerical
tion f(x), and on the vector (Xn' P nm ) consisting of the integration, Acad. Press, 1984.
[A3] ENGELS, H.: Numerical quadrature and cubature, Acad.
interpolation nodes Xk (it is usually assumed that Press, 1980.
Xk E[a, b D and the coefficients hi, k = 1, ... ,n;
i = 0, ... , m. Fixing n ~ I and m ~O, let A denote AMS 1980 Subject Classification: 65030, 65032
some set of vectors (Xn, P nm ) (and hence also some set
of quadrature formulas), defined by some restrictions BETA-DISTRIBUTION - A continuous probability
distribution concentrated on (0, 1) with density
on the interpolation nodes and coeffi~ents (in particu-
lar, one might consider the ~et A =A(Xn) of coefficients f3 m,n (x) = 1
B(rn, n)
xm-I(I- xt- I
,
(I)
hi for a fixed node vector Xn). Let we be some class of
functions f(x), it being assumed that the integral and where the parameters m, n are non-negative and the
the sum in (*) exist. The best quadrature formula of normalizing factor B (m, n) is Euler's beta-function
type (*) for the class WC relative to the set A is defined 1

by a vector (x:, P~m) for which


B(
rn, n
) = jx m-I(I_ x )n-I dx = f(rn)f(n)
0 rem + n) ,
sup I R(f, X~, P~m)
f c '1fl
I= where r(n) is the gamma-function. The distribution
function is expressed as the incomplete beta-function

The constmction of best quadrature formulas is mtI- Bm.n(x) = 1 j'ym-I(I_y)"-1 dy, O<x<1
B(rn, n) 0
mately connected with certain problems in spline
approximation; in many cases it reduces to minimizing (this function has been tabulated, see [1], [2]). The
the norm of a monospline (see [ID, Best quadrature for- moments of the beta-distribution are given by the for-
mulas
mulas, together with sharp estimates for the remainder _ B(rn+k, n) _
rnk - B(rn, n) , k-I,2, ....
term, are known for many important classes of continu-
ous and differentiable functions. From a more general In particular, the mathematical expectation and the
point of view, the problem of finding best quadrature variance are m / (m + n) and
formulas and the corresponding errors for a class we mn / {(m+n)2(m+n+ I)}, respectively. If m>l and
may be viewed as the problem of optimal recovery of a n > L the density curve f3m.n(x) has a single mode at
functional b the point x =(m -1) / (m +n -2) and vanishes at the
1(f) = jp(x)f(x)dx, ends of the interval. If either m < 1 or n < 1, one ordi-
nate at the end of the graph becomes infinite, and if
where fEWC, on the basis of the information {fU)(xd} , both m < I and n < 1, both ordinates at the ends of the
k = 1, ... ,n; i =0, ... ,m. The concept of a best qua- interval are infinite and the curve is U-shaped. If m = 1

379
BET A-DISTRIBUTION

and n = I the beta-distribution reduces to the uniform abIes p and q which, for. p, q >0, is defined by the
distribution on the interval (0, I). Another special case equation I
of the beta-distribution is the so-called arcsine distribu- B(p, q) = jxP-I(l-X)q-1 dx. (*)
tion: o
I
f31/2,1/2(X) = ?TVx(l-x) The values of the beta-function for various values of
the parameters p and q are connected by the following
If one substitutes x = I / (I + t) in (I), then one obtains
relationships:
a distribution having the density B(P, q) = B(q,p),
1 1m - q-I
(J'
fJ (I) -
I
0<1<00, (2) B(P,q) =
p
+ q- \B(p,q-I), q>l.
m,n - B(m, n) (1 +t)m+n-2 '

This distribution is called a beta-distribution of the The following formula is valid:


second kind, as distinct from the beta-distribution (l). ?T
B(p, I-p) = -.-,
smp?T
O<p<l.
The distributions (l) and (2) correspond to 'type I' and
'type VI' distributions in the system of Pearson curves, If P and q are complex, the integral (*) converges if
An important case of generation of a beta-distribution Rep>O and Req>O. The beta-function can be
is the following: If X I and X 2 are independent and expressed by the gamma-function:
have gamma-distributions (cf. Gamma-distribution) with B(P q) = f(P)f(q)
respective parameters m and n, then the random vari- , f(P +q)
able X I / (X I + X 2) will have a beta-distribution with VI. Bityutskov
density f3m,n(x). This fact to a large extent explains the AMS 1980 Subject Classification: 33A 15
role played by beta-distributions in various applica-
tions, in particular in mathematical statistics: The dis- BETTI GROUP - In a broad sense, the same as a
tributions of several important statistics are reducible homology group; in a narrow sense, the Betti group is
to beta-distributions. For instance, the distribution the free part of the homology group with as domain of
function of the F-relationship coefficients the group Z of integers, if this homology
group is finitely generated. Named after E. Betti (1823
- 1892).
References
(the random variable X~ has a i-distribution with k [IJ SEIFERT. H. and THRELFALL. W.: Lehrbuch der Topologie. Chel-
degrees of freedom) is expressed by the formula sea, reprint, 1980.
[21 ALEKSANDROV. P.S.: An introduction to homological dimension
theory and general combinatorial topology. Moscow. 1975 (in
'
P(Fmn<X)=Bm/2n/2r
' l n +mxmx J Russian).
M. 1. VoUsekhovskil
(the values of the F-distribution are usually calculated Editorial comments.
with the aid of tables of beta-functions). The beta- References
distribution function also allows one to compute the [A 11 SPANIER. E.H.: Algebraic topology, McGraw-Hili, 1966.
values of the binomial distribution functions, in view of AMS 1980 Subject Classification: 55NXX
the relationship
BETTI NUMBER, r-dimensional Betti number pr of a
= k~O [z )pk(1_p)"-k
m )
Bn-m,nd l(l-P) complex K - The rank of the r-dimensional Betti group
with integral coefficients. For each r the Betti number
Beta-distributions are used m fields other than pr is a topological invariant of the polyhedron which
mathematical statistics; thus, the density of the beta- realizes the complex K, and it indicates the number of
distribution is the weight function for the system of pairwise non-homological (over the rational numbers)
orthogonal Jacobi polynomials. cycles in it. For instance, for the sphere sn:
References pO = I. pi = ... =p,,-I = O. p" = I;
[lJ BOL'SHEV, L.N. and SMIRNOV. N,V.: Tables of mathematical
statistics, Libr. of mathematical tables, 46. Nauka, Moscow, for the projective plane P2(R):
1968 (in Russian). Processed by L.S. Bark and E.S. Kedova.
[2J PEARSON, K.: Tables of the incomplete beta:function, Cambridge po = I. pi = p2 = 0;
U niv, Press, 1932.
A. V Prokhorol' for the torus T2:
AMS 1980 Subject Classification: 62EXX p" = p2 = I. pi = 2.
For an n-dimensional complex K" the sum
BETA-FUNCTION, Bjunction. Euler Bjunction.
Euler integral of the first kind - A function of two vari- ~(-llP'
, 1I

380
BIANCHI IDENTITY

is equal to its Euler characteristic. Betti numbers were equal to the product of the degrees of the equations.
introduced by E. Betti [1]. The multiplicity of the solutions is, by definition, the
References intersection index (in algebraic geometry) of the hyper-
(1) BETII, E.: Ann. Mat. Pura Appl. 4 (1871), 140-158. surfaces (*) at the respective point. The theorem is
M.1. Voitsekhovskii called after E. Bezout [1], who studied systems of alge-
Editorial comments. braic equations of higher degrees.
References References
[A1] SPANIER, E.H.: Algebraic topology, McGraw-Hili, 1966. [1) BEZOUT, E.: Theorie generale lies equations algebriques, Paris,
1779.
AMS 1980 Subject Classification: 55NXX VN. Remeslennikov
V.E. Voskresenskii
BEZOUT RING - An integral domain with a unit ele- Editorial comments.
ment in which any ideal of finite type is principal. Any References
principal ideal ring and also any valuation ring is [A1] SHAFAREVICH, I.R.: Basic algebraic geometry, Springer,
Bezout. A Bezout ring is integrally closed, and its local- 1977, Chapt. 4, Sect. 2 (translated from the RUSSian).
izations (i.e. its rings of fractions with respect to multi- AMS 1980 Subject Classification: 14C17, 12E05
plicative systems S, cf. Localization in commutative
algebra) are again Bezout rings. For a finite set BIANCHI CONGRUENCE, B-congruence A
ai, ... ,an of elements of a Bezout ring A there exist congruence of straight lines in which the curvatures of
the greatest common divisor (the greatest common divi- the focal surfaces at the points situated on the same
sor of (ai, ... ,an) has the form ~bA, b;EA, a so- straight line of the congruence are equal and negative.
called Bezout identity) and the lowest common multiple. The principal surfaces of a B-congruence cut out conju-
A Noetherian ring (and even a ring that satisfies the gate line systems on its focal surfaces. The straight
ascending chain condition only for principal ideals) lines of the congruence map the asymptotic nets of the
which is Bezout is a principal ideal ring. As for princi- focal surfaces onto an orthogonal net on a sphere. The
pal ideal rings, a module of finite type over a Bezout curvature of a focal surface of a Bianchi congruence is
ring is a direct sum of a torsion module and a free expressed in asymptotic parameters u and v by the for-
module. mula:
VI. Danilov K= 1
Editorial comments. (</>(u)+tf;(V2 .

References Surfaces whose curvatures satisfy this condition are


[A1] GILMER, R. and DEKKER, M.: Multiplicative ideal theory, called Bianchi surfaces (B-surfaces, cf. Bianchi surface).
1972. Bianchi congruences were studied by L. Bianchi [1].
AMS 1980 Subject Classification: 13G05, 13F99, References
13F10 [I) BIANCHI, L.: Ann. Mat. Pura Appl. 18 (1890), 301-358.
[2] FINIKOV, S.P.: Theorie der Kongruenzen, Akademie-Verlag,
BEZOUT THEOREM - 1) Bezout's theorem on the 1959 (translated from the Russian).
[3) FINIKOV, S.P.: Bending and related geometrical problems,
division of a polynomial by a linear binomial: The
Moscow-Leningrad, 1937 (in Russian).
remainder of the division of the polynomial [4] ZHULIKOVSKIi, V.I.: Classical differential geometry in a tensor
j(x) = aox'+ ... +a. setting, Moscow, 1963 (in Russian).
VT. Bazylev
by the binomial x - a is f (a). It
is assumed that the AMS 1980 Subject Classification: 53A25, 53A05
coefficients of the polynomials are contained in a cer-
tain commutative ring with a unit element, e.g. in the BIANCHI IDENTITY - A relation connecting the
field of real or complex numbers. A consequence of components of the covariant derivatives of the curva-
Bezout's theorem is the following: A number a is a root ture tensor R;Jk of a Riemannian space:
of the polynomial f (x) if and only if f (x) is divisible Ri'k.l + Ri%,.j + Ri~j.k = 0,
by the binomial x -a without remainder.
where h, i, j, k, 1= 1, ... , n. First established by L.
2) Bezout's theorem on homogeneous equations: If a
Bianchi [1] in 1902.
system of n homogeneous equations in n + 1 unknowns
References
j;(xo, ... ,x.) = 0, ;=1, ... ,n, (*) [I] BIANCHI, L.: Lezioni di geometria differenziale, 1-2, Zanichelli,
Bologna, 1923-1927.
has only a finite number of non-proportional non-zero M.l. Voftsekhovskif
solutions in an algebraically closed field containing the Editorial comments. Here Ri7k,l denotes of course the
coefficients of the system, then the number of these covariant derivative of ROk with respect to the /-th coordi-
solutions counted according to their multiplicity is nate.

381
BIANCHI IDENTITY

The identity described above is often called the second Bianchi congruence). If S is a pseudo-sphere then S'
Bianchi identity. The first Bianchi identity is then given by also is a pseudo-sphere. The pseudo-spheres S' which
Rjk,+Rk'J+R/ik =0 are Bianchi transforms of S are orthogonal trajectories
(see [A1], [A2]). Generalized versions of these identities for of the following congruence of circles. They are
curvature forms and curvature tensors of connections with situated in the tangent plane to S and have the same
torsion are given in [A2]. radius as S.
VT. Bazylev
References Editorial comments. Cf. also [A2], articles 803, 804 in
[A1] HICKS, N.J.: Notes on differential geometry, v. Nostrand,
1965. volume III.
[A2] KOBAYASHI, S. and NOMIZU, K.: Foundations of differential References
geometry, 1, Wiley (Interscience), 1963. [A 1]EISENHART, L.P.: A treatise on the differential geometry of
curves and surfaces, Boston, 1909.
AMS 1980 Subject Classification: 53820
[A2] DARBOUX, G.: Ler;;ons sur la theorie generale des surfaces,
Chelsea, reprint, 1972.
BIANCHI SURFACE - A surface of negative Gauss-
AMS 1980 Subject Classification: 53A05, 53A25
ian curvature K which can be expressed in asymptotic
parameters (u, v) as BIASED ESTIMATOR - A statistical estimator
K= whose expectation does not coincide with the value
[U(u)+ v(v)f ' being estimated.
where U(u) and V(v) are arbitrary functions; thus, Let X be a random variable taking values in a sam-
Bianchi surfaces can be characterized by the fact that pling space (1:, !Jd, PII ), OEe, and let T= T(X) be a sta-
the function (- K) - I /2 is diagonal with respect to its tistical point estimator of a function f(O) defined on the
asymptotic net, i.e. parameter set e. It is assumed that the mathematical
a2(-K)I/2 _ expectation Ell {T} of T exists. If the function
auav - o. b(8) = Ee{T}-f(8) = Ee{T-f(8)}
For instance, a ruled Bianchi surface is a conoid - a is not identically equal to zero, that is, b( 0) = 0, then T
surface attached to the Peterson surface. If a Bianchi is called a biased estimator of f(O) and b(O) is called the
surface is given, it is possible to determine the classes bias or systematic error of T.
of surfaces which can be obtained by a deformation Example. Let XI. ... ,Xn be mutually-independent
over a principal base and to classify them. Thus, if the random variables with the same normal distribution
principal base contains two families of geodesic lines, N I (a, 0 2 ), and let
the functions U and V are constant and the bent sur-
XI + ... +X"
face is a Voss surface (the class B 0)' X= - - - - - -
n
The class B I is characterized by the fact that only
one family of lines of the principal base are geodesics Then the statistic
(one of the functions U, V is constant); conoids may S~ = 1- (X, - X)2
serve as an example. The class B 2 corresponds to func- n,=1
tions U, V which both depend non-trivia )'y on their is a biased estimator of the variance 02 since
arguments. See also Bianchi congruence.
n-i a2
J.Kh. Sabitov E{S~} = - - a2
n
= a"'--,
1

n
Editorial comments. The notion of a 'deformation over a that is, the estimator S~ has bias b( 0 2 ) = - 0 2 / n. The
principal base' is not very common in Western literature, mean-square error of this biased estimator is
and this type of deformation has no standard name even. It
can best be characterized as a deformation preserving a E{(S~-~i} = 2n~i a4 .
n-
conjugate net.
The best unbiased estimator of 02 is the statistic
References
[A1] L Leziom di geometria differenziale. 2. Zanichelli.
BIANCHI.
Bologna, 1927. Chap!. 1.
S2 = _n_S2 = _i_(x,_X)2,
" n-i n n-i,cl
[A2] FINIKOV. S.P.: Theorie der Kongruenzen Akademle-Verlag.
1959 (translated from the Russian). with mean-square error
AMS 1980 Subject Classification. 53A05 D{s~} = E{(s;-~f} = _2_a4
n-i
BIANCHI TRANSFORMATION - The
transition When n > 2, the mean-square error of the biased esti-
from one focal surface S of a Bianchi congruence to the mator S~ is less than that of the best unbiased estima-
other focal surface S' of the same congruence (cL tor s~.

382
BICOMPLEX

There are situations when unbiased estimators do not Here Q(~}, ... '~n' X}, . . ,xn ) is the principal symbol
exist. For example, there is no unbiased estimator for of the linear partial differential operator, the dot indi-
the absolute value 1 a 1 of the mathematical expecta- cates differentiation with respect to the parameter s
tion a of the normal law N I (a, a2), that is, it is only and, if ~j =f/>x;, the equation Q =0 is the characteristic
possible to construct biased estimators for 1 a I. equation of the differential operator. Thus, the solution
References Xj=Xj(s), ~j=~j(s), i=I, ... ,n, of the system (*) for
[1] CRAMER, H.: Mathematical methods of statistics, Princeton Q =0 defines the bicharacteristic strip Q =O. This
Univ. Press, 1946.
M.S. Nikulin bicharacteristic strip belongs to the characteristic
AMS 1980 Subject Classification: 62F11, 62F12, q,(x}, ... ,xn)=O, i.e. q,(XI(S), ... ,xn(s=O, if the
62H12
equations

and
BICATEGORY - A category ~ in which some sub- ~;(S) = c/lx;(XI(S), ... ,xn(s, j=I, ... ,n,
categories of epimorphisms @ and of monomorphisms are valid for at least one value of s, then it follows that
me have been distinguished such that the following con- they are valid for all values of s.
ditions are met:
References
1) all morphisms a in st are decomposable into a
[I] COURANT, R.: Methods of mathematical physics. Partial dif-
product a=vJ-t, where VE@, J-tE9n; ferential equations, 2, Interscience, 1965 (translated from the
2) if VJ-t=pT, where v, PE@, J-t, TEme, then there exists German). B.L Rozhdestvenskit
an isomorphism () such that p= vO, and T=(}-I J-t;
n
3) @ 9n coincides with the class of isomorphisms
Editorial
=
comments. The projections x;=x;(s),
i 1, ... , n, into x-space are called the bicharacteristic
in the category ~. curves (or rays). The bicharacteristic curves are tangent to
The epimorphisms in @ (the monomorphisms in me) the characteristic hypersurfaces q,(X1, ... ,xn)=O due to
are called the permissible epimorphisms (monomor- the homogeneity of the principal symbol, as a function of
phisms) of the bicategory. (~1' ... ,~n), of degree equal to the order of the linear par-
The concept of a bicategory axiomatizes the possibil- tial differential operator (cf. also Principal part of a dif-
ity of a decomposition of an arbitrary mapping into a ferential operator; Symbol of an operator).
product of a surjective and an injective mapping. The Nowadays, the standard reference on these matters is
category of sets, the category of sets with a marked [A1J, or the older, more concise, [A2].
point and the category of groups are bicategories with a References
unique bicategorical structure. In the category of all [A1] HORMANDER, L.: The analysis of partial differential operators,
1, Springer, 1983, 271; 302.
topological spaces and in the category of all associative [A2] HORMANDER, L.: Linear partial differential operators,
rings there are entire classes of different bicategorical Springer, 1963, 29; 31.
structures. AMS 1980 Subject Classification: 58FXX, 58GXX
References
[1] TSALENKO, M.SH. and SHUL'GEiFER, E.G.: Fundamentals of BICOMPLEX, binary complex, double complex - A
category theory, Moscow, 1974 (in Russian). M Sh T. I k graded module, i.e. one representable as the direct sum
. . sa en 0
~ m,n of its submodules A m,n, together with a pair of
Editorial comments. In the literature there is much con-
differentials d I : A m,n ~ Am + I.n ,
fusion about the terms bicategory, double category and 2-
category. As the terminology is not settled yet the reader d 2 :A m,n ~Am,n+l,
should take care. In this Encyclopaedia the term bicategory
is always used as defined above. which satisfy the conditions
AMS 1980 Subject Classification: 18005 dld l = 0, d 2 d2 = 0, d 2 d l +d l d 2 = O.
Instead of the direct sum, the set {A m,n} and the dif-
BICHARACTERISTIC, bicharacteristic strip, of a ferentials
linear partial differential operator - A line of tangency d I : A m,n ~ Am -I.n ,
of any two characteristics (cf. Characteristic) d 2 :A m.n ~ Am.n-I,
q,(xJ, ... ,xn) = 0, I/;{xJ, ... ,xn) =0 satisfying the above conditions, may be considered.
of this linear partial differential operator. If the param- V. E. Govorov
eter s is introduced on the bicharacteristic strip, then its Editorial comments.
equations Xj=Xj(s), i=I, ... ,n, are defined by solving References
a system of 2n ordinary differential equations [A1] MAc LANE, S.: Homology, Springer, 1963.
x;(S) = Q~, g; = -Qx" j= 1, ... ,n. (*) AMS 1980 Subject Classification: 18G35

383
BICONNECTED SPACE

BICONNECfED SPACE - A space that cannot be where O~(J<'1T, - 00<7<00. The coordinate surfaces
decomposed into a sum of two connected non- are: the family of pairs of circular cylinders with paral-
intersecting proper subsets containing more than one lel axes (7=const), the family of circular cylinders
point. orthogonal to the former (a=const), and the planes
A.A. Mal'tsev
(z =const). The system of bicylindrical coordinates is
AMS 1980 Subject Classification: 54D05 obtained as the result of translation of the system of
bipolar coordinates in the xy-plane parallel to the z-
BICYCLIC SEMI-GROUP - The semi-group with aXIS.
unit element and two generators a, b subject to the sin- The Lame coefficients (d. Lame coefficients) are:
gle generating relation ab = 1. One of the realizations of a2
the bicyclic semi-group is the Cartesian square N X N, L. = LT = (coshT-cosaf
, L.
"
= 1.
where N is the set of non-negative integers, with respect
The Laplace operator is:
to the operation
(k, 1)* (m, n) = (k + m - min(/, m), 1+ n - min(/, m)). Do! = 1
-(coshT-cosaf
a2
[naa n] naz
2
+
a~
+ 2
.
The bicyclic semi-group is an inversion semi-group and
D.D. Sokolov
as such is monogenic, i.e. is generated by a single ele-
AMS 1980 Subject Classification: 51 N20
ment. The idempotents (d. Idempotent) of the bicyclic
semi-group form a chain, which is ordered with respect
BICYLINDRICAL DOMAIN - A domain D in the
to the type of the positive numbers. The bicyclic semi-
complex space C2 that can be represented in the form
group is bisimple (cf. Simple semi-group).
of the Cartesian product of two planar domains D I
The bicyclic semi-group often occurs in theoretical
and D 2 , i.e.
investigations concerning semi-groups, not only as a
representative of certain important classes of semi- D = {(ZhZ2): zIED], Z2ED2}.
groups, but also as a 'block', which defines the struc- A special case of a bicylindrical domain is the bidisc
ture of individual semi-groups. Thus, for any idempo- (bicylinder) B(a, r)={(zj,Z2): I zl-al I <rl,
tent e of a O-simple, but not completely O-simple semi- I Z2 -a2 I <r2} of radius r=(r\, r2) with centre at
group S there exists a bicyclic sub-semi-group in S con- a=(aj,a2)' The Cartesian product of n (for n~3)
taining e as the unit element (d. [1], Para. 2.7). The ele- planar domains is said to be a polycylindrical domain. A
ments a and b of the bicyclic semi-group B defined as polydisc (polycylinder) is defined in a similar way.
above, are, respectively, its left and right multiplying M. Shirinbekov
elements (i.e. there exist proper subsets X and Y in B AMS 1980 Subject Classification: 32A07
such that aX=B, Yb =B). Moreover, in a semi-group S
with unit element e the element c will be a left multi- BICYLINDRICS - Two closed curves obtained as the
plier if and only if S contains the bicyclic semi-group intersection of two cylinders the axes of which intersect
whose unit element is identical with c; a similar at right angles. The parametric equations of bicylindrics
theorem is also valid for right multiplying elements, so are
that S has left multiplying elements if an c') only if it x = a cost. y = Vb'--a-sm-t.
,., z .
= aSlnt, b :?a,
also has right multiplying clements.
where a and b are the radii of the cylinders and t is a
References
[I] CLIFFORD, A.H. and PRESTON. G.B.: The algebraic theory of
parameter. If a = b, the bicylindrics is a pair of
semigroups, 1-2. Amer. Math. Soc .. 1961-1967. congruent ellipses.
E. V. Shikin
[2] LYAPIN. E.S.: Semigroups. Amer. Math. Soc .. 1974 (translated
from the Russian). AMS 1980 Subject Classification: 51 N20
L.N. Shevrin
AMS 1980 Subject Classification: 20MXX BIEBERBACH CONJEcruRE - A hypothesis enun-
ciated in 1916 by L. Bieberbach [I]: For all functions
BICYLINDER - A synonym for bicylindrical domain. j (z) of class S, i.e. functions j (z) which are regular
AMS 1980 Subject Classification: 32A07 and univalent in the disc I z I < I and which have the
expansIOn %

BICYLINDRICAL COORDINATES - The numbers !(z) = z + ~cnzn


2
a and z related to the rectangular Cartesian coordi-
11
T.

nates x, y and z by the formulas III this disc, one has the estimate I Cil I ~11, 11 ~2. and
I Cil I = 11 only for the Koehe jUl1c[iol1s
. _ a sinh T , _ a Sin a
~\ - COShT-cosa' ) - COShT-cosa' ~ h(z) = ;:(I-ed'z) "

3R4
BIEBERBACH CONJECTURE

where () is a real number. Bieberbach proved his con- (I cn 1 e/2)n+const), and in 1965 by I.M. Milin
jecture for n = 2. The problem of finding an accurate (ICn < 1.243 n, n~2).
1

estimate of the coefficients for the class S is a special The best estimates to date (1977) were obtained in
case of the coefficient problem. 1972 [7]:
Owing to its simple formulation and profound signi-
ficance, Bieberbach's conjecture attracted the attention
of numerous mathematicians and stimulated the
1 Cn 1 < -vf" n < 1.081 n, n ;;'2,

and in 1976 [10]:


development of different methods in the geometric
theory of functions of a complex variable. At the time 1 Cn 1 < 1.0691 n, n;;.2.
of writing (1977) the validity of the Bieberbach conjec- For reviews of studies on this subject see [2], [3], [9].
ture had been established for n ";;;6. It was first proved References
for n = 3 in 1923 by K. Loewner who introduced the [I] BIEBERBACH. L.: 'Uber die Koeffizienten derjenigen
parametric method (cf. Parametric representation Potenzreihen, welche eine schlichte Abbildung des
Einheitskreises vennitteln', Sitzungsber. Preuss. Akad. Wiss.
method); other proofs of the estimate 1C3 1.,;;;3 subse- Phys-Math. Kl. (1916),940-955.
quently appeared, and were based on variational [2] GOLUZIN, G.M.: Geometric theory of functions of a complex
methods, on parametric methods and on the method of variable, Trans!. Math. Monographs, 26, Amer. Math. Soc.,
1969 (translated from the Russian).
the extremal metric. The Bieberbach conjecture for
[3] MILIN, I.M.: Univalent functions and orthonormal systems,
n = 4 was first proved in 1955 by the simultaneous use Trans!. Math. Monographs, 49, Amer. Math. Soc., 1977
of variational and parametric methods. In 1960 the esti- (translated from the Russian).
[4] HAYMAN, W.K.: 'The asymptotic behaviour of p-valent func-
mate C4 .,;;;4 was obtained much more simply, with
1 1
tions', Proc. London Math. Soc. (3) 5 (1955),257-284.
the aid of Grunsky's univalence condition. This esti- [5A] OZAWA, M.: 'On the Bieberbach conjecture for the sixth coeffi-
mate was also obtained by variational methods, and cient', Kodai Math. Sem. Rep. 21 (1969), 97-128.
with the aid of geometric reasoning; it was once more [5B] OZAWA, M.: 'An elementary proof of the Bieberbach conjec-
ture for the sixth coefficient', Kodai Math. Sem. Rep. 21 (1969),
obtained by using the Grunsky inequalities in matrix 129-132.
form. The validity of Bieberbach's conjecture for n = 6 [6] PEDERSON, R.N. and SCHIFFER, M.M.: 'A proof of the Bieber-
was demonstrated in 1968 with the aid of the Grunsky bach conjecture for the fifth coefficient', Arch. Rat. Mech. and
Anal. 45 (1972), 161-193.
inequalities; it was proved for n = 5 in 1972 by varia-
[7] FITZGERALD, C.H.: 'Quadratic inequalities and coefficient esti-
tional methods. Among other results obtained in trying mates for the fifth coefficient', Arch. Rat. Mech. and Anal. 46
to prove the Bieberbach conjecture, the following are (1972), 356-368.
worthy of note. [8] SHIROKOV, N.A.: 'On a regularity theorem of Hayman', J.
Soviet Math. 2, no. 6 (1974), 693-710. (Zap. Nauchn. Sem.
W. Hayman [4] obtained a number of results on the Leningr. Otd. Mat. Inst. Steklov. 24 (1972), 182-200)
asymptotic behaviour of the coefficients of functions [9] BAZILEVICH. I.E.: Mathematics in the USSR during 40 years:
which are p-sheeted in 1 z 1 < 1, in particular for the 1917-1957, Vo!' I, Moscow, 1959, pp. 444-472.
[10] HOROWITZ, D.: 'A refinement eStimate for univalent func-
class S. He proved that the limit tions', Proc. A mer. Math. Soc. 54 (1976), 176-178.

. 1 Cn 1
E. G. Goluzina
hm--=a/
n-'X; n Editorial comments. Actually Bieberbach proved
1 C2 1.;;;2 and then asked in a footnote if perhaps generally
exists and that af";;; 1 with an equality sign only for the
1 c n I.;;;n, ct. [1].
Koebe functions.
In the laborious progress on the Bieberbach conjecture
A number of studies are available on the local from 1950 until 1975, M. Schiffer's variational methods
Bieberbach conjecture, i.e. the conjecture that the Koebe played a major role. For a detailed history, see Duren's
function gives max Cn at least for those functions in
1 1
book [A3].
the class S which are close to it in the relevant The case n=4 was proved by P.R. Garabedian and M.M.
topology (cf. Univalent function). It was found that for Schiffer in 1955.
any n = 3, 4, ... , there exists a sufficiently small n >0 In 1984 the Bieberbach conjecture was established in
such that for a function 1 (z) E S which satisfies the con- complete generality by the French-born U.S. mathematician
dition C2 - 2 <n the estimate Re cn.,;;;n is valid, and
1 1
Louis de Branges [A 1], [A2]. In fact, he proved a remark-
able hypothesis of the Soviet mathematicians N.A. Lebedev
Recn =n only for 10(z).
and I.M. Milin which was known to be even stronger than
An estimate of the coefficients for all n, accurate as
the Bieberbach conjecture, cf. [3], [A3]. The Lebedev-Milin
regards the order of dependence on n, was first
hypothesis asserted that for n = 2, 3,
obtained in 1925 by J.E. Littlewood by reducing the
estimate for the coefficients to an estimate for the
(average) integral of the modulus. More accurate esti-
QnCO) = ~1 {k 1 Yk 12 -1-}cn
k=1 k
- k) .;;; O. C)

mates were obtained in 1951 by I.E. Bazilevich Here the numbers Yk are the logarithmic coefficients of fCz) ,

385
BIEBERBACH CONJECTURE

defined by principle) to some function in R. Inequality (2) yields


the following sharp inequality for j(z)ER:
Every function fez) in S is the starting point fez, 0) of a 1 j' (z) 1 ~ Ill.:=- ~(i ~ I, 1 z 1 < 1. (3)
Loewner parametric family {f(z, t)}, t~O, of univalent func-
tions. The latter map the disc onto a continuously-increasing The following bound on the modulus of a function in
family of domains with limit C. Normalized so that R has been obtained: If j(z)ER, then
fez, t) / e t is in S, the functions of a Loewner family satisfy
Loewner's partial differential equation 1 fez) 1 ~ (1_:2)1/2' 1 z 1 =r, O<r<I, (4)
af af
at = Z~p(z, t), and (4) becomes an equality only for the functions
where Rep(z, tO [A4] , [A3]. De Branges introduced +j(zeiO;r), where () is real and
functionals Qn(t) corresponding to the logarithmic coeffi- . _ (l-r 2 )1/2 z
f(z,r)- l' .
cients Yk(t) for fez, t) / e t , replacing the weight factors n - k +lrz
in ("') by undetermined weights 0nk(t) with initial value n - k.
The method of the extremal metric (cf. Extremal metric,
By ingenious use of Loewner's differential equation he could
method of the) provided the solution of the problem of
show that the weights 0nk(t) may be chosen such that
tpe maximum and rnipmum of I j (z) I in the class
Q~(t)~O for all t~O and all n. The proof also used a sophis-
ticated positivity result for hypergeometric functions due to R(c) of functions in R with a fixed value I Cl L=c,
R. Askey and G. Gasper. Since Qn(t)~O as t~oo it followed O<c.:;;;l, in the expansion (1): For j(z)ER(c),
that Qn(O)~O. The latter inequality established the O<c < 1, the following sharp inequalities are valid:
Lebedev - Milin hypothesis ("') and thereby also ImH(ir;r, c) ~ 1 f(re i9 ) 1 ~ ImF(ir;r, c). (5)
Bieberbach's conjecture I en I ~n. The case of equality was
dealt with both by de Branges [A2] and by C.H. FitzGerald Here the functions w=H(z;r, c) and w=F(z;r, c)
and C. Pommerenke [A5]. map the disc I z I < 1 onto domains which are sym-
References
metric with respect to the imaginary axis of the w-
[A 1] BRANGES, L. DE: 'A proof of the Bieberbach conjecture' . plane, and the boundaries of which belong to the union
Preprint E-5-84. Steklov Math. Inst. Leningrad (1984), 1-21. of the closures of certain trajectories or orthogonal tra-
[A2] BRANGES, L. DE: 'A proof of the Bieberbach conjecture', Acta
jectories of a quadratic differential in the w-plane with a
Math. 154 (1985),137-152.
[A3] DUREN, P.L.: Univalent functions, Springer, 1983. certain symmetry in the distribution of the zeros and
[A4] POMMERENKE, c.: Univalent functions, Vandenhoeck and Eoles [4], [5]. Certain optimal results for functions in
Ruprecht, 1975. R(c) were obtained by the simultaneous use of the
[A5] FITZGERALD, C.H. and POMMERENKE, c.: 'The de Branges
theorem on univalent functions', Trans. Amer. Math. Soc.
method of the extremal metric and the symmetrization
290 (1985), 683-690. method [4].
AMS 1980 Subject Classification: 30C50
_ Many results obtained for the functions in the classes
Rand R are consequences of corresponding results for
BIEBERBACH - ElLENBERG FUNCfIONS in the systems of functions mapping the disc I z I < 1 onto
disc I z I < 1 - The class R of functions j (z), regular in disjoint domains [6]. The analogue of R for a finitely-
the disc I z I <1, which have an expansion of the form connected domain G without isolated boundary points
) and not containing the point z = 00, is the class
j(Z)=CIZ + ... +cnz n+ ... (1)
Ra(G),aEG, of functions j(z) regular in G and satisfy-
and which satisfy the condition
*'
ing the conditions j(a)=O, j(ZI)j(Z2)=I=l, where
j(Zdj(Z2) 1, 1 Zl 1 <1, 1 Z2 1 <1. z I, Z 2 are arbitrary points in G. The class Ra( G)
This class of functions is a natural extension of the extends the class Ba( G) of functions j (z), regular in G
class B of functions j (z), regular in the disc I z I < 1, and such that j(a)=O, I j(z) I <1 in G. The following
with an expansion (1) and such that I j (z) I < 1 for sharp estimate is an extension of the result of
I z I < 1. The class of univalept functions (cf. Univalent Bieberbach - Eilenberg and of inequality (3) to func-
function) in R is denoted by R. The functions in R were tions of class Ra(G): If j(Z)ERa(G), then
named _after L. Bieberbach [1], who showed that for I/(z) 1 ~ 1 1- j2(Z) 1 F'(z, z), ZEG.
j(z)ER the inequality where F(z, b), bEG, is that function in Bb(G) for
1 CI 1 ~ 1 (2) which F'(b, b)=max I feb) I in this class.
is valid, while equality holds only for the function References
j(z)=eiOz, where () is real, and after S. Eilenberg [2], [IJ BIEBERBACH, L.: 'Ueber einige Extremalprobleme im Gebiete
who proved that the inequality (2) is valid for the der konformen Abbildung', Math. Ann. 77 (1916), 153-172.
[2J ElLENBERG, S.: 'Sur quelques proprietes topologiques de la sur-
whole class R. It was shown by W. Rogosinski [3] that face de sphere', Fund. Math. 25 (1935),267-272.
every function in R is subordinate (cf. Subordination [3J ROGOSINSKI, W.: 'On a theorem of Bieberbach - Eilenberg', 1.

386
BIFURCA nON

London Math. Soc. (1) 14 (1939), 4-11. BIFACfORIAL MAPPING - A mapping f of a topo-
[4] JENKINS, J.A.: Univalent functions and conformal mappings,
Springer, 1958. logical space X into a topological space Y, in which for
[5] JENKINS, J.A.: 'On Bieberbach - Eilenberg functions III', any covering of the inverse image [-I (y) of any point
Trans. Amer. Math. Soc. 119, no. 2 (1965), 195-215. Y E f (X) by sets open in X it is possible to select a fin-
[6] LEBEDEV, N.A.: The area principle in the theory of univalent
functions, Moscow, 1975 (in Russian).
ite number of sets so that y is located inside the image
G. V. Kuz'mina of their union. It is particularly important that the pro-
Editorial comments. duct of any collection of bifactorial mappings is a
References bifactorial mapping. Bifactorial mappings constitute an
[A 1] DUREN, P.L.: Univalent functions, Springer, 1983. extensive class of factorial mappings (cf. Factorial
AMS 1980 Subject Classification: 30C70 mapping), but nevertheless preserve the fine topological
properties of spaces. Thus, continuous bifactorial s-
BIEBERBACH POLYNOMIALS - Extremal polyno- mappings preserve a pointwise-countable base, and a
mials which approximate a function that conformally factorial s-mapping of a space with a pointwise-
maps a given simply-connected domain onto a disc. countable base onto a space of pointwise-countable
These polynomials were first studied by L. Bieberbach type is bifactorial.
[1] in the context of the problem of the approximate
v. V. Filippov
computation of a conformal mapping. AMS 1980 Subject Classification: 54C10
Let G be a simply-connected domain in the finite
part of the plane bounded by a curve r, and let the BIFUNCfOR - A mapping T: 2f X SB~C, defined on
function w = cp(z) map this domain conformally and the Cartesian product of two categories 2f and SE with
univalently onto the disc I w I <ro under the condi- values in C, which assigns to each pair of objects A E 2f,
tions cp(zo)=O and Ij>'(zo) = 1, where Zo is an arbitrary B ESE some object C EC and to each pair of mor-
fixed point of G and ro depends on zoo The polynomial phisms
a:A ~ A', fJ:B ~ B'
'lTn(z) which minimizes the integral
the morphism
I(Fn) = f f G 1 F~(z) 12 dx dy T(a, fJ): T(A', B) -> T(A, B'). (I)

in the class of all polynomials Fn(z) of degree n subject The following conditions
to the conditions Fn(zo)=O and F~(zo)= I is called the T(lA, 1B ) = 1T(A. B),
Bieberbach polynomial. In the class of all functions
T(a' oa, fJ' of3) = T(a, fJ')oT(a', fJ), (2)
which are analytic in the domain G and which satisfy
the same conditions, this integral is minimized by the must also be met. In such a case one says that the func-
mapping function w =cp(z). If the contour r is a Jordan tor T is contravariant with respect to the first argument
curve, the sequence {'ITn(z)} converges uniformly to the and covariant with respect to the second.
function cp(z) inside G. In the closed domain G there V. E. Govorov
need not be convergence [2]. If the contour r satisfies Editorial comments. What is described above is a
certain additional smoothness conditions, the sequence bifunctor contravariant in its first argument and covariant in
{'lTn(z)} converges uniformly in the closed domain, and its second. A bifunctor covariant in both arguments, the
the rate of convergence depends on the degree of more fundamental notion ([A 1D, has (1) and (2) replaced by
smoothness of r. T(a, f3): T(A, B) ~ T(A, B), (1 ')

References T(a'a, fJfJ) = T(a, fJ")oT(a, fJ) (2')


[I] BIEBERBACH, L.: 'Zur Theorie und Praxis der konfonnen Similarly one can define bifunctors contravariant in both
Abbildung', Rend Circ. Mat. Palermo 38 (1914),98-112.
arguments and covariant in the first and contravariant in the
[2] KEWYSH, M.V.: 'Sur l'approximation en moyenne quadratique
des fonctions analytiques', Mat. Sb. 5 (47), no. 2 (1939), 391- second argument.
401. References
[3] MERGELYAN, S.N.: Some questions of the constructive theory of [A 1] MITCHELL, B.: Theory of categories, Acad. Press, 1965,
functions, Moscow, 1951 (in Russian).
[4J SUETIN, P.K.: 'Polynomials orthogonal over a region and AMS 1980 Subject Classification: 18A22
Bieberbach polynomials', Proc. Steklov Inst. Math. 100 (1974).
(Trudy Mat. Inst. Steklov. 100 (1971
P.K. Suetin BIFURCATION - A term in certain branches of
mathematics, applied to situations in which some object
Editorial comments. A good additional reference is [A 1].
:I) = :I)(A) depends on a parameter A (which is not
References necessarily scalar) and which is such that in any neigh-
[A1] GAlER, D.: Vorlesungen uber Approximation im Komplexen,
Birkhauser, 1980. bourhood of a certain value Ao of that parameter (a
bifurcation value or a bifurcation point) the qualitative
AMS 1980 Subject Classification: 30C10, 30C70 properties under consideration of the object :I)(A) are

387
BIFURCATION

not the same for all X. The corresponding rigorous and results consists in the fact that, subject to relatively
definitions vary in different cases, but mainly follow (in weak limitations, the branching of the solution x =0
a more or less modified form) the two variants below: can be established; in particular, it is possible to prove
A) The qualitative properties of the object ~ being that the solution of the non-linear problem is not
studied are the existence of other objects 0, associated unique. The methods of the theory of branching of
with it in some manner. The bifurcation is character- solutions of non-linear equations often yield more accu-
ized by the fact that when X is varied, the objects 0 rate information [5].
appear or disappear (in particular, they may coincide 2) The theory of smooth dynamical systems studies
or else one object may generate several ones). See below one-parameter (and sometimes also two-parameter [6])
(Paragraph 1. families of flows (and cascades; only the former will be
B) The first step is to determine under what cir- considered here), and the conditions under which the
cumstances two objects ~(X) are considered to be bifurcation is 'typical', i.e. preserves its character under
equivalent. (This definition must be such that all the a small change of the family in question [9]; both vari-
qualitative properties of interest be identical for ants A) and B) above are useful. In the second variant
equivalent objects.) A change in the qualitative proper- two flows are considered equivalent if there exists a
ties of ~(X) in a neighbourhood of a bifurcation point homeomorphism of the phase space which converts the
means, by definition, that X-values with non-equivalent trajectories of the one into trajectories of the other
~(X) are found near this point. See below (Paragraph while preserving the direction of motion. There exists a
2. completely satisfactory theory of bifurcations of one-
I) In the theory of operators the initial object ~(X) is parameter families of flows with a two-dimensional
a non-linear operator fl>(x, X) in a real Banach space, phase space [7], [9], and a local variant referring to a
depending on a real parameter X, defined in a neigh- neighbourhood of an equilibrium position or of a
bourhood of the point x =0 and such that fl>(0, X)-O. periodic solution in the n-dimensional case [6].
For each fixed X one associates new objects 0 to this In variant A) the objects 0 under study, which are
fl>; they are the solutions x of the non-linear operator associated to the given dynamical system, are the posi-
equation fl>(x, X)=x. A bifurcation point is a point at tions of an equilibrium or periodic solution, and occa-
which a new, non-trivial, solution of this equation is sionally certain invariant manifolds (mainly tori) or
generated. It is in fact a point Ao such that for any t:>0 hyperbolic invariant sets. The 'growth' of these objects
there exists a X, I X- Ao I <t:, for which the equation - taking place 'locally' near an equilibrium position,
x = fl>(x, X) has a solution x(X) satisfying the conditions or near a periodic solution, or 'semi-locally' in a neigh-
0< " x(X) "<t:. If fl>(x, X) AAx, where A is a linear bourhood of a 'closed contour' formed by several tra-
completely-continuous operator, the concept of a bifur- jectories which tend to an equilibrium position or to a
cation point coincides with the concept of a characteris- periodic solution - is considered. A case of bifurca-
tic value of A. tion which is connected with a similar contour in a cer-
If fl>(x, X) is a non-linear completely-continuous tain sense, but which takes place (with a change of the
operator which is continuously Frechet differentiable parameter ,\) prior to its arrival, is also possible [8]. The
and is such that fl>x(O, X) M, then only the charac- growth of periodic solutions is often conveniently con-
teristic values of A can be bifurcation poir) s of fl>. It sidered by rewriting the differential equation and the
was found by a topological method [1], [2] that each periodicity condition in the form of an integral equa-
odd-fold (in particular, simple) characteristic value of A tion to which suitable methods are applied [5].
is a bifurcation point of fl>. An analogous sufficient 3) Various bifurcations of different objects (both the
condition for the case of even-fold characteristic values initial objects and those associated to them) are
is formulated using the concept of the rotation of a vec- encountered in the theory of singularities of mappings.
tor field. As a result, this term (or rather a term derived from it)
If x =0 is a non-isolated solution of the equation is used in several different ways [10], [6], [11], but it is
x =fl>(x, ,\), then Ao is a bifurcation point of fl>. It was more common to assign independent names to the
shown by a variational method [1], [2] that if fl>(x) is a corresponding concepts. They include, for instance, ver-
non-linear completely-continuous operator in a Hilbert sal families (or deformations) [6], [11], [12], which
space which is the gradient of a weakly continuous describe, in a certain sense, all possible bifurcations
functional, while A = fl>' (0) is a completely-continuous which may take place under a small deformation of the
self-adjoint operator, then all characteristic values of A object under consideration. These include, in particu-
are bifurcation points of fl>. The concept of a bifurca- lar, seven elementary catastrophes [12], which are
tion point is also modified in the case of large solutions represented by 'typical' k-parametric (k ";;4) families of
X('\)-HX) for X~Ao. The importance of these concepts functions which include a function with a degenerate

388
BIHARMONIC FUNCTION

critical point and which are defined in a neighbourhood il 2 u _ il(ilu) = 0,


of that point; they accordingly describe the correspond-
where d is the Laplace operator. This equation is
ing bifurcation. In non-Soviet literature on the theory
known as the biharmonic equation. The class of bihar-
of singularities, the term 'catastrophes' is often
monic functions includes the class of harmonic func-
employed for 'bifurcations'.
tions and is a subclass of the class of poly-harmonic
References functions (cf. Harmonic function; Poly-harmonic func-
[1) KRASNOSEL'sKIi, M.A.: Topological metods in the theory ofnon- tion). Each biharmonic function is an analytic function
linear integral equations, Pergamon, 1964 (translated from the
Russian). of the coordinates Xi'
[2) KREiN, S. (ED.): Functional analysis, Handbook of Math. From the point of view of practical applications the
Libraries, Moscow, 1964 (in Russian). biharmonic functions in two variables u(xJ, X2) are the
[3) KRAsNOSEL'sKIi, M.A: Positive solutions of operator equations,
Wolters-Noordhoff, 1964 (translated from the Russian). most important. Such biharmonic functions can be
[4) VAiNBERG, M.M.: Variational methods for the study of nonlinear represented using harmonic functions UI, U2 or VI, V2,
operators, Holden-Day, 1964 (translated from the Russian). in the form
[5) VAiNBERG, M.M. and TRENOGIN, V.A.: Theory of branching of
solutions of non-linear equations, Noordhoff, 1974 (translated
from the Russian).
[6) ARNOL'D, V.I.: 'Lectures on bifurcations and versal families', or
Russian Math. Surveys 27, no. 5 (1972), 54-123. (Uspekhi Mat.
u (x J, X2) = (r2 - ra)v 1(x J, X2)+ V2(X 1, X2),
Nauk 27, no. 5 (1972), 119-lS4) where r2 =xT + x~, while r6 is a constant. The main
[7) ANDRONOV, A.A., LEONTOVICH, E.A., GoRDON, 1.1. and boundary value problem for biharmonic functions is
MAIER, A.G.: Theory of bifurcations of dynamic systems on a
plane, Israel Progr. Sci. Transl., 1971 (translated from the Rus- the following: To find a biharmonic function in the
sian). domain D that is continuous together with its first-
[SA) GAVRlLOV, N.K. and SHIL'NIKOV, L.P.: 'On three-dimensional order derivatives in the closed domain D= Due and
systems close to systems with a structurally unstable homoc-
linic curve 1', Math. USSR-Sb. 17, no. 4 (1972), 467-4S5. satisfies on the boundary C the conditions
(Mat. Sb. 88 (130), no. 4 (1972),475-492)
[SB) GAVRlLOV, N.K. and SHIL'NIKOV, L.P.: 'On three-dimensional
systems close to systems with a structurally unstable homoc-
ule = !1(S), ~~ Ie = h(s), (*)

linic curve II', Math. USSR-Sb. 19, no. 1 (1973), 139-156.


(Mat. Sb. 90 (132), no. 1 (1973), 139-156) where au / an is the derivative with respect to the nor-
[9) PEIXOTO, M.M.: 'On bifurcations of dynamical systems', in mal on C, and 11 (s), h (s) are given continuous func-
Proc. Internat. Congress Mathematicians Vancouver, 1974, Vol. tions of the arc length s on the contour C. The above
2, Vancouver, 1974, pp. 315-319.
[10) THOM, R.: 'The bifurcation subset of a space of maps', in representations of biharmonic functions give the solu-
Manifolds, Amsterdam 1970, Springer, 1971, pp. 202-20S. tion of problem (*) in an explicit form for the case of
[II) ARNOL'D, V.I.: 'Critical points of smooth functions and their the disc D, starting from the Poisson integral for har-
normal form', Russian Math. Surveys 30, no. 5 (1975), 1-75. monic functions [1].
(Uspekhi Mat. Nauk 30, no. 5 (1975), 3-65)
[I 2) BROCKER, P. and LANDER, L.: Differentiable germs and catas- Biharmonic functions in two variables may also be
trophes, Cambridge Univ. Press, 1975. represented as follows:
D. V. Anosov
V.A. Trenogin u(XJ, X2) = Re{zcj>(z)+x(z)} =

Editorial comments. A standard reference for bifurcation = ~ {zcj>(z)+zcj>(z)+x(z)+x(z)}, Z=Xl -iX2,


theory is [A 1]. The presence of symmetry in a bifurcation
problem can often be a powerful tool [A2]. For more pre- using two analytic functions cp(z), X(z) of the complex
cise details in the case of bifurcations of solutions to equa- variable z = x I + ix 2. This representation makes it pos-
tions cf. Branching of solutions. sible to reduce the boundary value problem (*) for an
References arbitrary domain D to a system of boundary value
[A1) CHOW, S.-N. and HALE, J.K.: Methods of bifurcation theory, problems for analytic functions, for which a method of
Springer, 1982. solving was developed in detail by G.V. Kolosov and
[A2) SALLINGER, D.H.: Branching in the presence of symmetry,
SIAM,1983. N.!. Muskhelishvili. This method was developed while
solving various planar problems of elasticity theory (cf.
AMS 1980 Subject Classification: 35832, 58E07, 58- Elasticity theory, planar problem of), in which the main
XX biharmonic function is the stress function or the Airy
function [2], [3] (d. Airy functions).
BIHARMONIC FUNCTION A function
u (x) = u (x I, . . . ,xn ) of real variables, defined in a References
domain D of the Euclidean space Rn, n ;"2, with con- [I] TlCHONOFF, AN. [AN. TIKHoNov) and SAMARSKll, A.A.: Dif-
ferentialgleichungen der mathematischen Physik, Deutsch. Ver-
tinuous partial derivatives up to the fourth order lag Wissenschaft., 1959 (translated from the Russian).
inclusive, that satisfies in D the equation (2) MUSKHELISHVILI, N.I.: Some basic problems of the mathemati-

389
BIHARMONIC FUNCTION

cal theory of elasticity, Noordhoff, 1975 (translated from the A biholomorphic mapping 'is proper (i.e. the pre-image
Russian).
[3] LAVRENT'EV, M.A. and SHABAT, B.V.: Methoden der komplexen
of a compact set is compact), since ,-1
is continuous.
Riemann's theorem does not hold in the following sense:
Funktionentheorie, Deutsch. Verlag Wissenschaft., 1967
(translated from the Russian). There is no proper holomorphic mapping from the polydisc
E.D. Solomentsev in en onto the ball in em for any n, m> 1, cf. [A4]. Thus,
Editorial comments. An axiomatic treatment of bihar- function theory in en, n;;;.1, is strongly related to the
monic functions, similar to that of harmonic functions, is domain of definition of the functions. For function theory in
given in [A1], [A2]. the (unit) ball of en see [A5]; for function theory in
References polydiscs see [A6]. For entire holomorphic mappings and
[A 1] SMYRNELIS, E.P.: 'Axiomatique des fonctions biharmoniques', their value distribution see [A7].
Ann. Insf. Fourier (Grenoble) 25, no. 1 (1975), 35-97. References
[A2] SMYRNELIS, E.P.: 'Axiomatique des fonctions biharmoniques', [A1] BAOUENDI, M.S., JACOBOWITZ, H. and TREVES, F.: 'On the
Ann. Inst. Fourier (Grenoble) 26, no. 3 (1976), 1-47. analyticity of CR mappings', Ann. ot Math. 122 (1985), 365-
400.
AMS 1980 Subject Classification: 31 A30, 31830,
31C05
a
[A2] BELL, ST.: 'Biholomorphic mappings and the problem', Ann.
otMath. 114 (1981),103-113.
[A3] FEFFERMAN, c.: 'The Bergman kernel and biholomorphic
BmOLOMORPHIC MAPPING, holomorphic isomor- mappings of pseudoconvex domains', Inv. Math. 26 (1974),
phism, holomorphism, pseudo-conformal mapping - A 1-65.
generalization of the concept of a univalent confonnal [A4] KRANTZ, S.G.: Function theory ot several complex variables,
Wiley, 1982, Chapt. 10.
mapping to the case of several complex variables. A [A5] RUDIN, W.: Function theory in the unit ball otcn, Springer,
holomorphic mapping of a domain D ccn onto a 1980.
domain D' ccn is said to be a biholomorphic mapping if [A6] RUDIN, W.: Function theory in polydiscs, Benjamin, 1969.
[A7] GRIFFITHS, PH.A.: Entire holomorphic mappings in one and
it is one-to-one. A biholomorphic mapping is non- several variables, Princeton Univ. Press, 1976.
degenerate in D; its inverse mapping is also a biholo-
morphic mapping. AMS 1980 Subject Classification: 32HXX
A domain of holomorphy is mapped into a domain of
BIJECTION, bijective mapping, of a set A into a set
holomorphy under a biholomorphic mapping; holo-
B - A mapping f: A --"?B under which different elements
morphic, pluriharmonic and plurisubharmonic func-
in A have different images in f (A) = B. In other
tions are also invariant under a biholomorphic map-
words, f is a one-to-one mapping of A onto B, i.e. a
ping. If n > 1, biholomorphic mappings are not confor-
mapping that is both an injection and a surjection. A
mal (except for a number of linear mappings) and the bijection establishes a one-to-one correspondence
Riemann theorem is invalid for biholomorphic map- between the elements of the sets A and B. A bijection
pings (e.g. a ball and a polydisc in C2 cannot be biho- of a set A onto itself is also called a permutation of A.
lomorphically mapped onto each other). A biholo- o.A. Ivanova
morphic mapping of a domain D onto itself is said to
be a (holomorphic) automorphism; if n > 1, there exist AMS 1980 Subject Classification: 04A05
simply-connected domains without automorphisms
BIUNEAR DIFFERENTIAL - An analytic differen-
other than the identity mapping.
tial on a Riemann surface, depending on two points P
References and Q, and having the form
[I] SHABAT, B.V.: Introduction to complex analysis, 1-2, Moscow,
1976 (in Russian). fez, ndz dt
E.D. Solomentsev
Editorial comments. Concerning boundary behaviour of where z and ~ are local uniformizing parameters III a
biholomorphic mappings the following results have been neighbourhood of P and Q respectively, and f (z, n is
obtained. C. Fetterman's theorem: A biholomorphic mapping an analytic function of z and r Bilinear differentials
between strongly pseudo-convex domains with Coo -smooth are used to express many functionals on a finite
boundary extends Coo -smoothly to a diffeomorphism Riemann surface.
between the closures of the domains, see [A3]. The same
References
result holds if the domains are only pseudo-convex and one [I] SCHIFFER,M. and SPENCER, D.C.: Functionais offinite
of them satisfies condition R for the Bergman projection, Riemann surfaces. Princeton Univ. Press. 1954.
see [A2]. For strongly pseudo-convex domains with C k _ E.D. Solomentsev
boundary, k>2, C k - 1 -, extendability was obtained (>0 if
k=2, 3, ... , =0 otherwise) by L. Lempert and by S. AMS 1980 Subject Classification: 30F30
Pincuk. For (weakly) pseudo-convex domains with real-
analytic boundary one has even holomorphic extension to a BIUNEAR FORM on a product of modules V X W -
neighbourhood of the closure, see [A 1]. Similar results were A bilinear mapping f: V X W --"?A, where V is a left uni-
obtained for proper holomorphic mappings. tary A-module, W is a right unitary A-module, and A is

390
BILINEAR FORM

a ring with a unit element, which is also regarded as an dimensional and f is non-degenerate, then
(A, A)-bimodule. If V= W, one says that fis a bilinear
form on the module V, and also that V has a metric
dim V = dim W
structure given by f Definitions involving bilinear and for each basis v], ... ,Vn in V there exists a basis
mappings make sense also for bilinear forms. Thus, one w], ... ,Wn in W which is dual with respect to f; it is

speaks of the matrix of a bilinear form with respect to defined by the conditionf(v;, w)=8 ij , where 8ij are the
chosen bases in V and W, of the orthogonality of ele- Kronecker symbols. If, in addition, V= W, then the
ments and submodules with respect to bilinear forms, submodules V ~ and W ~ are said to be the right and
of orthogonal direct sums, of non-degeneracy, etc. For the left kernel of f, respectively; for symmetric and
instance, if A is a field and V = W is a finite- anti-symmetric forms the right and left kernels are
dimensional vector space over A wi th basis e], ... , en, identical and are simply referred to as the kernel.
then for the vectors Let f be a symmetric or an anti-symmetric bilinear
form on V. An element v E V for which f(v, v)=O is
v =V I eI + ... + l'n en
said to be an isotropic element; a submodule MeV is
and
w=wlel + ... +wne" said to be isotropic if M n
M ~ =1= {O}, and totally isotro-
pic if Me M ~. Totaliy isotropic sub modules play an
the value of the form will be
important role in the study of the structure of bilinear
n
1(1', w) = ~ a,lv,It'I' forms (d. Witt decomposition; Witt theorem; Witt
/)~ I ring). See also Quadratic form for the structure of bil-
where ai,. = f (e;. e). The polynomial 2: I,)n . _ ai, V; w) in inear forms.
th e vanables v], ... , vn , W] . . . 'W n IS sometImes
-]. .
Let A be commutative, let HomA (V, W) be the A-
identified with f and is called a bilinear form on V. If module of all A-linear mappings from V into W, and
the ring A is commutative, a bilinear form is a special let L 2 (V, W, A) be the A-module of all bilinear forms
case of a sesquilinear fonn (with the identity automor- on VX W. For every bilinear form f on VX Wand for
phism). each Vo E V, the formula
Let A be a commutative ring. A bilinear form on an 't"o(w) = 1(1'0, W), WE W,
A -module V is said to be symmetric (or anti-symmetric
or skew-s)immctric) if for all VI, V 2 E V one has defines an A-linear form on W. Correspondingly, for
f(V],V2)=j(V2,vd (or f(V],V2)=-f(V2,vd), and is Wo E W the formula
said to be alternating if f(v, v)=O. An alternating bil- rr,wo(v) = 1(1', wo). VEV,
inear form is anti-symmetric; the converse is true only
defines an A-linear form on V. The mapping
if for any aEA it follows from 2a=0 that a=O. If V
l( vo~lf,vo is an element of
has a finite basis. symmetric (or anti-symmetric or
alternating) forms on V and only such forms have a HomAV, HomA(W, A,
symmetric (anti-symmetric, alternating) matrix in this The mapping rf in
basis. The orthogonality relation with respect to a sym-
HomAW, HomA(V ,A
metric or anti-symmetric form on V is symmetric.
A bilinear form f on V is said to be isometric with a is defined in a similar way. The mappings f~/f and
bilinear form g on W if there exists an isomorphism of f~rf define isomorphisms between the A-modules
A-modules : V ~ W such that L2(V, W, A) -> Hom4(V, HomA(W, A
g(Cv), Cw =I (v, w) and
L2(V, W, A) -> HomAW, Hom4(V, A.
for all v E V. This isomorphism is called an isometry of
the form and, if V = Wand f = g, a metric automorphism A bilinear form f is said to be left-non-singular (respec-
of the module V (or an automorphism of the form f) tively, right-non-singular) if If (respectively, rf) is an iso-
The metric automorphisms of a module form a group morphism; if f is both left- and right-non-singular, it is
(the group of automorphisms of the form f); examples of said to be non-singular; otherwise it is said to be singu-
such groups are the orthogonal group or the symplectic lar, A non-degenerate bilinear form may be singular.
group. For free modules V and W of the same finite dimen-
Let A be a skew-field and let f be a bilinear form on sion a bilinear form f on V X W is non-singular if and
V X W; let the spaces V / W 1- and W / V ~ be finite- only if the determinant of the matrix of f with respect
dimensional over A; one then has to any bases in V and W is an invertible element of the
ring A. The following isomorphisms
dim V / W ~ = dim W / V ~ ,
I
and this number is called the rank of f If V is finite- Hom A (V. V) --> L 2 (V, W, A)

391
BILINEAR FORM

and r
A =B and A is commutative, then the set L 2(V, W, H)
HOmA(W, W) ~ L 2 (V, W, A), of all bilinear mappings VX W~H is an A-module
with respect to the pointwise defined operations of
given by a non-singular bilinear form f, are defined by
addition and multiplic~tion with elements in A, while
the formulas
the correspondence f ~f establishes a canonical isomor-
1(q,Xv, w) = f(q,(v), w)
phism between the A-module L 2(V, W, H) and the A-
and
r(~Xv, w) = f(v, ~w.
module L (V w, H) of all linear mappings from
VWinto H.
The endomorphisms epE HomA (V, V) and Let V and W be free modules with bases Vi, i EI, and
~EHomA(W, W) are said to be cOnjugate with respect to Wi' j EJ, respectively. A bilinear mapping f is fully
the form f if ~ = (r -1 0/)(ep).
determined by specifying f(Vi, Wi) for all i EI, j EJ,
References since for any finite subsets I' CI, f CJ, the following
[1] BOURBAKI, N.: Elements of mathematics. Algebra: Algebraic
structures. Linear algebra, 1, Addison-Wesley, 1974,
formula is valid:
f( ~.a;v;, ~.wA] ~ aJ(v;, wj)bj.
Chapt.l;2 (translated from the French).
= (*)
[2] LANG, S.: Algebra, Addison-Wesley, 1974. ieI jeJ ier,
[3] ARTIN, E.: Geometric algebra, Interscience, 1957. jeJ

v.L. Popov Conversely, after the elements hij EH, i EI, j EJ, have
AMS 1980 Subject Classification: 15A63 been chosen arbitrarily, formula (*), where
f(Vi, Wi)=hij' defines a bilinear mapping from VX W
BILINEAR FUNCfIONAL - A bilinear mapping from into H. If I and J are finite, the matrix Ilf(Vi' wi)11 is
a module over a commutative ring K (a K-module) into said to be the matrix of f with respect to the given
K itself (considered as a K-module). bases.
M.l. Voitsekhovskii Let a bilinear mapping f: VX W ~H be given. Two
elements V E V, W E W are said to be orthogonal with
AMS 1980 Subject Classification: 16A64
respect to f if f (u, w) = O. Two subsets X C V and
BILINEAR INTEGRAL FORM - The double integral YeW are said to be orthogonal with respect to f if any
b b x EX is orthogonal to any y E Y. If X is a submodule in
J(q" ~)= j jK(x, s)</>(x~s)dx ds, V, then
= {wEW:f(x,w)=O for allXEX},
a a
X-L
where K(x, s) is a given (usually complex-valued)
square-integrable function of real variables, and cJ>(x), which is a submodule of W, is called the orthogonal
1[;(x) are arbitrary (also complex-valued) square- submodule or the orthogonal complement to X. The
integrable functions, while 1[;(s) is the complex conju- orthogonal eomplement Y ~ of the submodule Y in W
gate function of 1[;(s). If 1[;(s) = cJ>(s), J (ep, ep) is said to be is defined in a similar way. The mapping f is said to be
a quadratic integral form. right-degenerate (left-degenerate) if V ~=i={O}
B. V. Khvedelidze (W~=i={O}). The sub modules V~ and W~ are called,
AMS 1980 Subject Classification: 46CXX respectively, the left and right kernels of the bilinear
mapping f If V~ = {O} and W~ = {O}, then f is said
BILINEAR MAPPING, bilinear function - A mapping to be non-degenerate; otherwise it is said to be degen-
ffrom the product VX W of a left unitary A-module V erate. The mapping f is said to be a zero mapping if
and of right unitary B-module W into an (A, B)- V ~ = W and W ~ = V.
bimodule H, satisfying the conditions Let v., i EI, be a set of left A-modules, let Wi, i EI,
f(v +v', w) = f(v, w)+ f(v', w); be a set of right B-modules, let fi be a bilinear mapping
from Vi X W; into H, let V be the direct sum of the A-
f(v, w+w') = f(v, w)+f(v, w');
modules v.,
and let W be the direct sum of the B-
f(av, w) = af(v, w); modules Wi' The mapping f: VX W ~H, defined by
f(v, wb) = f(v, w)b; the rule
f(~v;, ~w;] = (v;, Wi),
where v, v' E V, W, W' E W, a EA, b EB are arbitrarily lEI lEI lEI

chosen elements, and A and B are rings with a unit ele- is a bilinear mapping and is said to be the direct sum of
ment. The tensor product V W over Z has the the mappings Ji. This is an orthogonal sum, i.e. the sub-
natural structure of an (A, B)-bimodule. Let module Vi is orthogonal to the sub module Jfj with
q,: VX W~VWbe a canonical mapping; any bilinear respect to fif i=i=j.
mapping f will !hen induce a homomorphism of The bilinear mapping f is non-degenerate if and only
(A, B)-bimodules f: V W ~H for which f= foep. If if fi is non-degenerate for all i EI. Moreover, if f is

392
BINARY COMPUTING SYSTEM

non-degenerate then one has See also Multimodal distribution; Unimodal distribution.
A. V. Prokhorov
Vjl. = ~ nJ, Wjl. = ~~.
Joj:-j Joj:-j AMS 1980 Subject Classification: 62E99
If A = B = H, a bilinear mapping is called a bilinear
form. BIMODULE, double module - An Abelian group B
References that is a left module over a ring R and a right module
[I) BOURBAKI, N.: Elements of mathematics. Algebra: Algebraic over a ring S, and is such that (rb)s = r (bs) for all r E R,
structures. Linear algebra, I, Addison-Wesley, 1974,
b EB, s ES. One says that this is the situation RBs , or
Chapt.!;2 (translated from the French).
[2) LANG, S.: Algebra, Addison-Wesley, 1974. that B is an (R, S)-birnodule. The birnodule B may be
v.L. Popov regarded as a left R S' -module, where S' is the ring
AMS 1980 Subject Classification: 16A64, 15A69, which is dually isomorphic (anti-isomorphic) to S,
13CXX while denotes the tensor product over the ring of
integers, and (r s)b = rbs. For every left R -module M
BIMATRIX GAME - A finite non-cooperative game one has the situation RME , where E is the ring of
between two players. A bimatrix game is specified by endomorphisms of M. Any ring A can be given the
two matrices A = II aU II and B = I bi) I of the same natural structure of an (A, A )-birnodule.
dimension m X n, which are, respectively, the payoff L.A. Skornyakov
matrices (or gain matrices) of the players I and II. The Editorial comments. A bimodule morphism is a mapping
strategy of player I is the selection of a row, that of from a bimodule RBS into a bimodule ReS that is left R-linear
player II the selection of a column. If player I chooses i and right 5-linear. The category of (R, S)-bimodules with
(l';:;;i';:;;m), while player II chooses j (l';:;;j.;:;;n), their bimodule morphisms is a Grothendieck category.
respective payoffs (gains) will be ail and bi); if The centre of an (R, R)-bimodule (also called an R-
aij + bi) = 0 for all i, j, the bimatrix game becomes a bimodule) RBR is defined to be the set
matrix game. The theory of bimatrix games is the sim- ZR(B) = {xEB: rb=brfor all rER}. Clearly ZR(B) is a two-
plest branch of the general theory of non-cooperative sided ZR(R)-module.
games, but even bimatrix games are not always solvable AMS 1980 Subject Classification: 16A64
according to Nash or strongly solvable. Various algo-
rithms are available to find equilibrium solutions in BIMORPIDSM, bijective morphism in a category -
bimatrix games: a method of describing the subma- One of the categoric-theoretical generalizations of the
trices of A, B yielding all extreme points of the set of concept of a bijective mapping between sets. A mor-
equilibrium solutions [1], [2]; and methods which phism u in a category C is said to be a birnorphism if it
reduce the problem of finding the equilibrium solutions is both a monomorphism and an epimorphism in C. A
of a birnatrix game to a problem of quadratic program- product of birnorphisms is a birnorphism, i.e. the
ming [3], [4], [5]. birnorphisms form a subcategory containing all isomor-
phisms. In the category of sets and the category of
References
[I] VOROB'EV, N.N.: 'Equilibrium points in bimatrix games', groups every birnorphism is an isomorphism. However,
Theory Probab. Appl3, no. 3 (1958), 297-309. (Teor. Veroyat the categories of rings, topological spaces, or Abelian
nost. i Primenen. 3, no. 3 (1958),318-331) groups without torsion contain birnorphisms that are
[2] KUHN, H.W.: 'An algorithm for equilibrium points in bimatrix
games', Proc. Nat. A cad. Sci. USA 47 (1961), 1657-1662.
not isomorphisms.
1. V. Dolgachev
[3] MIllS, H.: 'Equilibrium points in infinite games', J. Soc. Ind.
M. S. Tsalenko
Appl. Math. 8, no. 2 (1960), 397-402.
[4] MANGASARIAN, O.L.: 'Equilibrium points of bimatrix games', AMS 1980 Subject Classification: 18A20
J. Soc. Ind Appl. Math. 12, no. 4 (1964), 778-780.
[5] LEMKE, C.E. and HOWSON, JR., J.T.: 'Equilibrium points of
bimatrix games', J. Soc. Ind Appl. Math. 12, no. 2 (1964), BINARY COMPUTING SYSTEM - A positional sys-
413-423. tem for the representation of numbers in base 2 (cf.
E.B. Yanovskaya
Numbers, representations of). In such a system the
AMS 1980 Subject Classification: 90005
arithmetic operations can be very simply executed. For
instance, the usual addition and multiplication with
BIMODAL DISTRIBUTION, doubly-peaked distribu- single-valued numbers are reduced in the system to:
tion - A probability distribution which is characterized I + 1 = 10; 11 = 1.
by the fact that the probability curve has two local V. 1. N echaev
maxima, corresponding to two values of the modes (cf. Editorial comments. The term binary arithmetic is also
Mode). Such a distribution is often the result of 'mix- used.
ing' two normal distributions (cf. Normal distribution). AMS 1980 Subject Classification: 68M05, 10AXX

393
BINARY FORM

BINARY FORM - A form in two variables, viz. a BINARY LIE ALGEBRA, BL-algebra - A linear alge-
homogeneous polynomial bra A over a field F any two elements of which gen-
erate a Lie sub algebra. The class of all binary Lie alge-
bras over a given field F generates a variety which, if
the characteristic of F is different from 2, is given by
where the coefficients at. k =0, ... ,n, belong to a
the system of identities
given commutative ring with a unit element. Such a ring
may be the ring Z of integers, the ring of integers of x 2 = J(xy, x,y) = 0, (*)
some algebraic number field, the field R of real where
J(x,y, z) = (xy)z +(yz)x + (zx)y.
numbers or the field C of complex numbers. The
number n is called the degree of the form. If n = 2, f is If the characteristic of F is 2 and its cardinal number is
called a binary quadratic fonn. not less than 4, the class of binary Lie algebras can be
The theory of forms includes algebraic. (theory of defined not only by the system of identities (*), but
invariants), arithmetic (representation of numbers by also by the identity
forms) and geometric (theory of arithmetical minima of J([(~)y]x, x,y) = o.
forms) approaches. The purpose of the algebraic theory
The tangent algebra of an analytic local alternative loop
of binary forms (in R or q is to construct a complete
is a binary Lie algebra and vice versa.
system of invariants of such forms under linear
transformations of variables with coefficients of the References
[I] MAL'TSEV, A.I.: 'Analytic loops', Mat. Sb. 36 (78), no. 3
same field (cf. Invariants, theory of; see also [2], Chapt. (1955), 569-575 (in Russian).
5). The arithmetic theory of binary forms studies [2] GAiNOV, A.T.: 'Binary Lie algebras of characteristic two', Alge-
Diophantine equations of the form bra and Logic 8, no. 5 (1969), 287-297. (Algebra j Logika 8,
no. 5 (1969), 505-522) T ~
j(x,y) = b, A . . Gamov

where aQ, ... ,an, b EZ, their solvability and their solu- AMS 1980 Subject Classification: 17B99
tions in the ring Z. The most important result is Thue's
BINARY P-ADIC GROUP - An infinite group G of
theorem and its generalizations and sharpenings (d.
square matrices of order 2
Thue-Siegel-Roth theorem). See [5], Chapts. 9 - 17,
and the MordeD conjecture on the solvability of such
equations in the field Q and the possible number of
solutions. The theory of arithmetical minima of binary where a, b, c, d are integral p-adic numbers (cf. p-adic
forms is part of the geometry of numbers. The arithmet- number) satisfying the following conditions:
ical minimum of a form f is defined as the quantity
ad-bc = 1, c _ 0 (modp), d _ 1 (modp).
m(j) = (x,y)eiW\(O,O) I j(x, y) I, The quotient groups of such groups of the form G / N,
It has been proved for the case n = 3 that where N is the n-th member of the lower central series
of G or the n-th term of the derived series (the series of
{ II DD /2311/4
/49 11/ if D >0,
4

m(j)";; if D<O, higher commutators of G), are examples of finite p-


groups having certain extremal properties.
where D is the discriminant of f, which, in the present A.I. Kostrikin
case, IS
Editorial comments.
18aoa1a2a3 +aTa~ -4ooa~ -403a1 -27a6aj. References
These estimates cannot be improved. [A1] HUPPERT, B.: Endliche Gruppen, 1, Springer, 1979.

AMS 1980 Subject Classification: 20C99, 20E99


References
[1] BOREVICH, Z.I, and SHAFAREVICH, I.R.: Number theory, Acad.
Press, 1966 (translated from the Russian). German transla- BINARY QUADRATIC FORM - A quadratic form in
tion: Birkhauser, 1966. two variables, i.e. having the form
[2] GUREVICH, G.B.: FoundatioTL'J of algebraic invariant theory,
Moscow-Leningrad, 1948 (in Russian). j = j(x,y) = ax 2+bxy+ry2. (*)
[3] LANDAU, E. and WALFISZ, A.: Diophantische Gleichungen mit
endlich vielen LOsungen, Deutsch. Verlag Wissenschaft., 1959. If a, b and c are integers, the binary quadratic form is
[4] LEKKERKERKER, e.G.: Geometry of numbers, Wolters- said to be integral. The expression d = ac - b 2 /4 is
Noordhoff, 1969. called the discriminant or determinant of the binary qua-
[5] MORDELL, L.J.: Diophantine equatioTL'J, Academic Press, 1969.
dratic form. The expression b 2 -4ac is also sometimes
A. V Malyshev referred to as the discriminant. The arithmetic theory of
AMS 1980 Subject Classification: 10CXX binary quadratic forms originated with P. Fermat, who

394
BINARY RELATION

proved that any prime number of the form 4k + 1 can been established between the integral ideals [aJ, a2l of
be represented as the sum of two squares of integers. the field F and the integral quadratic forms
The theory of quadratic forms was completed by J.L. N(alx+a7)')
Lagrange and by C.F. Gauss. This theory is a special f(x,y) = N[a], a2l
case of the theory of quadratic forms in n variables; the
with - A/ 4 as determinant. This results in a one-to-
arithmetic theory of binary quadratic forms is
one correspondence (up to a transition to conjugate
equivalent to the theory of ideals in quadratic fields,
classes of ideals) between the ideal classes of the field F
and is one of the origins of algebraic number theory
and the classes of binary quadratic forms. In this
(cf. Quadratic form; Quadratic field).
correspondence, multiplication of ideal classes defines a
The number of genera of binary quadratic forms
composition of classes of binary quadratic forms.
with discriminant d equals 2S -J, where s is the number
As in the case of forms in n variables, the theory of
of different prime divisors of d, except for d .1
binary quadratic forms can be generalized to include
(mod4), d-O (mod8), when s is increased by one; if
forms (*) with coefficients a, b and c in a given alge-
- d is a square, the number of different binary qua-
braic number field.
dratic forms is doubled. The number r(d, m) of essen-
There are various variants in the definitions of an
tially different primitive representations o~ a number. m
integral form, the discriminant of the form, equivalence
by a complete system of binary quadratic fo~s WIth
of forms, and the classes and the genera of forms. The
discriminant d is equal to the number of solutlOns of
definition of integral forms given above is due to L.
the equation X2 -d (modm). Kronecker. Gauss [1 1 stipulated that b be even. In
As in the general case, there exists an algorithm which determining the equivalence (and the class of forms),
reduces the problem of solving a given second-degree only the substitutions with discriminant + 1 may be
Diophantine equation in two unknowns (in particu~ar, considered; in other cases discriminants + 1 are con-
an equation f(x,y)=m) to the problem on the anth- sidered. The definition of a genus given in [6] is wider
metic equivalence of two binary quadratic forms. than that given by Gauss.
All integral automorphisms of a primitive form f References
with a=FO can be represented in the form [1] GAUSS, C.F.: Disquisitiones arithmeticae, Yale Univ. Press,
1966 (translated from the Latin).

Ilt-~:/2 t+~:/211'
[2] VENKOV, B.A.: Elementary number theory, Wolters-Noordhoff,
1970 (translated from the Russian).
[3] JONES, B.W.: The arithmetic theory of quadratic forms, Math.
where t 2 +du 2 = 1, and 2t and u are integers (cf. PeU Assoc. Amer., 1950.
equation). Therefore, the problem on the equivalence of [4] GEL'FOND, A.O. and LINNIK, Yu.V.: Elementary methods in the
analytic theory of numbers, M.I.T., 1966 (translated from the
two forms is solved by the reduction theory of binary Russian).
quadratic forms. The reduction theory of positive- [5] LANDAU, E.: Vorlesungen iiber Zahlentheorie, 3, Hirzel, 1927.
definite binary quadratic forms is a special case of the [6] BOREVICH, Z.1. and SHAFAREVICH, 1.R.: Number theory, Acad.
Press, 1966 (translated from the Russian). German transla-
reduction theory of positive-definite quadratic forms tion: Birkhauser, 1966.
according to H. Minkowski. The reduction theory of [7] O'MEARA, O.T.: Introduction to quadratic forms, Springer, 1973.
integral indefinite binary quadratic forms can be A. V. Malyshev
reduced to the reduction theory of quadratic irrationali- AMS 1980 Subject Classification: 10CXX
ties ([2], [3]).
An important role in the theory of numbers is played BINARY RELATION - A two-place predicate on a
by the arithmetic function h (d) - the number of
given set. The term is sometimes used to denote a sub-
classes of primitive integral binary quadratic forms with set of the set A XA of ordered pairs (a, b) of elements
determinant d. It is known that h (d)< + 00. Some idea of a given set A. A binary relation is a special case of a
of the rate of increase of the function h (d) can be relation. Let RCA XA. If (a, b)ER, then one says that
obtained from Siegel's theorem: Let d>O, then for any
>0 there exist constants c. c:
and >0 which satisfy
the element a is in binary relation R to the element b.
An alternative notation for (a, b )ER is aRb.
the condition
The empty subset 0 in A XA and the set A XA itself
c;d 1 / 2 -, < h(d) < c,d 1 /2+,
are called, respectively, the nil relation and the universal
(a similar formula is also valid for d<O). relation in the set A. The diagonal of the set A XA, i.e.
Let I::. be an integer, 1::._1 or 0 (mod 4), sUEPose that the set 1::.= {(a, a): a EA}, is the equality relation or the
if s211::. then s=l or s=2, and let F=Q(v'I::.) be the identity binary relation in A.
quadratic field which is obtained by adjoining Vi to Let R, R 1, R 2 be binary relations in a set A. In
the field of rational numbers. A correspondence has addition to the set-theoretic operations of union

395
BINARY RELATION

Rl UR2' intersection Rl nR2, and complementation (2)


R' = (A XA) \ R, one has the inversion
R- 1 = {(a, b): (b, a)ER}, Binomial coefficients, as well as the arithmetical trian-
gle, were known concepts to the mathematicians of
as well as the operation of multiplication: antiquity, in more or less developed forms. B. Pascal
R 1oR 2 = {(a, b): (3CEA)(aR1c and cR 2 b)}. (1665) conducted a detailed st\ldy of binomial coeffi-
The binary relation R - 1 is said to be the inverse of R. cients. The binomial coefficients are also connected by
Multiplication of binary relations is associative, but as many useful relationships other than (2), for example:
a rule not commutative.
A binary relation R in A is said to be I) reflexive if
ACR; 2) transitive if RoR CR; 3) symmetric if [~J k~JkJ [~=kJ, n";;;m";;;N-n;
R- l CR; and 4) anti-symmetric if R nR~1 CA. If a
binary relation has some of the properties 1), 2), 3) or f(-I)kk m
k=O
[ZJ = 0, m=O, ... ,N-I;
4), the inverse relation R -1 has these properties as
(3)
well. The binary relation RCA XA is said to be func-
tional if R -loR CA.
k~Jkr = [2nnJ;

The most important types of binary relations are


equivalences, order relations (total and partial), and
k~JZJk(k-I)'" (k-r+l) =

functional relations (d. Equivalence; Order relation; = N(N-I) (N-r+lpN-r;


Functional relation).
D.M. Smirnov k~O(-li [ZJk(k -I)' .. (k-r+ I) = O.
AMS 1980 Subject Classification: 04A05
In particular, (3) yields
BINARY UNIT in information theory - A synonym
for bit.
AMS 1980 Subject Classification: 94A17, 68MOS (4)

BINOMIAL - A sum or a difference of two algebraic


expressions, said to be the terms of the binomial, e.g.
a +b, 5x _y2 / (1 + x 2 ), etc. For powers of a binomial, The use of the Stirling formula yields approximate
i.e. expressions of the type (x +y t, see Newton bino- expressions for binomial coefficients. Thus, if N is
mial. much larger than n:

AMS 1980 Subject Classification: 12-XX Nn


[~J ~-"
n.
BINOMIAL COEFFICIENTS - The coefficients at the In the case of a complex number a, binomial coeffi-
powers of z in the decomposition of the Newton bino- cients are generalized according to the formula
t.
mial (1 + z The binomial coefficients are denoted by

[~J
[aJn = a(a-l) (a-n+l)
n." n>,
o [aJ
0
= I.
or by CRr, and are given by In this generalization, some of the relations (2) - (4) are
en - N!
preserved, but usually in a modified form. Thus,
[NJ
n -
-
N - n!(N-n)!
(I)

N(N-I)'" (N-n+l)
, ' O";;;n";;;N. [~J+ [n~d = [~!lJ;
n.
The first-mentioned notation is due to L. Euler; the ~ [k J = 2u, Re a> - I;
k=O
notation CRr appeared in the nineteenth century, and is
probably connected with the interpretation of the bino- k~(_I)k [k J = 0, Rea>O.
mial coefficients C'J., as the number of distinguishable
non-ordered combinations (d. Combination) from N For tables of binomial coefficients see [2], [3].
different objects with n objects in each combination. References
Binomial coefficients are most conveniently written as [I) KORN, G.A. and KORN, T.M.: Mathematical handbook for

numbers in the arithmetical triangle, or Pascal triangle, scientists and engineers, McGraw-Hill, 1968.
[2) BOL'sHEv, L.N. and SMIRNOV, N.V.: Tables of mathematical
the construction of which is based on the following statistics, Libr. of mathematical tables, 46, Nauka, Moscow,
property of binomial coefficients: 1968 (in Russian). Processed by L.S. Bark and E.S. Kedova.

396
BINOMIAL SERIES

[3) Table of binomial coefficients, Cambridge, 1954. Moivre- Laplace theorem):


E.D. Solomentsev
F(y) = ell [y-n p +O.5]+R (y )
AMS 1980 Subject Classification: 05A 10 Vnp(l-p) n ,p,
where
BINOMIAL DISTRIBUTION, Bernoulli distribution - Rn(y,p) = O(n-l/2)
The probability distribution of a random variable X
uniformly for all real y. There also exist other, higher
which assumes integral values x =0, ... ,n with the
order, normal approximations of the binomial distribu-
probabilities
tion.
P{X=x} = bAn,p) = [~)px(l_p)n-x, If the number of independent trials n is large, while
the probability p is small, the individual probabilities
where G) is the binomial coefficient, and p is a parame-
bAn, p) can be approximately expressed in terms of the
ter of the binomial distribution, called the probability
Poisson distribution:
of a positive outcome, which can take values in the
interval O~ ~ 1. The binomial distribution is one of
the fundamental probability distributions connected
bAn,p) = [~)px(l-pr-x ~ (;r e- np .
with a sequence of independent trials. Let Y\, Y 2 , .. , If n-HXJ and O<C''';;y~C (where c and C are con-
be a sequence of independent random variables, each stants), the asymptotic formula
one of which may assume only one of the values I and
F(y)= ~AX,e-A+O(n-2),
o with respective probabilities p and 1 - P (i.e. all Yj are x=ox.
binomially distributed with n = I). The values of Y j where A= (2n - [y])p / (2 - p), is uniformly valid with
may be treated as the results of independent trials, with respect to all p in the interval O<p < 1.
Y j = 1 if the result of the i-th trial is 'positive' and The multinomial distribution is the multi-dimensional
Y j =0 if it is 'negative'. If the total number of indepen- generalization of the binomial distribution.
dent trials n is fixed, such a scheme is known as Ber-
noulli trials, and the total number of positive results, References
[I) GNEDENKO, B.V.: The theory ofprobability, Chelsea, reprint,
x = Y + . . . + Yn ,
I n;;;.l, 1962 (translated from the Russian).
[2) FELLER, W.: An introduction to probability theory and its appli-
is then binomially distributed with parameter p. cations, 1-2, Wiley, 1957-1971.
The mathematical expectation Ezx (the generating [3] PROHOROV, Yu.V. [Yu.V. PROKHOROV) and ROZANov, YU.A.:
Probability theory, Springer, 1969 (translated from the Rus-
function of the binomial distribution) for any value of z
sian).
is the polynomial fpz + (1-p)r, the representation of [4) PROKHORov, Yu.V.: 'Asymptotic behaviour of the binomial
which by Newton's binomial series has the form distribution', Selected Translations in Math. Stat. and Probab. 1
(1961), Amer. Math. Soc. (Uspekhi Mat. Nauk 8, no. 3 (1953),
b o+b1z + ... +bnz n. 135-142)
[5) BOL'SHEV, L.N. and SMIRNOV, N.V.: Tables of mathematical
(Hence the very name 'binomial distribution'.) The
statistics, Libr. of mathematical tables, 46, Nauka, Moscow,
moments (cf. Moment) of a binomial distribution are 1965 (in Russian). Processed by L.S. Bark and E.S. Kedova.
given by the formulas L.N. Bol'shev
AMS 1980 Subject Classification: 62EXX
= np,EX
OX = E(X -np)2 = np(l-p), BINOMIAL SERIES - A power series of the form
E(X -np)3 = np(l-pXI-2p).
n~o [~)zn = 1+ [l)z+ [i)Z2+ ... ,
The binomial distribution function is defined, for any
real y, O<y <n, by the formula where n is an integer and 0: is an arbitrary fixed
number (in general, a complex number), z = x + iy is a
F(y) = P{X";;;u} = x~J~)px(l_p)n-x, complex variable, and the
where [y 1is the integer part of y, and

F(y) = 1
I
jtlY](I-t)n-lY]-1 dt,
B([y]+I,n-[y]) p
are the binomial coefficients. For an integer 0: = m ;;;;;. 0
the binomial series reduces to a finite sum of m + 1
B(a, b) is Euler's beta-function, and the integral on the terms
right-hand side is known as the incomplete betajunction. ( l+z)m = l+mz+ m(m -1)z2+ ... +zm
As n-HX), the binomial distribution function is 2! '
expressed in terms of the standard normal distribution which is known as the Newton binomial. For other
function ell by the asymptotic formula (the de values of 0: the binomial series converges absolutely for

397
BINOMIAL SERIES

I z I < I and diverges for I z I > 1. At points of the unit and {~,}, t E T, of elements of a (topological) vector
circle I z I = I the binomial series behaves as follows: space X and the dual (topological) space X*, respec-
I) if Rea>O, it converges absolutely at all points; 2) if tively, which satisfies the conditions
Reao;;;;; -I, it diverges at all points; and 3) if
~(as) = <~(' as> = 0
-I <Reao;;;;;O, the binomial series diverges at the point
z = - I and converges conditionally at all other points. if t=l=s, and =1=0 if t = s (here, <.,. > is the canonical
At all points of convergence, the binomial series bilinear form coupling X and X*). For instance, a
represents the principal value of the function (I + z t biorthogonal system consists of a Schauder basis and
the set formed by the expansion coefficients of x in it.
which is equal to one at z = O. The binomial series is a
In a Hilbert space H with scalar product <','> and
special case of a hypergeometric series.
If z =x and a are real numbers, and a is not a non-
basis {at} the set {bs } satisfying the condition
negative integer, the binomial series behaves as follows: <a(,bs>=lisl>
I) if a >0, it converges absolutely on - I";;;x 0;;;;; 1; 2) if where ~st = I if t =s and ~st =0 if t=l=s, is also a basis; it
a 0;;;;; -I, it converges absolutely in -I <x< 1 and is said to be the basis dual to {at} and, since H = H* ,
diverges at all other values of x; and 3) if -I <a 0;;;;;0, the sets {at} and {bt } form a biorthogonal system. In
the binomial series converges absolutely in - I <x < I, particular, a basis in H is said to be orthonormal if its
converges conditionally at x = I, and diverges for dual to itself.
x = - 1; for I x I > I the binomial series always However, there also exist biorthogonal systems which
diverges. do not even form a weak basis; an example is the set of
Binomial series were probably first mentioned by I. functions eikX, k EZ, x ER, in the space of continuous
Newton in 1664 - 1665. An exhaustive study of bino- periodic functions with the norm II f II = sup If (x) I
mial series was conducted by N.H. Abel [I], and was M.I. Voftsekhovskif
the starting point of the theory of complex power
AMS 1980 Subject Classification: 46A20
series.
References BIPLANAR SPACE - A real (2n + I)-dimensional
[I] ABEL,N.H.: 'Untersuchungen tiber die Reihe
projective space, with two non-intersecting n-
I+m /2x+m(m-I)/2'lx 2 + ... ',J. Reine Angew. Math. 1
(1826), 311-339. dimensional subspaces which are real (biplanar spaces
[2] KNopp, K.: Theorie und Anwendung der unendlichen Reihen, of hyperbolic type) or complex conjugate (biplanar
Springer, 1964. English translation: Blackie, 1951. spaces of elliptic type), the fundamental group of which
[3] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
I, Chelsea, reprint, 1977 (translated from the Russian). consists of the projective transformations that map each
E.D. Solomentsev of these subspaces into itself. These two n-dimensional
subspaces are called the absolute planes. The linear
AMS 1980 Subject Classification: 40A05, 30810
congruence of real straight lines intersecting both abso-
BINORMAL - The straight line passing through a lute planes is said to be an absolute congruence. This
point M 0 of a curve L perpendicular to the osculating congruence serves as a real model of an n-dimensional
plane to L at Mo. If r=r(t) is a parametrization of L, projective space over the algebra of double or complex
then the vector equation of the binormal at M 0 numbers. If n = I, a biplanar space is said to be a bi-
corresponding to the value to of the parameter t has the axial space. The study of properties of geometrical fig-
form ures in biplanar spaces which are preserved under the
operation of the fundamental group is the subject of
E. V. Shikin hiplanar geometry. The bi-axial geometry in which one
Editorial comments. This definition holds for space studies the theory of curves, surfaces and complexes of
curves for which r"Uo) does not depend linearly on r'Uo), straight lines, has been investigated in detail.
i.e. the curvature should not vanish. A.P. Shirokov
For curves in a higher-dimensional Euclidean space, the
binormal is generated by the second normal vector in the AMS 1980 Subject Classification: 53A35, 51 N25
Frenet frame (cf. Frenet trihedron), which is perpendicular
to the plane spanned by r'Uo) and r"Uo) and depends BIPOLAR COORDINATES - The numbers l' and a
linearly on r'Uo), r"Uo), r"'Uo) (cf. [A1]). which are connected with the Cartesian orthogonal
References coordinates x and y by the formulas
[A1] SPIVAK, M.: Differentia/geometry, 2, Publish or Perish, 1970. asinh'T a sin a
AMS 1980 Subject Classification: 53A04
x = cosh1'-cosa' Y = cosh'T-cosa'
where O';;;;a<7T, -00<1'<00. The coordinate lines are
BIORTIIOGONAL SYSTEM - A pair of sets {at} two families of circles (1'=const) with poles A and B

398
BlRATIONAL GEOMETRY

and the circles orthogonal with these (a=const). BIQUADRATIC EQUATION - An equation of the
(Joe=: const type
a.x 4 +bx 2 +c = 0,
where a, h, c are given complex numbers and a*O. The
substitution y =x 2 converts a biquadratic equation into
a quadratic equation.
AMS 1980 Subject Classification: OOA99, 12E12

BIRATIONAL GEOMETRY - The branch of alge-


braic geometry in which the main problem is the clas-
The Lame coefficients are: sification of algebraic varieties up to birational
equivalence (cf. Birational mapping). Over a fixed field
of constants k every class of birationally equivalent
LT = La = (COshT-cosai'
varieties defines a finitely generated field over k, which
The Laplace operator is: is isomorphic to the field of rational functions on any
variety in this class. Conversely, to each such field
!Jot = -(COshT-cosai
1 32 r 32 r J.
[ ~+~
a2 3~ 3~ corresponds a class of birationally equivalent varieties
- the model(s) of this field. Thus, the birational clas-
Bipolar coordinates in space (bispherical coordinates)
sification of algebraic varieties is equivalent to the clas-
are the numbers a, T and </>, which are connected with
sification (up to a k-isomorphism) of the finitely gen-
the orthogonal Cartesian coordinates x, y and z by the
erated fields regular over k.
formulas:
x = a sin a cos </> The most general birational invariant is the dimension
cosh T - cos a ' of an algebraic variety. For one-dimensional algebraic
a sin a sin</> varieties - irreducible algebraic curve~ - each class of
Y = COshT-cosa'
birational equivalence contains a non-singular model -
a SinhT
Z = -----'---
cosh cos a'
T-
a smooth projective curve, which is unique up to k-
isomorphisms. Accordingly, the birational classifica-
where - 00 <a< 00, O:S;;;T<'lT, O:S;;;</><2'lT. The coordi- tion of algebraic curves is reduced to the classification,
nate surfaces are spheres (a=const), the surfaces up to k-isomorphisms, of smooth projective curves,
obtained by rotation of arcs of circles (T=const), and which leads to the moduli problem. For dimensions ;;;;.2
half-planes passing through the Oz-axis. The system of the problem becomes much more complicated. The very
bipolar coordinates in space is formed by rotating the existence of a smooth model constitutes a problem of
system of bipolar coordinates on the plane Oxy around resolution of singularities of algebraic varieties, which at
the Oz-axis. the time of writing (1986) has been positively solved
The Lame coefficients are: only for surfaces and for varieties of arbitrary dimen-
sion over a field of characteristic zero. In such a case, if
La = LT = (COshT-cosa)2' such models exist at all, there are infinitely many of
L -
a 2 sin2 a
--=--==:........:....---:- them in the class of birationally equivalent varieties.
</> - (COShT-cosoi The minimal models (cf. Minimal model) take a special
The Laplace operator is: place among such models. Their birational classification

!Joj = (cosbr-cosha)3
a 2 sino
[..!.3T [ sin a
cosh T- cos a 3T
l1.J +
is often identical with classification up to a k-
isomorphism, as in the case of curves. However, this is

n]
not true in the general case, even for (both rational and
+..!.3a [ sino
cosh T- cos a 3a
l1J + 1
sin a( cosh T- cos a) 3cfl .
ruled) surfaces.
The principal results in the classification of algebraic
surfaces were obtained by geometers of the Italian
References
[I] MADELUNG, E.: Die mathematischen Hilfsmittel des Physikers,
school [1]. Only isolated results are so far (1986) avail-
Springer, 1957. able [3], [7], [8] for varieties of dimensions of ;;;;'3.
D.D. Sokolov The principal discrete birational invariants of smooth
Editorial comments. complete algebraic varieties over a field k of charac-
References teristic zero include the arithmetic genus, the geometric
[A 1] VEBLEN, O. and WHITEHEAD, 1.H.c.: The foundations of dif- genus, the multiple genus, the dimension of the space
ferential geometry, Cambridge Univ. Press, 1932.
of regular differential forms, the Severi torsion, the fun-
AMS 1980 Subject Classification: 51 N 10 damental group and the Brauer group.

399
BlRATIONAL GEOMETRY

One of the most important problems in birational correspondence between the sets a: I ~J and an iso-
geometry is the problem of rationality of algebraic morphism of local rings (J) X, Xi ~ (J) Y'Y"i) for each i EI.
varieties, i.e. the problem of the description of rational If the schemes X and Y are irreducible and reduced,
varieties - varieties which are birationally equivalent the local rings of their generic points become identical
to a projective space, cf. Rational variety. with the fields of rational functions on X and Y,
If the field of constants is algebraically non-closed, respectively. In such a case the birational mapping
the problems in birational geometry are closely f : X ~ Y induces, in accordance with condition 2), an
connected with the arithmetic of algebraic varieties (cf. isomorphism of the fields of rational functions:
Algebraic varieties, arithmetic of). The important prob- R(Y}~:~R(X).
lem in such a case is that of describing the birational Two schemes X and Y are said to be birationally
k-forms on a given variety V over a field k, in particu- equivalent or birationally isomorphic if a birational map-
lar if, say, V=PZ is a projective space over k [2]. An ping f : X ~ Y exists. A birational morphism is a special
important part of this problem is the description of the case of a birational mapping.
group of birational transformations of the variety V. The simplest birational mapping is a monoidal
References transformation with a non-singular centre. For smooth
[1) Algebraic surfaces, Moscow, 1965 (in Russian). complete varieties of dimension .,;;;2 any birational
(2) MANIN, YU.N.: Cubic forms. Algebra, geometry, arithmetic, mapping may be represented as the composite of such
North-Holland, 1974 (translated from the Russian).
(3) ROTH, L.: Algebraic threefolds, Springer, 1955. transformations and their inverses. At the time of writ-
(4) SHAFAREVICH, LR.: Basic algebraiC geometry, Springer, 1977 ing (1986) this question remains open in the general
(translated from the Russian). case.
(5) BALDASSARRI, M.: Algebraic varieties, Springer, 1956.
(6) DoLGACHEV, LV. and ISKovsKIKH, V.A.: 'The geometry of References
algebraic varieties', J. Soviet Math. 5, no. 6 (1976), 803-864. [1) SHAFAREVICH, I.R.: Basic algebraiC geometry, Springer, 1977
(lto~ Nauk. i Tekhn. Algebra Topol. Geom. 12 (1974), (translated from the Russian).
77-170) (2) HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
(7) J. Soviet Math. 13, no. 6 (1980). (Itogi Nauk. i Tekhn. Sovrem. I. V. Dolgachev
Prob!. Mat. 12 (1979
V.A. Iskovskikh
(8) IflAKE, S.: AlgebraiC geometry, Springer, 1982.
(9) HARTSHORNE, R.: Algebraic geometry, Springer, 1977. AMS 1980 Subject Classification: 14E05
I. V. Dolgachev
V.A. Iskovskikh BIRATIONAL MORPIDSM - A morphism of
Editorial comments. A field extension K / k is regular if schemes that is a birational mapping. The most impor-
k is algebraically closed in K and K is linearly disjoint from tant examples of birational morphisms include: a nor-
the algebraic closure k of k over k. If k is algebraically malization, a blowing up and a monoidal transforma-
closed, then every extension of k is regular and there is an tion. Any proper birational transformation between
arrow-reversing equivalence of categories between the
regular two-dimensional schemes can be decomposed
category of varieties and dominant rational mappings over k
into monoidal transformations with non-singular cen-
and the category of finitely generated field extensions, cf.
[8], Sect. 104. For dimension 3 many new results have
tres [2] (cf. Monoidal transformation). This is not true
recently become available in the (birational) classification of in dimensions higher than two.
algebraic varieties, cf. [A 1]. References
[1) GROTHENDIECK, A. and DIEUDONNE, J.: 'Elements de
References geometrie algebrique', Publ. Math. IHES 8 (1960).
[A 1] CONTE, A. (ED.): Algebraic threefolds, Leet. Notes in Math,
(2) SHAFAREVICH, I.R.: Lectures on minimal models and birational
Vol. 947, Springer, 1982.
transformations of two-dimensional schemes, Tata Inst. Fundam.
AMS 1980 Subject Classification: 14E05 Res., 1966.
(3) SHAFAREVICH, I.R.: Basic algebraic geometry, Springer, 1977
BIRATIONAL MAPPING, birational isomorphism - A (translated from the Russian).
(4) HARTSHORNE, R.: Algebraic geometry, Springer, 1977.
rational mapping between algebraic varieties inducing
I. V. Dolgachev
an isomorphism of their fields of rational functions. In
V.A. Iskovskikh
a more general setting, a rational mapping of schemes
f : X ~ Y is said to be a birational mapping if it satisfies AMS 1980 Subject Classification: 14E05
one of the following equivalent conditions: 1) there
exist dense open sets U C X and V C Y such that f is BIRATIONAL TRANSFORMATION - A birational
defined on U and realizes an isomorphism of sub- mapping of an algebraic variety (or scheme) into itself.
schemes flu: U~V; 2) if {Xd;d, (Yj}jd are the sets Also called sometimes a birational automorphism. The
of generic points of the irreducible components of the group of all birational transformations of an algebraic
schemes X and Y respectively, f induces a bijective variety is canonically isomorphic to the group of auto-

400
BIRKHOFF - WITT THEOREM

morphisms of its field of rational functions over the References


field of constants. Examples of birational transforma- [I) BIRKHOFF, G.D.: 'Proof of the ergodic theorem', Proc. Nat.
Acad. Sci. USA 17 (1931),656-660.
tions include the Cremona transformations (d.
(2) KATOK, A.B., SINAi, YA.G. and STEPIN, A.M.: 'Theory of
Cremona transformation), in particular the standard dynamical systems and general transformation groups with
quadratic transformation of a projective plane, given by invariant measure', J. Soviet Math. 7, no. 6 (1977), 974-1065.
the formula (Itogi Nauk. i Tekhn. Mat. Analiz 13 (1975), 129-262)
D. V Anosov
(xo, x" X2) ~ (X]X2, XOX2, XOx]),
References
where (x 0, X I , X 2) are homogeneous coordinates in the [AI) KRENGEL, U.: Ergodic theorems, W. de Gruyter, 1985.
projective plane. [A2) PETERSON, K.: Ergodic theory, Cambridge Univ. Press, 1983.

References AMS 1980 Subject Classification: 28DXX


(1) ISKOVSKIKH, V.A.: 'Birational automorphisms of three-
dimensional algebraic varieties', J. Soviet Math. 13, no. 6
BIRKHOFF - WITf THEOREM,
(1960),815-868. (ltogi Nauk. i Tekhn. Sovrem. Probl. Mat. 12
(1979), 159-239) Poincare- Birkhoff- Witt theorem - A theorem about
l. V Dolgachev the representability of Lie algebras in associative alge-
VA. Iskovskikh bras. Let G be a Lie algebra over a field k, let U(G) be
AMS 1980 Subject Classification: 14E05 its universal enveloping algebra, and let B = {b i : i El}
be a basis of the algebra G which is totally ordered in
BIRKHOFF ERGODIC THEOREM - One of the some way. All the possible finite products b", ... b".,
most important theorems in ergodic theory. For an where al ~ . . . ~a" then form a basis of the algebra
endomorphism T of a space X with a a-finite measure /L U(G), and it thus follows that the canonical homomor-
Birkhoffs ergodic theorem states that for any function phism G-"?U(G) is a monomorphism.
jELI(X, /L) the limit It is possible to construct a Lie algebra L(R) for any
1 n -] - associative algebra R by replacing the operation of
lim - "2.f(T k x) = f(x) multiplication in R with the commutator operation
n----'+oo n k =0

(the time average or the average along a trajectory) [xy] = xy-yx.


exists _almost everywhere (for almost all x EX). More- The Birkhoff - Witt theorem is sometimes formulated
over,jELI(X, /L), and if p.{X)<oo, then as follows: For any Lie algebra G over any field k there
exists an associative algebra R over this field such that
Ifd/L = [ldp. Gis isomorphically imbeddable in L(R).
For a measurable flow {T/} in the space X with a a- The first variant of this theorem was obtained by H.
finite measure /L Birkhoffs ergodic theorem states that Poincare [1]; the theorem was subsequently completely
for any functionjELI(X, /L) the limit demonstrated by E. Witt [2] and G.D. Birkhoff [3].
The theorem remains valid if k is a principal ideal
1T _
lim -Tjf(T,x)dt = f(x) domain [4], in particular for Lie rings without opera-
0
T---H()
tors, i.e. over Z, but in the general case of Lie algebras
exists almost everywhere, with the same properties of f over an arbitrary domain of operators the theorem is
Birkhoffs theorem was stated and proved by G.D. not valid [5].
Birkhoff [1]. It was then modified and generalized in References
various ways (there are theorems which contain, in [I) POINCARE, H.: 'Sur les groupes continus', Trans. Cambro Phi-
addition to Birkhoffs theorem, also a number of state- los. Soc. 18 (1900), 220-225.
(2) WITT, E.: 'Treue Darstellung Liescher Ringe', J. Reine Angew.
ments of a somewhat different kind which are known Math. 177 (1937), 152-160.
in probability theory as ergodic theorems (d. Ergodic (3) BIRKHOFF, G.D.: 'Representability of Lie algebras and Lie
theorem); there also exist ergodic theorems for more groups by matrices', Ann. of Math. (2) 38, no. 2 (1937), 526-
532.
general semi-groups of transformations [2]). Birkhoffs [4) LAZARD, M.: 'Sur les algebres enveloppantes universelles de
ergodic theorem and its generalizations are known as certain algebres de Lie', c.R. Acad. Sci. Paris Ser. I Math. 234
individual ergodic theorems, since they deal with the (1952), 788-791.
(5) SHIRSHOY, A.I.: 'On representations of Lie rings in associative
existence of averages along almost each individual tra-
rings', Uspekhi Mat. Nauk 8, no. 5 (1953), 173175 (in Rus-
jectory, as distinct from statistical ergodic theorems - sian).
the von Neumann ergodic theorem and its generaliza- (6) COHN, P.M.: Universal algebra, Reidel, 1981.
tions. (In non-Soviet literature the term 'pointwise (7) KUROSH, A.G.: Lectures on general algebra, Chelsea, 1963
(translated from the Russian).
ergodic theorem' is often used to stress the fact that the [8) SERRE, J.-P.: Lie algebras and Lie groups, Benjamin, 1965
averages are almost-everywhere convergent.) (translated from the French).

401
BIRKHOFF - WITI THEOREM

[9] CARTAN, H. and ElLENBERG, S.: Homological algebra, Prince- v =0 then one obtains the simplest branching process
ton Univ. Press, 1956.
T.S. Fofanova without immigration. If A=O and v>O, then this type of
process with immigration can be applied to describe the
AMS 1980 Subject Classification: 17815
functioning of a telephone exchange with an infinite
number of lines. Here, the state is the number of occu-
BIRTH-AND-DEATH PROCESS - A Markov process
pied lines. The reproduction coefficient An(t)=V charac-
with states 0, 1, ... , in which in a time interval
terizes the rate of incoming telephone calls, and J.L is the
(t, t +h) transitions from the state n into the states
expected duration of a conversation.
n + 1 and n -1 occur with probabilities An (t)h +o(h)
and J.'n(t)h +o(h), respectively, and where the probabil- References
[I] FELLER, W.: An introduction to probability theory and its appli-
ity of other transitions is o(h). For a special choice of cations, 1-2, Wiley, 1957-1971.
the reproduction coefficients An(t) and the death coeffi- [2] SEWASfYANOW, B.A. [BA SEVAST'YANOV]:
cients J.Ln(t) one can obtain particular cases which pro- Verzweigungsprozesse, Akad. Wissenschaft. DDR, 1974
vide a satisfactory description of various real processes: (translated from the Russian).
[3] SAATY, R.L.: Elements of queueing theory with applicatiOns,
radioactive transformations, the running of telephone
McGraw-Hill, 1961. v.p

ChISfl.) k
.Ta ov
exchanges, the evolution of biological populations, etc.
The fact that birth-and-death processes are widely used Editorial comments.
in applications is due to the simplicity of the equations References
for the transition probabilities, which often can be [A1] ATIlREYA, K.B. and NEY, P.E.: Branching processes,
obtained explicity. For instance, in the case of a Pois- Springer, 1972.

son process An(t)=A, J.'n(t)=O, the probabilities Pn(t) AMS 1980 Subject Classification: 60J80
(Pn(t) is the probability of being at time t in the state n
if the process has started from state 0) satisfy the sys- BISECTOR PLANE of a dihedral angle, bisector - The
tem: plane passing through the edge of the dihedral angle
P'o(t) = -APo(t) and bisecting this angle.
P~(t) = -APn(t)+APn-\(t), n ~ 1, AMS 1980 Subject Classification: 51 NXX
where
I if n =0,
Pn(O) = { BISECTRIX of an angle - The half-line (ray) issuing
o otherwise. from the apex of the angle and bisecting it. In other
words, the bisectrix of an angle is the set of points
The solution of this system is:
located inside the angle and equally distant from both
Pn(t) = ~e-N,
n.
n =0,1, .... of its sides. A bisectrix of a triangle is the segment (and
also its length) of the bisectrix of an internal angle of
A more general process is the one in which An(t)=nA, the triangle from the apex to the point of intersection
J.'n(t)=O. This type of process was first studied by G. with the opposite side. A bisectrix of a triangle divides
Yule (1924) in connection with the mathematical theory a side of the triangle into segments that are propor-
of evolution. A Yule process is a particular case of a tional to the adjacent sides. The bisectrices of a triangle
pure birth process which is obtained from the general intersect at one point, which is the centre of the
birth-and-death process by assuming An (t) = An, inscribed circle in the triangle. The quadruple of points
J.'n(t)=O. If An increases very rapidly, then with positive A, B, K, L, consisting of two apices A, B of the triangle
probability one can pass through all states in a finite ABC, the point K of intersection of the bisectrix of the
time and then angle C with AB, and the point L of intersection of the
n =0
bisectrix of the external angle C with AB forms a har-
monic quadruple of points.
The condition A.B. Ivanov

n =0
AMS 1980 Subject Classification: 51 NXX

for a pure birth process is satisfied if and only if the


BIT - A binary unit of information, numerically
series ~l IAn diverges. If An(t)=nA+v, Pn(t)=np., then
equal to the amount of information obtained during a
the birth-and-death process is a branching process with
trial with two mutually exclusive equally probable alter-
immigration in which the state n designates the number
natives (PI =P2 = I /2):
of particles. Each particle dies during time (t, t + h)
with probability J.Lh + o(h), splits into two particles with I(p\,P2) = P\lo~P\ -p2Io~p2 = lo~ 2 = 1 'bit',
probability Ah + o(h) and, moreover, one particle immi- the logarithms being taken to the base 2. The bit is the
grates from the outside with probability vh +o(h). If most frequently used unit, but other units of informa-

402
BIVECTOR

tion are also employed - the 'Hartley' or the 'nit', the v, calculated with respect to some basis e = (el, . . . , en)
definitions of which involve decimal and natural loga- of the underlying space of A, then the quantities
rithms respectively.
A. V. Prokhorov
Editorial comments. The definition as given comes from
information theory. In computer science the term 'bit' usu- are called the PlUcker coordinates of the pair u, v. Two
ally refers to the representation of '0' or '1' by a suitable pairs of vectors are in the same class if their Plucker
physical device which can be (exclusively) in two alternative coordinates with respect to some basis coincide (they
states, or even to that device itself. Nits or Hartley's are will then be equal in any basis). The coordinates of the
unknown in the West.
class are then called the coordinates of the bivector [u, v]
AMS 1980 Subject Classification: 94A17 , 68A05 with respect to the basis e. These coordinates are skew-
symmetric with respect to their indices; they contain (D
BrrsADZE EQUATION - The partial differential
independent coordinates. Under a transition to another
equation that can be written in complex form as fol- basis of A, the coordinates of a bivector behave as
lows: coordinates of a twice-contravariant tensor. A bivector
is also called a free bivector. In the presence of a scalar
where w(z)=u + iv, z =x +iy, and that can be reduced
product in A, a number of metrical concepts of vector
to the elliptic system
algebra can be extended to bivectors. The measure of a
U xx - u,y - 2vxy = 0, bivector is the area of the parallelogram formed by the
vxx-vy'y+2uxy = 0, vectors u, v, -u, -v, the origin of each one being
in the real independent variables x and y. The homo- located in the end of the precedirig one. This only
depends on the class, not on the representatives u,v.
geneous Dirichlet problem in a disc C: I z - z 0 I <t:,
where the radius t: is as small as one pleases, for the The scalar product of two bivectors is the number
Bitsadze equation has an infinite number of linearly equal to the product of the measures of the factors by
independent solutions [1]. The Dirichlet problem for the cosine of the angle between their two carrier planes.
This product is a bilinear form of the coordinates of
the inhomogeneous equation Wzz = f in the disc C is
normally solvable according to Hausdorff, since it is the factors, the coefficients of which are defined by the
neither a Fredholm problem nor Noetherian; in a metric tensor of the space A alone.
If the dimension of A is 3, the bivector [u, v] may be
bounded domain containing a segment of the straight
line y = 0, this problem is not even a Hausdorff prob- identified with a vector of A which, in the presence of a
lem, even though the homogeneous problem has only scalar product, is called the vector product of the vec-
tors u, v.
one zero solution [2].
In tensor calculus a bivector is an arbitrary contra-
References variant skew-symmetric tensor of valency 2 (i.e. a ten-
[I] BrrSADZE, A.V.: 'On the uniqueness of the solution of the Diri-
chlet problem for elliptic partial differential operators', Uspekhi sor of type (2,0. Each such tensor may be represented
Mat. Nauk 3, no. 6 (1948), 211-212 (in Russian). as a sum of tensors, to which correspond non-zero
[2] BITSADZE, A. V.: Boundary value problems for second-order ellip- bivectors in the above sense with different carrier
tic equations, North-Holland, 1968 (translated from the Rus-
sian). planes. They define the sheets of the bivector. The rank
[3] MIRANDA, C.: Partial differential equations of elliptic type, of the skew-symmetric matrix of dimension n X n con-
Springer, 1970 (translated from the Italian). sisting of the coordinates of a bivector is an even
[4] BERS, L., JOHN, F. and ScHECHTER, M.: Partial differential
number 2p, where p is the number of sheets of the
equations, Interscience, 1964. A.M. Nakhushev
bivector. In a real affine space A this matrix is similar
AMS 1980 Subject Classification: 35J15 to the matrix

BIVECfOR - A class [u, v] of ordered pairs u, v of


vectors of an affine space A, starting at a common ori-
gin (considered in a basis of the underlying space). A
bivector is considered to be equal to zero if its consti- o 0
tuent vectors u and v are collinear. A non-zero bivector with the blocks
generates a unique two-dimensional space in A, its car-
rier. Two bivectors are said to be parallel if their car-
rier planes are parallel. If A has finite dimension n, and
(u 1 , ,un) are the contravariant coordinates of u, See also Exterior product; Poly-vector; PlUcker coor-
while (v 1 , ,vn) are the contravariant coordinates of dinates.

403
BIVECTOR

References sor with components R aPr8 , R~H, R~~ and the second-
[1) ScHOUTEN, JA.: Tensor analysis for physicists, Cambridge valency bitensors with components R ab , Rg, R~ are
Univ. Press, 1951.
L.P. Kuptsov associated, respectively. The study of the algebraic
Editorial comments. Assign to a non-zero bivector (u, v) structure of the curvature tensor may then be reduced
the plane it generates, i.e. the corresponding pOint of the to the study of the pencil of quadratic forms Roo - Agoo,
Grassmannian of 2 planes in (the underlying vector space the second one of which is non-degenerate ( I gab I=FO).
of) A. Then the Plucker coordinates of this element in the The study of elementary divisors of this pair results in
Grassmann manifold can be identified with the Plucker a classification of the spaces Vn . If n = 4 (N = 6) and if
coordinates of the bivector. the form g af3 has signature (- - - +), then it can be
References shown that only three types of Einstein spaces exist.
[A 1) CARTAN,E.: Geometry of Riemannian spaces. (With notes A bivector may be assigned to each rotation in Vn ;
and appendices by R. Hermann), Math. Sci. Press, 1983
(translated from the French).
this means that in RN there corresponds a vector, which
[A2) GolAB, S.: Tensor calculus, Elsevier Sci. Publ. (translated is convenient for the study of infinitesimal transforma-
from the Polish). tions. Essentially, a bivector space is identical with a
[A3] RAsHEWSKI, P.K. [P.K. RAsHEVSKIl1: Riemannsche Geometrie biplanar space [2].
und Tensoranalyse, Deutsch. Verlag Wissenschaft., 1959
(translated from the Russian). References
[1) PETROV, AZ.: New methods in general relativity theory, Mos-
AMS 1980 Subject Classification: 15A72, 53A45 cow, 1966 (in Russian).
(2) NORDEN, AP.: 'On complex representation of tensors of a
BIVECI'OR SPACE - A centro-affme space EN biplanar space', Kazan. Gos. Univ. Uchen. Zap. 114 (1954), 45-
53 (in Russian).
(where N=n(n-I}/2), which may be assigned to A.2. Petrov
each point of a space An with an affine connection (in Editorial comments. Consider a bivector as represented
particular, to a Riemannian space Vn ). Consider all by an ordered pair of vectors (u, v) as in [A1], [A4] or the
tensors with even covariant and contravariant valencies article bivector. The Plucker coordinates of (u, v),
at a point of the space An (or Vn ); the covariant and pii = u i vi - u i Vi, then constitute a bivector as in the article
- - -i .
contravariant indices are subdivided into different above. Let u=Au, v=Av, i.e. u =Aju l and similarly for v.
pairs, for each one of which the tensor is skew- Then the pii transform as pii=2A~Allpkl. Whence the for-
symmetric. Tensors with these two properties are mulas above. Here the square brackets in A~AII are a nota-
called bitensors. If each skew-symmetric pair is regarded tion signifying taking an alternating average. Thus
as a collective index, the number of new indices will be Al1 Atl = (AJ,At - A~A)) / 2. If certain indices are to be sin-
gled out, i.e. exempt from this averaging process, this is
N = n (n - I) / 2. The simplest bitensor is the bivector
indicated by II. Thus
va{3 = - vfJa ~ Va, a, f3 = 1, . . . ,n; a = 1, . . . ,N. A~ I ki'l = (A~' - Af/:' - Ai//ii - A~' + Af/:i + A~i) / 6, and
gab=2ga[ygl{3l~l' cf. above. This is a notation introduced by
If, at a point P of An,

t
R. Bach [A5]. Cf. also Alternation.

AI} = [~~: Ag = 2A~aAgl = Af~Agt, In more modern terms what is described here is the bun-
dle of bivectors over An.
A centro-affine space is an affine space with a dis-
tinguished point, i.e. practically a vector space. It is not a
v
then =Agv b , and the set of bivectors assigned to An term which is still greatly used.
(or Vn ) at a given point defines a vector space of References
[A1] CARTAN, E.: Geometry of Riemannian spaces. (With notes
dimension N such that the components satisfy the con- and appendices by R. Hermann), Math. Sci. Press, 1963
ditions (translated from the French).

*
[A2] GoLAB, S.: Tensor calculus, Elsevier Sci. Publ., 1974
(translated from the Polish).
I Ag I 0, AgA~ = 8~ [A3] RASHEWSKI, P.K. [P.K. RASHEVSKIT]: Riemannsche Geometrie
i.e. this set defines the centro-affine space EN, called und Tensoranalyse, Deutsch. Verlag Wissenschaft., 1959
the bivector space at the given point. In Vn the bivector (translated from the Russian).
[A4] SCHOUTEN, J.A: Ricci-calculus, Springer, 1954.
space may be metrized with the aid of the metric tensor [A5] BACH, R.: 'Zur Weylschen Relativitatstheorie und der Weyl-
deC schen Erweiterung des Krummungsbegriffs', Math. Zeitschr.
gab = ga{ry~ = gcrygf30-ga~g{3y, 9 (1921),110-135.
after which EN becomes a metric space R N . AMS 1980 Subject Classification: 15A72, 15A75,
Bivector spaces are used in Riemannian geometry 53805, 53820, 53830
and in the general theory of relativity. The bivector
space RN is constructed at a given point of the space BJORLING PROBLEM - A problem in the theory of
Vn , and different representations of the curvature ten- minimal surfaces (cf. Minimal surface), which is to find

404
BLASCHKE PRODUCT

the minimal surface passing through a given non-closed The sequence 0, ... ,0, a 1. a2, . .. (with n zeros),
analytic curve L, with given tangent planes along L. which is usually written out in non-decreasing order of
Bjorling's problem on minimal surfaces is the analogue I ak I, is the sequence of all zeros of the Blaschke pro-
of the Cauchy problem of differential equations. The duct (*) (each zero is written down as many times as its
problem was posed and solved by E.G. Bjorling [1]. A multiplicity). Thus, Blaschke's theorem describes the
solution of this problem always exists, is unique, and is sequences of zeros of all possible Blaschke products.
explicitly expressed by the Schwarz formula for The product (*) can be regarded as the simplest
minimal surfaces. The solution of Bjorling's problem bounded holomorphic function in the disc K with a
always permits one to find the minimal surface when- prescribed sequence of zeros. It converges absolutely
ever one of its geodesic lines or one of its asymptotic and uniformly inside K, represents a bounded holo-
lines or one of its lines of curvature is known. If the morphic function I B (z) I < I in K, with angular boun-
given curve is planar and is a geodesic on the minimal dary values of modulus one almost everywhere on aK.
surface which is desired to be found, then the plane of A necessary and sufficient condition for a bounded
the curve L will be a plane of symmetry of the minimal holomorphic function f(z) in K, I f(z) I <1, to be a
surface. Blaschke product, is
References
[In I j(re
2"
[1] BJORLING, E.G.: Arch. Grunert IV (1844),290. lim
r-+l
i8 ) I dO = O.
[2] DARBoux, G.: Lerons sur la theorie generale des surfaces et ses
applications geometriques du calcul infinitesimal, 1, Gauthier- Blaschke products may be used to give a product
Villars, 1887.
[3] BLASCHKE, W.: Vorlesungen aber DifJerentialgeometrie und
representation of important classes of holomorphic
geometrische Grundlagen von Einsteins Relativitiitstheorie, 1, functions in the unit disc K. Thus, a proof was given
Springer, 1921. for the following theorem of Blaschke: A sequence {ak}
[4] NITSCHE, J.c.c.: Vorlesungen aber Minimalfliichen, Springer,
of points in the disc K is the sequence of all zeros of
some bounded holomorphic function f (z), If (z) I < 1,
1975.
I.Kh. Sabitov
AMS 1980 Subject Classification: 49C10, 53A 10 in K if and only if the series ~/1-1 ak I) is conver-
gent. Moreover, f (z) can be represented as a product
BLASCHKE PRODUCf, Blaschke function - A regu-
j(z)=Blz)g(z),
lar analytic function of a complex variable z, defined in
the unit disc K = {z: I z I < I} by the finite or infinite where BIz) is the Blaschke product constructed with
product the zeros {ad of the function f (z), while g (z) is a
(*) zero-free holomorphic function in K, I g(z) I ~1, which
can be represented relatively simply using an integral
where n is a non-negative integer, and {ad, formula. Apart from the bounded functions, similar
k = 1,2, ... , is a sequence of points ak EK\ {O} such product representations may be constructed for func-
that the product on the right-hand side of (*) converges tions of bounded form and for Hardy classes [2] - [4]
(the convergence condition is necessary only for an (cf. Function of bounded fonn).
infinite product). The Blaschke product was introduced The above theory was considerably generalized by
by W. Blaschke [1], who proved the following theorem: M.M. Dzhrbashyan [5], [6], who constructed infinite
A sequence {ad of points akEK\ {O} defines a func- products of a more general nature, which are suitable
tion of the type (*) if and only if the series for the factorization of much larger classes of mero-
~/1-1 ak I) is convergent. Each factor of the form morphic functions. A solution was also found for the
problem of constructing analogues of Blaschke pro-
bk(z)
I ak I ak- z
= ------, ducts and Blaschke's theorem for doubly-connected
ak l-ak z
domains [7] and, in general, finitely-connected [8]
called the Blaschke factor for ab defines a univalent domains. The solution of the problem of constructing
conformal mapping of the disc K onto itself, which suitable analogues of the Blaschke product for holo-
takes z =ak to zero, with the normalization morphic functions of several complex variables is ren-
bk ( -ak / I ak 1)= 1. The factors of the form bo(z)=z dered very difficult by the fact that the zeros of such
may be interpreted as Blaschke factors which functions cannot be isolated.
correspond to the zero z = 0 with the normalization References
bo(l)= 1. The definition of Blaschke factors and [I] BLASCHKE, W.: 'Eine Erweiterung des Satzes von Vitali tiber
Blaschke products is readily carried over to a disc of Folgen analytischer Funktionen', Berichte Math..Phys. KI.,
Sachs. Gesell. der Wiss. Leipzig 67 (1915), 194-200.
arbitrary radius, and also to an arbitrary simply-
[2] l'IuwALOW, I.I. [I.I. PIuvALov]: Randeigenschaften anaiytischer
connected domain, which is conformally equivalent to a Funktionen, Deutsch. Verlag Wissenschaft., 1956 (translated
disc. from the Russian).

405
BLASCHKE PRODUCT

[3] NEVANLINNA, R.: Ana(ytic fimctions, Springer, 1970 (translated References


from the German). [1] BLASCHKE, W.: 'Ein Beweis fUr die Unverbiegbarkeit geschlos-
[4] CoUINGWOOD, RF. and LoUWATER, A.J.: The theory of clus sener konvexer Flichen', Gott. Nachr. (1912),607-610.
ter sets, Cambridge Univ. Press, 1966. [2] WEYL, H.: 'Ueber die Starrheit der Eiflachen und konvexer
[5] DzHuASHYAN, M.M.: Integral transforms and representation of Polyeder', Sitzungsber. Akad. Wiss. Berlin (1917), 250266.
fimctions in a complex domain, Moscow, 1966 (in Russian). Also: Gesammelte Abh., Vol. 1, Springer, 1968, pp. 646-662.
[6] DzHuASHYAN, M.M.: 'The theory of factorization and boun- [3] EFIMOV, N.V.: 'Qualitative problems in the theory of deforma-
dary properties of functions meromorphic in a disc', Russian tions of surfaces', Uspekhi Mat. Nauk 3, no. 2 (24) (1948), 47-
Math. Surveys 28, no. 4 (1973), 1-12. (Uspekhi Mat. Nauk 28, 158 (in Russian). Translated into German as book.
no. 4 (1973), 3-14) [4] BLASCHKE, W.: Einflihrung in die Difjerentialgeometrie,
[7] KAs'yANYUK, SA.: 'On functions of classes A and H 8 in an Springer, 1950. M.l. Voitsekhovskii
annulus', Mat. Sb. 42 (84), no. 3 (1957), 301326 (in Russian).
[8] TAMRAZOV, P.M.: 'Conformal-metric theory of doubly con AMS 1980 Subject Classification: 53A45
nected domains and the generalized Blaschke product', Soviet
Math. Dokl. 6, no. 2 (1965), 432-435. (Dokl Akad. Nauk
SSSR 161, no. 2 (1965), 308-311) BLOCH CONSTANT - An absolute constant, the
P.M. Tamrazov
existence of which is established by Bloch's theorem. Let
Editorial comments. Some standard references for H be the class of all holomorphic functions f (z) in the
Blaschke products and related subjects are [A1], [A2] and disc I z I < 1 such that J' (0) =1. The Riemann surface
[A3].
of the function f(z) contains on one of its sheets a
References largest open disc of radius Bf>O. It was shown by A.
[A1] DUREN, P.L.: Theory of HP spaces, Acad. Press, 1970.
Bloch [1] that
[A2] RUDIN, W.: Real and complex analysis, McGraw-Hili, 1974.
[A3] GARNETT, J.B.: Bounded analytic functions, Acad. Press., inf {B/ fEH} = B > O.
1981.
The most precise known estimate is V3 / 40;;;;;B 0;;;;;0.472
AMS 1980 Subject Classification: 30D50 [2]. It follows from Bloch's theorem that the Riemann
surface of an entire function contains single-sheeted
BLASCHKE SELECfION TIIEOREM, Blaschke com- discs of arbitrary radius; this is equivalent to the Picard
pactness principle - A metric space of convex bodies is theorem.
locally compact, i.e. it is possible to select, out of an References
infinite set of convex bodies belonging to a given cube, [1] BLOCH, A.: 'Les theoremes de M. Valiron sur les fonctions
a sequence which converges to some convex body in entieres et la theorie de l'uniformisation', Ann. Fac. Sci. Univ.
this cube. Toulouse (3) 17 (1925), 1-22.
[2] AHI.FORS, L.V. and GRUNSKY, H.: 'Ueber die Blochsche Kon-
The theorem was demonstrated in 1916 by W. stante', Math. Z. 42 (1937), 671-673.
Blaschke [1]. [3] GoLUZIN, G.M.: Geometric theory offunctions of a complex
variable, Amer. Math. Soc., 1969 (translated from the Russian).
References
[1] BLASCHKE, W.: Kreis und Kugel, Chelsea, reprint, 1949. E.D. Solomentsev
A.B. Ivanov Editorial comments. For the connection between the
Editorial comments. theorems of Bloch and Picard, see e.g. [A 1).

References References
[A1] KELLY, PJ. and WEISS, M.L.: Geometry and convexity, Wiley, [A 1] HEINS, M.: Selected topics in the classical theory of functions
1979.
of a complex variable, Holt, Rinehart and Winston, 1962.

AMS 1980 Subject Classification: 52A07 AMS 1980 Subject Classification: 30C75

BLASCHKE - WEYL FORMULA - A variant of the BLOCK DESIGN - A system of subsets of a finite set
Green formulas for the rotation field y of an infini- which satisfies certain conditions related to the fre-
tesimal deformation of a surface with position vector x: quency of appearance of pairs of elements of the set in
the subsets of the system. The concept of a block
2 fj [xfu-fv-] du dv = :!c(xy dy). design arose in the theory of design (planning) of (sta-
tistical) experiments in the nineteen-twenties and
The proof and the idea of applying the Blaschke - Weyl nineteen-thirties, but had been studied as early as the
formula to demonstrate the rigidity of ovaloids is due mid-nineteenth century under the name tactical confi-
to W. Blaschke [1] and to H. Weyl [2]. For other appli- gurations. The concept of a block design is a variation
cations see [3]. The Herglotz formula is an analogue of of the concepts of a hypergraph, a net and a complex.
the Blaschke- Weyl formula. The formula has been As a rule, in a block design several additional limita-
generalized to the case of infinitesimal deformations of tions are imposed on the family of subsets. A block
surfaces in spaces of constant curvature. design may be defined by a pair of sets (V, B) where

406
BLOCK DESIGN

To each block design with v elements and b blocks


Bj ~ V, i=I, ... ,b. corresponds an incidence matrix A = I cij II, where
cij = 1 if ai EBj and Cij =0 otherwise, i = 1, ... , v;
The elements of the set V are called the points (treat- j = 1, ... ,b. The theory of block designs considers
ments) of the block design, or varieties or elements, problems on the existence and classification as well as
while the elements of the set B are called its blocks. The problems involved in the construction of a block design
element ai and the block Bj are incident if ai EBj . The with given parameters. The parameters of a block
number 1 Bj 1 of elements incident with Bj is usually design are related in a certain manner. The following
denoted by kj' while the number of blocks incident equations are valid for BIB-designs:
with ai is denoted by ri' The number
vr = kb, (1)
I {B/ ajEBj , a/EBj} I
A(V -1) = r(k -1).
is denoted by Ail. The numbers v, b, ri, kj' Ail
(i, 1= 1, ... , v; j = 1, ... ,b) are said to be the parame- Equation (1) and the relationships
ters of the block design. If ri =r for all i = 1, ... ,v,
~nj = v-I,
kj=k for all j=1, ... ,b and Ai/=A, (V, B) is a bal- j=1
anced incomplete block design (BIB-design) with param- m

eters v, b, r, k, A. The meaning of the word 'balanced' is ~Ajnj = r(k -1),


i=1
that the frequencies of appearance of all elements and m. { nj if i*j,
pairs of elements in the blocks are respectively equal, ~PJ"k
k=1 = nj - 1 if'l -
- j' , l. , .j-- 1
, ... ,m,
while the word 'incomplete' indicates that, generally
speaking, not all the k-element sets are included in B. njpjk = njP{k = nkPt, i, j, k = 1, ... ,m,
Let exactly m different numbers AI, . . . ,Am be are valid for the parameters of PBIB(m)-designs. The
encountered among the numbers Ail, i, 1= 1, ... ,v, and incidence matrix of a BIB-design satisfies the fundamen-
let m symmetric association relations be introduced on tal matrix relation
the elements of the set V so that the following condi-
tions are satisfied: AAT = (r-A)E+V, (2)
1) the set V2 of all pairs of distinct elements of V is where E is the unit matrix of order v and J is the
subdivided into m disjoint subsets VI, ... , V~ and, if matrix of order v consisting exclusively of ones. The
(a, a')E VJ, then a and a' are said to be j-associated; existence of a (0, I)-matrix which satisfies condition (2)
2) 1 {B/ a EBj , a' EBj , (a, a')E Vt} 1 =Ai' is a sufficient condition for the existence of a BIB-
i=1, ... ,m,j=1, ... ,b; design with the given parameters. The inequality b ~v
3)1 {a: 3a' (a, a')EVt} 1 =ni' i=1, ... ,m; follows from (2). A BIB-design for which b = v (i.e. also
4) 1 {an: (an, a)E VJ, (an, a')E Vi, (a, a')E Vt} I =P)k> r=k) is said to be a symmetric block design or a
and, in view of the symmetry, P)k =P~j, (v, k, A)-configuration. The following theorem applies
i, j, k = 1, ... ,m. A block design with the properties 1) to symmetric BIB-designs: If there exists a symmetric
- 4) is said to be a partial balanced incomplete block BIB-design with parameters v, k, A, then: 1) if v is even,
design with m types of relations or a PBIB(m)-design. k - A is a perfect square; 2) if v is odd, the equation
The rule which specifies the association relation is Z2 = (k-A)x2+(_1)(v-I)/2 Ay 2
called the association deSign. A BIB-design is a has a solution in integers x, y, Z, not all of which are
PBIB(1 )-design. An example of a PBIB(2)-design is a zero. The conditions of this theorem are sufficient for
block design which may be represented as the table the existence of a ratiollal matrix A satisfying equation
1 222333 (2).
456456456 A special range of problems involving the existence
789978897 of BIB-designs arises in the context of the following
10 11 12 11 12 10 12 10 11, problem: Given b blocks, find the conditions necessary
where any two numbers in the same column are 1- for the completion of these blocks to form a BIB-
associated, while any two numbers in different columns design. In their most general form these conditions are
are 2-associated. Here v=12, b=9, r=3, k=4, AI =1, expressed as the requirement of positive definiteness of
A2=0, nl =9, n2=2. some quadratic form Q, as well as the possibility of
representing Q as a sum of squares of linear forms with
pI = IIp)k II = II~ ~II' p2 = IIp7k II = II~ ~II non-negative coefficients.
The following subclasses of BIB-designs have been

407
BLOCK DESIGN

most extensively studied: Steiner systems (BIB-designs Editorial comments.


=
with X 1), in particular Steiner triple systems (k 3); = References
Hadamard configurations (v =b =4t -1, r =k =2t -1, [A1] lIALMos, P.R.: A Hilbert space problem book, Springer,
X=t -1, t ~2), the incidence matrix of which is 1982.
obtained from a Hadamard matrix; affine finite AMS 1980 Subject Classification: 47BXX
geometries and projective finite geometries [1]. In the
class of PBIB-designs those most extensively studied BLOITO GAMES - A class of two-person zero-sum
are PBIB(2)-designs, which may be subdivided accord- games (cf. Two-person zero-sum game) in normal form
ing to their association design into group-divisible block in which the pure strategies (cf. Strategy (in game
designs, triangular block designs, Latin square block theory) of the players are distributions of limited
designs, cyclic block deSigns, etc. [3]. resources (indivisible or divisible) over several objects,
The methods for constructing block designs are usu- and the gain, or payoff, is equal to the sum of the gains
ally classified as direct and recursive. The latter methods on the individual objects. So named after the fictional
make it possible to use designs with smaller parameter colonel Blotto who was supposed to have participated
values to construct designs with larger parameter in one of the first games of this type.
values. The direct methods mostly utilize the properties References
of finite fields or some geometric properties. [1] KARLIN, S.: Mathematical methods and theory in games, pro-
Block designs are used in the design of experiments, gramming and economics, Addison-Wesley, 1959.

the theory of games, graph theory and in the construc- I.N. Vrublevskaya
tion of error-correcting codes. AMS 1980 Subject Classification: 90D05
References
[I] RySER, R.J.: Combinatorial mathematics, Wiley, 1963. BOCHNER ALMOST-PERIODIC FUNCfIONS -
[2] HAll, M.: Combinatorial theory, Blaisdell, Waltham (MA), Functions equivalent to Bohr almost-periodic functions;
1967. defined by S. Bochner [1]. A function f (x) which is
[3] SHIROKOVA, S.A.: 'Block designs', Russian Math. Survrys 23,
no. 5 (1968), 47-94. (Uspekhi Mat. Nauk 23, no. 5 (1968), 51-
continuous in the interval (- 00, (0) is said to be a
Bochner almost-periodic function if the family of func-
98) V.E. Tarakanov
tions {f (x + h ): - 00 < h < oo} is compact in the sense
Editorial comments. The theorem giving conditions on of uniform convergence on (- 00, (0), i.e. if it is possi-
the numbers v, k, A in order that a symmetric BIB-design ble to select from each infinite sequence f (x + hk ),
with these numbers as parameters exists, is called the k = 1, 2, ... , a subsequence which converges uniformly
Bauck- Ryser- Chawla theorem. to f (x) on (- 00, (0). Bochner's definition is exten-
AMS 1980 Subject Classification: 05B05 sively employed in the theory of almost-periodic func-
tions; in particular, it serves as the starting point in
BLOCK-DIAGONAL OPERATOR with respect to a abstract generalizations of the concept of almost-
given orthogonal decomposition H = ~k;;'1 (f) H k of a periodicity.
Hilbert space H - A linear operator A on H which References
leaves each of the subspaces H k , k~l, invariant. The [I] BOCHNER, S.: 'Beitrage zur Theorie der fastperiodischen Funk-
spectrum of A is the closure of the union of the spectra tionen I, Funktionen einer Variablen', Math. Ann. 96 (1927),
119-147.
of the 'blocks' A IH. =Ak> k~l, II A I =SUPk;;.1 I Ak II [2] LEVITAN, B.M.: Almost-periodicfunctions, Moscow, 1953 (in
A block-diagonal operator in the broad sense of the Russian).
E.A. Bredikhina
word is an operator A of multiplication by a function X
in the direct integral of Hilbert spaces Editorial comments.

H = L$H(t)dp.(t), (Aj)(t) = A(t)f(t), t EM.


References
[A1] MAAK, W.: Fastperiodische Funktionen, Springer, 1967.
[A2] AMERIO, L. and PROUSE, G.: Almost-periodic functions and
Here X(t) is a linear operator acting on the space H(t). functional equations, v. Nostrand, 1971.
Each operator which commutes with a normal operator
AMS 1980 Subject Classification: 42A75, 43A60
is a block-diagonal operator with respect to the spectral
decomposition of this operator. See also Diagonal
BOCHNER INTEGRAL - An integral of a function
operator.
with values in a Banach space with respect to a scalar
References measure. It belongs to the so-called strong integrals
[1] NAlMARK., M.A.: Normed rings, Reidel, 1984 (translated from (cf. Strong integral).
the Russian).
N.K. Nikol'skii Let F(X; E, 18, JL) be the vector space of functions
B.S. Pavlov x (f), fEE, with values in a Banach space X, given on a

408
BOCHNER - MARTINELLI REPRESENTATION FORMULA

space (E, ~,p.) with a countably-additive scalar meas- and


ure p. on a a-algebra ~ of subsets of E. A function xn(t)dpo ~ x(t)dp..
Xo EF is called simple if
4) The space .fl? is complete with respect to the norm
(d. Convergence in nonn)
_ {Xi, tEBiE'B, p.(Bi)<oo, Bi nBj= 0,
xo(t) - N'=I= '. . .-1 N II x-y II = III x(t)-y(t) II dp..
0, tEE \ UBi' 1 J, I, } - , . . . , , B
i=l
5) If T is a closed linear operator from a Banach
A function x EF is called strongly measurable if there space X into a Banach space Y and if
exists a sequence {xn} of simple functions with X E Ie(X; E, '13, po), Tx E Ie(Y; E, '13, po),
I x - Xn II ~O almost-everywhere with respect to the then
measure p. on E. In such a case the scalar function TX(t)dpo = TX(t)dp..
II x II is ~-measurable. For the simple function Xo
def N If T is bounded, the condition
ixo(t)dpo = i~Xip.(B;).
Tx E Ie(Y;E, 'B,po)
A function x is said to be Bochner integrable if it is is automatically fulfilled, [3] - [5].
strongly measurable and if for some approximating The Bochner integral was introduced by S. Bochner
sequence {xn} of simple functions [1]. Equivalent definitions were given by T. Hildebrandt
lim I II X (t) - xn(t) I dpo = O. [2] and N. Dunford (the Do-integral).
n-+oo E
References
The Bochner integral of such a function over a set B E ~ [I] BOCHNER, S.: 'Integration von Funktionen, deren Werte die
Elemente eines Vektorraumes sind', Fund Math. 20 (1933),
is
IX (t) dpo = lim
B n-+oo E
I XB(t)Xn(t) dpo, 262-276.
[2] HILDEBRANDT, T.H.: 'Integration in abstract spaces', Bull.
Amer. Math. Soc. 59 (1953), 111-139.
where XB is the characteristic function of B, and the [3] YOSIDA, K.: Functional analysis, Springer, 1980.
limit is understood in the sense of strong convergence [4] HILLE, E. and PHILLIPS, R.: Functional analySiS and semi-
in X. This limit exists, and is independent of the choice groups, Arner. Math. Soc., 1957.
[5] DUNFORD, N. and ScHWARTZ, J.T.: Linear operators. General
of the approximation sequence of simple functions. theory, I, Interscience, 1958.
Criterion for Bochner integrability: For a strongly- v.l. Sobolev
measurable function x EF to be Bochner integrable it is Editorial comments. A simple function is also called a
necessary and sufficient for the norm of this function to step function. A good recent textbook on integrals with
be integrable, i.e. values in a Banach space is [A1]; [A4] is specifically about
the Bochner integral.
III x(t) II dpo < 00.
B References
[A1] DIESTEL, J. and UHL, J.J.: Vector measures, Math. Surveys,
The set of Bochner-integrable functions forms a vec- 15, Amer. Math. Soc., 1977.
tor subspace .fl? of F, and the Bochner integral is a [A2] ZAANEN, A.C.: Integration, North-Holland, 1967.
linear operator on this subspace. [A3] BOURBAKI, N.: Elements of mathematics. Integration,
Addison-Wesley, 1975, Chapt.6; 7; 8 (translated from the
Properties of Bochner integrals: FrenCh).
1) [A4] MIKUSrNSKI, J.: The Bochner integral, Academic Press, 1978.

AMS 1980 Subject Classification: 28805, 46G10

2) A Bochner integral is a count ably-additive p.- BOCHNER. - MARTINELLI REPRESENTATION


absolutely continuous set-function on the a-algebra ~, FORMULA, Bochner- Martinelli representation,
1.e. 00 Bochner- Martinelli formula - An integral representa-
U~IB,X(t)dpo = i~jX(t)dpo, tion for hoIomorphic functions, which is defined as fol-
lows [1], [2]. Let the function f be holomorphic in a
if B j E~, B j n B = 0, i=/=}, p.(B
j j) < 00, and domain D ccn with piecewise-smooth boundary aD,
II kx(t)dp. II~O if p.(B)~O, uniformly for BE'Jj. and let f be continuous in its closure D. Then the
3) If Xn EF, Xn~X almost-everywhere with respect to expresslOn
(n -l)! ( ren x (1)
the measure p. on B E'Jj, if I Xn II.;;;;f almost-everywhere (2'TTi)" iD I r- z Iln
with respect to p. on B, and if fJ"(t) dp.< 00, then n _ _ _
x ~ (rj - Z)drl /\ drl /\ ... /\ [drjl/\ drj /\ ... /\ drn =
X E Ie j=l

409
BOCHNER -MARTINELLI REPRESENTATION FORMULA

= {fez), if ZE~ boundary values of holomorphic functions of several


0, if Z ~D, complex variables. It was introduced by S. Bochner [1]
and by E. Martinelli [2].
here [dIj] means that the term dIj is to be omitted, is References
called the Bochner- Martinelli representation. For [1] BocHNER, S.: 'Analytic and meromorphic continuation by
n = 1 this representation is identical with the Cauchy means of Green's formula', Ann. of Matk (2) 44, no. 4 (1943),
integral formula (cf. Cauchy integral), but for n > 1 its 652-673.
[2] MARTINELLI, E.: Rend Accad Ital. 9 (1938),269-283.
kernel is not holomorphic in z, which is the reason for [3] VLADIMIROV, V.S.: Methods of the theory offunctions of several
the limited applicability of the Bochner- Martinelli complex variables, M.l.T., 1966 (translated from the Russian).
representation in the theory of functions of several E.M. Chirka
complex variables. The kernel of the Editorial comments. SokhotskIT's formula is also known
Bochner- Martinelli representation is the ~-differential as Plemelj's formula. The Cauchy-Green formula is also
form of bidegree (n, n -1): known as the Cauchy- Pompeiu formula.
w Z - (n -I)! 1 x The Bochner-Martinelli kernel is a special case of a
(~, ) - (2"'i)" I ~ - Z 12n Cauchy- Fantappie kernel. The integral representation (2)
solves the a-equation:
X ~(Ij-Zj)dII /\ d~1 /\ ... /\ [dIjl/\d~j /\ ... /\ dIn /\ d~n'
j=1 af =v with av = 0,
which is defined in en, has a singular point at ~=z, for compactly-supported (0, 1)-forms v, by substituting v for
and is a-closed (i.e. aw = 0) outside the singular point. d f in the right-hand side of (2) and omitting the integral over
If n > 1, the form w is equal to aw' (~, z), where the boundary. When v is not compactly supported, the

(~, z) = - (~2-i! .I ~ _ 112n -2~ [IIdIk /\ d~k]


boundary integral causes difficulties. These can be solved
w' for strictly pseudo-convex domains and the
"'/ Z J=I k~J Bochner-Martinelli kernel then occurs in an explicit solu-
is a form of bidegree (n -I, n -1), the coefficient of tion operator for the a-equation.
which is a fundamental solution of the Laplace equa- References
tion; here [A 1]KRANTZ, S.: Function theory of several complex variables,
Wiley, 1982.
d'" - d'"
d</> = ~dZk ~ and d</>
uZk
= ~dZk-;-:r--.
uZk
[A2] HENKIN, G.M. and LEITERER, J.: Theory of functions on com-
plex manifolds, Birkhauser, 1984.
The following integral representation, which general- [A3] RANGE, R.M.: Holomorphic functions and integral represen-
tations in several complex variables, Springer, 1986.
izes formula (1), is the analogue of the Cauchy - Green
formula (cf. Cauchy integral): If the function f is con- AMS 1980 Subject Classification: 32A25
tinuously differentiable in the closure of a domain
D ccn with piecewise-smooth boundary aD, then, for BOGOLYUBOV CHAIN OF EQUATIONS, BBGKY-
any point zeD, equations, Bogolyubov - Born - Green - Kirkwood - Yvon
fez) = !J<nwc~, z)- ,ff<n/\wct z). (2)
equations - A chain of equations (hierarchy) for the
one-particle, two-particle, etc., distribution functions of
The function a classical statistical system. The functions are defined
fez) = ff<nw(~, z), as follows:
where f is a smooth hypersurface in R2n =en and f is a F,(t, rl> ... ,r" PI> ... ,pJ = (1)
function on f which is Lebesgue-integrable, is said to V' jwNdrs+1 ... drNdp,+1 ... dpN,
be an integral of Bochner- Martinelli type. As for
Cauchy-type integrals, SokhotskiI's formula, with the where s = I, 2, ... , V is the volume of the system and
usual restrictions on f and f, is applicable to WN is the N-particle normalized distribution function
Bochner - Martinelli-type integrals. A which satisfies the Liouville equation:
Bochner- Martinelli-type integral is a complex function OWN
at === {H, WN}, (2)
which is harmonic everywhere outside f; in the general
case this function is holomorphic only for n = I. If where the brackets are the Poisson brackets and H is
f=aD, then if n~l, the conditi(~.nJ(z) 0 outside Dis the Hamiltonian of the system. In the thermodynami-
equivalent to the holomorphy of finD. cal limit V~oo, V / N=v=const, the chain of equa-
The Bochner- Martinelli representation is employed tions has the form of an equation for Fs with a specific
to demonstrate other integral representations (e.g. the 'link' with the function Fs + 1 of a higher rank:
Bergman - Weil representation), in holomorphic con- of,
tinuation from the boundary, and also in the theory of at - {H" F,} === (3)

410
BOGOL YUBOV INEQUALITY

= -1 j {
V
~
I <;;;,.;s
<P( I r, -rs+1 I),Fs+I}drs+1 dps+h
tistical physics', in J. de Boer and G.E. Uhlenbeck (eds.): Stu-
dies in statistical mechanics, Vol. I, North-Holland, 1961.
[2] UHLENBECK, G.E. and FORD, G.V.: Lectures in statistical
where <P( I ri - rs + I I) is the potential of the interaction mechanics, Amer. Math. Soc., 1963. lA K ko
. . vasm v
between the i-th and the (s + l)-th particles, while Hs is
the Hamiltonian of the s-particle system. AMS 1980 Subject Classification: 82A05
Under conditions of thermodynamic equilibrium
BOGOLYUBOV INEQUALITY in statistical mechan-
when the momentum distribution of each particle is the
MaxweU distribution, one considers the s-particle coor-
ics - 1) Bogolyubov's inequality for the free-energy func-
dinate distribution functions of particles, defined by
tional is an inequality that gives rise to a variational
principle of statistical mechanics. The following ine-
relations of type (1) in terms of an N-particle function
quality is valid for any Hermitian operators U I and
WN(rh,rN) = ~exp[-~I}, (4) U2 :
~<UI-U2>UI o;;;;f[Utl-/[U2] 0;;;; (*)
where HI = H - H 0, and H 0 is the sum of the kinetic
energies of the particles of the system, while the. co~i 1
0;;;; N <UI - U2>u"
guration integral Q is defined by the normalizatIOn
condition (4). The chain of equations for these func-
tions has the form
where
flU] =- ~ In Tre- u /8.

-aFs
+ -I-aus
F+ (5)
This expression has the meaning of the free-energy d~n
aXI 0 aXI S

sity for a system with Hamiltonian U; the extenSIve


Ilja<p(lrl-rs+II)F d parameter N is the number of particles or the volume,
+-0 -v a
XI
s+1 rs+1 = 0,
depending on the system; E> is the absolute temperature
where Us is the potential energy of the interaction in energy units, and
between s particles of the system. _ Tr( ... e- u /8)
The distribution functions, in particular FI and F 2 , < >u= Tre-U /8
may be used to express all the specific characteristics of
denotes the thermodynamic average with respect to the
statistical systems. The principal difficulties involved in
Hamiltonian U.
the study of the functions (3) or (5) are caused by the
The Bogolyubov inequality (*) is used to obtain the
problems of closing the system (decoupling the chain of
exact thermodynamic limit solutions of model problems
equations) and solving the closed system under special
in statistical quantum physics [1], [2], in studies using
boundary conditions for the functions Fs. This study is
the method of molecular fields [3], in proving the
essential for physical systems of various kinds and is
existence of the thermodynamic limit, and also in order
most advanced for the case of short-range interaction,
to obtain physically important estimates for the free
when rb / v 1, where ro is the effective radius of
energies of various multi-particle systems [4]. There
interaction between the particles, and for the case of
exist generalizations of the Bogolyubov inequality (*) to
long-range interaction, when rb / v 1, in particular
the case of von Neumann algebras with a 'trace' [5] and
for systems in which electrostatic (Coulomb) forces are
general von Neumann algebras [6].
operative. In the time-dependent theory this leads
directly to the kinetic Boltzmann equation for the References
[1) BOGOLYUBOV, JR., N.N.: 'On model dynamical systems in sta-
single-particle function F I or to the V1asov kinetic tistical mechanics', Physica 32 (1966), 933-944.
equation, and in the equilibrium theory to the virial [2) BOGOLYUBOV, JR., N.N.: A method for studying model Hamil-
decomposition for the thermodynamic potential or to tonians, Pergamon Press, 1972 (translated from the Russian).
[3] TYABLlKOV, S.V.: Methods of the quantum theory of magnetism,
specific Coulomb corrections. Plenum Press 1967 (translated from the Russian).
In considering quantum-statistical systems, the [4) KUDRIN, L.P:: Statistical plasma phySiCS, Moscow, 1974 (in
BBGKY-hierarchy is constructed for s-particle statisti- Russian).
cal quantum operators <x I, . . . ,Xs I Fs Ix~ , ... ,x~ > [5) RUSKAI, M.B.: 'Inequalities for traces on von Neumann alge-
bras', Comm. Math. Phys. 26 (1972),280-289. .
which are traces over the variables s + 1, ... ,N of the [6) AIwa, H.: 'Golden - Thompson and Peierls - Bogolyubov me-
particles of the general N-particle operator - the den- qualities for a general von Neumann algebra', Comm. Math.
sity matrix. The form of these equations is similar to Phys. 34 (1973), 167-178.
that of (3), in which the classical Poisson brackets have Editorial comments.
been replaced by quantum brackets.
References .
References [A1] RUELLE, D.: Statistical mechanics: rigorous results, Benjamin,
[I] BOGOLYUBov, N.N.: 'Problems of the dynamical theory in sta- 1974.

411
BOGOLYUBOV INEQUALITY

2) Bogolyubov's inequality for Green functions and several complex variables. It was obtained in 1956 by
correlation functions. The following inequality holds for N.N. Bogolyubov in the justification of the dispersion
time-temperature commutator Green functions (cf. relations in quantum field theory ([1], Appendix A).
Green function in statistical mechanics). If one defines The modem formulation is as follows. Let a function
I f(z), z=(zJ, ... ,zn)=x+ry, be holomorphic in an
I open set ~ = {z: I z I <7J, Y EC}, where C is an open
8
A;B = 2~ <AB;s>Hds, cone in Rn with apex at zero such that C C)= 0 ,n (-
where Bis is the operator B in the Heisenberg represen- let the open set 0 eRn be contained in the ball
tation for imaginary time t = is, < ... > denotes the I x I <7J and suppose that for any test function <p(x)
thermodynamic average for Hamiltonian H and () is the from D( 0) the limit
absolute temperature, then lim jf(x +iy)cp(x)dx
y-+O
I A;B 12 os;;; (1) yeC

exists, independent from the way in which y ~O, Y E C;


os;;; I A;A+ 1'1 B+;B I
then f (z) caI'!. be analytically continued into the
Substituting A =iQ=[Q, H]_ one gets domain r;; u
0:
I B+;B I os;;; I <[Q,Bl_>HI2 (2) o = U{z: I z-~ I <8~o(~1,
277' I <[Q, [Q+, Hl-l- >H I'
where [,]_ is the commutator. One may also write where 0 is a complex neighbourhood of the set 0,
down the inequality O<()< I is a constant which depends only on the cone
C, and ~om is the distance from the point ~ to the
<BB++B+B> ~ 28 I <[Q, Bl->H 12 (3)
boundary of O. Bogolyubov's edge-of-the-wedge
...- I <[Q,[Q+,Hl-l->HI ,
theorem also remains valid if 7J = 00. In such a case,
which follows from (2). Owing to the general nature of and under certain assumptions regarding the growth of
the inequalities (2) and (3) they are extensively the function f (z), one obtains the original formulation
employed in studies of various physical systems. of Bogolyubov (cf. [1]; the light cone in R4 plays the
An improved estimate of the correlation function role of the cone). There exist various proofs and gen-
<BkBt +Bt Bk>H eralizations of this theorem [2], [5]. In particular, one
is attained in (3) by selecting as the operator Qk = Q (k) can mention the generalizations to hyperfunctions [4]
some 'quasi-integral' of motion which, for k=O, com- and to holomorphic cocYcles [3].
mutes with the Hamiltonian H. The commutator in the Bogolyubov's edge-of-the-wedge theorem is exten-
numerator of the right-hand side of (3) describes the sively employed in axiomatic quantum field theory, in
transformation properties of Bk under the infinitesimal the theory of partial differential equations and in the
transformations of the continuous symmetry group gen- theory of boundary values of holomorphic functions
erated by the operator Qk=O' The inequalities (2) and (especially of functions of several complex variables). A
(3) are effectively employed in the study of systems useful completion of the theorem is the C-convex hull
with a spontaneous symmetry breaking: In such a case theorem [2]. Let, under the conditions of Bogolyubov's
thermodynamic averages must be considered in the theorem, C=C+ U C-, C- = -C+, where C+ IS a
framework of the method of quasi-averages (cf. Quasi- convex sharp cone; then
averages, method of). Bc(O) C ReH(T;i U i)),
Similar inequalities are applicable to Green functions
in classical statistical mechanics, where the respective where H(D) is the envelope of holomorphy (cf. Holo-
commutators 'become' Poisson brackets. morphic envelope) of D, ReD is the real section of the
Bogolyubov's inequalities made it possible to estab- domain D, and Be(O) is the C-convex hull of the set
lish several relations concerning model systems of sta- 0, i.e. the smallest open set containing 0 and having
tistical physics, to study the problem of ordering in the following property: If the points x' and x" of
infinite systems, etc. For references see Bogolyubov Be(O) can be connected by a C-similar curve that is
theorem. totally contained in Be(O), then all C-similar curves
A.M. Kurbatov homotopic to it are located in Be( 0).
AMS 1980 Subject Classification: 82A 15 References
[I] BOGOLYUBOV, N.N. and SHIRKOV, D.V.: Introduction to the

BOGOLYUBOV THEOREM - I) Bogolyubov's edge- theory of quantized fields, Interscience, 1959 (translated from
the Russian).
of-the-wedge theorem is a generalization of the principle [2] VLADIMIROV, V.S.: Methods of the theory offunctions of sl!\Ieral
of analytic continuation, in particular to the case of complex variables, M.LT., 1966 (translated from the Russian).

412
BOHL ALMOST-PERIODIC FUNCTIONS

(3) MARTINEAu, A.: 'Distributions et valeurs au bord des fonc- ticles in which a spontaneous symmetry breaking mani-
tions holomorphes', in Theory of distributions. Proc. internat.
summer inst., Lisbon, 1964, pp. 193-326. fests itself in various ways: the Heisenberg model with
(4) KOMATSU, H. (ED.): Hyperfonctions and pseudodifferentiaJ equa- ferromagnetic, anti-ferromagnetic and ferrimagnetic
tions. Proc. Con! Katata, 1971, Springer, 1973. ordering, systems of superfluid and superconducting
(5) RUDIN, W.: Lectures on the edge-ofthe-wedge theorem, Amer. types and systems with crystalline ordering.
Math. Soc., 1971.
V.S. Vladimirov The appearance of singularities in the Green func-
Editorial comments. Additional generalizations of the tions as q~O is connected with the presence of a
edge-of-the-wedge theorem include the case where branch of collective excitations in the energy spectrum
Cn (- C)= 0, see [5], and the case where the 'edge' of of the system that corresponds with spontaneous sym-
the 'wedge' is a totally-real hypersurface in en
instead of an metry breaking under certain restrictions on the
open set in {lmz=O} in en,
see [A1]. For cones interaction potential.
C= C+ U C- , where C+ is convex, a C-similar curve in 0 The nature of the energy spectrum of elementary
is defined as a differentiable curve C(t): [0, 1]~O such that excitations may be studied with the aid of the mass
C '(t)EG. operator inequality constructed for Green functions of
References type (1)_ In the case of Bose systems, for a finite tem-
[A1] BEDFORD, E.: 'Holomorphic continuation at a totally real
edge', Math. Ann. 230 (1977), 213-225. perature (8=kB T>0), this inequality has the form:
2) Bogolyubov's theorem on singularities of type 1/ q2. I ~1I (0, q) - ~12 (0, q) I 0;;;; constq2. (2)
A theorem in statistical mechanics on the asymptotic If q = 0, formula (2) yields a generalization of the so-
behaviour of Green functions (cf. Green function) in called Hugenholtz - Pines formula to finite tempera-
the limit of small momenta (q~O) for Bose and Fermi tures_ If one assumes that the mass operator is regular
systems with a gauge-invariant interaction potential. in a neighbourhood of the point (E =0, q= 0), then one
Presented in 1961 by N.N. Bogo1yubov [1]. can use (2) to prove the absence of a gap in the
The following inequality is valid for systems of (phonon-type) excitation energy spectrum.
several interacting particles in the case of a degenerate In the case of zero temperatures (8=0) inequality (1)
statistical equilibrium state for the two-time tempera- establishes a connection between the density of the
ture commutator Green functions (cl. Green function in continuous distribution of the particle momenta and
statistical mechanics) in the energy representation: the minimum energy of an excited state.
I a q , at E=O I ~ constq-2, (1) Relations of type (1) should also be valid in quantum
a at
where q , are the annihilation and creation opera- field theory, in which a spontaneous symmetry break.-
tors of a particle with momentum q. ing (at a transition between two ground states) results
The singularities in the Green functions specified in [4], [5] in an infinite number of particles of zero mass
Bogo1yubov's theorem which appear when q~O (Goldstone's theorem), which are interpreted as singular-
correspond to elementary excitations in the physical ities for small momenta in the quantum field Green
system under study. Bogo1yubov's theorem also predicts functions_ Bogolyubov's theorem has been applied to a
the asymptotic behaviour for small momenta of relativistic quantum-field model with a spontaneous
macroscopic properties of the system which are con- symmetry breaking [6].
nected with Green functions by familiar theorems. References
Thus, according to (1), in the case of superfluid Bose (1) BOOOLYUBOV, N.N.: Selected woria, Vol. 3, Kiev, 1970 (in
Russian).
or Fermi systems, the particle density distribution w(q) (2) SADOVNIKOV, B.1. and FEDYANIN, V.K.: 'N.N. Bogoljubov's
tends to infinity as q~O at a rate which is not slower inequalities in systems of many interacting particles with bro-
than 1 / ri. In such a case the degeneracy of the state ken symmetry', Teoret. Mat. Fiz. 16, no. 3 (1973), 368-393 (in
Russian). English abstract.
of statistical equilibrium is due to the conservation law (3) BoooLYUBOV, JR., N.N. and SADOVNIKOV, B.I.: Some ques-
of the total number of particles, i.e. to the invariance of tions in statistical mechanics, Moscow, 1975 (in Russian).
the Hamiltonian of the system with respect to gauge (4) GoWSIONE, J.: 'Field theories with 'superconductor' solu-

transformations. However, similar singularities appear tions', Nuovo Cimento (10) 19 (1961), 154-164. Italian abstract.
(5) GoWSIONE, J., SALAM, A. and WEINBERG, S.: 'Broken sym-
in the respective Green functions, and hence also in the metries', Phys. Rev. (2) 127 (1962), 965-970.
correlation functions characterizing systems with other (6) KAzANSKIi, A.K.: Teoret. Mat. Fiz. n, no. 3 (1975), 418-421.
kinds of degeneracy caused by certain additional con- A.M. Kurbatov
servation laws, i.e. by the invariance of the Hamil- AMS 1980 Subject Classification: 32015, 46F20,
tonian of the system with respect to certain transforma- 82A15
tions. Bogolyubov's theorem has a number of non-
trivial physical consequences, involving problems of BOllI.. ALMOST-PERIODIC FUNCfIONS - A class
long-range ordering in systems of many interacting par- of functions the typical property of which is that they

413
BOHL ALMOST-PERIODIC FUNCTIONS

can be uniformly approximated on the whole real axis particular the Bohr almost-periodic function
by generalized trigonometric polynomials of the form f (x) / g(x) is U-almost-periodic, under the condition
inf
-oo<x<oo
I g(x) I > Y > O.
where n 1, . . . ,nkare arbitrary integers, while If f (x) is U-almost-periodic and if / (x) is uniformly
al> ... ,ak are given real numbers. This class of func- continuous on ( - 00, 00), then / (x) is U-almost-
tions contains the class of continuous 2'IT-periodic func-
periodic; the indefinite integral F(x)= (f(t)dt is U-
tions and is contained in the class of Bohr aImost-
almost-periodic if F(x) is a bounded funcOtion.
periodic functions. P. Bohl specified several necessary
and sufficient conditions for a function to be almost- References
(1] BOHR, H.: 'Zur Theorie der fastperiodischen Funktionen 1',
periodic. In particular, any function of the type Acta Math. 45 (1925), 29-127.
f(x) = fl(X)+ ... +Jk(x), [2] LEVITAN, B.M.: Almost-periodic junctions, Moscow, 1953 (in
Russian).
where each one of the functions f1 (x), ... ,jk(x) is E.A. Bredikhina
continuous and periodic (with possibly different Editorial comments. Bohr's treatise [A1] is a good refer-
periods), is a Bohl almost-periodic function. ence. An up-to-date reference is [A2].
References References
[1] BOHL, P.: Uber die Darstellung von Funktionen einer Variabeln [A 1] BOHR, H.: Almost periodic functions, Chelsea, 1947
durch trigonometrische Reihen mit mehreren einer Variabeln pro (translated from the German).
portionalen Argumenten, Dorpat, 1893. Thesis. [A2] CORDUNEANU, C.: Almost periodic functions, Wiley, 1968.
[2] BOHL, P.: 'Ueber eine Differentialgleichung der
StOrungstheorie', J. Reine Angew. Math. 131 (1906), 268-321. AMS 1980 Subject Classification: 42A75, 43A60
[3] LEVITAN, B.M.: Almost-periodic junctions, Moscow, 1953 (in
Russian). BOHR COMPACTIFICATION - The space X of max-
E.A. Bredikhina
imal ideals of the algebra of Bohr almost-periodic func-
Editorial comments. A very well-known reference for tions (d. Banach algebra; Bohr almost-periodic func-
this kind of topic is [A 1].
tions). The Bohr almost-periodic functions form a
References commutative C' -algebra A over the reals R. The alge-
[A1] BOHR, H.: Almost periodiC functions, Chelsea, 1947
(translated from the German).
bra A is isometrically isomorphic to the algebra C (X)
of all continuous functions on the compactum X. The
AMS 1980 Subject Classification: 42A75, 43A60
real line R is naturally imbedded in X as an
BOHR ALMOST-PERIODIC FUNcnONS, uniform everywhere-dense subset (however, this imbedding is
almost-periodic functions - The class U-a.-p. of almost- not a homeomorphism). The compactum X has the
periodic functions. The first definition, which was given structure of a connected compact group which is identi-
by H. Bohr [1], is based on a generalization of the con- fied with the group of characters of the real line if the
cept of a period: A continuous function f(x) on the latter is considered with the discrete topology. This
interval (- 00,00) is a Bohr almost-periodic function if isomorphism between the algebra of almost-periodic
for any (>0 there exists a relatively-dense set of (- functions and the algebra of all continuous functions
almost-periods of this function (d. Almost-period). In on the Bohr compactification makes it possible to sim-
other words, f (x) is U-almost-periodic (or EO U-a.-p.) if plify the proofs of a number of classical theorems. The
for any (>0 there exists an L =L() such that in each concept of a Bohr compactification is also meaningful
interval of length L there exists at least one number T for the algebras of almost-periodic functions on other
such that groups. In the case of the set of conditionally-periodic
functions with n independent fixed periods, the n-
I f(x+T)-f(x) I <l, -oo<x<oo. dimensional torus with these periods acts as the Bohr
If L(), (~O, is bounded, a Bohr almost-periodic func- compactification.
tion f (x) becomes a continuous periodic function. References
Bochner's definition (cf. Bochner almost-periodic func- [1] LOOMIS, L.H.: An introduction to abstract harmonic analysis, v.
tions), which is equivalent to Bohr's definition, is also Nostrand, 1953.
E.A. Gorin
used in the theory of almost-periodic functions. Func-
Editorial comments. Instead of Bohr compactification
tions in the class of U-almost-periodic functions are
the term Bohr compactum is also used.
bounded and uniformly-continuous on the entire real
There are several other ways to characterize the Bohr
axis. The limit f (x) of a uniformly-convergent sequence compactification and to prove its existence; see e.g. [A 1].
of Bohr almost-periodic functions {j,,(x)} belongs to The most abstract one is the characterization of the Bohr
the class of U-almost-periodic functions; this class is compactification of a topological group G as the reflection
invariant with respect to arithmetical operations (in of G in the category of all compact Hausdorff topological

414
BOLTZMANN DISTRIBUTION

groups; its existence is then guaranteed by the adjoint func- an arbitrary subdivision a=xo<xl<'" <xn=b of
tor theorem. This approach is also used for semi-groups [a, b], for an arbitrary selection of points I = {~;} 7,
and leads to several other types of compactifications (e.g. ~i E[Xi -I, x;], and an arbitrary t, the following sum is
the weakly almost-periodic compactifications). See [A2]. constructed:
References n
[A 1] ALFSEN, E.M. and HOLM, P.: 'A note on compact representa- I(t) = ~/(~i+t)[Xi-Xi-d.
i=l
tions and almost periodicity in topological groups', Math.
Scand. 10 (1962), 127-136. If, for p = maxi (Xi - Xi -I )~O, I (t) converges in measure
[A2] BERGLUND, J.F., JUNGHENN, H.D. and MILES, P.: Compact
to a definite limit I, the number I is said to be the Boks
right topological semigroups and generalizations of almost
periodicity, Springer, 1978. integral (B-integral) of f over [a, b]. Thus, the Boks
integral is an integral of Riemarm type and is a gen-
AMS 1980 Subject Classification: 43A60, 42A71
eralization of the Riemann integral.
The Boks integral represents a considerable extension
BOHR - FAVARD INEQUALITY - An inequality
of the Lebesgue integral: Any summable function is B-
appearing in a problem of H. Bohr [1] on the bounded-
integrable and these integrals coincide, but there exist
ness over the entire real axis of the integral of an
non-summable B-integrable functions; in particular, if g
almost-periodic function. The ultimate form of this ine-
is the function conjugate with a summable function f,
quality was given by J. Favard [2]; the latter materially
then it is B-integrable and the coefficients of the series
supplemented the studies of Bohr, and studied the arbi-
conjugate with the Fourier series of f are the coeffi-
trary periodic function
cients of the respective Fourier series (in the sense of
00
I(x) = ~ (ak coskx +bk sinkx) B-integration) of g (A.N. Kolmogorov). The theory of
k =n the Boks integral was not further developed, since the
with continuous derivative fr)(x) for given coastants r A-integral proved to be more convenient for the
and n which are natural numbers. The accepted form of integration of functions conjugate with summable func-
the Bohr- Favard inequality is tions.
lillie ~ K Ilf') lie, References
[1] BOKs, T.J.: 'Sur les rapports entre les methodes de l'integration
II I II e = XE[O.2"J
max II (x) I, de Riemann et de Lebesque', Rend. Cire. Mat. Palermo (2) 45
(1921),211-264.
with the best constant K=K(n, r): [2) ZYGMUND, A: Trigonometric series, 1-2, Cambridge Univ.
Press, 1979.
K = Ilf~Pc"1 II I Ik I.A. Vinogradova

The Bohr- Favard inequality is closely connected with AMS 1980 Subject Classification: 26A39
the inequality for the best approximations of a function BOLTZMANN DISTRIBUTION - The statistical
and its roth derivative by trigonometric polynomials of equilibrium distribution function f(p, r) of the momenta
an order at most n and with the notion of p and coordinates r for the particles of an ideal gas, the
Kolmogorov's width in the class of differentiable func-

,,+ .fm: l
molecules of which obey the laws of classical mechan-
tions (d. Width).
ics, in an external potential field:
References
[1] BOHR, H.: 'Un theoreme general sur l'integration d'un
polynome trigonometrique', c.R. Acad. Sci. Paris Ser. I Math.
200 (1935), 1276-1277.
ftp,,) ~ A
U
T (') (1)

[2] FAVARD, 1.: 'Sur les meilleurs procedes d'approximation de


certaines classes des fonctions par des polynomes Here k is the Boltzmann constant (a universal constant:
trigonometriques', Bull. Sci. Math. (2) 61 (1937), 209-224; 243-
256.
k = 1.38 times 1O- 16 erg / degree), T is the absolute
[3) AmEZER, N.!. [N.J. AKmEZER]: Vorlesungen fiber Approxima temperature, .; /2m is the kinetic energy of the parti-
tionstheorie, Akademie-Verlag, 1953 (translated from the Rus- cle, U(r) is the potential energy of the particle in the
sian).
LV. Talkov field, and the constant A is defined by normalization
over a dimensionless phase volume:
AMS 1980 Subject Classification: 26D05
j jl(P, r) d 3: :3r = N.
BOKS INTEGRAL - One of the generalizations of
the Lebesgue integral, proposed by A. Denjoy (1919) Here N is the total number of particles, h is the Planck
and studied in detail by T.J. Boks (1921). A real-valued constant (a universal constant
function f on a segment [a, b] is periodically extended h = 6.62 X 10- 27 erg X sec),
(with period length b - a) to the entire straight line. For d 3 p = dpx dPy dpzo d 3r = dx dy dz.

415
BOLTZMANN DISTRIBUTION

A can also be defined by the condition of normaliza- where t:.U= U(rl)- U(r2). A particu1ar case of (4)
tion in the space of velocities and coordinates, which is yields the barometric formula, which defines the particle
more usual in the kinetic theory of gases: densities in the gravity field above the surface of the
j jf(P, r)d3vd3r = N, Earth:
(5)

v =.R..., d 3 v = dvx dvy dvz where g is the acceleration of gravity, m is the mass of
m
the particle, z is the altitude above the Earth's surface,
The Boltzmann distribution is a consequence of the and no is the density at z =0.
Boltzmann statistics for an ideal gas, and is a particular The Boltzmann distribution of a mixture of several
case of the Gibbs distribution gases with different masses shows that the partial den-
nUl!. rio .Pt;. rN) sity distributions of the particles for each individual
P(PI, rl, ... ,PN, rN) = ze
1 kT
component is independent of that of other components.
For a gas in a rotating vessel, U(r) is the field of the
for an ideal gas, when
centrifugal forces:
PT + ~U(ri)'
= ~, 2m
H
, U(r) = _ m~2r ,
and the canonical Gibbs distribution becomes the pro- where w is the angu1ar velocity of rotation.
duct of the Boltzmann distributions for individual par- For references, see Boltzmann statistics.
ticles. The Boltzmann distribution is the limiting case D.N. Zubarev
of quantum statistics for an ideal gas at sufficiently
AMS 1980 Subject Classification: 82A40
high temperatures, when quantum effects can be
neglected. The average occupation number of the i-th BOLTZMANN EQUATION - An equation in kinetic
quantum state of a particle is gas theory, proposed by L. Boltzmann for the determi-
(2) nation of the single-particle distribution function of an
ideal mono-atomic gas [1]. In dimensionless variables
where f.j is the energy corresponding to the i-th quan-
the equation has the form:
tum state of the particle and p. is the chemical potential
defined by the condition ~;;=N F'lrmu1a (2) is valid Ma +(v, 'Yxj)+(F, 'Y.j)
t
= l.L(f,j).
(
(*)
for temperatures and densities at which the average dis-
tance between the particles is larger than the ratio Here f (x, v, t) is the density of the distribution func-
between the Planck constant h and the modulus of the tion of the number of particles in the phase space x v,
average thermal velocity x is the three-dimensional space coordinate, v is the
velocity, t is the time, F is the field strength of the
[~ f/3 lmu. external forces, and f. is a dimensionless parameter
(which is proportional to the ratio between the average
distance covered by the particles between successive
The MaxweU distribution is a special case of the collisions and the typical scale of the phenomena under
Boltzmann distribution (1) for U=O: consideration). In the simplest case the collision opera-
3/2
f() = N [ ~) -p'/2mkT
(3)
tor has the form:
P V 2wkT e .
L(f,j) = j[f(v')j(v'I)-f(v)f(vI)] I v -VI I dwdv lo
The distribution function (I) is sometimes referred to
where v I and v are the velocities of the molecules
as the Maxwell-Boltzmann distribution, the term
before collision, V'I and v' are the velocities of the
Boltzmann distribution being reserved for the distribu-
molecules after collision, and dw is the solid angle ele-
tion function (1) integrated over all momenta of parti-
ment in the direction of the vector VI -v.
cles representing the density of the number of particles
In deriving the Boltzmann equation it is assumed
at the point r:
(4) that the evolution of the function f (x, v, t) is deter-
where no is the density of the number of particles mined by its value at a given moment of time t and by
corresponding to the point at which U=O. The relative the pairwise collisions between the gas molecules, and
densities of the number of particles at different points that the time of interaction between two gas molecules
depend on the differences between the potential ener- during collision is much shorter than the time during
gies at these points: which they move independently of each other. From
the mathematical point of view the derivation of the
~ = e-I!.U/kT,
n2 Boltzmann equation is based on a certain algorithm

416
BOLTZMANN H-THEOREM

which constructs the operator L in accordance with the operator Lo is non-positive, <cp, Locp> ";;0, and can be
well-known laws of motion of two gas molecules which written as
Lo(cj = -P(c)<p+Gcj>,
collide with one another.
In equation (*) the range of variation of the variable where v(c) (which is sometimes called the collision fre-
t is the half-line t ~o; the range of variation of v is the quency) is a multiplication operator acting on cp and G
entire space R3; and the range of variation of x is a is a completely-continuous integral operator. The func-
subspace n in R3 (n may coincide with R 3). In accor- tion v( c) and the kernel of G have the following form
dance with its physical meaning, the function f (x, v, t) for the hard-sphere model [1]:
should be non-negative and such that
j!(x, v, t)v 2 dv < 00.
P(c) = <W [l.e-
2
c2
/ 2 + [I e I+_1_Jl
21 Ie
jCl e - da]
0
a2

'

The simplest boundary condition on an has the form


!(v -2n(n, v), x, t) = !(v, x, t), XEan, vER 3,
G(e, e') = 471" exp { -I e-e' 12 _ Ie 12_1 e' 12 }_
le-e'l 4 4Ie-e'12
where n is the normal to an. There exist a number of
rigorous statements of the Cauchy problem for equa- -271"1 e-e' I exp { _ e2~e'2}
tion (*).
References The existence theorem for the solution of the Cauchy
[I] BOLTZMANN, L.: Lectures on gas theory, Univ. of California problem as t-'?oo and E:-'?O was proved for the linear-
Press, Berkeley, 1964. ized Boltzmann equation, and the dispersion equation
[2] BOGOLYUBOV, N.N.: 'Selected works', Vol. 2, Kiev, 1970
(in Russian). was studied. The equation is principally employed in
[3] CHAPMAN. S. and COWLING, T.G.: The mathematical theory of the molecular acoustics of ideal gases. The equation
nonuniform gases, Cambridge Univ. Press, 1939. yields correct values for the transfer coefficients (viscos-
A.A. Arsen'ev ity, thermal conduction, velocity of sound) and the
Editorial comments. Stokes - Kirchhoff law of ultra-sound absorption.
References References
[A1] CERCIGNANI, c.: Theory and application of the Boltzmann
[I] CARLEMAN, T.: Probleme mathematique de la theorie kimitique
equation, Scottish Acad. Press, 1975.
des gases, 1960 (in Russian; translated from the French).
[A2] CERCIGNANI, C. (ED.): Kinetic theories and the Boltzmann
[2] ARsEN'EV, A.A.: 'The Cauchy problem for the linearized
equation, Springer, 1984.
Boltzmann equation', USSR Comput. Math. Math. Phys. 5, no.
AMS 1980 Subject Classification: 82A40 5 (1965), 116-136. (Zh. Vychisl. Mat. i Mat. Fiz. 5, no. 5
(1965), 864-882)
A.A. Arsen'ev
BOLTZMANN EQUATION, LINEARIZED, in kinetic
gas theory - A linear integro-differential equation which Editorial comments.
approximately describes the evolution of the one- References
particle distribution function of a sufficiently-rarefied [A 1] CERCIGNANI, C.: Theory and application of the Boltzmann
equation, Scottish Acad. Press, 1975.
gas without internal degrees of freedom for small devia- [A2] Kinetic theories and the Boltzmann equation, Springer, 1984.
tions from equilibrium.
This equation, AMS 1980 Subject Classification: 82A40
acj> _ 1
-at
+ <e, 'VA> - -Lo(cj,
( BOLTZMANN H-THEOREM - One of the conse-
is obtained from the Boltzmann equation quences of the kinetic Boltzmann equation, according to
which the function
H. +<e, 'V x! > -_ -Lif,!)
by substituting
at
1
( H(t) = jh(t, r)dr = j!ln!dpdr,
is a non-increasing function of the time t. Here
! = 71"-3/2 e -c 2
+p.e_c 2 /2cj>
f= f(t, r, p) is the dimensionless classical single-particle
and equating the terms in which the parameter J.L distribution function of the coordinates r and the
appears in the first degree. The operator Lo is said to momenta p, which satisfies the Boltzmann equation.
be the linearized collision operator. The linearized The evolution of the density h (t, r) with time is deter-
Boltzmann equation gives a satisfactory description of mined by the relaxational nature of the evolution of the
the evolution of the distribution function only if function f towards the local MaxweU distribution, while
sup I !(x, e, t)_71"-3/2 e -c 2 I 1.
the limit value of the H-function as t-'?oo is equal to
X,c,t the entropy of the ideal gas, calculated according to
If certain very general assumptions are made, the Gibbs, with the opposite sign. If the increments of t

417
BOLTZMANN H-THEOREM

under consideration are much larger than the time probable. The number of different modes of distribu-
needed for the local Maxwell distribution to be esta- tion of N particles over M cells of size Gj containing N j
blished, the quantity (- h (t, r)) may be identified with particles each is
the entropy density, while (- H(t)) may be identified GNi

with the non-equilibrium entropy of an ideal gas. WB ( N ... ) = N! II -'-" N = ~N;.


1.. ; .. M N;. ;
From the point of view of statistical mechanics, the
It is assumed that the particles are totally independent,
principal significance of the Boltzmann H-theorem con-
that they are distinguishable and that the state remains
sists in the mathematical expression of the fundamental
unchanged by rearrangements of particles within the
assumptions of macroscopic thermodynamics, accord-
same cell. In Boltzmann statistics this magnitude
ing to which, for example, an isolated system spontane-
defines the statistical weight, or the thermodynamic
ously tends to the state of thermodynamic equilibrium,
probability, of a state (unlike in ordinary probability, it
the process being accompanied by an increase in
is not normalized to one). In computing the statistical
entropy.
weight it is assumed that a rearrangement of identical
H-theorems are statements analogous to the original
particles does not result in a change of state, and for
Boltzmann H-theorem, but formulated for statistical
this reason the phase volume WB must be divided by a
systems of a different or a more general type, including
factor N!: WB
the case of non-ideal and quantified systems.
W("'N"') =N!
-.
The theory was presented by L. Boltzmann in 1872.
References Phases with volume reduction as above are said to be
[I) SOMMERFELD, A.: Thermodynamics and statistical mechanics, generic phases (as distinct from the original specific
Academic Press, 1956 (translated from the German).
phases).
[2) UHLENBECK, G.E. and FORD, G.V.: Lectures in statistical
mechanics, Amer. Math. Soc., 1963. All microscopic states with different particle distribu-
I.A. Kvasnikov tions over phase cells when the number of particles
AMS 1980 Subject Classification: 82A05 M
N = LN;
i=1
BOLTZMANN STATISTICS - Statistics used in a
system of non-interacting particles obeying the laws of and the total energy
classical mechanics (a classical ideal gas). The distribu-
tion of particles of an ideal gas (since they do not ;=1
interact) is not considered in the phase space of all par-
(fj is the energy of the particles in the i-th cell) are
ticles (the r-space) as in Gibbs statistical mechanics (d.
given, correspond to the same macroscopic state.
Gibbs distribution), but in the phase space of coordi-
It is assumed that the distribution of the particles in
nates and momenta of one particle (the JL-space). This
a state of statistical equilibrium is the most-probable
is because for an ideal gas the phase volume is
distribution, i.e. corresponds to the maximum
preserved in the JL-space (a special case of one of the W( ... N j ) for a given number of particles N and
Liouville theorems). energy E. The problem of an arbitrary extremum
According to the Boltzmann statistics, this phase W( ... N j ) for given Nand E yields the following
space is subdivided into a large number of small cells Boltzmann distribution for the average number of parti-
with a phase volume such that each cell contains an
cles in a cell:
even larger number of particles N j , and all possible dis- n = N; = e<p-<,)/kT
tributions of the particles over these cells are con- I Gi '

sidered. The phase volume Gj of the i-th cell is its where k is the Boltzmann constant (a universal con-
volume in the JL-space in h 3 -units, where h is the Planck stant k = 1.38 X 10- 16 erg / degree), T is the absolute
constant (a universal constant temperature, and JL is the chemical potential, defined by
h=6.62X 1O- 27 ergXsec). The meaning of such a
the condition N = ~.Nj. In the particular case of a
dimensionless Gj is the maximum number of possible I

potential field U (r):


micro-states in cell i, since the smallest value of the
product of each pair of coordinates and momenta is pl
(, 2m + U(r,),
equal to h according to quantum mechanics, and the
particle has three degrees of freedom. Boltzmann statistics is a special case of Gibbs statistics
Statistical mechanics is based on the assumption that - the canonical ensemble for a gas consisting of non-
all microscopic states corresponding to a given total interacting particles. The Boltzmann statistics is the
energy and a given number of particles are equally limit case of Fermi - Dirac statistics and

418
BOLZANO - WEIERSTRASS SELECTION PRINCIPLE

Bose- Einstein statistics at sufficiently-high tempera- of Mayer's problem; in the classical calculus of varia-
tures, when quantum effects can be neglected. tions it is considered in the form of Lagrange's prob-
The Boltzmann statistics was proposed by L. lem. The most extensively used topology is that of the
Boltzmann in 1868 - 1871. space C 1 of continuously-differentiable functions. To
References obtain necessary or sufficient conditions for the
[I] MAYER, J.E. and GoEPPERT-MAYER, M.: Statistical mechanics, extremum, smoothness requirements must be imposed
Wiley, 1940. on the functions f and g involved in the definition of
[2] SOMMERFELD, A.: Thermodynamics and statistical mechanics,
Academic Press, 1956 (translated from the German). the problem and on the mappings q, and l/I, as well as
[3] ScHRODINGER, E.: Statistical thermodynamics, Cambridge requirements concerning the regularity of these map-
Uniy. Press, 1948. pings, i.e. the matrices
[4] FOWLER, R. and GUGGENHEIM, E.: Statistical thermodynamics,
Cambridge Uniy. Press, 1960. (al/l / at 1> al/l / ax 1, al/l / at 2, al/l / ax 2) and (aq, / ax)
D.N. Zubarev must have maximal rank p and m, respectively. For the
AMS 1980 Subject Classification: 80A 15 vector function x(t) to give an extremum in the BoIza
problem (and, similarly, in the Lagrange or Mayer
problem), it is necessary that is satisfies also the Euler
BOLZA PROBLEM - One of the main problems in
equation and the Weierstrass conditions (for a varia-
the classical calculus of variations on the conditions for
tional exfremmn) with respect to the Lagrange function
an extremum, given some contraints on the type of
constructed using the data of the problem with
equations; formulated in 1913 by O. BoIza. The prob-
Lagrange multipliers, as well as the Jacobi condition
lem consists of the minimization of the functional
and the transversaiity condition.
I,
The notation employed in the above formulation of
J(x) = jJ(/, x, x)dl+g(tJ, X(tl), 12, X(t2)),
I, the BoIza problem is that used in optimal control
J:RXRnXRn ~ R, g:RXRnXRXRn ~ R, theory. In the classical calculus of variations the BoIza
problem is formulated using a different notation:
in the presence of differential constraints on the type of x,
the equation: J(y) = jJ(x,y,y')dx+g(XJ,Y(Xl),X2,Y(X2)),
.p(/,x,x)=O, q,:RXRnXRn~Rm, m<m, XI

and the boundary conditions


!J1h, X(/l)' 12, X(/2)) = 0, 1/1: RXRn XRXRn ~ RP, References
[I] BLISS, G.A.: Lectures on the calculus of variations, Chicago
P ..;;; 2n+2. Uniy. Press, 1947.
I.B. Vapnyarskir

lem, if f
If g-O, the problem is known as the Lagrange prob-
and p<2n +2 it is known as the Mayer
problem. A special feature of the BoIza problem is the
AMS 1980 Subject Classification: 49BXX

mixed nature of the functional, which is represented as BOLZANO- WEIERSTRASS SELECfION PRIN-
the sum of an integral functional and a function on the CIPLE - A method of proof which is frequently
boundary. In principle, the BoIza problem is equivalent employed in mathematical analysis and which is based
to the Lagrange problem, and in fact leads to it if one on successive subdivision of a segment into halves, after
sets which the segment having some property is chosen as
I,
the new, initial segment. This method may be
Jl(x) = j(f(/, x, x)+xn+l)dl,
I, employed if the nature of the segments is such that the
fact that the property is present in some segment
Xn+l = 0, Xn+l(tl) = ( g ) implies that at least one segment obtained by halving
12 -II
the original segment will also have this property. For
It also leads to the Mayer problem if one sets
instance, if the segment contains infinitely-many points
J 2 = XO(t2)+g(tJ, x(tl), 12, X(t2)), of some set, or if some function is not bounded on the
where segment, or if a non-zero function assumes values of
Xo = J(/, x, x), XO(tl) = 0. opposite sign at the two ends of the segment - all
The choice of a given form of the problem, and the these are properties of this type. The
choice of the topology within which the problem is then BoIzano - Weierstrass selection principle can be used to
considered, is determined by convenience or by expec- prove the Bolzano - Weierstrass theorem and a number
tion in each particular case. In the theory of optimal of other theorems in analysis.
control the problem is usually considered in the form Depending on the criterion according to which the

419
BOLZANO - WEIERSTRASS SELECTION PRINCIPLE

segments are chosen in applying the was later also independently deduced by K. Weier-
Bolzano - Weierstrass selection principle, the process strass.
obtained is effective or ineffective. An example of the References
former case is the application of the principle to prove [1] BOLZANO, B.: Abh. Biochem Ges. Wiss. (1917).
that for a continuous real function that assumes values L.D. Kudryavtsev
of opposite sign at the ends of a given segment, this AMS 1980 Subject Classification: 26A03
segment contains a point at which the function van-
ishes (cf. Cauchy theorem on intermediate values of BONNESEN INEQUALITY - One of the more pre-
continuous functions). In this case the criterion chosen cise forms of the isoperimetric inequality for convex
for the successive choice of the segments is that the domains in the plane. Let K be a convex domain in the
function assumes values of different sign at the two plane, let r be the radius of the largest circle which can
ends of the chosen segment. If there is a way of com- be inserted in K, let R be the radius of the smallest cir-
puting the value of the function at every point, then, cle containing K, let L be the perimeter and let F be
after performing a sufficient number of steps, it is pos- the area of K. The Bonnesen inequality [1]
sible to obtain the coordinates of the point at which the
function vanishes, to within a given degree of accuracy. Il = L2_4'lTF;;;' .,r.(R -r)2
Thus, in addition to proving that a root of the equation is then valid. The equality A=O is attained only if
f(x)=O exists on a segment at the ends of which the R = r, i.e. if K is a disc. For generalizations of the Bon-
values of the function are of different sign, one also has nesen inequality see [2].
a method of approximately solving this equation. An References
example of an ineffective process is the use of the [I] BONNESEN, T.: 'Ueber eine Verschlirferung der isoperimetische
Bolzano - Weierstrass selection principle to prove that Ungieichheit des Kreises in der Ebene und auf die
Kuge10berflache nebst einer Anwendung auf eine Min-
a continuous real function on a segment attains a max- kowskische Ungieichheit fUr konvexe K6rper', Math. Ann. 84
imum on the segment. Here, the segment chosen in the (1921), 216-227.
successive subdivisions is the one on which the max- [2] DISKANT, V.I.: 'A genera1ization of Bonnesen's inequalities',
Soviet Math. Dokl. 14, no. 6 (1973), 1728-1731. (Dokl. Akad.
imum of the values of the function is not less than that
Nauk SSSR 213, no. 3 (1973), 519-521)
on the other one. If, as in the former case, it is possible A.B. Ivanov
to calculate the value of the function at any point, this AMS 1980 Subject Classification: 52A 10, 52A40
is still not sufficient for an effective choice of the
required segment. Accordingly, the BONNET NET - An isothermal net whose lines have
Bolzano - Weierstrass selection principle can be used in constant geodesic curvature. The square of the line ele-
this case only to prove an existence theorem which says ment in the parameters of this net is:
that the function assumes its maximum at some point, du 2 +dv 2
ds 2 = -==----'~
but not to specify this point within a given degree of (u+V)2 '
accuracy. where U= U(u), V= V(v). Studied by O. Bonnet in
There exist various generalizations of the 1848.
Bolzano- Weierstrass selection principle, e.g. to apply A.B. Ivanov
it in the n-dimensional Euclidean space (n = 2, 3, ... ) AMS 1980 Subject Classification: 53A05, 53820
to n-dimensional cubes, which are successively subdi-
vided into congruent cubes with side-lengths of one- BONNET THEOREM - 1) Bonnet's theorem on the
half that of the original cube. existence and the uniqueness of a surface with given first
L.D. Kudryavtsev
and second fundamental forms [1]. Let the following
AMS 1980 Subject Classification: 26A03 two quadratic forms be given:
E du 2 +2F du dv +G dv 2 ,
BOLZANO - WEIERSTRASS THEOREM - Each
bounded sequence of numbers contains a convergent L du 2 + 2M du dv + N dv 2 ,
subsequence. The theorem applies both to real and the first one of which is positive definite, and let the
complex numbers. It can be generalized to include coefficients of these forms satisfy the Gauss equations
more-general objects, e.g. any bounded infinite set in (cf. Gauss theorem) and the Peterson-Codazzi equa-
n-dimensional Euclidean space has at least one limit tions. Then there exists a surface, which is unique up to
point in that space. There exist analogues of this motions in space, for which these forms are, respec-
theorem for even more-general spaces. tively, the first and the second fundamental forms.
The theorem was demonstrated by B. Bolzano [1]; it 2) Bonnet's theorem on the diameter of an oval sur-

420
BOOLEAN ALGEBRA

face: If the curvature of an oval surface is larger than notation X, x' or -x may be employed instead of Cx.
or equal to 1 / A 2 at all its points, then the external The complement of an element in a Boolean algebra is
diameter of this surface is smaller than 7TA; this esti- unique.
mate cannot be improved. Stated by O. Bonnet in 1855. A Boolean algebra can also be defined in a different
A.B. Ivanov manner. Viz. as a non-empty set with the operations C,
Editorial comments. A proof of this theorem of Bonnet V, 1\ which satisfy the following axioms:
may be found in [A1] or [A2]. The Peterson-Codazzi equa- 1) xVY = Y V x, x 1\Y = Y I\x ;
tions are usually called the Mainardi- Codazzi equations, cf 2) xV(yVz)=(xVy)Vz, x I\(yl\z)=(xl\y)l\z ;
[A1], after G. Mainardi (1857) and D. Codazzi (1868), who 3) (xl\y)Vy=y, (xVy)l\y=y;
established them. 4) xl\(yVz)=(xl\y)V(xl\z),
References x V(y I\z)=(x Vy)l\(x V z);
[A1] BLASCHKE, W. and LEiCHTWErB, K.: Elementare Differential- 5) (xI\Cx)y=y, (xVCx)l\y=y.
geometrie, Springer, 1973. If this approach is adopted, the order is not assumed
[A2] CARMO, M. DO: Differential geometry of curves and surfaces,
Prentice-Hall,1976. to be given in advance, and is introduced by the fol-
3) Bonnet's theorem on the mean value, second mean- lowing condition: x~ if and only if x=xl\y.
value theorem [2r. Let f (x), Ij>(x) be integrable func- Other axiomatics are also possible. The axioms of a
tions on a segment [a, b] and let Ij>(x) be a positive Boolean algebra reflect the analogy between the con-
decreasing function of x; then there exists a number ~ cepts of a 'set', an 'event' and a 'statement'. The
in [a, b] for which the equality 'order' relation in a Boolean algebra can be interpreted
b ~
in various ways - as set-theoretical inclusion, as causal
jf(x)<p(x)dx = <f>(a)jf(x)dx follow-up of events, as logical follow-up of statements,
a etc.
is valid. If Ij>(x) is merely required to be monotone, In addition to the basic operations C, V, 1\, other
Bonnet's theorem states that there exists a point ~ in operations in a Boolean algebra can be defined; among
[a, b] such that these the symmetric difference operation is particularly
b ~ b important:
jf(x)<p(x)dx = <f>(a)jf(x)dx+<f>(b)jf(x)dx
a a ~
x +2.Y = (x I\Cy)V(v I\Cx).
is true. Alternative notations are xLly, I x -y I
References Any Boolean algebra is a Boolean ring with a unit
[lA] BONNET, 0.: J. Ecole Poly technique 24 (1865), 204-230. element with respect to the operations of 'addition'
[IB] BONNET, 0.: J. Ecole Poly technique 25 (1867), I-lSI.
[2] BONNET, 0.: 'Remarques sur quelques integrales definies', J.
(+2) and 'multiplication' (1\); any Boolean ring with a
Math. Pures Appl. 14 (1849), 249-256. unit element can be considered as a Boolean algebra.
T. Yu. Popova
Boolean algebras first arose in the studies of G.
Editorial comments. Bonnet's original article is [A1]. Boole [1], [2] as a tool of symbolic logic. They subse-
References quently found extensive application in other branches
[Al] BONNET, 0.: C.R. Acad. Sci. Paris 40 (1855),1311-1313. of mathematics - in probability theory, topology,
AMS 1980 Subject Classification: 53A05, 26A24, functional analysis, etc. The applications of Boolean
53C45 algebras to logic are based on the interpretation of the
elements of a Boolean algebra as statements (d. Alge-
BOOLEAN ALGEBRA, Boolean lattice - A partially bra of logic), the complement Cx being interpreted as
ordered set of a special type. It is a distributive lattice the negation of the statement x, and the operations 1\
with a largest element '1', the unit of the Boolean alge- and V as conjunction and disjunction, respectively.
bra, and a smallest element '0', the zero of the Boolean Closely related to logic is another field of application of
algebra, that contains together with each element x also Boolean algebras - the theory of contact schemes (cf.
its complement - the element Cx, which satisfies the Contact scheme). Boolean algebras are used in the
relations foundations of probability theory. A field of events, as
studied in probability theory, is a Boolean algebra; here
sup{x, Cx} = 1, inf{x, Cx} = O. the inequality x ~y means that an event y follows from
The operations sup and inf are usually denoted by the an event x; the '1', the '0' and the Boolean operations
symbols V and 1\, and sometimes by U and n V, 1\ and C are interpreted corespondingly.
respectively, in order to stress their similarity to the An example of a Boolean algebra is the system of all
set-theoretical operations of union and intersection. The subsets of some given set Q partially ordered by inclu-

421
BOOLEAN ALGEBRA

sion. Such a Boolean algebra is denoted by 2Q; its zero algebra have an extension to a homomorphism if and
is the empty set, and its unit is the set Q itself. The set only if E is an independent set, i.e. if all elements of the
Q \ x is the complement of an element x; the Boolean form
operations V and 1\ coincide with union and intersec- XI;\ ... ;\xp;\CXp+l;\ ... ;\Cxm , xjEE, Xj=l=Xk,
tion, respectively.
are non-zero. A Boolean algebra generated by an
Instead of the subsets of Q it is convenient to con-
independent system is called a free Boolean algebra.
sider their characteristic functions. The system XQ of
An example of a free Boolean algebra is the algebra
all such functions, with the natural order, is a Boolean
of Boolean functions in n variables considered above.
algebra, which is isomorphic to the Boolean algebra 2Q.
In such a Boolean algebra the operations V, 1\, C, and Independent generators of it are the functions
+2 are interpreted as follows: J;:J;(x) = J;(XI> ... ,xn ) = Xj.
(xVyXq) = max{x(q),y(q)}, Stone's theorem: Every Boolean algebra X is iso-
morphic to some algebra of sets, namely, the algebra of
(x;\y)(q) = min{x(q),y(q)} = x(q)y(q),
all open-and-c1osed sets of a totally-disconnected com-
(Cx)(q) = 1-x(q), pactum l(X), defined up to a homeomorphism. This
(x+1Y)(q) = I x(q)-y(q) I = x(q)+y(q) (mod 2), (qEQ). compactum is known as Stone's compactum. To a
homomorphism of a Boolean algebra X into a Boolean
The following cases are especially important:
algebra Y corresponds a topological imbedding of l( y)
1) Q=Qn={1, ... ,n}.
into l(X); to a subalgebra of a Boolean algebra X
In this case the characteristic functions of the subsets
there corresponds a continuous image of l(X). The
are 'two-valued symbols' of the form:
Stone compactum of a free Boolean algebra is a dyadic
O,
x = (XI> ... ,xn ), Xj= { 1, discontinuum.
A Boolean algebra X is called complete if any set
which are 2n in number. For n = 1, the two-element E e X has an upper bound sup E and a lower bound
Boolean algebra, consisting only of '1' and '0', is inf E. This is equivalent to l(X) being extremal (cf.
obtained. Extremally-disconnected space). Sub algebras of a com-
2) Q=XQ plete Boolean algebra containing the bounds of all their
In this case, all possible functions, defined on the subsets calculated in X are called regular subalgebras.
system of all binary symbols of length n, and taking the The weight of a Boolean algebra X is the lowest cardi-
values '0' and '1' only, are elements of X Q They are nality of a complete generating set, i.e. a set which is not
called Boolean functions in n variables (cf. Boolean contained in any regular subalgebra other than X. If
function). Every well-constructed formula of predicate the weights of all non-zero principal ideals are identi-
logic defines some Boolean function; if two functions cal, then the Boolean algebra is called uniform; such
are identical, the formulas are equivalent. algebras invariably contain a complete generating
Under certain conditions a subset E of elements of a independent set. In other words, a complete uniform
Boolean algebra X is itself a Boolean algebra with Boolean algebra can be 'stretched onto' a free Boolean
respect to the order induced from X. This is the case, algebra. The study of an arbitrary Boolean algebra
in particular, if: readily reduces to the study of uniform Boolean alge-
(a) E is a principal ideal, i.e. a set of the form bras. An incomplete Boolean algebra can be completed
{XEX: x:s;;;u}; the element u then acts as the unit '1'; in different ways, i.e. it can be imbedded as a subalge-
(b) E is a subalgebra of a Boolean algebra X. This bra in some complete Boolean algebra.
means that if x, y EE, it follows that A complete Boolean algebra is called normed if a
x Vy, X /\y, Cx EE. The '0' and '1' of the initial real-valued function J.L (a measure) is defined on it with
Boolean algebra are also the '0' and '1' of the new the following properties: 1) if x=O, then J.L(xO; 2) if
Boolean algebra. The Boolean subalgebras of 2Q are EeX and xl\y=O when x,YEE, x=y, then
especially important; they are called algebras of sets. p.(supE) = ~p.(x).
Any set E eX generates a certain subalgebra - the XEE

smallest sub algebra that contains E. In probability theory, in which normed Boolean alge-
Homomorphisms of Boolean algebras play a special bras are particularly important, it is usually assumed
role under the mappings of Boolean algebras; they are that J.L( I) = 1. Here, the value of J.L{x) is interpreted as
mappings which commute with the Boolean operations. the probability of an event x. The classical theory of
A bijective homomorphism of Boolean algebras is an measure and integral can largely be applied to normed
isomorphism. If a Boolean algebra X is generated by a Boolean algebras. Normed Boolean algebras have been
set E, then all mappings of E into an arbitrary Boolean completely classified [4], [5], [7]. In particular, for uni-

422
BOOLEAN FUNCTION

form normed Boolean algebras the only invariant is the BOOLEAN FUNCI10N, function of the algebra of
weight. Not all Boolean algebras can be normed. Many lOgiC - A function whose arguments, as well as the func-
conditions for the existence of a measure are known, tion itself, assume values from a two-element set (usu-
but these are far from exhaustive in the problem of n.
ally {O, I Boolean functions are one of the main sub-
nonning. jects of discrete mathematics, in particular, of
A Boolean algebra can be endowed with various mathematical logic and mathematical cybernetics.
topologies. A specially important one is the so-called Boolean functions first occurred in the mathematical
(0 )- topology, which, for a normed Boolean algebra, IS formulation of logical problems, and were named after
metrizable, and corresponds to the metric G. Boole, who laid the foundation for the applications
p(x,y) = JL(x/\Cy)V(Cx/\y)], of mathematics in logic in the middle of the nineteenth
and is identical with the Tikhonov topology for century; cf. Algebra of logic.
Boolean algebras of the form 2Q. In the most general One such problem is the construction of an algebra
case there need not be a topology compatible with the of propositions. To this purpose, one of the two values
order in a Boolean algebra. o ('false') or I ('true') is assigned to each proposition;
the principal logical relations 'and', 'or', 'not', 'if ...
References
[I] BOOLE, G.: The mathematical analysis of logic: being an essay then', etc., can then be regarded as the respective 'ele-
towards a calculus of deductive reasoning, Macmillan, 1847. mentary' Boolean functions: xl\y, xVy, ex, x~y, etc.
[2] BOOLE, G.: An investigation of the laws of thought, on which are When this is done, the value of any complex proposi-
founded the mathematical theories of logic and probabilities,
Dover, reprint, 1951. tion, constructed with the aid of the fundamental logi-
[3] SIKORSKI, R.: Boolean algebras, Springer, 1969. cal connectives from given propositions, is a Boolean
[4] VLADIMIROV, D.A.: Boolesche Algebren, Akademie Verlag, 1978 function of the values of these propositions. Such a
(translated from the Russian).
[5] liALMOS, P.R.: Lectures on Boolean algebras, v. Nostrand,
Boolean function is a composition of elementary
1963. Boolean functions corresponding to the logical connec-
[6] RASIOWA, E. and SIKORSKI, R.: The mathematics of tives forming part of the complex proposition. It
metamathematics, Polska Akad. Nauk., 1963.
became clear later that the language of Boolean func-
[7] STONE, M.H.: The theory of representations for Boolean alge-
bras', Trans. A mer. Math. Soc. 40 (1936), 37-111. tions is suited for a description of the operation of
[8] BIRKHOFF, G.D.: Lattice theory, Colloq. Pub!., 25, Amer. Math. discrete control systems (cf. Control system) such as
Soc., 1973. contact schemes, diagrams of functional elements, logi-
[9] HERMES, H.: Einfiihrung in die Verbandstheorie, Springer, 1967.
[10] KOLMOGOROV, A.N.: 'Algebres de Boole metriques completes',
cal networks, switching networks, etc. These control
in VI Zjazd Matematykow Polskich, Krakow, 1950. systems are constructed in accordance with certain
[II] DUNFORD, N. and SCHWARTZ, J.T.: Linear operators. Spectral rules from a number of initial elements, just as complex
operators, 3, Interscience, 1971.
[12A] KAKUTANI, S.: 'Concrete representations of abstract (L)-spaces
statements are constructed from simple ones. The rules
and the mean ergodic theorem', Ann. of Math. (2) 42, no. 2 governing the construction of such control systems as
(1941),523-537. well as the operation of the initial elements are such
[12B] KAKUTANI, S.: 'Concrete representations of abstract (M)-
that the operation of complex control systems can be
spaces (a characterization of the space of continuous func-
tions)" Ann. of Math. (2) 42, no. 4 (1941),994-1024. described with the aid of Boolean functions. Boolean
[13] MAIiARAM, D.: Proc. Nat. Acad. Sci. USA 28 (1942), 108-111. functions are also used in certain problems of integer
[14] MACKEY, G.W.: The mathematical foundations of quantum programming that reduce to solving a system of
mechanics, Benjamin, 1963.
[15] YOSIDA, K.: Functional analysis, Springer, 1980. Boolean equations of the form
[16] KURATOWSKI, K.: Topology, 2, Acad. Press, 1966-1968
!l(Xj, ... ,xn ) = ... = !m(xj, ... ,xn ) = 0,
(translated from the French).
D.A. Vladimirov
where the j;, i = I, ... ,m, are Boolean functions.
AMS 1980 Subject Classification: 06EXX
There are also other ways of using Boolean functions in
BOOLEAN EQUATION - An equation of the form discrete mathematics, so that the study of Boolean
functions is of interest in its own right.
!(Xj, ... ,xn ) = 0, (*)
An essential feature in solving problems related to
where f is a Boolean function in n variables. The set of Boolean functions is the method of specifying Boolean
all solutions of an equation of the form (*) can be functions. There are several such methods: tables, for-
described by a system of Boolean functions depending mulas, special classes of formulas known as normal
on n arbitrary parameters. forms (cf. Boolean functions, normal forms of), subsets
References of the vertices of an n-dimensional unit cube, etc. In
[I] BIRKHOFF, G.: Lattice theory, Colloq. Pub!., 25, Amer. Math.
Soc., 1973.
the last-named case every selection of length n of values
T.s. Fofanova of the arguments (0 or I) is regarded as a vertex of the
AMS 1980 Subject Classification: 06E30 n-dimensional unit cube, in which case a Boolean func-

423
BOOLEAN FUNCTION

tion of n arguments can be defined by the subset of the nected with the minimization of Boolean functions and
comers at which it assumes the value 'I'. This subset, the theory of local algorithms. These characteristics are
when written out as a matrix whose rows are selections the dimension and the extent of functions.
of values of the arguments of the Boolean function, is Let Nf(x" ... ,x,) be the set of vertices of the n-
known as a Boolean matrix. If a Boolean function dimensional unit cube on which a function
describes the operation of control systems, the latter f(x], ... ,xn ) is equal to one. Consider all maximal
can also be regarded as a method of specifying the intervals of the function f(x], ... ,xn ) and select the
Boolean function. One usually says that this control interval of the largest dimension r. The quantity r is
system realizes the given Boolean function. The realiza- called the dimension of f (x], ... ,xn ) and is denoted
tion of Boolean functions by many kinds of control by Dimf(XI, ... ,xn ). In terms of the dimension one
systems is closely related to a large number of prob- can estimate the complexity ratio of the most complex
lems, such as synthesis, minimization, control and relia- dead-end and the shortest disjunctive normal forms of f
bility problems, etc. Another type of problem arises in (cf. Boolean functions, normal forms of). An upper esti-
the study of properties and classes of Boolean functions mate of this ratio is 2Dimf. The inequality
specified by different methods; this is the study of the Dimf(x\> ... ,xn ) E;;; [lo~nl+l
metric characterizations of various classes of normal
forms of Boolean functions and the related geometrical is valid for 'almost-all' Boolean functions.
properties of the n-dimensional unit cube (cf. Boolean In solving problems of minimization of Boolean
functions, metric theory of), as well as of the various functions it is of interest to calculate the dimension of
algebras of Boolean functions (ct. Many-valued logic; 'typical' maximal intervals. It has been proved that
Equivalent transfonnations). The system of all classes of 'almost-all' maximal intervals of 'almost-all' Boolean
Boolean functions that are closed under composition functions f(x], .. . ,xn ) are of dimension close to
was described by E. Post. It forms a countably-infinite lo&lo~n.
lattice with five maximal (pre-complete) classes. Let Sk(,iJl,j) be the k-th order smooth neighbour-
In certain cases it may be necessary to consider par- hood (cf. Algorithm, local) of an elementary conjuction
~ occurring in the abridged disjunctive normal form SRf
tial (i.e. not everywhere-defined) Boolean functions for
which the problems listed have a specific character. of f, and let k (~,f) be the minimal value of the order
of the neighbourhood in which Sk(~' j) includes all
References elementary conjunctions occurring in the abridged dis-
[1] NOVIKOY, P.S.: Elements of mathematical logic, Edinburgh,
1964 (translated from the Russian). junctive normal form SRf . The quantity
[2] YABLONSKII, S.V., GAVRILOV, G.P. and KUORYAVTSEV, V.B.:
p(j) = max k(~, f)
Functions of the algebra of logic and Post classes, Moscow, 1964 lle91f
(in Russian).
v'B. Kudryavtsev is called the extent of f For 'almost-all' Boolean func-
Editorial comments. Cf. [A2] for uses of Boolean func- tions
n
tions and equations in switching theory and logical net- p(j(x\> ... ,xn )) ~ 1 1
o~ o~n
works. Applications of Boolean functions and equations in
operations research are discussed in [A1]. Let
p(n) = j(x,max
... ,x,)
P(j(Xlo'" ,xn )).
References
[A1] HAMMER, P.L. and RuoEANU, S.: Boolean methods in opera-
tions research, Springer, 1968.
It is known that p (n) can be realized on a Boolean
[A2] RuoEANU, S.: Boolean functions and equations, North- function of a special kind, which is called a chain. A
Holland,1974. function ll-{X) , . . . ,xn ) is called a chain if the set N%f
AMS 1980 Subject Classification: 06E30 its zeros can be represented as a sequence a), ... ,aq
such that p(aj, aj + 1) = 1, where p is the Hamming dis-
BOOLEAN FUNCfIONS, METRIC THEORY OF -
The branch of mathematics in which one studies
tance (cf. Code); the distance between other pairs a" as
(possibly except for the pair (a), aq is larger than one,
numerical characteristics and metric properties of and no interval of dimension 2 is completely contained
Boolean functions. The principal parts of this theory in the set of ones of 1/;. The extent of the chain 1/; is
are concerned with properties of 'almost-all' Boolean q - 1. Therefore, the problem of calculating p (n)
functions (cf. Boolean functions, minimization of), pro- reduces to the construction of a chain with maximal q
perties of the set of all Boolean functions in a given in the n-dimensional unit cube. It has been shown by
number of variables, and special subclasses of Boolean direct construction of such chains that
functions. Another topic is the structure of the truth
domains of Boolean functions with the aid of numerical Cj2 n <p(n) < C2'2n,

characteristics that appear mainly in problems con- where CJ, C2 are constants. The constmction of closed

424
BOOLEAN FUNCTIONS, METRIC THEORY OF

chains (cycles), i.e. of chains in which p(a" aq )= 1, with values of n. The asymptotic equality
maximum cardinality of the set N", is an important n
part of the proof of the theorem that the properties of
conjuctions 'of occurring in the minimal or shortest dis-
junctive normal form' are non-computable in the class
of local algorithms. is also known.
The following result clarifies the structure of the The problem of optimal decoding of a monotone
truth domains of 'almost-all' Boolean functions. A set Boolean function has a large number of applications in
M of vertices of the n-dimensional unit cube is called the solution of discrete extremal problems. Consider a
connected if for any yoint aEM there exists a point {J given algorithm A for the computation of a monotone
a
in M such !!tat p(a, {J) = 1. Ayoint in M is called iso- Boolean function f (x), ... ,XII) at any point a. If it
pted if all {J such that p(a, {J)= I satisfy the condition can be established in the course of the computations
{Jfl.M. The following theorem holds: For 'almost-all' that f(a)=1 for some point a,
then f({J)=1 for all
Boolean functions f (x), ... ,xn ) the set of ones, p~a. If f(a)=O, the function f is known (and van-
Nf(x" ... ,x.), split~ into the sum of a connected set and ishes) at all points p~a. Therefore, to effect the decod-
a set of isolated points. The cardinality of the con- ing, i.e. to completely reproduce the function f, the
nected set is not less than algorithm A need only compute its values in a certain
2n -)-Vn2"/ 2 -constlo!;2n and the number of iso- set of points. Let m (j) denote the minimum number
lated points does not exceed constl0!;2 n. of points sufficient to reproduce f completely. Let
The results obtained in calculating the extent of m(n) = max m(j).
'almost-all' Boolean functions are closely connected j(x" .. . ,x.)

with the results obtained on calculating the radii and The quantity m(n) satisfies the asymptotic equality:

lJ
diameters of the graphs generated by Boolean func-
tions. The graph G(j) generated by a Boolean function n 1
f has as vertices the points of Nf and as edges the pairs m(n) - 2 [;
of points of Nf at H~g distance one from each
other,: The distance rG(a, {J) between two vertices a The estimate of m (j) can be improved if supplemen-
and {J of G (j) is defined as the length of the shortest tary information about the monotone Boolean function
chain connecting witha p. (It is assumed that the ver- is available.
a p
tices and belong to the same connected component A problem related to the problem of decoding mono-
a
of G(j).) The deviation of the vertex in G(j) is the tone Boolean functions is that of calculating numerical
quantity l(a)=maxrG(a, P), where the maximum is characteristics of the set of essential variables of
taken over all the vertices of G that belong to the same Boolean functions that are not defined everywhere, i.e.
connected component as a. The radius of a connected functions defined on some subset of the set of vertices
component K of G is the number R(K)=minaEKI(a). of the n-dimensional unit cube. A collection of vari-
The quantity R(G)=maxR(K), where the maximum is ables Xi" ... ,Xi. is called essential for a Boolean func-
taken over all connected components of G, is called the tion F(x" ... ,XII) that is not everywhere defined if
radius of G. The diameter of G is the number {Xi" . . . ,xd C;; { X ), . . . ,XII} and if there exists a
D(G)=maxrG(a, P), where the maximum is taken over Boolean function CP(Xi" ... ,Xi.) that is not everywhere
all pairs of vertices a, p that belong to the same con- defined such that F(x), ... ,XII)-CP(Xi" ... ,Xi.)
nected component. For 'almost-all' Boolean functions throughout the domain of definition of F. An essential
f(x), ... ,XII) the quantities R(G) and D(G) are such collection is called a dead-end collection for F if no
that n -2:s;;;;,R(G(j:s;;;;,n -I and D(G(j=n-1. proper part of it is essential for F.
Of all the results of computations of numerical Each everywhere-defined Boolean function has a
characteristics of individual classes of Boolean func- unique dead-end collection. For Boolean functions that
tions only those concerning monotone Boolean func- are not defined everywhere the structure of the collec-
~ons will be considered. Let a=(a)'" all), tions of essential variables is distinguished by its large
{J=({J) . ~ . {JII) be binary ordered samples. One says variety. Let ~ be the system of subsets of the set
that a:s;;;;,{J if ai:s;;;;,{Ji, i=I, ... ,no A Boolean function is {x), ... ,xn } that satisfies the following condition: If
called monotone if a~p implies that f(a)~f({J). It is S E~ and if S' is an arbitrary subset of {x), ... ,XII}'
of interest to determine the exact number #..n) of dis- then SUS' E~. Then there exists a Boolean function
tinct monotone Boolean functions in the variables F~(x" ... ,XII)' not everywhere defined, such that ~ is
X), ... ,XII' This number is known for only a few small a system of essential collections for F~. Let tF(n) be

425
BOOLEAN FUNCTIONS, METRIC THEORY OF

the number of distinct dead-end collections for another, of having an empty intersection or a non-
F(Xb ... ,XII) and put empty symmetric difference. Let mf be the complexity
t(n) = F(x"max tp(n).
of the minimal disjunctive normal form of a function f,
... ,x,,) let kf be the minimal complexity of its shortest disjunc-
It is known that tive normal form and let /(n) be the largest of the
ratios kf / mf over all functions in n variables. Then the
following asymptotic relation holds:
n
I(n) '" 2'
The algorithm for the construction of all dead-end
collections for F(x., ... ,XII) has maximal computa- By the problem of minimization of Boolean functions
tional labour 2([11 j2) and this maximum is in fact one usually understands that of constructing their
attained. (Here the unit of labour is defined as the minimal disjunctive normal forms. There is a trivial
labour involved in verifying whether or not a given col- algorithm for constructing all minimal disjunctive nor-
lection of variables is essential for F.) mal forms of an arbitrary Boolean function
The metric theory of Boolean functions also deals f (x 1, . . . , XII)' which operates as follows: All disjunc-
with problems of computing characteristics connected tive normal forms in the variables Xl"'" XII are
with the problem of minimization of Boolean functions. reviewed, and those which realize the function f and
have minimal complexity are selected. In fact, this algo-
References
[I] GLAGOLEV, V.V.: 'Certain estimates of the disjunctive normal rithm is not applicable even for small n, since the
forms of functions of the algebra of logic', Problemy Kibemet. number of operations required increases rapidly. Many
19 (1967),75-94 (in Russian). other algorithms have therefore been constructed,
[2] ZlruRAvuv, Yu.l.: 'On algorithmic extraction of the set of
essential variables of not everywhere defined functions of the which, however, are not effectively applicable to all
algebra of logic', Problemy Kibemet. 11 (1964), 271-275 (in functions.
Russian). The initial specification of a function in the problem
[3] KOROBKOV, V.K.: 'On monotone functions of the algebra of
logic', Problemy Kibemet. 13 (1965),5-28 (in Russian). of minimization is usually a table, a perfect disjunctive
[4] SAPOZHENKO, A.A.: 'Metric properties of almost all functions normal form (cf. Boolean functions, normal forms of) or
of the algebras of logic', Diskret. Anal. 10 (1967),91-119 (in an arbitrary disjunctive normal form. The first stage
Russian).
[5] SAPOZHENKO, AA.: 'The order of the neighbourhood of maxi-
consists in the transition to the so-called abridged dis-
mal intervals for almost all logical functions', Soviet Math. junctive normal form, which is uniquely determined for
Dokl. 9, no. 3 (1968), 591-594. (DokL Akad. Nauk SSSR 180, each function. Many methods for the realization of this
no. I (1968), 32-35) transition are available. The most universal method
[6] KLBITMAN, D.l.: 'On Dedekind's problem: the number of
monotone Boolean functions', Proc. Amer. Math. Soc. 21 consists in performing transformations on a disjunctive
(1969), 677-682. normal form of the type
Yu.I. Zhuravlev
AMS 1980 Subject Classification: 06E30 A V AB ~ A (absorption),
xA V CxB ~ xA V CxBV AB.
BOOLEAN FUNCflONS, MINIMIZATION OF - A
representation of Boolean functions by normal forms A typical property of the abridged disjunctive normal
(cf. Boolean functions, normal fonus of) that are most form is that it is possible to obtain any minimal
simple relative to some measure of complexity. The abridged disjunctive normal form from it by removing
usual meaning of the complexity of a normal form is the certain elementary conjunctions. The second, most
number of letters in it. A simplest form is then called a laborious stage is to extract from the abridged disjunc-
minimal form. A measure of complexity sometimes used tive normal form all dead-end disjunctive normal
is the number of elementary conjunctions in a disjunc- forms, among them all the minimal ones. A geometri-
tive normal form or the number of factors in a con- cal representation of Boolean functions is usually
junctive normal form. In this case a simplest form is employed at this stage. Let Ell denote the set of all
called a shortest form. In view of the duality of disjunc- vertices of the n-dimensional unit cube. Each Boolean
tive and conjunctive normal forms, it is sufficient to function f (x J, ,XII) is in one-to-one correspondence
consider disjunctive normal forms only. with the subset Nf , Nfr;;,En, of vertices a such that
The construction of shortest and minimal disjunctive f (a) = 1. Let ~ be an elementary conjunction of rank r.
normal forms each has its own specific features. The The set N m: is then called an interval of rank r
sets of minimal and shortest disjunctive normal forms corresponding to the elementary conjunction ~. One
of one and the same function may be connected by says that a system of intervals N m:" ,N m:m forms a
set-theoretical relations: of being contained one in one covering of a set N r;;,En if

426
BOOLEAN FUNCTIONS, MINIMIZATION OF

N = N., U ... UN.~. tion with at least one interval corresponding to a con-
Since the equalities junction of Sk -I (~, 9C); or 2) N'l.Jr;, U ; =1N 113" where
f= ~lV V~m and NI = N., U ... UN~
N I1J" i=l, ... ,r, satisfies I).
Quine's algorithm. Here one considers at each step
are equivalent, the problem of minimization of Boolean the neighbourhood S 1 (~, 91) of one of the conjunctions
functions is equivalent to the search for coverings with in the disjunctive normal form 91. As the algorithm is
a minimal sum of the ranks of their intervals. Such executed for each conjunction, one attempts to com-
coverings are called minimal. An interval N g{ is called pute one of the following properties:
maximal for a function f if N III r;,Nf and if there is no p 1 (~, 91) - '~ occurs in all minimal disjunctive nor-
interval NI1J such that N'I. CN I1J r;,Nf. The construction mal forms'; and
of dead-end disjunctive normal forms of a function f
p 2 (~, 91) - '~ does not occur in any minimal dis-
reduces to a search for coverings of Nf by maximal junctive normal form'.
intervals such that no proper subset of it is a covering The algorithm works as follows. I) The neighbour-
of Nf . These coverings correspond to dead-end dis-
hood SI(~' 91)={~, ~I. . . . '~m} is successively
junctive normal forms and are called irreducible. They formed for each conjunction ~ in 91. The inclusion
are obtained from ~verings corresponding to abridged
N III C U ~= 1N Ill, is verified. If this is not the case, then
disjunctive normal forms by removal of certain inter-
~ is marked in some way as belonging to all minimal
vals.
The selection of the minimal disjunctive normal disjunctive normal forms. One says that ~ is part of the
kernel of 91 (is a kernel conjunction). 2) Let the con-
forms from all dead-end disjunctive normal forms is
also a very laborious process, since 'almost-all' Boolean junctions ~I' .. ,)8q be marked in 91 in the first stage.
functions in n arguments have no fewer than 22n dif- The remaining conjunctions in 91 are ordered, and the
ferent dead-end disjunctive normal forms. The diversity inclusion N!B r;, U r= 1N 113, is checked for each such
of the complexities of dead-end disjunctive normal conjunction)8. Conjunctions satisfying this relation do
forms may be very wide, so that if a random rather not occur in any minimal disjunctive normal form and
than a dead-end minimal disjunctive normal form is are removed from 91.
selected, a large error may result. Regular points algorithm. During each step of this
Estimates of the number of dead-end disjunctive nor- algorithm one examines the neighbourhood S 2(~' 91) of
mal forms and of the scatter of their complexities show a conjunction ~ in the disjunctive normal form 91, and
that a more detailed inspection during the elimination the conjunctions not occurring in any dead-end disjunc-
of the elementary conjunctions from a disjunctive nor- tive normal form, and hence not occurring in any
mal form is a natural way of improving the effective- minimal disjunctive normal form either, are eliminated.
ness of minimization algorithms. A conjunction should The description of the algorithm involves the concept
be eliminated (or, on the contrary, retained until the a
of an a-bundle in 91, which, for this point in N~, is
end of the process) only if it can be established by the set M(a,91) of intervals N'l.J such that ~je91,
some procedure that it does not occur in any minimal a
aeN\1lJ' For a conjunction ~ in 91 a point of N\1l is
disjunctive normal form for f (occurs in all minimal called regular with respect to (~, 9C) if there exists a
disjunctive normal forms for f). The latter fact is usu- point a' such - that a'eN~ \NIll and
ally established by an analysis of the conjunctions that M(a', 91)r;,M(a, 91). A set M r;,N'I. is called regular
are close to the one in question, i.e. occur in a neigh- with respect to (~, 91) if all its points are regular with
bourhood of it (cf. Algorithm, local). Here one has to respect to (~, 91). This algorithm is based on the fact
accumulate information about elementary conjunctions that a conjunction ~ of the abridged disjunctive normal
and use it in the subsequent analysis. Procedures of this form of a function f does not occur in any dead-end
kind are known as local Simplification algorithms. Some disjunctive normal form of f if and only if N'I. is a reg-
of them are quoted below, and their description ular set with respect to (~, 91). The algorithm checks
involves the concept of the k-th order neighbourhood whether in a certain sequence the conjunction intervals
Sk(~' 91) of an elementary conjunction ~ in a disjunctive occuring in the disjunctive normal form are regular sets
normal form 91, ~e91. and eliminates those that are regular. Whether or not
The zero-order neighbourhood So(~, 91) consists of an interval N III is regular with respect to (~, 91) is com-
the single conjunction ~. If Sk -I (~, 91) is the neigh- pletely determined by the neighbourhood S2(~' 91) but
bourhood of order k - I, then the neighbourhood not, in general, by the neighbourhood S 1(~, 91).
Sk(~' 91) of order k consists of all conjunctions ~j of The A -algorithm. The concept employed here is that
the disjunctive normal form 91 that satisfy one of the of a disjunctive normal form that is minimal with
following conditions: I) N Ill} has a non-empty intersec- respect to a disjunctive normal form 91, i.e. one that is

427
BOOLEAN FUNCTIONS, MINIMIZATION OF

minima] among all disjunctive normal forms that are the A -algorithm is terminated; if this is not possible,
equivalent to 9l and are obtained from 9l by omission the procedure is applied to the second conjunction in
of certain elementary conjunctions. Two properties of the sequence, etc. If the mark does not change in any
an elementary conjunction in a disjunctive normal form one of these conjunctions, then after all the conjunc-
are considered: tions have been examined, the operation of the A-
PI (m, 9l) - 'm occurs in all disjunctive normal forms algorithm is terminated.
that are minimal with respect to 9l'; and If the abridged disjunctive normal form 9lf of a
P 2 (m,9l) - 'm occurs in no disjunctive normal form function f is taken for 9l, then no conjunctions that
that is minimal with respect to 9l'. obtained in the algorithm the mark (a, I) occur in any
It is assumed that Pi = I if the property Pi holds, and minimal disjunctive normal form of f, they are elim-
Pi =0 otherwise. It is also assumed that disjunctive nor- inated from 9lf . Conjunctions that obtained the mark
mal forms are formed from conjunctions with informa- (I, a) occur in all minimal disjunctive normal forms of
tive marks ("'10 "'2), where "'iE{O, I, a}. The equality f Various special cases of the A -algorithm have also
a
"'i :: means that the property Pi has not been calcu- been considered.
lated (i = I, 2), while the equalities "'i = I mean that The ring algOrithm places over the conjunctions infor-
=
Pi ="'i (i 1,2). A conjunction m
with informative mative marks ("'10 "'2), with the same meaning as in the
marks ("'I, "'2) is denoted by m""""'. The A-algorithm A -algorithm. At each step of the ring algorithm use is
computes the values of the properties P I and P 2 for made of the conjunctions contained in the k-th order
the conjunctions min the disjunctive normal form 9l, neighbourhood of some conjunction and of their infor-
using for this purpose only the conjunctions of the mative marks. A simplified version of this algorithm is
neighbourhood S2(m,9l) and their informative marks. outlined below. The ring algorithm in its complete form
For a conjunction mof rank r of 9l a set Mr;;.Nrtf. is is the best local algorithm with a special memory rela-
called a set of the first type relative to (m, 9l) if 9l con- tive to the neighbourhoods Sk(m,9l). The algorithms
tains conjunctions mlo , mm of ranks r., ... ,rm described above are special cases of the general ring
such that Mr;;. U j=INrtf.j and r> ~j=lrj. algorithm. If
The difference 91 1 \ 912 of two disjunctive normal Sk-1(~' 91) = {~, ~\o 0 ,~tl,
forms 911 and 912 is the disjunctive normal form con-
Sk(~' 91) = {~, ~\o '~/' ~I+\O '~m}
sisting of the elementary conjunctions occurring in 9l]
0

but not in 9l2 and


I m
Before the application of the A-algorithm all con-
N s. , = N'/I. U UN'/I." N s = N'/I. U UN'/I.J'
junctions of the disjunctive normal form 9l have the ;=1 j=1

mark (a, a). Mter i steps have been performed and the Q(Sk) = Ns \ N s. , ,
conjunctions m;, ... ,m~ have received the mark (1, a),
then to each subset Nr;;.Q(Sk) a Boolean function
and the conjunctions ~~, ... ,~; have received the
f(x 10 ... ,xn ) that is not everywhere defined is
mark (a, I), the (i + I)-th step consists of the following.
assigned, such that the set M{ of its ones is Ns. \ N
The conjunctions in the disjunctive normal form are
ordered in some way. If the disjunctive normal form and the set M6 of its zeros is En \ Ns.; the function f is
9l\(Vj=]~jVVj=]~;) is empty, the A-algorithm is not defined on N. The set of such functions is denoted
terminated. If it is not, the conjunctions of this disjunc- by Fk(m). Prior to the beginning of the algorithm the
tive normal form are ordered in some way; the first conjunctions of the disjunctive normal form 9l in ques-
conjunction ~ in the sequence and its neighbourhoods tion have the mark (a, a). If, after i steps have been
S] and S 2 in the disjunctive normal form completed, conjunctions ~'I' ... ' with the markm:
~=9l\(Vj=]~;) are singled out, and the relation (1, a) and conjunctions ~'{, ... ,~; with the mark

N'/I. = U. Nmo
!8e(S, ('/I., 91) \ 'II.)
(a, I) are obtained, then the (i + l)-th step is performed
as follows. The conjunctions of the disjunctive normal
forms are ordered in some way. If the disjunctive nor-
is checked. If it is satisfied, the mark (a, a) over ~ is mal form 9l\ Vj=l~jVVj=]~;) is empty, the algo-
replaced by (1, a) and the (i + l)-th step of the A- rithm is terminated. If not, then all conjunctions of this
algorithm is terminated. If not, then a check is made disjunctive normal form are ordered in some way. For
whether or not N rtf. can be represented in the form the conjunction ~ which is the first in the sequence and
M] UM2' where M] is a regular set relative to (m, 9l) for all f from the set Fk(m), all disjunctive normal
and M 2 is a set of the first type relative to (~, 9l). If forms which realize f on its domain of definition, which
N'l can be represented in this form, the mark (a, a) consist of conjunctions in Sk(m, 9l) and which contain
over ~ is replaced by (a, 1) and the (i + 1)-th step of the least number of variable symbols as compared with

428
BOOLEAN FUNCTIONS, NORMAL FORMS OF

other such disjunctive normal forms, are found. respectively, is called a disjunctive normal form, and
Among them one selects all disjunctive normal forms the number ~~ = 1ri is called its complexity; a formula
which, first, do not contain the conjunctions m1 ml> where m1, . , mt are different elementary
~';, ... , ~~ and, secondly, contain all the conjunctions disjunctions of ranks PI, ... , Pt, respectively, is called
~j, j= 1, ... ,s, for which Nw. n
(Ns \N)= 0. If for a conjunctive normal form, and the number ~:=IPi is
all f in Fk(~ the conjunction ~ occurs in all disjunc- called its complexity. Every Boolean function that does
tive normal forms selected, then the mark (/l, /l) over ~ not vanish identically can be defined by a disjunctive
is replaced by (1, /l) and the (i + l)-th step of the algo- normal form which is, generally speaking, not unique.
rithm is terminated. If ~ does not occur in any disjunc- The same applies to conjunctive normal forms and
tive normal form selected, then the mark (/l, /l) is functions that do not identically vanish.
changed into (Jl, 1) and the (i + l)-th step of the algo- From a table defining a Boolean function
rithm is terminated. Otherwise, the procedure just f (x 1, . . . , x") one easily obtains the perfect disjunctive
described is applied to the conjunction that is next in normal form ~l V ... V~s, where ~i =X~il ... x~.. ,
sequence, etc. If the mark cannot be changed for any i=l, ... ,s, and the sets CJil, ,CJi" are such that
conjunction, the algorithm terminates at the (i + l)-th f (CJi I, . . . , CJi") = 1. The perfect disjunctive normal form
step. All conjun6tions obtained in the ring algorithm realizing a Boolean function f is unique. The perfect
over the abridged disjunctive normal form WI of a conjunctive normal form is defined similarly.
function f with mark (1, /l) (or, respectively, (/l, 1 Since for 'almost-all' Boolean functions the number
occur in all minimal disjunctive normal forms (or, do of unit sets varies between 2" - I - Vn' 2" /2 and
not occur in any minimal disjunctive normal form) of f 2" -I + Vn' 2" /2, the asymptotic complexity of the per-
The algorithms just described yield identical results, fect disjunctive normal form for 'almost-all' Boolean
whatever the manner of ordering of the conjunctions in functions is n2" -I. The maximal complexity of the
the disjunctive normal form. perfect disjunctive normal forms for functions in n vari-
The task of selecting all conjunctions occurring in at ables is attained for functions that vanish at a single
least one or not occurring in any minimal disjunctive point. This complexity is n '(2" - 1).
normal form cannot be solved by algorithms working The main problem in the theory of normal forms of
with Sk(~, W) if k is bounded or increases too slowly as Boolean functions is that of minimizing Boolean func-
the number of variables n increases. This situation does tions, i.e. the construction of conjunctive or disjunctive
not change if to the properties PI, P 2 stored in the normal forms of minimal complexity - minimal con-
algorithm a bounded set of properties or one that junctive or disjunctive normal forms for any given
increases too slowly as n increases is added. Boolean function (cf. Boolean functions, minimization
References of; Algorithm, local). Here, by virtue of the duality
[I] YABLONSKII, S.V.: 'Functional constructions in k-placed logic', principle, it is sufficient to consider disjunctive normal
Trudy Mat. Inst. Steklov. 51 (1958), 5-142 (in Russian).
forms only. In connection with the problem of minim-
[2] ZIflJRAVLEV, Yu.l.: 'Set-theoretical methods in the algebra of
logic', Problemy Kibemet. 8 (1962), 5-44 (in Russian). izing Boolean functions one also considers abridged
[3] QUINE, W.V.: 'On cores and prime implicants of truth func- dead-end, shortest and minimal disjunctive normal
tions', Amer. Math. Monthly 66 (1959),755-760. forms. An important problem in the theory of disjunc-
Yu.I. Zhuravlev tive normal forms is the search for numerical
AMS 1980 Subject Classification: 06E30 characteristics of them, and characteristics connecting
various types of disjunctive normal forms of a given
BOOLEAN FUNCI10NS, NORMAL FORMS OF -
function.
Formulas of a special kind realizing Boolean functions.
The abridged disjunctive normal form is uniquely
One distinguishes between disjunctive normal forms (cf.
constructed from a Boolean function by means of a
Boolean functions, minimization of) and conjunctive nor-
fairly simple algorithm. Its most important property is
malforms. A product X'!l'I ... x''>'k ' where x" =x if CJ= 1 the fact that every minimal disjunctive normal form of
and x" = ex if CJ = 0, is called an elementary conjunction a function and at least one shortest form can be
of rank k if all its variables are different; '1' is con- obtained from the abridged disjunctive normal form by
sidered to be an elementary conjunction of rank zero. the elimination of certain elementary conjunctions.
A logical sum xj: V ... V xj: is called an elementary Therefore many minimization algorithms employ the
disjunction of rank r if all its variables are different; '0' abridged disjunctive normal form as the initial specifi-
is considered to be an elementary disjunction of rank cation of a Boolean function. It is of major interest in
zero. this context to define the complexity of abridged dis-
A formula ~l V ... V~t. where ~J, , ~I are dis- junctive normal forms for 'almost-all' functions and to
tinct elementary conjunctions of ranks rJ, ... , rl, determine the absolute maximum of this complexity. If

429
BOOLEAN FUNCTIONS, NORMAL FORMS OF

Sin) is the number of elementary conjunctions in the 2n 2n


abridged disjunctive normal form of a Boolean function lo~ n 'lo~ 10~ n < lfin) < 10~ n .
!(xJ, ... ,xn ) and if These estimates show that the shortest (and also the
Sen) = max Sfin), minimal) disjunctive normal forms of 'almost-all'
f(x" ... ,x,) Boolean functions account for only a small fraction of
then the following estimates hold: the number of dead-end disjunctive normal forms.
3n 3n There are also estimates of the relative complexity of
CI-::S S(n)::S C 2' _ I ' dead-end and shortest disjunctive normal forms of
n vn
Boolean functions. Let "hln) be the maximal number of
and for 'almost-all' Boolean functions
elementary conjunctions in a dead-end disjunctive nor-
n(I-()log,log, n 2n < Sfin) < n(I+()log,log,n2n,
mal form of a function !(xJ, ... ,xn ). For 'almost-all'
_0 as n_oo. Boolean functions
It is clear from these results and from the estimates of
the complexity of the perfect disjunctive normal form
10~ n < ~j=; < lo~ n '10~ 10~ n.
that the complexity of the abridged disjunctive normal The maximum value of the ratio II.ln) / lin) can be
form is much higher than that of the perfect disjunctive estimated from below by 2n (l-<), _0 as n_oo. Esti-
normal form, both in the 'typical' and in the 'record- mates of y (f) - the so-called scatter of ! (x \, ... ,xn )
breaking' cases. Unlike the perfect and the abridged - have also been obtained. Here
disjunctive normal form, a given Boolean function may
have many dead-end and minimal disjunctive normal y(j) = ~~ L(T")'
.MT.:.l
T,T
forms. Let tin) be the number of dead-end disjunctive
where T' and Tn are arbitrary dead-end disjunctive
normal forms and let min) be the number of minimal
normal forms realizing !(x\, ... ,xn ), and L(T) is the
disjunctive normal forms of a Boolean function
number of characters in the dead-end disjunctive nor-
!(xJ, ... ,xn ),
mal form T. Examples of Boolean functions with
ten) = max tfin), men) = max mfin). y (f)"~2n(\-o(\ have been constructed; it has been
f(x" . .. ,x,) f(x" ... ,x,)
established, however, that for 'almost-all' Boolean func-
Then the following estimates hold: tions
y(j) .;;; lo~ n 10& 10& n.
(22')Vn .;;; t(n)';;; (22'r/2, men) > (22')""nst.Vn,
The estimates above give a fairly complete idea of the
and for 'almost-all' Boolean functions
difficulties that arise when Boolean functions are
(22'-' )(I-,)Iog, n log, log, n < tfin) < minimized according to the scheme: perfect disjunctive
< (22H P+,)Iog, n log, log, n. normal form - abridged disjunctive normal form -
dead-end disjunctive normal form - minimal disjunc-
_0 as n_oo.
tive normal form.
A non-trivial upper estimate for m(n) and a non-
References
trivial estimate for min) for 'almost-all' functions are [1] VASIL'EV, YU.L.: 'On comparing the complexity of typical and
as yet (1977) not known. Complexity estimates of disjunctive normal forms', Problemy Kibernet. 10 (1963),
dead-end and of minimal disjunctive normal forms are 5-61 (in Russian).
[2] GLAGOLEV, V.V.: 'Certain estimates of the disjunctive normal
of great interest in problems of minimization of forms of functions of the algebra of logic', Problemy Kibernet.
Boolean functions. Let II.J(n ) be the number of 19 (1967), 75-94 (in Russian).
elementary conjunctions in a dead-end disjunctive nor- [3] KORSHUNOV, A.D.: 'The upper complexity bound of the shor-
test disjunctive normal forms of almost all Boolean functions',
mal form T of the function !(x\, ... ,xn ); let lin) be Cybernetics 5, no. 6 (1969), 705-715. (Kibernetika (Kiev), no. 6
the number of elementary conjunctions in a shortest (1969), 1-8)
disjunctive normal form of! (x \, ... ,xn ), [4] SAPOZHENKO, A.A.: 'On the greatest length of a dead-end dis-
junctive normal form for almost all Boolean functions', Math.
hen) = maxhJ(n),
f, T
len) = maxl;(n).
f
Notes 4, no. 6 (1968), 881. (Mat. Zametki 4, no. 6 (1968),
649-658) Yu.l. Zhuravlev
The following estimates then hold:
hen) ~ 2\ len) = 2"-1.
AMS 1980 Subject Classification: 06E30

For 'almost-all' Boolean functions !(xJ, ... ,xn ) and


for almost-all dead-end disjunctive normal forms T: BOOLEAN RING - An associative ring K whose ele-
ments are all idempotent, i.e. x 2 =x for any x EK. Any
hJ(n) ~ 2"-1.


Boolean ring K=I=0 is commutative and is a subdirect
For 'almost-all' Boolean functions! (x \, ... ,xn ): sum of fields ~ of two elements, and x + x = for all

430
BOOLEAN-VALUED MODEL

x EK. A finite Boolean ring K=I=O is a direct sum of dual constants: each v E VM having its own individual
fields ~ and therefore has a unit element. constant v. Let M be a B-model and let
A Boolean ring is the ring version of a Boolean alge- B = (B; 0, 1, C, u, n) be a complete Boolean alge-
bra, namely: Any Boolean algebra is a Boolean ring bra; the equalities 1) - 8) below then define the value
with a unit element under the operations of addition II e II M of each closed expression e (Le. of a formula or
and multiplication defined by the rules a term without free variables) of L M :
(x+y) = (xnCy)u(YnCx), xy =xny, 1)11 v IIM=v, where VEVM ;
where Cx is the complement of x. The zero and the
2)11 P(TJ, . ,tn ) IIM=(~M(p))(11 T\ 11M, ... , II Tn 11M),
unit of the ring are the same as, respectively, the zero where TJ, ,Tn are closed terms and p is an n-place
and the unit of the algebra. Conversely, every Boolean function or predicate symbol;
ring with a unit element is a Boolean algebra under the 3) I/>:lljl IIM= -II I/> 11M U IlljI 11M;
operations x Uy =x +y + xy, x ny =xy, Cx = 1+ x. 4) I/>VljI IIM= II I/> 11M U IlljI 11M;
References 5) l/>/\ljI IIM= II I/> 11M n IIljI 11M;
[I] STONE, M.H.: 'The theory of representations for Boolean alge- 6) -,1/> IIM= -II I/> 11M;
bras', Trans. Amer. Math. Soc. 40 (1936),37-111. 7) 3~q>(~) IIM= U VEV) q>(v) 11M;
[2] ZHEGALKIN, I.I.:"On the technique of computation of proposi-
tions in symbolic logic', Mat. Sb. 34, no. I (1927),9-28 (in 8) V~q>(~ IIM= n II q>(v) 11M.
VEV
M
Russian). French abstract.
[3] VLADIMIROV, D.A.: Boolesche Algebren, Akademie Verlag, 1978 The relations 1) - 8) define the value II e II M for cer-
(translated from the Russian). tain non-complete Boolean algebras as well; the only
[4] SIKORSKI, R.: Boolean algebras, Springer, 1969.
condition is that the infinite unions and intersections in
Yu.M. Ryabukhin
7) and 8) exist. The concept of a Boolean-valued model
Editorial comments. The operation
can also be introduced for languages with more than
x+ y=(x n Cy) u
(y n Cx) is known as the symmetric
one kind of variables. In such a case each kind of vari-
difference. Think of the Boolean algebra of all subsets of a
able has its own domain of variation VM.
given set under union, intersection and complement to inter-
pret these formulas.
A closed formula I/> is said to be true in a B-mode1 M
(MI=I/ if
References
[A 1] RUDEANU, S.: Boolean functions and equations, North-
Holland,1974. A B-model M is said to be a model of a theory T if
MI=I/> for all axioms I/> of T. If h is a homomorphism of
AMS 1980 Subject Classification: 06E20, 16A30,
a Boolean algebra B into a Boolean algebra B'
16A32
preserving infinite unions and intersections, then there
BOOLEAN-VALUED MODEL - A model defined as exists a B' model M' such that
follows. Let ~ be the signature of some first-order
language L with one kind of variables, i.e. ~ is the set for each closed formula I/> of L M If the universe of a
of symbols of functions and predicates. A Boolean-
model M is countable, then there exists a homomor-
valued model then is a triple M=(BM' VM, ~M)' where
phism h into the Boolean algebra {O, I}, under which
BM is a non-degenerate Boolean algebra, VM is a non-
M is transformed into the classical two-valued model
empty set, and ~M is a function defined on ~ such that M' such that
0M(P) E V~M
if p is an n-place function symbol, and It has been shown that a theory T is consistent if and
only if it has a Boolean-valued model. This theorem
0M(P) E BJ/ forms the basis of the application of the theory of
if p is an n-place predicate symbol. The symbol X Y Boolean-valued models to problems of the consistency
denotes the set of all functions defined on Y with of axiomatic theories.
values in X and xn = x{m: m <n}, where n ~O is a If the Boolean-valued model of a theory T is con-
natural number. The Boolean algebra BM is called the structed by means of another axiomatic theory S, then
set of truth values of the model M. The set VM is called one obtains the statement on the consistency of T rela-
the universe of M. A Boolean-valued model M is also tive to S. Thus, the result due to P. Cohen on the con-
called a B-model if the set of truth values is the sistency of the theory ZF+(i~o>Nl) relative to ZF is
Boolean algebra B, BM = B. If a Boolean algebra B is a obtained by constructing the respective Boolean-valued
two-element algebra (i.e. B = {O, I}), then the B-model model by means of the system ZF (cf. Forcing method).
M is the classical two-valued model. The construction of the Cohen forcing relation p I~I/> is
Let LM be a language, complemented by new indivi- equivalent to that of a Boolean-valued model M such

431
BOOLEAN-VALUED MODEL

that while that bounded by a hyperbolic Booth lemniscate is


II cp II M = {P: p II- .-.c/>}.
a 2 -b 2 a ab
References S = 2 arctgb+T'
[1) RAsIOWA, E. and SIKORSKI, R.: The mathematics of
metamathematics, Polska Akad. Nauk, 1963. The Booth lemniscate is a special case of a Persian
[2) JECH, T.J.: Lectures in set theory: with particular emphasis on curve.
the method offorcing, Springer, 1971. Named after J. Booth [1).
[3) TAKE1ITl, G. and ZARING, W.M.: Axiomatic set theory,
Springer, 1973. References
[4) MANIN, Yu.I.: 'The problem of the continuum', J. Soviet [1) BOOTH, J.: A treatise on some new geometrical methods, 1-2,
Math. S, no. 4 (1976),451-502. (ftogi Nauk. i Tekhn. Sovrem. London, 1873-1877.
Problemy S (1975),5-73) [2) SAVELOV, A.A.: Plane curves, Moscow, 1960, pp. 144-146 (in
v.N. Grishin
Russian). D.D. Sokolov
Editorial comments.
AMS 1980 Subject Classification: 53A04
References
[A1) BELL, J.L.: Boolean-valued models and independence proofs
BORDERING ME1HOD - A method for solving a
in set theory, Clarendon Press, 1977.
[A2) JECH, T.J.: Set theory, Acad. Press, 1978. system of linear algebraic equations Ax =b with a non-
[A3) KUNEN, K.: Set theory, North-Holland, 1980. degenerate matrix by inverting a matrix and calculating
AMS 1980 Subject Classification: 03C90 a determinant, based on passing recursively from the
solution of the problem with matrix
BOOTH LEMNISCATE - A plane algebraic curve of
order four whose equation in orthogonal Cartesian
coordinates is
ak-I.I ak-I.k-I
(x 2 +y2)2 +(2m 2 +n)x 2 +(2m 2 -n)y2 = o.
to the solution of the problem with the matrix A k ,
If I n I <2m 2, the Booth lemniscate is called elliptic (it
which can be considered as the result of bordering
has singular point 0 (Fig. 1), where O<n<2m2). If
Ak -
I n I > 2m 2 , the Booth lemniscate is called hyperbolic (it I
The calculating scheme of the bordering method for
has a nodal point at the coordinate origin, cf. Fig. 2,
the inversion of a matrix is as follows: Let Ak -I be a
where n >2m 2 ).
non-degenerate matrix. In the inversion of the matrix
y
A k , the representation

IIAk-1 I

ob
Ak = Vk
Uk
akk (1)
o x
is used, where Uk =(a I,k> ... ,ak + I,kf and
..x Vk=(ak , I, . . . ,akk-I).
, Then
A-I A-I
-I k-IUkVk k-I Ak~IUk
A k- I +
Ci.k Ci.k
Fig. l. Fig. 2. Ak l (2)
vkAk~1
Ci.k Ci.k
The equation of an elliptic Booth lemniscate in polar
coordinates is Ci.k = akk-vkAk~IUk'
p2 = a 2 cos2 cp+ b 2 sin2 cp or p o. Subsequent inversion of the matrices A I, . . . ,An by
this scheme gives the matrix A -I .
If n > 2m 2, the equation of a hyperbolic Booth lemnis-
The above scheme of the bordering method is only
cate has the form
applicable to matrices with principal minors unequal to
p2 = a2cos2cj>-b2sin2cj>; zero. Generally, a choice of the principal element is
and if n < - 2m 2 adopted. In this scheme, the bordering rows and
p2 = -a 2 cos 2 cj>+b 2 sin2 cj> columns used are those for which ak=akk-vkAk~IUk
has the maximum absolute value. The calculated matrix
(a 2 = 12m 2 +n I, b2 = 12m 2 -n I). will then differ from A -I only in the rearrangement of
The arc length of a Booth lemniscate is expressed by the rows and columns [1].
elliptic integrals. The area bounded by an elliptic The hordering method is not the fastest of the direct
Booth lemniscate is methods\ of inversion of a matrix.
The bordering method permits an effective inversion
of a triangular matrix. If Ak is a right (upper-) triangu-

432
BORDERING METHOD

lar matrix, then in (1) ing method is equivalent to the Gauss method, one of
the fastest direct methods of solving systems.
The bordering method allows one to solve higher-
order systems owing to the effective use of computer
o memory. This is caused by the fact that for the calcula-
tion of the vectors x (k,p) , P > k, only the storage of the
The amount of calculation in this instance is reduced vectors x(k -I,p), P >k -1, and the coefficients of a k-th
six-fold. order system of equations is necessary, that is, a series
The bordering method is particularly effective when of numbers of length j(k)=k(n - k + l)+(n + 1). It is
inverting Hermitian positive-definite matrices. For these therefore sufficient to have a working capacity
matrices it is not necessary to use a scheme with a (n + 1Xn + 5) /4::::::(n /2)2 to solve a system of order n.
choice of a principal element. Moreover, they are Moreover, the elements of the matrix and of the right-
determined by only half of their elements. The calcu- hand side do not have to be put into the computer
lating scheme in this instance is simplified thus: memory immediately but successively, row by row.
A-I
pr
+-k k
Pk It is advisable to use the bordering method when
k-I -
ak ak solving systems for which a truncated system has
Ak l already been solved. Relation (4) then immediately
Pi,
ak ak
gives the required solution.
The scheme of the bordering metho.d described can
be used to calculate the determinant. It follows from
The calculating scheme of the bordering method for (1) that
the solution of a system is as follows. Let
1 Ak 1 = (akk-vkAk~IUk) 1 Ak- I I,

b (kp)-(
- alp' ... ,akp')T k=l , ... , n', b(n,n+I)=-b . If Recurrent use of this relation gives I A I
Ak -I is a non-degenerate matrix and x(k -I,p) is the As with the inversion of a matrix, the solution of a
solution of the system Ak_IX(k-l,p)+b(k-l,p)=O, then system and the calculation of the determinant by means
the solution X(k,p) of the system Akx(k,p) +b(k,p) =0 is of the bordering method is possible only for matrices
obtained from the representation with non-zero principal minors. It is generally also
necessary here to use a scheme with a choice of a prin-
Ak-1 b(k-I,k) II = Ilb(k-I,P) I
A
k
=
II Vk akk '
b(k,p)
akp ,
(3) cipal element.
References
and from (2) as follows [1) VOEVODIN, V,V.: Numerical methods of algebra, Moscow, 1966

x (k,p) = -
I Ak~1 llllb(k-I,P) II + (in Russian).
(2) FADDEEv, D.K. and FADDEEVA, V,N,: Computational methods
o 0 akp
of linear algebra, Freeman, 1963 (translated from the Russian).
G.D. Kim
Editorial comments. The bordering method is a special
case of more general partitioning procedures [A 1].
= Ilx(k-I,P) 11_ VkX(k-l,p)+akp Ilx(k-l,k) II
The bordering method defined above is related to the
o akk+vkx(k-I,k) 1 (4) Sherman- Morrison formula and its generalization the
Sherman- Morrison- Woodbury formula. The
In this way, through the solutions x(k -I,p) and x(k -I,k) Sherman - Morrison formula states that a non-singular
of the systems with the same matrix Ak -I and with (m X m)-matrix M and vectors x and yare related by
various right-hand sides, it is easy to obtain the solu- [M+xyHr' = M-'-M-'xc-'yHM-', C=1+yHM-'x,
tion of the system with the bordered matrix A k The
provided that C=FO. The Sherman - Morrison - Woodbury
solution of the initial sytem is x(n,n + I). This can be
formula replaces the vectors x and y by (mxn)-matrices X
obtained by the recurrent use of relation (4). This leads and Y. These formulas were developed in [A2] - [A5] (see
to the subsequent calculation of the set of vectors x (k,p) , also [A6]) , and used for the construction of matrix inversion
k= 1, ... ,n,p>k, that is schemes resembling that of the bordering method. An
X(12), X(l3), X(I,n+l) , excellent discussion of these schemes is given in [A7].

X(23), x(2,n + I), References


[A1] WESTLAKE, l.R.: A handbook of numerical matrix inversion
and solution of linear equations, Wiley, 1968, Sect. 2.6.
x(n -I,n +I), [A2] BARTLETT, M.S.: 'An inverse matrix adjustment arising in
x(n,n+I), discriminant analysis', Ann. Math. Stat. 22 (1951),107-111.
[A3] SHERMAN, l.: Computations relating to inverse matrices, Appl.
In the amount of calculation necessary, this border- Math., 29, Nat. Bur. Standards, 1953, pp. 123-124.

433
BORDERING METHOD

[A4] SHERMAN, J. and MORRISON, W.J.: 'Adjustments of an inverse go:No ~ Mo and g:N ~ M.
matrix corresponding to changes in the elements of a given
column or a given row of the original matrix', Ann. Math. A set of mutually bordant manifolds is called a bordism
Stat. 20 (1949), 621. class, while the triplet (W, M o, M) is sometimes called
[A5] WOODBURY, M.: 'Inverting modified matrices', Memorandum
Report Stat. Res. Group Princeton 42 (1950).
a bordism (it would be more accurate to take
[A6] GoLUB, G.H. and loAN, C.F. VAN: Matrix computations, (W, Mo, M, go, g. The set of bordism classes of n-
North Oxford Acad., 1983. dimensional manifolds forms an Abelian group ~" with
[A7l HOUSEHOWER, A.S.: Principles of numerical analysis,
McGraw-Hili, 1953.
respect to unconnected union. The zero element of the
group is the bordism class consisting of the manifolds
AMS 1980 Subject Classification: 65F05, 65F10, M which constitute the boundary of a certain manifold
15A09 W (one formally speaks of the triplet (W, M o, M) with
BORDERING OF A SPACE X in a compactification
empty M 0; other appellations are: M is a 'bounding
manifold', or M is 'internally homologous to zero' or is
bX - A finite family {U 1, . . . , Uk} of sets open in X
'bordantly zero'). The element of ~" inverse to a given
such that the set K~X\ (U I U .~.. U Uk) Js com-
bordism class is this class itself (since MUM is dif-
pact, and bX=K U U I U ... U Ub where Uj is the
feomorphic to the boundary of the direct product
largest open set in bX the intersection of which with X
M X [0, I D. The direct sum ~. of the groups ~" is a
is the set U; (X is assumed to be completely regular).
commutative graded ring in which the multiplication is
The concept of a bordering of a space X in bX coin-
induced by the direct product of manifolds with unit
cides with the concept of an almost-extendable border-
element given by the bordism class of a point.
ing of a proximity space X (the proximity on X is
More complex variants comprise the bordism of
induced by the extension bX), formulated in terms of
smooth closed manifolds with a supplementary struc-
the proximity: apart from K being compact, it is neces-
ture. For instance, two oriented manifolds M 0 and M
sary that for any neighbourhood (f) K, the family
are orientedly bordant if they are bordant in the sense
{ (f) K, U J, , Uk} is a uniform covering of the space
explained above, if the 'membrane' W is oriented and
X. A bordering of a space X in its Stone - Cech com-
if, using the above notation, the orientation induced by
pactification is simply called a bordering of X. In the
the orientation of W on No and N (which are parts of
language of borderings, a series of theorems has been
the boundary) goes over under the diffeomorphisms go
formulated on the dimensions of the remainder of com-
and g, respectively, to the original orientation on M
pactifications of topological and proximity spaces.
and the opposite orientation on Mo. One then speaks
References of oriented bordism; if it is desired to stress the differ-
[II SMIRNOV, Yu.M.: 'On the dimensions of remainders of com-
pactifications of proximity and topological spaces', Mat. Sb.
ence between such bordism and bordism in the sense
71, no. 4 (1966), 554-482 (in Russiap). given above, then one speaks of the latter as non-
v. V. Fedorchuk oriented bordism. In analogy with ~n and ~. one intro-
Editorial comments. A concept related to the bordering duces the groups of oriented bordism nnand the ring
of a space is that of a border cover. A collection <t of open n.=~n".
sets such that X\ U <t is compact. Border covers work in
Historically, the first example was that of the bor-
a sense opposite to borderings. In the case of borderings a
dism of nested manifolds, introduced in 1938 by L.S.
compactification is given; from certain systems of border
covers one can construct compactifications whose
Pontryagin, who showed that the classification of such
remainders can have special properties. bordisms is equivalent to the calculation of the homo-
topy groups of the spheres 7Ti(S"), and who could
AMS 1980 Subject Classification: 54020, 54035,
54040 determine 7T" + 1(S") and 7T" +2(S") in this way (see [2]
for a detailed account of his studies, and [4] for an
BORDISM, bordantism - A term used by itself or as introductory text). Non-oriented and oriented bordisms
a part of standard expressions in a number of similar were introduced in 1951 - 1953 by V.A. Rokhlin [3],
meanings. The older term, cobordism, is still employed. who calculated ~" and nn for n ~4. It had been previ-
The simplest variant is the following. Two smooth ously shown by Pontryagin [1] that if two manifolds are
closed n-dimensional manifolds M 0 and M are bordant bordant, their characteristic numbers are identical
(cobounding or internally homologous) if there exists a (Stiefel- Whitney numbers for non-oriented manifolds,
smooth compact (n + I)-dimensional manifold W (a Stiefel- Whitney and Pontryagin numbers for oriented
'membrane') the boundary of which consists of two manifolds). It was subsequently found that the converse
manifolds No and N which are diffeomorphic, respec- proposition is also true.
tively, to M 0 and M under certain diffeomorphisms (cf. Modem methods of algebraic topology were first
Diffeomorphism) applied in the theory of bordism in 1954 by R. Thorn

434
BOREL - CANTELLI LEMMA

[5], [6], who rediscovered (for the case of both oriented 9Cn(X) and each mapping I> with the mappings I>n is a
and non-oriented bordisms) the connection between generalized homology theory. In the present case it
bordisms and certain homotopy problems. Thus, the reduces to ordinary homology, viz., for any cellular
group 9Cn is isomorphic to the group '!Tn +r(TBO(r)) for polyhedron X
a sufficiently large r; here, TBO(r) is the Thorn space of L9?n(X) ~ H.(X, ~)9?
the universal vector bundle with structure group OCr).
Owing to this connection, Thorn was able to make a (the tensor product of graded modules on the right-
complete computation of the ring 9C. and to contribute hand side of this expression is over Z:J. = Z / (2)), but
substantially to the study of ~" which was subse- this is generally not true for other bordisms (oriented,
quently continued by other workers. Thus, 9C. proved etc.). Many generalized homology theories may be
to be the ring of polynomials over the field of residues obtained by way of a so-called bordism with singulari-
modulo 2 in the generators Xi of dimension i, where i ties [14].
runs through all positive numbers not equal to 2S -1, In addition to the bordism of a space X there exist
s ~ 1; a geometric realization of these generators is generalized cohomology theories which are dual to it.
known (i.e. concrete manifolds whose bordism classes The introduction of these generalized homology and
are Xi have been given [7]). cohomology functors has made it desirable to introduce
Other variants of bordism of manifolds with a sup- the changed terminology discussed at the beginning of
plementary structure comprise the very important bor- the present article. Thus, the term 'cobordism' is res-
disms of quasi-complex manifolds (also called unitary tricted to generalized cohomology theories dual to bor-
bordism or complex cobordism [8], [9]), and bordism of dism.
manifolds acted upon by a group of transformations References
[10]. There are also variations of another kind (for [1) PONTRYAGIN, L.S.: 'Characteristic cycles of differentiable man-
ifolds', Mat. Sb. 21 (63), no. 2 (1947), 233-284 (in Russian).
piecewise-linear or topological manifolds, for Poincare [2) PONTRYAGIN, L.S.: Smooth manifolds and their applicatiOns in
complexes, etc. [11]). Special kinds of bordisms include homology theory, Moscow, 1976 (in Russian).
foliated bordism and h-bordism (previously referred to as [3) ROKHLIN, V.A.: 'Theory of inner homology', Uspekhi Mat.
Nauk 14, no. 4 (1959), 3-20 (in Russian).
J-equivalences); these are used for relating differential [4A) WALLACE, A.H.: Differentiable topology. First steps, Benjamin,
and homotopy topological properties [12]. 1968.
Further development of bordism theory is related to [4B) MILNOR, J.W.: Topology from the differentiable viewpoint, Uniy.
Press of Virginia, 1965.
the bordism groups of a topological space X (bordism of [5) THOM, R.: 'Quelques proprietes globales des yarietes
spaces for short, cf. [13]). They are defined for dif- differentiables', Comm Math. Helvetia 28 (1954), 17-86.
ferent variants of bordisms (the simplest example is [6) MILNOR, J.W. and STASFEFF, J.D.: Characteristic classes,
given below). A singular n-dimensional (sub)manifold of Princeton Uniy. Press & Uniy. of Tokyo Press, 1974.
[7) DoLO, A.: 'Erzeugende der Thomschen Algebra 9?',
a space X is a pair (Mn ,f), where M n is a closed Math. Z. 65 (1956),25-35.
smooth manifold, and f: Mn~x is a continuous map- [8) MILNOR, J.: 'On the cobordism ring (r and a complex analo-
ping. Two such pairs (M o,fo), (Mn,f) are bordant if gue 1', Amer. J. Math. 82 (1960),505-521.
[9) NOVDWV, S.P.: 'Homotopic properties of Thorn-complexes',
M 0 and M are bordant in the ordinary sense, and Mat. Sb. 57 (99) (1962), 406-442 (in Russian).
(using previous notations) if there exists a continuous [10) CONNER, P.E. and FLOYD, E.E.: Differentiable periodic maps,
o'
mapping h: W~X such that hg =fo, hg-' =fo. (If Springer, 1964.
[11) STONG, R.E.: Notes on cobordism theory, Princeton Uniy. Press,
M 0 and M are identified with No and N, one can sim- 1968.
ply say that the mapping h induces the given mappings [12) MILNOR, J.: Lectures on the h-cobordism theorem, Princeton
of M 0 and Minto x.) The bordism classes of singular Uniy. Press, 1965.
manifolds in a space X form the n-dimensional bordism [13) ATIYAH, M.F.: 'Bordism and cobordism', Proc. Cambridge Phi-
los. Soc. (2) 57 (1961), 200-208.
group 9Cn (X) of this space (the group operation is gen- [14A) BAAS, N.A.: 'On bordism theory of manifolds with singulari-
erated by the union of manifolds). ties', Math. Scand. 33 (1973), 279-302.
If X is a manifold of dimension > 2n, the elements of [14B) BAAS, N.A.: 'On formal groups and singularities in complex
bordism theory', Math. Scand. 33 (1973),302-313.
9Cn (X) can be visualized as submanifolds, as can the
D. V. Anosov
corresponding 'membranes'; in this respect bordism of
M.I. Voitsekhovskii
spaces resembles the original attempts at introducing
homologies. If X is a point, 9Cn (X) is reduced to the AMS 1980 Subject Classification: 57N70
previous 9Cn. To a mapping 1>: X~y correspond
homomorphisms I>n: 9Cn(X)~9Cn(Y)' generated by the BOREL - CANTELLI LEMMA - A frequently used
transition from a singular manifold (M, f) to another statement on infinite sequences of random events. Let
one (M, I>f). The functor by means of which each A" ... ,An, ... , be a sequence of events from a cer-
space X is brought into correspondence with the groups tain probability space and let A be the event consisting

435
BOREL - CANTELLI LEMMA

in the occurance of (only) a finite number out of the (not necessarily algebraically-closed) field k: Let V be a
events An, n =1, 2, . ... Then, according to the complete variety defined over a field k on which a con-
Borel- Cantelli lemma, if nected solvable k-split group G acts regularly, then the
set of rational k-points V(k) is either empty or it con-
(*) tains a point which is fixed with respect to G. Hence
the generalization of the theorem of conjugation of
then
P(A) = 1. Borel subgroup is: If the field k is perfect, the maximal
If the events An are mutually independent, then connected solvable k-split subgroups of a connected k-
P(A) = 1 or 0, depending on whether the series defined algebraic group H are mutually conjugate by
elements of the group of k-points of H [2].
~:=l P(An) converges or diverges, i.e. in this case the
condition (*) is necessary and sufficient for P(A) = 1; References
this is the so-called Borel criterion for 'zero or one' (cf. [1) BOREL, A.: 'Groupes lineaires algebriques', Ann. of Math. (2)
64, no. 1 (1956), 20-82.
Zero-one law). This last criterion can be generalized to [2) BOREL, A.: Linear algebraic groups, Benjamin, 1969.
include certain classes of dependent events. The [3] MOROZOV, V.V.: Dokl. Akad. Nauk SSSR 36, no. 3 (1942),
Borel- Cantelli lemma is used, for example, to prove 91-94.
v.P. Platonov
the strong law of large numbers.
AMS 1980 Subject Classification: 22A05
References
[I) BoREL, E.: 'Les probabilites denombrables et leurs applications
arithmetique', Rend. Circ. Mat. Palermo (2) 27 (1909),247-271. BOREL FUNcrION, B-function - A function for
[2) CANfELLI, F.P.: 'Sulla probabilita come limite della frequenza',
Atti Accad. Naz. Lincei 26, no. I (1917), 39-45. which all subsets of the type E(x: f(x)~a) in its
[3) LoEVE, M.: Probability theory, Princeton Univ. Press, 1963. domain of definition are Borel sets (cf. Borel set). Such
A. V. Prokhorov functions are also known as Borel-measurable functions,
Editorial comments. The Borel- Cantelli lemma can be or B-measurable functions. The operations of addition,
used in number theory to prove the so-called 'normality' of multiplication and limit transition - as in the general
almost-all natural numbers, cf. [A1], Chapt. 8, Sect. 6. case of measurable functions - do not take one out-
References side the class of Borel functions, but, unlike in the gen-
[A1) FELLER, W.: An introduction to probability theory and its eral case, the superposition of two Borel functions does
applications, 1, Wiley, 1957. also not lead outside the class of Borel functions. More-
AMS 1980 Subject Classification: 60A05 over, [1], if f is a measurable function on an arbitrary
space n and if g is a Borel function on the space of real
BOREL FIELD OF EVENTS, a-field, Borel algebra,
numbers, then the function h =g[f(x)] is measurable
a-algebra of events - A class A of subsets (events) of a on n. All Borel functions are Lebesgue-measurable (cf.
non-empty set n (the space of elementary events) which Measurable function). The converse proposition is not
is a Borel field of sets. true. However, for any Lebesgue-measurable function f
v. V. Sazonov there exists a Borel function g such that f(x)=g(x)
AMS 1980 Subject Classification: 28A05 almost-everywhere [1]. Borel functions are sometimes
called Baire functions, since the set of all Borel func-
BOREL FIELD OF SETS, family of Borel sets, gen- tions is identical with the set of functions belonging to
erated by a system of sets M - The smallest system of the Baire classes (Lebesgue's theorem, [2]). Borel func-
sets containing M and closed with respect to the opera- tions can be classified by the order of the Borel sets
tions of countable union and taking complements. E(x: f(x)~a); the classes thus obtained are identical
A.G. El'kin with the Baire classes.
The concept of a Borel function has been generalized
AMS 1980 Subject Classification: 54C50, 54C99
to include functions with values in an arbitrary metric
BOREL FIXED-POINT THEOREM - A connected space [3]. One then also speaks of B-measurable map-
solvable algebraic group G acting regularly (cf. Alge- pings. Borel functions have found use not only in set
braic group of transfonnations) on a non-empty com- theory and function theory but also in probability
plete algebraic variety V over an algebraically-closed theory [1], [4].
field k has a fixed point in V. It follows from this References
theorem that Borel subgroups (cf. Borel subgroup) of [I) HALMos, P.R.: Measure theory, v. Nostrand, 1950.
algebraic groups are conjugate (The Borel- Morozov [2] HAUSDORFF, F.: Set theory, Chelsea, reprint, 1978.
[3) KURATOWSKI, K.: Topology, Acad. Press, 1966-1968 (translated
theorem). The theorem was demonstrated by A. Borel from the French).
[l]. Borel's theorem can be generalized to an arbitrary [4) KOIMOGOROV, A.N.: Foundations of the theory ofprobability,

436
BOREL MEASURE

Chelsea, reprint, 1950 (translated from the Russian). is countably additive. A Borel measure
J.L J.L is called reg-
VA. Skvortsov ular if
11.(.E) = sup- p.(F),
AMS 1980 Subject Classification: 26A21, 26A30, E:JF

28A20, 28A05 where F belongs to the class $' of closed subsets in X.


The study of Borel measures is often connected with
BOREL ISOMORPHISM, B-isomorphism - A one- that of Baire measures, which differ from Borel meas-
to-one mapping f of a space X into a space Y such that ures only in their domain of definition: They are
both f and f- 1 transform Borel sets into Borel sets (cf. defined on the smallest a-algebra !J# 0 with respect to
Borel set). In the class of Borel subsets of complete which all continuous functions on X are measurable. A
separable metric spaces, sets of the same cardinality are Borel measure J.L (or a Baire measure v) is said to be T-
Borel isomorphic. smooth if J.L(Fa)tO for any net {Fa} of closed sets which
A.G. El'kin
satisfies the condition Fat 0 (or v(ZaHO for any net
AMS 1980 Subject Classification: 54C50, 54C99
{Z a} of sets which are zero sets of continuous func-
tions and such that Z at 0 ). A Borel measure J.L (or
BOREL - LEBESGUE COVERING THEOREM - Let
Baire measure v) is said to be tight if
A be a bounded closed set in R n and let G be an open
covering of it, i.e. a system of open sets the union of p.(E) = E :JKEf
sUP p.(K),

which contains A; then there exists a finite subsystem


where .Y( is the class of compact subsets on X (or
of sets {G;}, i = 1, ... ,N, in G (a sub covering) which
is also a covering of A, i.e. P(E) = E :JKEf
sUP p' (K),

N
A C UG,. where
i=!

The Borel- Lebesgue theorem has a converse: If


A eRn and if a finite sub covering may be extracted Tightness and T-smoothness are restrictions which
from any open covering of A, then A is closed and ensure additional smoothness of measures, and which
bounded. The possibility of extracting a finite subcov- in fact often hold. Under certain conditions Baire
ering out of any open covering of a set A is often taken measures can be extended to Borel measures. For
to be the definition of the set A to be compact. Accord- instance, if X is a completely-regular Hausdorff space,
ing to such a terminology, the Borel- Lebesgue then any T-smooth (tight) finite Baire measure can be
theorem and the converse theorem assume the follow- extended to a regular T-smooth (tight) finite Borel
ing form: For a set A eRn to be compact it is neces- measure. In the study of measures on locally compact
sary and sufficient for A to be bounded and closed. spaces Borel measures (or Baire measures) is the name
The theorem was proved in 1898 by E. Borel [1] for the sometimes given to measures defined on the a-ring of
case when A is a segment [a, b] e R 1 and G is a system sets generated by the compact (or G8 -compact) sets
of intervals; the theorem was given its ultimate form by and which are finite on compact sets. Often, by the
H. Lebesgue [2] in 1900 - 1910. Alternative names for Borel measure on the real line one understands the
the theorem are Borel lemma, Heine-Borel lemma, measure defined on the Borel sets such that its value on
Heine- Borel theorem. an arbitrary segment is equal to the length of that seg-
ment.
References
[I) BOREL, E.: Ler;ons sur la theorie des fonctions, Gauthier-Villars, References
1928. [1] VARADARAJAN, V.S.: 'Measures on topological spaces', Transl.
[2) RUDIN, W.: Principles of mathematical analySiS, McGraw-HilI, Amer. Math. Soc. Ser. 248 (1965), 161-228. (Mat. Sb. 55 (97),
1953. no. 1 (1961),35-100)
I.A. Vinogradova [2] HALMOS, P.R.: Measure theory, v. Nostrand, 1950.
AMS 1980 Subject Classification: 54030 [3] NEVEU, J.: Bases mathematiques du calcul des probabi/ites, Mas-
son, 1970.
VV Sazonov
BOREL MEASURE of sets - A non-negative function
J.L of the subsets of a topological space X possessing the Editorial comments.
following properties: 1) its domain of definition is the References
a-algebra !J# of Borel sets (cf. Borel set) in X, i.e. the [A1] ROYDEN, H.L.: Real analysis, Macmillan, 1968.
[A2] ZAANEN, A.C.: Integration, North-Holland, 1967.
smallest class of subsets in X containing the open sets
[A3] RUDIN, W.: Principles of mathematical analysis, McGraw-Hili,
and closed with respect to the set-theoretic operations 1953.
performed a countable number of times; and 2) [A4] RUDIN, W.: Real and complex analysis, McGraw-Hili, 1966.
[AS] TAYLOR, A.E.: General theory of functions and integration,
J.L( U ~=lEi)="2:~=lJ.L(Ei) if E; nEj= 0 when i-=Fj, i.e. Blaisdell, 1965.

437
BOREL MEASURE

[A6] ALIPRANTZ, C.D. and BURLEINSHAW, 0.: Principles of real


References
analysis, North-Holland, 1981. [A1] KURATOWSKI, K.: Introduction to set theory and topology,
AMS 1980 Subject Classification: 28A12 Pergamon, 1961.

AMS 1980 Subject Classification: 54C50, 54C99


BOREL SET, B-set - A set which may be obtained
as the result of not more than a countable number of BOREL SET, CRITERION FOR A - A necessary and
operations of union and intersection of closed and open sufficient condition for an d -set in a complete separ-
sets in a topological space. More exactly, a Borel set is able metric space to be a Borel set. Two criteria can be
an element of the smallest countably-additive class of stated as follows: 1) its complement must also be an
sets containing the closed sets, and which is closed with d-set (Suslin's criterion); and 2) it can be represented
respect to complementation. Borel sets are also called as the union of non-intersecting components (Luzin's
Borel-measurable sets and B-measurable sets. Open and criterion).
A.G. El'kin
closed sets are said to be Borel sets of order zero. Borel
sets of order one are sets of type F" or G 8 which are, AMS 1980 Subject Classification: 54C50
respectively, countable sums of closed sets and count-
BoREL SET OF AMBIGUOUS CLASS a - A Borel
able intersections of open sets. Borel sets of the second
subset of a metric, or (more general) of a perfectly-
order are sets of type F ,,8 (the intersection of a count-
normal topological, space that is at the same time a set
able number of sets of type F (J) and sets of type Goo
of additive class a and of multiplicative class a, i.e.
(the union of a countable number of sets of type G8 ).
Borel sets of an arbitrary finite order are defined in a
belongs to the classes Fa and Ga at the same time.
similar manner by induction. With the aid of transfinite Borel sets of ambiguous class 0 are the closed and open
numbers of the second class (cf. Transfinite number) sets. Borel sets of ambiguous class I are sets of types
this classification may be exhaustively extended to all F" and G8 at the same time. Any Borel set of class a is
Borel sets. Let a be an arbitrary transfinite number of a Borel set of ambiguous class fi for any {3>a. The
Borel sets of ambiguous class a form a field of sets.
the second class; the Borel sets of class a will include
any Borel set of order a that is not a Borel set of order References
[1) KURATOWSKI, K.: Topology, 1, Acad. Press, 1966 (translated
a' for any a' <a. Whether the classes of Borel sets are
from the French).
empty or not will depend on the basic space under con- [2) HAUSDORFF, F.: Set theory, Chelsea, reprint, 1978 (translated
sideration. In Euclidean, Hilbert and Baire spaces there from the German). A. G. El'kin
exist Borel sets of all classes.
Editorial comments. The notations Fa, Ga are still
Borel sets are a special case of d -sets. For an d -set current in topology. Outside topology one more often uses
E to be a Borel set it is necessary and sufficient that the notation ~2, II2, respectively. For a~w one has
the complement of E also is an d -set (Suslin's cri- Fa=~2, Ga =II2; but ~g+1=Fn and IIg+ 1 =Gn for n<w.
terion). In spaces with a Lebesgue measure all Borel The notation for the ambiguous classes is ~2 =~2 II~. n
sets are Lebesgue measurable. The converse proposition See also [A 1].
is not true. All separable spaces having the cardinality References
of the continuum contain sets that are not Borel sets. [A1] MOSCHOVAKIS, Y.: Descriptive set theory, North-Holland,
Borel sets were introduced by E. Borel [1]; they play 1980.
an important role in the study of Borel functions (cf. AMS 1980 Subject Classification: 54H05, 04A15
Borel function).
In a more general sense a Borel set is a set in an BOREL STRONG LAW OF LARGE NUMBERS -
arbitrary Borel system of sets generated by some sys- Historically, the first variant of the strong law of large
tem of sets. The Borel sets in a topological space are numbers, formulated and proved by E. Borel [1] in the
generated by the system of closed subsets of this space. context of the Bernoulli scheme (cf. Bernoulli trials).
References Consider independent random variables
[I] BOREL, E.: Lefons sur les fonctions discontinues, Gauthier- Xi> ... ,Xn , . . . , which are identically distributed and
Villars, 1898.
[2] KURATOWSKI, K.: Topology, Acad. Press, 1966-1968 (translated
assume one of two values 0 and 1 with probability of
from the French). 112 each; the expression Sn =~~ = I Xk will then give
[3] HAUSDORFF, F.: Set theory, Chelsea, reprint, 1978. the number of successful trials in a Bernoulli scheme in
[4] ALEKSANDROV, P.S.: EinfiIhrung in die Mengenlehre und die
Theorie der reelen Funktionen, Deutsch. Verlag Wissenschaft., which the probability of success is 112. Borel [1]
1956 (translated from the Russian). showed that
V.A. Skvortsov Sn 1
-~

Editorial comments. For notational questions, see Borel n 2


set of ambiguous class. with probability one as n~oo. It was subsequently

438
BOREL SUMMA nON METHOD

(1914) shown by G.H. Hardy and J.E. Littlewood that, a role in the theory of algebraic groups over k similar
almost certainly, to that of the Borel groups. For example, two such par-
abolic subgroups are conjugate by an element of G(k)

lim
n~oo
Is"-~
yin
I<
V2'
Inn
_1_
[2].
References
[i] BOREL, A.: 'Groupes lineaires algebriques', Ann. of Math. (2)

l=l
after which (1922) the stronger result: 64, no. I (1956), 20-82.
-!!... [2] BOREL, A. and TITS, J.: 'Groupes roouctifs', Publ. Math. IHES
n 2 27 (1965), 55-150.
P -li
m - -I-
- VP. Platonov
n-->oo Yn In Inn V2
AMS 1980 Subject Classification: 20D25, 20G99

was proved by A.Ya. Khinchin. See also Law of the


iterated logarithm. BOREL SUMMATION MEI'HOD - A method for
summing series of functions, proposed by E. Borel [1].
References
[I] BOREL, E.: 'Les probabilites denombrabies et ieurs applications Suppose one is given a series of numbers
arithmetique', Rend. Circ. Mat. Palermo (2) 27 (1909),247-271.
[2] KAc, M.: Statistical independence in probability, analysis and (*)
number theory, Math. Assoc. Amer., i963. A V P kh
. . ro orov
let Sn be its partial sums and let S be a real number.
AMS 1980 Subject Classification: 60F15 The series (*) is summable by the Borel method (B-
method) to the number S if
BOREL SUBGROUP - A maximal connected solv-
. -x 00 Xk _
able algebraic subgroup of a linear algebraic group G. x~ e k~O kf Sk - S.
Thus, for instance, the subgroup of all non-singular
upper-triangular matrices is a Borel subgroup in the There exists an integral summation method due to
general linear group GL(n). A. Borel [I] was the first to Borel. This is the B' -method: If
carry out a systematic study of maximal connected a Uk
fe- u Lkk-
00 00

solvable subgroups of algebraic groups. Borel sub- , du = S,


o k =0 .
groups can be characterized as minimal parabolic sub-
then one says that the series (*) is summable by the
groups, i.e. algebraic subgroups H of the group G for
B '-method to the number S. For conditions under
which the quotient variety G / H is projective. All
which the two methods B and B' are equivalent, cf. [2].
Borel subgroups of G are conjugate and, if the Borel
The B-method originated in the context of analytic
subgroups B I, B 2 and the group G are defined over a
extension of a function regular at a point. Let
field k, B I and B 2 are conjugate by an element of
G(k). The intersection of any two Borel subgroups of a
group G contains a maximal torus of G; if this intersec- n =0

tion is a maximal torus, such Borel subgroups are said be regular at the point 0 and let C be the set of all its
to be opposite. Opposite Borel subgroups exist in G if singular points. Draw the segment OP and the straight
and only if G is a reductive group. If G is connected, it line Lp normal to OP through any point PEe. The set
is the union of all its Borel subgroups, and any para- of points on the same side with 0 for each straight line
bolic subgroup coincides with its normalizer in G. In Lp is denoted by II; the boundary r of the domain II
such a case a Borel subgroup is maximal among all is then called the Borel polygon of the function f (z),
(and not only algebraic and connected) solvable sub- while the domain II is called its interior domain. The
groups of G. Nevertheless, maximal solvable subgroups following theorem is valid: The series
in G which are not Borel subgroups usually exist. The
commutor subgroup of a Borel subgroup B coincides
with its unipotent part B u, while the normalizer of Bu
in G coincides with B. If the characteristic of the is summable by the B' -method in II, but not in the
ground field is 0, and g is the Lie algebra of G, then domain II which is the complement of II [2].
the subalgebra b of g which is the Lie algebra of the References
Borel subgroup B of G is often referred to as a Borel [I] BOREL, E.: 'Memoire sur les series divergentes', Ann. Sci. Ecole
subalgebra in g. The Borel subalgebras in g are its Norm. Sup. (3) 16 (1899),9-131.
[2] HARDy, G.H.: Divergent series, Clarendon, 1949.
maximal solvable subalgebras. If G is defined over an
A.A. Zakharov
arbitrary field k, the parabolic subgroups which are
defined over k and are minimal for this property, play Editorial comments.

439
BOREL SUMMA nON METHOD

References sider the class of entire functions f (z) of exponential


[A1] BEEKMANN, W. and ZELLER, K.: Theorie der Limitierungsver- type ";;;0' for which
(ahren, Springer, 1970.
00

AMS 1980 Subject Classification: 40C05 j 1f (x) 12 dx < 00.


-00

BOREL SYSTEM OF SETS, B-system, generated by This class is identical with the class of functions f (z)
a system of sets M - The smallest (0', ~)-system of sets, that can be represented as
B (M), containing M. The sets belonging to B (M) are
called the Borel sets (cf. Borel set), or B-sets, generated fez) = .k-
v2'IT
!eiz1q,(t)dt,
-0

by the system M. For each ordinal number a<wI,


where cp(t)EL 2( -0',0').
where WI is the initial ordinal number of cardinality ~I'
the Borel classes Ma are defined as follows: Mo=M; References
[II BoREL, E.: Lefons sur les series divergentes, Gauthier-Villars,
M a consists of the unions if a is odd, and it consists of 1928.
the intersections if a is even, of sequences of sets [21 DZHRBASHYAN, M.M.: Integral transforms and representation of
belonging to M fJ, f3<a. In such a case functions in a complex domain, Moscow, 1966 (in Russian).
B(M)= U {Ma: a<wd. The same Borel system of A.F. Leont'ev
sets B (M) is obtained if the union and intersection Editorial comments. The statement at the end of the
above are interchanged. A Borel set belongs properly to article above is called the Paley - Wiener theorem.
a class M a if it belongs to the class M a but does not References
belong to M fJ if f3<a. (Sometimes one takes non- [A1] BOAS, R.P.: Entire functions, Acad. Press, 1954.
intersecting classes, i.e. then the system AMS 1980 Subject Classification: 44A 10, 44A 15
Ma \ U {MfJ: f3<a} is called a class.)
A.G. El'kin BORSUK PROBLEM - One of the fundamental
AMS 1980 Subject Classification: 54A99 problems in combinatorial geometry: Is it possible, for
any bounded set of diameter d>O in an n-dimensional
BOREL TRANSFORM - An integral transform of the Euclidean space, to make a decomposition into not
more than n + 1 subsets with diameters smaller than a?
type 00

yet) = jf(z)e- ZI dz,


The problem was formulated by K. Borsuk [1] who
o noted that it was not possible to subdivide an n-
where f (z) is an entire function of exponential type. dimensional simplex and an n-dimensional sphere in Rn
The Borel transform is a special case of the Laplace into n parts of a smaller diameter. The problem has a
transfonn. The function y(t) is called the Borel positive solution for n = 2, 3, but only partial results
transform of f (z). If have been obtained for n > 3. Thus, for instance, the
problem has been positively solved for any bounded
fez) = n=O
~ an n
~-,z ,
n.
smooth convex set in Rn [2]. It has been proved that
the solution of Borsuk's problem can be reduced to the
then 00 case of sets of constant width. If a (F) is the smallest
yet) = ~ avt -(v + I);
v=o number of parts of a diameter smaller than d into
which a set FeRn can be subdivided, then the equality
the series converges for I t I >0', where 0' is the type of
a (F) = 3 is valid for a figure Fe R2 of diameter d if
f (z). Let D be the smallest closed convex set contain- and only if R2 contains a unique figure of constant
ing all the singularities of the function y(t); let
width d containing F [3]. This fact cannot be directly
K(q,) = mRe(ze- i </
generalized to the case n > 2. The Borsuk problem is
ZED
closely related to the illumination problem and to the
be the supporting function of D; and let h(cp) be the Hadwiger hypothesis, which is a generalization of the
growth indicator function of fez); then K(cp)=h( -cp). Borsuk problem in which Rn is replaced by a finite-
If in a Borel transform the integration takes place over dimensional normed space.
a ray argz =cp, the corresponding integral will converge
in the half-plane xcoscp+ysincp>K(-cp). Let C be a References
[IJ BORSUK, K.: 'Drei Siitze tiber die n -dimensionale euklidische
closed contour surrounding 15; then Sphare', Fund. Math. 20 (1933). 177190.
[2) GRDNBAUM, B.: Borsuk's problem and related questions, Proc.
fez) 1 . jy(t)e ZI dt.
= -2 Symp. Pure Math .. 7. Amer. Math. Soc .. 1963. pp. 271-284.
7Tle
[3) BOLTYANSKlI, V.G.: 'On decomposition of plane figures in
If additional conditions are imposed, other representa- parts of least diameter', Col/oq. Math. 21, no. 2 (1967), 253263
(in Russian).
tions may be deduced from this formula. Thus, con- P.S. Soltan

440
BOSE - EINSTEIN STATISTICS

Editorial comments. and defines the probability of the particle distribution


References over the cells, characterized by the occupation numbers
[A1] BOLTYANSKlT, V.G. and GOKHBERG, I.Ts.: satze und Prob- NJ,N 2 ,
leme der Kombinatorische Geometrie, Deutsch. Verlag The most-probable distribution corresponding to a
Wissenschaft., 1972 (translated from the Russian).
given energy E and number of particles N:
AMS 1980 Subject Classification: 51 N20, 05899
E = ~(iNi' N = ~Ni' (2)

BOSE - EINSTEIN STATISTICS, Bose statistics -


is found from the extremum of (1) under the comple-
Quantum statistics used in systems of identical particles
mentary conditions (2). The corresponding average
with integral spin (0, 1, ... , in the unit
occupation numbers are
n=1.05XIO- 27 ergXsec). Presented by S. Bose and A.
Einstein in 1924. According to these statistics an arbi- -
ni
= KG = _1_ - I
e{J(,-I') ,
(3)
i
trary number of particles may be found in each quan-
tum state. It was shown by W. Pauli that the type of where p. is the chemical potential, 13= 1 / kT, k is the
quantum statistics is directly related to the particle Boltzmann constant (a universal constant
spin, since the population of particles with an integral k = 1.38 X 10- 16 erg / degree), and T is the absolute
spin obeys the Bose - Einstein statistics, while those temperature. The values of 13 and p. can be found from
with half-integral spin obey the Fenni - Dirac statistics. the conditions (2).
The state of a system of many particles is defined in The entropy of the system is defined as the logarithm
quantum mechanics by a wave function which, in the of the statistical weight (1) for the most-probable distri-
case of identical particles, may be symmetric with bution (3):
respect to the rearrangement of any pair of particles
(for particles with integral spin) or anti-symmetric (for
S = kIn W{n;} = k~Gi{niin [1+ ~i) +In(l+ni>}- (4)

particles with half-integral spin). For a system of parti-


Other thermodynamic functions may be derived from
cles obeying the Bose - Einstein statistics the states are
the entropy and the average energy.
described by symmetric wave functions. This is a
The usual approach to Bose - Einstein statistics con-
second, equivalent, formulation of the Bose- Einstein
sists in employing the large canonical Gibbs distribu-
statistics. Systems of a large number of particles obey-
tion for the probability Wn of the quantum level n of
ing Bose - Einstein statistics are known as Bose systems
the entire system being filled
(e.g. a Bose gas).
For an ideal quantum gas, i.e. for a system of identi-
cal particles with mass m which do not interact, in a
Wn = Q-I exp { - En;:}, (5)
cube of volume V = L 3 the quantum energy levels for
single particles are given by
where
Q = ~exp
{}
- En;: (6)
-L
2m'
(p -
is the sum over states, or partition function, and En are
where p are the eigen values of the momentum of the the energy levels of the entire system. Thus, for
individual particle: p = 2wnn / L, where n is a vector instance, for an ideal Bose gas
with integer (positive, negative or zero) components.
The quantum state of an ideal gas is defined by
En = ~..lC..np,
p2m
2
n = {np}, np=O, 1, ... ,

specifying the population of occupation numbers of the and from equations (5) and (6) formula (3) for the
levels {np }, where each np is the number of particles in occupation numbers, and formula (4) for the entropy
the one-particle state p. For Bose systems np =0, 1, .... can be found, without the use of combinatorial
For large systems the energy levels are very densely analysis. Such an approach is particularly important in
located and tend to form a continuous spectrum as the case of non-ideal Bose systems, when the use of
V ~ 00. Let the levels be grouped by small cells, con- Bose - Einstein statistics cannot be reduced to a simple
taining G; levels per cell. To each cell corresponds the combinatorial problem. In such a case the requirements
average energy ;, and the number G; is assumed to be of Bose- Einstein statistics can be met if for the Ham-
very large. The state of the system is defined by the ilton operator H the representation of second quantiza-
selection {N;}, where N; is the sum of np taken over the tion is used, in which its action is defined on the space
levels of the cell. The statistical weight, that is, the of symmetric wave functions or on the space of occupa-
number of different particle distributions over the cells, tion numbers. The sum over states will then be

H;r},
is
(I) Q = spexp {

441
BOSE-EINSTEIN STATISTICS

where N is the operator for the number of particles; A type of occurrence of a variable in a linguistic
this sum may be used to find all the thermodynamic expression. An exact definition for each formalized
functions of the Bose system. language depends on the rules of formation of the
References language. It is not possible to substitute objects in
[I] HUANG, K.: Statistical mechanics, New York, 1963. place of a bound variable, since such a substitution
[2] KUBO, R.: Statistical mechanics, North-Holland, 1967.
would lead to meaningless expressions. But the
[3] LANDAU, L.D. and LIFSHITZ, E.M.: Statistical physics, Per-
gamon, 1980 (translated from the Russian). replacement of a bound variable, where it occurs, by a
[4] SCHRODINGER, E.: Statistical thermodynamiCS, Cambridge new (for a given expression) variable leads to an
Univ. Press, 1948. expression with the same meaning. For example, in the
[5] BOGOLYUBOY, N.N.: Lectures on quantum statistics, Gordon
and Breach, 1967 (translated from the Russian). expressiOns
D.N. Zubarev jf(x,y)dx, {x: f(x,y)=O},
AMS 1980 Subject Classification: 81-XX, 82A 15 x is a bound variable. Replacing x by a number leads
to a meaningless expression, whereas by writing z
BOTI PERIODICITY THEOREM - A fundamental
everywhere instead of x one obtains an expression with
theorem in K-theory which, in its simplest form, states exactly the same meaning.
that for any (compact) space X there exists an isomor-
Bound variables always arise in applying to an
phism between the rings K(X)K(S2) and K(XXS2). expression e with free occurrences of a variable x an
More generally, if L is a complex vector bundle over X
operator variable x (see Free variable). In the resulting
and P(LEB 1) is the projectivization of LEB 1, then the
expression, all the occurrences of x in e that were previ-
ring K(P(LtB 1 is a K(X)-algebra with one generator
ously free become bound. We mention below certain
[H] and a unique relation ([H]-[I])([L][H]-[1])=O,
operators that are often used (next to the operators
where [E] is the image of a vector bundle E in K(X)
and H -( is the Hopf fibration over P (L EB 1). This fact
f ... dx and {x: . . . }), in which x is an operator
variable:
is equivalent to the existence of a Thorn isomorphism
V'x( ... ), 3x( ... ), that is, the universal and
in K-theory for complex vector bundles. In particular,
existential quantifiers;
P(ltBI)=XXS2. Bott's periodicity theorem was first
demonstrated by R. Bott [1] using Morse theory, and f. ..... ... dx, that is, a definite integral with respect
was then re-formulated in terms of K-theory [6]; an to x;
analogous theorem has also been demonstrated for real ~ " ' , that is, summation over x;
x
fibre bundles. Ax( ... ), that is, a function of x the value of which
Bott's periodicity theorem establishes the property of at x is .... Specific linguistic expressions can be sub-
the stable homotopy type of the unitary group U, con- stituted in place of the dots.
sisting in the fact that Q2 U,,-, U, where QX is the space In real (non-formalized) mathematical texts it is pos-
of loops on X, and "-' is weak homotopy equivalence, sible to have a non-unique use for one and the same
in particular 7Tj( U) = 7Tj + 2 (U) for i = 0, I, ... , where 7Ti expression; in this connection distinguishing a bound
is the i-th homotopy group. Similarly, for the orthogo- variable in a given expression depends on the context
nal group 0: and meaning of the expression. In formalized languages
there is a formal procedure for distinguishing free and
bound occurrences of variables.
References v.N. Grishin
[I] BOTT, R.: 'The stable homotopy of the classical groups', Ann.
of Math. (2) 70, no. 2 (1959), 313-337. AMS 1980 Subject Classification: 03-XX, 03005
[2] MILNOR,1.: Morse theory, Princeton Univ. Press, 1963.
[3] ATIYAH, M.F.: Ktheory: lectures, Benjamin, 1967. BOUND VECfOR - See Vector.
[4] HUSEMOLLER, D.: Fibre bundles, McGraw-Hili, 1966.
[5] MOORE, J.e.: 'On the periodicity theorem for complex vector AMS 1980 Subject Classification: 15A03
bundles', in Sem. H. Cartan, 1959-1960.
[6] ATIYAH, M.F. and BOIT, R.: 'On the periodicity theorem for
BOUNDARY - I) The boundary of a subspace A of a
complex vector bundles', Acta Math. 112 (1964),229-247.
A.F. Shchekut'ev given topological space X is the set of points such that
Editorial comments. every neighbourhood of any point of it contains both
References points from A and points from X\A. The accepted
[A1) BOTT, R.: Lectures on K(X), Benjamin, 1969. notations include aA, Fr A, Frx A.
[A2) KAROUBI, M.: K-theory, Springer, 1978. 2) A synonym for the border of a manifold, such as
AMS 1980 Subject Classification: 55N15 the border of a simplex.
A. V. Chernavskif
BOUND VAlUABLE, bound occurrence of a variable - AMS 1980 Subject Classification: 54AXX, 57-XX

442
BOUNDARY (IN THE THEORY OF UNIFORM ALGEBRAS)

BOUNDARY CONDmONS - Conditions imposed There are several known compactifications of a


on an (unknown) solution of a differential equation on simply-connected domain with the property of boun-
the boundary (or on a part of the boundary) of the dary correspondence under conformal mapping. His-
domain in which the solution is to be determined. torical precedence goes to the Caratheodory extension
Boundary conditions are usually formulated in terms of (see [1], and also [2]). It is the most intuitive and is
differential operators, though other types are also often used in the study of conformal and other map-
encountered. pings. The elements of the boundary thus obtained
A.P. Soldatov
were called prime ends by C. Caratheodory (see Limit
AMS 1980 Subject Classification: 34-01, 35-01 elements). A theory has been developed of boundary
correspondence under variable conformal mappings of
BOUNDARY COKRESPONDENCE, PRINCIPLE OF simply-connected domains (see [3]).
- A principle formulated in the following way. One says
References
that the principle of boundary correspondence holds [1] MYSHKlS, A.D. and SUVOROV, G.D.: 'Conformally-invariant
for a mapping f if ~e facts that f is a continuous mae: bicompact extensions of a plane simply-connected domain',
Soviet Math. Dokl. 14 (1973-1974), 1488-1491. (Dokl. Akad.
ping of the closure G of a domain G onto the clos~re D
Nauk SSSR 212 (1973), 822-824)
of a domain D and f is a homeomorphism of G \ G [2] CARATHEODORY, c.: 'Uber die Begrenzung einfach
onto 15 \ D imply that f is a topological mapping of G zusammenhangender Gebiete', Math. Ann. 73 (1913), 323-370.
[3] SUVOROV, G.D.: 'Prime ends of a sequence of plane regions
onto D. Thus, the principle of boundary correspon- converging to a nucleus', Amer. Math. Soc. Transl. Ser. 2 1
dence is in some sense converse to the boundary- (1955),67-93. (Mat. Sb. 33 (75) (1953), 73-100)
correspondence principle (cf. Boundary correspondence [4] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
(under confonnal mapping)). 1-3, Chelsea, 1965-1977 (translated from the Russian).
[5] COLLINGWOOD, E.F. and LOHWATER, A.J.: The theory of clus-
If G and D are plane domains with Euclidean boun- ter sets, Cambridge Univ. Press, 1966.
daries homeomorphic to a circle and D is bounded, [6] SUVOROV, G.D.: Families ofplane topological mappings, Novo-
then the principle of boundary correspondence holds sibirsk, 1965 (in Russian).
for analytic functions f on G, i.e. f is a conformal map- [7] SUVOROV, G.D.: Metric theory ofprime ends and boundary pro-
perties of plane mappings with bounded Dirichlet integrals, Kiev,
ping of G onto D. In addition to that given above, vari- 1981 (in Russian).
ous other forms of the principle of boundary [8] IVANOV, O.V. and SUVOROV, G.D.: Complete lattices of
conformally-invariant compactifications of a domain, Kiev, 1982
correspondence are commonly used for conformal map-
(in Russian).
pings (see [1]). The principle of boundary correspon- B.P. Kufarev
dence has been verified for orientable mappings in Editorial comments. Standard English references on
Euclidean space (see [2]). boundary correspondence under conformal mapping and
prime ends are [A1] - [A3].
References
[I] LAVRENT'EV, M.A. and SHABAT, B.V.: Methoden der komplexen References
Funktionentheorie, Deutsch. Verlag Wissenschaft., 1967 [A1] AHLFORS, L.V.: Conformal invariants, McGraw-Hili, 1973.
(translated from the Russian). [A2] OHTSUXA, M.: Dirichlet problem, extremal length and prime
[2] KUDRYAVTSEV, L.D.: 'On differentiable mappings', Dokl. ends, van Nostrand Reinhold, 1970.
Akad. Nauk SSSR 95, no. 5 (1954), 921-923 (in Russian). [A3] POMMERENXE, c.: Univalent functions, Vandenhoeck and
[3] PINCHUX, S.I.: 'Holomorphic equivalence of certain classes of Ruprecht, 1975.
domains in C", Math. USSR-Sb. 111 (153), no. I (180), 67-94;
159. (Mat. Sb. 111 (153), no. 1 (1980), 67-94; 159) AMS 1980 Subject Classification: 30C35, 30040
B.P. Kufarev
BOUNDARY (IN THE THEORY OF UNIFORM
AMS 1980 Subject Classification: 30040, 54C05 ALGEBRAS) - A subset f of the space MA of maximal
ideals of a commutative Banach algebra A with an iden-
BOUNDARY CORRESPONDENCE (UNDER CON- tity over the field C of complex numbers, on which the
FORMAL MAPPING) - A property of a univalent con- moduli of the Gel'fand transforms a
of all elements
formal mapping f of a finitely-connected domain G a EA attain their maximum (cf. Gel'fand representa-
onto a domain D in the z-plane consisting of the fact tion). For example, one can set f=MA (the trivial
that f can be extended t? a hO!!leomorphism between boundary). Of interest are non-trivial boundaries with
certain compactifications G and D of G and D, respect- some sort of minimality property. There exists among
ively; that is, f induces a homeomorphism of the boun-
- -
daries G \ G and D \ D. For the ordinary (Euclidean)
the closed boundaries fCMA a minimal one aMA , that
is, a closed boundary such that aMA c f for every
boundaries aG and aD of G and D this property does closed boundary f; it is called the Shilov boundary. The
not always hold. For example, a conformal mapping of points of the Shilov boundary are characterized by the
a disc K induces a homeomorphism of aK and aD only property that for each neighbourhood V CMA of such
if aD is homeomorphic to a circle. a point ~ and every >0, there exists an element a EA

443
BOUNDARY (IN THE THEORY OF UNIFORM ALGEBRAS)

a a
for which max I I = I and I I < outside V. The form algebra constitutes a one-point Gleason part;
points ~EoMA constitute 'the most stable' part of the however, the converse is in general false.
set MA of maximal ideals: If B is a commutative The equality oMA =X =MA for uniform algebras is a
Banach algebra containing A as a sub algebra, then the simple necessary condition for the coincidence of A
maximal ideals (multiplicative functionals) correspond- with C(X). Without extra hypotheses this is not a suffi-
ing to such points can be extended to maximal ideals cient condition for the above coincidence even in the
(multiplicative functionals) of B, whereas for maximal case of algebras A =R(X) of uniform limits of rational
ideals not belonging to the boundary oMA such an functions on a compactum X in the plane.
extension is, in general, not possible. This situation is Let X be a metrizable compactum and let A be a
analogous to the stability of the boundary of the spec- uniform algebra on X. Then there exists a minimal
trum of a bounded linear operator on a Banach space. boundary among all the boundaries: ooMA. The closure
A typical example is the algebra A consisting of the of the minimal boundary is the same as oMA However,
functions continuous on the disc I A I .;;;; I that are ana- OOMA is, in general, not closed; an example is the
lytic for I A I < 1. In this case MA can be identified subalgebra of all analytic functions inside the disc
with the closed disc and oMA with its topological boun- I A I :s;;; I for which f(O) =f(1). The boundary OOMA is
dary; maximal ideals corresponding to interior points the same as the set of peak points with respect to A,
of the disc cannot be extended to maximal ideals of the that is, points ~EMA for which there exists an a EA
algebra of all functions continuous on the boundary in with I am I > I a(1/) I for all1/*~. On the other hand,
which A is naturally included (according to the max- a condition for a point ~ to belong to doMA that is for-
imum principle), while the maximal ideals correspond- mally considerably weaker is known. Namely, if the
ing to the boundary points can be extended. point ~EMA is such that for some O<c< I and d~ I
As in the case of algebras of analytic functions there there exists for each neighbourhood V of ~ an element
is a local maximum-modulus principle for general com- a in A for which a(~)= 1, max I a I =d and I a(1/) I.;;;;c
mutative Banach algebras: If V is an open subset of the for 1/ft V, then ~EooMA' The abstract Poisson formula
space M A , then has been strengthened in the following way: There
max{ I a(~) I : ~E V} = max{ I a(0 I : ~E(aMA n V) U aV} exists for any point ~EMA a representing measure con-
centrated on OOMA (doMA being, moreover, a Gil-set).
for all a EA, where V is the closure and dV is the topo- In this form it has been successfully applied in certain
logical boundary of V in M A Roughly speaking, this problems of approximation theory. The points
means that a local maximum point is necessarily a glo- ~EooMA are characterized by the property that for
bal maximum point (possibly of another element). these points, the measure is unique and is the same as
The notion of a boundary is used in the study of uni- the Dirac ~-measure, that is, the minimal boundary is a
form algebras, that is, closed sub algebras A of the alge- special case of the Choquet boundary.
bra C(X) of all continuous functions on a compactum For the algebra A =R(X) of uniform limits of
X that separate points and contain the constants. In rational functions on a compactum X in the plane, A
this situation OMA CXCMA. There exists for each and C(X) coincide if and only if X=MA and OOMA
point ~EMA a representing measure, concentrated on coincide. This is not true for arbitrary uniform alge-
the Shilov boundary, that is, a probability measure ~ bras: There exists a uniform algebra A distinct from
such that C(MA) for which MA is metrizable and for which each
point ~EMA is a peak point (that is, OOMA =MA)'
There also exists a uniform algebra for which all the
for all a EA (this holds for arbitrary commutative Gleason parts are trivial (singletons), but for which
Banach algebras). In the simplest case (described even the Shilov boundary is a proper part of the space
above) of the disc and the algebra of analytic functions, of maximal ideals.
the latter formula reduces to the Poisson formula. The One of the algebraic generalizations of the notion of
representing measure J1. is in general not unique. For the Shilov boundary is the following. Let olMA =MA
points belonging to the same Gleason part (see Algebra and let I be the closed ideal generated by a set of ele-
of functions), the measures J1. can be chosen absolutely ments al,'" ,an~1 EA, n;;;.2. The space MAil can be
continuous with respect to one another, which, under naturally identified with the set of common zeros of the
certain extra conditions of uniqueness type for the a
functions I, . . . ,an ~ I . The closure of the union of
representing measures, makes it possible to equip the the boundaries OMA I lover all ideals I generated by
Gleason parts of the space of maximal ideals with a n - 1 elements is denoted by dnMA . For example, for
one-dimensional analytic structure compatible with the the algebra of continuous functions in the polydisc
algebra. Every point of the Shilov boundary of a uni- I Al I:s;;; 1, I A2 I :s; ; 1 that are analytic in its interior,

444
BOUNDARY-LA YER THEORY

a]MA is the skeleton I A] I = I A2 1=1, a2M A is the Editorial comments. Although the concept of a boun-
topological boundary I A] I = 1, I A2 I ~ 1; I A] I ~ 1, dary layer has its origin in fluid dynamics, the recent
I A2 I = 1, and a3MA = MA These generalizations are mathematical development of this concept, as well as its
useful when proving theorems on multi-dimensional applications, are in the field of singular perturbations. For a
analytic structures in spaces of maximal ideals. partial review one may consult [A1].
Notions of boundaries (or functional boundaries) References
[A 1] EcKHAus, W.: 'Boundary layers in linear elliptic singular per-
similar to the ones described above are encountered in turbation problems', SIAM Review 14 (1972), 225-270.
the theory of analytic functions (the Bergman boun-
dary), probability theory (the Martin boundary) and in AMS 1980 Subject Classification: 76D10
a number of other branches of mathematics. In this
BOUNDARY-LAYER. THEORY - An asymptotic
connection the initial set is not necessarily assumed to
approximation of the solution of boundary value prob-
be an algebra or a ring.
lems for differential equations containing a small
References parameter in front of the highest derivative (singular
[1) BASENER, R.: 'A generalized Shiloy boundary and analytic
structure', Proc. A mer. Math. Soc. 47, no. I (1975), 98-104. problems) in subregions where there is a substantial
[2) BROWDER, A.: Introduction to function algebras, Benjamin, effect from the terms containing the highest derivatives
1969. on the solution. The boundary-layer phenomenon
[3) GAMELIN, T.W.: Uniform algebras, Prentice-Hall, 1969.
[4) GELFAND, I.M. [I.M. GEL'FAND), RAIKOV, D.A. [D.A. RAiKOV)
arises in narrow zones near the parts of the boundary
and SCHlLOW, G.E. [G.E. SHILOV): Kommutative Normierte on which there is a difference in the numbers of boun-
Ringe, Deutsch. Verlag Wissenschaft., 1964 (translated from dary conditions for the initial problem and the degen-
the Russian).
[5) GONCHAR, A.A.: 'The minimal boundary of the algebra A (E)"
erate one (with the small parameter taking the value
Izv. Akad. Nauk SSSR Ser. Mat. 27, no. 4 (1963),949-955 (in zero), as well as near the surfaces of discontinuity of
Russian). the solution of the degenerate problem.
[6) PHELPS, R.R.: Lectures on Choquet's theorem, v. Nostrand, The solution of the singular problem may be
1966. E.A, Gorin represented as the sum of two expansions. The outer

Editorial comments. Much of this article, in particular expansion is determined by the method of the small
the definition of boundary, is relevant and valid for general parameter with the part Bo of the boundary conditions
commutative Banach algebras. B = ~ 0 +B ]. The inner expansion decreases rapidly
AMS 1980 Subject Classification: 46J20 outSIde the boundary layer and is usually sought for as
polynomials in powers of (. To determine these, the dif-
ferential equations are transformed by variables that
BOUNDARY LAYER - A region of large values of
depend on ( and that stretch the subregions of the
the gradient of a function, in particular, in hydro-
boundary layer. The equations of the boundary layer
dynamics it is a region of the flow of a viscous liquid
(gas) the transversal thickness of which is small in com- are derived by equating to zero the coefficients at vari-
parsion with its longitudinal dimensions and which is ous powers of ( after substituting the polynomials into
the transformed equations. To these one adds condi-
produced at the surface of a solid or at the boundary
tions B]. The error in the outer expansion, after it has
between two liquid flows with different velocities, tem-
peratures or chemical compositions. A boundary layer been found, indicates the necessary change of variables.
is characterized by a sharp change in the velocity in the In solving complicated applied problems, the necessary
transversal direction (a shear layer) or a sharp change c~ange ~f variables can be clarified on the basis of phy-
SICal esttmates for the terms in the initial equations and
in the temperature (a thermal, or temperature, boundary
the simplifications corresponding to them. This change
layer), or else in the concentrations of the individual
should eliminate the highest derivatives in front of f.
chemical components (a diffusion, or concentration,
To solve the problem it is necessary to determine where
boundary layer). The concept of a boundary layer and
the boundary layers lie and how the conditions B are to
the term itself were introduced by L. Prandtl (1904) in
connection with the solution of a boundary value prob- be separated into Bo and B]. A notable feature of this
lem for non-linear partial differential equations in the solution is that hyperbolic equations for the outer
hydrodynamics of viscous liquids. The needs of aviation expansions and parabolic equations for the inner
have led to the development of a boundary-layer theory e~pansions may correspond to initially elliptic equa-
in aerohydrodynamics. In the mid-twentieth century, nons.
the mathematical boundary-layer theory developed, as In the theory of systems of ordinary differential
well as applications in the theory of heat transfer, diffu- equations
sion, processes in semi-conductors, etc. XI = f (x, y, t), YI = g(x, y, t), O';;;;;t';;;;; T, (1)
Yu.D. Shmyglevskif where x, f are m-dimensional and y, g are n-

445
BOUNDARY-LAYER THEORY

dimensional vector functions, an existence and unique- with this boundary condition on a certain part of S
ness theorem for the solution of the Cauchy problem (absence of a boundary layer) [6].
under the conditions x (0, ()=x o and yeO, ()=yO and For an elliptic second-order equation in a region D
under certain properties of j and g has been proved. with boundary S, using the example of two indepen-
One has also derived properties of the solution as (~O dent variables
(see [1]). In the case of a boundary value problem for t:(a (x, y)uxx +2b (x, y)uxy +c(x, y)Uyy +
(1) under the conditions
+d(x,y)ux +e(x,y)Uy +g(x, y)u]+ux -
x(O,) = xo, YI(O,) = yy, Y2(T, ) = y~,
-h(x,y)u = I(x,y), h~a2>0,
where the sum of the numbers of components of the
iterative processes have been constructed solving the
vectors Y I and Y2 is n, there exist, in general, boundary
problem with the condition u =0 on S, theorems have
layers in neighbourhoods of the ends of the segment
been proved on the structure of the expansion of u with
[0, T]. An algorithm has been constructed for finding
respect to t: and estimates have been made of the resi-
asymptotics of the solution for this problem. Under
dual term in this expansion [4]. Similar results have
certain properties of j and g it has been shown that a
been obtained for equations of higher orders.
solution exists and is unique, and it has been estimated
(see [3]). If a solution to the limit equation g=O is not A method has been devised [7] for combining asymp-
totic expansions for the equation
unique with respect to y, one can construct an internal
boundary layer (in a neighbourhood of 7',0<7'<1) that ilu -a(x, y)Uy = I(x,y)
separates regions with different solutions of the limit in a rectangle with u given on the boundary.
equation. An algorithm has been constructed for a par- Research on boundary-layer theory for non-linear
ticular type of integro-differential equation giving an partial differential equations is related mainly to
asymptotic expansion in ( for the problem with initial aerohydrodynamics and is based on the N avier - Stokes
conditions, and some features of the behaviour of the equations or generalizations of them. Practical require-
solution have been examined. ments have led to the development of the mathematical
In the case of linear ordinary differential equations theory and to methods of handling various problems.
(L+(M)x=j(t), O~t~l, where Land M are differen- Below only laminar flows are considered (see [8] - [10]).
tial operators, with boundary conditions B + B I, one The Navier-Stokes equations in the case of the
can distinguish the class of problems the solutions of hydrodynamics of planar (k =0) and axi-symmetric
which contain boundary layers, and the concept of reg- (k = 1) flows of an incompressible fluid with constant
ular degeneracy (the solution to the limit equation viscosity coefficient pare:
enables one to satisfy the conditions B 0, while the
asymptotic solution for the boundary layer enables one (Ir"U)~+('l/kV~ = 0, UT+~U~+V~~ = ~ilU-P~'} (2)
VT+UV~+VV~ - ~ilv P~,
to satisfy B 1) can been introduced (see [4]). An iterative
process has been constructed for the asymptotic where t:=Re- 1j2 , Re=wX /p is the Reynolds number,
representation of the solution, and estimates have been which is represented in terms of the characteristic
given for the residual terms in the expansions. values of the velocity w and the linear dimension X.
It has been shown (see [5]) in the theory of boundary The solutions are defined by the boundary conditions
layers of a general non-linear second-order ordinary on the closed boundary S of a region D, where on the
differential equation, subject to certain assumptions, solid contour r one has the conditions U=O,
that the solution of the first boundary value problem is v=vo(~, H(~ for T/=H(~), where u and v are the
made up of an external solution, a boundary layer and tangential and normal components to r of the vector
a residual term that, together with its first-order deriva- (u, v). The initial values of u, v and p are given on
tive, is of order t: on the segment. DUS.
Studies have been made on the behaviour of solu- For t: smail, in a first approximation the asymptotic
tions of boundary value problems of basic types for a solution is composed of the solution of (2) for =0
linear partial differential equation of the form with some of the conditions on S (only the condition
ilu + A (x, y)ux + H(x, Y)Uy + C(x, y)u = f(x, y), v=vo is imposed on f) and the solution of the
where D. is the Laplace operator, in a region D with boundary-layer equations. The equations for the
boundary S. Conditions on the functions A, B, C, and dynamic boundary layer are derived on the assumption
f, the boundary S, and the functions a and cp of the that the conditional thickness 8 of the boundary layer
points P on S appearing in the boundary condition and the value of v have orders 8"" X t:, V ""Wt:, and that
Un +a(P)=</i,P) have been given such that u in D US the terms on the left-hand sides of the latter equations
tends uniformly to the solution of the limit equation in (2) are of the order of the terms containing ~. Intro-

446
BOUNDARY-LAYER THEORY

duction of the variables t =-r, x =g,y =1/ / f. and dary layer behind a shock wave, and problems with a
V=v / f. leads, as f.~O, to the Prandtl equations (cf. variable temperature over the surface of the solid
Prandtl equation): around which the flow takes place have been solved.
(rku)x+(rkV)y = 0, Ut+uux+V~ = Uyy-Px. (3) In the boundary-layer theory for three-dimensional
flows, methods for obtaining a solution have been
Py = 0, O";;;;t,,;;;;T, O";;;;x,,;;;;Xo, O";;;;y<oo,
developed and cases in which the equations simplify
with the conditions have been studied. The boundary-layer equations for a
u\t=o = uo(x,y), v\t=o = vo(x,y), u\y=o = 0, sliding cylindrical wing of infinite span are analogous
to the equations for a two-dimensional boundary layer.
V\y=o = vo(t, x), Approximate solutions have been obtained for the
u _ W(t, x), px = - WWx - w" asy_oo, u \x=o = 0, problem of a boundary layer on a rotating cylindrical
where r is the distance from the symmetry axis for propeller blade and on a rotating cylinder in a skew
k = 1 and W (x) is a known function. These equations flow, as well as for the boundary-layer problem near
and conditions apply for any curvilinear contour with the line of intersection between two planes.
radius of curvature much larger than ~. In the latter These researches on boundary layers in aerohydro-
case, x and y are the coordinates along the contour and dynamics relate to a first approximation in boundary-
along the normal to it. layer theory. Higher approximations enable one to
If W is constant, the problem reduces to a boundary examine the interactions of boundary layers with the
value problem for an ordinary differential equation. external flow, and to make calculations for moderate
There are also other classes of analogous solutions. values of R [13].
Conditions under which solutions of boundary-layer The stability of boundary layers enables one to deter-
problems exist are known; one has investigated the mine the limits of applicability of the theory. There are
problem of uniqueness and stability of the solutions as studies [14] based on methods of small perturbations
well as how they result from the solutions for station- with periodic and local initial perturbations. In the case
ary cases [11]. Solutions have been constructed by the of two-dimensional flows, the analysis of three-
method of straight lines, and it has been shown that dimensional perturbations reduces in linear approxima-
they converge. tion, on the basis of Squire's theorem, to a two-
Boundary-layer equations for a compressible liquid dimensional analysis with an altered value of P. Non-
can be derived from the equations for the flow of a linear analysis applied to stability loss shows that longi-
viscous and heat-conducting gas; they are much more tudinal vortices occur.
complicated than (3). Their number is also larger. Physical generalizations of problems in the theory of
There is an integral transformation that simplifies these boundary layers (see [15] - [17]) are related to research
equations in the general case and reduces them to (3) on multi-phase flows, to the use of real equations of
when the Prandtl number Pr=cp / K= 1, where cp is state and transfer coefficients (a complication of the
the heat capacity of the gas at constant pressure and K equations), to the study of non-equilibrium flows with
is the coefficient of heat conductivity [12]. Several diffusion (an extension of the system of equations,
modifications of the transformation exist. In the general which then becomes of parabolic-hyperbolic type), to
case, the boundary-layer equations describe so-called taking into account ablation of the surface around
natural convective flows. If p is independent of the tem- which the flow takes place (a complication of the boun-
perature and if the Archimedean force is negligible, dary conditions and it becomes necessary to consider
then the energy equation splits off from the system of the thermal conduction in the solid), and also to con-
boundary layer equations and one speaks of forced sidering radiation transport (integro-differential equa-
convective flow. The energy equation determines the tions).
thermal boundary layer, whose thickness differs from ~. A further development of boundary-layer theory in
Boundary layers also arise in zones separating flows aerohydrodynamics has been obtained in the study of
with different characteristic velocities. Shock waves are flows that do not satisfy Prandtl's assumptions, where
also boundary layers. solutions are obtained as multi-layer asymptotic expan-
A distinct class of two-dimensional boundary-layer sions. The causes of the complexity in the structure of
problems is associated with flows in rotating axi- the solution are that there are additional small parame-
symmetric plates and bodies. ters in the boundary conditions (for example, because
Not only have methods been developed for solving the radius of curvature of the contour around which the
non-stationary problems, but also problems in which W flow takes place is small), there may be singular points,
is periodic, when there is stepwise motion from a state lines or surfaces in the first approximation of the
of rest, problems with accelerated motion, for the boun- theory, and bifurcation of the solution is possible.

447
BOUNDARY-LAYER THEORY

1bis class also contains flows around points of parameter', Uspekhi Mat. Hauk 12, no. 5 (1957), 3-122 (in Rus-
separation or connections of the boundary layers to the sian).
[4B] VISHIK, M.1. and LYUSTEllNIK, L.A.: 'Solutions to some prob-
contour around which the flow takes place and around lems on perturbations in the case of matrices and non-self-
the points of incidence of shock waves on the boundary adjoint differential equations', Uspekhi Mat. Hauk 15, no. 3
layers. The characteristic solutions have ([18] - [20]) a (1960),3-80 (in Russian).
[5] CoLE, J.: Perturbation methods in applied mathematics, Blais-
three-layer structure. The external solution defines the dell, 1968.
potential flow perturbed by the boundary layer and is [6] OLEiNIK, O.A.: 'on equations of elliptic type with a small
described by equations in perturbations. The middle parameter in front of the highest derivatives', Mat. Sh. 31, no.
1 (1952), 104-117 (in Russian).
layer of order X{ can be described by the equations for [7] !L'IN, A.M. and LEuKovA, E.F.: 'The method of matched
non-viscous vortex flows with Py =0, and it receives gas asymptotic expansions for the equation
from the main part of the preceding boundary layer. /;u-a(x,y)u,=f(x,y) in a rectangle', Mat. Sh. 96, no. 4
The equations for the third (thinnest) layer nearest to (1975), 568-583 (in Russian).
[8] KocHIN, N.E., KmEL', LA. and ROZE, N.V.: Theoretical
the wall can be derived on the assumption that its hydromechanics, 2, Moscow, 1%3 (in Russian).
length and thickness are, correspondingly, of order [9] LoiTSYANSKIi, L.G.: Laminar boundary layers, Moscow, 1962
X~/4 and X~/4. The following variables are intro- (in Russian).
(10) ScHLICHTING, H. VON: Grenzschicht-Theorie, Karlsruhe, 1951.
duced in the stationary case: [II] OLEINiK, O.A.: 'Mathematical problems of boundary-layer
x= ~-3/4, Y = 'I}(- 5 / 4 , theory', Uspekhi Mat. Haule 23, no. 3 (1968), 3-65 (in Russian).
[12] DoRODNITSYN, A.A.: 'Boundary layers in an incompressible
U = u(-I/4, V = v(-3/4, P = p(-1/2. gas', Prikl. Mat. i Mekh. 6, no. 6 (1942), 449-486 (in Russian).
[13] DYKE, M. VAN: Perturbation methods inftuid mechanics, Para-
1bis leads, as {~O, again to (3) in which u is replaced bolic Press, 1975.
by U and p by P. 1bis structure for the solution of a [14] BETcHov, R. and CluMiNALE, V.O.: Stability ofparallelftows,
Acad. Press, 1967.
problem is characteristic for a wide class of flows with
[15] Soo, S.: Fluid dynamics ofmultiphase systems, Blaisdell, 1971.
small perturbations. [16] DoRRANCE, W.H.: Viscous hypersonicftow, McGraw-Hill, 1962.
Many studies have been made on the dynamics of [17] KAyS, W.M.: Convective heat and mass transfer, McGraw-Hill,
flows for small {, where the pressure in the outer super- 1971.
[18] NEiLAND, V.YA.: 'On the theory of separation of laminar
sonic flow varies considerably over short distances boundary layers in supersonic flow', Izv. Akad. Haule SSSR
(M> 1, M = w / c, where w is the velocity of the gas Mekh. Zhidk. i Gaza, no. 4 (1969), 53-57 (in Russian).
and c is the velocity of sound). This includes problems [19] NEiLAND, V.YA.: 'Asymptotic problems in the theory of
viscous flows', Trudy TsAGI, no. 1529 (1974) (in Russian).
on calculating flows around contours of large local cur- [20] STEWARTSON, K.: 'Multistructured boundary layers on flat
vature and flow attachment to the surface of the body. plates and related bodies', Adv. App. Mech. 14 (1974), 145-239.
In these cases Py=l=O in the middle layer of the three- Yu.D. Shmyglevskii
layer scheme for the solution. Editorial comments. The article above presents a one-
The class of problems in which the perturbed solu- sided view of the subject and is in many aspects not up-to-
tion occupies a finite region has also been studied. date. Furthermore, very substantial contributions in the
These solutions occur when there is moderate or strong Western literature are neglected. Mathematically boundary
interaction of the boundary layer with an external layers arise in problems of singular perturbations and are
hypersonic flow (M~oo) or in supersonic flow around treated by methods of asymptotic analysis. In linear and
a body of finite length in which there are intense gas non-linear, ordinary and partial, differential equations much
more has been accomplished than described above in the
injections through the surface (vo >0). In these cases,
first part. Overviews that are partially complementary can be
the pressure on the outer boundary of the boundary
found in [A1] - [A3].
layer is determined from the solution to the complete
From the point of view of applications boundary-layer
problem. The perturbation from the trailing edge of the theory is described above as a branch of fluid- and gas-
body propagates upstream, which is caused by the dynamics, where it originated. However, boundary layers
non-uniqueness of the solution in a neighbourhood of and transition layers also occur in many other fields, such
the leading edge. as for example combustion, geophysics, free boundary
problems, or epidemology. Samples and further literature
References
can be found in [M], [A5).
(1) TnrnONOV, A.N.: 'Systems of differential equations containing
small parameters in front of the derivatives', Mat. Sb. 31, no. 3 Systems of (ordinary) differential equations of the type (1)
(1952), 575-586 (in Russian). above are often referred to as singular perturbations (cf.
(2) W ASOV, W.: Asymptotic expansions for ordinary differential also the editorial comments to Perturbation theory).
equations, Interscience, 1965. For an idea of the role of singular perturbations, boundary
[3] VASIL'EVA, A.B. and Buruzov, V.F.: AsymptotiC expansions of
layers and multiple time scales in control and system theory,
solutions to singularly perturbed equations, Moscow, 1973 (in
Russian). cf. [A6).
[4A] VISHIK, M.I. and LYUSTERNIK, L.A.: 'Regular degeneracy and References
boundary layers for linear differential equations with a small [A 1] LIONS, J.L.: Perturbation singu/ieres dans les problemes aux

448
BOUNDARY PROPERTIES OF ANALYTIC FUNCTIONS

limits et en contrOle optimal, Lecture Notes in Math., 323, advances in the first third of the twentieth century,
Springer, 1973.
[A2] EcmAus, W.: Asymptotic analysis of singular perturbations,
owing to the work of several scientists; it resumed its
North-Holland, 1979. rapid advance in the second half of that century,
[A3] CHANG, K. W. and HOWES, F.A.: Nonlinear singular perturba- accompanied by the appearance of new ideas and
tion phenomena: theory and application, Springer, 1984. methods, novel directions and objects of study. Its
[A4] BRAUNER, C.M., GAY, B. and MATHIEu, J. (EDS.): Singular
perturbations and boundary layer theory, Lecture Notes in development is closely connected with various fields of
Math., 594, Springer, 1977. mathematical analysis and mathematics in general, first
[A5] VERHULST, F. (ED.): Asymptotic analysis II, Lecture Notes in and foremost with probability theory, the theory of har-
Math., 985, Springer, 1983.
[A6] KOIWTOVIC, P. (ED.): Singular perturbations and time scales
monic functions, the theory of conformal mapping,
in modelling and control of dynamical systems, Univ. of Illi- boundary value problems of analytic function theory,
nois at Urbana-Champaign, 1980. potential theory, value-distribution theory, Riemann
surfaces, subhartnonic functions and function algebras.
AMS 1980 Subject Classification: 34E15, 35-01,
The theory of boundary properties of analytic functions
76010
is closely connected with various fields of application of
BOUNDARY (OF A MANIFOLD) - The subset of mathematics by way of boundary value problems.
the closure M n of an (open) n-dimensional real mani- Since the study of boundary properties is connected,
fold M n for which a neighbourhood of each point is in the first place, with the geometry of the boundary f
homeomorphic to some domain wn in the closed half- of the domain of definition D of an analytic function
space of Rn , the domain being open in R'!t- (but not in f (z) in one complex variable z, three main approaches
can be distinguished in the theory of boundary proper-
Rn). A point a EM n corresponding to a boundary point
ties of analytic functions.
of wn CR'!t-, i.e. to an intersection point of wn with
a) The study of the behaviour of f (z) in a neigh-
the boundary of R'!t-, is called a boundary point of Mn.
bourhood of an isolated boundary point aEf. The
A manifold having boundary points is known as a man-
most important case is that of an essential singular
ifold with boundary. A compact manifold without boun-
point a, which is dealt with in the theorems of
dary is known as a closed manifold. The set of all
SokhotskiI, Picard, Julia, and Iversen (cf. Sokhotskii
boundary points of M n is an (n -I)-dimensional mani-
theorem; Picard theorem; Julia theorem; Iversen
fold without boundary.
M.I. Voitsekhovskii theorem).
Editorial comments. b) The study of the behaviour of f (z) in the case
References
when the boundary f is an everywhere-discontinuous
[A1] HIRSCH, M.W.: Differential topology, Springer, 1976. set. V.V. Golubev's dissertation Single-valued analytic
functions with a perfect set of singular points (1916, cf.
AMS 1980 Subject Classification: 57-XX [1]) was of great importance in this connection.
c) The study of the behaviour of f(z) when the
BOUNDARY PROPERTIES OF ANALYTIC FUNC- domain D is bounded by a continuous closed curve f
TIONS - Properties of analytic functions that are and, in particular, by the unit circle.
displayed as the function approaches the boundary of Cases a) and c) are, in a sense, extreme cases, while
its domain of definition. case b) is intermediate. Case c), which is discussed
I t can be said that the study of boundary properties below, has been the subject of most intense study.
of analytic functions, understood in the widest sense of Let an analytic function f (z) be defined in a finite
the word, began with the Sokhotskii theorem and the simply-connected domain D of the complex z-plane
Picard theorem about the behaviour of analytic func- bounded by a rectifiable Jordan curve f. The following
tions in a neighbourhood of isolated essential singular problems are fundamental in the classical approach to
points (cf. Essential singular point), which were the study of boundary properties of analytic functions.
obtained in the second half of the nineteenth century. 1) The problem of the existence of boundary values, i.e.
The terms relevant to this approach to the study of the question under which conditions and in which sense
boundary properties of analytic functions - which is the boundary values of f (z) exist as the point z
now called the theory of prime ends and cluster sets approaches f. This problem, as well as the succeeding
(cf. Limit elements) - first appeared in a course given ones, can be formulated in a different manner as the
by P. Painleve in 1895. The dissertation of P. Fatou problem of identifying sufficiently extensive classes of
(1906) is the first systematic study of certain boundary analytic functions in D that have, in some sense, boun-
properties of analytic functions in a neighbourhood of dary values for sufficiently large sets of points on f.
the continuous boundary of their domain of definition. 2) The problem of boundary representation of f (z), i.e.
The theory of boundary properties made considerable the problem under which conditions and with what

449
BOUNDARY PROPERTIES OF ANALYTIC FUNCTIONS

kind of analytic apparatus the dependence of f(z) on II I II = II I II B = ~~E II (z) I


00

its boundary values on r can be expressed. Oearly, this


apparatus will be different for different classes of ana- on Hoo; the classes HP, I:s;;;.p:s;;;. + 00, have
a natural
lytic functions. structure of a vector space, and become Banach Hardy
3) The uniqueness problem, or the problem of the pro- spaces. If 0<p<1, it is only possible to introduce the
perties that a set E C r should have such that two ana- metric ppCf, g)= II f - g II~ on HP, which converts the
lytic functions of a given class coincide in D if their latter into a complete metric non-normable space. The
boundary values on E are identical. class of bounded analytic functions B =H OO is con-
The first result in the solution of the existence prob- tained in any class HP, P >0.
lem is the theorem of Fatou (1906): If an analytic func- The class N(D) of certain meromorphic functions
tion is bounded in the unit disc D={z: I z I <I}, f (z) in the unit disc D is said to be the class of func-
I f(z) I :s;;;.M, then radial boundary, or limit, values tions of bounded characteristic; it was introduced in
f(ei8)=limHl_of(reiB) exist almost everywhere with 1924 by R. Nevanlinna. The class N(D) can be
respect to the Lebesgue measure on the unit circle characterized as the set of meromorphic functions f (z)
r ={z: I z I = I}. It can be shown that, under these in D that can be represented as the ratio of two
conditions, not only radial, but also angular boundary bounded regular functions fl(z) and h(z) in D,
values, or boundary values along all non-tangential f(z)=fl(z)/ h(z).
paths, exist almost everywhere on r. This means that, All regular functions f(Z)EN(D) form a subclass
for almost-all points e iB Er, f(z) tends to a definite N*(D), andf(z)EN*(D) if and only if the condition
limit f(e iB ) as z tends to the point e iB while remaining 1 2"
sup -2 fln+ I/(rei<P) I dq, = C(f) < + 00 ' (3)

-+
within an arbitrary fixed angle O<r<:1 'IT 0

~(eiB,() = {I z I<I} n {I arg(eiB-z) 1<; where In+ x=Inx if Inx;;;;o.O and In+ x=O if Inx<O is
fulfilled. The class N* (D) contains all classes HP,
>0, of width 7T-2<7T, with apex at the point e iB , O<p:s;;;.+oo.
bisected by the radius drawn through the point e iB . The classes HP have the following generalization. Let
Fatou's theorem cannot, in a certain sense, be #,t) be a strictly-convex function for - oo<t< + 00, i.e.
improved upon; it was in fact shown by N.N. Luzin a non-negative, convex, non-decreasing function such
(1919) that for any set E Cr of measure zero on f that #,t)/t~+oo as t~+oo. The class H",(D) is
there exists a bounded analytic function f (z) not hav- then defined by the condition
ing radial limits on E. I 2"
The class of bounded analytic functions in a domain sup 1/1-1 { -2 jl/;{ln Il(re'<P) I)dq,
O<r<:1 'IT 0
. }
= C(f,I/I) < (2')
D is denoted by B(D) or HOO(D). Following the results
of Fatou, the next problem appeared to be the generali- < +00,
zation of his theorems to wider classes of functions. compare with condition (2), where 1/J(t)=eP1 .
One distinguishes between the following basic classes of The principal result obtained for the problem of the
functions in the unit disc D, which are related by existence of boundary values in the case of the unit disc
proper inclusions: D states that each meromorphic function f(z) of
A(D) C B(D) = Hoo(D) C HP(D) C N*(D) C N(D).(I) bounded characteristic in D has angular boundary
values f(e iB ) almost everywhere on f; these boundary
The class A (D) is the class of single-valued analytic
values are such that the function In If(e iB ) I is Lebes-
functions in D that are continuous in the closed
gue integrable on f. The following additional property
domain D U r=D. is displayed by the classes HP, O<p < + 00, or H",: The
The classes HP(D), for all positive numbers p, are function I f(e,B) IF or, correspondingly, #,In I f(e ifJ ) IY'
defined by the condition is Lebesgue integrable on f. For bounded functions
I 2" }I /P = C(f,p) f (z), I f (z) I :s;;;.M, one has instead of the above,
II I lip = O<r<:1
sup { -2 jl/(rei<P) IPdq,
'IT 0
< (2)
ess sup I f(e ifJ ) I ";;;'M, 0:s;;;.(J:s;;;.27T. Thus, condition (3) is
the widest sufficient condition on the average increase
< +00.
of an analytic function f (z), as I z I~ 1, that ensures
The proper inclusions H oo CHP, CHP, are valid for the existence of angular boundary values almost every-
any O<p I <P2 < + 00. The classes HP were first intro- where on f.
duced by G.H. Hardy (1915), and are often named It has been shown that condition (3) cannot be
Hardy classes. If I ~ < 00, one can introduce the norm substantially weakened. Thus, it was shown by A.
(2) on HP, and the norm Zygmund that for an arbitrary increasing function 1/J(t),

450
BOUNDARY PIWPERTIES OF ANALYTIC FUNCTIONS

I/I{t) / t~O as 0<t1+ 00, there exists an analytic func- it possible to extend this theorem to meromorphic func-
tion ! (z) in D such that tions of bounded characteristic (cf. Function of bounded
2w form). N.N. Luzin (1919) constructed, for any set E cf
sup f#)n+
0<,<:1 0
IJ(rei<l I)d</l < +00, of measure zero, an analytic function ! (z) such that
!(ei(J)=O everywhere on E as z~ei(J in an arbitrary
but which has no boundary values anywhere on f. manner, but !(z) is not identically equal to zero. The
Even if the maximum deepest and most general boundary uniqueness
M(r;f)=max{ I!(z) I: I z I =r} grows as slowly as theorems for meromorphic functions of general form
one pleases, there still exist analytic functions without were obtained in 1925 by Luzin and 1.1. Privalov (d.
radial boundary values. Uniqueness properties of analytic functions;
The boundary representation of functions ! (z) of Luzin - Privalov theorems).
class N(D), characterizing the functions of this class, Consider the case of an arbitrary plane domain D;
has the form for the sake of brevity, only simply-connected domains
D with a rectifiable boundary f will be discussed. The
(4)
conditions (2), (3) and (2') are equivalent to stipulating
eiO +z that the subharmonic functions I!(z) IP, In+ I!(z) I
xexp-2
1
." 0
fIn IJ(e'O) 1-o-dOX
2w

e' -z
.
and I/I{ln I!(z) I), respectively, have a harmonic
majorant in D. In such a form these conditions are fully
1 e +z iO
X exp -2
." 0
f-o-
2w

e' -z
d~(J),
suitable, and furnish a natural definition of the classes
HP, N* and H.jI in arbitrary domains. It is known that
where m is an integer, m =k if the point z =0 is a zero a rectifiable curve f has a definite tangent and normal
of multiplicity k and m = -k if z =0 is a pole of multi- at almost all of its points. The inclusions (1) remain
plicity k; A is a real number; valid, as does Fatou's theorem on the almost-
I a I a -z everywhere existence on f of angular boundary values
B I(z'a)
, JL = II--JL---JL-
00
_ (5) for the class N*. Here, as the bisectrix of the angular
JL=I aJL l-aJLz
domains ~(~, f) the normal to f at the point ~Ef is
is the Blaschke product taken over all the zeros a,,::f:O taken. The Riesz uniqueness theorem for the class N*
of !(z) inside D taking into account their multiplicity; is also applicable.
B 2(z; b v ) is the Blaschke product of type (5) taken over V.I. Smimov also introduced the frequently
all poles bv::f:O of !(z) in D; and <1>(8) is a singular employed classes EP, P >0, for the case of an arbitrary
function of bounded variation on [0,2'lT] with a deriva- domain D. These classes have the following definition:
tive that vanishes almost everywhere. In (4), the last !(z)EEP(D) if there exist a sequence of contours
integral is of Lebesgue - Stieltjes type, while the first is {fj} CD, fj~f, such that
of Lebesgue type. ~~~ [I J(z) IP I dz I = elf,p) < +00.
It was shown by M.M. Dzhrbashyan [10] that the
theory of meromorphic functions of bounded charac- The classes EP are especially convenient in the study of
teristic can be considerably extended. It is possible, in problems of representation of functions in the form of
fact, to introduce a family of classes of meromorphic a Cauchy integral.
functions N a depending on a continuous parameter a, Of major interest is the study of the boundary pro-
-1 <a< + 00, and the classes N a are characterized by perties of analytic functions realizing a conformal map-
representations yielding (4) when a=O. As a increases, ping. Let a function z =P(w) realize a conformal map-
the classes N a become larger, and No becomes identi- ping of the unit disc I w I < I onto a domain D of the
cal with Nevanlinna's class N. z-plane with a rectifiable boundary f. It has been
For analytic functions !(z)EN*(D) in the represen- shown, for example, that in such a case the derivative
tation (4) one must put B 2(z; bv)-l. For the func- p' (w) belongs to the Hardy class H I in the disc
tions !(z)EHP, O<p:r;;;;, + 00, or EH.jI, in the represen- I w I < 1, so that it is representable in the form (4) with
tation (4) one has B 2 (z; b v )=I, and <1>(8) is a non- B 2 =1 and a non-increasing singular function <1>(8).
increasing function of the indicated type. See also Cau- Smirnov pointed out the importance of the class S of
chy integral. such domains D for which this singular function
The first results in the uniqueness problem were <1>(8)_0. In 1937 M.V. Keldysh and M.A. Lavrent'ev
obtained in 1916 by the brothers F. and M. Riesz: If a constructed an example of a domain with a rectifiable
function !(z)EHOO has radial boundary values boundary that is not included in the Smirnov classes
!(eifJ)=O on a set E Cf of positive Lebesgue measure just mentioned; this renders the characterization of
on f, then !(z)_O in D. The representation (4) makes domains of Smimov type even more important.

451
BOUNDARY PROPERTIES OF ANALYTIC FUNCTIONS

Numerous workers also attempt to study boundary integral on the right-hand side is called the outer factor,
properties of analytic functions f (z) in several variables while the other factors on the right-hand side form together
Z=(Zb" . ,zn). Let D=Un={ZEC": I Zj I <I, the inner factor, the latter has absolute boundary values 1
j=I, ... ,n} be the unit polydisc, and let Tn={zEC": almost-everywhere. If the outer factor in (4) equals 1, f is
I Zj 1=1, j= 1, ... ,n} be its skeleton. The class called an inner function. Recent general references are
[A2], [A3].
N*(D) of analytic functions f(z) in D is defined by the
For smooth domains D in en, n~2, there is a strong ver-
condition sion of Fatou's theorem for HP(D) and N'(D): Boundary
SUI>
O<""-:!r
(In+ I!(rz) I dmiz) = Clf) < +00, values exist almost-everywhere (or even better than almost-
everywhere) if the boundary is approached over admissible
which is analogous to (3), while the classes HP(D) or approach regions, see [AS], [A7]. In contrast with the one-
H",(D) by the condition of the type (1./I(t)=e pt for the dimensional case, zero sets of HP-functions can be essen-
case HP(D), p >0): tially different for different values of p, see [11], [AS]. A
characterization of zero sets of functions of Nevanlinna
SUI> 1/;-1
O<r<::1
{ftf.{ln
f. I J(rz) I)dmn(Z)} = Clf,1/;) < +00, class on smooth strongly pseudo-convex domains was
obtained by G.M. Khenkin and, independently, by H. Skoda,
where mn is the normalized Haar measure on Tn, see [AS]. A.B. Alexandrov and E. L0W proved indepen-
mn(Tn)=l. Inclusions of the type B=HooCH",CN* dently the existence of non-constant inner functions for the
are preserved. Analytic functions f(Z)EN*(D) have unit-ball in en, see [A1], [A4]. In an other direction, HP-
r
'radial' boundary values (z) = limr ..... \-0 f (rz), Z E Tn ,
spaces have been introduced for the upper half-space in
Rn , [A6].

r
almost everywhere on Tn with respect to the Haar
measure mn; and In + I (z) I is summable on Tn with References
[A 1] ALEKSANDROV, A.B.: 'The existence of inner functions in the
respect to mn Sufficiently simple and general charac- ball', Math. USSR Sb. 46 (1983), 143-159.
teristics for boundary representations and uniqueness [A2] GARNETT, J.: Bounded analytic functions, Academic Press,
properties of functions f (z) in un for n > 1 have not 1981.
[A3] KOOSIS, P.: Introduction to Hp spaces, Cambridge Univ.
yet (1986) been found. Press, 1980.
Many boundary properties may be applied to various [A4] L0w, E.: 'A construction of inner functions on the unit ball in
generalizations of analytic functions, in particular to cP', Invent. Math. 67 (1982), 223-229.
abstract analytic functions f: D~X, which have values [AS] RUDIN, W.: Function theory in the unit ball in cn, Springer,
1980.
in, say, a separable locally convex topological space X [A6] STEIN, E.M.: Singular integrals and differentiability properties
over the field C. of functions, Princeton Univ. Press, 1970, Chapt. 7.
[A7] STEIN, E.M.: Boundary behaviour of holomorphic functions of
References several complex variables, Princeton Univ. Press, 1972.
[I] GoLUBEV, V.V.: Single-valued analytic functions. Automorphic
functions, Moscow, 1961 (in Russian). AMS 1980 Subject Classification: 30D40, 30055
[2] GoWZIN, G.M.: Geometric theory offunctions of a complex
variable, Amer. Math. Soc., 1969 (translated from the Russian).
[3] PIuwALOW, 1.1. [1.1. PIuvALov]: Randeigenschaften analytischer BoUNDARY VALUE PROBLEM, COMPLEX-
Funktionen, Deutsch. Verlag Wissenschaft., 1956 (translated VARIABLE METHODS - Methods for studying boun-
from the Russian).
[4] KHAVINSON, S.YA.: 'Analytic functions of bounded type', Itogi
dary value problems for partial differential equations in
Nauk. Mat. Anal. 1963 (1965), 5-80 (in Russian). which one uses representations of solutions in terms of
[5] NEVANLINNA, R.: Analytic functions, Springer, 1970 (translated analytic functions of a complex variable.
from the German). Given a second-order elliptic equation
[6] NOSHIRO, K.: Cluster sets, Springer, 1960.
[7] COLLINGWOOD, E.F. and LOHWATER, A.J.: The theory of clus-
~u +a(x,y)
au +b(x,y) ay
ax au +c(x,y)u = 0, (1)
ter sets, Cambridge Univ. Press, 1966.
[8] MAcLANE, G.R.: Asymptotic values of holomorphic functions,
Rice Univ. Studies, Math. Monographs, 49, Rice Univ., Hous- where a, b, c are analytic functions of the real variables
ton, 1963. x, yin some domain Do of the z-plane, Z =x + ry, con-
[9] LOHWATER, A.: 'The boundary behaviour of analytic func- sider the following boundary value problem: Find a
tions', Itogi Nauk. Mat. Anal. 10 (1973),99-259 (in Russian).
[10] DZHRBASHYAN, M.M.: Integral transforms and representation of
solution of equation (1), regular in a simply-connected
functions in a complex domain, Moscow, 1966 (in Russian). domain S CD o, satisfying the boundary condition

=
[II] RUDIN, W.: Function theory in polydiscs, Benjamin, 1969.
[ aJ+k u ] ]
[l2] KHENKIN, G.M. and CHIRKA, E.M.: 'Boundary properties of
holomorphic functions of several complex variables', J. Soviet
R(u) ~
O<;j+k<;m
[ a j k (aJ+ku
X
t) k + 1jk
ja- a
~
-X a
~
ja k = f(t)O)
Math. 5, no. 5 (1971), 612-687. (Itogi Nauk. i Tekhn. Sovrem.
Prohl. Mat. 4 (1975), 13-142) where ajk(t), f(t)ECa(aS), O<a< 1, and the 1jk are
E.D. Solomentsev linear operators mapping Ca(aS) into Ca(aS), with
Editorial comments. SokhotskiT's theorem is better Tm-k,k completely continuous.
known as the Casorati- Weierstrass theorem. In (4) the first This problem includes as special cases the well-

452
BOUNDARY VALUE PROBLEM, COMPLEX-VARIABLE METHODS

known classical boundary value problems of Dirichlet, where p is the increment of the function (1/ 2'IT) arga(t)
Neumann, Poincare, etc. when the contour as is described once in the positive
Using the formula for the general representation of sense. The index of the Dirichlet problem is zero.
solutions (see Differential equation, partial, complex- The homogeneous boundary value problem R(u)=O
variable methods), has a finite number k;;;:.O of linearly independent solu-
tions, where k ;;;:.,,; the inhomogeneous problem (2) has
u(x, y) = Re {G(Z' zo, z, Z)41(z)- a solution if and only if
/!(t)pit)dS =0, j=I, ... ,k,
-
Z a _ }
tP(t)-atG(t, zo, z, Z)dt , where Pi is a complete system of linearly independent
solutions of the associated homogeneous integral equa-
one reduces the problem to an equivalent boundary tion
value problem for analytic functions: k'(p) = A(to)P(to)-~ r B (t)1I(t)dt +T(p) = o.
'lT1 is t - to
m
Re ~ [ak(t)41(k)(t) + Tk(41(k))] = jet), (3) The boundary value problem (2) has a solution,
k=O
whatever the free term on the right, if and only if there
where ak(t) are given Holder-continuous functions, exist exactly " linearly independent solutions of the
tEaS, T m is a completely-continuous operator, and the associated homogeneous problem R(u)=O. Conse-
Tk (k = 0, ... ,m - 1) are linear operators. quently, if 1(>0 the homogeneous boundary value prob-
Suppose that the finite simply-connected domain Sis lem R(u)=O always has at least I( linearly independent
bounded by a closed Lyapunov contour (see Lyapunov solutions; if ,,<0 the inhomogeneous problem (2) is not
surfaces and curves) as and that the m-th derivative, solvable for an arbitrary free term on the right, but at
m ;;;:'0, of the function cI>(z) (the latter is holomorphic in least I" I solvability conditions must be satisfied.
S), restricted to as, is a function of class C a , 0<0:< 1. Necessary and sufficient conditions for the solvability
Then, assuming that the point z =0 is in S, one can of an inhomogeneous boundary value problem may be
express cI>(z) as follows: formulated in terms of the completeness of a certain
system of functions. The kernel and system of func-
4l(z) = !st1~:t + ie, ifm=O; tions may be constructed explicitly using the Riemann

[I-fr- [I-f]
function of equation (1) and the coefficients of the
<I>(z) = !s~t) 1ln dl + boundary conditions. For example, let {ud be some
complete system of solutions in the basic domain Do of
+ !s~t)dt+ie, if m;;;.I, equation (1), and let S CD o. Then a necessary and suf-
ficient condition for problem (2) to be solvable for any
where p.(t) is a real function of class C a(aS), 0<0:< 1, free right-hand side is that the system of functions
and c is a real constant; p.(t) and c are uniquely deter- {R (Uk)} be complete on the boundary.
mined by cI>(z). Very complete results have been obtained for the fol-
Substituting these expressions into the boundary con- lowing boundary value problem (the generalized
dition (3), one obtains a singular integral equation, Riemann- Hilbert problem): Find a solution of the
equivalent to problem (2), for the unknown function p.: equation

K(p.) = A(toMto)+ B(t.o) r.l!!l.J.!lL + T(p.) = j(to), ilzw+A(z)w+B(z)w = j(z), w=u+iv, (5)
'lT1 is I - to 2ilz = ax-iay,
toE as, where T is a completely-continuous operator. which is continuous in S + as and satisfies the boun-
A necessary and sufficient condition for the boun- dary condition
dary value problem (2) to be normally solvable is that
m
Re[X(z), w(z)]- au+fiv = ,(, ZEaS, (6)
aCt) = ~ikam_k(t) =1= 0, lEaS, m;;;'O. (4) where 0:, {3, y are real functions of class Ca(aS),
0<0:< 1, with 0:2 + {32 = 1. The domain S may be multi-
k=O

The Dirichlet problem (m =0) is always normally solv-


ply connected. A problem of this type may be reduced
able. (Henceforth it is assumed throughout that condi-
to an equivalent singular integral equation; this yields a
tion (4) is satisfied.)
full qualitative analysis of the boundary value problem
The index of the boundary value problem (2) is com- (6).
puted from the formula Suppose that the boundary as of S is the union of a
I( = 2(m+p), m;;;.l, finite number of simple closed curves as o, ... ,asm

453
BOUNDARY VALUE PROBLEM, COMPLEX-VARIABLE METHODS

satisfying the Lyapunov conditions. Since the forms of complex-variable methods.


I.N. Vekua
the equation and the boundary condition are preserved
under conformal mapping, it may be assumed without Editorial comments.
loss of generality that as o is the unit circle with centre References
[A1] VEKUA,I.N.: Generalized analytiC functions, Pergamon, 1962
at Z =0, the latter being a point of S, while
(translated from the Russian).
as J, . ,aSm are circles lying outside as o. [A2] VElmA, N.P.: Systems of singular integral equations and
The index of problem (6) is defmed as the integer n some boundary value problems, Moscow, 1970 (in Russian).
equal to the increment of (l /2w)arg[a(O+i,8(O] when [A3] VEKUA, I.N.: New methods for solving elliptic equations,
r
the point describes as once in the positive sense. The
North-Holland, 1968.

boundary condition can be reduced to the simpler form AMS 1980 Subject Classification: 35J25, 35G15,
Re[z-neic(Z)w(z)] = y, zeas,
30E25,45E05,35C15

where c(z)=Cj on asj , with co=O, while CJ, . . 'Cm BOUNDARY VALUE PROBLEM, ELLIPrIC EQUA-
are real constants, uniquely expressible in terms of a TIONS - The problem of finding a solution u, regular
and ,8. The index of the adjoint problem

l
in a domain D, to an elliptic equation
ilzw.-Aw.-Bw. = 0, zeS, n a2u n au
~ a;k--+ ~b;-+cu = j, (I)

Re{<a+I/l):!w.<,J} ~ 0, ,eas, J
;,k=O aX;aXk ;=0 ax;
(7) which satisfies certain additional conditions on the
boundary r of D. Here ajb bj, C and f are given func-
is calculated by the formula n' = - n + m - 1. tions on D.
Problem (6) has a solution if and only if
The classical boundary value problems are special
l(a+i{I)w.yds = 0, cases of the following problem: Find a solution to
equation (1), regular in a domain D and satisfying on r
where w. is an arbitrary solution of the adjoint prob-
lem.
du
a-+OO
dl
=g (2)

Let e and e' be the numbers of linearly independent where d / dl denotes differentiation in some direction,
solutions of the homogeneous problems (6) and (7), and a, b and g are given continuous functions on r
respectively. Then with I a I + I b I >0 everywhere on r (see [1]).
e-e' = n-n' = 2n+l-m. Putting a =0, b = 1, one obtains the Diricblet prob-
If n <0, the homogeneous problem (6) has no non- lem; with b =0, a = lone has a problem with oblique
trivial solutions. If n >m -1, it has exactly derivative (see Differential equation, partial, oblique
=
e 2n + 1 - m linearly independent solutions, while the derivatives), which becomes a Neumann problem if I is
inhomogeneous problem (6) is always solvable. If n <0, the direction of the conormal. If r = 1\ U f 2, where r I
the inhomogeneous problem (6) is solvable if and only and r 2 are disjoint open subsets of r, and fin f 2 is
if either empty or an (n - 2)-dimensional manifold, with
a = 1, b =0 on rio a =0, b = 1 on r 2 , one obtains a
l(a+i/J)w./fds = 0, j=I,2, ... ; e'=m-2n-l,
mixed problem.
Problem (2) has been studied for elliptic equations in
where w.j is a complete system of solutions of the
two independent variables (see [2]). Fairly complete
homogeneous problem (7). If m =0 and n =0, then
investigations have been made of the Dirichlet problem
e = 1 and all solutions of the homogeneous problem
for elliptic equations in any finite number of indepen-
problem (6) are given by
dent variables (see [1], [3], [4]) and the problem with
w(z) = ice""'(Z) , oblique derivative in case the direction f is not con-
where c is a real constant and Wo a continuous function tained in a tangent plane to r at any point of r. In
on s+as. that case the problem with oblique derivative is a
The above results completely characterize the prob- Fredholm problem and the solution is smooth to the
lem in the simply-connected (m =0) and multiply- same order as the field of directions f and the function
connected (n <0, n > m - 1) cases. The cases g (see [l]). The case in which flies in a tangent plane
O";;;n";;;m -1 require special examination; they have to r at certain points of r has been studied (see [3]).
also been worked out in considerable detail. The local properties of solutions to the problem with
Boundary value problems of the type of the Poincare oblique derivative have been investigated (see [5]). At
problem have also been studied for equation (5). points where the field flies in a tangent plane to r, the
For references see Differential equation, partial, solution of the problem is less smooth than I and g.

454
BOUNDARY VALUE PROBLEM. ELLIPTIC EQUATIONS

This has been used as a basis for investigating the tion u, regular in D and satisfying on the boundary f
problem in a generalized setting (see [7], [8)). of D the conditions
Consider the following boundary problem for har-
B)u = ct>j, )=1, ... ,m, (3)
monic functions regular in the unit ball ~ CR3 :
where BJ<x, D), j= 1, ... ,m, are differential operators
aux + bUy +cuz = g; satisfying the following complementarity condition.
let K be the set of points of the unit sphere S at which Let L' (x, a/ ax I, . . . ,a / aXn+l) be the principal
the function w = ax + by + cz vanishes. The vector field part of L, let B; be the principal part of Bj , n the nor-
Pea, b, c) lies in a tangent plane to S at the points of mal to f at a point x and >..*0 an arbitrary vector
K. Suppose in addition that K is the union of a finite parallel to f. Let Tt (>..) denote the roots of
number of disjoint curves; let K+ be the subset of K L' (x, >.. +m) with positive imaginary parts. The polyno-
consisting of those points at which grad w makes an mials B;(x, >"+m), j=l, ... ,m, as functions of T,
acute angie with the projection of the field P on S, and must be linearly independent modulo the polynomial
let K- be the remaining part of K. A generalized for- II;= I(T-Tt (>..)). In this case, too, the problem is nor-
mulation of the problem is obtained when the values of mally solvable. Violation of the complementarity condi-
u are also prescribed on K+, whereas on K- the solu- tion may entail an essential change in the nature of the
tion u is allowed to have integrable singularities. If problem (see [17]).
K- is empty, the solution to the generalized problem Problem (2) is a special case of problem (3). For
may be made arbitrarily smooth by increasing the problem (2) with a 1, the complementarity condition
smoothness of the additional data of the problem. Gen- is equivalent to the condition that there be no point on
erally speaking, a solution to the mixed problem on the the boundary of the domain at which the direction I
set fo=1\ ni\ has singularities (see [1D. In order to lies in a tangent plane to the boundary.
eliminate such singularities on fo, one must impose Another particular case of problem (3) is the boun-
additional conditions on the data (see [11)). dary value problem
A large category of boundary value problems is con-
a;u = ct>j, )=0, ... ,m-I,
stituted by what are known as problems with free an)
boundaries. In these problems one must find not only a which is an analogue, to some extent, of the Dirichlet
solution of equation (1), but also the domain in which problem for higher-order elliptic equations.
it is regular. The boundary f of the domain is The boundary value problem has been studied for
unknown, but two boundary conditions must be satis- the poly-harmonic equation t::.ku =0 when the boundary
fied on it. An example of this type of problem is the of the domain consists of manifolds of different dimen-
problem of wave motions of an ideal fluid: Find a har- sions (see [15)).
monic function u, regular in some domain D, where In investigations of boundary value problems for
part of the boundary, fl say, is known and the normal non-linear equations (e.g. the Dirichlet and Neumann
derivative au / an is given on f l ; the other part of the problems), much importance attaches to a priori esti-
boundary, f2' is unknown and on it one gives two mates, various fixed-point principles (see [17], [18)) and
boundary conditions: the generalization of Morse theory to the infinite-
au -_ 0,
a; 2 2
ux+uy+u z
2_
- q(x,y, z),
dimensional case (see [19)).
References
where q >0 is a given function. [1) MIRANDA, c.: Partial differential equations of elliptic type,
Springer, 1970 (translated from the Italian).
For harmonic functions of two independent vari-
[2) VEKUA, LN.: Generalized analytic functiOns, Pergamon, 1962
ables, one uses conformal mapping (see [12], [13], [14)). (translated from the Russian).
See also Differential equation, partial, free boundaries. (3) BITSADZE, A.V.: Boundary value problems for secondorder ellip-
The following problem has been investigated: Find a tic equations, North-Holland, 1968 (translated from the Rus-
sian).
harmonic function u, regular in a domain D and satis- (4) KELDYSH, M.V.: 'On the solvability and stability of the Diri-
fying the condition chlet problem', Uspekhi Mat. Nauk, no. 8 (1941), 171-231 (in
Russian).
1 gradu 12 = q, (5) HORMANDER, L.: 'Pseudo-differential operators and non-
where q>O is a given function, on the boundary f. elliptic boundary value problems', Ann. of Math 83, no. 1
(1966), 129-209.
There is a complete solution of this problem for har- (6) BORRELLI, R.L.: 'The singular, second order oblique derivative
monic functions of two independent variables (see [14)). problem', J. Math and Mech. 16, no. 1 (1966), 51-81.
Given an equation Lu = f, where L is an ope~tor of (7) GoROV, YU.V. and KONDRAT'EV, V.A.: 'The oblique deriva-
tive problem', Mat. Sb. 78, no. 1 (1969), 148-176 (in Russian).
order 2m, uniformly elliptic in the closure D of a (8) MAz'YA, V.G.: 'The degenerate problem with oblique deriva-
domain D, consider the problem of determining a solu- tive', Mat. Sb. 87, no. 3 (1972), 417-453 (in Russian).

455
BOUNDARY VALUE PROBLEM, ELLIPTIC EQUATIONS

(9) Y ANUSHAUSKAS, A.: Dokl. Akad Nauk SSSR 164, no. 4 assumed that the problem in question is well-posed. In
(1965), 753-755.
order to justify difference methods one must investigate
(10) VISHIK, M.I. and EsKIN, G.I.: 'Sobolev-Slobodinsky spaces of
variable order with weighted norm, and their applications to the well-posedness of the difference problem and its
mixed boundary value problems', Sibirsk. Mat. Zh. 9, no. 5 convergence as the grid is refined. A difference problem
(1968),973-997 (in Russian). is said to be well-posed if it has a unique stable solu-
(11) GIRAUD, G.: Ann. Soc. Math. Polon. 12 (1934), 35-54.
(12) LAVRENT'EV, M.A.: The variational method in boundary value tion, whatever the terms on the right-hand sides of the
problems for systems of equations of elliptic type, Moscow, 1962 equations. By stability of a difference scheme one
(in Russian). means that its solution depends continuously on the
(13) NEKRASOV, A.I.: 'Exact theory of waves of stationary type on
the surface of a heavy fluid', in Collected works, Vol. I, Mos-
right-hand side, uniformly with respect to the grid spac-
cow, 1961 (in Russian). ing.
(14) GAKHOV, F.D.: Boundary value problems, Pergamon, 1966 For example, suppose one wishes to solve the Diri-
(translated from the Russian).
chlet problem for the Poisson equation in the square
(15) SoBOLEV, S.L.: Mat. Sb. 2, no. 3 (1937), 465-499.
(16) AGMON, S., DoUGLIS, A. and NIRENBERG, L.: 'Estimates near G={O<Xl,X2<1} with boundary f:
the boundary for solutions of elliptic partial differential equa-
tions satisfying general boundary conditions I, II.', Comm.
Pure Appl. Math. 17 (1964), 35-92.
(17) ScHAUDER, J.: Math. Z. 33 (1931), 602-640.
(18) LERAY, J. and ScHAUDER, J.: Ann. Sci. Ecole Norm. Sup. Ser. 3 U(X\>X2) = p.(X\>X2), (X\>X2)Ef.

51 (1934), 45-78. The domain G is replaced by a square grid Gh with


(19) PAUlS, R.S.: 'Morse theory on Hilbert manifolds', Topology 2,
spacing h, i.e. by the set of points
no. 4 (1963), 299-340. AI v h k
. . anus aus as
~
Gh = {(X\i) , x~: xV) = ih, x~) =jh ;
Editorial comments.
i,j=l, ... ,N-l}, hN=l,
References
[A1) COURANT, R. and HILBERT, D.: Methods of mathematical with boundary
physics. Partial differential equations, 2, Interscience, 1965 N-j

(translated from the German). fh = U {(O, x~ U (1, x~ U (X\i) , 0) U (xV), I)},


i,)=1
[A2] GARABEDIAN, P.R.: Partial differential equations, Wiley, 1964.
[A3] FRIEDMAN, A.: Partial differential equations, Holt, Rinehart and the derivatives figuring in the equation by differ-
and Winston, 1969.
ence quotients
AMS 1980 Subject Classification: 35JXX (Ui+ 1,) - 2Ui,} +Ui-j,)
(AjU)i.) = ux\x"i,) = h2
BOUNDARY VALUE PROBLEM, NUMERICAL
METHODS FOR PARTIAL DIFFERENTIAL EQUA- (Ui,j+ 1 -2Ui,) + U1,)_I)
(A 2 u)i,) = Ux ,x2,i,) = h2
TIONS - Approximate methods of solution which yield
the solution to the problem in the form of a numerical where Ui,j = U(X\i) , xY). The difference scheme is
table. An exact construction of a solution (in terms of
(A"y )i,) = YX\X\.I,) +Yx,x"i,) = - f,,}, (1)
explicit formulas, series, etc.) to a boundary value prob-
lem is only rarely possible. The most prevalent of the i,j=I, ... ,N-l; Yi,) 1
f,
= /-Lip
methods of approximate solution are difference
methods (see [1]); they are applied in the most general where Yi,j is the solution.
problems and are readily computerized. The essential Problem (1) has a - unique - solution for any
idea of difference methods is to replace the original inhomogeneous term f and any boundary conditions Jl
domain of variation of the independent variables by a (see [2]). Moreover, the solution of the difference
discrete set of points, a grid, and to approximate the scheme (1) converges as h ---'?O to the solution of the ori-
derivatives figuring in the equation and the boundary ginal problem in such a way that the scheme is a
conditions by difference quotients at the grid points. As second-order approximation in the maximum norm, i.e.
a result of this procedure the original problem is
replaced by a system of finitely many algebraic equa-
tions (linear or non-linear); this system is known as a where M is a constant independent of h.
difference scheme. The solution of the difference The difference scheme (1) is a system of linear alge-
scheme is taken to be an approximate solution of the braic equations, characteristically with a large number
original problem. The accuracy of the approximation of equations (to be precise: (N -1)2 equations, with
depends on the method of approximation and on the N ---'? 00 as h ---'?O) and a large number of zeros in the
fineness of the grid, i.e. on how densely the grid points matrix of the system; moreover, the system is generally
fill the original domain. In the rest of this article atten- ill-conditioned (the ratio of the minimum eigen va)ue to
tion will be restricted to linear boundary value prob- the maximum one is of the same order of magnitude as
lems for partial differential equations, and it will be h 2, h ---'?O). Such systems of equations, which arise when

456
BOUNDARY VALUE PROBLEM, NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS

differential equations are replaced by difference equa- mines the stability and accuracy of the scheme. If
tions, may be solved using various efficient methods, o~0.5, the scheme is stable for any grid spacings 7 and
both direct and iterative. Direct methods yield an exact h (an absolutely-stable difference scheme). But if
solution of the difference equation after a finite number (1<0.5, the scheme is stable only if some restriction is
of arithmetic operations. This category includes various imposed on Y=7/ h 2 (a conditionally-stable difference
versions of the double-sweep method, including matrix scheme); for example, the explicit scheme 1=0) is
double-sweep, the decomposition method, the fast stable if y~ 1/4. If (1=0.5, the scheme is second-order
Fourier transform, and the method of representations accurate with respect to 7 and h; for other values of (1 it
by sums (see [1], [2], [3], [6]). A measure of the effi- is first-order accurate in 7 and second-order accurate in
ciency of direct methods is the order of magnitude of h. The difference problem (3) is solved 'in layers'. The
the number of operations as h ~O. Thus, solving prob- n-th layer is the set of all points of the grid Gh X w,. for
lem (1) by matrix double-sweep requires O(h -4) opera- some fixed n. The values of Y?,j on the zero-th layer
tions, whereas the fast Fourier transform method needs (n = 0) are known from the initial conditions. If the
- for the same problem - O(h -210g h -\) operations. values on the n-th layer are known, then the values on
The most widely used iterative methods for the solution the next layer Y;,j = y7,f \ are determined from the sys-
of difference problems are Richardson's method with tem of equations
Chebyshev interpolation points, the alternating-
triangular iterative method, and various alternating-
direction methods (see [2]). The efficiency of iterative
Yi'J-o~lll!}')i.j = fl.j, i,j=I, ... ,N-I, 1 (4)
YI.jl - 0,
r,
methods is measured by the order of magnitude of the
minimum number of iterations, no(), necessary to where
decrease the error of the initial approximation by a fac- fl.j = Y2j+(1-o)T(1l1!}')2 r
tor 1/ . For example, in solving problem (1) by The solution of problem (4) may be found by any of
Richardson's method, no() is of the order the methods for solving the stationary problem (1).
h -\ log(l / ), while the alternating-direction method However, there are more economical algorithms for the
with optimally-selected iteration parameters has solution of multi-dimensional non-stationary boundary
no()= O(logh -\ 10g(1 / . Iterative methods are more value problems, namely alternating-direction methods
universal and simpler to implement than direct (see [1] - [5]), which enables one to reduce the solution
methods, and consequently have been widely utilized to of a multi-dimensional problem to that of a sequence of
solve difference problems. one-dimensional problems. Thus, the heat equation
For example, consider solving the first boundary may be solved using the following alternating-direction
value problem for the heat equation: scheme:
au au au 2 2 n+I/2 n
-a
(
= -aXI
2 +-,
ax~
(>0, (Xj, X 2)EG Yi.j
0.57
-Y,.j = A n+I/2+A n
IY,.j zY,.j'
n+1 _ n+I/2
U(Xj,X2,0) = UO(XI,X2); (2) Y i.j Y i.j = A n+ I /2 + A n+ I
0.57 IY,.j zY,.j .

This scheme is absolutely stable, second-order accurate


To solve this problem, one prescribes a time grid with
and is solved by successive inversion of one-
spacing 7>0:
dimensional difference operators.
WT = {tn=n7: n=O, I, ... } Even when the difference problem is solved exactly,
and the grid Gh (see above) for the space variables. Let the solution may well be different - not only quantita-
u7,j = u(x\i), x~), tn). The derivative u; is approximated tively but also qualitatively - from the solution of the
by the quotient U n + 1 -un.
original differential problem. The discrepancy is
I.j I.j
Ul.I.j = particularly prominent in dealing with equations the
7
coefficients - or solutions - of which involve singu-
and the Laplacian Il by the difference operator Ilh . larities (thus, in computing discontinuous solutions of
The original equation (2) is replaced by a correspond- the equations of gas dynamics one usually encounters
ing difference scheme zones in which the discontinuities are strongly 'spread
ouC). Thus, direct approximation of a differential prob-
Yt. i.j = o( Ill!}' )7,/ I + (I - 0)( Ill!}' )~j ; lem by a difference problem, when the derivatives are
y?,j = uo(x\i),xY, i,j=I, ... ,N-I; (3) replaced - with a considerable degree of arbitrariness
Y2jl
f",xw,.
= o. - by difference quotients, does not always yield a
good difference scheme. Various principles have been
The parameter (1 appearing ill these equations deter- developed for the construction of difference schemes of

457
BOUNDARY VALUE PROBLEM, NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS

good quality. Among the successful methods are the lying in a given subset D of the space D(J, Rn) of func-
balance method and the method of variational- tions depending on t that are absolutely continuous on
functional approximation (see [1] - [3]). The difference J and that assume values in Rn:
schemes obtained in these methods correctly reflect the x() E D. (2)
integral laws of conservation which are valid for the
It is assumed that f(t, x) is a function defined on
original equations, and ensure that the relevant differ-
J X Rn with values in Rn and satisfying the
ence operators have fixed signs. In the theory of homo-
Caratheodory conditions; J is an interval on the real
geneous difference schemes (see [7]) one considers ques-
line R.
tions of constructing difference schemes for equations
1) The boundary value problem (1), (2) is said to be
with variable (including discontinuous) coefficients and
investigating their convergence.
linear if
jet, x) = A (t)x +b(t),
References where the functions A(t) and b(t) are summable on
[I] TncHONOV, A.N. and SAMARSKIi, A.A.: The equations of
mathematical physics, Moscow, 1977 (in Russian).
every compact interval in J and the set D is a linear
[2] SAMARSKIi, A.A. and MARCHUK, G.I.: The theory of difference manifold in D(J, Rn). In particular, one might have
schemes, Moscow, 1983 (in Russian).
[3] MARCHUK, G.I.: Methods of computational mathematics, Mos- J = [to. td,
cow, 1977 (in Russian).
[4] YANENKO, N.N.: Fractional step methods for solution of multi-
dimensional problems in mathematical physics, Novosibirsk,
1967 (in Russian).
D ~ {xo E D(J, R'), [!d<l>(t)]X(t) ~ j, 0

[5] D'YAKONOV, E.G.: Difference methods for solving boundary


value problems, 1-2, Moscow, 1971-1972 (in Russian). where cI>(t) is a function of bounded variation. A linear
[6] POLOZHll, G.N.: Numerical solution of two- and three-
boundary value problem gives rise to a linear operator
dimensional boundary value problems of mathematical physics
and functiOns of a discrete argument, Kiev, 1962 (in Russian). Lx(t) = x' -A (t)x, x(-)ED,
[7] TncHONOV, A.N. and SAMARSKIi, A.A.: 'On homogeneous
difference schemes', Zh. Vychisl. Mat. i Mat. Fiz. 1, no. 1 the eigen values of which are precisely those values of
(1961), 5-63 (in Russian).
A.V. Gulin the parameter A for which the homogeneous boundary
value problem
Editorial comments. In Western literature, boundary
value problems are usually called initial-boundary value x' -A (t)x = Ax, x(')ED,
problems if the time variable is involved (e.g. problem (2) of has non-trivial solutions. These non-trivial solutions are
this article). The literature on the numerical solution of par- the eigen functions of the operator L. If the inverse
tial differential equations is overwhelming; only a few good operator L -\ exists and has an integral representation
text books are listed below. References [A5] and [A8] only
deal with initial-boundary value problems. x(t) = L -1 b(t) j G(t, s )b(s) tis, t EJ,
J
References
[A1] FORSYTHE, G.E. and WASOW, W.R.: Finite difference methods
then G(t, s) is called a Green function.
for partial differential equations, Wiley, 1960. 2) Let J=(-oo, 00), letf(t, x) be almost-periodic in
[A2] GARABEDIAN, P.R.: Partial differential equations, Wiley, 1964. t uniformly in x on every compact subset of Rn and let
[A3] GLADWELL, I. and WAIT, R. (EDS.): A survey of numerical
D be the set of almost-periodic functions in t that are
methods for partial differential equations, Clarendon Press,
1979. absolutely continuous on J. Then problem (1), (2) is
[A4] MITCHELL, A.R. and GRIFFITHS, W.F.: The finite difference known as the problem of almost-periodic solutions.
method in partial differential equations, Wiley, 1980. 3) In control theory one considers boundary value
[A5] RICHTMYER, R.D. and MORTON, K.W.: Difference methods
for initial value problems, Wiley, 1967. problems with a functional parameter: a control. For
[A6] SAMARSKIT, A.A.: Theorie der Differenzverfahren, Akad. Ver- example, consider the equation
lagsgesellsch. Geest u. Portig K.-G., Leipzig, 1984.
[A7] SMITH, G.D.: Numerical solution of partial differential equa- ~~ = j(t, x, u), tEJ=[to, td, xERn, (3)
tions, Oxford Univ. Press, 1977.
[A8] YANENKO, N.N.: The method of fractional steps; the solution with set of admissible controls U and two sets
of problems of mathematical phYSICS in several variables,
Springer, 1971 (translated from the Russian).
Mo, M\ eRn. Let D be the set of absolutely continuous
functions in t such that X(tO)EMO, x(t\)EM\. The
AMS 1980 Subject Classification: 65MXX, 65NXX boundary value problem is to find a pair (xoo, uoO)
BOUNDARY VALUE PROBLEM, ORDINARY DIF-
such that UOOE U and the solution xo(t) of equation
FERENTIAL EQUATIONS - The problem of finding a
(3) at u=uo(t) satisfies the condition xoOED.
4) There is a wide range of diverse necessary and suf-
solution to an equation
ficient conditions for the existence and uniqueness of
~ = jet, x), tEJ, xERn, (1) solutions to various boundary value problems, and of

458
BOUNDARY VALUE PROBLEM, PARTIAL DIFFERENTIAL EQUATIONS

methods for constructing an approximate solution (see [A2] INCE, E.L.: Ordinary differential equations, Dover, reprint,
1956.
[4] - [7]). For example, consider the problem [A3] JACKSON, L.K.: 'Boundary value problems for ordinary dif-
ferential equations', in J.K. Hale (ed.): Studies in ordinary dif-
x' = A (t)x +f(t, x), }
ferential equations, Math. Assoc. Amer., 1977, pp. 93-127.
I, (4)
j[dq,(t)]x(t) = 0, AMS 1980 Subject Classification: 34BXX
10
in which
IIf(t, x) II ~a+bllxlla BOUNDARY VALUE PROBLEM, PARTIAL DIF-
FERENTIAL EQUATIONS - The problem of determin-
for certain constants a >0, b >0, a~O. Suppose that
ing in some region D with points x=(xJ, ... ,xn ) a
the homogeneous problem
solution u(x) to an equation

I,

x' = A(t)x, j[dq,(t)]x(t) = (5) (LuXx) = f(x), XED, (1)


10
which satisfies certain boundary conditions on the boun-
is regular, i.e. its only solution is the trivial one. Then dary S of D (or on a part of it):
problem (4) has at least one solution, provided either
a< 1, or a~ 1 and b is sufficiently small. It is fairly (Bu)(y) = </>(y), YES. (2)
complicated to determine whether problem (5) is regu- As a rule, the boundary conditions relate the boundary
lar. However, the linear (scalar) boundary value prob- values of the solution to its derivatives up to a certain
lem order, i.e. B is a differential operator. However, boun-
XU +q(t)x' +p(t)x = 0, x(to) = 0, x(t]) = 0, dary conditions of other types also occur.
Given a differential equation, the question whether a
for example, is regular if whenever I q(t) I :;;;;2m there
specific boundary value problem should be studied is
exists a k ER such that
frequently settled by the concept of what is known as
I,
well-posedness. Namely, a boundary value problem is
jfp(t)-k]+ dt < 2[F(k, m)-mJ,
10
well-posed if it is solvable, and if its solution is unique
and depends continuously on the data of the problem.
where
Different types of differential equations require dif-
_~ (t]-to)~ ferent well-posed boundary value problems; and con-
V k-m" cotg 2 ' versely, well-posed boundary value problems may
sometimes serve as a basis for the classification of types
m2 <b;;;;;m 2+ .,T2 ,
(t] -tof of differential equations.
F(k, m) = A boundary value problem is said to be linear if the
operators L and B are linear, and homogeneous if f and
cJ> in (l), (2) vanish. A linear boundary value problem
is said to be Noetherian if: a) the homogeneous prob-
lem has a finite number k of linearly independent solu-
tions; b) the inhomogeneous problem is solvable if and
References
[I] HARTMAN, P.: Ordinary differential equations, Birkhauser, 1982. only if f and cJ> satisfy I linearly independent ortho-
[2] KRAsNOSEL'sKIi, M.A., BURD, V.SH. and KOLESOV, Yu.S.: gonality conditions; and c) on the assumption that the
Nonlinear almost-periodic oscillations, Wiley, 1973 (translated problem is uniquely solvable, the solution depends con-
from the Russian).
[3] PONTRYAGIN, L.S., ET. AL.: The mathematical theory of optimal
tinuously on f and cJ>.
processes, Interscience, 1962 (translated from the Russian). If k = I, the problem is called a Fredholm problem.
[4] KRAsoVsKll, N.N.: Theory of control of motion. Linear systems, The difference k -I defines the index of the problem.
Moscow, 1968 (in Russian).
A broad range of boundary value problems for linear
[5] ZUBOV, V.l.: Lectures in control theory, Moscow, 1975 (in Rus-
sian). second-order differential equations
[6] KAMKE, E.: Differentialgleichungen: LOsungsmethoden und
LOsungen, I, Chelsea, reprint, 1971.
[7] KIGURADZE, I.T.: Some singular boundary value problems for
ordinary differential equations, Tbilis~ 1975 (in Russian).
= f(x)
Yu. V. Komlenko
E.L. Tonkov falls into the category of the Poincare problem. In this
Editorial comments. type of problem the boundary conditions are prescribed
References
on the entire boundary, which is assumed to be an
[A 1] BRAUN, M.: Differential equations and their applications, (n - 1)-dimensional manifold, and the boundary opera-
Springer, 1975. tor B in (2) has the form

459
BOUNDARY VALUE PROBLEM, PARTIAL DIFFERENTIAL EQUATIONS

n au equation of parabolic type, the heat equation


(Bu)(y) = ;~;(y)~+p(y)u = CP(Y), YES. (4)
Lu = a2 u -~ = 0,
The Poincare problem for bounded regions D with ax 2 at
sufficiently-smooth boundaries have been thoroughly the Dirichlet problem in the domain D bounded by the
investigated in the case of uniformly-elliptic operators straight lines x = + 1, t = + I amounts to the specifica-
L. tion of boundary values u on the intervals
On the assumption that the coefficients of the opera- {t=+l, -1";x";+I} and {x=+l, -1..;t";1}.
tors L and B in (3), (4) are sufficiently smooth, and Boundary value problems for equations of mixed
that the boundary of D is sufficiently smooth, the Poin-
care problem is Noetherian if n =2, >0, and is ar elliptic-hyperbolic type are formulated in a special

Fredholm problem if n > 2, r


>0 and the vector
manner.
Boundary value problems occupy an important posi-
p=(pJ, ... ,Pn) is not tangent to S. Investigations of tion in the theory of analytic functions. Let S be a
the Poincare problem in the two-dimensional case make piecewise-smooth curve in the plane, i.e. a union of a
extensive use of the theory of functions of a complex finite number of simple oriented arcs. The linear
variable (see Boundary value problem, complex-variable conjugation problem consists in determining a function
methods). </>(z), analytic outside S, having limit values </>(t), tES,
Boundary conditions for the general elliptic equation from both sides of S and satisfying the boundary con-
of order 2m dition
alai u
Lu = ~ a,,(x)-- = I(x), xED, (5) where G(t) and get) are given functions. The solution
1"1,.;2m ax a
to this problem can be specified in explicit form with
may be prescribed in terms of linear differential opera-
the help of representations of analytic functions by
tors
Cauchy-type integrals, subject to certain additional
alai u . assumptions concerning the functions G, g and the
Bju = ~ bja(Y)-a- = <l>j(Y), YES, l';;;;j';;;;m, (6)
lal";m, ax curve S. See Boundary value problems of analytic func-
of order mj <2m, with coefficients defined on the boun- tion theory.
dary S of D. Here a=(aj, ... ,an), a; is a non-negative Various methods are available for the investigation of
boundary value problems. The Schwarz alternating
integer, Ia 1= L7=ja; and
method, Poincare's related balayage method and the
alai alai
Perron method are based on an application of the max-
ax a ax~l ... a~: imum principle. A solution using integral equations is
If the operators Land Bj satisfy the so-called com- based on various integral representations of the solu-
plementarity conditions, then bounds for the derivatives tions. Among functional methods is the investigation of
of order 2m of the function u satisfying the boundary boundary value problems using a priori estimates. The
conditions can be formulated (relative to an appropri- theory of distributions is widely applied. In practical
ate norm) in terms of the norm of fin (5) and suitable applications, various finite-difference methods are quite
norms of the boundary functions </>j in (6). Boundary frequently used.
value problems of this type are said to be coercive.
Another type of boundary value problems are known References
[lJ BERS, L., JOHN, F. and SCHECHTER, M.: Partial differential
as mixed problems (cf. Mixed problem), in which dif- equations, Interscience, 1964.
ferent boundary conditions are prescribed on adjacent [2J BITSADZE, A.V.: Boundary value problems for secondorder ellip-
sections of the boundary. tic equations, North-Holland, 1968 (translated from the Rus-
sian).
Characteristic for boundary value problems of dif- [3J COURANT, R. and HILBERT, D.: Methods of mathematical phy-
ferential equations that are uniformly elliptic in D is sics. Partial differential equations, 2, Interscience, 1965
that the boundary conditions are prescribed on the (translated from the German).
[4J LADYZHENSKAYA, O.A.: Boundary value problems ofmathemati-
entire boundary. If the operator L in (3) is elliptic in cal physics, Moscow, 1973 (in Russian).
the interior of the region and parabolically degenerates [5] MIRANDA, c.: Partial differential equations of elliptic type,
on a section S 0 ~ S, then, depending on the type of Springer, 1970 (translated from the Italian).
[6] MUSKHELISHVILI, N.!.: Singular integral equations, Wolters-
degeneracy, So can be eliminated from the specification
Noordhoff, 1972 (translated from the Russian).
of the boundary conditions. See also Boundary value [7] TICHONOFF, A.N. [A.N. TIKHONOV] and SAMARSKIi, A.A.: Dif
problem, elliptic equations. ferentialgleichungen der mathematischen PhYSik, Deutsch. Ver-
If equation (3) is not elliptic, part of the boundary lag Wissenschaft., 1959 (translated from the Russian).
may usually be eliminated from the specification of the A.P. Soldatov
boundary conditions. For example, given the simplest Editorial comments. Useful additional references are

460
BOUNDARY VALUE PROBLEMS OF ANALYTIC FUNCTION THEORY

given below. mass distribution on r with a constant potential in the


References interior of D. This problem appears in electrostatics,
[A 1) COURANT, R. and HILBERT, D.: Methods of mathematical when it is required to determine an equilibrium charge
physics, 1-2, Interscience, 1953-1962 (translated from the distribution on a conductor r which has no effect any-
German).
[A2) FRIEDMAN, A.: Partial differential equations, Holt, 1969. where inside D.
[A3) GARABEDIAN, P.R.: Partial differential equations, Wiley, 1964. 6) The balayage method, which, in its simplest
[A4) HORMANDER, L.: Linear partial differential operators, presentation dating back to H. Poincare, amounts to
finding a 'swept-out' mass distribution on r the poten-
Springer, 1963.
[A5) LADYZHENSKAYA, O.A.: Linear and quasilinear elliptic equa-
tions, Acad. Press, 1968. tial of which in the complementary domain CD coin-
[A6) LIONS, 1.L. and MAGENES, E.: Non-homogenous boundary cides with the potential of given masses in the interior
value problems and applications, 1-3, Springer, 1972
of D. The two last problems are particularly important
(translated from the French).
[A7) PROTTER, M.H. and WEINBERGER, H.F.: Maximum principles in abstract potential theory (see Potential theory,
in differential equations, Prentice Hall, 1967. abstract). See also Bessel potential; Non-linear poten-
[A8) FORSYTHE, G.E. and WASOW, W.R.: Finite-difference methods tial; Riesz potential.
for partial differential equations, Wiley, 1960.
References
AMS 1980 Subject Classification: 35F15, 35F30, [1) GONTER, N.M.: Potential theory and its applications to basic
35G15,35G30 problems of mathematical phYSiCS, F. Ungar, New-York, 1967
(translated from the French).
[3) BRELOT, M.: Elements de la tMorie classique du potential, Sor-
BOUNDARY VALUE PROBLEMS IN POTENTIAL bonne Uniy. Centre Doc. Uniy., Paris, 1959.
THEORY - Fundamental problems in both classical [4) CONSTANTINESCU, C. and CORNEA, A.: Potential theory on har-
and abstract potential theory. The classical Newton and monic spaces, Springer, 1972.
E.D. Solomentsev
logarithmic potentials satisfy certain partial differential
equations of elliptic type: the Laplace equation in Editorial comments. Well-known additional references
are [A 1], [A2], [A3]. Boundary value problems for parabolic
regions free of the masses generating the potentials, and
equations can be considered in the same way as those in
the Poisson equation in regions occupied by the masses.
potential theory, cf. [A3]. The relation between boundary
It follows, therefore, that the boundary value problems
value problems and the theory of Markov processes is con-
of potential theory are primarily boundary value prob- sidered in [A3].
lems for elliptic equations and systems (cf. Boundary
References
value problem, eUiptic equations). [A 1) KELLOGG, O.D.: Foundations of potential theory, F. Ungar,
I) The Dirichlet problem, or first boundary value prob- 1929. Re-issue: Springer, 1967.
lem. Here the problem is to find a potential u(x) in [A2) HELMS, L.L.: Introduction to potential theory, Wiley (Intersci-
ence), 1969 (translated from the German).
some domain D, given its continuous restriction [A3) DOOB, 1.L.: Classical potential theory and its probabilistic
u(x) = f(x), x E r, to the boundary aD = r of the counterpart, Springer, 1983.
domain on the assumption that the mass distribution in
AMS 1980 Subject Classification: 31 A25, 31820
the interior of D is known. This is a fundamental prob-
lem in potential theory.
BOUNDARY VALUE PROBLEMS OF ANALYTIC
2) The Neumann problem, or second boundary value
FUNCflON THEORY - Problems of finding an analytic
problem. Here the problem is to find a potential in D,
function in a certain domain from a given relation
given the continuous restriction of its normal derivative
between the boundary values of its real and its ima-
~ = Ijl(x) ginary part. This problem was first posed by B.
an
Riemann in 1857 [l]. D. Hilbert [2] studied the boun-
to r.
dary value problem formulated as follows (the
3) The mixed problem, or third boundary value prob-
Riemann- Hilbert problem): To find the function
lem. Here the known data on r
constitute a linear
P(z) = u + iv that is analytic in a simply-connected
combination
domain S + bounded by a contour L and that is con-
au(x)
an +a(x)u(x) = Ijl(x), xEf. (*) tinuous in S + U L, from the boundary condition

4) The problem with oblique derivative arises when Re(a+ib)4> = au-bv = c, (1)
the normal derivative au(x) / an in condition (*) is where a, b and c are given real continuous functions on
replaced by the derivative au(x) / al with respect to an L. Hilbert initially reduced this problem to a singular
arbitrary direction l=l(x), x Er. integral equation in order to give an example of the
In addition to these general problems, the following application of such an equation.
specific problems have also arisen in potential theory. The problem (1) may be reduced to a successive solu-
5) The Robin problem. Here it is required to find a tion of two Dirichlet problems. A complete study of the

461
BOUNDARY VALUE PROBLEMS OF ANALYTIC FUNCTION THEORY

problem by this method may be found in [3]. Np. _A(to)lJ(to)+ [N(to,t)lJ(t)ds = (5)
The problem arrived at by H. Poincare [4] in
developing the mathematical theory of tide resembles = !(to)-c(J(to),
problem (1). Poincare's problem consists in determining
where p. is the unknown real-valued function of class H,
a harmonic function u (x, y) in a domain S + from the
c is an unknown real constant, and
following condition on the boundary L of this domain:
au au N(t t) = K(to, t) .
A(s)-a;;-+B(s)a.;-+C(s)u = !(s), (2) 0, t - to

where A (s), B (s), C (s) and f (s) are real functions The functions A(to), K(/o, I) and a(/o) are expressed in
given on L, s is the arc abscissa and n is the normal to terms of aj(/) and hilo, 1),j=O, ... ,m.
L. Let k and k' be the numbers of linearly independent
The generalized Riemann- Hilberl- Poincare problem solutions of the homogeneous integral equation N p. =0
is the following linear boundary value problem: To find corresponding to (5) and of the homogeneous integral
an analytic function I>(z) in S + from the boundary equation
condition Re{X<Il} = !(to), toES+, (3) N~ - A(to)p(to)+ jN(t, to)p(t)ds = 0, (6)
L
where A is an integro-differential operator defined by associated with it. The numbers k and k' are connected
the formula with the index " of the Riemann - Hilbert - Poincare
X<Il = j~ {alto)<Il(f) (to) + [hlt o, t)~(f)(t)d+ (4)
problem by the equality
K = k-k'.
where ao(to), ... ,am(/) are (usually complex-valued) Of special interest is the case when the problem is
functions of class H defined on L (i.e. satisfying a solvable whatever the right-hand side f (10). In order
Holder condition), f (I) is a given real-valued function for the Riemann - Hilbert - Poincare problem to be
of class H and hilo, I) are (usually complex-valued) solvable whatever the right-hand side f(to), a necessary
functions on L of the form and sufficient condition is k' =0 or k' = I, and in the
h(t t) = hJ(to, t) O";;;;a< 1, latter case the solution v(/) of equation (6) must satisfy
J 0, It-tola' the condition
j p(t)(J(t) ds =1= 0;
where hJ(to, I) are functions of class H in both vari- L

ables. The expression 1>(;)(/0) on the right-hand side of in both cases ,,;;;;.0 and the homogeneous problem
(4) is understood to mean the boundary value on L Re{;\I>} =0 has exactly K+ I linearly independent solu-
from inside the domain S + of the j-th order derivative tions. If a(/)=O, then the Riemann-Hilbert-Poincare
of I>(z). problem is solvable for any right-hand side if and only
A special case of the Riemann - Hilbert - Poincare if k' =0.
problem, in the case when m =0, hilo,/)=O, is the As regards the Riemann - Hilbert problem, the fol-
Riemann - Hilbert problem; Poincare's problem is also lowing statements are valid: I) If ,,;;;;'0, then the inho-
a special formulation of the same problem. Many mogeneous problem (1) is solvable whatever its right-
important boundary value problems - such as boun- hand side; and 2) if ,,< -2, then the problem has a
dary value problems for partial differential equations of solution if and only if
elliptic type with two independent variables - may be 2"
reduced to the Riemann - Hilbert - Poincare problem. je i (kh/2}<t>Q(<p)c(<p)d<p = 0, k=1, ... ,-K-1,
o
The Riemann - Hilbert - Poincare problem was also
where
posed for am (to)=i=0, 10 EL, and was solved by I.N.
Vekua [3].
Q(<P) = V 1 {1 2"
exp --4 jO(<PI)cotg-2-d<P1 ,
<PI - <p }
An important role in the theory of boundary value a 2 (<p) + b 2 (<P) 7T 0

problems is played by the concept of the index of Ihe


problem - an integer defined by the formula O(t)=arg[-t-':~~~]' a 2 +b 2 =1=O.
K = 2(m +n),
The Riemann - Hilbert problem is closely connected
where 27Tn is the increment of argam(t) under one com- with the so-called problem of linear conjugation. Let L
plete traversal of the contour L in the direction leaving be a smooth or a piecewise-smooth curve consisting of
the domain S + at the left. closed contours enclosing some domain S + of the com-
The Riemann - Hilbert - Poincare problem IS plex plane z = x + ry, which remains on the left during
reduced to a singular integral equation of the form traversal of L, and let the complement of S + U L in

462
BOUNDARY VALUE PROBLEMS OF ANALYTIC FUNCTION THEORY

the z-plane be denoted by S -. Let a function <I>(z) be Note that the various names given to various variants of
given, and let it be continuous in a neighbourhood of these problems are by no means fixed. Thus, what is called
the curve L, everywhere except perhaps on L itself. One the linear conjugation problem above is also often known as
says that the function <I>(z) is continuously extendable the Riemann-Hilbert problem [A9]. This version, especially
to a point tEL from the left (or from the right) if <I>(z) the matrix case where g(t), 111+ (t), 111- (t) are all (invertible)
matrix-valued functions, is of great importance in the theory
tends to a definite limit <1>+ (t) (or <1>- (t as z tends to
of completely-integrable systems. Indeed, consider an over-
t along an arbitrary path, while remaining to the left
determined system of linear partial differential equations (cf.
(or to the right) of L. [A6] for more detail)
The function <I>(z) is said to be piecewise analytic
with jump curve L if it is analytic in S + and S - and cf>x = ucf>, cf>t = vcf>, (A1)

is continuously extendable to any point tEL both from where u, v are to be thought of as rational functions in a
the left and from the right. complex parameter ;\. with coefficients depending on x, t but
The linear conjugation problem consists of determin- with the pole structure independent of x, t, e.g.
ing a piecewise-analytic function <I>(z) with jump curve U1 Un
U = Uo+--,
a1 -/\
+ ... +--,
a -/\
'
L, having finite order at infinity, from the boundary n

condition with the a, constants and the u, functions of x, t only. An


invertible matrix solution cf> of (A 1) exists if and only if the
corresponding u, v satisfy
where G(t) and get) are functions of class H given on
L. On the assumption that G(t):=O everywhere on L,
ut-vx+[u, v] = 0, (A2)

the integer a so-called Zakharov- Shabat system. Many integrable sys-


tems can be put in this form. Now let cf> solve (A 1) and take
a function g(;\.) on a contour r in the A-plane. Solve the
is called the index of the linear conjugation problem. (x, t)-family of matrix Riemann - Hilbert problems
If <I>(Z)=(<I>l, ... ,<I>n) is a piecewise-analytic vector, 111- =cf>Qc/> -1111+. Then 1/;= (111- )-1 cf> also solves (A 1) and this
G(t) is a square (n Xn)-matrix and g(t)=(gl' ... ,gn) leads to an action of the group of invertible matrix-valued
is a vector, and if also det G(t)==O, then the integer functions in ;\. on the space of solutions of (A2). This
1 method of obtaining a new solution ~=I/;Kcf>-1, ~=l/;tV1
K = 27T [argdetG(t)lL from an old one u, v and a function g(A) is k~o::vn as the
Zakharov- Shabat dressing method. (u, v)r->( U, v) is also
is called the total index of the linear conjugation problem.
sometimes known as a Riemann- Hilbert transformation. In
The concepts of the index and the total index play an
the case of Einstein's field equations (axisymmetric solu-
important role in the theory of the linear conjugation
tions) a similar technique goes by the names of
problem [5], [6], [7]. Hauser- Ernst or Kinners/ey- Chitre transformations, and in
The theory of one-dimensional singular integral that case (a subgroup of) the group involved is known as
equations of the form (5) was constructed on the basis the Geroch group [A 10]. The Riemann monodromy problem
of the theory of the linear conjugation problem. asks for n multi-valued functions yeA) =(Y1 (A), ... ,Yn(A
References regular everywhere but in ;\.=a1,.. ,an, oo=ao, such that
[I] RIEMANN, B.: Gesammelte mathematische Werke - Nachtrdge, analytic continuation around a contour containing exactly
Teubner, 1892-1902 (translated from the German). one of these points changes Y(Af into V;Y(A)T, i=O, ... ,n.
[2] HILBERT, D.: Grnndzuge einer allgemeinen Theorie der Iinearen This problem reduces to the Riemann- Hilbert problem by
Integralgleichungen, Chelsea, reprint, 1953.
[3] VEKUA, LN.: Trudy Tbil. Mat. Inst. Akad. Nauk GrnzSSR 11 taking a contour through ao, ... ,an and a suitable step
(1942), 109-139. function on it. The Riemann monodromy problem was
[4] POINCARE, H.: Lefons de mecanique celeste, 3, Paris, 1910. essentially solved by J. Plemelj [A11], GD. Birkhof [A12],
[5] MUSKHELISHVILI, N.I.: Singular integral equations, Wolters- and I.A. Lappo-Danilevsky [A13].
N oordhoff, 1972 (translated from the Russian).
[6] GAKHOV, F.D.: Boundary value problems, Pergamon, 1966 References
(translated from the Russian). [A1] BART, H., GOHBERG, Land KAASHOEK, M.A.: 'Fredholm
[7] KHvEDELIDZE, B.V.: Trudy Tbil. Mat. Inst. Akad. Nauk theory of Wiener-Hopf equations in terms of realization of
GruzSSR 23 (1956),3-158. their symbols', Integral Equations and Operator Theory 8
A. V Bitsadze (1985), 590-613.
Editorial comments. The problem discussed in the arti- [A2] BOUTET DE MONVEL, L., A.O. (EDS.): 'Mathematique et phy-
cle is also known as the barrier problem. For applications in sique', in Sem. ENS 1979-1982, Birkhauser, 1983.
mathematical physics, see [A6], [A?], [A9] , and the refer- [A3] CLANCEY, K. and GoHBERG, I.: 'Factorization of matrix
functions and singular integral operators', in Operator
ences given there. An important contribution to the theory
Theory: Advances and Applications, Vol. 3, Birkhauser,
(matrix case) was given in [A5]. Other relevant publications 1981.
are [A1], [A2], [A3], [A4] and [AS]. The method proposed in [A4] GoHBERG, I.e. and FELDMAN, LA.: Convolution equations
[A 1] employs the state space approach from systems and projection methods for their solution, Transl. Math.
theory. Monographs, 41, Amer. Math. Soc., 1974.

463
BOUNDARY VALUE PROBLEMS OF ANALYTIC FUNCTION THEORY

[A5] GoHBERG, I.e. and KRETN, M.G.: 'Systems of integral equa- dratic differentials (cf. Quadratic differential) in the
tions on a half line with kernels depending on the difference
of arguments', Amer. Math. Soc. Transl. (2) (1960),217-
solution of extremal problems in the theory of
287. (Uspekhi Mat. Nauk 13, no. 2 (80) (1958),3-72) univalent functions clear, since s(w) proves to be a
[A6] ZAKHAROV, V.E. and MANAKOV, S.V.: 'Soliton theory', in meromorphic function in many applications. In certain
J.M. Khalatnikov (ed.): Physics reviews, Vol. 1, Harwood cases it follows from the conditions of the problem that
Acad. Publ., 1979, pp. 133-190.
[A7] MEISTER, E.: Randwertaufgaben der Funktionentheorie,
the appropriate poles of s(w) belong to the boundary of
Teubner, 1983. the extremal domain, and it is shown by the fundamen-
[A8] RODIN, Y.L.: The Riemann boundary value problem on tal lemma of the method of boundary variation that the
Riemannian manifolds, Reidel, 1987.
[A9] CHUDNOVSKY, D. and CHuDNOVSKY, G., (EDS.): The
boundary of this domain belongs to the union of the
Riemann problem, complete integrability and arithmetic closures of the critical trajectories of the quadratic dif-
applications, Lecture Notes in Math., 925, Springer, 1982. ferential
[A 10] HOENSELAERS, e. and DIETz, W.: Solutions of Einstein's
equations: techniques and results, Lecture Notes in Phys.,
265, Springer, 1984. In a number of extremal problems, the fundamental
[A11] PLEMED, J.: Problems in the sense of Riemann and Klein, lemma not only yields qualitative results, but also gives
Interscience, 1 ~64. sufficient information for the determination of the
[A12A] BIRKHOFF, G.D.: 'Singular pOints of ordinary linear differen-
tial equations', Trans. Amer. Math. Soc. 10 (1909), 436-
boundary of the extremal domain, and hence for the
470. complete solution of the problem.
[A 128] BIRKHOFF, G.D.: 'A simplified treatment of the regular The following results were obtained by means of the
singular point', Trans. Amer. Math. Soc. 11 (1910), 199- method of boundary variation: Qualitative results in
202.
[A 13] LAPpo-DANILEVSKY, LA.: Memoires sur la tMorie des the coefficient problem for the class S; in the problem
systemes des equations differentielles lineaires, Chelsea, of the maximum of the n-th diameter in a family of
reprint, 1953. continua of a given capacity; the solution of a number
AMS 1980 Subject Classification: 30E25, 45EXX, of extremal problems of univalent conformal mappings
45F15 of doubly-connected domains; distortion theorems for
multiply-connected domains, which at the same prove
BOUNDARY VARIATION, METHOD OF - A existence theorems of univalent conformal mappings of
method of studying univalent functions (cf. Univalent a given multiply-connected domain onto canonical
function), based on the study of the variation of a func- domains, etc.
tion w = f(z) that is univalent in a domain of the z- References
plane, the variation of the function being determined [I) SCHIFFER, M.: 'A method of variation within the family of sim-
by the appropriate variations of the boundary of the ple functions', Proc. London Math. Soc., Ser. 244 (1938), 432-
449.
image of this domain. (2) ScHIFFER, M.: 'Some recent developments in the theory of con-
The fundamental lemma of the method of boundary formal mappings', in R. Courant (ed.): Dirichlet's principle.
variation. Let D be a domain in the w-plane and let Conformal mapping and minimal surfaces, Interscience, 1950.
(3) ScHIFFER, M.: 'Applications of variational methods in the
the complement!::. of D in the extended plane consist of theory of conformal mapping', in Calculus of variations and its
a number of continua. Let r be a continuum in !::. and applications, Proc. Symp. Appl. Math., Vol. 8, Arner. Math.
let there exist on r an analytic function s(w )=O such Soc. & McGraw-Hill, 1958, pp. 93-113. GV K ' .
.. uz mma
that for any point Wo E r and for any function F( w)
that can be represented as Editorial comments. The 'fundamental lemma of the
method of boundary variation' is also known as Schiffer's
AJp2
F(w) = w+A o+ _ _ +O(p3) (*) theorem.
W-Wo
References
and that is univalent in D, the inequality [A 1] DUREN, P.L.: Univalent functions, Springer, 1983, Chapt.1 O.

Re{AJs(wo)}+O(p);;;' 0 AMS 1980 Subject Classification: 30C45, 30C70


be valid, and suppose that the estimate of the residual
term in (*) is uniform in all closed subdomains of D. r BOUNDED OPERATOR - A mapping A of a topo-
will then be an analytic curve that may parametrically logical vector space X into a topological vector space Y
be represented by means of the function w=w(t) of the such that A (M) is a bounded subset in Y for any
real parameter t. This parameter may be so chosen that bounded subset M of X. Every operator A: X ~ Y, con-
r satisfies the differential equation tinuous on X, is a bounded operator. If A: X ~ Y is a
linear operator, then for A to be bounded it is suffi-
[c;; fS(W)+l = o. cient that there exists a neighbourhood U C X such that
A (U) is bounded in Y. Suppose that X and Yare
This result makes the important role played by qua- normed linear spaces and that the linear operator

464
BRACHISTOCHRONE

A : X ~ Y is bounded. Then is finite.


y = sup II Ax II < 00.
2) A bounded set in a topological vector space E
Ilxll ;;;] (over a field k) is a set B which is absorbed by every
This number is called the norm of the operator A and is neighbourhood U of zero (i.e. there exists an aEk such
denoted by I A II. Then that B Call).
M.l. Voitsekhovskii
II Ax II ~ II A I . II x II, AMS 1980 Subject Classification: 46AXX, 54E35
and IAI is the smallest constant C such that
II Ax I ~ C IxI BOUNDEDLY-COMPACf SET in a topological linear
space X - A set M with the property that the closure N
for any x EX. Conversely, if this inequality is satisfied,
then A is bounded. For linear operators mapping a of every bounded subset N CM is compact and is con-
normed space X into a normed space Y, the concepts tained in M (for a normed space in the strong (resp.
of boundedness and continuity are equivalent. This is weak) topology this is equivalent to the compactness
(resp. weak compactness) of the intersections of M with
not the case for arbitrary topological vector spaces X
balls). A convex closed set in a normed space is
and Y, but if X is bomological and Y is a locally con-
boundedly compact if and only if it is locally compact.
vex space, then the boundedness of a linear operator
Boundedly-compact sets have applications in the theory
A : X ~ Y implies its continuity. If H is a Hilbert space
of approximation in Banach spaces; they have the pro-
and A: H ~H is a bounded symmetric operator, then
perty that an element of best approximation exists. A
the quadratic form <Ax, x> is bounded on the unit
barrelled topological linear space which is boundedly
ball K, ={x: I x 11':;;;1}. The numbers
compact (in itself) in the weak (resp. strong) topology is
f3 = sup <Ax, x> and a = Ilxllinf <Ax, x> a reflexive (resp. Montel) space. A normed space which
Ilxll;;;] <;;]

are called the upper and lower bounds of the operator A. is boundedly compact is finite-dimensional.
The points a and f3 belongs to the spectrum of A, and References
[1) KLEE, V.L.: 'Convex bodies and periodic homeomorphisms in
the whole spectrum lies in the interval [a, f3]. Examples
Hilbert space', Trans. Amer. Math. Soc. 74 (1953), 10-43.
of bounded operators are: the projection operator (pro- (2) EDWARDS, R.E.: Functional analySiS: theory and applications,
jector) onto a complemented subspace of a Banach Holt, Rinehardt, Winston, 1965.
L.P. Vlasov
space, and an isometric operator acting on a Hilbert
space. AMS 1980 Subject Classification: 46AXX
If the space X and Y have the structure of a partially
ordered set, for example are vector lattices (cf. Vector BOURGET FUNCTION - The function In,k(Z) which
may be defined as a generalization of the integral
lattice), then a concept of order-boundedness of an
representation of the Bessel functions
operator can be introduced, besides the topological
boundedness considered above. An operator A: X ~ Y
is called order-bounded if A (M) is an order-bounded set
In,k(Z) = 2~)t-n-] [t++r exp { ~Z[t-+]}dt,
in Y for any order-bounded set M in X. Examples: an where n is an integer and k is a positive integer. The
isotone operator, i.e. an operator such that x '0' implies integration contour makes one counter-clockwise tum
Ax':;;;Ay. around the coordinate origin. In other words,
References 1 '1T
=-

[1) LYUsTERNIK, L.A. and SOBOLEY, V.I.: E/emente der Funk- In,k(Z) j(2 cos 81 cos(n8-z sm8) d8,
tiona/analysis, Akademie Verlag, 1955 (translated from the 7r 0
Russian).
(2) RUDIN, W.: Functiona/ analysis, McGraw-Hill, 1973. In,O(z)=.Jn(z) is a cylinder function of the first kind. So
[3) BIRKHOFF, G.: Lattice theory, Arner. Math. Soc., 1967. named after 1. Bourget [1], who studied the function
v.I. Sobolev with a view to various applications in astronomy.
Editorial comments.
References
References [1) BoURGET, J.: 'Memoire sur les nombres de Cauchy et leur
[A1) TAYLOR, A.E. and LAY, D.C.: Introduction to functional application a divers problemes de mecanique celeste', J. Math.
analysis, Wiley, 1980. Pures App/. (2) 6 (1861), 32-54.
[A2) ZAANEN, A.c.: Riesz spaces, 2, North-Holland, 1983. (2) WATSON, G.N.: A treatise on the theory of Besse/functions,
Cambridge Univ. Press, 1952, Chapt. 10.
AMS 1980 Subject Classification: 47 A05, 47855 v.I. Pagurova
AMS 1980 Subject Classification: 33A40
BOUNDED SET - I) A bounded set in a metric
space X (with metric p) is a set A whose diameter BRACIDSTOCHRONE - A curve of fastest descent.
8(A) = sup p(x, y) The problem of finding such a curve, which was posed
x,yEA

465
BRACHISTOCHRONE

by G. Galilei [1], is as follows: Out of all planar curves


connecting two given points A and B and lying in a
common vertical plane (B below A) to find the one
along which a material point moving from A to B
solely under the influence of gravity, would arrive at B
within the shortest possible time. The problem can be
reduced to finding a function y (x) that constitutes a cr,
minimum of the functional

J(y) = f -if
u
1 +y'2 dx,
2gy
Fig. 1. Fig. 2.

If this is the identity permutation, w is called a coloured


where a and b are the abscissas of points A and B. The (or pure) braid. The transposition (i i + 1) corresponds
brachistochrone is a cycloid with a horizontal base and to a simple braid (Ji (see Fig. 2).
with its cusp at the point A. On the set of all braids on n strings with fixed Po,
References P J, {aj}, {b;}, one introduces the equivalence relation
[I] GALILEI, G.: Unterredungen und mathematische Demonstra- defined by homeomorphisms h : II ~ II, where II is the
tionen aber zwei neue Wissenszweigen, die Mechanik und die region between Po and PI, which reduce to the identity
Fallgesetze betreffend, W. Engelmann, Leipzig, 1891 (translated
from the Italian and the Greek). on Po U PI; it may be assumed that h(Pt)=Pt . Braids
[2] LAVRENT'EV, M.A. and LVUSTERNIK, L.A.: A course in varia- a and {3 are equivalent if there exists a homeomorphism
tional calculus, Moscow-Leningrad, 1950 (in Russian). with the above properties such that h(a)={3.
L.P. Kuptsov The equivalence classes - which are still called
Editorial comments. The brachistochrone problem is braids - form the braid group B(n) with respect to the
usually ascribed to Johann Bernoulli, cf. [A1], pp. 350 ff. A operation defined as follows. Place a copy II' of the
treatment can be found in most textbooks on the calculus of domain II above another copy II", in such a way that
variations, cf. e.g. [A2]. P~ coincides with p'J, a;' with b;, and then compress
References II' U II" to half its 'height'. The images of the braids
[A1] ROUSE BALL, W.W.: A short account of the history of w' E II' and w" E II" produce a braid w' w", with. string Ii
mathematics, Dover, reprint, 1960.
[A2] WEINSTOCK, R.: Calculus of variations, Dover, reprint, 1974. obtained by extending f; with (~i' where k j E S'" (i). The
AMS 1980 Subject Classification: 49-XX, 51 M15
identity braid is the equivalence class containing the
braid with n parallel segments; the inverse w-I of a
BRAID THEORY - The branch of topology and alge- braid w is defined by reflection in the plane P I / 2 For
bra concerned with braids, the groups formed by their the condition ww- I = see Fig. 3.
equivalence classes and various generalizations of these
groups [1].
A braid on n strings is an object consisting of two
parallel planes Po and P I in three-dimensional space
R3 , containing two ordered sets of points
aJ, ... ,anEP O and b l , . . . ,bnEP I , and of n simple
non-intersecting arcs II, ... ,In' intersecting each paral-
lel plane Pt between Po and P I exactly once and join-
ing the points {a;} to {b;}, i = 1, ... ,n. It is assumed
that the a;'s lie on a straight line La in Po and the b;'s Fig. 3.
on a straight line Lb in P I parallel to La; moreover, bi
lies beneath aj for each i (see Fig. 1). Braids can be The mapping w~S'" defines an epimorphism of B(n)
represented in the projection on the plane passing onto the group S(n) of permutations of n elements, the
through La and L b; this projection can be brought into kernel of this epimorphism is the subgroup K(n) of all
general position in such a way that there are only fin- pure braids, so that one has an exact sequence
itely many double points, each two of which lie at dif- 1 ~ K(n) ~ B(n) ~ S(n) ~ 1.
ferent levels, and the intersections are transversal.
The braid group B(n) has two principal interpreta-
The string Ii of a braid w joins ai to bki and so
tions. The first, as a configuration space, is obtained by
defines a permutation
identifying the planes P t via vertical projection onto
S'" = [k\ ::: :n] . Po, under which the images of the points ail =/i Pt, n
considered as t varies from 0 to 1, form the trace of an

466
BRAID THEORY

isotopy $~ of the set U aj along Po; one has The braids OJ, 1~i ~n -1, are the generators of the
$r< U aj)= U aj. Consider the space of unordered group B(n), i.e. w=Ok, . . . 0k with m ,

sequences G(n) of n pairwise distinct points of the (Ji(Jj = (Jj(Ji if I i - j I > 1, }


plane; then each braid corresponds in one-to-one (Ji(Ji+1(Ji = (Ji+1(Ji(Ji-], l';;;i';;;n -2. (1)
fashion to a class of homotopy loops in this space, and
It turns out that (1) is a presentation of B(n) (see Fig.
one has an isomorphism
4).
{3: B(n) ~ 71"1 G(n).
There exists a splitting exact sequence (obtained from
For pure braids one has an analogously constructed the locally trivial fibration F(n )~F(n - 1) with fibre
isomorphism R2 \ (aI, ... ,an-I:
a: K(n) ~ 71"1 F(n),
1 ~Fn-1 ~K(n)~K(n-l)~ 1,
where F(n) is the space of ordered sequences of n dis-
tinct points of the plane, so that K(n) can be identified which leads to the normal series
with the subgroup corresponding to the covering K(n) = An :l ... :l A1 :l Ao = Fn- 1
p:F(n)~ G(n) = F(n)jS(n). with free fractors Aj / Aj _ I, where Aj has a 'com-
The second interpretation, as a homeotopy group, is ponent' Un - j isomorphic to K(n -i -1). Each element
obtained by extending the isotopy $~ to an isotopy $~ wEB(n) can be expressed uniquely in the form
of the plane Po that coinci~es with the identity outside
some disc, and such that $~ =id. For each t, two such where 71"", is a selected representative for S'" in B(n)
extensions differ by a homeomorphism which is the
identity at the points ail. A braid uniquely determines a
and Wi EAn - i + I n
Uj The reduction of a braid to this
form is known as its dressing. This solves the word
component of the space of homeomorphisms Y(n) of (identity) problem in B(n).
the plane which map the set U aj onto itself, and one A presentation of K(n) is as follows: generators (see
has an isomorphism Fig. 5):
'I: B(n) ~ 71"0 Y(n).

To each homeomorphism hEY corresponds an auto-


morphism of the free group of rank n : relations:
Fn = 71"1 (R2 \ U aj), defined up to an inner automor-
if r.;;;s<i<j
phism, which in turn yields a homomorphism
or i<r<s<j;
B(n)~OutFn =AutFn /InnFn The elements of the
image are called braid automorphisms of the free group. ArjAij = Arj if i =s; (2)

In particular, corresponding to the braid OJ one has an AijAsjAijAij = Asj if r=i<j<s;


automorphism
This presentation may be obtained as a presentation of
if }=I=i, i + 1 ({x;} is a set of generators of Fn). Any the kernel of the natural homomorphism into S(n) of
braid automorphism a possesses the following the abstract group B(n) defined by the present(!tion (1)
properties: ~th the aid of the Schreier system rr;=2Mjkj,
J~k/~n, where Mji=oj_1 ... OJ.
The centre of B(n) is the infinite cyclic group gen-
erated by the element (01 . . . 0n)n. The commutator
up to an inner automorphism (for the meaning of A j , group B'(n) coincides with B"(n) for n~5; B'(3) is
see below); these properties characterize braid automor- isomorphic to the free group of rank 2, and B' (4) is
phisms. isomorphic to semi-direct product of two such groups.
The quotient group modulo the commutator subgroup
is an infinite cyclic group, generated by the images of
OJ. There are no elements of finite order in B(n). The

group K(n) is mapped onto itself by endomorphisms


with non-Abelian image. In particular, K(n) B' (n) is n
a fully-characteristic subgroup of B(n), and also of
K(n) (see [15]).
Solving the conjugacy problem in B(n) is much more
Fig. 4. Fig. 5. complicated than solving the word problem. There is a

467
BRAID THEORY

unique Garside normal form of a braid, where ",=~mo, mapping (see [14]) establishes a homology equivalence
~=(Ol ... 0n-I)(OI ... 0n-2)' .. (102)01 is what is of the limit space G(oo) and (02S2)o (the subscript
known as a Garside element and 0 is a positive braid, indicates that one chooses the component of spheroids
i.e. a braid the representation of which in terms of the of degree 0). As to the unstable homology groups of
elements 0i has positive indices. With any braid '" one B(n), it has been proved [16] that they are finite, stabil-
can associate, using finitely many operations depending ize at height n and satisfy the recurrence relation
on i (conjugation with certain elements, choice of ele- H i B(2n+I)=H i B(2n). There is a description of the
ments of maximum degree, etc.), a certain set of words calculation of these groups [17].
~("'), from which one selects a word in normal form Applications and generalizations. I) A closed braid is a
~ + T with minimal T. This is a so-called upper form of link (an n-component knot) in R3 each component of
the braid "'. It turns out that two braids are conjugate which transversally cuts out half-planes bounded by the
if and only if they have a same upper form (see [7]). same straight line: the axis I of the closed braid (see
The Burau representation of the braid group B(n) in Fig. 6).
the group of matrices over the ring of polynomials in
one variable with integer coefficients is defined by the 1

~
correspondence
Ii-I 0 0 0
0 i-I I 0
b(w): 0i ~
0 1 0 0
0 0 0 In-k-I
Fig. 6.

where h is the identity matrix of order k. The matrix


b(",)- In is the reduced Alexander matrix (see Alex-
ander invariants) of the link obtained by closing the
braid '" (see below). For a pure braid one obtains the
full Alexander matrix from the analogous Gassner
matrix. The problem of whether these representations
are faithful is still (1982) unsolved (see [2]). Fig. 7.
The fact that the spaces F(n) and G(n) are aspherical
makes it possible to evaluate the homology of braid w
A braid '" generates a closed braid (the closure of w)
groups. in the following way. Consider a cylinder with bases on
The homology of K(n) (see [16]). Homologically, Po and PI, the interior of which contains w. Let this
K(n) coincides with the product of unions of circles in cylinder be deformed in R3 so that its elements become
which the number of circles runs from one through circles with centres on I, its bases coincide and each
n - 1. The homology ring is isomorphic to the exterior w
point ai coincides with bi' Then is the union of the
graded ring generated by the one-dimensional elements strings Ii. Conversely, every link in R3 can be
"'U="'Ji' l:s;;,i~j:s;;,n, with relations represented by a closed braid. To equivalent braids
WklWlm +WlmWmk +WmkWkl = O. correspond isotopic links and, moreover, conjugate
braids yield isotopic links. The converse is false, since a
As "'kl one can take the forms
link may be represented by braids with different
1 dzk-dz i numbers of strings. In addition, the braids Wan -I and
2'TTi Zk -ZI '
WO;~I are not conjugate in B(n), but they correspond
corresponding to passage along the diagonals Zk =z,. to isotopic links. If two closed braids are equivalent as
The homology of B(n) (see [8], [12]). The homomor- links, they can be derived from one another by a chain
phism B(n )~S(n) can be extended to an imbedding of elementary transformations of two types (see Fig. 7).
Sen )~O(n); the induced homomorphism of cohomol- These operations are interpreted in terms of presenta-
ogy spaces H* (O(n ~H* (B(n is epimorphic, i.e. the tions of the link group, thus yielding a reformulation of
cohomology spaces mod 2 of the group B(n) generated the isotopy problem for links as a question concerning
by the Stiefel- Whitney classes. the system of groups B(n). A presentation of the link
There is a natural mapping of G(n) into 0 2S2, the w
group of has the form
two-fold loop space of S2, i.e. the space of spheroids
S2~S2 (choose small discs about n points, then map VI, ... ,Yn: Yi =AiYkjAi- I },
these discs canonically with degree 1 into a sphere, where the relations are defined by braid automorphisms
mapping the entire complement onto a point). This b W Conversely, any such relation defines a braid.

468
BRANCH INDEX

2) If one cuts a surface of genus g with g non- where the number of factors on each side is equal to
intersecting cuts so as to obtain a sphere with 2g holes, mij (0, here corresponds to a Weyl chamber). It has
then the homeomorphisms of this sphere with holes been proved for these groups that X w and Y w are
that fix points on the edges of the holes define spaces of type K(17, I), and the conjugacy problem has
homeomorphisms of the surface which fix the cuts and been solved. The spaces X w appear in algebraic
are themselves defined up to isotopy by the elements of geometry as complements to the discriminant of versal
the group K(2g). This yields a representation of the deformations of rational singularities (see [12], [13]).
braid group in the homotopy group of the surface. References
Similarly one constructs a representation of B(2g). [1] ARTIN, E.: 'Theory of braids', Ann. of Math. 48 (1947), 643-
These representations are used in studying Heegaard 649.
[2] BIRMAN, l.S.: Braids, links and mapping class groups, Princeton
diagrams of three-dimensional manifolds (cf. Three- Univ. Press, 1974.
dimensional manifold). [3] BURAU, W.: 'Ueber Zopfinvarianten', Abh. Math. Sem Univ.
3) By identifying R2 with the complex plane CI and Hamburg 9 (1932), 117-124.
[4] MARKov, AA: 'Foundations of the algebraic theory of
associating with any unordered set of n points in the braids', Trudy Mat. Inst. Steklov. 16 (1945) (in Russian).
plane a polynomial of degree n having these points as [5] GASSNER, B.: 'On braid groups', Abh. Math. Sem Univ. Ham-
roots, one can identify G(n) with the space of polyno- burg 25 (1961), 10-22.
[6] FADELL, E. and NEUWIRTH, L.: 'Configuration spaces', Math.
mials with non-zero discriminants. This has made it Scand 10 (1962), 111-118.
possible to obtain several results concerning the non- [7] GARSIDE, F.A: 'The braid group and other groups', Quart. J.
representability of algebraic functions by superpositions Math. 20, no. 4 (1969), 235-254.
[8] FUKS, D.B.: 'Cohomology of braid groups mod 2', Funktional.
of functions in fewer variables (see [16]).
Anal. i Prilozhen. 4, no. 2 (1970), 62-73 (in Russian).
4) Configuration spaces for arbitrary spaces X are [9] ARNOL'D, V.I.: 'On cohomology classes of algebraic functions
defined in analogy with G(n) and F(n), with R2 that are preserved under Tschimhausen transformations',
replaced by X. The fundamental groups of these spaces, Funktional. Anal. i Prilozhen. 1 (1970), 84-85 (in Russian).
[10] GoRIN, E.A and LIN, V.YA.: 'Algebraic equations with con-
B(X) and K(X), are called the braid group of the space tinuous coefficients and some problems in the theory of
X and the pure braid group of the space X, respectively. braids', Mat. Sb. 78 (1969),579-610 (in Russian).
For a manifold M n of dimension exceeding 2, [11] ARNOL'D, V.I.: 'On certain topological invariants of algebraic
functions', Trudy Moskov. Mat. Obshch. 21 (1970), 27-46 (in
17IFn(X);::::::II~=117,M(i), and this group is of no Russian).
interest. For two-dimensional manifolds, one has a [12] BRIESKORN, E.: Matematika 18, no. 3 (1974), 46-59.
natural imbedding of B(n) and K(n) into Bn(M2) and [13] BRIESKORN, E. and SAITO, K.: 'Artin Gruppen und Coxeter
Gruppen', Invent. Math. 17 (1972), 245-271.
Kn(M2) induced by an imbedding R2 CM2. If M2 is [14] DELIGNE, P.: 'Les immeubles des groupes de tresses
neither a sphere nor a projective space, one obtains an generalises', Invent. Math. 17, no. 4 (1972), 273-302.
exact sequence (15] LIN, V.YA.: 'Representations of permutation braids', Uspekhi
Mat. Nauk. 29, no. 1 (1974), 173-174 (in Russian).
e n (16] ARNOL'D, V.I.: 'The ring of cohomology groups of crossed
1~ 7TI K(2) ~ 7TI Kn(M2) ~ II7T;Mfi) ; braids', Mat. Zametki 5, no. 2 (1969), 227-231 (in Russian).
;=1
[17] LIN, V.YA.: 'Artin braids and related groups and spaces', Itogi
for the sphere, the homomorphism e is an epimor- Nauk. i Tekhn. Algebra Topol. Geom. 17 (1979), 159-227 (in
phism, obtained by adding to (l) the single relation Russian). A .V. Ch ernavsk"II

AMS 1980 Subject Classification: 57M25


5) If p: X ~ Y is a k-sheeted covering, then p -I ex,
where ex is a loop in Y, is a loop in the configuration BRANCH INDEX - The sum V = ~(k - 1) of the
space X, and this defines a homomorphism orders of the branch points (cf. Branch point) of a com-
171 Y ~Bk(X) which strengthens the monodromy of the pact Riemann surface S, regarded as an n-sheeted cov-
covering and has applications in algebraic geometry. ering surface over the Riemann sphere, extended over
6) Let VC be the complexification of a real vector all finite and infinitely-distant branch points of S. The
space V and let W be a finite irreducible group gen- branch index is connected with the genus g and
erated by reflections acting in V (hence also in VC). number of sheets n of S by:
Let s, be generating reflections in the planes P, C V and V = 2(n+g-l).
let D be their union. Finally, let VC / D = Yw and let
See also Riemann surface.
X w be the quotient space. The groups 171 Y w and 171 X w
are called Brieskorn groups and constitute natural gen- References
[1] SPRINGER, G.: Introduction to Riemann surfaces, Addison-
eralizations of K(n) and B(n). If ord(s,sj) = m'j' then Wesley, 1957.
17 I X W has a presentation of the form E.D. Solomentsev
Editorial comments.

469
BRANCH INDEX

References Il(z] ; r) is again obtained, this is also true of all ele-


[A1] MUMFORD, D.: Algebraic geometry I. Complex projective ments of the branch (cf. Branch of an analytic function)
varieties, Springer, 1976.
of f (z) defined in V by the element Il(z] ; r). In such a
AMS 1980 Subject Classification: 14XX, 30F10 case a is a branch point of finite order k - I of this
branch. In a punctured neighbourhood V of a branch
BRANCH OF AN ANALYfIC FUNCflON - The point a of finite order this branch is represented by a
result of analytic continuation of a given element of an generalized Laurent series, or Puiseux series:
analytic function represented by a power series
00 (I)
n(a; r) = ~cv(z -at v= -00
v=O

with centre a and radius of convergence r >0 along all If a = 00 is an improper branch point of a finite
possible paths belonging to a given domain D of the order, then the branch of f (z) is representable in some
complex plane C, a ED. Thus, a branch of an analytic neighbourhood V' = {z: 1 z 1 > p} by an analogue of
function is defined by the element Il(a; r) and by the the series (I):
domain D. In calculations one usually employs only
(2)
single-valued, or regular, branches of analytic functions, v= -00

which exist not for all domains D belonging to the


domain of existence of the complete analytic function. The behaviour of the Riemann surface R of f (z) over
For instance, III the cut complex plane a branch point of finite order a is characterized by the
D =C\ {z =x: - oo";;;x,,;;;o} the multi-valued analytic fact that k sheets of the branch of f (z) defined by the
function w = Ln z has the regular branch element Il(z]; r) come together over a. At the same
time the behaviour of other branches of R over a may
w = Lnz = In I z I +iargz, I argz 1<'11', be altogether different.
which is the principal value of the logarithm, whereas If the series (l) or (2) contains only a finite number
in the annulus D = {z: 1< 1 z 1 <2} it is impossible to of non-zero coefficients bv with negative indices v, a is
isolate a regular branch of the analytic function an algebraic branch point or an algebraic singular point.
w=Lnz. Such a branch point of finite order is also characterized
References by the fact that as z~a in whatever manner, the values
[I] HURWITZ, A. and COURANT, R.: Vorlesungen iiber allgemeine of all elements of the branch defined by Il(z]; r) in V
Funktionentheorie und elliptische Funktionen, I, Chapt. 3; or V' tend to a definite finite or infinite limit.
2, Chapt. 4, Springer, 1964.
[2] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
Example: f(z) = z ]1 k , where k> I is a natural
2, Chelsea, 1977 (translated from the Russian). number, a =0,00.
E.D. Solomentsev If the series (l) or (2) contain an infinite number of
AMS 1980 Subject Classification: 30A05, 30B40 non-zero coefficients bv with negative indices v, the
branch points of finite order a belong the class of
BRANCH POINT, singular point of multi-valued char- transcendental branch points.
acter - An isolated singular point a of an analytic func- Example: f(z)=exp(l/z)]lk, where k>1 is a
tion f (z) of one complex variable z such that the ana- natural number, a =0.
lytic continuation of an arbitrary function element of Finally, if it is impossible to return to the initial ele-
f (z) along a closed path which encircles a yields new ment after a finite number of turns, a is said to be a
elements of f (z). More exactly, a is said to be a branch logarithmic branch point or a branch point of infinite
point if there exist: I) an annulus order, and is also a transcendental branch point.
V={z: 0<1 z-a I<p} in whichf(z) can be analyti- Example: f(z)=Lnz, a =0,00.
cally extended along any path; 2) a point z] E V and Infinitely many sheets of the branch of f (z) defined
some function element of f (z) represented by a power by the element Il(z]; r) come together over a loga-
senes rithmic branch point.
x
n(zl;r) = ~cvCz-zd\ In the case of an analytic function of several complex
v =-=0
variablesf(z), z=(z], ... ,zn), n;;.2, a point a of the
with centre z] and radius of convergence r>O, the ana- space en or cpn is said to be a branch point of order m,
lytic continuation of which along the circle l";;;m";;; 00, if it is a branch point of order m of the,
1z -a 1= 1z] -a I, going around the path once in, generally many-sheeted, domain of holomorphy of f (z).
say, the positive direction, yields a new element Unlike in the case n = I, branch points, just like other
rI'(z] ; r') different from Il(z]; r). If, after a minimum singular points of analytic functions (d. Singular point),
number k > I of such rounds the initial element cannot be isolated if n ~2.

470
BRANCHING OF SOLUTIONS

References (I-Q)F(xo+u+v, A) = 0, }
[I] MARKUSHEVICH, A.I.: Theory offunctions of a complex variable,
2, Chelsea, Chapt. 8 (translated from the Russian). QF(xo +u +v, A) = 0,
[2] FUKS, B.A.: Theory of analytic functions of several complex vari
ables, Amer. Math. Soc., 1963 (translated from the Russian). where u =(/ - P)(x - xo), v =P(x - xo). The implicit
operator u = f (v, A) is found from the first equation of
E.D. Solomentsev
the system. Its substitution into the second equation of
AMS 1980 Subject Classification: 30A05, 30B40 the system yields the equation
QF(xo + j(v, A)+V, A) = 0,
BRANCIllNG OF SOLUTIONS, bifurcation of solu-
tions, of non-linear equations - A phenomenon that from which v may be found; it is said to be the branch-
causes a given solution of a non-linear equation to ing (bifurcation) equation. The complete solution of the
disappear completely or to become a number of solu- problem - to find in a sphere" v "<r of sufficiently
tions if minor changes in the parameters of the equa- small radius r all solutions v (A) of the branching equa-
tion are introduced. More precisely, let the non-linear tion that are continuous for I A-Ao I <p, where p is
equation sufficiently small - yields the complete solution of the
F(x, A) = (*) initial problem, since all its solutions can be
represented in the form
with parameter A (which need not be numerical) have
the solution Xo for a given value Ao of the parameter. X(A) = Xo +V(A)+JIV(A), Aj,
Then, if the values of A are close to Ao, equation (*) where v is a certain solution of the branching equation.
may have more than one solution X(A) close to xo. One Let F(x, A) be an analytic operator in Q. The choice
then speaks of branching (bifurcation) of the solution xo, of bases in the n-dimensional subspaces PEl =N(B)
and the pair (xo, Ao) is said to be a branching (bifurca- and QE 2 makes it possible to write the branching
tion) pOint of equation (*). equation in the form of the system
Example: The equation x 2 -A=O, where x and A are
2"j(~!, ... '~n,A) = 0, i=l, ... ,n,
complex variables, has the branching point
(xo, Ao)=(O, 0) because there exists a two-valued solu- where 2;, i = 1, ... ,n, is an analytic function at the
tion x = -..[i;, i.e. if A*O is small, the solution x =0 (for point (0, ... ,0, Ao), while all partial derivatives
A=O) branches out into two small non-trivial solutions. a.!l'; / a~j vanish at this point. This system can be stu-
The modem theory of branching of solutions is died using the theory of exclusion, the method of the
based on the ideas of A.M. LY3punov [1] and E. Newton diagram and other methods [3], [4], [5]. If
Schmidt [2] and has mostly been developed for non- n = 1, the complete analysis is effected by the method
linear equations in Banach spaces. of the Newton diagram. As regards the study of the
Let Eland E2 be complex Banach spaces, x EE I, let branching equation, i.e. of the original problem as well,
A be a complex variable, and let F(x, A) be a non-linear the following three cases are the only ones possible: a)
operator which, together with its Frechet derivative the problem has no solution; b) the problem has a fin-
FxCx, A), is continuous in a neighbourhood Q of the ite number of solutions, all of which can be represented
point (xo, Ao). Let F(x, A) map Q into a neighbourhood by converging series of integer or rational powers of the
of zero of the space E2 such that F(xo, Ao)=O, and difference A- Ao; or c) the problem has a finite number
suppose that FxCxo, Ao) B is a Fredholm operator. of families of solutions, each one of which depends on
It is required to find, in the sphere II x - Xo " <r, a finite number of small free parameters and, possibly,
where the radius r is sufficiently small, all the solutions on the finite number of solutions given in b).
of equation (*) that are continuous if I A- Ao I <p, For case b) to occur, it is sufficient for Xo to be an
where p is also sufficiently small. In other words, this is isolated solution of the equation F(x, Ao)=O. In case
a problem of local extension of the solution Xo in the b) it is expedient to look for solutions by the method of
parameter A. If the inverse operator B- 1 exists, the indefinite coefficients in the form
00
problem has a unique solution X(A), and X(Ao)=Xo. If, X(A) = xo+ ~Xk(A-Aot/P,
on the other hand, B- 1 does not exist, then the null k=!

space N (B) of B has dimension n;;;;'1. In such a case where Xk are the coefficients to be found, while the
the problem may be reduced to an analogous finite- possible values of p may be preliminarily found using
dimensional problem. Let P denote the projection (d. the branching equation. The substitution of such a
Projector) of E I on N (B) and let / - Q denote the pro- series in (*) yields a recurrent system for finding
jection of E2 onto the domain of values of B, where / x I , X 2, . . .. Problems of the type
is the identity operator. Equation (*) may be written as BXk = H (x I, ... ,Xk -I) are obtained, and each Xk is
the system determined up to n arbitrary constants, which follow

471
BRANCHING OF SOLUTIONS

from the stipulated solvability of the successive equa- and their further development', Russian Math. Surveys 17, no.
tions. All resulting series converge in some neighbour- 2 (1962), 1-60. (Uspekhi Mat. Nauk 17, no. 2 (1962), 13-75)
[5] KRAsNOSEL'sKIi, M.A., ET AL.: Approximate solution of operator
hood of Ao. An estimate from below of the radius of equations, Moscow, 1969 (in Russian).
the neighbourhood may be obtained by the construc- [6] AKHMEDOV, K.T.: 'The analytic method of
tion of majorants [6]. Nekrasov- Nazarov in non-linear analysis', Uspekhi Mat.
Nauk 12, no. 4 (1957), 135-153 (in Russian).
For case c) to occur, Xo must be a non-isolated solu-
[7] YUDOVICH, V.I.: 'Free convection and bifurcation', J. Appl.
tion of the equation F(x, Ao)=O. Here the application Math. Mech. 31 (1967), 103-114. (Priklad. Mat. Mekh. 31
of the method of indefinite coefficients may result in (1967), 101-111)
divergent series (formal solutions). If the problem is [8] LOGINov, B.V. and TRENOGIN, VA: 'On the application of
continuous groups in the theory of branching', Soviet Math.
invariant with respect to a continuous group of linear Dokl. 12, no. 2 (1971), 404-408. (Dokl. Akad Nauk SSSR 197,
operators in E I, the use of group considerations will no. 1 (1971
make it possible in several cases to reduce the number [9] AKHMEDOv, K.T.: 'The Cauchy problem for a class of non-
linear equations in function spaces', Dokl. Akad. Nauk SSSR
of the equ?tions and of the unknowns involved, and 115, no. 1 (1957), 9-12 (in Russian).
thus simplify the problem or even reduce it to case b) [IO] SIDOROV, N.A.: 'Branching of solutions of the Cauchy problem
[7], [8]. for a certain class of nonlinear integra-differential equations',
Differential equations N. Y. 3, no. 9 (1967), 830-834.
Equation (*) may also have solutions that are defined
(Differentsial'nye Uravneniya 3, no. 9 (1967
for A=Ao only. Such solutions are possible only if Xo is [11] 1'ER-K1uKoROV, A.M. and TRENOGIN, V.A.: 'Long wave solu-
a non-isolated solution of the equation F(x, Ao)=O; tions for quasi-linear elliptic equations in an unbounded strip',
they are found using the branching equation for A=Ao. Differential Equations N. Y. 3, no. 3 (1967), 496-508.
(Differentsial'nye Uravneniya 3, no. 3 (1967 VA 'T'
The determination of all its multi-parameter families of . . 1.renogm
solutions yields all the solutions of equation (*) with Editorial comments. For a classification of types of
A=Ao. bifurcation (branching) pOints see Singularities of differen-
If the spaces E I and E 2 are real, the branching tiable mappings. The technique of reducing solving equa-
equation is studied in the complex domain, after which tion (*) to a finite-dimensional problem, the bifurcation
real solutions are taken. Some of them may prove to be equation, is often called the Lyapunov-Schmidt method.
defined in half-neighbourhoods of Ao. References
This procedure is also partly applicable if F(x, A) is [A1] CHOW, S.-N. and HALE, J.K.: Methods of bifurcation theory,
a sufficiently smooth operator, B is a Noetherian opera- Springer, 1982.

tor and the parameter A is an element of another AMS 1980 Subject Classification: 47H15, 58E07,
Banach space E (branching points in E may be 58F14
replaced by lines and surfaces). It is also used in the
study of certain related problems: the problem of find- BRANCHING POINT (OF A MINIMAL SURFACE)
ing large solutions (equation (*) can have solutions - A singular point of a minimal surface in which the
X(A)~OO as A~Ao), the problem of branching of eigen first fundamental form of the surface vanishes; this
values and eigen elements of linear operators, etc. [3]. means, in fact, that such a branching point can exist on
The special case when a generalized minimal surface only. This singular point
owes its name to the fact that in a neighbourhood of it
E 1 = E 2 , F(x,;\) - x -~(x, ;\), ~(O,;\) - 0,
the structure of the generalized minimal surface resem-
was also studied by topological and variational bles that of the Riemann surface of the function w =zn,
methods and methods involving the use of cones in a n ~2, over the point z =0, i.e. there the generalized
Banach space. The concept of a bifurcation point is minimal surface has a many-sheeted orthogonal projec-
very important in this circle of problems. Problems tion onto some plane domain, in which the projection
regarding branching of solutions which do not belong of the branching point itself is an interior point with a
to this scheme are also encountered. They include, for unique pre-image. In a neighbourhood of a branching
example, degenerate differential equations [9], [10] and point (u =0, v =0) the coordinates (x,y, z) of the
problems involving long and solitary waves [11]. minimal surface can be represented in the form
References x+ry = awm+O(1 w Im+l),
[I] LYAPUNOV, A.M.: 'On equilibrium figures, etc.', in Complete
works, Vol. 4, Moscow, 1959 (in Russian). z = Re(bwm+n)+O(1 w Im+n+l), w = u+iv,
[2] ScHMIDT, E.: 'Zur Theorie der linearen und nichtlinearen
Integralgleichungen III', Math. Ann. 65 (1908),370-399. where a = c(l + i)==O and b==O are two complex con-
[3] VAlNBERG, M.M. and TRENOGIN, VA: Theory of branching of stants, 1m c = 0; m ~ 2, and n ~ 1 are integers named,
solutions of non-linear equations, Noordhoff, 1974 (translated respectively, the order and the index of the singular
from the Russian).
[4] VAlNBERG, M.M. and TRENOGIN, VA: 'The methods of point and u and v are intrinsic isothermal coordinates.
Lyapunov and Schmidt in the theory of non-linear equations The following theorem was deduced on the basis of

472
BRANCHING PROCESS

this representation: If the numbers m + n and m are of transition from state i to state j during a time inter-
coprime, then the minimal surface has (m -I)(m +n) val of length t. The principal analytical tools of branch-
different lines of self-intersection issuing from the ing processes are the generating functions (cf. Generat-
singular point in different directions. There is a relation ing function)
between the genus of a complete minimal surface, the 00

number of its branching points, and the index of its F(t;s) = ~P{p.(t)=nlp.(O)=1}sn. (2)
n=O
Gaussian mapping [1].
The equality
One distinguishes between two kinds of branching
points: false branching points and true (non-false) F(t +".; s) = F(t; F(".; s (3)
branching points. False branching points are singulari- follows from the branching condition. In branching
ties of the mapping which defines the surface that can processes with discrete time, t assumes non-negative
be got rid of by re-parametrization (e.g. if r=r(w) is a integral values, and it follows from (3) that F(t; s) is a
regular minimal surface, then the generalized minimal t-fold iteration of the function F(s)=F(I;s). Such a
surface r=r(w2) has a false branching point at w =0). process is sometimes called a Galton - Watson process.
A true branching point represents a real singularity of In a continuous-time branching process it is assumed
the surface itself, and has the following important pro- that t E [0, (0) and that the right-side derivative
perty: In a neighbourhood of a true branching point
the surface can be altered so that the new surface which aF(t; s)
at
I = /(s)
coincides with the original one outside the deformed 1=0

neighbourhood will have a smaller area than the origi- exists. It follows from (3) that F(t; s) satisfies the dif-
nal surface (this holds for surfaces in R3 but is not true ferential equation
in more-general settings: e.g. for the area-minimizing
surfaces in R4 ). The theory of generalized minimal sur- aF~/S) =/(F(t;s (4)
faces with a branching point served as a base of the
general theory of branched immersions, developed for a and the initial condition F(O;s)=s.
broad class of two-dimensional surfaces in R", n ~ 3, If A =F' (I) and a =/(1) are finite, the mathematical
[2]. expectation EJL(t) of the number of particles JL(t) (under
the condition JL(O) = I) is A I for a discrete-time branch-
References
[1) CHEN, Y.W.: 'Branch points, poles and planar points of ing process and eat for a continuous-time branching
minimal surfaces in R3', Ann. of Matk 49, no. 4 (1948), 790- process. Depending on the values of the parameters A
806. or a, branching processes are subdivided into subcritical
[2) NITZSCHE, J.C.C.: Vorlesungen iiber Minimalj1iichen, Springer,
1973.
(A < 1, a<O), critical (A = I, a =0) and supercritical
I.Kh. Sabitov (A> I, a>O). The main characteristic governed by this
AMS 1980 Subject Classification: 53A 10 classification is the behaviour of EJL(t) as t~oo.
In what follows, the trivial cases F(s)=s and
BRANCIUNG PROCESS - A stochastic process f(s) 0, when
describing a wide circle of phenomena connected with
the reproduction and transformation of given objects P{p.(t)=llp.(O)=l} - 1,
(e.g. of particles in physics, of molecules in chemistry, will not be discussed.
of some particular population in biology, etc.). The The probability of extinction is one in subcritical and
class of branching processes is singled-out by the fun- critical branching processes and is less than one in
damental assumption that the reproductions of indivi- supercritical processes. If EJL(t}ln JL(t)< 00, then, in the
dual particles are mutually independent. subcritica1 range, the probability P{JL(tO} of survival
A time-homogeneous branching process JL(t) with one of the process behaves asymptotically, as t~oo, as
type of particles is defined as a Markov process with a K EJL(t), where K is a positive constant. In the case of
countable number of states 0, I, ... , the transition critical branching processes with finite Ep.2(t), the
probabilities Pit) of which satisfy the additional asymptotic behaviour as t~oo is
branching condition:
2
Pit) = ~ PIj,(t) .. P1},(t). (1) P{p.(tO} ~ Op.(t) ,
}'+"'+},=}
where DJL(t)=B t in discrete-time branching process
The states 0, I, ... , of a branching process are inter- and DJL(t)=bt in continuous-time branching process
preted as numbers of particles. The probability PiJ(t) is and where B = F" (I), b =f" (1). A more detailed study
equal to the probability of the asymptotic behaviour of the distribution of JL(t)
P{p.(t+to)=j Ip.(to)=i} as t~oo shows that the conditional distribution law

473
BRANCHING PROCESS

S/(x) = p{ E{JL(t)11to} ~x lJL(tO} (5)


into n particles, and each such particle begins to exe-
cute an independent motion along Brownian trajec-
tories starting from its point of genesis. Let /Lx,t(A) be
converges weakly as t~oo towards a limit distribution
the number of particles in a set A at the moment of
Sex) if certain moments of p.(t) are finite. In subcritical
time t, that number having been produced by one parti-
branching processes the limit law S (x) is discrete, while
cle located at the point x E G at the initial moment of
in other cases it is absolutely continuous. Of special
time 0. The generating function
interest is the case of critical branching processes for
which the limit law S (x) is exponential: R(t, x, s( = Eexp [[InS(Y)J.!X.1 (dY )]
Sex) = l-e- x , x;;;'O. (6)
The distribution (6) is also a limit distribution for satisfies the quasi-linear parabolic equation
near-critical branching processes. More exactly, if one t:.H + feR) =0 (8)
considers the class of generating functions F (s) or f (s) with the initial condition H(O, x, s(=s(x) and the
with a bounded third derivative F'" (1), j''' (1) and
boundary condition H (t, x, s( Ix->aG =0, where
F"(lr~Bo>O,j"(1r~bo>O, then
axt
f1 = ~; = 1 a2 / is the Laplace operator, and
lim S/(x) = l-e- x , x;;;.O,
/-->00 f(s)=~"Pnsn'Pl = - ~n*lPn.
A-->I
It is assumed in general branching processes that the
where St(x) is defined by formula (5). The phenomena propagating particles can be characterized by certain
occurring in near-critical processes at t~oo are said to parameters, which may be interpreted as age, location
be transient. of the particle in space, type, size or energy of the par-
Another model of branching processes is the ticle, etc. Such processes are studied with the aid of
Bellman - Harris process, in which each particle has a generating functions or functionals, for which non-
random lifetime with distribution function G(t). At the linear differential and integral equations, generalizing
end of its lifetime, the particle leaves behind n daughter equations (4), (7) and (8), are deduced. One may give
particles with probability qn; the following general descriptions of such models of
00
branching processes. Let particles move, independently
~qn = 1.
n=O of each other, in accordance with Markov's law in some
The lifetimes and the number of daughter particles pro- phase space X. It is assumed that the random lifetime
duced by individual particles are independent. Consider of a particle is a Markov time (d. Markov moment)
a particle of age zero at the initial moment of time which depends on its trajectory. At the end of its life-
t =0; then the generating function F(t;s) of the time the particle generates new particles, which become
number of particles p.(t) at the moment of time t, given distributed over the phase space X in accordance with
by formula (2), satifies the non-linear integral equation some probability law. The new particles evolve in a
similar manner, independently of each other. In the
F(t;s) = jh(F(t-u;sdG(u)+s(l-G(t, (7) space of integer-valued measures determined by the
o number of particles present in subsets of X, the branch-
where 00 ing process is Markovian. However, branching
h(s) = ~q.sn. processes are often studied in simpler reduced spaces as
n=O
well. In such a case many of them become non-
If G(t) is a degenerate distribution function, the Markovian.
Bellman - Harris process is a discrete-time branching In most of the models discussed above the subdivi-
process; if G(t) is an exponential distribution function, sion of processes into subcritical, critical and supercriti-
one obtains a continuous-time branching process. In cal processes retains its meaning. Many properties of
the general case, the Bellman - Harris process is a non- simple branching processes described by equation (4)
Markov branching process. are also displayed by more complex systems. In partic-
A branching process may also be complicated by the ular, the limiting distribution of (5), for critical
dependence of the particles on their location in space. processes, usually turns out to be an exponential distri-
For instance, let each particle execute an independent
bution (6) (in an appropriate norm).
Brownian motion in an r-dimensional domain G with
Branching processes are used in calculations of vari-
an absorbing boundary aGo A particle located inside ous real processes - biological, genetic, physical,
the domain G will be transformed within time f1t~O chemical or technological. In real processes the condi-
with probability tion of independent motion of each particle is often not
Pn!:.t +o(!:.t), n#l, met, and the generated daughter particles may interact

474
RRA'.;CHI'\(i PROCESS. A(i F-I)"P"'.; I)"'.; I

with each other. This is true of many biological repro- At the end of its life the particle is transformed into k
duction processes, in processes of propagation of epi- daughter particles of age zero with a probability Pk(u)
demics (cf. Epidemic process), in bimolecular chemical if the transformation took place when the age attained
reactions, etc. However, the initial stages of such by the original particle was u. Let JL(t) be the number
processes may be calculated with the aid of suitably of particles at the moment of time t. The generating
chosen models of branching processes. This is done in function F(t;x) of the probability distribution of JL(t)
situations in which the number of active particles in the for a process beginning with one particle of age zero
space is relatively small; at such low concentrations satisfies the equation
their mutual collisions are practically non-existent, and
the state of the system evolves by the collisions between F(t;x) = jh(u, F(t-u;xdG(u)+x(l-G(t, (*)
o
active particles and the particles of the medium. In epi-
demic processes, for example, sick individuals may be where 00

considered as such 'active particles'. Phenomena related


h(u, x) = ~Pk(U)Xk.
k =0
to genetic mutations, for example, may be calculated Put
with the aid of branching processes. The branching pro-
cess with a finite number of particle types may serve as
a model in computing chain reactions; the branching 00 00

process with diffusion may serve as a model of neutron A = ja(u)dG(u), B = jb(u)dG(u).


processes in nuclear reactors. Phenomena occurring in o 0

showers of cosmic particles may also be studied with An age-dependent branching process is said to be sub-
the aid of branching processes. In telephony, the com- critical, critical or supercritical if A > 1, A = 1 and B >0,
putation of certain expectation systems may be reduced or A> 1, respectively. The behaviour of the process as
to branching process models. t ~ 00substantially depends on its criticality. Subcritical
See also Branching process with a random medium; and critical processes die out with probability one, i.e.
Branching process with immigration; Branching lim P{JJ.(t)=O} = 1.
processes, regularity of. t~oo

The following results have been obtained for these


References
[I] SEWASTJANOW, B.A. [B.A. SEVAST'YANOV]: Verzweigungspro- processes [1]: asymptotic formulas for the moments
zesse, Akad. Wissenschaft. DDR, 1974. JL(t), necessary and sufficient conditions of extinction,
[2] ATHREYA, K.B. and NEY, P.: Branchingprocesses, Springer, conditions of existence and uniqueness of a solution of
1972.
B.A. Sevast'yanov equation (*) and asymptotic formulas as t ~ 00 for
Editorial comments. Applications of branching process Q(t) = P{JJ.(tO}.
in biology may be found in [A2]. The survey article [A3]
gives an up-to-date account of stable population theory and The limit distributions have also been determined. In
balanced exponential growth, items that are also relevant in the critical case, as t ~ 00 :
biological applications. 00

j ua(u)dG(u)
References
[A1] ASMUSSEN, S. and HERING, H.: Branching processes, Q(t):::::: 2-"0_ _ __
Bt
Birkhauser, 1983.
[A2] JAGERS, P.: Branching processes with biological applications,
Wiley, 1975.
[A3] JAGERS, P. and NERMAN, 0.: 'The growth and composition of
p{ E(JJ.(t)12tO) <xlJJ.(tO}~ l-e- x , x>O.

branching populations', Adv. Appl. Probab. 16 (1984),221-


259. If h (u, x) is independent of u, the age-dependent
branching process is a ReUman - Harris process. The
AMS 1980 Subject Classification: 60GXX, 60J80, model just described has been generalized to include
60J85
processes with several types of particles, and also to
processes for which a particle may generate new parti-
BRANClDNG PROCESS, AGE-DEPENDENT - A
cles several times during its lifetime [2], [3].
model of a branching process in which the lifetime of a
particle is an arbitrary non-negative random variable, References
[I] SEWASTJANOW, B.A. [B.A. SEVAST'YANOV]: Verzweigungspro-
while the number of daughter particles depends on its zesse, Akad. Wissenschaft. DDR, 1974.
age at the moment of transformation. In the single-type [2] SEVAST'YANOV, B.A.: 'Age-dependent branching processes',
particle model each particle has a random duration of Theory Probab. Appl. 9, no. 4 (1964), 521-537. (Teor. Veroyat-
nosl. i Primenen. 9, no. 4 (1964), 577-594)
life T with distribution function [3] MODE, c.J.: Mullitype branching processes, Elsevier, 1971.
P{T";;;;t} = G(t). VP. Chistyakov

475
BRANCHING PROCESS, AGE-DEPENDENT

Editorial comments. Additional references can be found IILk (0) = 1; P.;(O)=O, i*k},
in the article Branching process.
and the theorems on limit distributions of the number
AMS 1980 Subject Classification: 60J80, 58F14 of particles [2], are analogous to the respective results
for processes involving single-type particles (cf. Branch-
BRANClDNG PROCFSS WITH A FINITE NUMBER
ing process). Asymptotic properties in the near-critical
OF PARTICLE TYPES - A model of a branching pro- case (t~oo, A~l) have been studied [3]. Processes
cess which is a special case of a Markov process with a with a decomposable matrix of mathematical expecta-
countable set of states. The state of the branching vec-
tions have also been investigated [4].
tor is described by the random process
References
IJ-{/) = (PI (I), ... ,p,,(t)), [1] KOLMOGOROV, A.N. and DMlTRIEv, N.A.: 'Branching stochas-
tic processes', Dokl. Akad. Nauk SSSR 56, no. 1 (1947), 5-8
the k-th component, J.Lk(t), of which shows that at time (in Russian).
t there are J.Lk(t) particles of type Tk . The principal pro- [2] SEWASTJANOW, B.A. [B.A. SEVAST'YANOV]: Verzweigungspro-
perty by which branching processes differ from Markov zesse, Akad. Wissenschaft. DDR, 1974.
[3] CuISTYAKOV, V.P.: 'On transition phenomena in branching
processes is that the particles existing at the moment t 1
stochastic processes with several types of particles', Theory Pro-
produce daughter particles at any subsequent moment bab. Appl. 17, no. 4 (1972), 631-639. (Teor. Veroyatnost. i
t 1 + t, t >0, in a mutually independent manner. The Primenen. 17, no. 4 (1972), 669-678)
generating functions [4] OGURA, Y.: 'Asymptotic behaviour of multitype
Galton -- Watson processes', J. Math. Kyoto Univ. 15 (1975),
Fk(/, Xl> . . ,xn) = 251-302.
v.P. Chistyakov
= E(x),(t) , ... ,x:;-(t) IILk (0) = I; IL;(O)=O, i*k) Editorial comments. Additional references can be found
satisfy the system of equations in the article Branching process.
Fk(/+T,XI>'" ,xn) = (*) AMS 1980 Subject Classification: 60J80, 60J85,
58F14
= Fk(/, FI(T, Xl> . ,xn), ... ,Fn(T, Xl> ,xn))
with initial conditions BRANClDNG PROCESS WITH A RANDOM
MEDIUM, branching process in a random medium - A
Fk(O,XI>'" ,xn) = Xb k=I, ... ,n.
time-inhomogeneous branching process in which the
The equations (*) are satisfied by discrete-time and inhomogeneity is random. Let g= {go, gl, ... } be a sta-
continuous-time processes. tionary sequence of random variables (the value of gl is
In the case of discrete time, the matrix of mathemati- interpreted as the state of the 'medium' at the moment
cal expectations of time t), and let to each possible state of the medium
A(/) = II Aij(t) II, g correspond a probability distribution (Pk(g)} of the
number of descendants of a single particle:
aF I
Aij(t) = ax; x,~ . . ~x.~1 Pk(~) ;;. 0, ~Pk(~) = I,
k=O
is the t-th power of the matrix A =A (1): A (t)=A I. If
A is indecomposable and aperiodic, it has a simple ~(s) = ~Pk(~)sk.
k=O
positive eigen value A which is larger than the moduli
of the other eigen values. In this case, as t -HX;, In order to construct a trajectory {J.L(O), J.L(1), ... } of a
branching process in a random medium the value of
J.L(O) = m and the trajectory g of the states of the
A;/t) = U;V}I/ +0(;\'),
where (u 1, . . . ,Un), (V 1, . . . ,Vn) are the right and left medium are fixed, and J.L(t + I) is determined for each
eigen vectors of A which correspond to A. Branching t =0, 1, ... , as a sum of J.L(t) independent random
processes with an indecomposable matrix A are said to variables with distribution (Pk(gl)}' Such a complica-
be subcritical if A< I, supercritical if A< 1 and critical if tion of the branching Galton - Watson process is
A= 1 and if at least one of the functions natural if, for example, the branching process in a ran-
F k (1, XJ, . . . ,xn ) is non-linear. The concept of critical- dom medium is regarded as a model of a biological
ity is defined in a similar manner for continuous-time population.
processes. The properties of branching processes in a random
The asymptotic properties of a branching process sig- medium are analogous to those of ordinary branching
nificantly depend on its criticality. Subcritical and criti- processes. For instance, the generating distribution
cal processes die out with probability one. The asymp- function of J.L(t), under the condition J.L(O)= 1, has the
totic formulas (as t~oo) for the probabilities, form
Qk(t) = P{ILI(t)+ ... +ILn(tO

476
BRANCHING PROCESS WITH IMMIGRATION

(for a branching Galton - Watson process, i.e. for independent Brownian motions. Each particle in G is
P{~t=O} = 1, the right-hand side of (*) equals the t-fold independently converted, within a time At, into n parti-
iteration of Fo(s)). Branching processes in a random cles with a probability of PnAt + 0 (At), n:#= 1, At ~O. Let
medium may be subcritical, critical and supercritical: the daughter particles begin their independent evolution
the 'criticality parameter' here (see [1]) is the variable from the point of their genesis. Let
00
p = EQ,ln~kPk(~) = EQ,lnF~(l)
k =0 n -:::0

(for ordinary branching processes, the 'criticality be the generating function of {Pn}, let PI = - ~n7'=IPn'
parameter' is the mathematical expectation of the and let /LxAA) be the number of particles in a set A C G
number of 'descendants' of a single particle). If p<O, at the moment t if there initially was one particle at the
the branching process in a random medium is said to point x EO G. The generating functional
be subcritical and, for the random variable
q(f) = limP{M(t)=OIM(O)=I, I}
H(t;x, s( = Eexp [[lnS(Y)/Lx.t(q)]
t~oo

which is the probability of extinction of the branchin~ satisfies the quasi-linear parabolic equation
process in a random medium for a given trajectory ~,
the relation ;=1
aax,I[ +
2
f(H) = aH
at
P{q(~=1} = 1
with the initial condition
is valid. There is also the analogue of the limit theorem
of the subcritical Galton - Watson brancIEng process: H(O, x, sO) = sex)
For almost all realizations of the sequence ~ the limits and the boundary condition
lim P{M(t)=klM(O)=I, M(tO, I} = pZ(I) H(t, x, sO) Ix~ac = O.
1->00

Denote by 0<'\1 <'\2 ~'\3 ~ . " the eigen values, and


exist and satisfy
let 1>1 (x 0 be the eigen function of the problem
00 _

~p~(~) = 1.
k=1
~
r f i2 +A</> _
- 0,
_
!f>(x) IHac - 0,
If p = 0, the branching process in a random medium is ;=1 ax;
said to be critical, and corresponding to ,\ I. As t ~ 00 the asymptotic relation
P{ q(~)= 1} = 1 E/Lx,(G) ~ K e(a-AIlI</>I(X)
and, for almost all realizations of ~, holds. For this reason the problem is said to be subcriti-
lim P{M(t)=k I M(O)= I, M(tO, ~} = O. cal if a <'\1, critical if a ='\1 and supercritical if a >'\1.
I~oo
If a ~'\I' a branching process with diffusion dies out
If p>O, the branching process in a random medium is with probability one, while if a>'\\o both the probabil-
said to be supercritical; in such a case ity of dying out and the probability of the event
P{ q(~)< 1} = I f1xAG)~oo as t~oo will in general be positive.

and, if certain additional conditions are met, there Depending on their criticality, branching processes with
exists for almost all ~ a non-negative random variable diffusion obey limit theorems analogous to those valid
w, for branching processes without diffusion.
W = lim M(t) EW = 1. References
hoo F~ (1) ... F~_l (I)'
[1] SEWASTJANOW, B.A. [B.A. SEVAST'YANOV]: Verzweigungspro.
zesse, Akad. Wissenschaft. DDR, 1974. BAS '
References . . evastyanov
[1] A THREVA, K.B. and NEV, P.: Branching processes, Springer,
1972.
Editorial comments. Additional references can be found
A.M. Zubkov in the article Branching process.
Editorial comments. Additional references can be found AMS 1980 Subject Classification: 60J60, 60J80,
in the article Branching process. 58F14
AMS 1980 Subject Classification: 60J80
BRANClDNG PROCESS WITH IMMIGRATION - A
BRANClDNG PROCESS WITH DIFFUSION - A model of a branching process (discrete-time or
model of a branching process in which the reproducing continuous-time, with one or several types of particles,
particles diffuse in some domain G. Let the domain G etc.) in which new particles may appear not only as the
be r-dimensional, with an absorbing boundary 8G, and products of division of the original particles, but also as
let the particles in the domain itself execute mutally a result of immigration from some 'external source'.

477
BRANCHING PROCESS WITH IMMIGRATION

For instance, let


Xt,i,Yt , 1=0,1, ... ; i=I,2, ... ,
lim
t~oo
p{ In(l In +p.(/,.;:
1
~x
} = x, 0,.;:,.;:1
"",x~.

be independent random variables with generating func-


References
tions 00 [l] ZUBKOV, A.M.: 'Life-like periods of a branching process with
F(s) = ~P{Xt,i=k}Sk, immigration', Theory Probab. Appl. 17, no. 1 (1972), 174-183.
k=Q (Teor. Veroyatnost. i Primenen. 17, no. 1 (1972), 179-188)
00 [2] PAKES, A.G.: 'Further results on the critical Galton-Watson
G(s) = ~P{Yt=k}Sk, process with immigration', J. Austral. Math. Soc. 13, no. 3
k=O
(1972),277-290.
respectively; the branching Galton-Watson process [3] FOSTER, J.H. and WlLUAMSON, J.A.: 'Limit theorems for the
Galton-Watson process with time-dependent immigration', Z.
with immigration may be defined by the relations
Wahrsch. Verw. Geb. 20 (1971),227-235.
J4(0)=0, where J4(t) is the number of particles and [4] SENETA, E.: 'An explicit limit theorem for the critical
Galton - Watson process with immigration', J. Roy. Statist.
p.(1 + 1) = Xt, 1 + ... + xt,fi(t) + Yt , 1 =0, 1, ....
Soc. Ser. B 32, no. 1 (1970), 149-152.
Here, the variable Xt, i is interpreted as the number of [5] SENETA, E.: 'On the supercritical Galton - Watson process with
daughter particles of the i-th particle of the t-th genera- immigration', Math. Biosci. 7 (1970), 9-14.
[6] FOSTER, J.H.: 'A limit theorem for a branching process with
tion, while the variable Y t is interpreted as the number state-dependent immigration', Ann. of Math. Statist. 42, no. 5
of the particles which have immigrated into the t-th (1971), 1773-1776.
A.M. Zubkov
generation. The generating functions
Editorial comments. Additional references may be found
Ht(s) = E{s/l!t) I p.(0)=0} in the article Branching process.
are given by the recurrence relations AMS 1980 Subject Classification: 60J80, 60J85,
Ho(s) = 1, Ht+l(S) = G(s)Ht(F(s. 60GXX
The Markov chain J4(t) corresponding to the
BRANCIUNG PROCESSES, REGULARITY OF - A
Galton - Watson branching process with immigration is
property of branching processes ensuring that the
recurrent if EXt,; < 1 and Eln(l + Yt ) < 00 or EXt,; = 1
number of particles at any moment of time is finite.
and B=DXt,;>2C=2EYt , and is transient if EXt,; = 1
The problem of the regularity of a branching process is
and B<2C or EXt,;> 1. For the Markov chain J4(t) to
be ergodic, i.e. for the limits usually reduced to the problem of uniqueness of the
solution of some differential or integral equation. For
lim P{p.(/)=k}
t~oo
= Pk instance, in a continuous-time branching process the
to exist and to satisfy differential equation
00 aF(/; s) = f(F(t; s
~Pk = 1, at
k=O
with the initial condition F(O; s)=s has a unique solu-
it is necessary and sufficient [3] that tion if and only if, for any >0, the integral
1
j I-G(s) ds < 00.
j
1
dx
o F(s)-s
l-J(x)
This condition is met, in particular, if EXt,; < 1 and
is divergent. In the branching Bellman - Harris process
Eln(l + Y t )< 00. If EXt,; = 1, B>O, C< 00, then [4]
the generating function F(t; s) of the number of parti-
lim p{1:l!!.!.l,,;;;x}
t~oo Bt
= I
r(2CB -\)
jXy2CB-I-le-Y dy, x;;;;'O.
0
cles is the solution of the non-linear integral equation

If A = EXt,; > 1 and Eln( 1+ Yt )< 00, then there exists F(t;s) = jh(F(t-u;sdG(u)+s(I-G(I, (*)
[5] a sequence of numbers Ct~O, Ct / Ct + I ~A, such that
where G(t) is the distribution function of the lifetimes
P {lim clp.(t) exists and is POSitive} = 1. of particles and h (t) is the generating function of the
I~CC
number of daughter particles ('direct descendants') of a
single particle. If, for given to, CI, C2>0 and an integer

In branching processes with immigration in which the
immigration takes place at p.(t) = only, i.e. n ~ 1, the inequalities
p.(t + 1) = X I. l + ... + XI./l!I) +80 ,/l!t) YI, t =0,1, ... , clt n ,,;;; G(t) ~ C21"
where 0ij is the Kronecker symbol, the following rela- are valid for all O,,;;;t,,;;;t o, the solution of equation (*) is
tion is valid if EXt,i=l, l<EX?;<oo and O<EYt<oo: unique if and only if the equation

478
BRAUER GROUP

(5) CLIFFORD. A.H. and PRESTON. G.B.: The algehraic theory oj


!.!/!..
n
= h(CP)-1 semigroups, 1-2, Amer. Math. Soc., 1961-1967.
dt L.N. Shevrin
with initial conditions
AMS 1980 Subject Classification: 20MXX
cf>(0) = I, cp(r)(o) = 0, r = I, ... , n -I,
BRAUER GROUP of a field k - The group of classes
has a unique solution
of finite-dimensional central simple algebras (cf. Cen-
o ,;;; cf>(t) ,,;;;;; 1.
tral simple algebra) over k with respect to the
For a branching process described by equation (*) to equivalence defined as follows. Two central simple k-
be regular, it is necessary and sufficient for the integral algebras A and B of finite rank are equivalent if there

f'o xl-l/n(l-h(l-x))l/n
exist positive integers m and n such that the tensor pro-
dx
ducts A kMm(k) and BkMn(k) are isomorphic k-
algebras (here Mr(k) is the algebra of square matrices
to diverge for any >0. of order rover k). The tensor multiplication of alge-
References bras induces an Abelian group structure on the set of
[1) SEWASTJANOW, B.A. [B.A. SEVAST'YANOV): Verzweigungspro- equivalence classes of finite-dimensional central simple
zesse, Mad. Wissenschaft. DDR, 1974. ,
B.A. Sevast yanov algebras. This group is also known as the Brauer
Editorial comments. Additional references can be found group of k and is denoted by Br k. The zero element of
in the article Branching process. this group is the class of full matrix algebras. while the
AMS 1980 Subject Classification: 60J60, 60GXX element inverse to the class of an algebra A is the class
of its opposite algebra. Each non-zero class contains.
BRANDT SEMI-GROUP - A semi-group with a zero up to isomorphism, exactly one division algebra over k
element in which to each non-zero element a there (i.e. a skew-field over k).
correspond uniquely determined elements e, f, a' ES Brauer groups were defined and studied in several
such that ea = af = a and a' a =f, and for any two non- publications by R. Brauer, E. Noether, A. Albert, H.
zero idempotents gl. g2 ES one has gl Sg2=1=O. The ele- Hasse and others. starting in the nineteen-twenties (see.
ments e and f in the definition will in fact be idempo- for example. [6]). The most complete results, including
tent and fa'=a'e=a' and aa'=e. Moreover, in a the computation of the Brauer group, were obtained for
Brandt semi-group each one of the conditions numher fields in connection with the development of
ac = bc=l=0, ca = cb=l=O, implies a = b, while the condi- class-field theory. The general form of the reciprocity
tions ab=l=O and bc=l=O imply abc =1=0. law is formulated in terms of Brauer groups.
The partial groupoid obtained by excluding the zero The Brauer group is zero for any separably-closed
element from a Brandt semi-group is said to be a field and any finite field. For the field of real numbers
Brandt groupoid. The concept of a Brandt semi-group the Brauer group is a cyclic group of order two and its
was introduced by H. Brandt in [I], and the concept of non-zero element is the class of the quaternion algebra.
a Brandt groupoid was introduced in the same publica- If k is the field of p-adic numbers or any locally com-
tion by implication. The concept of a Brandt groupoid pact field that is complete with respect to a discrete
is an abstraction of the system of normal ideals of valuation, then its Brauer group is isomorphic to Q / Z,
semi-simple linear algebras with respect to the so-called where Q is the additive group of rational numbers and
proper multiplication (d. [2], [3], Chapt. 6). The signi- Z is the additive group of integers. This fact is of
ficance of a Brandt semi-group in the theory of semi- importance in local class-field theory.
groups lies in the fact that Brandt semi-groups are just Let k be an algebraic number field of finite degree or
completely O-simple inversion semi-groups (d. Inversion a field of algebraic functions in one variable with a fin-
semi-group; Completely-simple semi-group). A semi- ite field of constants. Then there exists an exact
group will be a Brandt semi-group if and only if it is sequence of groups
isomorphic to a Rees semi-group of matrix type with a IOV 2:
unit sandwich matrix over a group with an added zero 0 ..... Brk ..... :LBrk, ..... Q/Z ..... 0,
element.
where v runs through all possible norms of the field k,
References
[1) BRANDT, H.: 'Ueber eine Verallgemeinerung des Gruppen- kv are the respective completions of k, and the
begriffes', Math. Ann. 96 (1927), 360-366. homomorphism inv is induced by the natural imbed-
[2) DEURING. M.: Algebren, Springer, 1935. dings k--'>kv' The image of an element from Brk in
(3) JACOBSON, N.: The theory oj rings, Amer. Math. Soc .. 1943.
(4) SUSHKEVICH, A.K.: The theory oJ generalized groups, Kharkov-
Br k v is called a local invariant, the homomorphism ~
Kiev, 1937 (in Russian). is the summation of local invariants. This fact is esta-

479
BRAUER GROUP

blished in global class-field theory. (1957),45-51 (in Russian), English summary.


If k is a field of algebraic functions in one variable [5] CHEBOTAREV, N.G.: Introduction to the theory of algebra... ,
Moscow-Leningrad, 1949 (in Russian).
over an algebraically closed field of constants, then its
[6] OEURING, M.: Algebren, Springer, 1935.
Brauer group is zero (Tsen's theorem). The case of an [7] GROTHENDIECK, A.: 'Le groupe de Brauer I, II, III', in Dix
arbitrary field of constants is treated in [4] and in [7]. exposes sur la cohomologie des schemas, North-Holland, 1968,
The Brauer group depends functorially on k, i.e. if K pp.46-188.
[8] MILNE, l.S.: Etale cohomology, Princeton Univ. Press, 1980.
is an extension of the field k, a homomorphism
V.A. Iskovskikh
Br k ~ Br K is defined. Its kernel, denoted by Br(K / k),
consists of classes of algebras splitting over K. Editorial comments, For the construction of cross pro-
The construction of cross products with the aid of ducts with the aid of factor systems see also Cross pro-
factor systems [5] results in a cohomological interpreta- duct; Extension of a group. The latter article contains the
tion of Brauer groups. For any normal extension K / k notion of a system of factors.
A recent result in the theory of Brauer groups is a
there exists an isomorphism
theorem of Merkuryev and Suslin [A1], which in its simplest
Br(K / k) ~ H 2 (K, K'), form asserts that Br k is generated by the classes of alge-
where H 2(K, K') is the second Galois cohomology bras that split over a cyclic extension of k, provided that k is
group with coefficients in the multiplicative group K' a field of characteristic zero containing all roots of unity.
The proof is based on the close relationship between the
of K. Moreover, the group Br k is isomorphic to
theory of Brauer groups and algebraic K-theory. An alge-
H2(f, t), where k is the separable closure of k. A
bra A over a commutative ring is an Azumaya algebra if it is
central simple algebra is assigned its class in the Brauer finitely generated and central over k and separable.
group by the coboundary operator
References
0: HI(K, PGL(n, K ~ H 2 (K, K') [A1] SUSLIN, A.: 'Plenary adress', in Proc. Internat. Congress of
Mathematicians Berkely, 1986, Forthcoming.
in the cohomology sequence corresponding to the exact
group sequence AMS 1980 Subject Classification: 13A20, 16A 16
1 ~ K' ~ GL(n, K) ~ PGL(n, K) ~ 1,
BRAUER - SEVERI VARIETY An algebraic
where GL(n, K) and PGL(n, K) are the linear and the
variety over a field k that, if considered over the alge-
projective matrix groups of order n X n. Here the set
braic closure k of k, becomes isomorphic to a projective
H1(K, PGL(n, K)) is interpreted as the set of k-
space,
isomorphism classes of central simple algebras of rank
The arithmetic properties of such varieties were stu-
n 2 over the field k which split over k, or as the set of
died in 1932 by F. Severi; F. Chiitelet subsequently
classes of k-isomorphic Brauer - Severi varieties of
discovered a connection between Brauer - Severi
dimension n -1, possessing a point that is rational over
varieties and central simple algebras (d, Central simple
K (d. Brauer-Severi variety).
algebra) over k and the Brauer group.
All Brauer groups are periodic groups. The order of
The simplest non-trivial example of a one-
any of its elements is a divisor of the number n, where
dimensional Brauer - Severi variety is the projective
n 2 is the rank of the skew-field representing this ele-
conic section Q:
ment.
The cohomological interpretation of the Brauer X5 +XT +x~ = 0
group makes it possible to consider it as the group of on the real projective plane p~, Over the field of com-
classes of ex~ensions of the Ga~!s group of the separ- plex numbers C this variety is isomorphic to the projec-
able closure k / k by the group k . tive line Pt. The set of all one-dimensional
A generalization of the concept of a Brauer group is Brauer - Severi varieties, considered up to isomorphism,
the Brauer - Grothendieck group, whose definition is is in a one-to-one correspondence with the set of pro-
analogous to that of the Brauer group, except that the jective non-degenerate conic sections (considered up to
central simple algebras are replaced by Azumaya alge- projective equivalence over k). which is in turn in a
bras [7]. one-to-one correspondence with the set of non-
References isomorphic generalized quaternion algebras over k. In
[1] CASSELS. J.W.S. and FROLICH, A. (EDS.): Algebraic number
the above example the conical section Q corresponds to
theon', Acad. Press, 1967.
[2] BOURBAKI. N.: Elements of mathematics. AIKebra: AlgebraiC the algebra of ordinary quaternions.
structures. Linear algebra. 1, Addison-Wesley. 1974, In the more-dimensional case, the set of classes of n-
Chapt.l;2 (translated from the French).
dimensional Brauer - Severi varieties (i.e.
[3] SERRE. J.-P.: Cohomologie Galoisienne, Springer, 1964.
[4] FADDEEV, O.K.: 'On the theory of algehras over fields of alge- Brauer- Severi vanehes distinguished up to k-
hraic functions in one variahle', Vestnik LeninKrad. Univ .. no. 7 isomorphism) may be identified with the Galois coho-

480
BRIOT - BOUQUET EQUATION

mology group HI(k, PGL(n + 1, k where BREAKING POINT, angle point - A singular point of
PGL(n + 1, k)
is the projective group of automorphisms a plane curve with the property that two branches of
of the projective space PI: [3), [4). This cohomology the curve end in it in such a way that each has a (one-
group describes the classes of k-isomorphic central sim- sided) tangent at the point different from the other. For
ple k-algebras of rank (n + 1)2 (i.e. forms of the matrix example, the origin is a breaking point of the curve
algebra Mn + 1(k)). The connection between y=x /(l+e 1 / x ) (see Fig.). The left and right deriva-
Brauer- Severi varieties and central simple algebras is tives are different at a breaking point.
more explicitly described as follows. To a k-algebra A
y
of rank r2 one associates the variety X of its left ideals
of rank r, which is defined as a closed subvariety of the
x
Grassmann manifold of all k-linear subspaces of dimen-
sion r in A. In certain cases the variety X may be
defined by norm equations - e.g. in the case of
quaternion algebras. The connection between
Brauer - Severi varieties and algebras is taken advan- A.B. Ivanov
tage of in the study of both [1), [4). AMS 1980 Subject Classification: 53A04
The most significant properties of Brauer- Severi
varieties are the following. A Brauer - Severi variety is BRIANCHON THEOREM - In any hexagon (Fig.)
k-isomorphic to a projective space PI: if and only if it circumscribed around a curve of the second order (a
has a point in the field k. All Brauer - Severi varieties Brianchon hexagon) the straight lines connecting the
have a point in some finite separable extension K of k opposite corners of the hexagon intersect at one point
[ 1). (Brian chon 's point). This theorem is dual to the Pascal
The Hasse principle applies to Brauer - Severi theorem, and was demonstrated by Ch.l. Brianchon in
varieties defined over an algebraic number field. 1806.
The field of rational functions k (X) on a
Brauer - Severi variety X is the splitting field of the
corresponding algebra A; moreover, an arbitrary exten-
sion K of k is the splitting field for A if and only if X
has a K-point [4).
In the context of the generalization of the concepts
of a central simple algebra and the Brauer group to A.B. Ivanov
include schemes, the Brauer- Severi varieties were gen- AMS 1980 Subject Classification: 51 N1 0
eralized to the concept of Brauer - Severi schemes [2).
Let 1: p--,>x be a morphism of schemes. A scheme P is BRIOT- BOUQUET EQUATION - An ordinary dif-
called a Brauer-Severi scheme if it is locally iso- ferential equation
morphic to a projective space P1 over X in the etale
topology of X. A scheme P over a scheme X is a (I)

Brauer-Severi scheme if and only if 1: P--,>X is a where m is a positive integer and the function 1 is ana-
finitely-presented proper flat morphism and if all of its lytic at x=y=O, h(O,O)=f:O, 1(0,0)=0. It was shown
geometrical fibres are isomorphic to projective spaces by C. Briot and T. Bouquet [1) that any equation of the
[2). type
a(z, w)w' = (3(z, w),
References
[I) CHA":ELET, F.: 'Variations sur un theme de H. Poincare', Ann.

where a(O, 0) = /3(0, 0) = and a and /3 are analytic at
the origin, can be reduced, by means of a special local
Sci. Ecole Norm. Sup. (3) 61 (1944), 249-300.
[2] GROTHENDIECK, A.: 'Le groupe de Brauer', in Dix exposes sur changes of the variables, to a finite number of equa-
la cohomologie des schemas, North-Holland, 1968, pp. 1-21. tions of type (1). Equation (1) always (except for the
[3] SERRE, J.-P.: Cohomologie Galoisienne, Springer. 1964.
case where m = 1 and h(O, 0) is a natural number) has
[4] ROGUEITE, P.: 'On the Galois cohomology of the projective
a unique solution in the form of a formal power series:
=
linear group and its applications to the construction of generic
splitting fields of algebras', Math. Ann. 150 (1963), 411-439.
Y = ~(x) ~IX+~2X2+ " ' , (2)
VA. Iskovskikh
which converges for sufficiently small I x I if m = 1,
Editorial comments. Thus a Brauer-Severi variety of and can diverge for all x=f:0 if m> 1. In (1), let
dimension n is a k / k-form of P~. f fo(x)+ fl(x)y,
AMS 1980 Subject Classification: 14M99, 14GXX then, for the series (2) to converge, it is necessary and

481
BRIOT- BOUQUET EQUATION

sufficient to meet m -I conditions concerning the coef- Boolean algebras are more commonly called Heyting alge-
ficients of the Taylor series of /0 and /1; all the coeffi- bras. Complete Heyting algebras (often called frames or
cients are included in these conditions, so that the locales) have been extensively studied on account of their
connections with topology: the lattice of open sets of any
existence or non-existence of an analytic solution
topological space is a locale, and locales can in some
y = ~(x) of equation (1) cannot be proved by any par-
respects be considered as generalized topological spaces.
tial sum of the Taylor series of / (cf. [2], [3]). For the
See [A1], [A2], [A3].
case of a general function / there are
(m -I)+(m -I) X 00 such conditions, [4]. Accordingly, References
[A1] FOURMAN, M.P. and SCOTT, D.S.: 'Sheaves and logic', in
the Briot - Bouquet equation is sometimes referred to Applications of sheaves, Lecture notes in Math., Vol. 753,
as equation (1) with m>l. Springer, 1979, pp. 302-401.
[A2] JOHNSTONE, P.T.: Stone spaces, Cambridge Univ. Press,
References 1983.
[I] BRIOT, C. and BOUQUET, T.: 'Recherches sur les proprietes des [A3] SIMMONS, H.: 'A framework for topology', in Logic colloquium
equations differentielles', J. Eco/e Poly technique 21, no. 36 '77, Studies in logic and foundations of math., Vol. 96,
(1856), \33-198. North-Holland, 1978, pp. 239-251.
[2] BIEBERBACH, L.: Theorie der gewohnlichen Differentia/-
g/eichungen auffunktionentheoretischer Grund/age dargestellt, AMS 1980 Subject Classification: 06D20, 03G10
Springer, 1965.
[3] BRYUNO, A.D.: 'Analytical form of differential equations.
Introduction', Trans. Moscow Math. Soc. 25 (1971), 134-15J.
BROUWER THEOREM - 1) Brouwer's fixed-point
(Trudy Moskov. Mat. Obshch. 25 (1971), 120-138) theorem: Under a continuous mapping j: S ~S of an
[4] MARTINET, J. and RAM IS, J.P.: 'Problemes de modules pour n-dimensional simplex into itself there exists at least
des equations differentielles du premier ordre', Pub/. Math.
one point XES such that j(x)=x; this theorem was
IHES 55 (1982), 63-164.
A.D. Bryuno proved by L.E.1. Brouwer [I]. An equivalent theorem
AMS 1980 Subject Classification: 34A30 had been proved by P.G. Bohl [2] at a somewhat earlier
date. Brouwer's theorem can be extended to continuous
BROUWER LATTICE, Brouwer structure, Brouwer mappings of closed convex bodies in an n-dimensional
algebra - A distributive lattice in which for each pair of topological vector space and is extensively employed in
elements a, b there exists an element, called the proofs of theorems on the existence of solutions of vari-
pseudo-difference (frequently denoted by a *- b), which is ous equations. Brouwer's theorem can be generalized to
the smallest element c possessing the property b +c;;;;'a. infinite-dimensional topological vector spaces.
An equivalent description of a Brouwer lattice is as a References
variety of universal algebras (cf. Universal algebra) with [I] BROUWER, L.E.].: 'Ueber eineindeutige. stetige Transforma-

three binary operations u,


nand *-, which satisfies
tionen von FUichen in sich'. Math. Ann. 69 (1910), 176-180.
[21 BOHL, P.: 'Ueber die Beweging eines mechanischen Systems in
certain axioms. The term 'Brouwer algebra' was intro- der Nahe einer Gleichgewicht>lage'. J. Reine Angew. Math. 127
duced in recognition of the connection between (1904), 179-276.
VI. Sobolev
Brouwer lattices and Brouwer's intuitionistic logic.
Instead of Brouwer lattices the so-called pseudo-Boolean Editorial comments. There are many different proofs of
the Brouwer fixed-point theorem. The shortest and concep-
algebras are often employed, the theory of which is
tually easiest, however, use algebraic topology.
dual to that of Brouwer lattices. Any Brollwer lattice
Completely-elementary proofs also exist. Cf. e.g. [A 1],
can be converted to a pseudo-Boolean alge) fa by the
Chapt. 4. In 1886, H. Poincare proved a fixed-point result
introduction of a new order (a~'b)=>(b~a), and of on continuous mappings f: P--,>P which is now known to
new unions and intersections according to the formulas be equivalent to the Brouwer fixed-point theorem, [A2].
(a U .h) = (a n h), (a n' h) = (a U h) There are effective ways to calculate (approximate) Brouwer
fixed pOints and these techniques are important in a multi-
and the operation of relative pseudo-complementation tude of applications including the calculation of econom'c
a=>b which corresponds to the pseudo-difference a*- h. equilibria, [A1]. The first such algorithm was proposed by H.
Conversely, any pseudo-Boolean algebra can be Scarf, [A3]. Such algorithms later developed in the so-called
regarded as a Brouwer lattice. The term 'Brouwer lat- homotopy or continuation methods for calculating zeros of
tice' is sometimes used to denote a pseudo-Boolean functions, cf. e.g. [A4], [A5].
algebra (see, for instance, [2]).
References
References [A1] ISrRATI'sCC. V.1. Fixedpointtheory. Reidel. 1981.
[IJ McKINSEY. 1.C.C. and TARSKI. A.: 'The algehra of topology'. [A2] POINCAR(. H.: 'Sur les courbes definies par les equations
Ann. oj Mmh. (2) 45. Ill). 1(1944). 141191. differentielles', J de Math. 2 (1886).
[21 BIRKIIOFI'. G.: LallieI' lhl'orr. Amer. Math. Soc:. 1967. [A3J S( ARI. II .. 'The approximation of fixed points of continuous
V.A. Yankov mappings', SIAM J Apo/. Math. 15 (1967),1328-1343.
[A4] KARAMADlAN, S. (ED.): Fixed pOints. Algorithms and applica-
Editorial comments. In Western literature pseudo- tions, Acad. Press, 1977.

4X2
BRUN SIEVE

[A5] ALLGOWER, E. and GEORG, K.: 'Simplicial and continuation the morphism V;' XB~BwB x,Y)l->xwy) is an iso-
methods for approximating fixed pOints and solutions to sys-
morphism of algebraic varieties. An even more precise
tems of equations', SIAM Rev. 22 (1980), 28-85.
formulation of the Bruhat decomposition will yield a
2) Brouwer's theorem on the in variance of domain:
cellular decomposition of the projective variety G / B.
Under any homeomorphic mapping of a subset A of a
Namely, if Xo is a fixed (with respect to the left shifts
Euclidean space En into a subset B of that space any
by elements from B) point of G / B (such a point
interior point of A (with respect to En) is mapped to
always exists, cf. Borel fixed-point theorem), G / B will
an interior point of B (with respect to En), and any
be the union of non-intersecting V-orbits of the type
non-interior point is mapped to a non-interior point. It
V(w(xo, WE' W (d. Algebraic group of transforma-
was proved by L.E.l. Brouwer [1].
tions), and the morphism V;' ~Vw(xo) (UI->u(w(xo))) is
References an isomorphism of algebraic varieties. All groups V;"
[I] BROUWER, L.E.1.: 'Ueber Abbildungen von Mannigfaltig-
keiten', Math. Ann. 71 (1912), 97-115. being varieties, are isomorphic to an affine space; if the
M.J. VoTtsekhovskiT ground field is the field of complex numbers, then each
Editorial comments. For a modern account of the of the above V-orbits is a cell in the sense of algebraic
Brouwer invariance-of-domain theorem cf. [A 1]. Chapt. 7, topology so that the homology of G / B can be calcu-
Sect. 3. The result is Important for the idea of the lated. The existence of a Bruhat decomposition for a
topological dimension (dim e = n). number of classical groups was established in 1956 by
References F. Bruhat, and was proved in the general case by C.
[A 1] DUGUNDJI. J.: Topology, Allyn and Bacon, 1966. Chevalley [3]. A. Borel and 1. Tits generalized the con-
AMS 1980 Subject Classification: 54H25, 55M20, struction of Bruhat decompositions to the groups Gk of
57A15 k-points of a k-defined algebraic group [2], the role of
Borel subgroups being played by minimal parabolic k-
BROWNIAN MOTION - The process of chaotic dis- subgroups, the role of the groups V by their unipotent
placements of small particles suspended in a liquid or radicals; the Weyl k-group W k or the relative Weyl
in a gas which is the result of collisions with the group was considered instead of W.
molecules of the medium. There exist several References
mathematical models of this motion [1]. The model of [I] BOREL, A.: Linear algebraic groups, Benjamin, 1969.
Brownian motion which is the most important one in [2] BOREL, A. and TITS. J.: 'Groupes reductifs'. Publ. Math. IHES
27 (1965), 55-150.
the theory of random processes is the so-called Wiener [3] CHEVALLEY, c.: Classification des groupes de Lie algebriques, 2,
process, and the concept of Brownian motion is in fact Paris. 1958.
VP. Platonov
often identified with this model.
References AMS 1980 Subject Classification: 20G05
[1] PAVLOV, V.P.: 'Brownian motion', in Large Soviet Encyclo-
pedia, Vol. 4 (in Russian). BRUN SIEVE - A sieve method in elementary
number theory, proposed by V. Brun [1]; it is an exten-
Editorial comments. See also Wiener measure.
sion of the sieve of Eratosthenes (d. Eratosthenes,
References sieve ot). The method of Brun's sieve may be described
[A 1] ITt>, K. and McKEA"', H.P .. JR.: Diffusion processes and their
sample paths. Springer. 1974, Chap!. 1; 2. as follows. Out of a sequence of natural numbers an:%;x
the numbers with small prime divisors are eliminated
AMS 1980 Subject Classification: 60J65, 58G32
('sieved out') leaving behind prime and almost-prime
numbers (cf. Almost-prime number) with only large
BRUHAT DECOMPOSITION - A representation of a
prime divisors. Let P(x) be the amount of these
connected algebraic reductive group G as the union of
numbers. It can be shown that P (x) is included
double cosets of a Borel subgroup, parametrized by the
between two sums with a relatively-small number of
Weyl group of G. More exactly, let B, B - be opposite
Borel subgroups of a reductive group G; let V, V - be summands, which may be estimated from above and
the respective unipotent parts of B, B - (d. Linear from below. Thus, it is possible to evaluate from above
algebraic group) and let W be the Weyl group of G. In the number of twins in a given interval. Brun's sieve is
what follows w denotes both an element of Wand its employed in additive number theory. Brun used his
representative in the normalizer of the torus B B- , n sieve to prove that all large even numbers N can be
represented in the form N = P I + P 2, where P I and P 2
since the construction presented below is independent
contain at most 9 prime factors.
of the representative chosen. The group
V;' = V n w V - w - I will then be considered for each References
[1] BRUN, V.: 'Le crible d'Eratosthene et Ie theoreme de Gold-
WE' W. The group G is then representable as the union bach', CR. Acad. Sci. Paris Ser. I Math. 168. no. 11 (1919).
of the non-intersecting double cosets BwB (w E' W), and 544-546.

483
BRUN SIEVE

[21 GEL'FOND, A.D. and LINNIK, Yu.V.: Elementary methods in the BUBNOV - GALERKIN ME1HOD - See Galerkin
analytic theory of numbers, M.LT., 1966 (translated from the
Russian). method.
[31 TROST, E.: Primzahlen, Birkhauser, 1953. AMS 1980 Subject Classification: 49015, 65N30
N.!. Klimov
Editorial comments.
BUDAN - FOURIER lHEOREM - The number of
References roots of an algebraic equation
[A11 HALBERSTAM. H. and RICHERT. H.E.: Sieve methods. Acad.
Press. 1974. !(x) =0
AMS 1980 Subject Classification: 10H30 comprised in an interval (a, h), a <h, is equal to or is
smaller, by an even number, than 7'=tl -t2, where tl
BRUN lHEOREM on prime twins - The series is the number of changes in sign in the series of deriva-
~ 1/ p is convergent if p runs through all (the first tives of the polynomial f (x) at the point a, i.e. in the
members of all) prime twins. This means that even if series
!(a), I (a), ... ,/")(a),
the number of prime twins is infinitely large, they are
still located in the natural sequence rather sparsely. while t 2 is the number of changes in sign in this series
This theorem was demonstrated by V. Brun [1]. The at the point h. Each multiple root is counted according
convergence of a similar series for generalized twins to its multiplicity. Established by F. Budan (1822) and
was proved at a later date. 1. Fourier (1820).

References References
[II BRUN. v.: 'La serie 1/5 + I /7+ ... ou les denominateurs [I) Encyclopaedia of elementary mathematics, Vol. 2 Algebra.
Moscow-Leningrad. 1951, p. 331 (in Russian).
sont 'nombres premiers jumeaux' et convergente ou finie'. Bull. o.A. lvanova
Sci. Math. (2) 43 (1919).100-104; 124-128.
[21 TROST. E.: Primzahlen, Birkhauser. 1953. Editorial comments. An application of the
N.l. Klimov Budan - Fourier theorem in numerical analysis may be
Editorial comments. found in [A1], where it is used in the interpolation by spline
References functions.
[A1] HALBERSTAM. H. and RICHERT. H.E.: Sieve methods, Acad. References
Press. 1974. [A1] BOOR, C. DE and SCHOENBERG, U.: 'Cardinal interpolation
and spline functions VIII. The Budan-Fourier theorem for
AMS 1980 Subject Classification: 10J15 splines and applications.', in K. Bohmer, G. Meinardus and
W. Schemp (eds.): Spline functions, Lec!. Notes in Math.,
BRUNN - MINKOWSKI lHEOREM - Let K 0 and Vol. 501, Springer, 1976.
K I be convex sets in an n-dimensional Euclidean [A2] HOUSEHOLDER, A.S.: 'Unique triangularization of a nonsym-
metric matrix', J. Assoc. Compo Mach. 5 (1958), 339-342.
space; let K", AE[O, 1], be the set of points which
divide segments with end points at any points of the AMS 1980 Subject Classification: 65H05
sets Ko and KI in the ratio A/(l-A) (a linear combi-
nation of Ko and K I ); and let V(A) be the n-th power BUFFON PROBLEM of the needle- A classical prob-
root of the volume of the set K". Then V(A) is a con- lem in the theory of geometric probabilities, which is
cave function of A, i.e. the inequality rightly considered to be the starting point in the
V(A[(l-p)+A2P) ;;;. (l-p)V(Ad+pV():) development of this theory. First posed by G. Buffon
in 1733 and reproduced together with its solution in [1].
is valid for all AI, A2, pE[O, 1]. The function V(A) is Buffon considered the following situation: A needle of
linear (and the inequality then becomes an equality) if
length 2r, where 2r<a, is thrown at random on a plane
and only if Ko and Klare homothetic. The
ruled by parallel lines at a distance a from one another.
Brunn - Minkowski theorem can be generalized to
What is the probability that the needle thus thrown will
linear combinations of several convex sets. I t is used to
come to rest across one of the lines? Clearly, the loca-
solve extremal and uniqueness problems. It was
tion of the needle is defined by the distance x from its
discovered by H. Brunn in 1887, and completed and
centre to the nearest straight line and by the acute
rendered more precise in 1897 by H. Minkowski.
angle () between the needle and the perpendicular to
References the line. The magnitude x lies between zero and a /2,
[II BUSEMAN. H.: Convex surfaces. Interscience. 1958.
while () lies between zero and 'TT / 2. It is assumed that
[2] HADWIGER. H.: Vorlesungen fiber Inhalt, Oberjlache und Isoper
imetrie, Springer. 1957. the point (x, () is uniformly distributed in the appropri-
M.I. Vottsekhovskit ate rectangle (this is equivalent to assuming that the
Editorial comments. random variables x and () are independent and are uni-
References formly distributed over (0, a /2) and (0. 'TT / 2.
Then
[A 1] LEICHTWEISS, K.: Konvexe Mengen, Springer, 1979. the sought probability is defined as the ratio between
AMS 1980 Subject Classification: 52A40, 52A20 the areas corresponding to the favourable and to all

484
BURKILL INTEGRAL

possible results, and is References


[I) BOUNJAKOWSKY, W. [V.YA. BUNYAKOVSKJi): 'Sur quelques
inegalites concernant les integrales aux differences finis', Mem.
p = --
1
~!!...
f rcosfJdfJ = -.
w/2

0
4r
aw
(*) A cad. Sci. St. Petersbourg (7) 1 (1859), 9.
v.I. Bityutskov
22 Editorial comments. In Western literature this inequality
In the past Buffon's problem served as the tool for the is often called the Cauchy inequality, or the
experimental check on the Bernoulli theorem. In fact, if Cauchy- Schwarz inequality. Its generalization to a function
the needle is thrown n times, and crosses one of the fin Lp and a function gin Lq,.1 / p+ 1 / q= 1, is called the
lines in m cases, the frequency m / n should, for large Holder inequality.
values of n, according to Bernoulli's theorem, be close Cauchy's algebraic inequality stated above holds for real
to the probability (*). This idea was used by many numbers ai, bi , i= 1, ... ,n. For complex numbers ai, bi ,
workers for the determination of the number '1T by ran- i = 1, ... ,n, it reads
dom trials [1], [2]. Buffon also considered other similar I alti; + ... +anbn 12 =
problems, in particular the problem of the needle cross- I a1 + ... +a~ I . I ~ + ... +&n I
ing the lines belonging to two mutually-perpendicular It has a generalization analogous to the Holder inequality.
systems which subdivide the plane into rectangles hav-
ing sides of lengths a and b respectively. The answer References
[A1) RUDIN, W.: Principles of mathematical analysis, McGraw-Hili,
given by Buffon to this problem is incorrect. The 1953.
correct solution
4r(a+b)-4r 2 AMS 1980 Subject Classification: 26010
wab
was found by P. Laplace in 1812. BURKILL INTEGRAL - A concept which was intro-
duced by J.e. Burkill [1] to determine surface areas.
References
[I) BUFFON, G.: Essai d'arithmetique morale. Supplementa The Burkill integral is introduced in its modem form
'/'Histoire Morale', 4, 1777. for the integration of a non-additive function F(l) over
[2) USPENSKY, J.V.: Introduction to mathematical probability, an n-dimensional segment (a block). Let R be a set that
McGraw-Hili, 1937.
[3) KENDALL, M.G. and MORAN, PAP.: Geometric probability, can be represented as a sum (union) of a finite number
Griffin, 1963. of segments (such a set is called a figure). Each
A. V. Prokhorov
representation R = U 1 k is called a subdivision C!f the
AMS 1980 Subject Classification: 60005 figure R. The upper and the lower Burkill integral of the
segment function F(l) over the figure R are, respec-
BUNDLE tively, the upper and lower limits of the sums L.kF(ld
Editorial comments. In the sense of a (two-dimensional) for all possible subdivisions as the maximum of the
family of objects the usual name is net. (See also Web of diameters of the segments involved in the subdivision
spheres.) tends to zero. If these integrals are equal, their common
See also Vector bundle; Fibre space; Fibration. value is the Burkill integral of F over R and is denoted
AMS 1980 Subject Classification: 55RXX 14F05, by f RF. If F is integrable over R, then F is integrable
32LXX, 57Q50, 57R22, 53AXX, 14C21, 14C20, on each figure R 1 cR. This enables one to introduce an
51N20 indefinite Burkill integral, which is an additive set func-
tion. If F is continuous, the indefinite Burkill integral is
BUNYAKOVSKII INEQUALITY - An equality in continuous as well.
mathematical analysis, established by V.Ya. The concept of the Burkill integral can be general-
BunyakovskiI [1] for square-integrable functions f and ized to include the case of a set function defined on
g: some class of subsets of an abstract measure space.
This class must meet a number of requirements; in par-
ticular, each set of the class must permit a subdivision
This inequality is analogous to Cauchy's algebraic ine- into sets also of this class that have a measure as small
quality as one pleases. The Burkill integral can then be defined
(a,b, + ... +anbnf .;;;
for any set in the class in analogy with the n-
.;;; (at + ... +a~)(bt + ... +b~). dimensional case, the respective limits being taken as
The BunyakovskiI inequality is also known as the the maximum of the measures of the constituent sets
Schwarz inequality; however, BunyakovskiI published tends to zero. The Burkill integral can be naturally gen-
his study as early as 1859, whereas in H.A. Schwarz' eralized to set functions with values in a commutative
work this inequality appeared as late as 1884 (without topological group. The Burkill integral is less general
any reference to the work of Bunyakovskll,,). than the subsequently introduced Kolmogorov integral,

485
BURKILL INTEGRAL

which is also known as the Burkill- Kolmogorov where


integral. Any function that is Burkin-integrable is also
Kolmogorov-integrable after a suitable ordering of the
R = -I-If [g(Z)-b ]m-I (t)g'(z)dt dz
m 27Ti a y g(t)-b g(t)-g(z)'
subdivisions. The converse statement is true only if cer-
tain additional conditions are satisfied. The Burkin Here y is a contour in the t-plane which encloses the
integral is used in constructing the Denjoy integral in r
points a and z, and is such that if is any point inside
different spaces. y, then the equation g(t)=gm has no roots on y or
The name of Burkin integral is also given to a inside yother than the simple root t = r.
number of generalizations of the Perron integral (AP-, The expansion (*) for the case b =0 was obtained by
CP-, SCP-integrals), which were also introduced by J.L. Lagrange [2].
Burkill. Instead of ordinary derivatives certain general- If the derivative g'(t) has a zero of order r-1 at the
ized derivatives are used in the definition of these point z = a, there is the following generalization of the
integrals. These Burkill integrals are used in the theory Biirmann - Lagrange series for the multi-valued inverse
of trigonometric series. function [3]:
F(w)= /(a)+

+1l~1~ {::,~II [I(Z) [g~Z~~b r] t~(J(W -h)"


References
[IA) BURKILL, J.C: 'Functions of intervals', Proc. London Math.
Soc. (2) 22 (1924), 275-310. r
[IB) BURKILL, J.C: 'The expression of area as an integral', Proc.
London Math. Soc. (2) 22 (1924),311-336.
Another generalization (see, for example. [4]) refers to
[2) SAKS, S.: Theory of the integral, Hafner, 1952 (translated from
the Polish). functions g(z) regular in an annulus; instead of the
[3) ROMANOVSKli, P.I.: 'l'Integrale de Denjoy en espaces series (*), one obtains a series with positive and nega-
abstractes', Mat. Sb. 9 (51), no. I (1941),67-120. tive powers of the difference w - h.
[4) BURKILL, J.C: 'Integrals and trigonometric series', Proc. Lon-
don Math. Soc. (3) I (1951), 46-57. References
VA. Skvortsov [I) BORMANN. H.: Mem. In.H. Nat. Sci, Arts. Sci. Math. Ph)'s. 2
AMS 1980 Subject Classification: 26A39, 28A 10, (1799). 13-17.
[2A) LAGRANGE, J.L.: Mem. Acad. R. Sci. et Be/les-Iellres Berlin 24
49F25 (1770).
[2B] LAGRANGE. J.L.: 'AdJitions au memoire sur la resolution des
BVRMANN- LAGRANGE SERIES, Lagrange series equations numcriques', in Oeuvres. Vol. 2. Georg Olms. 1973.
- A power series which offers a complete solution to the pp. 579-652.
[3] HURWITZ. A. and COCRA:-;T. R.: Vorlesungen iiher allgemeine
problem of local inversion of holomorphic functions. In Funkionenenrheorie und elliptische Funktionen. 1. Springer.
fact, let a function w = g(z) of the complex variable z 1968. Chapt. 7.
be regular in a neighbourhood of the point z = a. and [4] WHITTAKER. E.T. and WATSON. G.N,: A course oj'modern
anah sis. Cambridge U niv. Press. 1952.
let g(a)=b and g'(a)=f=O. Then there exists a regular [5] MARKUSHFVICll. A.1.: Theon' olIul1Clions oj' a complex variahle.
function z =h(w) in some neighbourhood of the point 1. Chelsea. reprint. 1977 (translated from the Russian).
w = h of the w-plane which is the inverse to g (.::) and is E. D. Solomentsev
such that h (b)=a. Moreover, if fez) is any regular Editorial comments. There is an exhaustive treatment of
function in a neighbourhood of the point .:: = a, then the Lagrange - Burmann theorem and series in [A 1].
the composite function F(w)= /[h (11')1 can be
References
expanded in a Burmann - Lagranfie series ) 1 a neigh- [A 1] HENRIe!. p, Applied and computational complex analysis. 1,
bourhood of the point w = h Wiley, 1974

F(w) = f\a)-l (*) AMS 1980 Subject Classification 30810

+
,,2:x
1111
-I ,r - 1 l" (-) l z - a
d" -
ldz" 1 I - ~(z)-h
1" 11
jJ (1\' -- h)".
BURNSIDE PROBLEM - 1) The Burnside problem
for finite groups: Do unsolvable finite groups of odd
The inverse of the function \\'=g(.::) is obtained hy set- order exist? Or, in another formulation: Ale all finite
ting/(.::) .::. simple non-Abelian groups of even order? This problem
The expansion (*) follows from Biirm{lnn's theorem is connected with the name of W. Burnside, who noted
[1]: Under the assumptions made ahove on the holo- in 1897 that all simple non-Abelian groups which were
morphic functions fi (z) and 1(.::). the latter function known at that time were of even order [1]. The problem
may he represented in a certain domain in the .::-planc was solved in 1962 by W. Feit and J.G. Thompson [2],
containing a in the form who showed that all finite groups of odd order are solv-
Ii z) -c Ii (/ ) -l able.

j "~llg(z)
Ii 1 n.
1 h]" I d"
I-dz-" -I II
1 r"
(z)
I z- (i l' 1 1
-~(_--)=h J II r R",.
References
[I] BUR~SIDF. W.: Theon' ojgroups oj' finite order. Camhridge
Univ. Press. 1897.

4~6
BURNSIDE PROBLEM

[2] FElT, W. and THOMPSON, J.G.: 'Stability of groups of odd In accordance with the idea which has been gradu-
order', Pacific J. Math. 13 (1964), 775-1029. VD M ally evolving ever since the middle of the nineteen-
.. azurov
thirties, the answer to the following question is of
Editorial comments. In [1], Burnside also showed that
importance in the theory of finite groups: Is the order
all groups of order p8 q b, where p, q are prime numbers and
a, b~O, are solvable.
of any finite group with d generators, obeying the iden-
tity relation xn = 1, be bounded from above by a
2) The Burnside problem for periodic groups: This
certain natural number bed, n), depending only on d
problem was posed by W. Burnside in 1902 [1): Is a
and n? This is the so-called restricted Burnside problem.
finitely-generated group, each element of which is of
It has been positively solved [13) for all prime
finite order, always finite (the unbounded Burnside prob-
exponents n =p. It has thus been proved that there
lem)? This problem may also be formulated as fol-
lows: Are all periodic groups locally finite (cf. Locally- exists a universal finite p-group B (d, p) of order bed, p)
finite group)? Burnside himself stressed an important whose quotient groups are isomorphic to all other finite
special case of this problem, when the orders of all p-groups with d generators and satisfying the relation
elements of the groups are bounded uniformly (the x P = 1. If B (d, p) is finite, one has the identity
bounded Burnside problem), i.e. for a certain natural B(d,p)=B(d,p). A comparison of the results in [10)
number n the identity xn = I is valid in the group. It is and [13) leads to the conclusion that if p is sufficiently
the bounded variant of the Burnside problem which has large, there exists a finitely-generated infinite simple p-
received the most attention. In other words, the object group of exponent p. It has been shown that
studied was the quotient group B(d, n)=F / P of a b (2, 5) = 534 . For p ~ 7 there are only a few estimates
free group F with d~2 generators by the smallest nor- from below for b (d, p), connected with corresponding
mal subgroup P containing the n-th powers F of all estimates c(d, p) for the nilpotency class of the group
elements fEF. The following results are known: B (d, p). It is known that c(2, p) cannot be a linear
B (d, 2) is an elementary Abelian group of order 2d; function of p. More importantly, c(d, p) increases
B (d, 3) is a finite group of order 3md , where without limit with d [14), [15). The question of the
existence of B(d, n) for n=pm, m>I, beginning from
n = 8 and 9 is still (1977) open. At the same time, the
existence of B (d, n) for all square-free n is a conse-
[1), [2); B(d, 4) is a finite group (of order 212 if d=2 quence of the results reported in [6) and [l3), and of
and of order 269 if d=3 [1), [3), [4)); B(d, 6) is a finite the theorem of the solvability of groups of odd-order
group of order 2s 31 , where s = 1+(d -I)3 md , t =mr and (cf. Burnside problem for finite groups), and of certain
r=I+(d-I)2d [5), [6). The negative solution to the facts concerning the classification of simple groups.
bounded Burnside problem was announced in 1959 [7). The original solutions [8) of the unbounded Burnside
The negative solution to the unbounded Burnside prob- problem and [l3) of the restricted Burnside problem are
lem was published in 1964 [8). Another construction of partly based on the theory of algebras - in the former
a periodic group which is not locally finite was subse- case on the criterion of infinite dimensionality of an
quently given [9). P.S. Novikov and S.1. Adyan [10) algebra, in the latter on an identity of Lie algebras (cf.
proved in 1968 that the group B(d, n), d~2, is infinite Lie algebra), which is an analogue of the identity x P = 1
for all odd n~4381 (a negative solution of the bounded in groups [16), [17). There are other Burnside-type
Burnside problem). It was subsequently shown that the problems than those mentioned above, and they receive
word problem and the conjugation problem for these considerable attention [8), [9).
values of d, n are solvable in B(d, n); B(d, n) cannot
be presented by means of a finite number of defining References
[I] BURNSIDE, W. ':)n an unsettled question in the theory of
relationships; all finite subgroups in B (d, n) are discontinuous groups', Quart. J. Pure Appl. Math. 33 (1902),
Abelian, and all Abelian subgroups are cyclic; B (d, n), 230-238.
d~n, do not meet the maximum or minimum condi- [2] LEVI, F.V. and W AERDEN, B.L. VAN DER: 'Ueber eine beson-
dere klasse von Gruppen', Abh. Math. Sem. Univ. Hamburg 9
tions for normal subgroups; the B(d, n), d>2, are iso- (1932), 154-158.
morphically imbeddable in the group B (2, n). For a [3] SANOV, LN.: 'Solution of the Burnside problem for exponent
study of the properties of the infinite groups B (d, n), 4', Uchen. Zap. Leningrad. Gos. Univ. Ser. Mat. 10 (\ 940),
166-\70 (in Russian).
based on the advanced methods of [10), see the mono- [4] BAYES, A.J., KAUTSKY, J. and WAMSLEY, T.W.: Proc. 2-nd
graph [11) in which, in particular, the bound on the Internat. Con! Theory of Groups, Canberra, 1973, pp. 82-89.
odd values of n indicated above is reduced to n ~665. [5] HALL, M.: 'Solution of the Burnside problem for exponent 6',
Proc. Nat. Acad. Sci. USA 43 (1957),751-753.
To give a final bound for the indices n to which [6] HALL, P. and HIGMAN, G.: 'On the p-Iength of p soluble
correspond finite or infinite groups B (d, n) is a difficult groups and reduction theorems for Burnside's problem', Proc.
problem. The values n = 5, 12, and n = 2m , m ~ 3, are London Math. Soc. (3) 6 (1956), 1-42.
especially interesting in this context. [7] NOVIKOV, P.S.: 'On periodic groups', A mer. Math. Soc. Transl.

487
BURNSIDE PROBLEM

45 (1965), 19-22 (in Russian). (Dokl. Akad. Nauk SSSR 127, A simple and geometrically obvious version of the nega-
no. 4 (1959), 749-752) tive solution to the bounded Burnside problem for odd
(8) GoLOD, E.S.: 'On nil algebras and residually-finite groups', numbers n > 1010 was given by AYu. Ol'shanskiT [A3]. Later
Izv. Akad. Nauk SSSR Ser. Mat. 28 (1964),273-276 (in Rus-
(cf. [A4]) he constructed for each sufficiently-large prime
sian).
(9) ALESHIN, S.V.: 'Finite automata and Burnside's problem for number p an infinite rrgroup in which all normal subgroups
periodic groups', Math. Notes 11, no. 3 (1972), 199-203. (Mat. have order p (Tarski's monster). This is the strongest form
Zametki 11, no. 3 (1972), 319-328) of a negative solution to Burnside's problem. Every1hing
[lOA) NOVIKOV, P.S. and ADYAN, S.I.: Izv. Akad. Nauk SSSR Ser. that has been done on the restricted Burnside problem is
Mat. 32, no. I (1968),212-244.
summarized in [A5] and the book [A6]. In [A6] a complete
[lOB) NOVIKOV, P.S. and ADYAN, S.I.: Izv. Akad. Nauk SSSR Ser.
Mat. 32, no. 3 (1968), 709-731. list of references is given, as well as results of computer
[II) ADYAN, S.I.: The Burnside problem and identities in groups, experiments of various scientists:
Springer, 1979 (translated from the Russian). 18(4,4) 1=2422 (cf. [A7]) ,
(12) SHMIDT, O.Yu.: Selected works on mathematics, Moscow, 1959,
pp. 298-300 (in Russian).
I 8(3,5) 1,,;;;52282 (cf. [A8]) ,
(13) KOSTRIKIN, A.I.: 'On Burnside's problem', Izv. Akad. Nauk I 8(2, 7) I > 76366 ,
SSSR Ser. Mat. 23 (1959),3-34 (in Russian). I 8(2,5) 1=534 (cf. [A9]).
(14) BACHMUTH, S.M., MUCHIZUKI, H.Y. and WALKUP, D.W.: 'A Further, [A10] may prove to be useful.
nonsolvable group of exponent 5', Bull. Amer. Math. Soc. 76,
no. 3 (1970), 638-640.
References
[A 1) GRIGORCHUK, R.I.: 'On Burnside's problem for periodic
(15) RAzMYSLOV, Yu.P.: 'On Lie algebras satisfying the Engel con-
groups', Funktsional. Anal. i Prilozhen. 14, no. 1 (1980), 53-
dition', Algebra and Logic 10, no. I (1971),21-29
54 (in Russian).
(Algebra i Logika 10, no. I (1971),33-44)
[A2) GRIGORCHUK, R.I.: 'Milnor's problem on the growth of
(16) MAGNUS, W.: 'Ueber Beziehungen zwischen hoheren Komrnu-
groups', Ookl. Akad. Nauk SSSR 271, no. 1 (1983), 30-33
tatoren', J. Reine Angew. Math. 177 (1937), 105-115.
(in Russian).
(17) HIGMAN, G.: 'Lie ring methods in the theory of finite nilpotent
[A3] OL'SHANSKIT, A.Yu.: 'On a theorem of Novikov-Adyan', Mat.
groups', in Proc. Internat. Congress Mathematicians Edinburgh,
Sb. 118, no. 2 (1982), 202-235 (in Russian).
1960, Cambridge Dniv. Press, 1960, pp. 307-312.
[A4] OL'SHANSKIT, A.Yu.: 'Groups of bounded period in which all
(18) KUROSH, A.G.: 'Ring-theoretical problems connected with
subgroups are of prime order', Algebra i Logika 21, no. 5
Burnside's problem for periodic groups', Izv. Akad. Nauk
(1982), 555-618 (in Russian).
SSSR Ser. Mat. 5 (1941),233-240 (in Russian). German
abstract. [A5] VAUGHAN-LEE, M.R.: 'The restricted Burnside problem', Bull.
A.l. Kostrikin London Math. Soc. 17 (1985), 113-133.
[A6] KOSTRIKIN, A.I.: Around Burnside, Springer. Forthcoming.
Editorial comments. A positive solution of the restricted [A7] ALFORD, HAVAS, G. and NEWMAN, M.F.: 'Groups of
Burnside problem for the case of exponent five was first exponent four', Notices Amer. Math. Soc. 22 (1975), A.301.
obtained in [.A.11]. In the period 1977 - 1986 a lot of [A8] HAVAS, G., NEWMAN, M.F. and VAUGHAN-LEE, M.R.: 'A nil-
additional results on the Burnside problem for periodic potent quotient algorithm for graded Lie rings', To appear.
groups were obtained. R.1. Grigorchuk [A1] proposed a [A9] HAVAS, Go, WALL, GoE. and WAMSLEY, loW.: 'The two-
generator restricted Burnside group of order five', Bull. Aus-
most-simple construction of finitely-generated infinite rr tral. [,lath. Soc. 10 (1974), 459-470.
groups. One of its versions leads, in particular (cf. [A2]), to [A 10] MENNICKE, l.L (ED.): 'Proc. Burnside workshop Bielefeld,
the construction of groups of intermediate growth, i.e. nei- 1977', Lec!. Notes Math., Vol. 806, Springer, 1980.
ther polynomial, nor exponential growth; it leads, moreover, [A 11] HIGMAN, G.: 'On finite groups of exponent five', Proc. Cam-
bridge Phil. Soc 52 (1956), 381-390.
to the construction of both such periodic groups as well as
of such torsion-free groups (solutions to Milnor's problem). AMS 1980 Subject Classification: 20F50

Você também pode gostar