Escolar Documentos
Profissional Documentos
Cultura Documentos
PART I
Editor
ROBERT S. COHEN, Boston University
VOLUME 177
HUGUESLEBLANC
Courtesy o/Virginia G. Leblanc
Edited by
MATHIEU MARION
University of Ottawa
and
ROBERT S. COHEN
Boston University
ISBN13: 9789401072045
eISBN13: 9789400915756
001: 10.1007/9789400915756
TABLE OF CONTENTS
ix
EDITORIAL PREFACE
LOGIC
On Axiomatizing Free Logic - And Inclusive
Logic in the Bargain
FRAN~OIS LEPAGE / Partial Propositional Logic
SERGE LAPIERRE / Generalized Quantifiers and Inferences
MARIE LA PALME REYES, JOHN MACNAMARA and GONZALO E.
REYES / A Category-Theoretic Approach to Aristotle's Term Logic,
with Special Reference to Syllogisms
JOACHIM LAMBEK / On the Nominalistic Interpretation of Natural
Languages
JEAN-PIERRE MARQUIS / If Not-True and Not Being True Are Not
Identical, Which One Is False?
DANIEL V ANDER VEKEN / A New Formulation of the Logic of
Propositions
YVON GAUTHIER / Internal Logic. A Radically Constructive Logic
for Mathematics and Physics
JUDY PELHAM / A Reconstruction of Russell's Substitution Theory
HUGUES LEBLANC /
1
23
41
57
69
79
95
107
123
PHILOSOPHY OF MATHEMATICS
MICHAEL HALLETT / Hilbert and
MATHIEU MARION / Kronecker's
Logic
'Safe Haven of Real Mathematics'
135
189
PHILOSOPHY OF PHYSICS
MARIO BUNGE / Hidden Variables, Separability, and Realism
STORRS McCALL / A Branched Interpretation of Quantum Mechanics
vii
217
229
243
viii
T ABLE OF CONTENTS
Philosophy
259
275
287
297
309
NAME INDEX
315
EDITORIAL PREFACE
By North-American standards, philosophy is not new in Quebec: the first mention of philosophy lectures given by a Jesuit in the College de Quebec (founded
1635) dates from 1665, and the oldest logic manuscript dates from 1679. In
English-speaking universities such as McGill (founded 1829), philosophy
began to be taught later, during the second half of the 19th century. The major
influence on English-speaking philosophers was, at least initially, that of
Scottish Empiricism. On the other hand, the strong influence of the Catholic
Church on French-Canadian society meant that the staff of the facultes of the
French-speaking universities consisted, until recently, almost entirely of
Thomist philosophers. There was accordingly little or no work in modem
Formal Logic and Philosophy of Science and precious few contacts between the
philosophical communities. In the late forties, Hugues Leblanc was a young
student wanting to learn Formal Logic. He could not find anyone in Quebec
to teach him and he went to study at Harvard University under the supervision
of W. V. Quine. His best friend Maurice L' Abbe had left, a year earlier, for
Princeton to study with Alonzo Church.
After receiving his Ph.D from Harvard in 1948, Leblanc started his professional career at Bryn Mawr College, where he stayed until 1967. He then went
to Temple University, where he taught until his retirement in 1992, serving as
Chair of the Department of Philosophy from 1973 until 1979. His achievements
as a logician include seminal contributions to the development of Free Logic, in
particular with the ground breaking paper, written jointly with Theodore
Hailperin, 'Nondesignating Singular Terms' (Philosophical Review 68 (1959),
pp. 239-43). After initial results by Bas van Fraassen, using supervaluation,
Hugues Leblanc and Richmond Thomason obtained completeness results in
'Completeness Theorems for Some Presupposition-Free Logic' (Fundamenta
Mathematicae 62 (1968), pp. 125-64). More recently, Leblanc also made
seminal contributions to Truth-Value Semantics (cf. his Truth-Value Semantics,
Amsterdam, North-Holland, 1976) and, inspired by appendices to Karl Popper's
Logic of Scientific Discovery, to Probability Semantics and Probability Theory,
in his paper 'Probabilistic Semantics for First-Order Logic' (ZeitschriJt for
mathematische Logik und Grundlagen der Mathematik 25 (1979), pp. 498509). In all, Leblanc has written more than one hundred scientific papers, the
more recent of them in collaboration with Peter Roeper (Australian National
University), and four books, he collaborated on two books and edited or coedited four. Many logic students will remember learning the subject from his
classic textbook, written with William A. Wisdom, Deductive Logic (3rd edn.,
Englewood Cliffs, Prentice Hall, 1993).
After a long and fruitful career in the United States, Hugues Leblanc is now
ix
EDITORIAL PREFACE
back in Quebec, where the philosophical milieu has changed beyond recognition since his student days. He came back to find studies in logic and in all
aspects of philosophy of science in a flourishing state. As a result of the revolution tranquille which took place among the French-speaking society in the
sixties, philosophy in Quebec opened up to external influences such as, initially,
phenomenology and Marxism and, increasingly in the past twenty years, AngloAmerican analytic philosophy. As a result, there is now a growing number of
French-speaking logicians and philosophers of science - although not all of
them work from the point of view of analytical philosophy. Conditions were set
for fruitful exchanges with the English-speaking philosophical community. (But
we should add here that the essential role of immigrants in the evolution of
the philosophical life in Quebec should not be overlooked. Contributors to the
present volumes come not only from other parts of Canada, but also from
Argentina, Australia, Belgium, Germany, Ireland, Switzerland, the United
Kingdom and the United States).
Such exchanges have led recently to the creation of research groups across
Quebec. These are now joined together under the name of Groupe de recherche
sur la representation, I' action et Ie langage or GRRAL. Our two volumes
of Quebec Studies in the Philosophy of Science comprise the first full-scale
collection of studies in the philosophy and history of science from French- and
English-speaking philosophers of Quebec to appear in English; they include in
particular most members of the GRRAL. As editors, we are happy to join the
contributors in dedicating these volumes to Hugues Leblanc, who is, among
philosophers, the first logicien quebecois.
In our first volume, which opens with a new essay on Free Logic by Hugues
Leblanc himself, we have collected together papers in logic, philosophy of
mathematics, philosophy of physics and in general philosophy and history of
science. This volume includes members of two of the research groups forming
the GRRAL, the group on Fondements de la logique et fondement du raisonnement (M. Hallett, F. Lepage, S. Lapierre, J.-P. Marquis, and, at the time of
writing, J. Pelham) and the group Actes du discours et grammaire universelle
(D. Vanderveken & Y. Gauthier).
The papers in the section on logic show the great variety of logical investigations work done across Quebec. Both Franyois Lepage and Serge Lapierre
present their results, respectively, on partial functions in type theory and conditional quantifiers, within their more global context. The following three papers
reflect the fundamental research in category theory which has been taking place
in Montreal's various departments of mathematics. Marie La Palme Reyes, John
Macnamara and Gonzalo Reyes argue in their paper for the replacement of the
standard Boolean, class interpretation of syllogistic by a category-theoretical
approach. Jim Lambek argues for an extension of his nominalistic interpretation
of the language of mathematics, developed in collaboration with Jocelyne
Couture and Phil Scott, to natural languages, and Jean-Pierre Marquis studies
EDITORIAL PREFACE
xi
the distinction between true, not-true and not being true within the perspective
of topos theory.
Further to his work on the logic of illocutionary forces, Daniel Vanderveken
presents in his paper a framework for a new logic of propositions. Yvon
Gauthier presents his own system of internal logic and system of finitist arithmetic, where the schema of complete induction is replacd by that of Fermat's
infinite descent. Finally, building on joint results with Alasdair Urquhart on
structured propositions, Judy Pelham presents in her paper a reconstruction of
Russell's substitution theory (circa 1905-6) that provides a resolution of the
paradoxes which originally caused Russell to abandon the theory in favour of
the ramified theory of types.
In the section on the philosophy of mathematics, Michael Hallett studies the
role of logic in Hilbert's approach to the foundations of mathematics, while
Mathieu Marion examines the relations between Kronecker's philosophy of
mathematics and the tradition of logical foundations. The section on philosophy
of physics comprises a paper by the distinguished philosopher of science Mario
Bunge, in which he argues that recent experiments which refuted hidden
variable theories were not a refutation of realism, a paper by Storrs McCall
presenting a new 'branched' interpretation of Quantum Mechanics and a paper
by Michel Blais on Chaos Theory.
The last section of the volume contains studies in general philosopy of
science and a study by the leading historian of science, William Shea. In his
paper, Paul Pietro sky presents an original conception of ceteris paribus laws
inspired by Ramsey's ideas about causation. Dave Davies argues that Putnam's
recent emphasis on conceptual relativity against metaphysical realism does
not vindicate critics of his model-theoretic argument, but actually clarifies
Putnam's objections to these very criticisms. Jean Leroux presents elements of
Helmholtz's epistemology which foreshadow modem forms of empiricism and
he argues for an anti-realist reading of his theory of science. Finally, Shea
studies the origin, in the development of new technologies from the medieval
ages onwards, of the mechanical philosophy which underlay modem science,
from Galileo to Newton.
We would like to thank Alain Voizard for his help in writing this preface, and
also the editor of the Brazilian Journal of Physics, formerly Revista Brasileira
de Fisica, for granting us permission to reprint Mario Bunge's essay 'Hidden
Variables, Separability and Realism' (volume especial os 70 anos de Mario
SchOnberg, 1984, pp. 150-168). We are especially grateful to Annie Kuipers for
her professional assistance on behalf of Kluwer Academic Publishers and for
her continued encouragement and patience.
Boston and Montreal
April 1995
HUGUES LEBLANC
Free Logic owes its name to its being free of two presuppositions of Standard
Logic, one to the effect that something exists, a patent truth but surely a factual
rather than a logical one, and the other to the effect that every singular term
designates something, a patent falsehood. l In Standard Logic with Identity,
it is this familiar law:
A.
(3X) (X
T),
T here any term you please, that most crisply encapsulates the two presuppositions. 2 In Standard Logic without Identity, it may be this little known
counterpart of it:
B.
(3X) (A == A(T/X)),
C.
(\iX) (\iY)A
though independent of the other axiom schemata for that logic, is provable it so happens - in the presence of those for '='.3 So, awkwardly, C must be
dropped as one converts Free Logic without Identity into Free Logic with
Identity. 4 But what I regret more is that (i) in the process of ax iomati zing
Free Logic with Identity, Bencivenga, Lambert, and others do not exploit
Leonard's Specification Law for Free Logic with Identity in [19], to wit:
D.
(3X) (X
and (ii) in the process of ax iomati zing Free Logic without Identity, they do
not exploit this Identityless counterpart of D in [10]:
E.
Editorial note: a list of lettered fonnulas in Section I is given at the end of this essay.
M. Marion and R. S. Cohen (etis.), Quebec Studies in the Philosophy of Science I, 1-22.
1995 Kluwer Academic Publishers.
HUGUES LEBLANC
(3X)(X = T)
T.
(V'X)A ~ A(T/X)
of D and that
H.
T=T
of F are the only two of the customary axiom schemata for Standard Logic
with Identity not among those for my partial logic, adding A to them obviously extends it to a full-fledged Standard Logic with Identity.6 The resulting
axiomatization of that logic may be new. In any event it is one in which,
interestingly enough, the Specification Law for Free Logic with Identity begets
G, the Specification Law for Standard Logic (with or without Identity).
In Section III I extend the partial logic of Section II to a full-fledged Free
Logic with Identity. I do this in two different - and shown equivalent - ways,
using in the first case H plus this generalization of A:
I.
Y a variable distinct from X, and in the second case H plus Lambert's generalization of G in [7]:
J.
Y this time a variable foreign to A and A(Y/X) the result of course of replacing
X everywhere in A by Y.
In Section IV I drop the one axiom schema in Section II that exibits '=',
and show the resulting partial logic to yield E as a theorem. So, enlisting B
as a substitute for A is sure to extend that logic to a full-fledged Standard Logic
without Identity. The resulting axiomatization may also be new, and this time
around it is the Specification Law for Free Logic without Identity that begets
G. I next extend the present partial logic to a full-fledged Free Logic without
Identity. Following the precedent set in Section III, I do this in two different
- and shown equivalent - ways, using in the first case this generalization of
B:
K.
Free Probability Theory (or, as more recent usage has it, a Free Probability
Logic), and I remind the reader that Free Probability Theory does provide
an alternative semantics for Free Logic, one in which probability functions
relativized to possibly empty sets of terms substitute for truth-value functions
relativized to such sets.
In Section VI, Lastly, I turn to what Quine called in [22] Inclusive Logic,
provide both an axiomatization of and a semantics for it, and study its relationship to Standard as well as to Free Logic. Used in the process will be
these two axiom schemata:
L.
(3X) (A
and
M.
(\tY) (3X) (A
-A).
Note: For brevity's sake I shall refer to the partial logic with Identity of
Section II as L., to the Standard Logic with Identity that L. extends to in
that very section as SL., to the Free Logic with Identity that L. extends to
in Section III as FL., to the partial logic without Identity of Section IV as
L, and to the Standard Logic without Identity and the Free Logic without
Identity that L extends to in that very Section as SL and FL, respectively.
As for Inclusive Logic, I shall refer to the one with Identity as IL. and to
the other as IL.
II
The primitive signs of L., SL., and FL. are the customary ones, to wit: (i)
countably many predicates, '=' one of them, of course, (ii) a denumerable
infinity of individual variables, say, 'X1" 'X2" . , 'xn', . . . , (iii) a denumerable infinity of singular or - for uniformity's sake - individual terms,
say, 't 1 ','t2 ', , 'tn', ... ,8 (iv) the three logical operators '-', '::J', and
'\t', (v) the two parentheses '(' and ,)" and (vi) the comma','. As for the statements of L., SL., and FL., they are also the customary ones, except for (\tX)A
counting as a statement only if A(T/X) - T once more any term you please
- counts itself as a statement. Identical quantifiers, as a result, cannot overlap,
hence the restriction placed three times in Section I on the variable y' 9
Parentheses will be dropped unless ambiguity threatens; and the four logical
operators '&', 'v', '=', and '3' - two of them already used in Section 1will be presumed to be defined in the customary manner.
Lastly, extending my use so far of '/' and introducing its cognate 'II',
suppose I and l'to be two individual variables, or two individual terms, or
one an individual variable and the other an individual term. A(I'II) will then
be the result of replacing I everywhere in A by I', and A(I'III) that of replacing
I at zero or more places in A by 1/.10
Following in this Fitch's example in [5], I identify the axioms of L. (hence,
those common to L., SL., FL., and ILJ recursively:
HUGUES LEBLANC
A ::J (B ::J A)
(A ::J (B ::J C)) ::J A ::J B) ::J (A ::J C))
A3.
A4.
AS.
A6.
('v'X)A~(XlT)
::J ('v'X)A;(XlT)
'v'X)A~(XlT)
::J ('v'X)A;(XlT))
and
after whichever of ('v'X)A~(XlT) and ('v'X) (A~ ::J A;) (XlT) (=('v'X) (A~(XlT)
::J A;(XlT))) occurs second in the third column will justify the presence of
('v'X)A;(XlT) in that column. The first of these lines is indeed an axiom of
L. of the sort A4, the second follows from the first and ('v'X) (A~ ::J A;) (XlT)
by means of MP, and ('v'X)A;(XlT) follows from the second and ('v'X)A~(XlT)
by means of MP also. But, X being by hypothesis foreign to A and hence to
An' ('v'X)A~(XlT) is the same as ('v'X)A(XlT). So, if 1-_ A, then 1-. ('v'X)A(XlT),
so long as X is foreign to A.
0
LEMMA 7. 1-. ('v'X)(A ::J B) ::J 3X)A ::J (3X)B).
Proof Suppose T new. 13
(1)
(Lemma 4)
(2)
(Go, (1))
(3)
(A4)
(4)
(5)
(A4)
(6)
1-. ('v'X)(A ::J B) ::J 'v'X)-B ::J ('v'X)-A) (Lemma 1, (4), (5))
(7)
(8)
(Lemma 4)
(Lemma 1, (6), (7)) 0
HUGUES LEBLANC
LEMMA 8. 1-. (V X) (A
Proof
B)
3X)A
(1)
1-. -B
(2)
1-. (3X)B
(3)
(AI)
(4)
(MP, (2), (3
(5)
(A2)
(6)
(MP, (4), (5
(7)
(Lemma 7)
(8)
(Lemma 1, (7), (6 0
VX)-B)
(AS, hypo on X)
(Lemma 5, (1
1-. A(T'/X)
(T' = T
(1)
1-. A(T'/X)
(T' = T
A(T/X
(2)
(3)
(Go, (1
(A4)
i.e.
(4)
A(T/X
(Lemma 2, A6)
(MP, (2), (3
(Lemma 8)
(6)
(Lemma 1, (4), (5
(7)
(Lemma 2, (6
(5)
(A6)
(Lemma 3, (1
(Lemma 4)
(MP, (2), (3
(Go, (4
(6)
(A4)
(7)
(MP, (5), (6
(8)
(AS)
(9)
(Lemma 1, (8), (7
f-. (3X)(X = T) :J T = T
(Lemma 5, (9
(10)
To obtain the Free Logic with Identity promised on page 2, (i) substitute
wherever appropriate 'FL.' for 'L.' in Section II, (ii) add to the six axiom
schemata on page 4 either this axiom schema, labelled I on page 2:
FA7.
(VY)(3X)(X
Y),
(VY)VX)A:J A(Y/X,
HUGUES LEBLANC
(2)
(3)
(4)
I- F_ (V'Y)3X)(X = Y) ::J
V'X)A ::J A(Y/X)
I- F_ (1) ::J V'Y)(3X)(X = Y) ::J
(V'Y) V'X)A ::J A(Y/X)
I- F_ (V'Y) (3X) (X = Y) ::J
(V'Y) V'X)A ::J A(Y/X
I- F_ (V'Y) V'X)A ::J A(Y/X
(Gn, Theorem 1)
(A4)
(MP, 0), (2
(MP, FA7, (3
This done, I go on to show that FA7' and FAS (not used in the foregoing proof)
deliver FA7. Two new lemmas are used in the course of the proof.
LEMMA 9. I- F_ (A ::J -B) ::J (B ::J -A).
LEMMA 10. I- F_ (V'Y) (A(Y/X) ::J (3X)A).
Proof. Suppose T new.
(1)
(2)
(3)
(4)
(5)
(Lemma 9)
(Gn, (1
(A4)
(MP, (2), (3
(MP, FA7', (4
(2)
(3)
(4)
(5)
(Lemma 10)
(A4)
(MP, (1), (2
(Gn, FAS)
(MP, (3), (4
So, given FAS, FA7 and FA7' are provably equivalent means of extending
the partial logic of Section II to a full-fledged Free Logic with Identity, a result
mentioned by Bencivenga in [2] and known to Lambert.
Shown in effect at the close of Section II was that statements of L_, and
hence of FL_, of the sort FAS are independent of the axioms of L_. Proof
that FA7 and FA7' are independent of the axioms of L_ and of the statements of FL_ of the sort FA8 calls for a bit more work.
:E1, :E2, ... , :En' ... being infinitely many non-empty sets of individual
terms of FL_, take a statement A of FL_ to evaluate to I on a truth-value
assignment a. (to the atomic statements of FL_) relative to the sequence
(:E 1, :E2, ... , :En' ... ) if the customary conditions are met when A is an
atomic statement, a negation, or a conditional; but, in the case that A is a
universal quantification (VX)B, X here the i-th individual variable of FL_
for some i or other from lon, take A to evaluate to 1 on a. relative to
(:E1' :E2, ... , :En' ... ) if, and only if, B(T/X) evaluates to 1 on a. relative to
that sequence for every term T in :Ej This done, consider the truth-value assignment a. that assigns 1 to every atomic statement of FL_ of the sort T =
T but the truth-value 0 to every other one, and a sequence (:E1' :E2, ... , :En'
... ) that is arbitrary except for :E1 and :E2 being {'t 1'} and {'t/}, respectively. Then all the axioms of L_ and statements of FL_ of the sort FA8 evaluate
to 1 on a. relative to (:E 1, :E2, ... , :En' ... ), as does the consequent of a
conditional of FL_ if that conditional and its antecedent themselves do.
Contrastingly, though, 't1 = t/ evaluates to 0 on a. relative to (:E 1, ~, ... ,
:En' ...), hence '(VX)-(x 1 = t2), evaluates to I on a. relative to that sequence
(this because 't1' is the only member of :E 1), hence '(3x 1) (X1 = t2), evaluates
to 0 on a. relative to (:E1' ~, ... , :En' ... ), and hence '(Vx2)(3x 1)(X 1 = x2)'
evaluates to 0 on a. relative to (:E1' :E2, ... , :En' ... ) (this because 't2' is
the only member of :E2). So, at least one statement of FL_ of the sort FA7
evaluates to 0 on a. relative to that sequence. So, at least one statement of
FL_ of the sort FA7 is independent of the axioms of L_ and of the statements
of FL_ of the sort FA8. And at least one statement of FL_ of the sort FA7',
to wit: 'CV'x2) Vx 1)F(x 1) ::J F(x2 will prove to be independent of the axioms
of L_ and of the statements of FL_ of the sort FA8 if 'F(t1)' is assigned 1
rather than 0 by the truth-value assignment 0.. 14
',
IV
To obtain the partial logic L promised on page 2, (i) substitute wherever appropriate 'L' for 'L_' in Section II, (ii) drop the predicate '=' on page 3 and, as
already indicated, axiom schema A6, (iii) for the reason given on page 1,
add this axiom schema, labelled C on that page:
FA6.
(VX)(VY)A::J (VY)(VX)A,
A.
10
HUGUES LEBLANC
And, to obtain the Free Logic FL also promised on page 2, (i) substitute
wherever appropriate 'FL' for 'L' in Section II, (ii) add besides FA6 either this
axiom schema, labelled K on page 2:
F7".
A is provable in FL
as
f-F A.
I first show that E is provable in L and hence that Free Logic without
Identity boasts a Specification Law that exactly parallels
(3X)(X
T)
VX)A
A(T/X
A(Y/X,
to wit:
(3X) (A == A(T/X
VX)A
A(T/X.
(A
B).
f- (A(T'/X) == A(T/X ~
(A(T'/X) ~ A(T/X
f- A(T'/X) :J A(T'/X) == A(T/X ~
A(T/X
f- (VX)(A ~ A == A(T/X :J ACT/X)))
f- (3) ~ VX)A ~
(VX) A == A(T/X ~ ACT/X)))
f- (VX)A ~ (VX) A == A(T/X ~
A(T/X
(6)
(7)
(8)
A(T/X
f- (3X) (A == A(T/X
VX)A ~ A(T/X
(Lemma 11)
(Lemma 2, (1
(Go, (2
(A4)
(MP, (3), (4
(Lemma 8)
(Lemma I, (5), (6
(Lemma 2, (7
11
(2)
(3)
(Go, Theorem 5)
(A4)
(MP, (1), (2))
r-F A == A.
(Lemma 10)
i.e.
(1)
(2)
(3)
(4)
(5)
(6)
(Lemma 10)
(A4)
(MP, (1), (2))
(Lemma 12)
(Go, (4))
(MP, (5), (3))
But FA7' is needed to obtain Lemma 10. So, FA7' yields FA7".
On page 2 I talked of A and B being provably equivalent given H (=
FAS). Indeed, enlist H as an extra axiom schema of L. Then B is provable
in L. from A, and A is provable in L. from (one special case of) B, as I proceed
12
HUGUES LEBLANC
to show. Four additional lemmas are needed to that effect, of which I prove
only the two involving '='.
LEMMA 13.
IJr_ A:::>
(B :::> C) and
(2)
(3)
(4)
r_ T' =
r _T'
LEMMA 15.
Proof.
(1)
(2)
T'
T :::> T
r_ T' =
r _T' = T
r _T = T'
:::> T = T')
:::> T = T')
(A6)
(Lemma 2, (1
(H)
=
T'
(MP, (3), (2
T :::J (A == A(T/T'.
(A6)
:::J (A(T/T') :::> (A(T/T' (T'/T (A6)
:::J (A :::J A(T/T'
i.e.
(2)
(3)
(4)
(A6)
(Lemmas 1 and 14, (2
(Lemma 13, (1), (3 0
(3)
(4)
(5)
(3X) (A == A(T/X)))
r _(3X) (X = T) :::> (3X) (A == A(T/X
r _(3X)(X = T)
r _(3X) (A == A(T/X)
(MP, (4), (3
LEMMA 16.
IJr_ B, then r_
(A == B) :::J A.
r_ T =
(H)
(Lemma 16, (1
(Gn, (2
(4)
(5)
(6)
(7)
13
T == T
T).15
v
Of the various semantic accounts of Free Logic, the truth-value one in [11]
is by far the simplest. Let ~ be a possibly empty set of terms of FL. and Ul;
be a unary function from the statements of FL. to 0 and 1. Then Ul; is said
to constitute an identity-normal truth-value function for FL= if it obeys the
following six constraints:
BI.
1 if ul;(A) = 0
0 otherwise
ul;(A ::J B) = 1 if ul;(A) = 0 or ul;(B) = 1
= 0 otherwise
ul;V'X)A) = 1 if~ =@ or ul;(A(T/X = 1 for every term T in ~
= 0 otherwise
ul;(T = T) = 1
If ul;(T = T') = 1, then ul;(A) = ul;(A(T'I/T, where A is atomic.
If one of T and T' belongs to ~ but the other one does not, then
ul;(T = T') = o.
=
=
B2.
B3.
B4.
B5.
B6.
E ~.
14
HUGUES LEBLANC
u}; for FL.. Proof can then be retrieved from [11] and Section 3 of [13]
that
I- F A if, and only if, A is logically true in the truth-value sense.
CI.
C2.
C3.
P};(-A) = 1 - P};(A)
C4.
P};(A)
P};(A)
CS.
P};(A & B)
C6.
C7.
P};(B & A)
cs.
C9.
CIO.
P};(T = T) = 1
If P};(T = T') = 1, then u};(A) = u};(A(T'IIT)), where A is atomic
If one of T and T' belongs to 1: but the other one does not, then
P};(T = T') = O.
Note as regards constraint C7 that Po(-(A & -A) & (\fX)B) = Po(-(A &
-A)). But P};(-(A & -A) & (\fX)B) is easily shown to equal p};\fX)B). So.
by C2, Po\fX)B) = 1, as expected. ls
These matters attended to, declare a statement A of FL. logically true in
the probability sense if P};(A) = 1 for every identity normal probability function
p}; for FL. Proof can then be retrieved from Section 4 of [13] that
I- F= A if, and only if, A is logically true in the probability sense.
Results analogous to the two just obtained hold of course for Free Logic
without Identity: write 'FL' everywhere for 'FL.', delete all occurrences of
the qualifier 'identity -normal', drop constraints B4-B6 on page 13 and con-
15
straints C8-CIO above, write 'f-F' in place of 'f-F-' in the two results in question,
and the trick is done. So, as axiomatized in this paper, Free Logic with and
without Identity is sound and complete in the probability as well as the truthvalue sense. 19
VI
IA9.
(3X)(A v -A)
VX)A
A(T/X)),
IAtO.
VX)A
A(Y/X))),
(VY)(3X)(A v -A)
(VY)VX)A
A(Y/X)),
16
HUGUES LEBLANC
i.e. FA 7'. So all the theorems of Free Logic with Identity are provable in
Inclusive Logic with Identity. But Free Logic with Identity was so axiomatized
as to make room for 0 as a domain. Hence so is Inclusive Logic with Identity.
Hence Inclusive Logic with Identity is so axiomatized here as to be exactly
like Standard Logic with Identity except for owning 0 as a domain.
The truth-value semantics of Section V is easily adjusted to suit Inclusive
Logic with Identity, as is the probability one: take 0 and the set {'t l ', 't/,
... , 'tn', ... } of all the terms of IL_ to be the only :E's to which the truthvalue functions and the probability functions there are relativized. 23 Proof that,
given the present axiomatization of and semantics for IL_, IL_ is sound and
complete in both the truth-value and the probability sense can be retrieved from
[10] and [13], but the retrieval is a bit laborious at places.
IL is readily gotten from IL_: (i) drop '=' of course, (ii) substitute Fine's
axiom schema FA6 for A6 and drop axiom schema FA8, and (iii) drop constraints B4-B6 on page 13 and constraints C8-ClO on page 14.
The relationship between Standard Logic, Inclusive Logic, and Free Logic
can be depicted as follows, '=' ignored from now on (and without prejudice)
to expedite matters:
SL
IL
Note for proof that (i) every theorem of FL is provable in IL, as we just
saw, but some statements of the sort IA9 are not provable in FL, as I shall
establish below, and (ii) every theorem of IL is obviously provable in SL,
but some statements of the sort (\tX)A:J A(T/X) are not provable in FL,
'(\txl)(F(x l) & -F(x l :J (F(t l) & -F(tl' being the most obvious case in point.
So, claims to the contrary notwithstanding, Free Logic is but a sublogic of
Inclusive Logic, and of course Inclusive Logic is but a sublogic of Standard
Logic. 24 25
So the only two items of business left concerning the present axiomatization of Inclusive Logic are showing that IA9 is independent of the axiom
schemata of FL and that IAlO is independent of the other axiom schemata
of IL.
17
does evaluate to 0 on u r .
As regards IAIO, consider the following 4-valued truth-value fuction for
IL due to Roeper:
u",(B)
~(A)
~(-A)
2/3
Ih
Ih
2/3
u",(A => B)
2h
1/3
2h
1/3
2h
1
2/3
1
1/3
Ih
1
Ih
2h
0
u",(A)
0
~(A(T/X
~VX)A)
2/3
1/3
2/3
2/3
2/3
It is easily verified that all the axioms of IL not of the sort IAIO evaluate to
1 under u r . Yet this statement of IL of the sort IAIO:
hence
18
HUGUES LEBLANC
hence
hence
hence
A.
B.
c.
(3X) (X = T)
(3X)(A == A(T/X
(V X) (VY)A ::) (VY) (VX)A
(= FA6)
D.
E.
(= Theorem 1)
(= Theorem 5)
F.
(3X)(X = T) ::) T = T
G.
H.
I.
T=T
(= FAS)
(VY) (3X) (X = Y)
(= FA7)
J.
(= FA7')
K.
(VY)(3X)(A == A(Y/X
(= FA7")
L.
(= IA9)
M.
(= IAtO)
Universite du Quebec
a Montreal
NOTES
I
Free Logic dates back to 1959, the year that saw the publication of [14], a paper by Leblanc
and Hailperin, and the publication of [6], a paper by Hintikka. The Free Logic in both cases is
one with Identity. Free Logic without Identity dates back to 1963, the year that saw the publication of [7], a paper by Lambert. As regards the first of the two presuppositions, recall Russell's
remark on p. 203 of [23]; "The primitive propositions in Principia Mathematica are such as to
allow the inference that at least one individual exists. But I now regard this as a defect in
logical purity."
2 A is an adaptation of an axiom of Tarski's in [24], where - free variables doing duty in
effect for terms - a variable other than X occurs in place of T.
3 The result is Fine's in [4].
4
This is explicitly done in [2], but should be done as well in [8].
s [19] predates [14] and [6], and influenced the writing of [14]. Leonard's Law appears as an
19
axiom schema in several axiomatizations of Free Logic with Identity and in all axiomatizations of what is known as the Logic of Existence. On the latter occasions,
(3X)(X
T)
is provable from
14
Bencivenga had already shown in (1) that FA7 is independent of AI-A6, FA8, and Leonard's
Specification Law for Free Logic with Identity (= Theorem I), hence in effect that the law in
question could not substitute for either of FA7 and FA7' in the foregoing axiomatizations of
FL_. He also showed there that this fascinating law:
('ltX)3Y)(Y
X) :J A) :J ('ltX)A,
is independent of AI-A6, FA8, and Leonard's Law. However, Bencivenga's Law - as I take
leave to call it - readily follows from A4, Lemma 2, and FA7. Note indeed that
('ltX)3Y)(Y
X) :J A) :J 'ltX)(3Y)(Y
X) :J ('ltX)A)
by A4, hence
('ltX)(3Y)(Y
X) :J 'ltX)3Y)(Y
X) :J A) :J ('ltX)A)
20
HUGUES LEBLANC
T=T'
and
A(T'/T)
is even closer than the foregoing results suggest. Note indeed that
A:5 A(T/T),
If A :5 A(TIT'), then A :5 A(T'IT),
and
If A :5 A(T/T') and A :5 A(T'IT"), then A - A(TIT").
So, like Identity, Substitutivity in the sense of '/' is an equivalence relation. Note also that as
(3X)(X - T) :::> T
21
Absent indeed from previous axiomatizations of IL_ and IL, the one in [15] among the
latter, is IAI0 in (iii) on page 15, an axiom schema recently shown by Roeper to be independent of those in [15]. Yet, no matter the truth-value function 0.1: or the probability one P1: for
IL_, statements of IL_ of the sort IAI0 all evaluate to 1 under 0.1: and P1: and hence are all
logically true.
23
That the statements of IL_ of the sort IAI0 all evaluate to I under the resulting functions
is obvious enough.
24
Studied in [20] and [15] are Inclusive Logics where (\iX)A, when vacuous, is provably
equivalent to A. Quine's test will suit them if vacuous quantifiers are deleted before a statement is subjected to the test. The axiom schemata of IL in [15] are AI-A3, AS, the converse
of AS, and this restricted version of A4:
n
REFERENCES
1. Bencivenga, E., 1978, 'A Semantics for a Weak Free Logic', Notre Dame Journal of Formal
Logic 19, 646-652.
2. Bencivenga, E., 1986, 'Free Logics', in Handbook of Philosophical Logic, vol. 3, D.
Reidel Publishing Company, Dordrecht, pp. 373-426.
3. Cocchiarella, N. B., 1966, 'A Logic of Actual and Possible Objects', The Journal of Symbolic
Logic 31, 689-690.
4. Fine, K., 1983, 'The Permutation Principle in Quantificational Logic', Journal of
Philosophical Logic 12, 31-37.
5. Fitch, F. B., 1948, 'Intuitionistic Modal Logic with Quantifiers', Portugaliae Mathematica
7,113-118.
6. Hintikka, J., 1959, 'Existential Presuppositions and Existential Commitments', The Journal
of Philosophy 56, 125-137.
7. Lambert, K., 1963, 'Existential Import Revisited', Notre Dame Journal of Formal Logic
4,288-292.
8. Lambert, K., 1991, 'The Nature of Free Logic', in Philosophical Applications of Free Logic,
Oxford University Press, New York - Oxford.
9. Leblanc, H., 1968, 'On Meyer and Lambert's Quantificational Calculus FQ', The Journal
of Symbolic Logic 33, 275-280.
22
HUGUES LEBLANC
10. Leblanc, H., 1971, 'Truth-Value Semantics for a Logic of Existence', Notre Dame Journal
of Formal Logic 12, 153-168.
11. Leblanc, H., 1976, Truth-Value Semantics, North-Holland Publishing Company, Amsterdam
New York Oxford.
12. Leblanc, H., 1982, 'Popper's 1955 Axiomatization of Absolute Probability', Pacific
Philosophical Quarterly 63, 133-145.
13. Leblanc, H., 1983, 'Alternatives to Standard First-Order Semantics', in Handbook of
Philosophical Logic, vol. 1, D. Reidel Publishing Company, Dordrecht, pp. 189-274.
14. Leblanc, H. and Hailperin, T., 1959, 'Nondesignating Singular Terms', The Philosophical
Review 68, 239-243.
15. Leblanc, H. and Meyer, R. K., 1969, 'Open Formulas and the Empty Domain', Archiv
fUr mathematische Logik und Grundlagensforschung 12, 78-84.
16. Leblanc, H. and Meyer, R. K., 1982, 'On Prefacing ('v'X)A ::J A(Y/X) with ('v'Y): A Free
Quantification Theory without Identity', in Existence, Truth, and Provability, State
University of New York Press, Albany, pp. 58-75. The paper there is an amended version
of the original, which had appeared in ZeitschriJt fUr mathematische Logik und Grundlagen
der Mathematik 12, 1971, pp. 153-168.
17. Leblanc, H. and Roeper, P. 1993, 'On Getting the Constraints on Popper's Probability
Functions Right', Philosophy of Science 60, 151-157.
18. Leblanc, H. and Thomason, R. H., 1968, 'Completeness Theorems for Some PresuppositionFree Logics', Fundamenta Mathematicae 62,125-164.
19. Leonard, H. S., 1956, 'The Logic of Existence', Philosophical Studies 7,49-64.
20. Mostowski, A., 1951, 'On the Rules of Proof in the Pure Functional Calculus', The Journal
of Symbolic Logic 16, 107-111.
21. Popper, K. R., 1959, The Logic of Scientific Discovery, Basic Books, Inc., New York.
22. Quine, W. V., 1954, 'Quantification and the Empty Domain', The Journal of Symbolic Logic
19,177-179.
23. Russell, 8., 1919, Introduction to Mathematical Philosophy, George Allen and Unwin, Ltd.,
London.
24. Tarski, A., 1965, 'A Simplified Formulation of Predicate Logic with Identity', Archiv fUr
mathematische Logik und Grundlagen der Mathematik 7, 61-79.
25. van Fraassen, 8. C., 1%6, 'Singular Terms, Truth-value Gaps, and Free Logic', The Journal
of Philosophy 67, 481-495.
FRAN<;ms LEPAGE
24
FRANyOIS LEPAGE
recent. Pavel Tichy [Tic82] not only formulated the project explicitly, but
gave motivations for such an enterprise. His motivations remain, in my opinion,
globally valid, and it is worth recalling them here.
The first motivation is that 'the logic underlying ordinary language - and
hence our conceptual scheme - is that of the simple type theory'. What is meant
here is without doubt, that type theory possesses sufficient resources for
expressing the inferential structures one finds in natural languages; structures whose complexity often exceeds the expressive capacity of first order
calculi. The second motivation is that it is necessary to enrich the classical
type theory from inside 'by dropping the totality assumption and treating partial
functions on a par with total ones'. In fact, 'the partial theory seems to
provide a medium which yields an analysis for any linguistic expression, and
affords a universal explication of logical entailment'.
We see that for Tichy the idea of introducing partial functions in type theory
is not presented as the adoption of a particular hypothesis but, on the contrary,
as the lifting of an arbitrary restriction, i.e., that which consists of using only
a subset of functions for semantic values - the subset of total functions,
which are defined for all the arguments of the right category. It is difficult
to determine whether TichY's motivation was primarily epistemic. What is
certain, at any rate, is that his motives can be interpreted in an epistemic sense.
Tichy did not completely realize his project, in part because he did not know
how to use the full power of the partial theory of types, lacking as he did a
fundamental technical notion to which I will come back later. Now we will
present the framework that will serve to introduce partial interpretations, i.e.,
the theory of propositional types.
THE THEORY OF PROPOSITIONAL TYPES
The partial theory of types results from generalizing the well-known simple
theory of types by authorizing the presence of partially defined entities. Let
us recall some elementary facts regarding type theory, in particular regarding
the theory of propositional types.
Simple type theory is a simplified version of Russell's ramified theory of
types. It was initially formulated by Chwistek and Ramsey, and the first full
presentation is that of Church [Chu40].
In its contemporary versions, the theory of types consists first of the construction of a hierarchy of functions in the following way.2
DEFINITION 1. The set of types T is the smallest set such that
(i)
(ii)
(iii)
(iii)
25
26
FRAN<;:OIS LEPAGE
DEFINITION 3. The set Trma of terms of type a. is the smallest set such
that
(i)
(ii)
(iii)
(iv)
Vara E Trma
if A, B E Trm a, then [A == B] E Trm,
A E Trma and x E Trmr" then AxA E Trmr.a.
if A E Trmar, and B E Trm a, then [AB] E Trmr,
We will call terms of type t statements. The logical constants are defined in
the following manner: 3
DEFINITION 4.
T = def[Ax,x == /...xx]
F = def[Ax,x == AxT]
-'" = defAx,[X == F]
1\'(11)
defAx,Ay,[A!,(II)[[fx]y] == Af[[fT]Tll
V xaA
def[AxaA == AxT]
aET
aET
such that /-L(xa) E Da. We write /-L(a/x) for the assignation that differ from /-L
at most by the fact that it assigns the value a to x
Finally, we define a valuation based on /-L.
DEFINITION 6. A valuation based on /-L is a function
V 11: U Trma ~ U Da
aET
aET
such that
(i)
V l1(xa ) = /-L(xa );
(ii)
(iii)
(iv)
27
[Aa == A]
H2 :
[[At ==
H3:
[[T /\ F] == F]
H4:
H5:
H6:
H7:
11 == A]
[j== gJ]
[[AxaABJ == A {Blx}] where B is free for x and A {Blx} is the formula
obtained from A by substituting B for each free occurrence of x.
['v'xa[[fa~xaJ]
==
[ga~xa]] :J
28
FRANyOIS LEPAGE
29
confuse our use of the notion 'undefined' with another use, that which consists
of calling an expression undefined if it is not well-formed, that is if it is not
a meaningful term. In this case one should rather speak of nonsense.
In the classical context, the expressions A, B, C would take total functions as values, so the value of [BC] would be a total function of the type of
the arguments of A and thus the value of [A[BC]] would be a total function
as well. What would happen if the values of these expressions, still being
of the right type, were partial? Specifically, what would happen if the value
of B were a partial function undefined for the argument which is the value
of C? The value of [BC] is not defined. In that case, [A[BC]] has no
value, that is, it is not possible to give a sense to the expression 'value of
[A[BC]]'.
PM,={O,l,<p}
(ii)
PMaf',
(PMa -7 PMf',)
where (PMa -7 PMf',) is the set of monotone functions of PMa in PMf',' the
monotony being relative to the following order:
(i)
(ii)
PMa, f(x) ~
PMa,
30
FRANC;OIS LEPAGE
and
(iii)
(iv)
PMa,
We can represent partial objects using the following graphic artifice. First of
all, since PM, forms a meet-semi-Iattice it is natural to represent it by
1
and a function f
PM" by
f(1)
f(O)
f(<p)
1
1
<p
<p
<p
0
<p
<p
1
<p
<p
<p
0
<p
<p
<p
<p
<p
<p
<p
Diagram 1.
Certain elements of the top of the hierarchy (which we will call partial
total functions) behave like the regular total functions (which we will henceforth call classical functions) at least when their arguments themselves behave
like classical functions. We can characterize them formally in the following
way.
31
for a.
for a.
=
=
t, and x
Pr,
and f
PM" x
E
PMaf3 , f
PTa' f(x)
PTf3.
(ii)
One easily verifies that '",,' is a relation of equivalence. One of the interesting properties of this notion is expressed by the following proposition.
PROPOSITION 11. For any f, g
v g exists.
Intuitively, partial functions of the same type can differ in two ways. The
first, classical, is that they can take incompatible values. They can also differ
in so far as they are compatible but more or less defined. This intuition can
be formalized in the following way.
s:
(ii)
for x, y
for f, g
PM" x :t:-* y iff x :t:- <p and y :t:- <p and x :t:- y;
g(y).
Now we can define a partial interpretation for the terms of the theory of propositional types.
DEFINITION 13. A partial value assignation Jl is a function
Jl: U Vara ~ U PMa
aET
aET
32
FRANyOIS LEPAGE
such that ~(xa) E PMa. As before, we write ~(a/x) for the assignation that
differs from ~ at the most by the fact that it assigns the value a to x
Finally, we define a partial valuation based on
~.
is a function
aeT
such that
(i)
vIl(xa) == ~(xa);
(ii)
(iii)
(iv)
Da
Clauses (i), (iii) et (iv) are identical to the corresponding clauses of Definition
6. Clause (ii) deserves a few remarks. The adoption of clause 6 (ii) is not
possible because in that case VIl([Aa == BJ) would not be monotone with regard
to V Il(A) and V Il(B). (For example, given AI and BI such that VIl(A) = <p and
ViB) = <p; according to 6 (ii) we would have V i[A == B]) = 1 but for C and
D such that VIl(C) = 1 and VIl(D) = 0 and VIl(A) $ VIl(C) and VIl(B) $ VIl(D),
and by monotony, V I'([C == D]) = 1. Therefore 1 = 0 which is not desirable,
to say the least.
Moreover, the relation '",,' between partial total functions, though weaker
than identity, is sufficient in virtue of Proposition 11: if two total functions
are different but equivalent, their supremum exists and belongs to the same
equivalence class. The fact that these functions take equivalent total values
for the same total arguments guarantees monotony.
We can show that the order on partial functions induces a similar order
on partial valuations. Thus we can speak of partial total valuations and
classical valuations. The existence of an epimorphism between partial total
functions and classical functions allows us to assimilate a classical valuation
to any element of the corresponding equivalence class of partial total valuations, because all statements receive exactly the same value (total, of course)
according to the classical valuation and according to all the partial total
valuations of the corresponding equivalence class. For a detailed proof, see
[Lrep92].
One easily verifies that (see Definition 4 and the definition of denotation),
Td = [AX,x == AXX]d = 1
pi = [AX,x == AxT]d = 0
33
o
represents propositional identity and that -,,/ = A.Xt[x == F]d will be the function
represented by the following diagram
f(1)(1) f(1)(O)
f(1)(q
f(O)(q
f(q(1) f(q(O)
f(q(q
Diagram 2.
A t(1I)
for f, namely
o
q>
Diagram 3.
34
FRAN<;OIS LEPAGE
PTa' V I1(a'x)(A) = 1
PMa , VlJ(a,x)(A) = 0
and
3xA d = [-,VX[-,A]]d
[-,[AXa[-,A] == AxT]]d
= 1 iff for some a E PMa , VlJ(alx)(A) = 1
= 0 iff for any a E PTa' Vl1(alx)(A) = 0
= <p otherwise.
=
In classical logic, in the sense meant there, the notions of validity and valid
inference are monolithic. A is a valid consequence of a set of statements r,
symbolically, r F A, if and only if for any valuation Jl, if for any B E r,
ViB) = I, then VI1(A) = 1. In a similar way, A is valid, symbolically, FA, iff
for any valuation Jl, V I1(A) = 1. An equivalent way of presenting things in
the classical context is to use the false rather than the true. Then we would
have A as a valid consequence of r if and only if, for any valuation Jl, if for
any B E r, VI1(B) = I, then VI1(A) O. In a similar way, A is valid iff ViA)
= 0 for no valuation Jl.
These two notions, as we would expect, do not coincide in partial logic
on account of the present of a third, the undefined. We can define two concepts
of validity.12
These two notions do not coincide; we can, however, easily show that [Lrep92].
PROPOSITION 17.
r FA
iff
j2.A.
What about the class of verifiably valid statements? This class is neither
empty nor identical to the class of classically valid statements. For example,
35
[Aa == Aa] is not verifiably valid but [AXaX == AXaX] is. The problem, one
will have guessed, is to give a system (complete if possible) for the class of
verifiably valid statements. The question is not yet resolved, and the major
difficulty rests in the absolute impossibility of constructing an expression Elf
of the object language, the interpretation of which would be [EA] is true if
and only if A is not defined and false otherwise. It is easy to convince oneself
that such a functor would not be monotone.
For the time being we must content ourselves with fragmentary results
[Lrep92]. First of all, if it is not possible to introduce in the object language
a functor [EA] the interpretation of which would be 'A is undefined', it is
possible nonetheless to introduce a functor ~(A), the interpretation of which,
roughly, is 'A is total'.
This is why we say 'roughly': ~(Aa) is true when A is total and undefined
otherwise. ~(Aa) is never false and in this way the monotony is preserved.
Such a functor is of great utility. In effect, it makes it possible to introduce
axioms of the form 'if A, B, ... , are total, then ... ' and permits the elaboration of a system of partial logic.
A DEDUCTIVE SYSTEM FOR PARTIAL PROPOSITIONAL LOGIC
The first stage consists in characterizing the subset of formulae of propositional calculus as verifiably valid tautologies. We proceed in the following
way.
DEFINITION 19. Let FP
smallest set such that
(i)
(ii)
Var, U {T, F} ~ FP
if A, B E FP, then -,A, [A
1\
v B], [A == B)
FP.
(iii)
(iv)
(v)
if A
E
C]
36
FRAN<;OIS LEPAGE
(vi)
(vii)
FP+ and
part.
Classically valid rules
Rl.
R2.
R3.
R4.
F ~ A.
[AX I [Ax"A,BI] ... Bn] ~ A,{BI ... Bjxl ... x n}, provided
that every B j is free for Xj in AY
A,[C,Ix,], [C == B,] ~ A[Blx], provided that B in free for x in A.
[[AliT] A [AF]] ~ V'x,[Ax].
R5.
R6.
~ V'Zar,V'var,[V'Xa[ZX ==
== [z == v]].
RIO.
A ~ ~(A).
Ril.
R12.
R13.
R14.
R15.
'Vxl ... V'xn~(Ar,) ~ ~(A.XI ... Ax"Ar,), where XI' ... ,Xn are distinct
variables of any type.
R16.
~ V'xa~(x).
One easily verifies that all these rules are verifiably valid.
37
Ai E H, or
Ai is such that => Ai is a rule or
R1 *
Ai = VX I
R2*
R3*
Ak,
Universite de Montreal
38
FRAN<;:OIS LEPAGE
NOTES
REFERENCES
[And63] Andrews, P. B., 1963, 'A Reduction of the Axioms for the Theory of Propositional
Types', Fundamenta MathematiclE LII, 345-350.
[Ben86] van Benthem, J., 1986, 'Partiality and Nonmonotonicity in Classical Logic', Logique
et Analyse 29, 225-247.
[Bla86] Blarney, S., 1986, 'Partial Logic', in Gabbay, D. and Guenthner, F. (eds.), Handbook
of Philosophical Logic Vol. Ill, Reidel, Dordrecht, pp. 1-70.
[Chu4O] Church, A., 1940, 'A Fonnulation of the Simple Theory of Types', The Journal of
Symbolic Logic 5, 56-68.
[Far90] Fanner, W. M., 1990, 'A Partial Functions Version of Church's Simple Theory of
Types', The Journal of Symbolic Logic 55(3), 1269-1271.
[Lep92]
[Lep84]
[Mon74]
[Muss88]
[Qui56]
[Sc073]
[Siu53]
[Tar23]
[Thi87]
[Tic82]
[Urq86)
39
SERGE LAPIERRE
INTRODUCTION
denotes
denotes
denotes
{X I: E: [[AJ] I: X},
{X I: E: [[AJ] n X"* 0},
{X I: E: I[[AJ] n XI > I[[AJ] - XI},
denotes
denotes
denotes
{(X, Y)
{(X, Y)
{(X, Y)
Such binary relations are called (local) binary generalized quantifiers. More
generally, a (global) n-ary generalized quantifier (n ~ 1) is a function Q
which assigns to every non-empty universe E an n-ary relation QE between
subsets of E. From now on we will stick with global binary generalized quantifiers and we shall simply call them quantifiers. In order to define or exhibit
a property of a quantifier Q operating on a universe E. we shall write "QEXY"
instead of "X, Y I: E and (X, Y) E QE". Moreover, for denoting a familiar quantifier having a simple determiner, we shall write the determiner itself in italic.
For instance:
allEXY iff X I: Y;
someEXY iff X n Y "* 0;
noEXY iff X n Y = 0;
all and someEXY iff X I: Y and X n Y
at least halfEXY iff IX n YI ~ IX - YI.
0;
42
SERGE LAPIERRE
the meaning of the determiner "many", we may set manyEXY if and only if
the proportion of Y-individuals in X is larger than the proportion of Y-individuals in the whole universe E; formally:
manyEXY iff
IX n YI/IXI
> IYI/IEI
Since the proportion IYIIIEI decreases when the size of E increases, many in
this sense is not a context-free quantifier. However, most of the natural
language quantifiers are context-free and even logical in the sense that only
the cardinalities IX - YI and IX n YI are relevant for deciding whether QEXY
or not. This is clearly the case for all, some, no, all and some, at least half,
most and many others, From now on, we shall restrict our attention to logical
quantifiers.
As relations, quantifiers have relational properties. For instance, all is
reflexive and transitive, some is symmetric, not all is connected. These
properties can be considered as inferential properties. For instance, symmetry
allows us to infer someYX from someXY. If we consider other conditions,
more inferential properties emerge. For instance, from allXY we may infer
all(X n Z)Y, where Z is any set, because all is downward-left monotonic
(i.e., if QXY and X' !;;;; X, then QX'Y).
The inferential properties of quantifiers can be studied from two opposite,
but complementary perspectives. First, there is the "inverse logic" perspective,
which consists of considering some specific inferential properties and finding
which quantifiers have those properties. The opposite view - the "direct logic"
perspective - consists of considering some specific quantifiers and studying
their inferential behaviours. This paper belongs essentially to the inverse logic,
since our purpose is to summarize some results in the analysis of the inferential properties of some quantifiers.
By its nature, our study depends on two things. First, it depends on a prior
choice of formalism, expressing more or less the relevant inferential properties. For instance, we may decide, as in Section 1, to study only simple
relational properties; in that case a formalism consisting only of atoms of
the form QXY is sufficient. But in order to express more specific properties
of quantifiers, such as the various forms of monotonicity, we need a formalism
allowing in addition Boolean set terms in the argument of the quantifier
relation. This more expressive formalism will be used in Sections 2 and 3 when
we study some "conditional quantifiers", that is to say quantifiers which behave
as conditional relations.
Besides a prior choice of formalism, another important decision concerns
the cardinality of the universes. Must we admit finite universes only, or infinite
universes as well? Some theoreticians would prefer to stick with finite universes. There is an empirical reason for this decision: natural language seems
to require finite models only, infinite ones arising only through philosophical or scientific considerations. But there is also a methodological reason
for the finiteness restriction: it simplifies results and proofs and one notes
that most of the results obtained under the finiteness assumption also hold
43
There are many different formalisms, having more or less expressing power,
which can be used for expressing the inferential properties of a given quantifier. In this section we shall stick with the minimal formalism L syll , which
consists only of elementary formulae of the form QXY, where Q is a quantifier symbol and X, Yare set variables. This formalism is rich enough for
expressing "pure syllogistic" patterns of inference such as:
=>QXX
=> QXY
QXY => QYX
QXY => QXX
QXY => QYY
QXX => QXY
QXY, QYZ => QXZ
reflexivity
universality
symmetry
quasi-reflexivity
weak-reflexivity
quasi-universality
transitivity
44
SERGE LAPIERRE
different elements are represented by two distinct sets (this follows from
reflexivity (x E F(x and the fact that there is no cluster in the resulting model).
Moreover, Qxy if and only if F(x) !: F(y). For, if Qxy, then for any z such
that Qzx, Qzy (transitivity), which means that F(x) !: F(y); conversely, if
F(x) !: F(y), then x E F(y) (by reflexivity), which means that Qxy.
one-one and that Qxy if and only if F(x) !: F(y) and F(x) n F(y) ::F- 0.
Since quasi-reflexivity and weak-reflexivity are both trivial consequences of
reflexivity, it follows from the two previous theorems that Tsyl/(all and some)
C Tsyl/(all).
THEOREM 3. Symmetry and quasi-universality axiomatize the pure syllogistic
theory of no completely.
Proof. We proceed as before, but now we represent the model by a set of
sets with the empty intersection relation by the function F(x) = {{x, y}: not
Qxy}. One easily verifiers that F is one-one. Moreover, Qxy if and only if
F(x) n F(y) = 0. For, if Qxy and x = y, then F(x) = F(y) = 0 (quasi-universality and symmetry) and so F(x) n F(y) = 0. If Qxy and x ::F- y, then any
set in F(x) is distinct from anyone in F(y), and thus F(x) n F(y) = 0.
Conversely, if F(x) n F(y) = 0, then there is no set {x, z} = {y, Z/} E
F(x) n F(y), and this implies Qxy (for, if not Qxy, then not Qxy (symmetry),
45
CONDITIONAL QUANTIFIERS
Its has been pointed out in van Benthem (1984) that there is a striking analogy
between the analysis of binary quantifiers and conditional sentences of the
form if A, (then) B, considered as expressing relations between antecedent
sets and consequent sets of situations. More precisely, given a "relevant"
non-empty universe E of situations, the functor "if" may be analysed as a determiner denoting a binary quantifier on E, say ifE' so that given any two terms
A, B having the extensions [[All, [[Bll ~ E, if A, (then) B is true if and only
if ifE [[All [[B]].
From the inverse logic perspective, several properties, expressing more or
less a priori intuitions of conditionality, have already been suggested. The
central ones may be formulated by the following patterns of inference:
CONS
Cl
C2
C3
R
BE
ifXY ~ ifX(Y n X)
ifXY => ifX(Y U Z)
ifX(Y n Z) => if(X n Y)Z
ifXY => if(X U Z) (Y U Z)
=> ifXX
Replacement of Boolean Equivalents
conservativity
confirmation
reflexivity
iff {X k Y, if X is finite
X - Y is finite, if X is infinite.
Each of these quantifiers determines, in an appropriate formalism, a conditional logic. Obviously, in order to capture most of the fundamental patterns
of conditional inference, especially the above basic M-principles, the minimal
formalism we need consists of conditional formulae of the form ifXY,
where X, Y are set variables or combinations of set variables with parentheses and the operations "n", "U" and "-", to which we give their usual
meanings.
Given the formalism described above, it remains to determine the range
of the admissible sizes of the universes. Many options are available here.
For instance, we may decide to consider either finite universes only, or only
infinite universes of a fixed cardinality. On the other hand, when considering
infinite universes, we may decide to be careful not to become entangled in
higher infinite cardinalities, and so to restrict the range of admissible universes
46
SERGE LAPIERRE
to denumerable ones. Combining these two options, here is a list of the logics
we may consider:
Quantifiers
Logics
Finite universes
all
C
Classical conditional logic
E
Exemplary conditional logic
all or some
at least half
QD Quasi democratic conditional logic
Denumerable universes
all but finitely many
N
Coo
all
all or some
Eoo
at least half
QDoo
Section 3 is about the first three logics. Section 4 is for the most part about
the others in relation with the formers. However, other possibilities, such as
the inferential behaviours of the quantifiers on at most denumerable universes, or on infinite universes of higher cardinalities, will be considered when
they seem relevant.
3.
The logics. C, E and QD and their mutual relationships have been studied in
detail in Lapierre (1991). Figure 1 summarizes these relationships. We will
give a quick proof of each indicated relation and specifications about each
logic involved in the relation.
C
MCEn QD
C'
C'
EUQDCC
QD
Fig. 1.
The logic C, the one of all, is the most inclusive logic in this figure.
Obviously, this logic corresponds, in our restricted formalism, to the logic
of (Ss)-strict implication. So it contains, besides all M-principles, the following
additional ones:
CNT
LM
TRN
CNJ
DSJ
contraposition
left-monotonicity
transitivity
conjunction
disjunction
47
However, it appears that all basic principles of M plus TRN are sufficient
for characterizing this logic.
THEOREM 5. All basic principles of M together with TRN axiomatize C
completely.
Proof To begin with, one notes that the following principles are straightforward consequences of M+ TRN (0 =defX n -X, X being any term):
PI.
P2.
P3.
P4.
if00
ifXO, ifYO ~ if(X U Y)O
ifXO ~ if(X n Z)O
ifXY :::> if(X n -Y)O
Now suppose that ifX1Y1 ... ifXnYn If ifXY in M+TRN. Since only finitely
many terms are involved here, there must be a finite Boolean algebra with
an additional binary relation Q interpreting if which verifiers every premise
and refutes the conclusion. (Note that though Boolean terms denote elements
of this Boolean algebra by means of some basic assignment from set variables,
these elements are not necessarily sets.) As usual, this Boolean algebra may
be represented isomorphic ally as a power set algebra and under this representation, Q becomes a binary relation between sets, having the following
properties (where X, Y, Z are any subsets of the set B of atoms of our Boolean
algebra):
PI'.
Q00
P2'.
P3'.
P4'.
Q*0
Q*X, Q*Y ~ Q*X U Y
Q*X ~ Q*X n Y
(from PI')
(from P2')
(from P3')
Let K = U {X: X E Q*}. Then Q*K by P2* and the fact that B is finite.
Now the mapping F = AX. X n -K is a Boolean homomorphism from our
power set algebra into another one over the smallest base set B n -K.
Moreover, Q* restricted to !fi>(B n -K) contains only the empty set; thus for
all X, Y ~ B n -K, Q*X n -Y if and only if X ~ Y. It remains to verify
that none of the previous relations is affected by this restriction.
Let X, Y ~ B and suppose that Q*X n -Yo Then Q*(X n -Y) n -K by
P3*, that is to say Q*F(X n -Y). Conversely, suppose that Q*F(X n -Y).
48
SERGE LAPIERRE
Since Q*K, we have that Q*K U F(X n -y), by P2*. But by Boolean identity,
K U F(X n -Y) = K U (X n -y), and thus Q*K U (X n -Y). So by
P3*, Q*(K U (X n -Y) n (X n -y), and therefore Q*X n -Y, by Boolean
identity.
We may now return to the relations pictured in Figure 1. First, the assertion
that E U QD C C contains the following non-immediate propositions.
E U QD ::F- C: this follows from the fact that neither all or some nor at least
half validates CNJ.
E ~ C: let ijX\Y\, ... , ijXnYnlijXY be any inference refuted by inclusion
ijXY,
ijlX,
ijlX,
iflX,
Z) cautious conjunction
conditional symmetry
transmissibility of necessity
conditional weakening of the
antecedent
However, it is easy to verify that all these principles, including DSJ, are
derivable from all basic M-principles plus CCNJ. Though that does not mean
that this set of principles axiomatize E completely, it is a very likely conjecture at this stage.
The assertion that E q;, QD and QD q;, E is established by the following
two facts. First, as we pointed out, DSJ is valid according to all or some.
49
However, OSJ is not valid according to at least half, witness the following
QD-counter-example:
Secondly, it has been established in van Benthem (1986) that at least half
validates the following principle (where X A Y abbreviates the symmetric
difference (X n -Y) u (Y n -X:
if(X A y)Y, if(Y A Z)Z => if(X A Z)Z.
PA
But PA is not valid according to all or some, as the following E-counterexample indicates:
In order to establish that M C E n QD, it is sufficient to show that M ::FQD, since both E and QD include M. This is quite simple since we
easily verify that the following principle is valid according to both all or
some and at least half without being derivable in Malone:
COSJ
if(X
conditional disjunction
Another way to see this is to consider the class of the quantifiers all but at
most n (for any n = I, 2, 3, ... ): all of them validate all basic M-principles,
but none validates COSJ.
4.
Now let us consider denumerable universes. It is not very surprising that the
logic of all as well than the logic of all or some do not change on these
universes. We give here the proofs of these identities, which will be useful
in some forthcoming demonstrations.
50
SERGE LAPIERRE
Incidentally, this latter result about the inferential behaviours of all and all
or some can be generalized to at most denumerable universes as well as to
infinite universes of higher cardinalities.
With at least half, matters change, as the following theorem indicates.
THEOREM 7. QD(o) C E.
Proof. DSJ is an E-principle which is not QDoo-valid, and thus QD oo "t:- E.
(To see that DSJ is not QDoo-valid, consider the QD-counter-example of Section
3, and add countably many new situations outside the three relevant sets.
The results is obviously a QDoo-counter-example.) However, every QDoo-valid
inference is E-valid too. Indeed, let ijX1Y1, ... , ijXnYnlijXY be any inference refuted in some E-model. For each 1 ~ i ~ n, select exactly one Xj n
Yj-situation (if there is any) and add countably many copies of this situation,
behaving in exactly the same way with regard to (non-}membership of the
relevant sets. Clearly this new model is denumerable. Moreover, for every
I ~ i ~ n, either there is no Xj n -Yj-situation, or there are countably many
Xj n Yj-situations, which means in both cases that there are no more Xj n
-Yj-situations than Xj n -Yj-situations. On the other hand, there are more
X n -Y-situations than X n -Y-situations, since there was no X n Y-situa-
51
tion at all, but at least one X-situation in the former model, which is still the
q;,
E (Section 3).
52
SERGE LAPIERRE
Now, note that the situation is different for at most denumerable universes,
since every QD-invalid inference is also an invalid inference according to at
least half on finite or denumerable universes.
The logic N (the one of all but finitely many) is interesting. It contains
CNJ, which distinguishes it from both E and QD. But unlike C, none of LM,
CNT and TRN is valid according to this logic. (Too see that LM, for instance,
is not an N-principle, consider the numerical model where Y = the set of
even numbers, X = Y U {I} and Z = {I}; clearly, all but finitely manyXY
but not not all but finitely many(X n Z)Y.) Thus, N seems to be a good candidate for a counterfactuallogic. Let us compare it with the Basic subjunctive
logic S of Burgess (1981), which is completely axiomatized by R, CNJ, DSJ
and the following two additional principles:
SIMP
CLM
simplification
cautious left-monotonicity
The following two results, due to van Benthem (1986), will be useful:
(i) S is precisely M+CNJ (all derivations are straightforward in both
ways);
(ii) S is the many-premise fragment, in our formalism, of the full counterfactuallogic of Lewis (1973). This means that that S is sound and complete
with respect to comparative similarity models.
Now, since all basic M-principles and CNJ are N-principles, it follows
from (i) that N ~ S. But N does not exactly coincide with S. Too see this,
consider the inference from ijl-J(, ifXY to ij(X n Z)Y. It is clearly N-valid,
since ifl-J( means here that there are only finitely many X-situations, and in
this case the inference from ifXY to if(X n Z)Y is validated by inclusion.
On the other hand, this inference is not S-valid, as indicates the following comparative similarity model (comparative similarity is distance and world 3 is the
vantage world):
z
y
3
53
As we see, if(X n Z)Y does not hold in world 3 (i.e., there is a "closest"
X n Z-world, namely world 1, which is not a Y-world). However, both ifl-,X
and ijXY hold in world 3 (in particular, there is only one "closest" E-world
from world 3, namely world 3, and this world is not an X-world). So, the inclusion of S in N is proper.
Where is the location of N in the scheme of Figure I? Here is the picture,
the proofs will follow.
('
EU QDCNCC
MCEnQD
('
QD
Fig. 2.
and thus N:F- S. But every inference which is C-invalid is N-invalid too. Indeed,
let ijXjYj, ... , ifXnY jifXY be any inference refuted by inclusion in some finite
model, and thus (Theorem 6) in some denumerable model. Then in this model
all premises are verified according to all but finitely many, because they are
verified by inclusion. On the other hand, if there are countably many X n
-Y-situations, then there are also countably many X-situations, and so the
conclusion is already refuted according to all but finitely many. If there are
only finitely many X n -Y-situations and that the set of X-situations is denumerable, add countably many copies of any X n -Y-situation, behaving in
exactly the same way with regard to (non-)membership of the relevant sets.
Again this procedure does not disturb any of the previous relations, but now
there are countably many X n -Y-situations - and thus we have an N-countermodel for the same inference.
The second non-immediate assertion to consider is E U QD C N. First,
E U QD :F- N, since CNJ is an N-principle but neither an E-principle nor a
QD-principle. It remains to establish that both E and QD are included in N.
In order to do this, we need this
THEOREM 11. Every inference ifXjY j, ... , ifXnYjifXY which is refuted
according to all but finitely many in some denumerable model where X n
- Y is denumerable is also refuted in some finite model according to inclusion (all).
Proof Let ijXjY j, ... , ifXnY jifXY be any inference refuted according
to all but finitely many in some denumerable model where X n -Y is denumerable. Consider the homomorphic sub-model consisting only of all nonl) Xj n -Yrsituations, behaving in exactly the same way with regard to
I
54
SERGE LAPIERRE
QD",
(ii)
QD
E",
E",
n N;
n N.
QD",
(iv)
QD
E
E
n N;
n N.
55
All these inclusions are proper. For, DSJ is both an E-principle and an Nprinciple which is neither QDoo-valid nor QD-valid, and thus QDIiJ #. EIiJ n
N = E n Nand QD #. EIiJ n N = E n N.
Many questions are still unanswered. For instance, where is the location
of QDIiJ in the scheme of Figure 2? Theorem 7 gives us a partial answer,
which motivates us to ask whether E n QD C QD oo . Along another line,
there is the issue of the inferential behaviours of our conditional quantifiers
in a more expressive formalism allowing logical combinations between conditional formulae, such as:
CEM
CV
ALT
ifXY V ifX -Y
ifXY, -,ifX -Z => if(X n Z)Y
ifX(Y U Z) => ifXY V ifXZ
A CATEGORY-THEORETIC APPROACH
TO ARISTOTLE'S TERM LOGIC,
WITH SPECIAL REFERENCE TO SYLLOGISMS
INTRODUCTION
When Aristotle invented logic, what he invented was a logic of terms. The
Stoics replaced Aristotle's term variables with propositional ones, and with
that propositional logic was born (see [16]). For a long time term logic and
propositional logic existed together. For example, William of Ockham [21]
devoted the first part of his Summa logicae to terms and the second part to
propositions. Perhaps it was Kant who was responsible for the emphasis on
propositional logic at the expense of term logic. For where Aristotle had
categories of objects and attributes, closely related to the grammatical categories of terms that normally denote them, Kant had categories of concepts.
Kant, however, derives categories of concepts from categories of judgments;
that is, from categories of propositions. With the move to categories of judgments, term logic in anything like Aristotle's sense drops from view. In this
Frege follows Kant and so does what is now called "classical logic". (These
remarks were inspired by a comment of F. W. Lawvere.)
In the exclusive pursuit of classical logic important logical problems
are neglected. Elsewhere we have studied problems of negation in natural
languages ("unhappy" versus "not happy") and problems of identity (the celebrated ship of Theseus) and we shall not repeat the discussion here (see [13]
and [14]). These problems show the special relevance of term logic for the
study of cognition; for the semantics of natural languages and cognition are
of a piece. Not that one can read the forms of cognitive operations
automatically off grammatical form, as perhaps some ordinary-language
philosophers may once have imagined; but that if a cognitive operation is
demanded for the interpretation of a non-technical expression of natural
language, then that operation must be one that is readily available to the human
mind. It seems that no operation can exist in the non-technical part of a natural
language that is not available to the untutored human mind. Since many of
these operations cannot be captured in classical logic with its set-theoretic
models but require the use of category theory, we propose the following parallel
Categorical logic
Calculus
Cognition
Dynamics
57
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science I, 57-68.
@ 1995 Kluwer Academic Publishers.
58
so categorical logic provides the language best adapted to expressing the theory
of cognition and the main mathematical tool for exploring cognitive structures and operations. (For this claim see [10].)
Among the examples which illustrate term logic, which is not in general
Boolean, we develop at length only the theory of syllogisms. The first section
discusses two problems about syllogisms. Section two introduces the standard
class interpretation and shows that it is unable to solve the two problems. In
section three we present a particularly central component of term logic, the
logic of kinds. In section four we apply this logic to syllogisms, delimiting
conditions under which syllogisms are valid. In section five we compare
the category-theoretic approach with the one that has for a considerable time
been offered as the standard approach, namely the class interpretation of
syllogisms. Some concluding remarks bring our paper to an end.
1.
59
"Red" in the first premiss is an adjective sorted by "fire engine" and as such
it picks out a set of fire engines (the red ones). In the second premiss, "reds"
is the plural form of the eN "red" which refers to the kind RED (consisting
of person belonging to a certain political group). It follows that the putative
inference, made plausible by an unmarked change in grammatical category
associated with the change in grammatical role, is spurious.
Even if we lay such obvious miscarriages of validity aside, we still have
a problem. Take the perfectly unexceptionable syllogism in Barbara
All dogs are animals
All animals are mortal
All dogs are mortal
What is the logical connection between "to be an animal" which in the
first premiss denotes a predicate of "dogs" and "animals" in the second premiss,
which denotes a kind in its own right, not a predicate of "dogs"? What explains
the obvious validity of such a syllogism?
Perhaps to avoid such difficulties some writers have placed restrictions
on the class of natural language expressions that can be substituted for
Aristotle's schematic letters. Lukasiewicz [17] rules out proper names and
adjectives. Yet as Mulhern [20] points out, Aristotle gives numerous examples
of syllogisms in which proper names and adjectives feature. Elsewhere we have
dealt with syllogisms in which proper names and adjectives in both subject
and predicate position occur, delimiting the circumstances under which they
are valid (see [12] and [15]). Here we shall leave these aside and handle the
difficulties in "normal" syllogisms only.
2.
We have indicated two problems for syllogisms. Let us see how the standard
class interpretation handles them. But first a word on what the class interpretation is. The idea of the class interpretation is to assign to the count
nouns and predicables of a syllogism non-empty subsets of a supposed universal kind THING. The members of this kind are sometimes conceived as bare
particulars, that is attribute-free supports for attributes. Thus "baby" is interpreted as the subset of things that have the property of being a baby; "person"
as the subset of things that have the property of being a person; and the relation
between BABY and PERSON is just one of set-theoretical inclusion. Similarly,
"big" is interpreted as the subset of things that have the property of being
big.
Neither of the problems pointed out above seems to arise in this interpretation. There seems to be no problem relating to the sorting of predicables,
since predicables are not sorted or rather, what amounts to the same, are sorted
by the unique sort "thing". The problem of switching between a predicable and
a sentence subject seems also to have disappeared, since both sentence subjects
60
61
Since proper names (PNs), count nouns (eNs) and predicables are interpreted
in relation to kinds, we begin with the logic of kinds. We do not here consider
syllogisms that include mass nouns (like "water") or abstract nouns (like
"beauty" or "justice"). Readers seeking more detail on the logic of kinds should
consult [11].
3.1. Situations and Kinds
62
3.2. Predicates
As we have seen, eNs such as "dog" and "animal" are interpreted as kinds
and relations between eNs of the form "a dog is an animal" as un underlying map between the corresponding kinds. However, we can be a bit more
explicit. We will see, by means of an example, that eNs themselves (in a given
"universe of discourse") constitute a preordered set and thus a category, }{,
the nominal category of the given universe. Its role is to act as a "blueprint"
to organize the eNs in such a way as to indicate the connection between the
kinds interpreting these eNs.
Assume that, like Aristotle, we are organizing and developing a given
subject, say Zoology. In Zoology we have eNs ("mammal", "whale",
"fish", "animal", etc.) and predicables ("having a heart", "breathing air", "being
a mammal") which combine with eNs to make significant sentences.
Furthermore, we have relations between eNs described by sentences of the
type "a dog is an animal", "a whale is a mammal", which we assume as
postulates. These postulates impose constraints on the interpretation of these
CNs, constraints which are partly conventional and partly empirical and are
subject to revision. Recall that for a long time people accepted that a whale
was a fish.
The CNs and their relations constitute a category }{ under the following
63
H:
We noted that the supposed universal kind THING is logically ill founded.
Partly to replace this notion we propose a new notion of ENTITY relative to
a system of bona fide kinds, a notion that is sensitive to ordinary practices
in the interpretation of natural language expressions.
For example, an airplane may transport fathers, mothers, husbands, wives,
women, stewardesses, crew members, etc. To calculate the number of seats
occupied, we do not simply add the number of fathers plus the number of
mothers plus the number of husbands plus the number of . . . , since a
particular woman, say, may also be a crew member. Similarly, a particular wife
may be a mother. We need to describe these identifications systematically.
We assume that all of these kinds are organized in a system of kinds. In our
example, a stewardess s is identified both with a woman u(s) and with a
crew member yes), where u and v are the underlying maps of the kind STEWARDESS into WOMAN and into CREW MEMBER, respectively, corresponding
to the expressions "a stewardess is a woman" and "a stewardess is a crew
member". This forces the identification of u(s) with yes), i. e., the identification of a certain woman with a crew member, even if there are no underlying
maps between WOMAN and CREW MEMBER. Similarly, a certain wife and
a certain mother could each be identified with the same woman, forcing their
identification. Notice that, whereas the first identification takes place by means
of a "backward" move, the other takes place by means of a "forward" move
(these terms referring to the direction of the movement along the underlying
maps in the system). Proceeding in this way, we obtain the required relation
of identification relative to our system of kinds. This relation turns out to be
an equivalence relation.
The ENTITY relative to our system of kinds is the set E of equivalent classes.
In our case this kind is the interpretation of "person (in the airplane)".
Forming the entity is a particular example of the notion of colimit of a
functor in category theory (see [12] for details). This shows that the operation of forming the colimit is one that is a perfectly natural one for the human
mind. At times the language may fail to contain a single lexical item to express
the colimit. Take bent-wood chairs, kitchen chairs, chairs, dining tables, card
tables, tables, etc. The lexical item that covers them all is "furniture". But
64
"furniture" is a mass noun and does not denote the colimit of the system.
Nevertheless, there is "article of furniture" that does. Other such expressions
are "head of cattle", "articles of clothing", and "items of information". Many
languages have a much more extensive set of such "markers" ("article", "head",
"item") than English.
3.5. Coincidence Relation
65
of a predicate being functorial is easy to grasp but not always easy to compute
with. For purposes of computation, the equivalent notion of an extensional
predicate is often more convenient. We say that a predicate <I> of a system of
kinds is extensional if and only if given two kinds in the system and one
member in each, either both members have the property <I> at U or both fail
to have the property <I> at U, provided that the members in question are coincident at U. The equivalence of these notions is proved in [12].
We mention that there already exists, in the literature the notion of a predicate being "inter-secting" or "absolute" (see [9]). This notion turns out to be
a very special case of a predicable being functorial/extensional (see [12]).
3.7. Grammatical Change
As we mentioned, some predicables derive from CNs and PNs and we need
to specify the semantical connection between the interpretation of CNs and
PNs on the one hand and that of derived predicables on the other.
Let us start with a system of kinds and assume that "puppy" is a CN in
the nominal category. The interpretation of the predicable "to be a puppy" is
obtained as follows: for each kind A we define a predicate TO BE A PUPPYA
by stipulating that for each a in A and any situation U, a has the property of
being a puppy at U iff there is a puppy which coincides with a at U. Notice
that a member of a kind in the system, a dog say, may have the property of
being a puppy at U, but may fail to have it at V.
The following is fundamental for validity of syllogisms:
PROPOSITION 3.7.1. The interpretation (All)IlEIXI of the predicable "to be
an A" derived from the CN A is an extensional predicate.
4.
INTERPRETATION OF SYLLOGISMS
We now specify the notion of validity for a syllogism. In the literature there
are two notions of validity, depending on how syllogisms are considered: either
in terms of axioms, with Lukasiewicz [17]; or in terms of rules of inference,
with Corcoran [4]. Since validity based on axioms implies validity based on
rules of inference, we will prove validity of syllogisms based on axioms.
Our interpretation makes essential use of the relation of "forcing" between
a situation U and a sentence 0 which we write U II. . 0. Intuitively this relation
expresses the idea that U is a truth-condition of 0 (as explained earlier in
the paper for sentences such as "John runs", see Section 3.2).
Since we do not plan to give a detailed treatment, we shall give one example:
Take the celebrated syllogism in Barbara
All Greeks are men
All men are mortal
All Greeks are mortal
66
We first notice that we here have two eNs: "Greek" and "man" and one
predicable: "to be mortal". The first eN occurs only in subject position in
the syllogism. The second eN changes its grammatical role from subject
position in the second premiss to predicate position in the first. On the other
hand, the predicable "to be mortal" is sorted by "man" in the second premiss
and by "Greek" in the conclusion. We have here an example of the two
problems discussed in the introduction.
Let j{ be a nominal category containing, among its objects, "Greek" and
"man" and let 1: j{ ~ X be an interpretation whose values are inhabited
kinds (i.e., kinds having at least one member). To simplify the language, we
let GREEK = I("Greek") and MAN = I("man").
Since both premisses and the conclusion are of the same form, our
first question is how to interpret "All A are '\If", where A is a eN and '\If is a
predicable?
We recall from Section 3.1 that the interpretation of sentences should be
a downward closed set of situations, i.e., the set of truth conditions should
be a downward closed set of situations. We say that U forces "All A are '\If"
or, equivalently, that U is a truth-condition of "All A are '\If" iff for every a
in A and every V::; u, V forces "a has the property '\If", i.e., a has '\If at V.
We define a syllogism to be valid if every truth-condition of both premisses is a truth-condition of the conclusion.
Let us check the validity of our syllogism. Let U be a situation which is
a truth-condition of each premiss and let g be a Greek. Using the first premiss
and the definition of the interpretation of a predicable derived from a eN, there
is a man m such that, at U, g coincides with m. Using the second premiss, m
has the property MORTAL MAN at U. We want to conclude that g has the property
MORTALGREEK at U. But here we get stuck, unless we know that the interpretation of the predicable "to be Greek" is extensional, in which case we
obtain the desired conclusion. But "to be Greek", being derived from the
noun "Greek", is extensional and the syllogism is valid.
This notion of forcing is just Kripke's forcing and is a particular case of
forcing in a topos [19].
As we mentioned before, Aristotle also considered "deviant syllogisms",
namely syllogisms with predicables in subject position. We shall leave them
aside. The interested reader is advised to consult [12] for details. Aristotle also
considered syllogisms in which PNs occur. For a study of these, the reader
is referred to [15].
5. THE CLASS INTERPRETATION REVISITED
Our work should not be taken as refuting the class interpretation, but as
delimiting its validity, as we show in this section. In fact our theory yields
the class interpretation under rather special circumstances.
The idea is simple enough. Assume that we have a system of inhabited kinds
and a predicable. As an example take the system involving DOG, POODLE,
67
etc. and the predicable "male". We define MALE, the interpretation of the
predicable "male" as a predicate of ENTITY (which we assume to be the
kind ANIMAL) by stipulating that an animal a is male at U iff there is a member
of a kind in the system (for instance a dog) that is identified with a and is
male at U. Of course, kinds themselves give rise to extensional predicates
(as shown in Section 3.7) and we obtain the class interpretation for terms
that are "homogeneous with respect to their possible positions as subjects
and predicates" as Lukasiewicz requires.
Although this definition does not require the interpretation of the predicable
to be functorial, several pseudo syllogisms would be validated if this restriction were not imposed, as we pointed out in the introduction.
CONCLUDING REMARKS
68
5.
6.
7.
8.
9.
10.
II.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
J. LAMBEK
INTRODUCTION
In spite of the regrettable divorce of mathematics from philosophy, I am convinced that these two disciplines can learn from one another. In collaboration
with Phil Scott (1986) and Jocelyne Couture (1991), and again in (1993), I
have argued in favour of a nominalistic interpretation of mathematics, suggesting that the world of mathematics may be constructed from the language
of mathematics. Although this view has by no means been accepted by the vast
majority of mathematicians, if they care about foundations at all, it was
intended to minimize the conflict between different traditional philosophies
of mathematics.
I would like to argue here that nominalistic interpretations can be extended
from the language of mathematics to natural languages. This idea is certainly
not new; it has been held for centuries by various philosophers, starting with
Philo of Alexandria, and more recently by linguists and anthropologists; but
I hope that at least some of my arguments have not been seen before. I shall
invoke some linguistic insights, which I arrived at while attempting to write
a mathematically sound production grammar for English, an ongoing project
which many never see completion. My claim that the structure of our world
is greatly influenced by the structure of our language will be illustrated by
episodes in the early history of mathematics, before its lamented divorce
from philosophy (see Anglin and Lambek, 1995). Unfortunately, I do not
possess a technical knowledge of classical Greek and my speculations will
be based on the assumption that there are significant parallels between ancient
Greek and modem English grammars.
2.
MATHEMATICS
Let me begin with the question: what is the language of mathematics about?
Most people would say that numerical expressions such as 1 + 1 are about
* This research was supported by the Social Sciences and Humanities Research Council of
Canada.
69
M. Marlon (lIf{,[ R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science I, 69-78.
1995 Kluwer Academic Publishers.
70
J. LAMBEK
71
NATURAL LANGUAGES
It would seem that the nouns uncle, glass and water denote entities which exist
quite independently of our language, and even the verb form drank denotes
a past action of some extralinguistic status. A little reflection, however, will
show that the way these entities are structured is entirely language dependent, even though some of them may be made out of molecules or, according
to the latest scientific theory, out of quarks and electrons, and others may
consist of events in space-time.
Let us begin by looking at the noun uncle. This denotes a binary relation,
as in "A is an uncle of B", meaning that "A is a male sibling of a parent of
B or the male spouse of a sibling of a parent of B". To simplify the discussion, let us forget about unclehood by marriage and concentrate on the first
part of the definition: A is a male sibling of a parent, say C, of B. Nothing
is being said about the sex of C; however, in many languages it is necessary
to specify whether C is male or female, as was the case in medieval English.
In some languages, e.g. Hindi (Bhargava and Lambek, 1983), when C is male
we must also specify whether A is older or younger than C. In such a language
72
J. LAMBEK
there are three words where the single word uncle will do in modern English;
the world has been partitioned more finely than in our language.
On the other hand, there are languages in which no distinction is made
between a mother's brother and her father. In fact, our word uncle is derived
from Latin avunculus, meaning "little (maternal) grandfather". Similarly, our
word nephew is derived from a word meaning "grandson". Even in the world
of Hindi, some partitions are less fine than in ours; for example Hindi does
not distinguish between siblings and cousins.
This is not to say that we cannot translate between Hindi and English. Thus,
Hindi mama can be translated into English "maternal uncle" and English
brother can be translated into a compound Hindi phrase which excludes cousinhood. The point is rather that sometimes Hindi sees a simple structure
where English sees a compound one, and sometimes it is the other way
round.
Next, let us look at the word drank, expressing the past tense of the verb
drink. At first sight, drink seems to refer to the process of consuming a liquid,
whereas we usually eat a solid. But even British English and American English
partition the world differently here. Americans eat soup and drink tea, while
the British drink soup and eat tea, at least when it consists of cucumber sandwiches.
On the other hand, British and American scientists agree that glass is a
liquid, contrary to common sense. Yet, when we drink a glass of water, we
do not consume the glass at all, the glass is merely a vessel for containing
or measuring the substance water. Glasses can be counted, but water must
be measured, a distinction we shall take up in the next section.
The fact that drank is the past tense of drink refers to the passage of time.
In English such reference is obligatory, whereas in some languages, e.g. in
Chinese, it is not necessary to refer to the time when the drinking took place
at all. According Parmenides, the flow of time belongs to the world as it
appears to us humans; to the "goddess" past and future are all one. Presumably,
the goddess would feel more comfortable with Chinese than with English.
In a later section we shall consider the fact that the verb drink denotes a process
and not a state or a causal action.
4.
NOUNS
Not counting names and pronouns, which might be marginally included under
the umbrella of nounhood, English has three kinds of nouns, namely
count nouns:
mass nouns:
plural nouns:
pants, ....
The plural nouns, of course, include also the plurals of count nouns: houses,
oxen, peas, deliveries, .... One way to tell mass nouns from count nouns
73
is that they don't have a plural; another is that they don't allow an indefinite
article. Notice also
74
J. LAMBEK
and even claimed that all things are numbers, meaning natural or rational
numbers, thus embracing a reductionist philosophy, reducing all science to
mathematics. Not surprisingly, this extremist position faced criticism even in
his life time. According to one anecdote, Pythagoras was challenged thus:
"If everything is number, what about friendship?" Pythagoras was said to
have replied with a pun: "A friend is to me as 284 is to 220." This pair of
numbers was called "amicable", a technical term meaning that each is the
sum of the proper divisors of the other.
A more serious challenge came later, when his disciples discovered that
{2 is not a number in the sense of Pythagoras; translated into modem terminology, that it is irrational. This discovery brought about a victory of those
who championed measuring over counting, of geometry over arithmetic. It thus
came about that geometry dominated Greek mathematical thinking.
Actually, Eudoxos, a pupil of Plato's, discovered that numbers, meaning
rational numbers, would do after all. He pointed out that two real numbers,
namely ratios of geometric quantities, could be compared by comparing the
sets of rational numbers below each of them. He thus anticipated the modem
definition of real numbers by Dedekind.
5.
VERBS
Leaving aside auxiliary verbs for the moment, we also find three classes of
verbs in English. Before discussing these, let us look at the corresponding
situation in Latin (Lambek, 1979), where verbs belonging to three distinct
classes are usually linked semantically and morphologically in triples.
A typical example is provided by the triple amo, amavi, amor. Although
these words are usually regarded as parts of the same verb, we may also
think of them as three separate verbs, sharing the same root and having an
element of meaning in common. Each of these three verbs is equipped with
a complete 5 by 6 conjugation matrix, consisting of 30 forms made up from
5 simple tenses and 6 persons. One usually refers to amavi as the perfect
and amor as the passive of amo. (There is also a perfect passive, but this
has to be expressed as a composite made up from the past participle and the
auxiliary verb sum.) Not all triples are complete; intransitive verbs lack a
passive and a few verbs such as memini and nascor possess only the second
or third component respectively.
In English, there is a similar, though less obvious trichotomy. A typical
example is the triple: know, learn, teach. Here we have no common root, yet
three related meanings. We say that know refers to a state, learn to a process
and teach to a causal action. The analogy with Latin is not precise. We may
think of learn as kind of passive of teach and know as the perfect of learn;
yet is seems more natural to take know as basic and to regard teach and learn
as its causative and process forms respectively. (There are languages, e.g.
Hebrew, where a causative may be formed by a systematic morphological
variation, as is the case for the passive in Latin).
75
process
action
be
be dead
sit
have
know
become
die
sit down
get
learn
make
kill
seat
give
teach
is quite systematic. For example, I have a cold or I get a cold may be transformed into you give me a cold or you give a cold to me, while similarly I
know English or I learn English may be transformed into you teach me English
or you teach English to me.
We may think of learn and teach as being derived in meaning from know.
Thus,
learn
teach A
=
=
get to know,
make A know.
While teach incorporates the notion of causation, make expresses pure causation. The verb make allows one to transform any state verb or process verb
into an expression denoting causal action. It is not the only verb to do this,
76
J. LAMBEK
let, help and have perform a similar function, though with different emphasis.
When we say
A makes, lets, helps or has B come,
One suspects that there is here a kind of subconscious belief or folk philosophy, according to which the act of seeing brings about the event which is
seen. In fact, the Greek philosopher Empedocles, renowned for having proved
experimentally that air is a substance, proclaimed that light emanates from
the eye. More recently, the belief that perception is a kind of causation was
crucial to the philosophy of Berkeley.
I will skip discussing the auxiliary verbs be and have, although it could
be argued that the former plays a prominent role in the philosophy of
Heidegger. But I will say a few words about the modal verbs:
shall, will, can, may, must,
The modal verbs differ from other verbs not only syntactically, by the contexts
in which they appear, but also morphologically, by the fact that the third person
singular does not end in s. Semantically, the modal verbs carry all that English
is capable of saying about the future.
In Latin and many other languages, tenses impose a linear order on time;
but English seems to assume that, while the past is determined, the future is
not: there is one past but many possible futures. This view is reflected in
the recent philosophy of Storrs McCall. In Victorian times, if one wanted to
express the future, one had to make a choice between shall, which carried
a sense of obligation, and will, which carried a suggestion of desire. Not
surprisingly, in our more permissive times, shall is disappearing. There is a
popular attempt to restore a linear future by the introduction of a new modal
verb gonna.
If time is linear, there can be no causality, as was pointed out by Hume.
Physics seems to be in two minds about this question. While the notion of
causality plays no explicit role in the description of nature by mathematical
77
equations, classical physics allows for a number of possible futures, from which
one is selected by the principle of least action. Quantum mechanics achieves
the same end by invoking the collapse of the wave function.
6.
CONCLUSION
78
J. LAMBEK
NOTES
I
This relation, usually called "synonymity", is not as easily made precise as the relation of
provable equality in mathematics. Roughly speaking, one would wish to call strings rand t:.
of English words synonymous if any sentence containing r is known to imply and be implied
by the corresponding sentence containing t:.. There are problems with this attempted definition,
which may conceivably be overcome by putting restrictions on the contexts in which r appears.
Anyway, the notion of synonymity will play no explicit role in this discussion.
2 In some languages, mass nouns predominate and require a so-called "classifier" before being
modified by a numeral. For example, in Indonesian one must say five tail {of] cow, much as
we may say five head of cattle.
3 Many linguists insist that state verbs do not normally admit a continuous form, as in *he is
knowing her. This is so for know (except in India), but not for sit; thus our class of state verbs
is larger than the usual one. Zeno Vendler, in an important article, uses "state" in this narrower
sense. In place of our trichotomy, he speaks of four "time schemata": states, activities, accomplishments and achievements.
REFERENCES
Anglin, W. S. and Lambek, J., 1995, The Heritage of Thales, Springer-Verlag, New York.
Bhargava, M. and Lambek, J., 1983, 'A Production Grammar for Hindi Kinship Terminology',
Theoretical Linguistics 10, 227-245.
Couture, J. and Lambek, J., 1991, 'Philosophical Reflections on the Foundations of Mathematics' ,
Erkenntnis 34, 187-209.
Gillon, B. S., 1992, 'Towards a Common Semantics for English Count and Mass Nouns',
Linguistics and Philosophy 15, 597-639.
Lambek, J., 1979, 'A Mathematician Looks at Latin Conjugation', Theoretical Linguistics 6,
221-234.
Lambek, J., 1993, 'Are the Traditional Philosophies of Mathematics Really Incompatible?', Math.
Intelligencer 15, 5~2.
Lambek, J. and Scott, P. J., 1986, Introduction to Higher Order Categorical Logic, Cambridge
U. Press, Cambridge.
Lawvere, F. W., 1972, Introduction to Toposes. Algebraic Geometry and Logic, Springer LNM
274, 1-12.
Macnamara, J., 1991, 'Linguistic Relativity Revisited', in R. L. Cooper and B. Spolsky (eds.),
The Influence of Language on Culture and Thought, Mouton de Gruyter, pp. 45-60.
McCall, S., 1994, A Model of the Universe, Clarendon Press, Oxford.
Vendler, Z., 1957, 'Verbs and Times', The Philosophical Review 66, 143-160.
JEAN-PIERRE MARQUIS
You've got to
Accentuate the positive,
Eliminate the negative,
Latch on the affirmative,
Don't mess with Mr. In-Between.
(Arlen and Mercer, 1944)
In classical logic, truth and falsity are highly symmetric and this symmetry
is captured by the negation operator which "transforms" truth into falsehood
and vice-versa. Moreover, the negation operation represented in the semantics by a unary operator on the two-element Boolean algebra is lifted to the
higher-order operation of complementation on sets. However, if we abandon
bivalence, then symmetry is problematic. For one thing, we are forced to
distinguish between not-true, which is a truth-value, from not being true, the
complement of the singleton set consisting of the truth, which is not the false
and not even a truth-value. In classical logic, these two notions collapse into
one. Once they are distinguished, their relationships have to be settled and
in particular the links between complementation and negation have to be
clarified. The purpose of this paper is to explore some of these relationships
in a specific context, namely topos theory.
After a short historical survey on the question in the context of many-valued
logic, we will move to a more general framework where the principle of
bivalence is commonly false and therefore without further ado dismissed: topos
theory. Moreover, in a topos, the passage from an object to the object of its
subobjects is well regimented and thus in particular the passage from the structure of truth values to the structure of collections of truth values is also
well-regimented. Thus, it provides an interesting case study of the relationships one obtains between truth, not-true, not being true, falsehood, not being
false, etc. a topic which goes back to Aristotle. 1 We should immediately point
out that our purpose is not to establish new and profound results in topos theory.
Rather, our goal is simply to point to some interesting asymmetries which arise
in this case and these asymmetries can be exhibited directly in a simple manner,
without any complicated calculations. The main point of this paper, if there
is one, is conceptual, not technical.
Topos theory is particularly appealing for additional reasons: (1) it is wellknown that a topos is "the same" as a (intuitionistic) type theory strong enough
to develop a large part of classical mathematics; furthermore the language
contains terms for truth and falsity, which are however propositional types
and not predicates; (2) every topos contains a subobject classifier which is,
79
M. Marion andR. S. Cohen (eds.), Quebec Studies in the Philosophy ojScience 1,79-94.
1995 Kluwer Academic Publishers.
80
JEAN-PIERRE MARQUIS
-,p
-p
T
F
N
F
T
F
T
T
The first negation is called the "internal negation" and the second is called
the "external negation". Both are truth-functions and simply disagree on their
value for p = N. The external negation can be interpreted as being the characteristic function of the set "not being true", whereas the internal negation
can be thought of as the characteristic function of "not-true". However, in
the latter case, there is another map which could also qualify as being the characteristic map of "not-true": it differs from the internal negation in that it sends
N to F instead of sending it to N. Thus, one and the same subset of our set
of truth-values can have two distinct characteristic maps and hence there is
no bijection between the subsets of our truth structure and the characteristic
functions.
This is typical of the many-valued approaches: the structure of subsets of
the truth structure is never considered. Indeed, one has the feeling that the
classical Boolean structure is available, in other words that all maps from
the structure to itself are available, and thus all subsets of the truth-structure
are equally important and accessible and that none play any crucial role. This
81
is rather odd. If the truth structure has to be a truth structure, then there
should be some kind of natural restrictions on the maps allowed. These restrictions should then be reflected in the structure of subsets which will probably
not be a Boolean algebra. This is crucial, for it is then conceivable, for instance,
that the collection "not being true" look different from the set theoretical
complement of the singleton {true}.
As far as we know, the only attempt which has been made to incorporate
such considerations into a many-valued framework were those made by fuzzy
logicians. 3
In fuzzy logic, there is some restriction imposed on the intervals of truth
values allowed. It is not the case that any (fuzzy) subset is a truth-"value".
However, it seems that this restriction is arbitrary and is simply a matter of
convenience, for "if we allowed any fuzzy subset of [0, 1] to be a truthvalue of FL [a particular fuzzy logic], then the truth-value set of FL would
be much too rich and much too difficult to manipulate" (Bellman & Zadeh,
1977, 107). In order to avoid this difficulty, they pick a countable subset of
intervals of [0, 1] in a way which seems entirely ad hoc and indicates that
the structure of these intervals can be as one pleases. Needless to say, this is
entirely unsatisfactory. We ought to have intrinsic reasons to choose certain
intervals as truth-values and these intervals should have a definite structure,
uniform in some way for all truth-structures. 4 This is precisely what happens
in topos theory. Indeed, by definition, topos theory specifies what the structure of subobjects of any objects of the topos should be, in particular, the
structure of subobjects of the subobject classifier, the truth-structure, is given.
Thus the interval of truth values are not ad hoc and reflect genuine constraints of the surrounding "universe". We believe that any approach to
many-valued truth-structure has to have such specifications. 5
TRUTH-VALUES, CONNECTEDNESS AND SEPARATION
82
JEAN-PIERRE MARQUIS
The term 'separation' can even be used in its standard topological sense.
Recall that a topological space X is separated if there exists two open sets
A and B of X such that A U B = X and A n B = 0. A space X is said to
be connected if there does not exist a separation of X. Obviously, every set
with the discrete topology is separable in any way one wants. Thus any extension together with its complement yields a separation. However, it is enough
that X be totally disconnected, that is, that its only connected subsets are
one-point sets. Indeed, from the topological point of view, one of the fundamental property of the semantics of classical logic is that the underlying
universe is totally disconnected.
Two different types of evidences can be given at this point to support this
claim. The first type comes from general duality results in categorical logic
and the second type comes from considerations in topos theory.
Let us consider first the case of propositional logic. It is well-known that
a theory in classical propositional logic can be transformed into a Boolean
algebra via the Lindenbaum-Tarski construction. Now, the Stone duality
theorem asserts that the category of Boolean algebras with boolean homomorphisms is equivalent to the category of totally disconnected compact
Hausdorff spaces, called Stone spaces. The important point here is that the
property of being totally disconnected plays a crucial role. When one abandons
it and considers the category of compact Hausdorff spaces, then one obtains
an equivalence with the category of C*-algebras.
However, it is possible, but highly non-trivial, to lift the duality between
boolean algebras and Stone spaces to the (enriched) category of Boolean pretoposes, which captures the "algebras" (categories) generated by first-order
theories and the (enriched) category of ultragroupoids. Unfortunately, at this
level, the topological set-up cannot be lifted as easily. However, it is interesting to note that the category playing the role of the category of Stone spaces
is a groupoid. 6 A groupoid is in a sense totally disconnected. To see how, let
us tum to the second type of evidence.
The argument for this second type of evidence provides at best indications that there is something to be understood better. First, we define what a
totally disconnected category is. A category C is totally disconnected if for
any two distinct objects X, Y of C, there is no morphism between X and Y,
i.e. Hom(X, Y) = 0. Hence, a totally disconnect category is simply a disjoint
union of monoids. Thus every discrete category is totally disconnected but
there are totally disconnected categories which are not discrete. We now need
the following totally trivial observations:
LEMMA. Any groupoid C is equivalent to a totally disconnected category
D.
Proof The proof follows immediately from the fact that any category is
equivalent to one of its skeletons and that the skeleton of a groupoid is totally
disconnected (and not in general discrete), since it is a disjoint union of nonisomorphic groups.
83
For our purposes, it is not necessary to go deeply into topos theory. OUf goal
is not to establish general results about toposes. We want to use some toposes
in order to display curious asymmetries and try to make some general conceptual remarks from them.
DEFINITION. A topos E is a category with all finite limits equipped with a
function P which assigns to each object B of E an object PB of E, and for
each object A of E, an isomorphism
Sub(B x A) == Hom(A, PB),
where Sub(-) is a functor from E to the category of sets which associate to
each object A its lattice of subobjects.
The important point here is that what the second condition guarantees is
precisely the existence of a function expressing internally what is the structure of subobjects of an object of the topos. Thus a category is a topos whenever
84
JEAN-PIERRE MARQUIS
is defined
Negation is then used to obtain the complement of a subobject by composing a characteristic map with the negation map and then pulling back.
Thus one lifts the external negation, the propositional operator, to the inside
and apply it to predicates, that is, subobjects, as it is done in set theory. So
can we apply this procedure to the predicate "not being true"? Intuitively, it
should yield a subobject of n, the subobject classifier, which contains everything but the whole truth. Thus, what we are looking for is an endomorphism
of n such that it sends everything different from truth to the truth such that
when we pull back along the truth we get the required subobject. Again, in
a set-theoretical framework, this is trivial and traditionally in many-valued
logic, it is not even a question. However, in the framework of toposes, the
question is far from being trivial, as we will now try to show.
First, let us show that in some toposes, truth and falsehood are not on a
par. To affirm something, one can always use the truth, that is, pull back
along truth. However, one cannot in general affirm that something is false,
which should be done by pulling back along the false. The latter operation
85
cannot always be performed in a topos. Thus in general, one can only deny
that something is true. We will illustrate this by displaying the situation in what
is probably the simplest example, the tapas of pointed loops.
In this tapas, usually denoted Ser->, the objects are described by the following "schema": f: X ~ Y, where f is simply a set function and X and Y
denote collections. Thus an object in this tapas is in a sense a function, or a
rule. The collections X and Y are not by themselves objects of the tapas. There
are two intuitive and heuristic ways of thinking about these objects. The
first way is to conceive of an object f: X ~ Y as a functional data base,
where X and Y are lists of objects of (possibly) different types. The second
illustration is more geometric. An element of X is a loop without a base
point. Thus the elements of X are homeomorphic to the circle SI. The elements
of Y are simply points. What the function f does is to transform the loops
into "pointed" loops, that is, loops with a base point. From this point of view,
the individuals are of two kinds: either bare points or "decorated" points,
that is, a collection of loops fixed to a base point. Thus, the elements of our
sets are not bare individuals anymore, but rather individuals possibly satisfying
a certain type of properties, more specifically, a rule-type property. An object
of the tapas can be represented thus:
x
An arbitrary element of an object of this tapas is really a quotient of pointed
loops, i.e. it is homeomorphic to a quotient of pointed circles. (This follows
from the fact that any functor of this category, thus any object of the topos,
is a colimit of representable functions.)
Since a tapas is a category, we have to describe the morphism, the "links"
between the objects. Given two objects f: X ~ Y and g: U ~ V, a morphism
from f to g is a pair (hi' h2 ) of functions hi: X ~ U and h2 : Y ~ V such
that hi = gh l . In our illustration, what this equality guarantees is that loops
are sent to loops, points are sent to points and these maps are consistent with
one another. From now on we will denote the objects of this tapas by X, Y,
Z, . . . and morphisms f, g, h, . . . .
The terminal object of this tapas, denoted by 1, consists of a unique pointed
loop. A point of an object X is given by a morphism from the terminal object
1 into X. By definition of a morphism, such a map picks out a pointed loop
in X. Thus, if X does not contain any loop, X has no points even though it
is not empty.
Let us now turn to the subobject classifier. Recall that from the axioms
of tapas theory, it can be shown that for a universe of objects to qualify as a
topos, it has to have an object Q and a chosen subobject, the true or T, which
allows us to classify the subobjects of an object of our universe. This means
86
JEAN-PIERRE MARQUIS
x
It is clearly a legitimate subobject of X.
Hence we can now define a morphism from X to Q such that Y is the inverse
image of T.
87
It is now very easy to show that we cannot always pull back along the
morphism 1..: 1 ---7 Q, which would amount to the affirmation that something
is false. Simply take X and Y as above. One immediately see that the following
square cannot be completed:
) 1
1 11
x
We now face the obvious question: which are the toposes for which the truth
and the false are on a par, or rather in which affirmation and denial are on
the same footing, e.g., the topos of sets? to obtain the answer, we first need
the following
LEMMA. In any topos E, there is an object, which we call QOP, which satisfies the basic property of the subobject classifier with the difference that the
pullbacks are taken along 1..: 1 ---7 QOP instead of T: 1 ---7 Q.
Proof. The following diagram is a particular case of the basic definition:
Hom(A, P(l
Usually, we let <1> be the identity map. But, as Barr & Wells (1985) point out,
we could be perverse and let <1> of an element of the power object be its complement. By doing so, we obtain what we call Qap. Qap has the following
basic properties: 1.. = <pT and T = <pl.., where <p is an anti-isomorphism,
which are obvious from the definition and immediately imply the claim.
The last isomorphism of the proof is an anti-isomorphism. In some toposes,
the two structures are also isomorphic and in these cases, truth and falsehood are on a par.
PROPOSITION. Let E be a topos. Thenfor each monomorphism Y ---7 X, there
is a unique map a.y : X ---7 Q such that
l~
<Xy
is a pullback
if and only if Q
==
Qap.
88
JEAN-PIERRE MARQUIS
Proof. Suppose that 0 == ooP. Thus QOP is also a subobject classifier. Clearly,
T: 1 ~ oop corresponds to ..1: 1 ~ Q. But since Oop is a subobject classifier, the proposition is true for T: 1 ~ QOP and therefore it holds for ..1: 1
~ O. Now suppose that for each monomorphism Y ~ X, there is a unique
map cry: X ~ 0 satisfying the appropriate condition. This is equivalent to
say that T: 1 ~ oop is also a subobject classifier and therefore that Q == OOP,
as required.
Notice that this includes more than the toposes in which the subobject
classifier has a boolean structure. It includes toposes for which the subobject classifier has a biheyting structure, that is both a heyting and a coheyting
algebra.
In the topos of pointed loops, we can pick out "not being false". Indeed,
consider the object Nf
G)Nt
89
tion of the not being true whereas no such separation exists for the not
false.
Finally, we can also separate the subobject of elements which are almost
in a given subobject of Y of X. Simply pull along the morphism t: 1 ~ n,
which, as its name indicates, picks out the loop t. As in the last case, its
characteristic map Xt: n ~ n is not a topology, since it sends T to t. Now,
the interesting thing is that none of the characteristic maps we have defined
above could reasonably qualify as a negation operator, since none of them send
T to 1- and 1- to T.
It is impossible to define a morphism n ~ n such that it sends T to 1and t and 1- to T. The reason should now be obvious: it is precisely because
the not being false is connected and the fact that, by definition, morphisms
send connected subobjects to connected subobjects. Thus it is impossible to
separate T and t. Notice also that in this topos, the subobject classifier is
separable into the false and the not being false.
However, a different complement operation can be constructed in this topos
by pulling an arbitrary map X: X ~ n along not being true thus:
-y ----.-Nt
---_.... n
90
JEAN-PIERRE MARQUIS
asserts that the value of a certain property is not equal to a given value, one
is almost certain to be correct. Bunge (1981) gives the example of a prudent
person who is asked to give the age of a friend and answers "John is not 30
years old". In this kind of circumstances, it is much easier to be correct with
a negative sentence than with an affirmation.
(4) It is easy to prove that Y U - Y = X and that -(Y n - Y) = X, thus
this negation "satisfies" the law of excluded middle and the law of (non)contradiction. Thus, - Y can be thought as the closest we can get to a logical
contradictory in this context, whereas --,Y should be thought as a logical
contrary, since Y and --,Y are mutually exclusive but need not be mutually
exhaustive.
(5) In fact, it can be shown that in every topos of presheaves, the lattice
of subobjects of an object is a biheyting algebra, hence that there is such an
operation of complementation.
These latter considerations suggest that it might always be possible to define
the subobject "Not being true" and therefore this new negation corresponding
to the coheyting complement. However, this is to forget the richness and variety
of toposes. We will not exhibit a different topos in which we have a complex
truth-structure but in which it is impossible to define that subobject "not
being true".9 An object of this topos is a set X with an endofunction f: X -7
X such that f2 = f. Thus an object of that topos can be depicted in the following manner:
An arrow between two points indicates that they are related by f and that
this relation goes in the direction of the arrow, i.e. x -7 y means that y =
f(x). Informally, the objects of this topos are collections of elements which
"evolve" in one step and then are stable (with the possibility of of elements
evolving into one another in a loop, which is not represented above).
A subobject S of an object (X, f) in this topos is a subset S closed under
the endofunction restricted to it. The construction of the subobject classifier
is now immediate. It can be depicted thus:
91
and is in this sense bivalent. However, it is not boolean. This is one of the
simplest examples of a non-boolean bivalent topos.
Moreover, we cannot consider the subobject of elements for which something is not true. Indeed, it is impossible to find a subobject of Q which
represents not being true, though there is a subobject representing the not being
false.
It is in fact easy to generate infinitely many toposes with this property.
Simply let the objects of a topos to be pairs (X, f) such that fn = f, for some
n E N. When n = I, this gives us the topos of evolutive sets.
The moral is clear. For toposes of presheaves, we have two operations of
complementation, one of which can always be transferred to the whole proposition, or to use a pictorial language, moved inside out and outside in, whereas
the other cannot in general be moved outside, that is become a propositional
operator. This is a formal representation of the situation where "this man is
unhappy" is not equivalent to "this man is not happy" or "it is not true that
this man is happy". It is then natural to ask under what conditions the complementation can be brought down and be related to a negation operator.
This is equivalent to inquiring whether the subobjects Nf and Nt can be
defined internally, which, in turn, is equivalent to asking whether certain
morphism exist in Hom(Q, Q). This task is more intricate than it appears at
first, and in fact we are at present incapable of providing a complete and
satisfactory general answer. Consider, for instance, the case of a simple boolean
topos: Set2 , where 2 is a two points set. It is easy to show that the subobject
classifier of this topos is boolean, has four truth-values, hence is not bivalent,
and that it has 16 subobjects. We would thus expect that some of these 16
subobjects represent predicates such as "neither true nor false", "not being
true", "not being false", etc. However, none of the predicates are represented
by the subobjects. It takes a while to convince oneself that this is as it must
be: indeed, in a boolean topos, we are dealing with classical logic, and hence
not-true and not being true should be identical, as indeed they are in this
case, no matter how rich the lattice of subobjects of the subobject classifier
is.
This example brings us to the following observation: we have seen above
an example of a topos which is bivalent but not boolean and the latest example
provides a very simple example of a boolean topos which is not bivalent. It
is well-known that a well-pointed topos is both two-valued and boolean. This
is conceptually very significant. A topos is said to be well-pointed if the
terminal object 1 generates the topos, which means that whenever f ~ g:
A -7 B, there is an arrow u: 1 -7 A such that fu ~ guo This, in fact, tells a
lot about the structure of the objects of the topos. Indeed, in can be shown
that to be well-pointed is equivalent to the claim that every object A of the
topos is isomorphic to the coproduct of 1 indexed by Hom(l, A). In other
words, every object is made of the disjoint union of a given basic individuals, the terminal object, and thus can be decomposed "at will", just as in Sets.
It is in the general case that a deep asymmetry lies. It is always possible
92
JEAN-PIERRE MARQUIS
Until the advent of topos theory, many-valued logics were developed along
two complementary lines: the arithmetic and the algebraic. 1o In the arithmetic framework, the truth structure is thought of as a collection of points with
a linear structure. The algebraic framework is obtained by replacing the linear
ordering by an appropriate partial ordering. In both cases, negation is a specific
endomorphism of the truth structure. However, in both cases, it seems to be
assumed that the predicate "not being true" can be represented by the settheoretical complement of the singleton set {T} and thus, whereas negation
is a morphism of a specific type, complementation is a purely set-theoretical
operation.
In the topos theoretical framework, we have for the first time genuinely geometric truth structures. This has profound conceptual implications. For instance,
the notion of a truth-value is not as straightforward as in the case of arithmetic or algebraic structures. Indeed, the latter structures are constructed
from sets of truth-values, whereas the subobject classifier of a topos is an object
with a specific universal property which determines its internal structure. There
are always at least two points in a subobject classifier, the truth and the false.
However, there are toposes in which these are the only points of the subobject classifier, for instance the topos of evolutive sets, even though the
subobject classifier contains more "parts". These other "truth-values" are in
fact subobjects of n, not points of n. Hence, such a topos is two-valued.
What is queer is that despite the fact that it is two-valued, not being false is
not the same as true in this topos and the internal logic of such a topos is
not classical. As we have said above, for not being false to be the same as true,
the topos has to be boolean, which is totally independent of the number of
truth-values. Thus, it is a structural property and not a combinatorial one.
In a many-valued truth-structure, we recover immediately a traditional
logical distinction between negation as a term operator and negation as a propositional operator. The standard generalization of negation as a endomorphism
of the subobject classifier captures only the propositional operator. When we
consider the complementation operations not being true, not being false and
the like, we try to model the term operator. These two models should be
93
harmonized in the semantics and it should be possible to reflect the distinction in the syntax. The first thing we have to clarify is the way these two
operations should be linked and formalized. We have tried to take the first steps
in this direction in this paper and we hope that it will elicit further explorations
in the near future.
ACKNOWLEDGEMENTS
94
JEAN-PIERRE MARQUIS
DANIEL VANDERVEKEN
In the philosophy of language and mind, the abstract entities called propositions have a double nature. On one hand, they are units of sense of a
fundamental logical type which are expressed by the use of sentences. All
propositions represent states of affairs and are true or false depending on
how things are in the actual world. On the other hand, propositions are also
the contents of conceptual thoughts that we, human beings, have in mind
whenever we think, speak or write. As ordinary language philosophers have
shown, the primary units of meaning in the use and comprehension of language
are speech acts such as assertions, promises and requests which consist of
an illocutionary force F with a propositional content P. Moreover, many of
our mental states are attitudes like beliefs, intentions and desires which consist
of a psychological mode m with a propositional content P. Like illocutionary
acts, such attitudes are conceptual thoughts whose contents represent states
of affairs.
As Frege already noticed, the two constitutive aspects of propositions are
not logically independent. Indeed, force, sense and denotation are the three
essential components of sentence meaning in language. Thus, every proposition which is the sense of a sentence in a possible context of utterance is
also the content of the illocutionary act that this sentence could be used to
perform literally in this context. For example, the proposition which is
expressed by the sentence "John will help me" in a context of utterance is
also what the speaker of that context would mean to assert if he were using
literally that declarative sentence in that context. Unfortunately, the philosophical logics of sense and denotation which are currently used today are
incompatible with the philosophical analysis of conceptual thoughts. Indeed,
standard modal, epistemic and intensional logics are based on Carnap's
definition of the logical type of propositions which reduces propositions to
their truth conditions. From a philosophical point of view, strict equivalence
(the property of having the same truth conditions) is not a sufficient criterion of propositional identity. Indeed there are many speech acts with the
same force and strictly equivalent propositional contents which are not performed in the same contexts. Similarly, there are many different attitudes
with the same psychological mode and strictly equivalent propositional
contents. Thus the assertion (or belief) that 2 ~ 4 is different from the asseris irrational.
tion (or belief) that
The main purpose of this paper is to formulate a new logic of propositions that is compatible with the philosophical analysis of conceptual thoughts.
Incidentally, the fact that propositions are expressible in the performance of
speech acts imposes on the logic of propositions many conditions of adequacy
"2
95
M. Marion andR. S. Cohen (eds.), Quebec Studies in the Philosophy of Science /,95-105.
1995 Kluwer Academic Publishers.
96
If all the propositions which are senses of sentences are also possible contents
of illocutionary acts, then any adequate law of propositional identity must
satisfy the following criterion of substitutivity salva felicitate within the scope
of illocutionary forces. Two propositions PI and P 2 are identical only if, for
every force F, the speech acts of the forms F(P I ) and F(P 2 ) have the same
conditions of success: it is not possible for a speaker to perform speech act
F(P I ) in a context of utterance without eo ipso performing the act F(P 2 )
in the same context. Now, it is quite obvious that the ability to perform (or
understand) an illocutionary act F(P) contains the ability to apprehend its
conditions of satisfaction and to understand the truth conditions of its propositional content. Thus to understand an assertion is to understand under which
conditions that assertion is true. Similarly, to understand a promise is to understand under which conditions that promise is kept. Consequently, an adequate
analysis of propositions must theoretically relate the logical aspects concerning
the truth conditions of propositions to the cognitive aspects concerning the
determination of these truth conditions in the use and comprehension of
language.
In my analysis, I will account for the following facts:
(1) From a cognitive point of view, propositions are complex senses provided
with a structure of constituents.
97
First, we must identify the various propositional constituents of that proposition, i.e., the sense of the definite description and the properties expressed
by the two verb phrases.
Second, we must also understand how these propositional constituents are
logically related in terms of predication in the two atomic propositions
out of which this proposition is composed. We must understand that the two
properties of speaking English and German are predicated of the president of Russia.
Finally, we must also understand how the truth conditions of the complete
proposition are determined from the truth possibilities of its atomic propositions. In this example, we must understand that the proposition in question
is true if and only if the first predication is true and the second false.
From a cognitive point of view, propositions are then much more structured
entities than simple functions from the set of possible worlds into the set of
truth values. In order that two propositions be identical, they must not only
have the same truth conditions. They must also be composed out of the same
atomic propositions. Moreover their truth conditions must be determined in the
same effective way from their atomic propositions.
(2) All propositions are general propositions whose constituents are senses and
not objects.
As Frege pointed out, we cannot refer to objects in an act of thought without
subsuming these objects under senses. We can only have in mind concepts
of an individual and refer indirectly to it through such concepts. Now, if one
admits both the indispensable role of concepts in reference and the hypothesis that every proposition is the possible content of an act of thought, then
one must also recognize like Church the absolute need of senses in logic.
All propositional constituents must then be attributes or concepts of objects
of reference.
(4) Human beings are not perfectionally rational in their use and comprehension of language.
Clearly, we are often inconsistent. We often assert (or believe) propositions
whose truth is impossible. Furthermore, our illocutionary and psychological
commitments are not as strong as they should be from the logical point of view.
98
DANIEL VANDERVEKEN
Thus, we often fail to make valid theoretical inferences in the use of language.
For example, we often assert (or believe) a proposition without asserting (or
believing) other propositions which are a logical consequence of the first.
Finally, we are not omniscient and we do not know all necessary truths.
Such imperfections of human thought impose constraints on propositional
logic: First, that logic must formulate a finer criterion of speaker consistency
than logical possibility. second, propositional logic must also define a nonclassical relation of implication which is stronger than Lewis' strict implication. (In modal logic, a proposition P strictly implies another proposition Q
when all the truth conditions of Q are also truth conditions of P.) Contrary
to what Hintikka wrongly assumes in his epistemic logic, the set of propositional contents of attitudes is not closed under strict implication. Indeed,
elementary rules of natural deduction like the rule of introduction of disjunction
do not generate psychological or illocutionary commitments. Thus, we can
believe that John is French without eo ipso believing that John is French or
Tupi.
(5) Even if human beings are not perfectly consistent in their use of language,
they are not totally irrational.
As is the case for the truth conditions of propositions, the success conditions
of speech acts are often logically related. Thus, certain illocutionary acts are
not simultaneously performable. For example, no one can ever mean to assert
that Cicero is and is not Roman. Furthermore, certain illocutionary acts have
more success conditions than others: it is not possible for a speaker to perform
these speech acts in an utterance without eo ipso performing the other speech
acts. Thus an assertion that 0 ~ 2 ~ 3 is an assertion that 2 ~ 3.
We need then to discover in formal semantics which stronger logical relation
of implication must exist between two propositions PI and P 2 in order that
the illocutionary acts of the form F(P I ) commit the speaker to the corresponding illocutionary acts of the form F(P 2 ). To my knowledge, most (if
not all) existing logics are incompatible with an adequate analysis of illocutionary or psychological commitment. First, classical modal, intensional and
epistemic logics which are based on Carnap's definition of propositions predict
far too many illocutionary and psychological commitments. On the contrary,
hyperintensional logic does not predict enough commitments. On one hand,
hyperintensional logic makes the right move in admitting that the contents
of attitudes are more complex entities than simple truth conditions. But it makes
a mistake in identifying these contents with structured senses called hyperintensions whose nature is too sensitive to syntactic features like the order
of predication which are not always relevant. Thus hyperintensional logic
distinguishes identical attitudes like the belief that the morning star is the
evening star and the belief that the evening star is the morning star. For their
contents are not intensionally isomorphic.
99
100
DANIEL VANDERVEKEN
of its atomic propositions, there always exists a unique set of truth value assignments to atomic propositions under which proposition P is true. For example,
the proposition that Paul is in France or in Belgium is true under all truth value
assignments that associate the true with at least one of its two atomic propositions. Formally, the notion of truth can then be defined as follows: a
propositions P is true in a possible world w if and only the set of truth value
assignments under which P is true contains at least one assignment which associates with each atomic proposition of P the actual truth value of that atomic
proposition in the world w.
Clearly, my criterion of propositional identity is stronger than that of Carnap
and Montague who require only strict equivalence. Indeed, strictly equivalent propositions with different propositional constituents or different atomic
propositions are distinguished in my logic. Moreover, contrary to Parry's logic,
my criterion of propositional identity requires more than the identity of content
and strict equivalence. Indeed strictly equivalent propositions with the same
content are distinguished in my logic when their truth conditions are determined by the application of different truth functions. Consider the impossible
proposition (2) that naive set theory is consistent and the contradictory proposition (3) that naive set theory is consistent and inconsistent. These two
propositions are different. For we all know a priori by virtue of linguistic competence that the contradictory proposition (3) is impossible while we have to
learn that proposition (2) is necessary false. Unfortunately, such propositions
are identified in Parry's logic. However, in my analysis, these two propositions
are not true under the same set of truth value assignments to atomic propositions. For the truth function that serves to determine the truth conditions
of proposition (2) is the identity function. Thus the first proposition is true
under all truth value assignments that associate the true with its single atomic
proposition. On the other hand, the truth function that serves to determine
the truth conditions of the contradictory proposition (3) is the constant function
that associates the false with each truth value. There is not a single truth
value assignment to atomic propositions under which the second proposition
is true.
On the basis of my new analysis of propositions, one can define recursively as follows the set of all propositions.
The simplest possible propositions are the elementary propositions which
are composed out of a single atomic proposition and which are true
under all truth value assignments that associate the true with that atomic
proposition.
All other propositions are complex propositions that can be obtained
from the elementary propositions by a finite number of applications of
operations. The truth functional operations are of course the simplest logical
operations on propositions. Indeed, the content of a complex proposition
that is the result of the application of a truth function is just the union of
the contents of its arguments. The truth functions do not affect the content.
They only rearrange the ways in which the truth conditions of the new
101
102
(Ap > Bp) means that all the atomic propositions of Bp are also atomic
propositions of Ap.
-
(i)
(ii)
(iii)
IIcll E (uy,
IIRnll E (2 un y where 2 = {O, I}
IIRic l ... cn)1I = ({IIR.II, IIcdl, ... , IIc.II}, {j/j
... , IIc nllv)) = I}).
103
Notation. Let idl(X) and id2(X) represent respectively the first and the second
terms of an ordered pair X. In the model
id l (IlAaID is the set of propositional constituents of the atomic proposition expressed by Aa while id 2(IIAa1D
is the set of possible worlds where that proposition is true.
we,
(iv)
(v)
(vi)
we,
On the basis of these assignments of semantic values, one can now define
as follows the concept of a true sentence of L:
-- A sentence of the form A = B is true in a world i E I under the model M
if and only if IIAII = IIBII.
-- A sentence of the form t(Ap) is true in a possible world i if and only if
there exists at least one f E id2(IIAp1D such that for all U a E idl(IIApID,
f(u a ) = 1 if and only if i E idlua).
-- A sentence of the form ,At is true in a possible world i if and only if the
sentence At is not true in world i under M.
-- A sentence of the form (At & B t) is true in a possible world i if and only
if the sentences At and B t are true in this world.
-- Finally, a sentence D At is true in a world i under M if and only if the
sentence At is true in all possible worlds under M.
As in model theory, a sentence A of L is logically true or valid (symbolically I=A) if and only if this sentence is true in all possible worlds of all
possible interpretations of L.
V.
= R~(dl'
104
Cn
= (R~(dl'
=
... , dn )))
C;
d l) V
(Ba)
::>
(t(Aa)
::>
t(Ba
-,t(Ap)
Bp
T(Bp)
Axiom schema 12: Aa) = (Ba ~ T(Ap ::> A;), where A; differs at most
from Ap by the fact that some occurrences of the term (Aa) in Ap are replaced
by occurrences of the term (Ba).
Axiom schema 13: -,TA~) V . . . V (A:, for any A~, ... , A: E La.
Axiom schema 14: -,T(-,(A~) V .. V -,(A:, for any A~, ... , A: E La.
Axiom schema 15: TA~) V .. V (A:) V -,(B~) V . . . V -,(B:' ~ (A~) =
(B~ V .. V A!) = (B:' V .. V A:) = (B~ V . . . V A:) = B:'),
for any A!, ... , A:, B!, ... , B:' E La.
These axiom schemas express well known valid laws for tautologies.
The two rules of inference of my axiomatic system PC are the rules of modus
ponens and of necessitation.
VI.
1. The relation of inclusion of content is reflexive and transitive. A proposition has the same content in all contexts.
I=Ap > Ap
HAp> Bp)
HAp> Bp)
~
~
H:
Bp).
4.
5.
6.
7.
8.
9.
105
In the first case, it is possible to have in mind the first proposition P without
also apprehending the second one Q. And in the second case, we do not
necessarily know a priori in virtue of linguistic competence that proposition P pas more truth conditions than proposition Q.
The rules of elimination of conjunction, disjunction and material implication generate strong implication.
4.1. F (Ap & Bp) I-( Ap and F (Ap & Bp) I-( Bp
4.2. F Ap I-( C p) & (Bp I-( C p =::} (Ap v Bp) I-( C p
4.3. F (Ap & (Ap =::} Bp) I-( Bp
4.4. The failure of the law of elimination of negation I' (Ap & -,Ap)
I-(
Bp. Indeed, the content of Bp can be new.
Only the rules of introduction which preserve content inclusion generate
strong implication.
5.1. The failure of the law of introduction of disjunction jib Ap I-( (Ap V Bp).
5.2. On the contrary, the laws of introduction of negation and of
conjunction are valid.
A proposition strongly implies all and only the tautologies whose content
is included in its content.
A theorem of finiteness for strong implication. A proposition only strongly
implies a finite number of other propositions.
The relation of strong implication is decidable. This confirms the thesis that
strong implication is cognitively realized by virtue of competence.
In simple illocutionary logic, there is a law of minimal rationality of
speakers: If a proposition P strongly implies proposition Q, a speech act
of the form F(P) with a primitive force has more success conditions than
the corresponding speech act F(Q) when Q satisfies the propositional content
conditions of F each time P satisfies these conditions. And the speech acts
F(P) and F(-,Q) are incompatible.
Universite du Quebec
a Trois-Rivieres
REFERENCES
YVON GAUTHIER
INTERNAL LOGIC.
A RADICALLY CONSTRUCTIVE LOGIC
FOR MATHEMATICS AND PHYSICS
By the terms 'internal logic' I mean what Hilbert, Weyl and Brouwer have
all called, although from different points of view, inhaltliche Logik to contrast
it with formal logic. It has been sometimes rendered in English by 'contentual';
it could also be considered as an equivalent to the 'intrinsic' logic H. Weyl
has defined in (Weyl, 1968. p. 705)
Each field of knowledge, when it crystallizes into a formal theory, seems to carry with it its
intrinsic logic which is part of the formalized symbolic system and this logic will, generally
speaking, differ in different fields.
Weyl is dealing here with quantum logic and he says that it constitutes an
integral part of the formalism, or as Hilbert would say it, the analytical apparatus, in this case, of Quantum Mechanics. Internal logic denotes the set of
logico-mathematical structures of a given scientific theory. That logic is constructive in the sense that it is not independent of the construction of a particular
theory. Arithmetic is then taken, in the spectrum of mathematical theories,
as the original building block of mathematics and in line with Kronecker's idea
of arithmetical foundations for the whole edifice of mathematics, logic is
seen as an extension of arithmetic. Arithmetical logic is a local logic, that is
it is a theory of local notions, local negation (complementation) and local implication (see Gauthier, 1985). Beyond arithmetical logic, other logics correspond
to various levels in the mathematical hierarchy, from (finite) set theory to
topology and topoi theory - topos a creation of algebraic geometry, is a
generalisation of the notion of topological space, it has an internal logic
which is constructive (intuitionistic) and it has become one of the main objects
of study in category theory.
Topology, more than geometry, deals also with local notions and this idea
has been extended to the formalism of Hilbert space to provide Quantum
Mechanics with a constructive quantum logic (and a local observer to apply
it, see Gauthier, 1983). The concept of local interaction is the heart of measurement theory and the generalisation to cosmology requires a local logic
in the absence of the global unification often dreamed of b.ut yet to be constructed. A phenomenon like non-commutativity (in gauge theories and in field
107
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy ojScience 1,107-122.
1995 Kluwer Academic Publishers.
108
YVON GAUTHIER
theories) seems to reflect the inner logic of interaction, the local character
of processes (strings and knots in symplectic theory).
As is obvious from the preceding, constructive internal logic is of a nonFregean variety and Boolean laws are confined to finite symmetric situations,
while processes that are not finite (i.e. that are not sets) must exhibit strict local
behaviour; consequently Cantorian set theory and set-theoretical model theory
cannot serve as foundational background for a programme that aims at giving
specific insights into the workings of mathematical and physical theory. The
logic that I delineate in the following draws upon number theory in order to
extract the internal content of an arithmetical logic which may be thought of
as the starting point of the constructivist approach in a radical foundational
enterprise.
1.
INTERNAL LOGIC
109
1.1. Vocabulary
Our first-order language L(T) for our first-order theory T has an effinite supply
of atomic symbols: (1) letters (capital and small) for formulas (and sentences), A, B, C, ... , P, Q, R, ... together with their punctuation signs, points,
commas, parentheses brackets, etc., (2) letters for variables XI' x2, , Xn'
(3) predicate letters pj, (4) functions letters fj - when f is o-ary, we consider
it as a constant, (5) the connectives /\, V, -', ~, (6) ... the quantifiers V, 3
and ~. The terms consist exclusively of: (1) variables, (2) sequences composed
of terms and functions letters, e.g. fj t l, ... , tn - for the terms t l, ... , tn.
Formulas or wffs consist exclusively of: (1) atomic formulas composed of terms
and predicate letters, e.g., pj(t l, ... , tn) for the terms t l, ... , tn; (2) any
wff consisting of formulas composed of connectives and quantifiers.
Remarks: Sentences are closed formulas, i.e., formulas are 'open' sentences where variables occur free, that is, are not quantified upon. An instance
A(tl' ... , tn/XI' ... , xn) of a formula A is the result of substitution terms t
for the free occurrences of a variable x.
1.2. Sequents
I adopt (and adapt) the standard formulation of the sequent calculus LK (Girard,
1987). A sequent is an expression r f- ll. where r and ll. are finite sequences
of formulas;
is the antecedent, e.g., AI /\ ... /\ An and ll. the succedent,
e.g., B I, ... , Bm with the interpretation (AI /\ ... /\ An) ~ (B I V . . . V
Bm)
1.2.1. Axioms
The system of L L (Local Logic) has the axiom
Axiom 1
110
YVON GAUTHIER
L4
r, A I- II
r, A /\ B Ir
II
I- A, II
I- A V B, II
I- A, II
I- B, II
I- A /\ B, II
1 1/\
/\r
L3
r,
Disjunction
r IV
L6
r,
L3
A I- II
r,
r,
r,
B I- II
A /\ B I- II
I- B, II
I- A V B, II
1 2/\
r 2V
B I- II 1 V
A V B I- II
Remark: Since the logic is local, conjunction and disjunction are assumed
to be locally (individually) provable, in particular disjunction has the disjunction property of intuitionistic logic: if I-A V B is provable, it means that
either I-A or I-B is provable - for conjunction, I-A and I-B are provable.
Negation being local, the minimal derivation of negation can be written
L7
r,
A I- II
r -,
I- -,A, II
Negation
Lg
r
r,
I- A, II 1-,
-,A I- II
Remark: one can introduce or eliminate negation (to the right or to the
left), if one has reason to do so, i.e. has found a contradiction. Double negation
cannot be eliminated, as we shall see later.
L9
r,
Implication
A I- B, II
I- A -7 B, II
r-7
LIO - - - - - - - - - ---,
r,
111
INTERNAL LOGIC
-7
Remark: Notice that since we do not have a -7 -,-,a (no more than -,-,a
a in general), implication is also local, being intimately tied with negation.
Universal quantification
Lll
r I- A, A
r I- '7xA, A
r'7(*)
r, A(t) I- A 1'7(**)
r, '7xAx I- A
LI2
Remark: Since '7 applies only to finite domains, domains, it does not differ
from the intuitionistic (or classical) finite quantifier - of course '7 as well as
3 and :E below are subject to the usual restrictions on variables: (*) means
that x is not free in r, A and (**) means that the substitute t is an arbitrary
term of L.
Existential quantification
I- At, A r3(**)
I- 3xA(x), A
LI4
r, A I- A
r, 3xA I- A
13(*)
r
r
I- A(xn ), A r:E(*)
I- :ExA, A
LI6
r, A(x n) I- A
r, :ExnA(xn ) I-
l:E(*)
A
abIes) not identified with those in A; only if there are the same, can :ExAx
be eliminated, that is to say that the left rule is only there for the sake of
symmetry.
112
YVON GAUTHIER
I- A, A r, A I- A
rl-A
for the symbol of absurdity ..1, which amounts to a structural weakening (or
addition), while classical negation requires also
L
7'
r,
r
-,A I- ..1
I- -,-,A r-,
Axiom 1
AI-A
and yields negation directly; it defines otherness or the exterior of the domain
of assertions. Semantics will describe that separation of worlds. In the
meantime, we take both autothesis and antithesis as axioms and delete L7
and Lg which are derived rules. The syntax remains even, despite the fact
that the two worlds (domains) are essentially uneven. Self-duality is only an
epiphenomenon.
2.
SEMANTICS
113
INTERNAL LOGIC
!\
=>
-,(3:D 3A 3H f-oA
H)
V=
Dn
Dn-m
(1 < m < n)
Dn(n-I)
Dn-n
<PM: form
(0,1)
<pM(A)[n]
(2)
<PM(-,A)[n]
(3)
<PM(A
!\
B)[n x m]
1, iff A
DMo and B
(4)
<PM(A
B)[n + m]
1, iff A
DMo or B
1, iff A
=
DMo
1, iff -,A
E~O
E
DMo
DMo
114
YVON GAUTHIER
loe
(5)
<i>M(A -7 B) [nrn]
(6)
<i>M(3xA) [n + m + 1 ...]
(7)
<i>M(V'xA) [n
(8)
<i>M(:ExA) [n
X
X
1, iff A
m
m
X
X
1]
DMo implies B
1, iff I: An
1, iff II An
I ... ]
DMo
DMo
DMo
1, iff II An ...
D Mo .
b = InX - a) U b) = c
where a, b, c are open subsets of a topological space X and In the interior c is the greatest element different from a. X - a is the difference or relative
complementation. Negation interpreted as arithmetic difference should be carefully distinguished from subtraction, when one remembers that the subtraction
sign is not a negation sign!
If implication can be seen as a continuous curve only in a non-standard
model - the topological interpretation - we could interpret it arithmetically
as a Cauchy product of power series, since a continuous function can be
represented arithmetically by a power series
for c n = aobn + a 1b n_1 + ... + anbo, that is, the Cauchy diagonal or product which
does not lead out of the realm of natural numbers unlike Cantor's diagonal
INTERNAL LOGIC
115
FERMAT ARITHMETIC
116
YVON GAUTHIER
ad absurdum obey the excluded third principle via double negation for infinite
sets and are then rejected by intuitionist (Brouwerian) standards. No such reprobation affects infinite descent and I shall try to give some foundational
legitimation for infinite descent. Poincare has insisted that infinite descent
(which he calls "recurrence") is not equivalent to complete induction (Poincare,
1906).
What I call Fermat arithmetic is arithmetic with the schema of complete
induction replaced by the schema of infinite descent. Fermat arithmetic has
thus only one limit-ordinal, that is an ordinal without an immediate predecessor, namely 0, Peano arithmetic (as Heyting arithmetic) has at least two
such ordinals, 0 and ro, and Cantor (or Gentzen) arithmetic admits of an arbitrary number of limit-ordinals - at least up to
Eo.
n ....OO
Fermat arithmetic is in this sense the only arithmetic that is not set-theoretic,
Le., it does not need an infinite ordinal- even Skolem (or Herbrand or GOdel)
recursive arithmetic cannot do without.
The schema of complete induction
'V'x(['V'y(y < x)Ay
Ax]
'V'xAx)
f\
'V'x(Ax
ASx)]
'V'xAx)
f\
'V'y(Xy
XSy)]
'V'yXy;
A(t, x
A(cr, x)]
'V'crA(cr, x
where the x's are free variables. Transfinite induction means complete induction up to lim (ro) = Eo in the following hierarchy:
ro = lim (0, 1, 2, ... n)
ro2 = lim (ro + n)
ro 2 = lim (ro n)
roO) = lim (ron)
roof'> = lim (rown )
eo = lim (roO) O)t).
INTERNAL LOGIC
117
Each member of the hierarchy has a last term n, since the hierarchy is based
upon the normal form that Cantor has given for any ordinal
S=
where 131 > 132 > 13m and m, n l , n 2, ... , nn are finite. This is part of Cantor's
second number class; this class contains the constructive ordinals of Kleene
and Church, and indeed it is denumerable (and recursively enumerable) up
to Eo in the set-theoretic sense because of the terms n, but it is not effectively enumerable in a strict sense, since one can find an ordinal such that
an *- a for all n: one takes a = lim an and one has an < a.
Takeuti (1975) has attempted a justification of transfinite induction by using
infinite descent in the form of strictly decreasing sequences
(WUn).
---7
X ---7
-,A(y)
can be obtained from the principle of complete induction putting -,A for A
in the schema of transfinite induction. While Buss (1986) introduces the '1'LMIN axioms with the schema
3xAx
---7
118
YVON GAUTHIER
--7
which means that the sequence is continuing on indefinitely, or rather 'effinitely'. We have the following schema:
[A(a)
A(a)
Dn
I-
Dn-m]
A(a)
Dj
(0 < m < n)
(0 ~ i < m)
I-:Ex A(x)
which corresponds to rule LIS.
This principle of descent does not need a universal quantifier, only an
'effinite' quantifier for finite or rather indefinite descent; effinite still means
potentially infinite, indefinite sequences or Brouwer's 'infinitely proceeding
sequences'. To such effinite sequences, one could assign an 'unlimited' natural
number, as in (Nelson, 1986), while finite natural numbers are assigned to finite
initial segments (sets) of those sequences.
Since infinite descent is impossible - any descending sequence of positive
integers must stop at 0, the prepositional bound of the sequence of natural
numbers - one can add the following conclusion to our descent schema
:Ex{[Ax " 3y(y < x)Ay]
--7
--7
:Ex-,Ax
which means that the property (or set of properties) postulated for the infinite
descent is false for all natural numbers 'effinitely'. The effinite quantifier of
the conclusion does not reach beyond what is contained in the premisses, it
is an indefinite ascent of the sequence of natural numbers.
INTERNAL LOGIC
119
asserts that the ratio of the number of primes in a large set x to xllog tends
to the limit 1 as x tends to infinity, that is
lim
1t(x)
xllog x
=1
has been proven by elementary means (by Selberg and Erdos), long after it
has been proven by analytical methods; the same holds for Dirichlet's theorem
on the infinity of primes in any arithmetical progression ax + b for a and b
relatively prime, i.e. (a, b) = 1. Since Euclid's theorem, like the fundamental
theorem of arithmetic on the unique representability of integers by a product
of primes, needs only constructive methods for its proof, it is the concept of
infinity which is at stake here. My contention is again that the concept is
dispensable and that one can eliminate it or paraphrase it as Brouwer did by
referring to 'infinitely proceeding sequences' (or, as I call them, 'effinite'
sequences). It is really an effinite process which is at work in those proofs;
Aristotle said in his Physics 203b, that the infinite is that which cannot be
crossed (dolEs(n1'toC;) - it may be worth noticing that Gentzen spoke rather
of a potential crossing or running through ("ein potentielles Durchlaufen")
of the infinite in his justification of transfinite induction. If the infinite cannot
be crossed, is the thought-experiment of a potential crossing in itself justifiable? In any case, the actual wording of Euclid's theorem is: "Prime numbers
are more numerous than any definite quantity (of prime numbers)", which is
proposition 20 of the book IX of the Elements (see Davenport, 1968). It suffices
for the proof to suppose that the sequence
PI ... Pk
PI x P2 X ... X Pk + 1
Do
I-
[A(a) E Dnl
A(a) E Dn+l
~7-----------------------------
I- :::Ex A(x)
120
YVON GAUTHIER
~(a)
A(a)
f-
Dn
A(a)
Dn_ 1
Dn-(n_l)
D n_n
~8--------------------------
f- :Ex -,A(x)
Here :Ex A(x) means 30 and :Ex -,A(x) means -,30. L17 and LI8 are analogues
of LIS and L7 We can then formalise Euclid's proof in the following way:
LEMMA. Any composite number is divisible by a prime number. In symbols
:Ex Comp x)
3zPrim z)
xJz.
1\
-,3zPrim z)
1\
xJz
:ExComp x)
-,-,3zprim z)
1\
xJz
1\
Prim 0)
~ t
< 0).
Proof Note the :Eo can stand either 'for 0 effinitely' or for 'there is an
effinite sequence 0'. We take as given the following prime:
3zn[zn $ pk! + 1 1\ (Prim zn
1\
Zn > p]
Dn
defined above and show that all t's differ from it; we have to show that:
121
INTERNAL LOGIC
Suppose that t = pk!, and Zn < pk! + 1. The only case of interest is t = p;
but Zn > p, thus t < zn. The fact that t is finite has been gotten by infinite descent
and the statement of the theorem is obtained by effinite induction
[Prim t /\ Prim 0) ~ t < 0) E Dn)]
Prim t /\ Prim 0) ~ t < 0) E Do f- Prim t /\ Prim 0)
f-V't::oPrim t /\ Prim 0)
t < 0) E Dn+1
t < 0)
CONCLUDING REMARKS
122
YVON GAUTHIER
JUDY PELHAM
A RECONSTRUCTION OF
RUSSELL'S SUBSTITUTION THEORY
In "Russellian Propositions" (Pelham, 1994), Alasdair Urquhart and I elaborate a theory of structured propositions that closely parallels Russell's
substitution theory of 1905-08. The present paper elaborates how our reconstruction of the substitution theory models Russell's philosophical ideas about
the structure of propositions as well as his axioms for substitution in writings
dating from the period around December 1905.
I.
124
JUDY PELHAM
(Zero)
true will pick out those matrices, sic, such that no substitution for c in s
yields a true proposition. Thus the matrix (Zero)/(pla) plays the role of the
cardinal number zero in STCR. In Russell's words:
According to this definition, 0 is a relation between a proposition and an entity, namely the
relation that, whatever we may substitute for the entity in the proposition, the result is always
false. (Russell, 1973, p. 175)
The relation between the proposition and the entity that Russell speaks of
here is a logical construct, a "matrix" as he calls it. That is, relations do not
exist as independent abstract objects, but zero is a relation which is specifiable in logical terms using the notions of a proposition and an entity. As
such this sort of relation is an instance of a logical fiction.
As Russell understands logic to be completely general, the variables of quantification, and so the substitution operation, are taken to apply to all entities
whatsoever. This means that although a may not occur in p, the substitution
sentence, p;,!q, must be true or false. Russell stipulates that if a does not
occur in p, then the result of any substitution for a in p is p. Russell reveals
this tacit stipulation with a definition of 'a ex p' in STCR (Russell, 1973,
p. 169), which says a is not a constituent of p if every substitution for a in
125
The elimination of predicates in this way is an attempt to resolve the contradiction concerning classes and predicates without developing a typed theory.
Russell says explicitly in STCR that propositions are to be taken as fundamental and not relations or predicates:
But in symbolic logic, it is best to start with propositions as our data; what is prior to propositions is not yet, so far as I know, amenable to symbolic treatment, ... (Russell, 1973, p. 175)
= y. =:(p,
12.1
12.14
q, r, a): p -!q.p -
!r.~.q~r
126
JUDY PELHAM
AXIOMS 3
12.2
x
1-: (3q).p -!q
a
=r
x
1-. P -Ix
p
x
12.211 1-. P -!p
x
12.212 1-: a
12.22
x
1-: -(y).a in ('\jf!y).:J.('\jf!a) -!('\jf!x)
12.24
a'
1-: -(x, y).a in <j>!(x, y).:J.{(y).<j>!(a, y)} - !{(y).<j>!(a', y)}
a
These axioms specify properties of substitution. Axioms 12.2 and 12.201 say
that there is a unique value for each substitution operation. Axioms 12.24
and 12.241 say that being a constituent of a proposition is a transitive and
reflexive notion. Those remaining say that with respect to wholes and parts,
substitution behaves in a way that preserves "logical" structure. These axioms
show that Russell did not hold that propositions with the same truth value were
substitutable for one another. They make it clear that Russell had a more
complex notion of propositions than simply the bearers of truth-values. He
thought of propositions as structured objects which seem to involve predicates,
according to axioms 12.22 and 12.25. However it is odd that Russell employs
a variable which seems to range over predicates, for after all predicates are
not supposed to be entities on the substitution theory. If they are not primitives of the theory, it is puzzling that they are mentioned in stating the
fundamental axioms. The answer to this puzzle I believe lies in the following
passage from STCR:
It should be observed that the relations identified with dual matrices are (approximately) relations in extension. If we say' x begat y', the word begat expresses the same relation in intension
as was expressed in extension by our matrix pl(a,b). The drawback to relations in intension,
from the standpoint of symbolic logic, is that not all propositional functions of two variables
correspond to relations in intension, just as not all propositional functions of one variable
correspond to predicates .... Relations in intension are of the utmost importance to philosophy and philosophical logic, since they are essential to complexity, and thence to propositions,
127
and thence to the possibility of truth or falsehood. But in symbolic logic, it is best to start with
propositions as our data; .... (Russell, 1973, pp. 174-5)
THE RECONSTRUCTION
did not observe the distinctions between language and metalanguage, and
semantics and syntax. In failing to observe the latter distinction Russell
is firm in his belief that the goal of logical analysis is the elucidation of
abstract structures; he simply assumed that a correct symbolism would
mirror the structure of the objects being described. 4 For the purposes of the
reconstruction we follow Russell's lack of such a distinction. The reconstruction develops a semantic structure which exhibits many features of a
formal language.
Elements of the Reconstruction
128
JUDY PELHAM
variables to range over individuals and propositions. The reconstruction introduces a category of individuals which are logically simple, that is, which
have no logical constituents. This category seems implicit in Russell's work.
The reconstruction also employs 1: as a metavariable ranging over predicates
and S as a metavariable ranging over variables, which Russell, without the
notion of a metalanguage, did not do. The connectives, like the predicates,
are functions mapping the set of all entities to propositions, which clearly
agrees with Russell's practice. Quantifiers are a special proposition-forming
operator, since they form propositions from things which are neither individuals nor propositions. The propositional function variables, <I>!x, used in
Russell's presentation of the substitution axioms, are construed as metavariabIes from the point of view of the reconstruction.
Russell understood propositions as complexes built up from individuals and
intensional predicates, and the goal of the reconstruction is to specify how
propositions are built up. For Russell the only intensional predicate which logic
requires is substitution, and identity can be defined in terms of the substitution operation. However, the reconstruction gives truth-conditions for the
substitution sentences in terms of the identity of the structure involved, so it
seems natural to employ identity and substitution as primitive.
Propositional Forms and Propositions
I introduce the notion of a propositional form as an intermediary in the construction of propositions. The set of propositional forms R(I), is recursively
defined by the following clauses:
(1) A variable standing alone is a propositional form.
(2) If 1: is a k-place predicate, and 0.1' . . . , ak are propositional forms
or members of I, then 1:(0.1, , a k) is a propositional form.
(3) If a is a propositional form or a member of I, then -a is a propositional form.
(4) If a, 13 are propositional forms or members of I, then a :J 13 is a
propositional form.
(5) If a is a propositional form and Sis a variable, then (S)a is a propositional form.
(6) Only formulae resulting from repeated applications of the preceding
four rules are propositional forms.
R(I) is the smallest set of objects which fulfils these conditions. Each propositional form may be thought of as an ordered sequence which consists of
the main operator, followed by each of its arguments in order, and each of these
arguments is followed by each of its arguments and so on until the end as in
Polish notation. It is helpful to simply diagram a propositional form as an
inverted tree, thinking of the predicates, connectives and quantifiers as nodes
with one or more branches, and individuals, variables, and propositions as
leaves.
129
Using the notion of the formation tree of a propositional form, I give the
definition of the identity of two propositional forms:
Two propositional forms, a and ~ are identical, iff they have the
same formation tree, or if the formation tree of one is obtained from
the formation tree of the other by alphabetic change of bound
variables.
In this way distinct variables make distinct propositional forms, except in
the case of the variables of quantification. Thus while Fx =F- Fy, (x)Fy = (y)Fy.
Propositions are those propositional forms which do not contain a free
variable. An adequate definition of proposition requires that the notion of a
constituent of a propositional form have a precise meaning. The notion of a
constituent of a propositional form parallels the inductive definition of propositional form already given. The constituents of a propositional form are
determined by the following clauses:
(1) Every propositional form is a constituent of itself.
(2) a l , ... , a k are constituents of the propositional form :E(a l , , ak)'
(3) a is a constituent of -a.
(4) a and ~ are constituents of a ::J ~.
(5) a is a constituent of (S)a.
(6) If r is a constituent of 11 and 11 is a constituent of W, then r is a constituent of W, with the following exception:
No bound occurrence of a variable is a constituent of any propositional form.
130
JUDY PELHAM
If 'Y does not meet these conditions, or if 'Y is not identical to any constituent
of a, then a~ is a.
The construction presented in the last three pages gives us a semantic
structure which is three-tiered, in the sense that it contains three different kinds
of objects. The first kind is entities, and entities are possible values of the unrestricted variable of quantification; ontologically these have the status of things
in the world. They include individuals and propositions. The second kind is
propositional forms. Propositional forms are not entities, but they may be
constituents of propositions, or the patterns which propositions instantiate.
In their simplest form they are variables. The third kind is predicates, which
are the nodes which are used in building up propositional forms. Predicates
are not entities, and they are not considered constituents of propositions or
propositional forms. Propositional forms and entities are both capable of
having and of being constituents in other logical structures. Predicates neither
have constituents, nor are they capable of being constituents to any other
complex.
This three-tiered structure of the reconstruction agrees with many of the
things which Russell says in his manuscript about the nature of propositions
and their constituents. Consider the following passage from the substitution
manuscript of Dec. 1905:
An expression containing x is called a dependent variable; thus "x is a man" is a dependent
variable; so is "x - x" or "x ~ x" or any other expression of which x is a constituent. Such an
expression will be called <p!x. Here <p!x by itself is not supposed to have any meaning, so long
as x is not determined. [bold mine] (Russell, 1905b)
What I have called propositional forms, Russell thought of as dependent variables. The determining property of dependent variables or propositional forms
is that each is capable of containing real variables as constituents. Russell
believed that the variable is a constituent of the propositional function, however
propositional functions themselves do not have any meaning. Russell continues:
<p is not a constituent of <p!a or <p!x, and when <p occurs in an expression, we cannot put (<p)
before the expression to obtain an analogue of (x). <p!x. A statement of the form "For any value
of <p, so-and-so is true" is meaningless. (Russell, 1905b)
Here Russell is speaking about what the reconstruction calls predicates. Both
predicates and propositional forms are not entities, but propositional forms
yield a proposition under certain circumstances, and predicates do not. Further,
predicates are not capable of having constituents, as propositional forms are.
Russell's own manuscripts suggest a three-tiered hierarchy of elements.
Russell's manuscript also agrees with the reconstruction in that Russell does
not think of a bound variable as a constituent of a universally quantified
sentence, as the following theorem, taken from the Dec. 1905 manuscript
shows:
1-: x in {(y).$!y}.
=.(y).x in $!y
(Russell, 1905b)
131
(l
and
(l
in
(l
V ~
(l
(l
is either false or
132
JUDY PELHAM
133
It was not published until 1973. It was received by the London Mathematical Society on
April 24, 1906, but Russell withdrew it before it could be published.
3 In the manuscript Russell writes p;!x for * 12.211 but it seems clear from the rest of the
paper that this is merely a slip on Russell's part.
4
For discussion of how Russell's understanding of logic in The Principles of Mathematics
precludes these distinctions, see Pelham (1993).
5 It is a liar sentence in this respect, and soon after the discovery of this paradox Russell
began work on a manuscript called "The Paradox of the Liar".
2
REFERENCES
Herzberger, Hans, 1982, 'Notes on Naive Semantics', Journal of Philosophical Logic 11: 61-102.
Kripke, Saul A., 1975, 'Outline of a Theory of Truth', Journal of Philosophy 72, 690-716.
Moschovakis, Yiannis N., 1980, Descriptive Set Theory. Studies in Logic and the Foundation
of Mathematics Series. North Holland, Amsterdam.
Pelham, Judy, 1993, 'Russell's Early Philosophy of Logic', Russell and Analytic Philosophy,
University of Toronto Press, Toronto.
Pelham, Judy and Alasdair Urquhart, 1994, 'Russellian Propositions', Logic, Methodology, and
Philosophy of Science IX, Elsevier.
Russell, Bertrand, 1903, The Principles of Mathematics, Cambridge University Press, Cambridge.
Russell, Bertrand, 1905a, 'On Denoting', Mind 14 (Oct): 479-493. Reprinted in Russell (1973)
Russell, Bertrand, 1905b, 'On Substitution' (Russell Archives 220.010940). Unpublished manuscript in Russell Archives, McMaster University, Hamilton, Canada. Dated Dec. 22, 1905
in Russell's hand.
Russell, Bertrand, 1906, 'On the Substitutional Theory of Classes and Relations'. First published in Russell (1973).
Russell, Bertrand, 1907, Letter to Ralph Hawtrey, dated Jan. 22,1907. Copy in Russell Archives
(REC.ACQ 394).
Russell, Bertrand, 1973, Essays in Analysis. Edited by Douglas Lackey. George Braziller, New
York.
MICHAEL HALLETT
The logical systems presented in the books by Hilbert and Ackermann (1928,
1938) and in Hilbert and Bernays (1934/39) are not too far removed from
modem, axiomatic systems, those, for instance, to be found in Kleene 1952,
Church 1956, or Mendelson 1964. What Hilbert et oZ. give is, at root, a
system of (many-sorted) first-order logic, suited for the deductive purposes
of all mathematical theories, and therefore (of necessity) adding no genuine
content to any theory. What we have, in fact, is systems which are minimal
when compared to those of Whitehead and Russell or Frege, a Zogica utens
as opposed to a Zogica magna, to echo van Heijenoort's distinction. Moreover,
Hilbert and Ackermann (and then Hilbert and Bernays) state clearly what
are now regarded as basic questions concerning consistency, completeness
and decidability. Thus, in short, whatever the similarities with systems earlier
than those of Hilbert, what we see in many respects is the first modem
presentation of logic.
Hilbert's involvement with, and treatment of, logic is complex, and cannot
be treated here in any full sense, even when technical developments (such
as the e-calculus) are left to one side. Nevertheless, I want to try to shed
some light on the emergence of certain of the elements mentioned above,
and to convey a general sense of the structure and role that logic plays in
Hilbert's overall approach to the foundations of mathematics. This will constitute a modest contribution to a fuller understanding of the position of modem
logic.
I will address two main issues. The first is the emergence of the view of
logic as minimal. I will suggest that there are two central things which pushed
Hilbert in this direction, his attitude to the primitives of a mathematical system,
and his novel approach to the question of mathematical existence. Both of these
views stem from the rejection of the thesis that axioms are truths. It should
be noted that both were initially developed independently of the logical
antinomies, but these certainly served to reinforce the doctrine that a logical
calculus should be minimal, and we will examine part of the reason for this. 2
The second thing I wish to examine in more detail is Hilbert's rather
complex attitude to second-order quantification. This, too, is connected to
the desire to keep the logical resources quite separate from the existential
commitments of mathematical theories, and is again intimately connected to
Hilbert's view of mathematical existence. But it is also connected to two things
stressed by Hilbert from the very early stages of his work on foundational
135
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science I, 135-187.
1995 Kluwer Academic Publishers.
136
MICHAEL HALLETT
matters, the role of symbols in mathematical theories, and what Hilbert and
Bernays later call the finite Einstellung. This is important, for it is then quite
natural to see logic for Hilbert as the main link between the two most important elements of his foundational work, the stress on the axiomatic method,
articulated in the period 1899-1905, although iterated throughout, and the
Hilbert programme of the 1920s, built as it is around what we call finitism.
Interestingly, Hilbert did not begin to develop a system of logic until 1905,
a development which was continued following rather different lines from
around 1917, thus a considerable time after he had set out the main aims of
his axiomatic approach. To be more explicit, Hilbert adapted logic (taken
from various sources) so that it conforms to certain features which he saw
as essential to the aims of the axiomatic method, and in this it is the finite
Einstellung which is the dominant feature. One should be careful to distinguish
between the finite Einstellung and finitary reasoning. The two are closely
connected, although I will not discuss finitary reasoning as such in this paper.
Hilbert intended all of developed mathematics to be finitary in a certain
sense, but he considered that only a certain restricted kind of arithmetic is
confined to methods of proof that can be called finitary. I will, however, say
a little at the end of the paper about where the need for finitary reasoning
arises.
The first two sections of the paper will, therefore, examine Hilbert's conception of the axiomatic method, his treatment of primitives, and why the
consistency question emerges in a natural way. This has clear implications concerning the scope of logic, and also for the nature of quantification. The third
section.will give a preliminary examination of the role of the antinomies and
of part of Hilbert's attitude to second-order logic. The last section will also
look very briefly at the finite Einstellung and finitary reasoning.
1.
Hilbert's view of the axiomatic method is not straightforward. In the first place,
he is clear that an axiomatic system does not have its origin ex nihilo. It begins,
rather, with an assemblage of what he calls 'facts' [Tatsachen], more or less
established; the task then is to investigate what follows from what, and what
is dispensable and what essential. For instance, in his lectures of 1905, he says:
If one has a certain body of facts [Thatsachenmaterial] at one's disposal, a body that may be
composed of certain propositions [Satze], even doubtful ones, conjectures, etc, then one singles
out a series of these statements, separates them off, and puts them together into a system of its
own [einem eigenen System], into the axiom system. This system is then seen as the foundation, and one seeks to derive from it all the material presented through logical combinations
according to the known logical laws"
In Hilbert's view, the axiom system achieves with this a certain independence from the Tatsachenmaterial, as is made clear in the following passage
from some lectures of 1922-23. Hilbert is reported as saying:
137
The service of ax iomatics is to have stressed a separation into the things of thought [die
gedanklichen Dinge] of the [axiomatic] framework and the real things of the actual world, and
then to have carried this out. s
To emphasise what is really new here, and for brevity, I cite Bernays writing
in 1922. Bernays, of course, is talking directly about Hilbert's axiomatisation of geometry, but Hilbert's view is quite general. First, Bernays begins with
a short account of how axiom systems were viewed before Hilbert. Bernays
writes:
According to the standard view, the nature of the axiomatic method, i.e., the method of
developing a science purely logically from axioms and definitions, is the following: One starts
from a few basic principles, of whose truth one is convinced, places these at the beginning as
axioms, and, with the aid of logical deductive procedures, derives from them theorems, theorems
whose truth is therefore just as certain as that of the axioms, simply because they follow
logically from these axioms. This view draws attention to the epistemological character of the
axioms. 6
Note that, according to this view, all the axioms and theorems are simply truths,
with the axioms assigned a special role among the truths, presumably (as
Bernays says) for epistemological reasons, reasons which ought to be given
in a full statement of the axiom system. But for Hilbert, axioms are no longer
truths. Bernays goes on:
... according to this view [i.e., Hilbert's] the axioms are not judgements of which it can be
said that they are true or false. Only in connection with the axiom system as a whole do they
have a sense. And even the entire axiom system does not constitute the expression of a truth.
Rather the logical structure of axiomatic geometry is, in Hilbert's sense, purely hypothetical,
just like abstract group theory: If anywhere in reality there are three systems of objects [Bemays
is referring to the three kinds of primitive in Hilbert's geometry, points, lines and planes] which
are mutually related so that the axioms are fulfilled (Le., that under a suitable correspondence
between the names and the objects and relations the axioms become true assertions), then all
the theorems of the system are also correct for these objects and relations. The axiom system
itself does not express a state of affairs [bringt nicht eine Tatsiichlichkeit zum Ausdruck), but
rather represents a possible form of a system of connections, a system which is to be investigated according to its internal properties [innere Eigenscha/ten].7
Note, too, that Hilbert, as reported by Bernays here, relies on a notion similar
in flavour to what is now called logical consequence, and this accurately
reflects formulations of Hilbert which go back at least as far as 1894. 8 Note
also that we have here something similar to the notion of 'true in an interpretation', which also goes back to 1894, and which is mentioned in the
correspondence with Frege. Above all, though, we should note that what
seems to be expressed by an axiom system is not a set of truths, and even
the separation from the 'facts' is stressed; rather what we have is a network
of logical interconnections between propositions such that if the basic terms
are interpreted by some realm of things in such a way that the axioms do
express truths about them, then the theorems must hold of them as well.
It should not be thought that Hilbert's concerns were utterly different from
those of others who promoted something like an axiomatic view at around
138
MICHAEL HALLETT
the same time, even those, like Frege and Pasch, who held something akin
to the first view of the role of axiom at is ation that Bernays sketches. For
example, Frege states clearly in the Grundlagen (1884, 2) that one of the
purposes of giving a proof is to show how the particular truth proved depends
on other truths. 9 Indeed, this is intrinsic to Frege's project in the Grundlagen,
for part of the point is to argue for a conception of the analyticity of a sentence
which is based, not, as with Kant, on its inner form, but rather on its deductive linkages. This expresses much of the spirit of Hilbert's investigations of
geometry. Secondly, Hilbert shares with Frege (as well as with Bolzano,
Dedekind and Cantor) the desire to show that mathematics is autonomous,
not dependent on 'foreign elements' or on appeals to intuition, at least in its
deductive development. This is connected to a third point. The main similarity between Hilbert and others in the late nineteenth-century concerned with
axiomatic development is the pursuit of rigour and precision, to be achieved
by explicit statements (laws or axioms) presented at the outset and which
must not be supplemented during the course of the theory's development. 1o
To this end, both Hilbert and Frege insist, as do Pasch and others, that the
development of the theory must proceed by means of purely logical (and,
indeed, 'gapless ') deduction. ll
However, the similarities mentioned here, although important, are limited,
especially in the conception of the role ascribed to logic. The best way to
approach one of the central differences is to look more closely at the matter
of precision.
It is often said that the primary purpose of logic is to facilitate, indeed
encourage, precision. In a sense, this is right, but it is important to be clear
about how to interpret this. For instance, if one wants to avoid 'foreign
elements', it is necessary to state what can (or cannot) be appealed to in a
proof, and this involves first a declaration of what primitives are to be allowed,
and of what propositions to admit (as axioms) governing these. For the particular deductive exercise in question, one might be content with such
specification, and indeed this was very much Hilbert's view. But one might
also wish to go further than this. Russell provides a prime example of this more
ambitious view. From the very beginning of his work on the foundations of
mathematical theories, Russell was concerned with isolating the right primitives. These Russell characteristically called 'indefinables', meaning not just
indefinable in the theory that one is trying to expose, but absolutely indefinable. For instance, take the following passage from Russell's 1899 reply to
Poincare:
Both [terms, 'distance' and 'straight line'] belong, so to speak, to the geometric alphabet. They
can serve to define other terms, but they are themselves indefinable. Consequently, any proposition, whatever it may be, in which these notions figure is either an axiom or a theorem, and
not a pure definition of the word. When I say: the straight line is determined by two points, I
assume that straight line and point are terms which are already known and understood, and I
express a judgement about how they are related, a judgement which is either true or false, and
which is in no case arbitrary.12
139
Two things are clear here. First, there is the maintenance of the view that
the statements involving the primitives must all be true or false. But alongside this there is the view that the primitives must be ultimate. In his essay
on Leibniz, Russell says that
.. the business of philosophy is just the discovery of those simple notions, and those primitive axioms, upon which any calculus or science must be based .... An idea which can be defined,
or a proposition which can be proved, is only of subordinate philosophical interest. The emphasis
should be laid on the indefinables and indemonstrables, ... 13
This makes the search for primitives the primary philosophical activity, and
knowledge of these the primary philosophical knowledge. And here Russell
says that
.. no method is available save intuition. 14
But it seems that this intuition does not yield articulable knowledge of the
primitives over and above that embodied in the axioms, for, if it did, this could
then supplement the latter, or even supplant some of the axioms. As Russell
says:
.. the meaning of the fundamental terms cannot be defined, but only suggested. If the suggestion does not evoke in the reader the right idea, nothing can be done. IS
And later he says in the Principles (1903) that any discussion of indefinabIes is
.. the endeavour to see clearly, and to make others see clearly, the entities concerned, in
order that the mind may have that kind of acquaintance with them which it has with redness
or the taste of a pineapple. 16
This concern with the nature of the primitives is not just a feature of
Russell's early examination of geometry, but it continues throughout his work
on the foundations of mathematics. From the time of the Principles onwards
it becomes the issue of whether to select classes, propositional functions or
propositions as the basic elements, one of the central considerations in this
(aside from the avoidance of contradiction) being how such choices fit in
with the demand that the basic principles adopted should be principles of logic.
Even after the Principia, in the theory of logical atomism, Russell is concerned
to find primitives on which our knowledge of the physical world is based. This
is all part of what Russell would have considered logical precision, since,
for Russell, the first task of philosophy requires a rendition of statements in
their ultimate 'logical' form as opposed to their 'surface' (and often misleading)
grammatical form; only this renders the correct content of a proposition.
Thus, he held that only sentences written in a 'logically perfect language' (such
as that of Principia) express genuine propositions. Thus part of what isolating the primitives achieves is precision about what the correct logical form
of our assertions is. 17 Note that this project requires finding the right language
in which everything can be expressed.
140
MICHAEL HALLETT
But this is not at all what logical precision means for Hilbert. For him, there
is no absolute logical form in this sense. Of course, logical form depends on
what is taken as primitive, and, then on what is assumed about the primitives in the axioms. But not only might there be different systems of axioms
based on the same primitives, but the same set of theorems might be presented with different primitives, and what is primitive in one system may
not be primitive in another. If follows that, for Hilbert, there is no unique
way of stating the facts, for there is not one privileged type of primitive,
although there might well be powerful considerations in favour of some ways
of shaping an axiom system and against others. 18 Logical precision for Hilbert,
then, is first and foremost a matter of establishing the logical interconnections between propositions within a system once one has settled on the
primitives. And for Hilbert it is not just the positive deductive links that
must be revealed, but also the absence of such links, and also the links between
the propositions of a theory and those of other theories, in so far as it is possible
to do this. 19 Thus, formalisation in the sense of stating the primitives is actually
only a pre-condition for precision, for it is simply to cast the terms and
propositions in such a way that it is possible to investigate their deductive interconnections. This is surely closely related to what Dedekind says in 3 of
his essay on the irrational numbers (1872) when he stresses the need for a
notion of continuity which can be used in a system of deduction:
Nothing is achieved by vague talk of 'uninterrupted connection in the smallest parts'. It is
surely a matter of specifying a precise characteristic of continuity which can be used as the
basis for actual deductions. 20
The extent to which what has been said about Russell's concern with
primitives also applies to Frege is not certain. For one thing, Frege gives the
strong impression in the Grundlagen (68, footnote) that there could be ways
of defining number other than through the use of extensions, that there is
therefore nothing essential about this. Nevertheless, it is clear that Frege's
intention is to give explicit definitions of the numbers, thus to determine the
reference of the 'primitives' of arithmetic and of analysis in terms of logical
notions which are genuinely primitive. The analogy which Frege uses in the
Begriffsschrift about microscopes is also instructive. Frege makes it clear
that he thinks of a formalised, logical language as being simply an extension of natural language that enables one to reveal the logical structure that
is there but (by implication) hidden. Logic, then, extends the power of natural
language (for some purposes) just as the microscope extends the power of sight
for some purposes:
I believe I can give the clearest indication of the relationship between my Begriffsschrift and
ordinary language [Sprache des Lebensl if I compare it with that of the microscope to the eye.
Because of the range of its applicability and its transportability, by means of which it can be
shaped to fit the most diverse circumstances, the latter has an enormous superiority over the
microscope. But considered as an optical instrument, it exhibits great imperfections, which
ordinarily go unremarked because of its intimate connection with intellectual life. However, as
soon as scientific [wissenschaftlichel purposes pose demands concerning the sharpness of
141
differentiation, then the eye reveals itself as inadequate. The microscope, on the other hand, is
precisely suited for such purposes, but for the same reason unusable for all others.
The Begrifsschrift is just a scientific tool, invented for determinate scientific [wissenschaftliche] purposes, which should not be condemned because it is not suited for others. 21
This suggests that looking more carefully with the new 'visual' aid will reveal
the correct structure of propositions (say, those of arithmetic).22 The desire
to discern the 'correct structure' might well be simply the desire, not to discern
an 'ultimate' form, but rather simply to reveal a form suitable for carrying
out deductions in a gapless way, as Dedekind suggests. However, what Frege
says in his 1906 (p. 301) about the position of genuine primitives tends to
suggest otherwise; these primitives cannot be defined, they can only be accompanied (sometimes) by 'elucidations'. (Note the similarity with Russell.) These
elucidations, Frege says, cannot play any role in proofs because they 'lack
the requisite precision' (Frege to Hilbert, 27.xii.1899) and 'and no inference
is based on them' (1906, p. 301). Frege's insistence that one must establish
the truth of axioms before one develops the system (an insistence which
emerges very strongly from the letter of 27.xii.1899) then suggests (i) that
there is a unique content to these axioms, and (ii) that the logical interrelations shown by the development of the system do not reveal anything about
this content.
I have suggested how Hilbert's view differs from this, but it is important
to pin down the source of the difference. Part of it stems from Hilbert's position
on the status of axioms. If it is assumed that the axioms are truths laid down
prior to the development of the system, then presumably they are truths about
the (relations between) the primitives which appear in them, and enough
must be said about these to indicate at least why these propositions are true.
This, in tum, indicates that we must work in an ambient language in which
it is possible to fix the primitives. For Frege, and for Russell much of the
time, this was a universal logic. However, if the assumption about the axioms
as truths is dropped, then there is no necessity to say anything about the
primitives prior to the development of the theory. Thus, in particular, there
is no necessity (as regards the primitives) for a strong ambient logic. Hilbert
does not accept that there need ever be absolutely indefinable primitives,
and holds that it would be extremely restricting to regard certain concepts as
absolutely primitive, for many of the most significant developments in mathematics and science are based precisely on giving definitions of 'indefinables'
in a modified or completely new systemY Individual systems operate with
primitives that must be regarded always as relatively indefinable, and these are
governed solely by the axioms set out in the system in question.
It is possible to see in this a generalisation of Dedekind's approach to the
extensions of the various number systems, and what he called new 'creations'
in general. Dedekind is not concerned (at least, not primarily) to give definitions of the elements of the successive fundamental number systems, but rather
first to specify laws which govern them, and then also to state general principles as to how the laws for an extended system emerge from laws for the
142
MICHAEL HALLETT
system that it extends. For instance, Dedekind states that completely general
laws are to be maintained, while restrictions are lifted on laws of limited
generality. (A prime example is the shift from the natural numbers to the
integers, where substraction, an operation of limited scope in the natural
numbers, is turned into a completely general operation like addition.)
According to Dedekind, there should be something canonical (and thus nonarbitrary) about this, and he makes a serious attempt to describe, in a general
way, how such principles are arrived at. 24 At the root of Dedekind's procedure is the insistence that the objects of the new theories are genuinely
primitive in the sense that they are assumed to have no properties other than
those which are specified in the principles laid down.
This procedure is clearest of all in the case (described above) of going
from the positive whole numbers to the integers; the modern way of describing
this would be to say that what Dedekind effects is the construction of the
free ring over the integral domain of positive integers. But it is also clear
from the way that Dedekind treats the reals and the natural numbers. In the
case of the reals, he does not define them as Dedekind cuts in the rationals.
Rather, it is simply shown that the cuts have the continuity property that
Dedekind seeks. The reals themselves are then introduced as primitive objects
which correspond to the cuts in the right way. The natural numbers, too, are
taken to be primitive things, things 'abstracted' from some 'simply infinite
system'. It is not that any error is committed by defining the objects in any
of the well-known ways, but rather that, if this is done, the essential structure will be burdened by inessentials, and possibly its relations to other
structures obscured. This is made quite clear in a letter to Weber, written in
January 1888. Dedekind mentions both the possibility of defining the irrationals as cuts, and then also refers to a suggestion made to him by Weber
that one could (as Russell later did) define the cardinal number of a set M
as the set of all N equinumerous to M. Dedekind comments as follows:
If one wishes to pursue your method - and I would strongly recommend carrying it through -
then I would still advise that by the name number [Zahl, Anzahl] is understood not the class itself,
but rather something new which corresponds to the class, something which the intellect creates .
. . . The rational numbers also generate cuts, but I certainly do not wish to pretend that the rational
numbers are identical with the cuts that they produce. And even after the introduction of the
irrational numbers one will often speak of the cut phenomena in such terms, and ascribe to
these cuts attributes which would sound very peculiar [seltsam] if applied to the corresponding
numbers themselves. Something similar holds with respect to the definition of cardinal number
as a class. One wishes to say a great deal about such a class (for example, that it is a system
of infinitely many elements, namely all similar systems), a weight one would be very unhappy
hanging around the neck of the number itself. Does anyone bother to remember that the number
four is a system of infinitely many elements? (Against this, everyone will always be conscious
of the fact that 4 is the child of the number 3 and the mother of the number 5.) For the same
reasons, I have always thought Kummer's creation of the ideal numbers thoroughly justified,
if only it is carried through the requisite rigour. 25
In effect, this is to say that the new irrational numbers are primitive objects
on the same type level, and thus of the same kind, as the rationals, and those
143
of the integers, and so on. Thus, for Dedekind part of the desire to avoid
constructive definitions is to avoid type inflation. Hence, in ceasing to have
the 'irrelevant' properties of the cuts themselves, an irrational number possesses only those properties ascribed by the principles of being a field and
being completely ordered. Things are numbers just in virtue of their satisfying the right properties, and this automatically admits the possibility of there
being different sorts of object satisfying the properties. In regard to the natural
numbers, Dedekind puts the position in a particularly strong way:
73. Definition [Erkliirung]. When, in considering a simply infinite system N ordered by a
mapping <p, one completely overlooks the particular nature [Beschaffenheit] of the elements,
when one only keeps in mind (festhiiltl their differentiability and takes into account only
those relations which they have in virtue of the ordering mapping <p, then these elements are
to be called natural numbers or ordinal numbers or even just numbers, and the basic element
1 is called the base number of the number series N. In so far as this frees the elements from
all other content (abstraction), one is justified in calling these numbers a free creation of the
human intellect. 26
Thus, in Dedekind what one sees is something quite different from Frege's and
Russell's analytical investigation; a halt is called to analysis even when it is
known (or strongly suspected) that the analysis could be carried further. The
reduction, if that is what is, is minimal and not maximal; more importantly,
the nature of the objects is specified by the fundamental principles involved,
and not by the ambient logic.
Hilbert's axiomatic method generalises Dedekind's conception of mathematical theories, the major difference being that, in Hilbert's work, there is
no suggestion of 'creating' elements, or of 'freeing' or 'abstracting' them. 27
There is just the claim that the primitives of a theory will have all and only
the properties attributed to them by the principles given (the axioms). Hilbert's
paper from 1900 on the axiomatisation of real number is an example:
In the theory of the number concept, the axiomatic method takes the following form:
We think of a system of things; we call them numbers and denote them by a, b, c, ....
We think of these numbers in certain mutual relations, whose precise and complete description
is obtained through the following axioms. . . .28
Thus the numbers are indeed to have the same elementariness that Dedekind
seems to suggest; they become, in Hilbert's words, a 'system of their own'
(*1905b, pp. 11_2}.29 The word 'complete' is important in this context, not
because of any allusions to the later problems of completeness, but rather
because it forms part of a clear statement that all we know (and need to
know) about the irrational numbers stems from the axioms. There is no need
to 'free' objects from 'extraneous' properties. Things that fulfill the axioms,
like Dedekind cuts, points of space or equivalence classes of Cauchy sequences,
will often have 'extraneous' properties. But, nevertheless, in doing real analysis
one is interested only in the properties that the axioms lay down. It is these,
and only these, which tell us what the primitives are like, more precisely
how they are related. As Hilbert says:
144
MICHAEL HALLETT
It is quite self-evident, that every theory is only a framework or schema of concepts together with
the necessary relations to one another, and the basic elements can be thought in an arbitrary
way.30
Hilbert is also quite clear that he thinks that this latter is a positive advantage of a theory.31
Dedekind achieves this only partially, for his account still depends to some
extent on the individual nature of the objects concerned. For instance, in the
case of the natural numbers there has to be some specific simply infinite system
from which to abstract (or to which to apply the free algebra construction),
whence the need for the infamous Theorem 66 of Dedekind 1888. In the case
of the irrationals, it is shown that some objects (the cuts in the irrationals)
satisfy the desired properties, thus in short (as we would now put it) that
there is a model of the axioms. But for Dedekind this is not just an interesting metamathematical observation, one made once the system of real
numbers has been set out, since Dedekind actually introduces the primitive
irrational numbers by reference to the cutS.32 Thus, to prove anything about
these numbers, one first has to prove that the corresponding fact is true of
the cuts.
Hilbert's position, on the contrary, puts the emphasis on the axiom system
alone (together with the theorems the axioms give rise to), and dispenses
entirely with external explanation for the primitive elements. It is precisely
this, I think, that Hilbert alludes to in the remark in a letter to Frege that 'point'
will mean something different in different systems of geometry.33 There is
no prior 'pure' notion of 'point' taken on its own that the axiom system
attempts to capture, and which we can say has a reference. All there is is a
wealth of imprecise, intuitive ideas (thus, either not fully articulated, and so
not suited for proper deductive development, or too restrictive), and other more
or less satisfactory axiom systems. This is also, I think, the substance behind
Hilbert's remark to Frege that 'love, law and chimney sweep' would form a
Euclidean geometry provided it could be shown that the axioms (in the right
interpretation) are true of them. 34 The example is instructive, for it shows us
that the primitives can be drawn from anywhere, and in other contexts may
well have structure, thus, not be primitives in this context - chimney sweeps
and Dedekind cuts clearly have a great deal of structure. But from the point
of view of the axioms, that structure is of no importance; but since there is
no mention of it, there is no need to strip it away.
Along with the aim of slimming down axiom systems, there is the parallel
aim of cutting to a minimum the assumptions used in proofs. Much of the
point of this is stated clearly by Bernays in the passage quoted early in the
section. Suppose it is shown that P can be proved from just the axioms
AI' A 2 , , An; and now suppose it is shown subsequently that some quite
unexpected interpretation can be given to the AI, A 2 , , An> for instance
by interpreting the basic entities, not as reals, but as certain kinds of function.
Then it follows without further ado that a version of P must hold for these
145
objects as well. The use of such techniques, a commonplace now, was one
of the reasons why Hilbert's work had such a powerful effect.
We are back, very firmly, to the view stated in the passage from Bernays
quoted at the outset, namely that the axioms can be differently interpreted,
something that Hilbert stresses many times, and something which he takes
to be a beneficial (and liberating) feature of the axiomatic presentation of
theories. Dedekind, it is clear, also appreciated this, hence his desire to prove
categoricity results. But his frequent appeal to "creation" might be seen as
hankering after the uniqueness that constructive definitions guarantee.
Lastly, this makes it quite clear why Hilbert is so insistent (e.g., in his 1900a)
that what he propounds is quite different from the 'genetic method' of
explaining, for instance, the genesis of the number systems in terms of the construction of new elements from existing ones. One problem with this is the one
which worried Dedekind, namely that, as well as the 'right' properties, the
objects constructed will also have properties that are not intrinsic to the structure being characterised. But there is a further worry, namely that the
construction itself requires theoretical treatment, and it may not be a trivial
matter to spell out precisely what such construction entails. This was especially so before the codification of set theoretic principles, a fact which
the antinomies turned into a serious drawback. The rejection of constructive
definitions to some extent dispenses with the need for such theoretical
explanation. 35
2.
146
MICHAEL HALLETT
Indeed, if the axiom system does yield a 'complete and unique [eindeutig]
concept', then according to Hilbert's characterisation of completeness as
Post-completeness, the 'purest nonsense' which arises will be a contradiction. Even if one does hold to a notion of truth for axiom systems, what
this passage points out is the danger of appealing to the truth of an isolated
proposition.
Note again that the problem of consistency arises quite independently of the
discovery of antinomies.
The consistency question already emerges with Dedekind's position, that is,
where mathematical objects are regarded as 'created' according to certain principles (axioms) and on the basis of nothing else. (Thus: are the creations
even 'possible'? See Frege 1903, 139 and 143, English translation in Geach
and Black (eds.) 1966, pp. 174 and 178.) It is quite clear from his correspondence that Dedekind himself was perfectly aware of the requirement of
proving consistency, both for the system of irrational numbers and also for
the system of natural numbers. For instance, the letter of Keferstein from
1890 states quite clearly that the very point of Theorem 66, which attempts
to prove the existence of an infinite set, is to show that there is no 'inner
contradiction in the concept' of a simply infinite system, thus, in effect, the
demonstration of consistency by the provision of a mode1. 39 Curiously, Hilbert
147
and Bernays (1934, p. 15) say that Frege was the first to realise the need to
prove the consistency of arithmetic, by carrying out an existence proof on
the basis of logic alone. In fact, what Frege attempted to show was that the
truths of arithmetic are truths about objects of a certain specific kind, logical
objects. (Hence Frege's sceptical question as to whether consistency is the only
criterion we need to satisfy in a creation: Frege 1903, 144, English translation in Geach and Black (eds.) 1966, p. 178.) That this then demonstrates
consistency is just an automatic consequence, as Frege points out to Hilbert
in the letter of 27.xii.1899. Nevertheless, Frege did think, with Dedekind,
that the provision of a 'model' of the axioms is the only way to prove their
consistency.4O
Although Dedekind's strategy of showing that there is a model of the axioms
(or principles) is weaker than Frege's attempt to show directly that the major
assumptions are true, it still approaches the consistency problem as a problem
of existence, for it tries to show that there are objects about which the axioms
express truths. While Hilbert's axiomatic method builds on Dedekind's
approach to mathematical theories, it does not follow Dedekind in the matter
of consistency, models and existence.
First, Hilbert separates the existence question into two, quite distinct, components. The first can be called the question of internal existence. This simply
amounts to proving within a system ~ a statement of the form 3xA, even by
non-constructive means. 41 Existence in this sense is clearly relative to the axiom
system. The second component is what might be called the question of external
existence (to distinguish it from the above). And Hilbert proposes an answer
to this latter question which is quite different from either Frege's or Dedekind's.
In his letter to Frege of 29.xii.1899, Hilbert puts forward the following thesis
about consistency and existence:
You write: "[I call axioms propositions] .... From the truth of the axioms it follows that they
do not contradict one another". I was very interested to read exactly this, since for as long as
I have been thinking, writing and lecturing on these things, I have asserted exactly the opposite:
If the arbitrarily chosen axioms do not contradict each other with all their consequences, then
they are true and the things defined by the axioms exist. That for me is the criterion of truth
and existence. 42
Hilbert makes it clear (for instance, in his 1900a) that when we give an
axiom system we make an assumption that there are such things as the axioms
say there are. A proof of consistency for the axiom system is then meant as
a justification of this assumption. We might say that, for this view, a proof
148
MICHAEL HALLETT
of consistency will make the internal existence question the only meaningful
one. (The connections to certain well-known later views on existence are
obvious.) In another way of putting it: a proof of consistency is meant as a
replacement for the procedure of modelling which Dedekind exemplifies, which
before Hilbert had been taken to be the only way to show consistency. Again,
it is important to underline the fact that the assertions governing internal
existence in the domain all stem (on this view) from the axioms: nothing
more can be said about this than is sanctioned by the axioms. The external
existence assumptions playa very important philosophical role, as will become
clear later on.44
It should also be stated that Hilbert's position entails something similar to
the type reduction that Dedekind advocated, for it must follow that there are
not, in general, different kinds of (internal) existence. A little after the passage
from 1900b just quoted, Hilbert goes on:
The conception of the continuum, or equally the concept of the system of all functions, exists
then in precisely the same sense as does the system of rational whole numbers or that of the
higher Cantorian number-classes and powers.45
In short, very familiar and well-accepted notions are on the same footing as
abstruse and controversial ones, providing consistency can be established for
the theory that presents them. In the best cases there would also be some
sort of 'natural' integration with previous theories, either through deductive
relations, laws partially shared, or even through various different kinds of
full or partial modelling of the sort used in Hilbert's 1899. Integration was a
very important element in Dedekind's criteria for the extension of theories, and
was also stressed in Cantor's arguments that the transfinite numbers are indeed
numbers on just the same footing as the natural or irrational numbers, and
for Cantor this was clearly a criterion meant to be applied to genuinely new
theories in place of that of constructing a full model. Such integration, when
taken over by axiomatic theories, is the axiomatic correlate of the 'genesis'
exhibited by constructive definitions.
This is intimately connected with Hilbert's attitude to ideal elements, and
it is in fact the very foundation of it. According to Hilbert, an element (say
...J-i) is indeed of a different status (ideal) when first added to the system of
reals. But once one has given a set of laws for the integration of this element
into the previous system, then what were previously ideal elements exist in
just the same sense as do the other elements, providing one can demonstrate
the consistency of the new, expanded system. Hilbert says the following in
his letter of 29.xii.1899 to Frege:
The proposition "Every equation has a root" is true or the existence of these roots is proven,
as soon as one adds the axiom "Every equation has a root" to the other arithmetical axioms
without, in so doing, making it possible to infer a contradiction. 46
And Hilbert makes it quite clear in some lectures from 1919 that the designation 'ideal element' is only a relative one:
149
The tenninology of ideal elements thus properly speaking only has its justification from the point
of view of the system we start out from. In the new system we do not at all distinguish between
actual and ideal elements. 47
Thus, as with Cantor and Dedekind, so with Hilbert - the question of existence boils down to that of the acceptability of the new, more inclusive theory. 48
At first sight, this looks a little odd. Consistency is a logical property of
the entire axiomatic system, for it expresses a (rather crude) fact about the
nature of the deductive relations between the propositions. In the earlier way
of regarding axiomatic systems that Hilbert rejects, what exists or not cannot
be a property of the system, for this is surely something determined by the way
the (abstract) world is, independently of the properties of the systems we
use as a means of expression. The consistency of the system would simply
be a by-product of its expressing truths, and thus subordinate to matters of
truth and existence. For Hilbert, however, it is the other way around: consistency is taken to be a sufficient condition for the truth of the axioms and
thus for the existence of what it is that they postulate.
But it should be clear that Hilbert's position is not so strange, given the
attitude he takes towards the primitive objects of a system. If we know nothing
about the primitives of a system other than what is given through the axioms,
then it makes no sense to ask in addition whether the axioms are expressing
truths about these things. To speak vaguely, if it is the axiom system that
'constitutes' or 'defines' these things, then their existence is surely afunction
of that system, a point underlined by Hilbert's claim that concepts (like 'point')
change as the system changes. 49
There are, of course, many reasons why Hilbert's claim that consistency
implies existence turned out to be questionable, but this is not something which
can be analysed here. 50 Instead, I will look at what it implies about the role
of logic.
In view of what Bernays says in the passage quoted at the beginning, and
what Hilbert himself says in earlier lectures (e.g., those from 1894), it is important to raise the question of whether, when he speaks of 'deduction', Hilbert
could really have meant the notion of logical consequence. After all, he does
use the term 'consequence' frequently. This in tum would mean that when
he talks about 'consistency' Hilbert actually means something like satisfiability, hence actually underlines, rather than repudiates, the need to produce
models of theories. Moreover, Hilbert does sometimes state categoricity as a
desirable condition on the adequacy of an axiom system, even if it is not
given as a necessary requirement. However, the categoricity condition would
be rather empty if the axiom system concerned has no models whatsoever.
More importantly, Hilbert uses precisely his notion of logical consequence
in his investigations of independence among the various axioms of geometry.
For what is shown in demonstrating the independence of an axiom A is that
there is an interpretation which makes all of the other axioms true, and which
nevertheless makes A false. Without further ado, Hilbert says that this shows
150
MICHAEL HALLETT
that A 'cannot be inferred logically from the remaining axioms' .51 Lastly, as
we stressed, Hilbert makes an assumption when setting up an axiom system
that there are such things with these properties, an assumption which has to
be justified with a consistency proof. It seems natural to think, therefore,
that a consistency proof is just a demonstration that there are such things,
for what better justification could there be? (Categoricity, when appropriate,
would then be a demonstration of uniqueness of that model up to isomorphism.)
In this case, there would be no interesting distinction between Hilbert's position
and that of Dedekind.
In some sense, it seems obvious that Hilbert cannot mean these things,
for active mathematics has always been concerned with proof, with the construction of a sequence of propositions constituting a chain of reasoning leading
from the assumptions to the theorems. But it is nevertheless important to see
why Hilbert does not mean these things, especially given the important place
of models of theories in Hilbert's work on geometry. Recognition of these
reasons will serve to underline the connection between Hilbert's view of the
function of logic and his finitism.
Firstly, in the passage from Bernays quoted at the beginning which stresses
the notion of consequence, he (Bemays) talks of 'logically deductive procedures'. At the very least, this indicates, as does Hilbert's use of logical
consequence in connection with his proofs of independence in his work on
geometry, that Hilbert and Bemays slipped easily back and forth between
the notions of deduction and of consequence, presumably believing them to
amount to the same thing. 52
Secondly, Hilbert states quite clearly in his 1900b what is to count as a
solution to a problem:
... I mean above all that it is possible to present the correctness of the answer in a finite
number of inferences on the basis of a finite number of assumptions, which are themselves present
in the statement of the problem, and which can always be precisely formulated. This demand
for logical deduction by means of a finite number of inferences is nothing other than the
demand for rigour in the carrying out of proofs. 53
151
(See 1900b, p. 265, or 1935, p. 300.) This is made even clearer by 1905, where
Hilbert describes the notion of consistency as that of being unable to deduce
both <\> and...,<\> by means of 'logical operations' (*1905b), and speaks also
of deduction as effected by 'logical combinations' of propositions. And his
work of 1905 (recorded in 1905a, *1905b and *1905c) was partly an attempt
to give an axiom system which is simultaneously an axiomatisation of propositionallogic, arithmetic and the notion of infinite set. This was, presumably,
Hilbert's first attempt at giving 'a modification of the known methods of inference' . In developing a system of propositional logic in *1905 b (into the details
of which we cannot go here), Hilbert sketches the first of many normal form
results, one of which is the basis of the proof of the Completeness Theorem
for propositional logic given in *1917-18. If one asks whether a 'correct'
proposition P follows from a given finite set of axioms AI' A z, A 3, , An'
then Hilbert says that there are only 'finitely many proof possibilities', and
so:
With this, we have solved, in the present and most primitive case, the old problem that every
correct result must be rendered by a finite proof This problem, properly speaking, was the starting
point of all my investigations in this area, and its solution in the most general case, thus the
proof that in mathematics there can be no "ignoramibus", must remain the final goal. 54
Given this, it is not surprising that Hilbert discusses consistency and inconsistency in his *1905b in a syntactic way. Inconsistency would allow us to
prove far too much from a system, says Hilbert. He notes that philosophers
in general have not taken sufficient notice of the antinomies and the effect
they have on what was taken to be logic, and he says that this is a mistake,
since:
... from any contradiction, no matter how far removed, we can prove the falsehood of every
correct statement [e.g., as Hilbert says, 2 = 2]. Hence, we could say that one contradiction in
the whole realm of our knowledge [Wissen] acts like a spark in the gunpowder barrel and destroys
everything. Therefore, every science [Wissenschaft] must have an interest in dealing with a
contradiction, no matter how far removed. 55
152
MICHAEL HALLETT
talks, not of a relation of logical consequence, but rather of the 'finite and
closed system of axioms for the real numbers' (l900a). For instance, in 1900a,
Hilbert writes:
Under the conception described above [Le., the axiomatic conception], the doubts which have
been raised against the existence of the totality of all real numbers (and against the existence
of infinite sets generally) lose all justification; for by the set of real numbers we do not have
to imagine, say the totality of all possible laws according to which the progression of elements
in a fundamental sequence can be specified, but rather - as just described - a system of things
whose mutual relations are given by the finite and closed system of axioms I-IV, and about which
new statements are valid only if one can derive them from the axioms by means of a finite number
of logical inferences. 56
This passages reveal very clearly that Hilbert did not think that modelling
was primary, for they state quite explicitly that it is the possession of the
finite (or 'finite and closed') axiom system which enables us to avoid considering whether there is a 'totality' (i.e., something independent of the axiom
system) with the properties desired. To see this, we have to remember three
important factors in the background to Hilbert's remarks.
The first is the significant opposition both to the full theory of real numbers
(for instance, the theory of Dedekind cuts), to geometry (with its use of
'arithmetical' continuity principles), and to Cantor's transfinite numbers, all
of which Hilbert sought to defend. Indeed, he remarks in a passage from his
1900b which has already been cited (above, p. 145) that a proof of consistency for the system of real numbers would defuse 'all objections which have
hitherto been raised against the existence of the totality of all real numbers',
and this remark would make no sense if it just meant consistency in the sense
of providing a model for that system, for the main objection was precisely
to such completed infinite totalities. To underline this, Hilbert makes it clear
in the passages quoted above, that it is the finite nature of the axiom systems
involved together with the finiteness of proof (and, of course, a consistency
proof for the system) that makes the concepts involved unexceptionable, even
when these systems are apparently of a radically infinitistic nature. The clearest
statement of this appears in Hilbert's unpublished notebooks:
Through the Archimedean and my completeness axioms [for Euclidean geometry or the reals
respectively], the ordinary continuity axiom is divided into two completely different components.
Moreover, with my completeness axiom, not one infinite process is demanded, but we have
only a finite number of finite axioms, just as Kronecker demands. 58
153
The second thing that must be remembered is exactly what the explicit
construction of the system of real numbers from the system of rational numbers
entails, namely, in one form or another, the concept of transfinite set, and
Hilbert was well aware that no consistent axiomatic presentation of this was
then available. Part of the goal of the axiomatic method and the stress on
consistency was to make all questions of mathematical existence (even in
transfinite set theory) as uncontentious as that for the most basic systems,
say that for natural numbers. To appeal to explicit construction would not,
therefore, be an effective way of appeasing the critics of the theories of real
number.
There is a more general point here about the provision of interpretations
for theories. When one provides a model of a theory r, what one actually
does is to show that certain objects of another theory :E can be used as an interpretation of r. This depends on existence proofs carried out within :E, thus
on objects which are shown to exist by the theory :E in the internal sense.
But, of course, this says nothing about the consistency of :E itself. To be even
more explicit, there is reliance on another theory A (which might be :E itself)
which can describe both :E and r, and in which the construction takes place,
thus where the translation function exists. If mathematics is to aspire to selfcontainment and self-sufficiency, then it is surely incumbent on it to be able
to describe mathematically what takes place with such constructions, thus to
be specific about the theories in which these constructions can be carried
out. If the sole account of consistency available is what we now call satisfiability, then this suggests that there must be some uniform (and well accepted)
theory in which this can be done. But what are such theories? In 1900, the only
candidate for such a universal theory was Frege's, and this was demonstrably
inconsistent.
The third thing to be kept in mind is specific to the problems with Cantor's
transfinite set theory. In setting out a theory of the transfinite, what could
Cantor possibly have given as a model, and out of what body of accepted
material could this have been created? What is required instead is presumably something rather like what Hilbert later called the method of ideal
elements. Hilbert recognised that it is often necessary to make radical extensions to mathematical theories by adjoining new "objects", with no prior
guarantee that this can be done consistently, and for which consistency then
has to be proved. Of course, one might think this should be proved by explicit
construction wherever possible, as in the cases of Dedekind cuts or the Gauss
complex numbers or the von Staudt construction of points at infinity as equivalence classes of parallel lines. But if we do not possess a theory which we
think is universal for the provision of models, then it is far too restrictive to
insist that it be possible to model the new principles by constructions based
on elements already present. Cantor's work furnishes the prime example. What
this starts from is the desire to add transfinite ordinals to point set theory
without any prospect that the result can be modelled. Focusing on syntactic
consistency is one suggestion as to how this problem, and this apparent restric-
154
MICHAEL HALLETT
tion, can be circumvented. The primary goal is then to set out clear axioms,
and to show how the resulting system extends (or is related to in some
other way) older, established systems. The next goal is to achieve a proof of
consistency, by modelling if possible, by some other means if the prospects
for modelling look hopeless. More or less the same considerations arise
when mathematicians want to proceed in some area of research by adding a
hypothesis whose status is not yet clear. The example Hilbert often cites is
analytic number theory and the Riemann Hypothesis, an example of "an
axiomatic postulation of an as yet unproved proposition' (* 1920a, p. 35).
And while insistence on the exhibition of a model as a uniform condition
is difficult enough, appeal to logical consequence as the fundamental positive
notion is even more problematic, for while Hilbert had a clear sense of what
it means to give an interpretation as early as his work on geometry in the 1890s,
and which is set out clearly in quite general terms in *1917-18, it was not clear
what it meant to quantify over all interpretations. For instance, the 1917-18
lectures (p. 131; see also Hilbert and Ackermann 1928, p. 45) state that an
interpretation is always based on one or several restricted 'species of objects'.
And, as we will see later, a domain of quantification for Hilbert (thUS a 'species
of object') must always be associated with an axiom system governing the
behaviour of the objects in the domain. What then is the domain of all domains,
thus what are the axioms governing all interpretations?59 Moreover, referring
to all interpretations which make the axioms true would thus not have made
much sense for anyone wanting to defend mathematical theories with infinitary content against those suspicious of such notions. This is presumably
why, in his 1929, Hilbert says that taking categoricity as the notion of completeness 'does not satisfy the demands of finite rigour' (1929, p. 139, 1930,
p. 6). What is really important about Hilbert's use of interpretations is just
the acceptance that it is part of the nature of the mathematical enterprise that
theories often have different, and surprising, interpretations, indeed in domains
with which we are quite familiar. Thus, we can show the independence of a
proposition from others by means of interpretations; but logical consequence
cannot be a replacement for deducibility. Hilbert might well have thought
that 'consequence' and 'derived proposition' are co-extensive or at the very
least that the latter entails the former. But the point remains that this had not
been shown, nor had the question been made precise, and hence that he could
not operate with the former in any official version of his account of the
nature of mathematics.
What emerges from these two sections about the scope and position of logic
in Hilbert's system?
First, there was the negative point. Logic for Hilbert cannot have the
function of a linguistic 'microscope' that it has for Frege and for Russell,
for there is no stock of ultimate primitives from which all mathematical propositions in their final, correct form are composed.
Second, the last section made it clear that Hilbert's theory of the axiomatic
155
taken in our thought satisfy these axioms. These are the simplest of all laws of thought. Most
of the time, one does not state them as such, since thought, with respect to such organisation,
is not conscious of any sequence. Only in writing down and in expression is such a sequence
necessary.61
156
MICHAEL HALLETT
But what should such an articulation look like? The history of this is
involved, not least because of Hilbert's reaction to the antinomies, which
itself is complex and various. What I will concentrate on here is just one
effect of the antinomies, namely in reinforcing the tendencies that have already
been identified in this essay.
3.
In his 1905 lectures, Hilbert suggests first that he thinks the primary difficulties behind the paradoxes reveal a problem with the notion of totality
[Gesammtheit]. For instance:
One recognises immediately that the contradictions rely on collecting together [Zusammenfassung] certain totalities to a set, collections which seem not to be permitted. But nothing is
achieved simply by saying this, since all thought depends on such collection. The problem
here is rather that of distinguishing the permissible collections from the impermissible. 63
And later on, Hilbert draws a connection between this and the logical notion
of 'all':
... [T]he most difficult concept is the concept 'all' or 'every', since through its use all the
contradictions known to us arise, at least if one applies it in the traditional ways .... 65
At various places in these lectures, Hilbert is also reported as saying that the
root of the difficulties arise with the ('unscrupulous') use of 'all' in the
'traditional' or 'old' way in the formation of concepts and, through them,
classes.
From Hilbert's point of view c. 1905, it seems fairly straightforward to
say how the 'all' of universal quantification ought to be dealt with, for this
is signalled in his attitude to the system of real numbers of 1900 and is confirmed by his later reaction to the difficulty seen by Weyl in the use of
impredicative definitions. The legitimacy of using object-quantifiers in an unrestricted way simply reduces to the question of whether there exists a domain
of objects over which the quantifiers range, what Hilbert and Bernays later call
a 'fertige Gesammtheit', a 'ready made' or 'given' totality which endows
the quantified statements with a perfectly good sense. 66 The kind of existence involved here is just existence in the second, external sense; the
quantification relates to a given, fixed domain, and the existence of this domain
only makes sense in the context of an axiom system supported by a consistency proof. (The axiom system, to use the terminology Hilbert sometimes
employed, is an 'implicit definition' of the concept or domain in question,
be it Euclidean geometry, or complete ordered field, or the universe of sets.)
Something like this is implied by what Hilbert says about the existence of
157
the real numbers in his 1900a and 1900b, thus goes back to the beginnings
of Hilbert's concern with consistency. In explaining the axiomatic method at
the beginning of 1900a, Hilbert says (his example is again geometry):
Here, one begins with the assumption of the existence of all the elements, i.e. one puts at the
outset three systems of things, namely points, lines and planes, and then establishes relationships between these elements through certain axioms .... 67
Then having set up an axiom system for the reals, which he is convinced is
consistent, Hilbert concludes that this defuses 'the objections raised against
the existence of the totality [lnbegrif.f] of all real numbers and against infinite
sets of any sort'. In his 1900b he says that a consistent axiomatisation of the
real number system will show that 'the concept of the continuum exists in
the same sense as the system of rational numbers' .68 It should also be noted
that this procedure is at the core of what Bernays calls 'restricted Platonism' ,69
and it is also what is behind Hilbert's designation of the quantifiers in the 1920s
as 'transfinite elements'.
In effect, one can see in this doctrine of Hilbert's the same principle that
is behind one of Cantor's arguments for the necessity of actual infinites in
mathematics, namely that whenever it appears as if there is only a potential
infinite, the use of unbounded quantifiers shows that there is in fact an actually
infinite domain which constitutes the universe over which the quantifiers
vary. Hilbert adopts a version of this principle, namely: whenever we use
quantifiers, we must be assuming the existence of afertige Gesammtheit over
which the quantified variables vary. Hilbert's doctrine of external existence
adds to this the important modification that this existence assumption, in
tum, must mean that the use of the quantifiers is accompanied by axioms
governing the domain of quantification. It is these axioms, and these alone,
that should govern the existence proofs for objects inside the domain.
In this regard, Hilbert's early general statements concerning set theory
are confusing. He ends his 1900a by saying that an existence proof of the
kind he believes he has given for the reals cannot be given for the Cantorian
system:
If we wanted to present a proof in the same way for the existence of a totality of all powers
(or Cantorian alephs), then we would not succeed. Indeed, the totality of all powers does not
exist, or - to use Cantor's way of expressing it - the system of all powers is an inconsistent
(or unfinished [nichifertige]) set. 70
And even in 1905 (as we have seen) he says that the 'totality of all sets' is
'impossible' .
Hilbert's reference to Cantor is explained as follows. In 1897, through
conversation and two subsequent letters ,71 Hilbert had learnt about Cantor's
distinction between 'consistent' and 'inconsistent' (or fertige and nichtfertige) Mengen or totalities. Assuming that a set is just any 'well-defined'
collection, thus a collection for which it is clear whether or not any object
belongs to it, then Cantor's distinction (given most explicitly in the second
158
MICHAEL HALLETT
letter mentioned in the previous note) can be stated in the following way: a
fertige Menge is one such that 'all its elements can be thought of as being
together' or that it can be 'thought of as constituting a composite thing'.72
The nichtfertigen Mengen are for Cantor' Absolute' infinites, and are therefore not subject to the normal mathematical operations which can be performed
onfertigen Mengen. Given this, Cantor argues that the set of all alephs cannot
be afertige Menge, since if it were it would be numerable and have an aleph
as cardinal number, and the well-known contradiction would follow. Cantor
uses this as the basis of a reductio, arguing that if there were a power which
is not an aleph, then all the alephs could be injected into this power, and
there would then be afertige Menge of all alephs, which is impossible.?3 Hilbert
seems to acknowledge Cantor's conclusion in the first sentence of the passage
quoted.
Hilbert substitutes Cantor's vague 'can be thought of as existing together
without contradiction' with the precise 'can be consistently axiomatised',
which, as we have noted, makes the concept of existence relative to the
axiom system concerned. Given this, the correct modification of the conclusion of Cantor's argument would be that there can be no consistent
axiomatisation of sets and cardinalities which allows the collection of all alephs
as afertige Menge, i.e., as a thing which is subject to all the same mathematical
operations as normal sets, including the alephs themselves, thus as a thing
inside the domain. The argument would not show that the totality of all alephs
'does not exist' (thus externally), as Hilbert in fact says, only (as Hilbert
also says) that it is a nichtfertige Menge, thus does not exist internally.
Hilbert's statement in 1900b does not exhibit quite the same confusion,
but the statement is equally misleading:
The concept of the continuum or of the concept of all functions exists in just the same way as,
say, the system of whole rational numbers or even of the higher Cantorian number-classes and
powers. For I am convinced that the latter can be shown to exist in the sense I have indicated
just as clearly as the continuum can be shown to exist. This is in opposition to the system of
all powers absolutely, since for this it can be shown that a consistent system of axioms in my
sense cannot be set up. Consequently, therefore, the system of all powers is not a mathematically existing concept, according to the way I use these terms. 14
Here, Hilbert states a conjecture that the totality of all alephs can be shown
to exist in the same sense that he thinks the totality of all real numbers
exists, namely that it is possible to give a consistent axiomatisation for this
totality. But Hilbert seems to ignore Cantor's assertion that the totality of all
powers is identical to the totality of all alephs, for he says in effect that there
must always be a contradiction in the assumption that there is a totality of
all powers.
It is quite possible that Hilbert's lack of endorsement in the second statement of Cantor's proof that all powers are alephs reflects a subsequent doubt
that Cantor's proof is adequate. Indeed, it is quite clear from the section of
the same paper dealing with the continuum problem (Problem 1, discussed two
pages earlier) that Hilbert does not regard the well-ordering theorem as proved.
159
But in this case, what Hilbert ought to have stated is: (l) his conjecture that
we can give a consistent axiomatisation of the system of all alephs, and thus
that this totality exists in the external sense of existence (thus, in the same
sense as the totality of reals can be said to exist); (2) that Cantor has conjectured not just that every set can be well-ordered but also that all powers
are alephs, a conjecture for which as yet there is no adequate proof; and that
(3) it can be shown that there is nofertige Menge of all alephs (or all powers),
which is the same as saying that there is no Menge of all alephs inside the
totality of all alephs or all powers. Thus, in sum, the natural position for Hilbert
should have been that there exists (external sense) a finished domain of all
the Cantorian powers and number-classes (alephs) (each of which exists in
the internal sense), but that this domain itself does not exist in the internal
sense as afertige Menge.
What I call Hilbert's 'natural position' is close to that adopted towards
set theory by Zermelo in his 1908. Zermelo's axiom system begins with the
statement:
1. Set theory has to do with a "domain" \B of objects, which we will call simply "things", and
among which are the sets. 75
To show that a set exists, says Zermelo, is to show that it belongs to the domain
~, which involves just the internal sense of existence based on a proof from
the axioms of a statement of the form 3xA, where ~ is taken the domain of
quantification. Zermelo then shows quite straightforwardly that there is no
set inside the domain ~ which contains all the objects of the domain ~, for
he simply turns the Russell-Zermelo paradox into a reductio which shows
that no set can contain all sets. Then:
From this theorem, it follows that not all things of the domain \B can belong to one and the
same set; i.e., the domain \B is not itself a set. 76
Thus, the issue becomes not whether there is a fertige Gesammtheit of all
sets (this is just Zermelo's domain ~, or some of it), but rather whether we
can prove that there is an object in the domain (thus, capable of being a member
of other objects) which has this property.77 The demonstration of the existence of m itself is a separate issue, which for Hilbert would be left to a
proof of the consistency of the axiom system. 78
This points up the worry much more precisely, and makes it easier to understand Hilbert's vacillation with respect to sets, not just in 1900 but even in
1905. In the first place, in 1900 there was no axiomatisation of set theory which
does what Zermelo's promised to do, thus no means of specifying set theory
which does not rely on the universal conversion of any collection into a set
(or the association of any concept with a unique object). The danger that Hilbert
sees is clearly that the existence of the domain of all the number-classes
would automatically entail the existence of a set of all these classes, and
thus contradiction. Indeed, the possibility of this unrestrained conversion is
160
MICHAEL HALLETT
what Hilbert is concerned with in the passages from *1905b quoted above, and
Hilbert's major criticism of Frege's system is precisely that it allows a general
form of concept-to-object conversion which leads to proofs of the existence
of too many objects. 79
What poses the problem, and what therefore introduces much of the hesitancy, is that Hilbert believed that 'all thought depends on ... collection' of
objects to a set. Thus, given the way that Hilbert associates logic with the principles that we actually use in mathematical thought, there is a strong temptation
to take this as a logical law, a 'Law of Thought' .80 Indeed, a version of this
view persists in the discussion in Hilbert and Bernays 1934 where it is stated
that fairly free operation of this sort of collection is fundamental to mathematics, as for instance, in the use of the fertige Gesammtheit of all natural
numbers as a set which can then act as the basis for further constructions. 81
Indeed, the view survives not only in the assertion in *1905b that a successful axiomatisation of the concept of natural number shows this latter to be a
'thing', but also indirectly in the view from 1926 (and earlier) that adding
the quantifiers is an instance of the method of adding ideal (or transfinite)
elements.
Given this, Zermelo's axiomatisation of set theory clearly addresses part
of the problem, for what it does quite consciously is replace wholesale collection-to-set conversion by piecemeal set construction principles, and aims
to do this in a way which is mathematically sufficient, that is, is such that
all those conversions which basic mathematical activity requires can be carried
out, but which proscribes the collections responsible for the paradoxes. In other
words, mathematical set construction is no longer the business of logic; if
Zermelo's axioms are the only principles available, it appears that the inference from the existence of the domain to the existence of a set of all sets is
blocked. 82
This makes it clear that Zermelo's theory of set existence not only puts a
good deal of stress on proving the consistency of the axiom system (thus,
on showing the existence of the underlying domain of quantification in
Hilbert's sense), but also on the analysis of the principles of reasoning themselves ('the logical foundations' of the disciplines in question: *1905b, p. 191),
for it shows that one natural 'Law of Thought' actually has no place in the
logical framework, whatever the temptation to adopt it.
But there is a further problem, for principles akin to Frege's Law V are
not the only problematic ways of specifying objects. Hilbert poses two questions about the theory of real numbers in his 1905 lectures. The first is: Why
is the totality of all sets an impermissible totality, while that of all real numbers
is not? The Zermelo system attempts the beginnings of an answer to this
question. The second question is quite different and is provoked by arguments like that of Richard 1905, to which Hilbert devotes a good deal of
attention, both in his 1905 lectures and later. One way of specifying a real
number is to give a 'law' (Hilbert's term) for a binary expansion. Given this,
Hilbert asks:
161
What is a genuine law, e.g., for a sequence of numerals in the binary expansion for an irrational number? When is a mathematical problem correctly and clearly posed, so that we must
demand the possibility of a clear answer? Why is it, e.g., a clear question whether Mascheroni's
constant 2[2 is rational or irrational, unlike the question of whether there is a totality of binary
expansions expressed in a finite number of words?83
The latter, as Hilbert shows, using essentially the same argument as Richard
(see Hilbert *1905b, pp. 201-3, *1905e, pp. 128-30), gives rise to contradiction by an application of Cantor's diagonal method, 'one of the most
beautiful proofs in set theory', as Hilbert calls the latter (*1905b, p. 196;
*1905e, p. 125). Hilbert sees a large part of the perniciousness of the contradictions, not only in the fact that they undermine, indeed trivialise, whole
theories, but also because they undermine particular, central and 'beautiful'
proofs, like Cantor's diagonal proof. 84
While Zermelo's axiom system begins to tackle the first question, it is not
clear that it addresses the second. One of Zermelo's axioms (Axiom III, the
Aussonderungsaxiom) is designed to deal with properties in a quite general
way, at least properties that are 'definite', as Zermelo calls them. Zermelo's
characterisation is as follows:
4. A question or statement @ is called "definite" when it can be decided in a non-arbitrary
way by the basic relations of the domain, using the axioms and the generally valid logical
laws, whether the question or statement is valid or invalid. In the same way, a "class statement" @(x), in which the variable term x ranges through all individuals of a class Sf, is called
"definite" when the statement is definite for every individual of the class Sf. 85
Zermelo says clearly what the two 'basic relations' of the domain are, namely
membership (the 'E'), and the identity relation. But nothing is said about quantification over properties, and the reference to classes and "class statements"
is enough to raise at least a suspicion of circularity, and consequently to raise
the spectre of the type of definition of objects present in antinomies such as
Richard's. This shows that, unless we operate with a precisely circumscribed
(and inflexible) descriptive apparatus when discussing uncountable collections,
then we will fall into contradiction, certainly if the notion of 'definable' is
allowed to appear in a definition. And it is this sort of thing which Zermelo
does not clearly forbid.
Zermelo himself is insistent that the definitions which appear in these
antinomies are excluded by his notion of 'definite', just as the Aussonderungsaxiom itself excludes direct construction of 'sets' of the type used in Russell's
antinomy. His argument, however, amounts to nothing more than the rather
uncertain claim that such notions as 'definable in a finite number of words'
simply cannot be 'definite' because this must be decided by the 'basic relations of the domain' (and the 'generally valid laws of logic,).86 Others saw
the matter differently, not least Weyl in his 1910. 87 Weyl's analysis was clearly
of profound importance to Hilbert.
In his 1910, Weyl puts forward the position that all definitions in mathematics should be direct definitions, and not 'implicit definitions' through axiom
systems, and that set theory (in something like the form presented by Zermelo)
162
MICHAEL HALLETT
ought to be the sole source of these direct definitions. The reasons he advances
for the choice of set theory as the universal provider of material are, crudely,
that the involvement of set theory is, at some level inevitable (certainly if
one favours direct definition), and that it is the 'discipline that stands closest
to logic'. Thus:
So set theory appears to us today, in logical respects, as the proper foundation of mathematical science, and we will have to make a halt with set theory if we wish to formulate principles of definition which are not only sufficient for elementary geometry, but also for the
whole of mathematics. 88
What is important about this proposal in the present context are not Weyl's
reasons for it, but rather the way that it deals with the Richard antinomy,
since, for Weyl's proposal to be in any way feasible, set theory as originally
presented by Zermelo must be reformulated so as to avoid any suggestion of
antinomies like Richard's. What Weyl suggests is the use in set theory of a
circumscribed notion of concept or relation formed from the basic relations
(Zermelo's 'E' and '=') by finite iteration of certain recognised operations.
What precisely these are is left a little unclear by Weyl in 1910, but it is
made explicit in 1917 (pp. 4-8); they are, in effect, just the logical operations -', 1\, V, V and 3 (treated as quantifiers over the objects). Thus, the
proposal is to replace Zermelo's 'definite property' with (in effect) first-order
predicate in the language of set theory. There is no possibility of reproducing
Richard's paradox, since the defining description employed to get the contradiction can only be reproduced if one can form a predicate which involves
quantification over, or reference to, predicates or propositions (or 'definitions')
as well as the objects. This simply cannot be done if Weyl's proposal is
followed.
Weyl makes three points (1910, p. 300) by way of justification for
addressing Richard's antinomy in this way.
First, rather in the spirit of Hilbert, Weyl says that it is a mistake to think
that the objects themselves are of primary importance. What matters, rather,
are what he calls the 'concepts' or 'relations'. It follows from this that to
address the Richard paradox properly means focusing attention on the way
concepts or relations are defined, for a proper account of concept formation
will tell us at the same time which definite descriptions of objects are, and
which are not, legitimate, and thus will answer Hilbert's question from 1905
as to what are the proper laws.
Second, we cannot say generally what concepts and relations are without
first saying what are the fundamental concepts and relations on the basis of
which they will be defined. There are always just a finite number of these basic
relations and concepts, and the way they behave is laid down through the
axioms of the discipline in question. In a sense, this is just a variant of the
Hilbert view that the nature of the domain (and thus, in particular, quantification) is governed by the axioms given. For example, one might view the
axiomatisation of Euclidean geometry either as an axiomatisation of points,
163
164
MICHAEL HALLETT
and to which we stand in an immediate, direct relation. And this is an assumption common to all use of language. The stress on signs is something Hilbert
shared with Frege. The use of signs, says Frege, is essential in order to free
ourselves from the 'sway of the inner world of ideas'. As Frege goes on:
Since the concept is something unintuitable, then it requires an intuitable representative in
order to appear [erscheinen] to us. In this way, the sensible makes accessible to us the world
of the non-sensible. 89
But what Hilbert noticed is that it is the use of signs in this way that constitutes the finitisation of all mathematics.
Given all this, it is then not surprising to find that the stress on signs is
at the root of Hilbert's conception of logic. Indeed, since logic is taken to
be a codification or an abstract reflection of the Laws of Thought, of the
way that minds actually reason, it is of the essence to be clear what it is ultimately that minds reason with, and this he thinks is the sign in combination,
the things we can 'write down and experiment on with chalk and pen'. In
his *1905b, this takes the form of
... an Axiam afThought, or as one might say, an Axiam afthe Existence afan Intelligence, which
can be formulated approximately as follows: I have the capacity to think of things and to denote
them through simple signs (a, b; ... , X, Y, ... ; ... ) in such a fully characteristic way that I
can always unequivocally recognise them again. My thinking operates with these things in this
designation [Bezeichnung] in a certain way according to determinate laws, and I am capable of
learning these laws through self-observation, and of describing them completely.90
165
This exerts further pressure in the direction of a minimal logic, for logic
described in this way gives the form of a descriptive apparatus while remaining
neutral as to content. The specific mathematical assumptions of the individual theories themselves should be kept separate from this framework.
What this amounts to, for Hilbert, is that the presentation of a mathematical theory splits into two distinctive elements, the constructive element,
which deals with the constructive generation of the formulas, and what might
be called the existential element, which deals with the axioms and therefore
with the existence of the domain(s) of quantification. Both terms are apposite.
Weyl's procedure is a perfectly constructive one, indeed, this is his term from
1910. Each concept (for Weyl) or each formula (for Hilbert) is itself a finite
object constructed in a thoroughly finite way by iteration from the finite number
of primitive concepts or formulas taken as starting point using only the four
or five principles of combination (Weyl's 'definitional principles'). In the
lecture notes *1920a (in remarks added by hand), Hilbert actually assimi1ates the genetic method to the constructive, for this, too, is based on a form
of generation of objects not previously constructed. He also calls the axiomatic
method of specifying objects the existential method of definition, which makes
the term 'existential' here appropriate. 93
This finally puts Zermelo's set theory (and the set theoretic assumptions
used in presentations of arithmetic) in a modern light, divorced from any
suggestion that it should be part of logic. It represents a departure from
Hilbert's position of 1905, where (in both 1905a and *1905b) he attempts to
axiomatise logic, arithmetic and the notion of 'infinite set' simultaneously.
Building on Weyl's suggestion, logic, indeed, becomes minimal, a calculus
for reasoning on the one hand (as Hilbert had recognised from the beginning), and a neutral tool for forming meaningful expressions on the other,
without any explicit content of its own.
Given the rigid separation of the two notions of constructive and existential, it is now possible to say more precisely what Hilbert thinks goes awry
in Richard's paradox. The account is a fairly straightforward application of
something like the vicious circle principle, and goes roughly like this. The
objects used to give definitions of sets or real numbers are the formulas, which
are themselves constructively presented, formed from symbols in the now
standard way, a given formula being constructed at some finite initial stage
in the generation of the domain of all formulas. This domain is certainly a
definite domain in the strong sense that it is a decidable question whether
an object belongs to it or not.
Suppose now we permit second-order quantification as one of the operations of formula formation. The central question for Hilbert is what this
quantification is quantification over. Now, says Hilbert,
... there is a difficulty.
We have to ask ourselves the question, what does it mean when we say "There is a predicate P"? In axiomatic set theory, the "there is" always refers to the domain \B we take to be
there at the foundation. In logic, we could indeed think of the predicates as collected together
166
MICHAEL HALLETT
to a domain. But this domain of predicates cannot he considered as something given from the
beginning; rather, it must be formed through logical operations. Only through the rules of
logical construction is the predicate-domain subsequently determined.
And now it becomes obvious that, in the rules of the logical construction of predicates,
reference to the domain of predicates cannot be permitted. Otherwise there would indeed be a
circulus vitiosuS. 94
In other words, since the domain of predicates is given only through the
constructive generation process, it would be illegitimate in the course of
this generation to refer (via unrestricted quantification) to the totality of all
predicates.
This does not mean, of course, that we should disallow second-order quantification under any circumstances. For example, we might want to think of
predicates as referring to properties or propositions or subsets, and then it is
natural to make general assertions about these involving quantification over
them as well as over the underlying objects. (There will, of course, be an
extra clause governing the second-order quantifiers in the principles for the
generation of formulas.) But given Hilbert's strictures about quantification,
such a procedure must be supported by axioms specifying the domain of the
second-order quantification. This is implicitly what we do anyway when we
use second-order quantifiers, for instance when we say, in interpreting the
formulas, what is the domain D(2) that the second-order variables range over
given that the first-order variables are to range over the domain D(l). (Note that
we normally rely on some specific set theory to assure us that this D(2) exists.)
But such quantification is not quantification over the formulas themselves;
indeed, it would be utterly inappropriate for Hilbert to take it as such, and
for two reasons. The first is that formulas in Hilbert's later conception of
logic are just a formal characterisation of what it is that the mathematician
actually deals with, thus they play the role in the logical framework that the
diagrams and formulas play in the informal framework. This suggests that
formulas are not something we make existential assumptions about. We actually
have them, or rather construct them in operating the mathematical theory.
This leads to the second reason, namely that we actually have no need of
any axiomatic characterisation, for the generating principles for formulas
capture directly the objects concerned, and indeed their constructive nature,
for they reflect the restricted number of elementary ways in which we actually
form new formulas from existing ones. But since there are no existential
assumptions here, it is simply not appropriate to have quantifiers over the
predicates while specifying a predicate, and straightforwardly unnecessary.
Consequently, a formula object with a quantifier over formula objects would
make indirect reference to objects that are only generated at some subsequent stage in the construction of formulas. It is precisely this that is the
mistake in the pseudo-formula used to 'define' the number that gives the
Richard paradox.
Hilbert is particularly clear about this in the lecture manuscript *1917-18.
Having extended the logical calculus to that of second-order, and then devel-
167
oped versions of the known paradoxes in this extension, he says that the trouble
must lie in assumptions that were made in the extended calculus, the ordinary
(first-order) calculus being provably consistent. He says:
In the original functional calculus, we took a system or several systems (species) as given
from the beginning, and by referring to these totalities of objects, the operation with the
variables . . . was given a significance. The extension of the calculus now consisted in
regarding statements, predicates and relations as types of object, and, according to this, allowing
symbolic expressions whose logical significance demands reference to the totality of statements resp. functions.
This procedure is in fact dubious in the following way. Those expressions which obtain
their content through reference to the totality of statements resp. functions are then themselves
counted among the statements resp. functions, while on the other hand, before we can refer to
the totality of statements or functions the statements resp. functions must be considered as
determined from the beginning. Here there is a kind of logical circle, and we have grounds for
the assumption that this circle is the cause of the paradoxes. 95
168
MICHAEL HALLETT
tions, although what concerns us most here are more the logical objections
to Russell's ramified theory, more specifically to the Axiom of Reducibility,
objections closely related to the issue of quantification which we have just
elaborated, and which therefore help to elucidate this.
Suppose one develops a view of sets as properties, something which Hilbert
explores in his *1917-18 notes and in Hilbert and Ackermann 1928. The appropriate formula U(x) serving to define a union set will be higher-order, namely
3P[P(x) 1\ <I>(P)]. A constructive attitude towards sets will be reflected here
in a constructive attitude towards the defining properties; in this case, the
complex predicate U(x) must stand for a different kind of object from any
of the predicates over which one has quantified, for it makes reference to a
domain not available in the construction of any of the objects quantified
over, and given only by the constructive presentation of the predicates (sets).
In this case, to take the standard example, the least upper bound of a set (a
union) will not be an object of the same kind as the elements in the set, since
its defining predicate is of a different order from that of its members. This,
for Hilbert, would be an unacceptable basis on which to pursue analysis, and
is sufficient reason to abandon the constructive attitude towards sets or their
defining properties. Russell's method of overcoming the difficulty is to invoke
the Axiom of Reducibility, which assumes that there is an (unspecified) propositional function which is extensionally equivalent to the complex predicate
U(x), and which contains no quantification over higher-order entities, and therefore can stand for the same kind of object as the members of the set. But to
use quantification here, says Hilbert, is also tantamount to the abandonment
of the constructive view, thus abandonment of the view that one is dealing with
predicates (sets) in the original constructive sense, thus with expressions built
up in our language of orders from the base formulas. Russell's invocation of
Reducibility, says Hilbert, is an assumption of theory, namely an assumption
that there is a domain of objects of some sort (say, concepts or properties or
sets), a domain which has to be specified by laying down axioms. As Hilbert
puts it:
In doing this RUSSELL returns from the constructive to the axiomatic viewpoint. 98
But it was only the constructive view that led to the difficulty in the first place.
If it is to be abandoned, then it can surely be done by a much simpler system
of set theory than that Russell presents.
4.
CONCLUSION: FINITISM
In this last section, I want to look briefly at some of the respects in which
this development of logic is consonant with Hilbert's finite Einstellung, and
indeed in such a way as to point in the direction of Hilbert's programme.
The first thing to emphasise is that what drives Hilbert's reaction to
Kronecker's treatment of the foundations of mathematics is the attempt to
explain the transfinite mathematics of Cantor, Dedekind, and that which it
169
170
MICHAEL HALLETT
171
What is explicitly stressed in this passage is the operation with symbols, and
not so much their use in representing mathematical content to the mind; indeed,
it is claimed that we are able to verify things about elements of this formal
language by operating on them (in the ways Hilbert states) because of their
discrete, finite nature. Means of verification in mathematics must always be
constrained by the way the system is given; for example, proof in set theory
is constrained both by the formal framework (rules of inference) and by
the system of axioms. However, the objects of a (or any) formal language
are not given by an axiom system in such a language; so the constraints on
verification here are different. It is in their very nature that they are constructively (recursively) presented, so the means of verification must be
determined (and constrained) solely by the constructive (recursive) definitions.
Note that this must be the same for all mathematical languages, and therefore for the formal representation of all mathematical languages, and therefore
for the formal representation of all mathematical theories. Let us call the
kind of reasoning involved here finitary reasoning. Thus, what we see is that
finitary reasoning (whatever the correct description of it) is determined by
Hilbert's way of explaining all mathematical theories as finitist.
Hilbert's 'finitary conjecture' now has two parts. The first part states that
this finitary reasoning is precisely the same as that which is required to carry
out proofs in a certain primitive arithmetic. The second part of the conjecture is that such reasoning (and therefore such arithmetical reasoning) can
be used to establish the consistency of all theories. About this second part,
I will say nothing here; but I do wish to say a little more about the first
part.
That operation with a logical calculus is not just 'combinatorial', but
depends explicitly on arithmetical knowledge was stressed by Poincare in
his 1905, specifically that any operation with a logical calculus must involve
what Poincare calls the 'principle of recurrence', by which he seems to mean
ordinary mathematical induction. (Poincare's point is repeated by Weyl in
his 1910, where he sees it as a difficulty for his view of the language of set
theory that it presupposes a concept of number, or of 'finite repetition', while
the concept of number itself is supposedly first explained in set theory.)103
The connection between operation with the symbol system and some sort
of arithmetic (made precise by the analysis of the structure of the logical
calculus given by Godel and Turing) was clear to Hilbert from the beginning. In his 1900b, Hilbert refers to a 'rapid, unconscious not definitive but
certain' combinatoral feel for signs. 104 Note also the claim in the passage
from Hilbert *1910 quoted above that operation with signs is 'analogous to'
the operations of arithmetic.
But what exactly is 'the concept of number' presupposed by the operation with a logical calculus? A concept (such as that of natural number or
real number), where quantification over a domain (or domains) is used as
part of its specification is taken by Hilbert to be something 'defined' by an
axiom system, and this is what would be involved if indeed Poincare were
172
MICHAEL HALLETT
right. What Hilbert wants to show, however, is that what is really involved
is only a much more restricted 'concept of number', a primitive notion which
does not depend on quantification or an axiom system and which mirrors the
constructive development of any of the formal languages, in short a concept
of number which is presented by a similar collection of recursive rules. The
corresponding notion of derivation for these numbers would then be determined by the recursive structure of the number-objects, but would not involve
the principle of induction.
Correspondingly, what Hilbert develops is a theory of numbers as numbersigns (which makes the analogy with the formulas even tighter), a development
that is begun in the lectures *1917-18 and continued in various works in the
1920s. (For a partial description of this, see section 4 of Hallett 1994.) As is
stressed in Hilbert 1922 (p. 164, also p. 164 of Hilbert 1935), there are no
axioms, by which Hilbert means that there is no quantification and (as he
stresses in 1926) the sign-arithmetic is not developed in a logical language
(as full arithmetic must be). There are just recursive rules governing the formation of the signs and the elementary operations (such as 'addition').
Consequently, as with the formulas, the appropriate notion of demonstration
is not that of formal proof within a logical calculus, but rather proof based
on recursion following the various clauses of the definition. What this means
is that the signs are actually treated themselves as objects just as the formulas
of a language are, which would not be the case with a full, formal axiom system
for numbers. The correct way to think of it is that the number-signs might
be a way of giving an interpretation of the formal language for arithmetic,
but this language, by itself, is strictly speaking meaningless. lo5
The point, presumably, is that the analogy between this arithmetic of signs
and the system of formulas in a logical language is now meant to be transparent. Hilbert states it in his 1922-23 lectures, Wissen und mathematisches
Denken:
The invocation of mathematical methods in investigating the logical language is not artificial,
but entirely appropriate and even inevitable. For the role of the language in the expression of
the logical connections between thoughts corresponds to the sign language in calculation. In
following a logical passage of thought with the help of this logical language, we carry out at
the same time a calculation, in which manifold logically elementary processes are put together
according to practised rules. It is even self-evident that, when we exclude the accidental features
[zufiilligen MomenteJ in the derivation of words, then a form of mathematical sign language
arises. 106
And then, as Hilbert says, the restriction 'to the essential' will make it possible
... to frame the rules of the grammar in such a surveyable way that logical inference can be
carried through automatically by calculation according to simple, determined rules. 101
But the crucial thing about Hilbert's intention is summed up in this passage
from 1922:
When we develop number-theory in this way, there are no axioms and no contradictions of
any sort are possible. We simply have concrete signs as objects; we operate with them, and
173
we make contentual statements about them. And in regard in particular to the proof just given
that a + b = b + a, I would like to stress that this proof is merely a procedure that rests solely
on the construction and decomposition of number-signs and that it is essentially different from
the principle that plays such a prominent role in higher arithmetic, namely, the principle of
complete induction or of inference form n to n + 1. This principle is rather, as we shall see, a
formal principle which goes well beyond this and which belongs to a higher level; a principle
which needs proof and which is capable of proof. 108
(This passage immediately follows the other passage from *1905h quoted at
the beginning of this section.) What is interesting about the statement is
the remark that the judgement involving universal quantification and the particular numerical judgement are of quite different kinds, not necessarily
connected, since these refer to 'different axiom systems'. This contains more
than an echo of Hilbert's statement to Frege that 'point' will mean different
things in different axiom systems. But more interesting is the fact that the
difference which is pointed out here corresponds in the later view to the
differences between the elementary sign-arithmetic and full arithmetic. The
most significant change is that, as we have seen, Hilbert adopts the view (in
his *1920a) that if a domain of objects is constructively presented through a
few recursive rules, there is no need for an explicit axiom system, and hence
no need for a formal system. Hence the axiomatic system of arithmetic
presented in a full logical calculus is not a simple extension of the system
of numeral-objects given, at least, it is not without further interpretation. 110
174
MICHAEL HALLETT
An early version of this paper was read at a session of the Boston Colloquium for the
Philosophy of Science in November 1993. I wish to thank the organisers of the Colloquium,
particularly Jaakko Hintikka and Fred Tauber, for their invitation. I also wish to thank George
Boolos, Emily Carson, William Demopoulos, William Ewald, Richard Heck, Mosbe Machover,
Mihaly Makkai, Ulrich Majer, John Mayberry, Stephen Menn and Wilfried Sieg for useful discussions on this and related work, and Mathieu Marion and Robert Cohen for their patience.
The Niedersachsische Staats- und Universitatsbibliothek and the Mathematisches Institut of the
Georg-August Universitat, Gottingen kindly granted permission to quote from various unpublished lecture notes and manuscripts of Hilbert. The support of the Alexander von Humboldt
Stiftung, the Deutsche Forschungsgemeinschaft, the Social Sciences and Humanities Research
Canada of Canada and the FCAR of Quebec is gratefully acknowledged.
Unless otherwise stated, the translations below are my own, although I have tried to give additional references to published translations wherever possible.
I
See van Heijenoort 1979.
2
We should not, however, make the mistake of thinking either that Hilbert's attitude to the
antinomies is simple, or that he underestimated their importance, as is clear from the following
passage from a report of his 1905 lectures:
The paradoxes which we got to know in the above [insertion in Hilbert's own hand: and which
are just a precise mathematical version of the Kantian antinomies] show only too well that
an examination of and a new approach [Neuauffiihrung] to the foundations of mathematics
and logic are absolutely necessary. (Hilbert *1905b, p. 215.)
3
A fuller discussion can be found in Hallett 1994.
4
Hilbert * J905b, pp. 11-12. Helmholtz uses the word 'Tatsache' in the title of his 1868, and
in his 1890 Klein refers to the geometrical facts to be captured by an axiom system. Hilbert refers
to the basis of Tatsachen repeatedly leading up to 1899 and after. I suspect that Hilbert's use
of the term is somewhat different, in that he means that, whatever the status of the propositions examined, and whatever reasons one might have for examining them, they are for the
time what one starts from.
175
5
6
176
MICHAEL HALLETT
108 All geometrical reasoning is, in the last resort, circular: if we start by assuming points,
they can only be defined by the lines or planes which relate them; and if we start by
assuming lines and planes, they can only be defined by the points through which they pass.
This is an inevitable circle, ... It is, therefore, somewhat arbitrary to start either with points
or with lines, as the eminently projective principle of duality mathematically illustrates;
... (Russell 1897, p. 120.)
This implies (as is explicit in Hilbert's statements of the axiomatic method) that one can only
hope to know exactly what the nature of points or lines or planes is after the axioms have
been given and the system developed. However, the last remark suggests that Russell might
here be stating a view peculiar to projective geometry, and that in the realm of metrical geometry,
intuition will be sufficiently refined to isolate the correct starting point. Whatever Russell's
attitude in 1897, there is certainly a passage in his 1919 (p. 59) where Russell states explicitly, and quite generally, much the same attitude as Hilbert, thus stressing the logical
interconnections and playing down the importance of primitives.
21
Frege 1879, Vorwort, p. IX, English translation, p. 6.
22
Compare this to what Russell says on the need for a special philosophical language in his
reply to Strawson, Russell 1957, pp. 123-4.
23
This, after all, is part of the point of Hilbert's idea of Tieferlegung der Fundamente ('driving
the foundations deeper'). See Hilbert 1918.
24
See, above all, Dedekind 1854. Dedekind also discusses here cases other than those of the
extension of the number systems.
25
Dedekind, letter to Weber, 24.i.l888, in Dedekind 1932, item LXVI, pp. 489-90.
26
Dedekind 1888, 73.
27
Weyl, in effect, makes just this same point about how the axiomatic method generalises
Dedekind's procedure. He writes:
The method of implicit definition consists, not in explaining the sense [Sinn] of each individual concept on the basis of others taken to be known. Rather it consists only of setting
up a system of propositions or axioms in which these concepts are involved. This method
of implicit definitions has very often been employed in mathematics. It has the advantage
that the most important properties of the concepts can be set out immediately at the beginning, whereas by setting down a proper [i.e., explicit] definition, these properties might appear
only as very distant consequences of the definitions. (Weyl 1910, p. 301.)
28
Hilbert 1900a, p. 181.
29
The full passage is quoted above, at the beginning of the section.
30
Hilbert to Frege, 29.xiL1899, in Frege 1976 or Frege 1980. This idea, too, goes back to Hilbert
*1894.
31
See the letter mentioned in the previous note; the English translation omits the relevant
remark.
32
See Dedekind 1872, IV, p. 13, p. 15 of the English translation. See also Dummett's highly
illuminating discussions of Dedekind in his 1991, especially Ch. 5 and pp. 247-51.
33
See the letter of Hilbert to Frege of 29.xii.1899 in Frege 1976 (or Frege 1980). We should
note in passing that GOdel and others deny this with respect to the addition of axioms to
standard set theory.
34
See Hilbert's letter to Frege of 29.xii.l899, in Frege 1976 or Frege 1980. The remark continues the quote referred to in n. 30. Hilbert made a similar remark in 1891, according to
Blumenthal 1935 (pp. 402-3), although this time about 'table', 'chair' and 'beer mug'.
35
We should note that extra theories might be needed for a precise formulation of certain of
the axioms. This was surely the case with Hilbert's own Completeness Axiom for the real numbers
or for Euclidean geometry, although we cannot go into this here.
36
The completeness criterion seems, on the surface, somewhat weaker than completeness as
it is now understood. Hilbert sometimes states it thus: The system is complete when it has 'all
the facts presented to us as logical consequences' (* 1905b, p. 12). His later formulations of
completeness are a good deal more precise, and apparently much stronger. For example, in Hilbert
1929 and 1930 two formulations of the completeness of I: are its Post-completeness (I: is Post-
177
complete if whenever cr is unprovable from E, then E U {cr} inconsistent: see Post 1921), and
that if cr is consistent with E, then it is actually provable in E. (Hilbert's statement of Postcompleteness pre-dates Post; it is used in the formulation of the property of completeness of
the propositional calculus in his 1917-18 lectures, which Hilbert then proves. See *1917-18,
pp. 152-3.)
Hilbert also sometimes lays weight on the categorical nature of the axioms, pointing out
that his own axiom system for the real numbers is categorical. See, for instance, Hilbert *1905b,
p. 21. In one of his notebooks, Hilbert even says that categoricity of an axiom system shows
that the concept 'defined' by the system is uniquely defined: see Hilbert, Notebook 3, Cod.
Ms. 600, III, p. 131. The passage is undated. Hilbert regarded categoricity as a version of completeness; see his 1929 (p. 139) and 1930 (p. 6). See also Wey11944, pp. 155-6. But it is important
to recognise why Hilbert prefers (syntactic) completeness to categoricity. See above, p. 152.
37
Hilbert often points out this consequence of inconsistency; see, for instance, *1905b,
p. 217, or *1917, p. 138, or *1917-18, p. 218.
38
Hilbert to Frege, 29.xii.1899, in Frege 1976, English translation in 1980.
39
See the letter to Keferstein, 27.ii.1890, point (7), p. 275 of Dedekind 1890, p. 101 of the
English translation. See also the letter to Lipschitz of 27.vi.l876, in Dedekind 1932, item LXV,
p. 477, or Scharlau (ed.) 1986, pp. 77-8.
40
See Frege 1903, 143, English translation in Geach and Black (eds.) 1966, p. 178. Even Weyl,
as late as his 1910, is of the opinion that modelling is the only way of showing consistency.
For instance, after discussing the advantages of the axiomatic method, he says this:
But the implicit definition through axioms is nevertheless only something provisional, since
one can only depend on it in case the axioms are free from contradiction, in other words,
in case a system of explicitly defined concepts can be set up which satisfies them. A good
example of what has been said here is offered by the treatment which Lebesgue has given
of the concept of integral in Ch. IV of his "Le~ons sur I'integration" (Paris 1904). Lebesgue
makes precisely the same distinction between explicit and implicit definitions, which he
distinguishes as "constructive" and "descriptive" respectively. (Weyl 1910, pp. 301-02 of
the reprint.)
It is worth noting that the condition of categoricity on axiom systems is an attempt to go as
far as possible to satisfy the stronger demands of the Frege approach while remaining within
the Hilbert framework, i.e., while still allowing that there need not be unique referents for the
primitive terms. Dedekind certainly wanted to prove categoricity for his systems. His letter to
Keferstein of 27.ii.189O states this concern explicitly for the natural number system. It is somewhat
more involved in the case of the real number system. Part of the question is settled by Dedekind's
Proposition IV from his 1872, V, which shows that the system of cuts in the rationals is complete
in the sense that one gets no new numbers by considering cuts in the ordered system of cuts in
the rationals. (This is mirrored by Cantor in 1 of his 1872, where he argues that for any
Cauchy sequence of reals there must be an equi-convergent sequence of rationals.) What categoricity then depends on is the isomorphism of any two rational fields.
41
The use of what would now be called non-constructive existence proofs was a hallmark of
Hilbert's work from very early on, the most famous example being the proof of the Hilbert
Basis Theorem which goes back to 1890.
42
Hilbert to Frege, 29.xii.l899, in Frege 1976; English translation in Frege 1980. Hilbert's view
is also clear in his 1900a, p. 184, and 1900b, pp. 265-6 (1935, pp. 300-1).
43
Hilbert 1900b, p. 266 (1935, p. 301).
44
Bernays (1935) calls the position based on such external existence assumptions a 'restricted
platonism', and somewhat the same position is put forward by Bernays in 1922a. It is also clearly
set out in some undated notes in Bernays's hand in Hilbert's NachlaB, Cod. Ms. 685, 3, Blatter
13-20. The main difference in Bernay's later statement is that Bernays at this point recognises
quite clearly that such assumptions can be weaker or stronger, and that they have what we
now call a 'consistency strength', something which was quite unclear before GOdel's and
Gentzen's work. Hilbert's work is also intimately tied to his views on quantification: see below,
pp. 154-8. Note also the various connections to Cantor.
178
MICHAEL HALLETT
46
179
existence of the domain of quantification and consistency proofs. The adjective 'fertig' is
clearly borrowed from Cantor; see p. 155.
67
Hilbert 1900a, p. lSI.
68
Hilbert 1900b, p. 266, p. 301 of Hilbert 1935. The whole passage is cited below.
69
See n. 44.
70
Hilbert 1900a, p. lS4.
71
See the letters of 26.ix.lS97 and 2.x.IS97, reprinted in Purkert and Ilgauds 1987, pp. 224-7.
See also Cantor's correspondence with Dedekind from IS99, excerpted in Cantor 1932 (English
translation in van Heijenoort (ed.) 1967).
72
Cantor asserts in the second letter to Hilbert mentioned in the previous note that this distinction is what lies behind his definition of set given at the beginning of his 1895.
73
The proof appears to rely on a step-by-step choice argument, not referred to in these letters,
but hinted at in the letters to Dedekind. See Hallett 1984, pp. 169-70.
74
Hilbert 1900b, p. 266, p. 301 of Hilbert 1935. The passage comes in a discussion of Hilbert's
Problem 2: 'The consistency of the arithmetical axioms'.
75
Zermelo 1908, p. 262, p. 201 of the English translation.
76
Zermelo 1908, p. 265, English translation, p. 203. Thus, he was in effect making use of
the inconsistencies in much the same way that Cantor had.
77
Zermelo himself states that the Aussonderungsaxiom 'forms a certain replacement for the
general and untenable definition of set given in the Introduction', a definition, in effect, incorporating the Comprehension Principle. See Zermelo 1908, pp. 263-4, English translation,
p.202.
78
Zermelo himself does not go this far; all he says is that he does not elucidate in his paper
the further questions as to 'the origin and domain of validity' of his principles, and that:
I have not even been able to show the 'consistency" of my axioms, something which it is
quite essential to do; ... (Zermelo 1908, p. 262, English translation, pp. 200--01.)
79
Hilbert's two clearest statements of this criticism are in his 1905a, p. 175 (p. 130 of the
English translation), and 1922, p. 162 (p. 162 of the reprint).
80
In *1905b (p. 191), Hilbert describes the main problem of the foundations of mathematics
as being:
... to recognise and describe completely the driving force [Getriebe] of correct mathematical thought, and thereby of logical thought generally.
Note the mechanical analogy in this. Hilbert also says that in carrying out the 'clear and exact
construction of the logical foundation' for mathematics,
... one will and must obtain a complete description of mathematical thought, and thereby
theoretical thought. (Hilbert * J905b, p. 250.)
81
See Hilbert and Bernays 1934, p. 15 and then p. 39. The drive to ever more inclusive
totalities makes clearer one of the reasons why an axiomatisation of set theory adequate for
the development of the whole of basic mathematics would have been thought desirable.
82
See, in particular, the first paragraph of Zermelo 1908, p. 261, p. 200 of the English translation. It should be noted that Zermelo's system still falls short of showing what Hilbert claims
in 1900, for in Zermelo's treatment, among other shortcomings, there is no way of reproducing
the full Cantorian theory of ordinals and alephs. This had to await von Neumann's work. See
Hallett 1984, chs. 7 and S, and also n. 100, below.
83
Hilbert *1905b, p. 215. The mention of 'Mascheroni's constant' 2{2 is important. Hilbert's
Seventh Problem in 1900b is: To show, for an algebraic number a and an algebraic irrational
exponent 13, that a~ is always transcendental or at least irrational. He gives the above number,
and e" (in the form t 2i ), as specific examples of the problem. (See Hilbert 1900b,
p. 274, Hilbert 1935, p. 308.) The fact that such problems as this, or that of whether there are
infinitely many twin primes, were unsolved at the time is used by Hilbert in *1905b to give
examples of 'non-algorithmic' binary expansions in which the presence of a 1 or a 0 depends
on the outcome of their solution. (See *1905b, p. 199, *1905b, or pp. 127-S.) Hilbert regards
these as perfectly good specifications of expansions, even though we cannot calculate the numbers
involved, and this is (presumably) based on his faith in the solubility of every well-posed
180
MICHAEL HALLETT
mathematical question. He gives a strong statement of this conviction in 1900b, pp. 261-2 (Hilbert
1935, p. 298), and this is repeated on pp. 193 and 249 of * 1905b. (This is another interesting
example of admitting metamathematical formulations directly into directly into ordinary
mathematics, Hilbert's Completeness Axiom being the first, a practice which has become
extremely important, particularly in set theory, since GOde1's Incompleteness Theorem.) On
the other hand, Hilbert does not regard specifications of binary expansions given by throws of
a die as proper specifications (see *1905b, pp. 199-200, *1905c, p. 128).
Hilbert's Seventh Problem was solved by a result proved by Gel'fond and Schneider independently in 1934, one of whose forms states: if u, ~ are algebraic, with u ~ 0, 1 and ~
irrational, then uP is transcendental. (See Tijdeman 1976, pp. 242-43.) In 1929, Gel'fond proved
a special case of the result which shows that 2r::2 must be transcendental, and in 1930 Siegel
extended the proof to cover 2n. Siegel himself recalls having heard Hilbert say in a lecture in
1920 that no one in the room would live to see this problem solved! See Reid 1970, p. 164.
Hilbert's use of the term 'Mascheroni's constant' is somewhat mysterious. As far as I can
tell, 'Mascheroni's constant' is another term for the very different Euler constant. Indeed,
Hilbert himself refers to the 'Euler-Mascheroni constant' when speaking of Euler's constant in
his 1900b (p. 261, p. 297 of Hilbert 1935).
S4
Since Hilbert makes no reference here to Richard, it is hard to say whether he knew of the
Richard paradox through Richard 1905, or an account of it, or whether he had made this discovery separately. (There is no mention of the paradox in Hilbert 1905a.) After 1905, Hilbert
always refers to this antinomy as Richard's without saying that he had discovered it independently, and even the talk in 1905 of 'laws' which define real numbers, and the assertion that a
reasonable condition on laws is that they be specifiable in a finite number of words (see * 1905b,
p. 200, *1905c, p. 128) is very reminiscent of Poincare's position in 1905 and 1906. The most
important thing to note, however, is that Hilbert seems to have taken the Richard antinomy
very seriously, at least up until about 1917. There are important discussions of it, not just in
Hilbert *1905b, but also in *1917 (pp. 129-32), and *1920a, pp. 2-5, not to mention Weyl
1910, 1917 and 1919.
S5
Zermelo 1908, p. 263, p. 201 of the English translation.
S6
See Zermelo 1908, p. 264, p. 202 of the English translation.
S7
See Weyl 1910, and also Weyl 1917. See also Skolem 1923.
ss Weyl 1910, p. 302 of the reprint.
S9
Frege 1882a, pp. 48-50. The need to investigate the 'associated ideas' is later played down,
especially in the Grundlagen (1884 J. In Frege 1880-81, there is another remarkable similarity
with Hilbert's later position:
... this [artificial language] differs from ordinary language [Wortsprache] in yet a further
way, in that it is designed for the eye and not the ear. Of course, ordinary writing is so designed
also, but this is simply an image [Abbi/dung] of ordinary, spoken language [Wortsprache],
and in this sense it is does not come any closer to the Begriffsschrift than this latter does.
Indeed, it is even further removed, since it is composed of signs that derive from signs and
not from content [Sachen]. (Frege 1880-81, pp. 13-14.)
90
Hilbert *1905b, p. 219. Virtually the same passage is to be found in the other Ausarbeitung
of these lectures stemming from Max Born (*1905c, p. 143). In his 1905a (p. 176; p. 131 of
the English translation), Hilbert hints at much the same thing, for there he begins his presentation of his logical theory:
Let an object of our thought be called a thought-object [Gedankending] or, briefly, a thing
[Ding] and let it be denoted by a sign.
91
Hilbert *1910, p. 159. In the last section, we will return briefly to the remark 'analogous
to arithmetic'.
92
Hilbert *1920b, pp. 46-7.
93
See *1920a, p. 11.
94
Hilbert *1920a, p. 31.
95
Hilbert *1917-18, pp. 218-20.
96
See Hilbert *1920b, pp. 10--12, or *1924-25, pp. 108. What Hilbert, in effect, does in forming
181
these paradoxes is to invoke the danger of what have been called 'indefinitely extensible' collections, or what Russell in 1906 called 'self-reproductive classes'.
97
More correctly, we would have to assume that the formulas denote propositions or propositional functions or even sets, but with the assumption that these objects exhibit the same
complexity as their defining formulas.
98
Hilbert *1920a, p. 32.
99
Hilbert *1905b, pp. 274-5.
100
Of course, we now take it for granted that, with a system like Zermelo's set theory (or
ZF), the constructive approach to the underlying language is not too restrictive, in other words,
that the constructive attitude to formulas is not an undue restriction on the Separation or
Replacement Axioms in particular, and that we can prove the existence of all the sets we want
to show exist. This is by no means obvious without demonstration, and showing this was one
of the prime concerns of early set theory. Much of the development of set theory in the 1920s,
though, pursues the 'axiomatic' approach to the higher-order notions that Hilbert suggests is
necessary once quantification over them is applied, thus with various attempts to axiomatise
Zermelo's notion of 'definite property', and then to show that this does not weaken the theory,
Le., that all the essential results still follow, and that it is possible to develop Cantor's theory
of transfinite numbers within such frameworks. This is true of all of Fraenkel, von Neumann
and Zermelo himself. Skolem seems to have been the major exception in the early stages. See
Skolem 1930, and Hallett 1984, 7.4, 8.2. We should note that even Bernays's axiomatisation of set theory in the late 1930s and 1940s attempts to show, in effect, how first -order predicates
can be axiomatised (as classes) alongside sets, the key assumption being that any predicate which
does not involve quantification over classes (predicates!) determines a class. The connection with
the attitude of Hilbert to predicates cannot be accidental.
101
See also Weyl 1910, p. 304. One might consider these statements of Hilbert and Weyl to
be premonitions of the Skolem paradox. Something Dedekind says might also be seen as an
early recognition of this:
Whether the symbolic language [Zeichensprache] will suffice in order to denote all the new
objects that are to be created individually is really of no importance. It is always sufficient
to signify all the individuals needed for a (limited) investigation. (Letter to Weber, 24.i.1888,
in Dedekind 1932, item LXVI, p. 490.)
102
Hilbert 1926, pp. 170-1; p. 376 of the English translation. See also the passages cited on
p. 187 of Hallett 1994. The passage quoted here is expanded from Hilbert 1922, p. 163
(p. 163 of the reprint), and is then repeated in somewhat abbreviated form in Hilbert 1928, pp.
65-6 (pp. 464-5 of the English translation), and very abbreviated form in Hilbert 1929, p. 140.
Neither the 1922 passage, nor that from 1928 nor 1929, contain the express reference to Kant.
103
See Weyl 1910, p. 304.
104
Hilbert 1900b, p. 260, p. 296 of Hilbert 1935.
105
See Hallett 1994, pp. 184-5.
106
Hilbert *1922-23b, p. 130 (p. 93 of the new typescript). See also pp. 128-30 (resp.
pp.92-94).
101
Hilbert *1921-22, p. 76.
108
Hilbert 1922, p. 164, p. 164 of Hilbert 1935.
109
Hilbert *1905b, pp. 275....fJ.
110
Hilbert asserts this explicitly about the sign-arithmetic in his 1922, p. 164 (also p. 164 of
Hilbert 1935).
REFERENCES
Benacerraf, Paul and Putnam, Hilary (eds.) 1964: Philosophy of mathematics: selected readings.
First edition. Oxford: Basil Blackwell.
Benacerraf, Paul and Putnam, Hilary (eds.) 1983: Philosophy of mathematics: selected readings.
Second edition. Cambridge: Cambridge University Press.
182
MICHAEL HALLETT
Bernays, Paul 1922a: 'Dber Hilberts Gedanken zur Grundlegung der Arithmetik', Jahresbericht
der deutschen Mathematiker-Vereinigung, 31, 10-19.
Bernays, Paul 1922b: 'Die Bedeutung Hilberts fiir die Philosophie der Mathematik', Die
Naturwissenschaften, 10, 93-9.
Bernays, Paul 1935: 'Sur Ie platonisme dans les mathematiques', L' enseignement mathematique, 34, 52-69. English translation by Charles Parsons in Benacerraf and Putnam (eds.)
1964,274-86, and 1983, 258-71. German translation by Peter Bernath in Bernays 1976,
62-78.
Bernays, Paul 1950: 'Mathematische Existenz und Widerspruchsfreiheit' in Etudes de Philosophie
des Sciences, Neuchatel: Editions de Griffon, 1950, 11-25. Republished in Bernays 1976,
92-106.
Bernays, Paul *1956: 'Are pure mathematics between logic and geometry?', lecture delivered
at Columbia on May 10th, 1956, unpublished, Bernays Nachlass (Hs 973: 18), Wissenschaftshistorische Sammlung, ETH-Bibliothek, Ziirich.
Bernays, Paul 1976: Abhandlungen zur Philosophie der Mathematik. Darmstadt: Wissenschaftliche Buchgesellschaft.
Blumenthal, Otto 1935: 'Lebensgeschichte', in Hilbert 1935, 388-429.
Browder, Felix (ed.) 1976: Mathematical developments arising from Hilbert problems.
Proceedings of symposia in pure mathematics, volume 28, parts 1 and 2. Providence, Rhode
Island: American Mathematical Society.
Cantor, Georg 1872: 'Dber die Ausdehnung eines Satzes aus der Theorie der trigonometrischen
Reihen', Mathematische Annalen, S, 123-32. Reprinted in Cantor 1932, 92-102.
Cantor, Georg 1883: 'Dber unendliche lineare Punctmannigfaltigkeiten, 5', Mathematische
Annalen, 21, 545-91. Reprinted in Cantor 1932, 165-209; English translation in Ewald
(ed.) 1996.
Cantor, Georg 1895: 'Beitriige zur Begriindung der transfiniten Mengenlehre, 1', Mathematische
Annalen, 46, 481-512. Reprinted in Cantor 1932, 282-311. English translation in Georg
Cantor: Contributions to the founding of the theory of transfinite numbers, with an
historical introduction and notes by P. E. B. Jourdain, La Salle, I11inois: Open Court
Publishing Company, 1915, republished by Dover Publications, New York, 1955.
Cantor, Georg 1897: 'Beitriige zur Begriindung der transfiniten Mengenlehre, II', Mathematische
Annalen, 49, 207-246. Reprinted in Cantor 1932,312-56. English translation in Georg Cantor:
Contributions to the founding of the theory of transfinite numbers, with an historical introduction and notes by P. E. B. Jourdain, La Salle, Illinois: Open Court Publishing Company,
1915, republished by Dover Publications, New York, 1955.
Cantor, Georg 1932: Gesammelte Abhandlungen mathematischen und philosophischen Inhalts.
Edited by Ernst Zermelo. Berlin: Julius Springer Verlag.
Church, Alonzo 1956: Introduction to mathematical logic. Volume 1. Princeton: Princeton
University Press.
Cohen, Robert and Elkana, Yehuda (eds.) 1977: Hermann von Helmholtz: Epistemological
Writings. Dordrecht, Holland: D. Reidel Publishing Co. [Boston Studies in the Philosophy
of Science, vol. 37] Translation of Helmholtz 1921.
Dedekind, Richard 1854: 'Dber die Einfiihrung neuer Funktionen in der Mathematik:
Habilitationsvortrag, gehalten im Hause des Prof. Hoeck, in Gegenwart von Hoeck, Gauss,
Weber, Waitz, 30 Juni 1854' in Dedekind 1932, item LX, 428-38. English translation in
Ewald (ed.) 1996.
Dedekind, Richard 1872: Stetigkeit und irrationale Zahlen. Braunschweig: Vieweg und Sohn.
Latest reprint, 1965. Also reprinted in Dedekind 1932, 315-332. English translation in
Dedekind 1901, with a revision of this translation in Ewald (ed.) 1996.
Dedekind, Richard 1888: Was sind undwas sol/en die Zahlen? Braunschweig: Vieweg und Sohn.
Latest reprint, 1969. Also reprinted in Dedekind 1932, 335-91. English translation in Dedekind
1901, with a revision of this translation in Ewald (ed.) 1996.
Dedekind, Richard 1890: 'Brief an Keferstein, vom 27ten Februar, 1890', in Sinaceur
1974 (together with a French translation), 271-8. English translation by Hao Wang and Stefan
183
184
MICHAEL HALLETT
Heijenoort, Jean van 1979: 'Absolutism and relativism in logic' in van Heijenoort 1985,75-83.
Previously unpublished.
Heijenoort, Jean van 1985: Selected essays. Naples: Bibliopolis.
Heijenoort, Jean van (ed.) 1967: From Frege to Godel: a source book in mathematical logic.
Cambridge, Mass.: Harvard University Press.
Helmholtz, Hermann von 1868: 'Dber die Tatsachen, die der Geometrie zum Grunde liegen',
Nachrichten von der koniglichen Gesellschaft der Wissenschaften zu Gottingen, 1868,
193-221. Reprinted in Helmholtz 1921.
Helmholtz, Hermann von 1921: Schriften zur Erkenntnistheorie, edited by Paul Hertz
and Moritz Schlick. Berlin: Julius Springer. English translation, Cohen and Elkana (eds.)
1977.
Hilbert, David *1894: Die Grundlagen der Geometrie. Sommersemester 1894: Hilberts eigene
Manuskript. Niedersiichsische Staats- und Universitiitsbibliothek, Gottingen. Cod. Ms. 541.
Hilbert, David 1899: 'Grundlagen der Geometrie', in Festschrift zur Feier der Enthiillung des
Gauss-Weber-Denkmals in Gottingen, 1899. Leipzig: B. G. Teubner.
Hilbert, David 1900a: 'Dber den Zahlbegriff', lahresbericht der deutschen MathematikerVereinigung, 8, 180-4. English translation in Ewald (ed.) 1996.
Hilbert, David 1900b: 'Mathematische Probleme', Nachrichten von der kOniglichen Gesellschaft
der Wissenschaften zu Gottingen, mathematisch-physikalische Klasse, 1900,253-96. English
translation in Ewald (ed.) 1996.
Hilbert, David 1905a: 'Dber die Grundlagen der Logik und Arithmetik' in Krazer (ed.) 1905,
174-85. English translation in van Heijenoort (ed.) 1967, 129-38.
Hilbert, David *1905b: Logische Principien des mathematischen Denkens. Sommersemester 1905:
Ausgearbeitet von Ernst Hellinger. Mathematisches Institut, Georg-August Universitiit,
Gottingen. 277 pages, handwritten.
Hilbert, David *1905c: Logische Principien des mathematischen Denkens. Sommersemester 1905:
Ausgearbeitet von Max Born. Niedersiichsiche Staats- und Universitiitsbibliothek, Gottingen,
Gottingen: Cod. Ms. Hilbert, 558a. 188 pages, handwritten.
Hilbert, David *1908: Prinzipien der Mathematik. Sommersemester 1908: Ausarbeitung
unbekannt. Mathematisches Institut, Georg-August Universitiit, Gottingen. 206 pages, handwritten.
Hilbert, David *1910: Elemente und PrinzipienJragen der Mathematik. Sommersemester 1910:
Ausgearbeitet von Richard Courant. Mathematisches Institut, Georg-August Universitiit,
Gottingen. 163 pages, handwritten.
Hilbert, David *1917: Mengenlehre. Sommersemester 1917: Ausgearbeitet von Margarethe Loeb.
Mathematisches Institut, Georg-August Universitiit, Gottingen. 166 + IV pages, typewritten.
Hilbert, David *1917-18: Prinzipien der Mathematik. Wintersemester 1917-18: Ausgearbeitet
von Paul Bernays. Mathematisches Institut, Georg-August Universitiit, Gottingen. vii + 246
pages, typewritten.
Hilbert, David 1918: 'Axiomatisches Denken', Mathematische Annalen, 78, 405-15. Reprinted
in Hilbert 1935, pp. 146-56. English translation in Ewald (ed.) 1995, volume 2.
Hilbert, David *1919: Natur und mathematisches Erkennen. Herbstsemester 1919: Ausgearbeitet
von Paul Bernays. Mathematisches Institut, Georg-August Universitiit, Gottingen. Vi + 165
pages, typewritten. Retyped and published in a limited edition by the Mathematisches Institut,
Gottingen, 1989, 117 pages.
Hilbert, David *1920a: Probleme der mathematischen Logik. Sommersemester 1920:
Ausgearbeitet von N. [sic] Schonfinkel and Paul Bernays. Mathematisches Institut, GeorgAugust Unviersitiit, Gottingen. i + 46 pages, typewritten.
Hilbert, David *1920b: Logik-Kalkiil. Wintersemester 1920-21: Ausgearbeitet von Paul Bernays.
Mathematisches Institut, Georg-August Universitiit, Gottingen. 62 pages, typewritten.
Hilbert, David *1921-22: Die Grundlagen der Mathematik. Wintersemester 1920-21:
Ausgearbeitet von Paul Bernays. Mathematisches Institut, Georg-August Universitiit,
Gottingen. 100 + 9 + 38 pages, typewritten.
Hilbert, David 1922: 'Neubegriindung der Mathematik. Erste Mitteilung', Abhandlungen aus dem
185
186
MICHAEL HALLETT
Mendelson, Elliot 1964: Introduction to mathematical logic. New York: D. Van Nostrand
Company. (Third edition, Monterey, California: Wadsworth and Brooks, 1987.)
Patzig, Giinther (ed.) 1986: Gottlob Frege: Funktion, BegrifJ, Bedeutung. Sechste Auflage.
Gottingen: Vandenhoeck und Ruprecht.
Poincare, Henri 1905: 'Les matMmatiques et la logique', Revue de metaphysique et de morale,
13, 815-35.
Poincare, Henri 1906: 'Les matMmatiques et la logique', Revue de metaphysique et de morale,
14,294-317. Reprinted with some changes in Poincare 1908, 192-214.
Poincare, Henri 1908: Science et methode. Paris: Ernest Flammarion.
Post, Emil 1921: 'Introduction to a general theory of elementary propositions', American journal
of mathematics, 43, 163-85. Reprinted in van Heijenoort (ed.) 1967, 264-83.
Purkert, Walter and I\gauds, Hans Joachim 1987: Georg Cantor 1845-1918. Basel, Boston and
Stuttgart: Birkhauser.
Rang, Bernhard (ed.) 1979: Edmund Husserl: Aufsiitze und Rezensionen (1890-1910).
Husserliana: Edmund Husserls Gesammelte Werke, Band XXIl. Den Haag: Martinus Nijhoff.
Reid, Constance 1970: Hilbert. New York, Heidelberg, Berlin: Springer-Verlag.
Richard, Jules 1905: 'Les principes de mathematiques et Ie probleme des ensembles', Revue
generale des sciences pures et appliquees, 16, 541. English translation by Jean van Heijenoort
as 'The principles of mathematics and the problem of sets' in van Heijenoort (ed.) 1967,
142-9.
Russell, B. A. W. 1897: An essay on the foundations of geometry. Cambridge: Cambridge
University Press. Republished by Dover Publications, Inc. in 1937, with a new introduction
by Morris Kline.
Russell, B. A. W. 1899: 'Sur les axiomes de la geometrie', Revue de metaphysique et de morale,
7,685-707.
Russell, B. A. W. 1900: A critical exposition of the philosophy of Leibniz. London: George
Allen and Unwin. Second edition, 1937.
Russell, B. A. W. 1901: 'Recent work in the philosophy of mathematics', The international
monthly, 1901, reprinted as 'Mathematics and the metaphysicians' in Russell 1917.
Russell, B. A. W. 1903: The principles of mathematics. Volume 1. Cambridge: Cambridge
University Press. Second edition published as The principles of mathematics, with a new introduction, by George Allen and Unwin, London, 1937.
Russell, B. A. W. 1906: 'On some difficulties in the theory of transfinite numbers and order
types', Proceedings of the London mathematical society, 4 (second series), 29-53. Reprinted
in Russell 1973, 135-64.
Russell, B. A. W. 1912: The problems of philosophy. The Home University Library of Modem
Knowledge. Oxford: Oxford University Press.
Russell, B. A. W. 1917: Mysticism and Logic. London: George Allen and Unwin.
Russell, B. A. W. 1918-19: 'Lectures on logical atomism', The Monist, 28 (1918), 495-527,
29 (1919), 33-63, 190-222,345-80. Reprinted in Russell 1956, 177-281. Also reissued as
The philosophy of logical atomism, edited and introduced by D. F. Pears, Open Court
Publishing Co., LaSalle, Illinois, 1985.
Russell, B. A. W. 1919: Introduction to mathematical philosophy. London: George Allen and
Unwin.
Russell, B. A. W. 1956: Logic and Knowledge. Edited by R. C. Marsh. London: George Allen
and Unwin.
Russell, B. A. W. 1957: 'Mr. Strawson on referring', Mind, 66, 385-9. Reprinted in Russell 1973 ,
120-6. Page numbers in the text refer to this reprinting.
Russell, B. A. W. 1973: Essays in analysis. Edited by Douglas Lackey. London: George Allen
and Unwin.
Scharlau, Winfried (ed.) 1986: Rudolf Lipschitz: Briefwechsel mit Cantor, Dedekind, Helmholtz,
Kronecker, Weierstrass und anderen. Braunschweig: Vieweg und Sohn.
Schlick, Moritz 1918: Allgemeine Erkenntnislehre. Berlin: Julius Springer.
187
SchrOder, Ernst 1890-1905: Vorlesungen iiber die Algebra der Logik (exacter Logik). 3 Bande.
Leipzig: B. G. Teubner.
Sinaceur, M.-A. 1974: 'L'infini et les nombres. Commentaires de R. Dedekind a Zahlen.
La correspondence avec Keferstein', Revue d'histoire des sciences, 27, 251-78.
Skolem, Thoralf 1923: 'Einige Bermerkungen zur axiomatischen Mengenlehre'. Reprinted in
Skolem 1970, 137-52. English translation in van Heijenoort (ed.) 1967, 290-301.
Skolem, Thoralf 1930: 'Einige Bemerkungen zu der Abhandlung von Ernst Zermelo "Dber die
Definitheit in der Axiomatic" " Fundmenta mathematicae, 15, 337-41. Reprinted in Skolem
1970, 275-9.
Skolem, Thoralf 1970: Selected papers in logic. Edited by Jens Erik Fenstad. Oslo:
Universitetsforlaget.
Tarski, Alfred 1936: 'Dber den Begriff der logischen Foigerung', Actes du Congres International
de Philosophie Scientijique, 7, 1-11. English translation by J. H. Woodger in Tarski 1956,
409-420.
Tarski, Alfred 1956: Logic, semantics, metamathematics: papers from 1923-1938, translated
by J. H. Woodger. Oxford: Clarendon Press. Second, corrected edition, with a new introduction by John Corcoran, published by Hackett Publications, Indianoplis, 1983.
Tijdeman, R. 1976: 'Hilbert's seventh problem: on the Gel'fond - Baker method and its applications' in Browder (ed.), 241-68.
Toepell, Michael-Markus 1986: Uber die Entstehung von David Hi/berts "Grundlagen der
Geometrie". (Studien zur Wissenschafts- Sozial- und Bildungsgeschichte, Band 2.) Gottingen:
Vandenhoeck and Ruprecht.
Weyl, Hermann 1910: 'Dber die Definition der mathematischen Grundbegriffe', Mathematisch-naturwissenschaJtliche Blatter, 7, 93-5, 109-13. Reprinted in Weyl 1968, volume
1, pp. 298-304.
Weyl, Hermann 1917: Das Kontinuum. Reprinted by Chelsea Publishing Co., New York.
Weyl, Hermann 1919: 'Der circulus vitiosus in der heutigen Begriindung der Analysis',
lahresbericht der deutschen Mathematiker-Vereinigung, 28, 85-92. Reprinted in Wey11968,
volume 2, 43-50.
Weyl Hermann 1944: 'David Hilbert and his mathematical work', Bulletin of the American
mathematical society, 50, 612-54. Reprinted in Wey11968, volume 4, 130-72; page numbers
in the text refer to this reprinting. Also partially reprinted in Reid 1970, 245-83.
Weyl, Hermann 1968: Gesammelte Abhandlungen. Volumes 1-4. Berlin, Heidelberg, New
York: Springer-Verlag.
Whitehead, A. N. and Russell, B. A. W. 1910-13: Principia mathematica. Volumes 1-3.
Cambridge: Cambridge University Press.
Willard, Dallas (ed.) 1994: Edmund Husserl: early writings in the philosophy of logic and
mathematics. Dordrecht: Kluwer Academic Publishers.
Zermelo, Ernst 1908: 'Untersuchungen iiber die Grundlagen der Mengenlehre, 1', Mathematische
Annalen, 65, 261-81. English translation in van Heijenoort (ed.) 1967, 199-215.
Zermelo, Ernst 1930: 'Uber Grenzzahlen und Mengenbereiche', Fundamenta mathematicae,
16, 29-47. English translation in Ewald (ed.) 1996.
MATHIEU MARION
Moe; ltv<!)
wvtrj.J
Heraclitus, Fr. 60
The mathematical legacy of Kronecker is impressive. The list of mathematicians who took up his problems includes Adolph Hurwitz, David Hilbert,
Kurt Hensel, Julius Konig, Ernst Steinitz, Erich Hecke, Helmut Hasse, Carl
Ludwig Siegel, Hermann Weyl and, recently, Andre Weil, Robert Langlands,
Harold Edwards. But, although nobody has ever doubted the brilliance of
Kronecker's results and insights, it is relatively safe to say with Edwards
that, more than a century after the publication of his complete works, "there
are important passages in Kronecker's work that no one, ever has fully understood, other than Kronecker himself" (1987, p. 29). This situation is
the result of a lack of interest caused by disdain for Kronecker's well-known
foundational stance and for his style of mathematics. Indeed, Kronecker's
mathematical practice features an insistence on providing algorithms which
is alien to the abstract, axiomatic approach which has been so dominant in
the past hundred years, in particular in Bourbaki's treatises. From this point
of view, algebra consists in the study of (highly) abstract notions - the structures-meres - such as groups, rings, field, and so forth. Dedekind presented
and refined his theory of ideals in successive versions of his Supplement XI
to Lejeune-Dirichlet's Vorlesungen aber Zahlentheorie (1879). Adopted by
Hilbert in his Zahlbericht and studied carefully by Emmy Noether and her
students, Dedekind's ideals played a crucial role in the development of modem
algebraic number theory.
Kronecker (1882) had an altogether different approach from Dedekind.
He considered in his theory of algebraic functions polynomials of the form:
where the coefficients ao, aI' ... , an are, in the old terminology, 'rational
integral functions' of a variable x, i.e.
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy o/Science I, 189-215.
1995 Kluwer Academic Publishers.
190
MATHIEU MARION
where aQ, ai' ... , an are rational function of x, in Kronecker's theory the
variable x and the coefficients Ck, I can only be rational. These limitations proved
intolerable to followers of Riemann (i.e. not only Dedekind and Weber, but
also Weierstrass, Hilbert and others) who worked on the theory of functions
of a real or complex variable.
Dedekind (and Weber) assumed the field of complex numbers C as the
ground field. Since the Fundamental Theorem of Algebra asserts that every
equation f(x) = 0 in Q has a root a in C, Dedekind worked with the subfield
Q(a). Hermann Weyl described this approach in these words:
This algebraic number field is cut out from the continuum [C]. The standpoint thus described
is that of analysis. (1940, p. 10)
Kronecker, however, wanted to make the arithmetic of algebraic numbers independent of the theory of complex numbers and he avoided C by using, for
example, congruences modulo ~ + 1 instead of the imaginary unit i = ~.
From this viewpoint, the expression ~ is only a "symbol", while the equality
~ = -1, which defines ~, is said to be "real" and
+ 1),
It is in this context that the modem schools of philosophy of mathematics formalism in particular - took shape. This historical fact is of great importance
and should not be overlooked.
To most philosophers of mathematics Kronecker is also a villain, because
he committed the unpardonable sin of opposing Cantor's pioneering work in
set theory.4 Moreover, Kronecker is accused of having made vicious personal
attacks on Cantor. I believe, however, that Kronecker's reputation as a klein
191
no completed infinities,
(3)
192
MATHIEU MARION
The first requisite is best understood by look at the role played by his
allgemeine Arithmetik - this expression refers to the arithmetic of polynomials
with rational or integer coefficients. 1O The motto according to which "natural
numbers were created by God, everything else is the work of men" reflects
Kronecker's conviction that number theory and (what we now call) algebraic
geometry can be constructed form Gattungen of quantities algebraic over
natural fields (natiirliche Rationalitiitsbereiche), i.e., the fields of quotients
of the ring of polynomials Z[XI' X 2, , xn] in some indeterminates XI' x 2,
, Xn with integral coefficients. The Gattungsbereiche are constructed by
adjoining to a natural field one root of a polynomial f(x) irreducible over
the field. It seems that Kronecker realized that these Bereiche are all that he
needed. II Andre Weil expressed essentially the same idea while commenting
on Kronecker's Grundziige:
.. in the final analysis, every statement we can make can be thought of as a theorem in algebraic geometry over an absolutely algebraic-ground field, i.e., either over a finite field or over
an algebraic number-field of finite degree. While this realization, of course, cannot in any way
detract from the methodological importance of arbitrary ground-fields as one of the chief tools
of modern algebraic geometers, it give us some insight into the deep meaning of Kronecker's
view, according to which the absolutely algebraic fields are the natural ground fields of algebraic geometry, at any rate as long as purely algebraic methods (as distinct from analytical or
topological methods) are being used. (1950, p. 444)
The second of these requisites was spelled out in a footnote to his paper
'Uber einige Anwendungen der Modulsysteme auf elementare algebraische
Fragen':
It seems to me that these considerations are in opposition to the introduction of Dedekind's
concepts of 'module', 'ideal', etc.; as well as to the introduction of various new concepts,
which have been used in many recent attempts (first of all by Heine) to grasp and to give foundations to the concept of 'irrational' in all its generality. The general concept of an infinite series,
e.g. one which increases according to definite powers of variables, is in my opinion permissible only with the reservation that, in every special case, certain assumptions must be shown
to hold, on the basis of the arithmetical laws of construction of the terms (or coefficients),
. . , which allow the use of the series as finite expressions and thus make it really unnecessary to go beyond the concept of a finite series. (1886a, p. 156)
193
functions whose zero values cut out intervals in the real line in which such
numbers can be isolated from their Galois conjugates, and for Kronecker, to
talk here of Existenz is simply to talk of possible calculations with these
intervals (1887a, p. 272): there is no need to introduce new entities the way
Dedekind did. In this respect, it is important to understand that Kronecker
did not limit himself to the use of elementary or algebraic tools (such as Sturm's
Theorem) and calculations sometimes also involved analytic tools: it is the
ability to calculate which was his prime concern.
In these remarks, I have already partly dealt with the third requisite, which
was, according to Hensel's quotation, the distinguishing feature of Kronecker's
mathematical practice. Kronecker required that proofs of existence contain a
method to find, in a finite number of steps, an arbitrarily good approximation to the number whose existence was proven, and to have held that a
definition, in number theory or algebra, is acceptable only if it could be checked
in a finite number of steps whether any given number falls under it or not.
To give an example, Kronecker rejected the usual definition of irreducibility
according to which if a polynomial f(x) has a rational factor, it is reducible,
otherwise it isn't, because
... [itl is devoid of a sure foundation until a method is given by means of which it can be decided
whether a given function is irreducible or not by the definition. (1882, pp. 256-257)
A similar remark is to be found in Julius Molk's study' Sur une notion qui
comprend celie de la divisibilite et sur theorie generale de l' elimination':
Definitions should be algebraic and not just logical. It is not sufficient to say: 'an object is or
isn't'. One must explain what is meant by being and not-being, in the particular domain in
which we are moving. Only then are we making a step forward. If we define, for example, an
irreducible function as a function which is not reducible, i.e. decomposable in other functions
of a precise nature, we are not giving an algebraic definition, but a simple logical truth. In
order for us to be able to give this definition in Algebra it must be preceded by a method by
which we can obtain, in a finite number of rational operations, the factors of a reducible function.
Only this method gives to the words reducible and irreducible and algebraic meaning. (1885,
p. 8)
194
MATHIEU MARION
and he held that any improvement in this direction was an important problem,
through which our knowledge of an essential point could be extended". Indeed
Lagrange's interpolation formula and the like are essential for generalizations, not just extraneous information.
This feature of Kronecker's practice was entirely alien to Dedekind. His
ideals came with no specific means of computation; this was not acceptable
to Kronecker. Dedekind's reaction to the remark by Kronecker just quoted was
published in a footnote to Was sind und was sollen die Zahlen?, two years later.
The footnote occurred just after the introduction of a "system" or set S as "completely determined when with respect to every thing it is determined whether
it is an element of S or not" (1888, 2). Dedekind was objecting to (3),
inasmuch as it implies that all concepts have to be decidable; according to him,
that a given object falls under a given concept or not is determined independently of our knowledge:
In what manner this determination is brought about, and whether we know a way of deciding
upon it, is a matter of indifference for all that follows; general laws to be developed in no way
depend upon it; they hold under all circumstances. I mention this expressly because Kronecker
not long ago ... has endeavoured to impose certain limitations upon the free formation of concepts
in mathematics which I do not believe to be justified ... (1888 2, note a)
Although the insistence on (3) is the same, Wittgenstein's motives were different. As a philosopher, he had no interest in pursuing work in foundations
of mathematics. At any rate, he knew little about it. But in the course of
their activity philosophers often emphasize aspects of mathematics which
are relevant to their discussion. It seems to me that Wittgenstein sought to
195
But, apart from the case where the domain is finite, the "ideal table" must
contain an infinity of elements, and
... one cannot see how it could be effectively realized. There is no need for this since, in
order to study in a precise manner the arithmetical dependence of y and of x, it is not necessary to encompass it all in one look; it suffices to obtain at any moment, in a rigorous fashion,
those of the elements which are needed.
This is the case when the ideal table is, so to speak, condensed in a computational procedure from which one obtains effectively the value of y corresponding to each value of x in the
domain (x). (1909, p. 20)
196
MATHIEU MARION
We can see here that the rejection of the notion of an arbitrary function is
linked with requisite (3), i.e. with the emphasis on algorithms. As an example
of the opposite standpoint, I shall quote Philip Jourdain's objection (to
Pringsheim's original article):
This implies that the function must be defined by at most an enumerable aggregate of specifications. However, such a restriction, which would reduce the cardinal number of all functions
to be considered from [2"1 to [Nol, and would in general, exclude integrable functions, is certainly not necessary for the theorems on the upper and lower limits of a function, and, in any
case, is a practical necessity irrelevant to our contemplation of functions sub specie aeternitatis. (1905, pp. 185-186n.)
The rejection of the notion of arbitrary function is one of the main features
of Wittgenstein's philosophy of mathematics. 20
I would like to compare now the different philosophical approaches behind
Dedekind's and Kronecker's algebraic number theories. There is a fascinating
similarity in the way Dedekind conceived his three fundamental notions of
ideals, cuts and chains. 21 In order to avoid having to produce irreducible factors,
for a given arbitrary algebraic field Q(a), he introduced his ideals as (possibly
infinite) sets of integers of Q(a) behaving as if they were divisible by some
common irreducible factor. In Stetigkeit und irrationale Zahlen, Dedekind
introduced cuts (Schnitte) as any division of the rational numbers into two
classes such that any number in the first class is less than any number in the
second. For classes Al and A2 , the cut is denoted by (AI' A2) (1872, pp. 12-13).
According to this conception, it suffices to talk of irrational numbers, such
197
n,
as
in terms of sets of rational numbers behaving as if separated off by
numbers AI and A2.22 In Was sind und was sol/en die Zahlen?, Dedekind did
not define a particular class as the natural numbers, but a class of "simply
infinite systems"; anyone of which being able to serve as the subject of arithmetic, if we abstract from their specific properties (1888, 73).23 A class N
is a simply infinite system if there is a one-to-one function <p mapping N
into itself, an object a which is not a value of <p for an argument in N, and
N is the intersection of all classes containing a and <p(x) if it contains x i.e. if N contains a and the chain (Kette) of a (1888, 37 & 44). This leads
to the logicist definition of natural numbers as the set of all classes which
behave as if possessing just as many members as that class.
In all three cases, Dedekind was able to offer a most general (and nonalgorithmic) definition. The root of this common approach is, to my mind, a
general methodological requirement for the introduction and axiomatization of
new abstract notions. This is the requirement that the introduction of new
concepts should not be marred by explicit, arbitrary representations:
My efforts in number theory have been directed towards basing the work not on arbitrary
representations or expressions but on simple foundational concepts, and thereby - although the
comparison may sound a bit grandiose - to achieve in number theory something analogous to
what Riemann achieved in function theory, in which connection I cannot suppress the passing
remark that Riemann's principles are not being adhered to in a significant way by most writers
- for example, even in the newest works on elliptic functions. Almost always they mar the
purity of the theory by unnecessarily bringing in forms of representation which should be
results, not tools, of the theory. (1932, p. 477)
where a o, ai' a 2 , , a'A._1 are integers. (See Kummer, 1851, for example.)
Unique factorization into primes was not valid in this field, and Kummer
had to introduce "ideal prime factors", as he called them, in order to reestablish it. With the theory of cylotomic fields with prime A, Kummer was able
to prove Fermat's Last Theorem, i.e.
198
MATHIEU MARION
for prime exponents. But his interest was higher reprocity laws. He succeeded (in 1859) in generalizing his theory to extensions of these cyclotomic
field obtained by adjoining a Ath root, in order to prove his general reciprocity law. 24
Dedekind was dissatisfied with Kummer's theory for many reasons: first,
he complained that Kummer never explained what an ideal prime factor is,
but only defined such ideal prime factors by providing a procedure for determining the multiplicity with which a given cyclotomic integer is divisible
by them. 25 Secondly, in cyclotomic fields the factorization of a rational prime
p in the number field can be determined from the factorization of a cyclotomic polynomial modulo p, but Kummer's approach didn't lend itself to a
generalization, because of ramification, and this is why Dedekind abandoned
it. Dedekind saw in this "arbitrary representation" the source of Kummer's
mistakes: 26
Kummer has not defined ideal numbers themselves, but only divisibility by these numbers. If
a number a possesses a given property A, always consisting in the fact that a satisfies one or
many congruences, he says that a is divisible by a determined ideal number, corresponding to
the property A. Although this introduction of new numbers is perfectly legitimate, it is to be
feared that, as a result of the mode of expression one has chosen, according to which one
speaks of ideal numbers and their products, and by the presumed analogy with rational numbers,
one would be drawn to hasty conclusions and thus to insufficient proofs; this pitfall was indeed
not always entirely avoided. (1887, p. 268)
199
a2,
J,
where Ill' 112, . . . , are elements of K. The product r of two ideals I and t}
of the field k is the smallest ideal which contains the product a~ of each a
of I and each ~ of t}. So I and t} are factors of r if it is the product of I and
t}. (A prime ideal r = It) if only if r = I or r = t}.) For example, 3 and 5
are replaced by the ideals [3] = (3, 6, 9, ... ) and [5] = (5, 10, 15, ... ), i.e.
by the set of all numbers divisible by, respectively, 3 and 5. The number 15
is replaced by the ideal [15] = (15, 30, 45, ... ), which contains all the products
3m X 5n. These are in [3] and [5], so [15] = [3] x [5]. From this definition,
Dedekind was eventually able to prove that every ideal has a unique representation as the product of prime ideals. We can see here that Dedekind's theory
of ideals consis.ts in manipulations of infinite sets. Although these are relatively harmless, Dedekind opened the door to Cantorian set theory?7
200
MA THIEU MARION
Requisite (3): in order to avoid the requisite about the existence of individual real numbers, which he could not satisfy, Dedekind was forced to tum
attention away to the existence of their complete system (denkbar vol/stiindigstes Grossen-Gebiet), i.e. to move, so to speak, from local to global
questions. 28 This procedure is even more transparent in Was sind und was sol/en
die Zahlen?, where Dedekind starts with an informal introduction of the natural
numbers by means of the notions of system, 1 and successor, which leads
directly to the definition of a simply infinite system. Dedekind then proves
the existence of a simply infinite system (1888, 66) and, further, that it has
a simply infinite subsystem (1888, 72). Dedekind thought that these "logical"
proofs of the existence of these systems had secured the consistency of these
notions, as seen from his letter to Keferstein:
After the essential nature of the simply infinite system, whose abstract type is the number sequence
N, had been recognized in my analysis (articles 71 and 73), the question arose: does such a system
exist at all in the realm of our ideas? Without a logical proof of existence it would always
remain doubtful whether the notion of such a system might not perhaps contain internal contradictions. Hence the need for such proofs (articles 66 and 72 of my essay). (1890, p. 101)
201
nomial in XI' X2' , x n ), Hilbert proved that any modular system M has a
basis such that any polynomial F of M can be written as
F=AIFI +A 2F 2 + +AJm
where F I , F 2 , , F m' is a finite set of polynomials in M and AI' A 2 ,
Am' are suitable polynomials, not necessarily in M. This result is often cited
as a decisive victory of the new abstract methods over cumbersome old ones.
For example, E. T. Bell wrote in The Development of Mathematics that "the
[algorithmic] method was incapable of revealing the general underlying principle of which Gordan's theorems are but special manifestations" (1940, pp.
211-212).31 But it is seldom said, as Hilbert himself pointed out (1890, p. 150),
that the result was inspired by Kronecker's ideas about homogeneous polynomials. Poincare is the exception here; he wrote in his report for the Bolyai
Prize on the works of Hilbert that
This is a consequence of the fundamental notion of [modular system] introduced by Kronecker.
This means, in Kronecker's language, that the divisors common to any [modular system], even
where they are infinite in number, are submultiples of one of them which is their greatest common
divisor ... (1911, p. 754)32
202
MATHIEU MARION
ated the algebraic extensions K of a given algebraic number field k such that
all the divisors in k become principal divisors in K. (Kronecker called these
extensions: "zu assoziierenden Gattungen".) His investigations of modular and
elliptic functions with 'singular moduli' led Kronecker to the formulation of
his "liebster lugendtraum" (1880, p. 453). Class field theory was developed
further by students of Hilbert, Furtwfulgler, Takagi, Artin and, later, Chevalley
and Hasse,35 with Teiji Takagi finally proving the lugendtraum in 1920.
Nowhere is the relevance of Kronecker's work to contemporary mathematics
as obvious as in the case of the theory of elliptic functions with complex
multiplication. Building on earlier work of Gauss, Abel and Jacobi, Kronecker
initiated number-theoretic investigations of complex multiplication. For
example, the main result of 'Zur Theorie der elliptischen Funktionen XI'
(1886b, p. 164) is, in the words of Serge Vladut, "a profound theorem of the
arithmetic of modular functions, which can also be used as a technical basis
for complex multiplication theory" (1991, p. 75). From this result, Kronecker
obtained his 'fundamental relation' (1886b, p. 164), of which he was to give
three proofs, and the so-called 'Kronecker congruence relation' (1886b, p. 177).
Already as a young man, Kronecker saw very far: he found out that Kummer's
cyclotomic fields are the only Abelian extensions of the rational numberfield and conjectured two fundamental theorems: the Kronecker-Weber
theorem (1853, p. 101), which says that every Abelian extension of the rationals is a subfield of the cyclotomic field Q(s), and his "liebsten lugendtraum",
i.e., the dream of constructing all these Abelian extensions by using special
values of analytic functions. 36 Later, Hilbert tried to generalize these results
and this led him to the formulation of his 12th problem (1902, pp. 18-20).37
It is usually assumed that Kronecker only had in mind that the division of
elliptic functions with complex multiplication provides all Abelian extensions, i.e. the completeness theorem. But, according to Weil, Kronecker was
even more ambitious: in parallel to Eisenstein's extension of Kummer's work
on cyclotomy to the lemniscate, Kronecker wanted to extend Kummer's work
on the factors of class-numbers of cyclotomic fields and their p-adic properties to all imaginary quadratic fields and their Abelian extensions (Weil,
1976, p. 88).
I would like to make two remarks. First, it is a striking fact that there is
no equivalent line of inquiry in Dedekind's work. Secondly, the interest of
results such as the Kronecker-Weber theorem or the lugendtraum lies in the
intimate relation between algebraic extension and special values of purely
analytical functions. These results would be of no interest if one had constructed the extensions with non-constructive methods or if the functions
were shown to have an algebraic origin. Moreover, these results can be interpreted both ways, i.e. as generating number fields 'analytically' or as defining
continuous quantities 'algebraically' .38 This clearly shows that talking of a
"putsch" (Hilbert, 1922, p. 160) or of an attempt at abolishing analysis (Bell,
1940, p. 257), in relation to Kronecker's work, is just incredibly misleading.
Why should Kronecker bother obtaining brilliant analytic results such as,
203
say, his 'limit formulas'?39 It is easy to jump to the conclusion that there
was no coherence between Kronecker's mathematical practice and his scattered philosophical remarks and that his own practice was contradicting his
foundational position. But we would do better by simply getting rid of the
caricature of Kronecker as a klein Despot or as a Verbotsdiktator.
To come back to the development of the theory of elliptic and modular functions, Serge Vladut has shown in the third part of his book Kronecker's
Jugendtraum and Modular Functions (1991) that a good deal of the modem
theory arithmetic of modular functions of one variable is derived from
Kronecker's work, i.e. results, such as the congruence relations, obtained in
papers such as 'Zur Theorie der elliptischen Funktionen XI' (1886b). One clear
example of this Kroneckerian heritage is the Eichler-Shimura congruence
relation, whose prototype is Kronecker's main theorem of (1886b) mentioned
above (Vladut, 1991, p. 76). The multidimensional generalization of the theory
of complex multiplication, started by Hecke and developed by Deuring, Weil,
Shimura and Taniyama40 and the theory of Shimura varieties are also among
modem developments owing to the work of Kronecker. Finally, it is impossible to avoid mentioning here the so-called 'Langlands philosophy' or
'Langlands Programme', 41 which consists in a series of conjectures about the
connections (characterized by L-functions) between finite-dimensional
representations of the Galois group of a local or a global field with infinitedimensional complex representations of linear groups over this field. Langlands
has discussed the Kroneckerian heritage of his work in 'Some Contemporary Problems with Origins in the Jugendtraum' (1976).
Hilbert's rhetorical excesses against Kronecker were somewhat excusable
in early XXth century. The use of set-theoretic tools led to great advances in
analysis and to the development of whole disciplines such as point-set topology.
Even in algebraic number theory, Kronecker's approach was felt not to have
produced results - the notable exception being Hensel's work on p-adic
numbers, whose importance was at any rate not immediately understood while Dedekind's ideal-theoretic approach led to early successes, starting
with the proof of the Riemann-Roch Theorem by Dedekind and Weber (1882).
But this rhetoric was already becoming somewhat misleading by the midcentury, as is recorded by Helmut Hasse, in the preface to the first edition
of Number Theory (1949):
It seemed at first that the ideal-theoretic approach was superior to the divisor-theoretic, not
only because it led to its goal more rapidly and with less effort, but also because of its usefulness in more advanced number theoretic research. For Hilbert and, after him, Furtwangler and
Takagi succeeded in constructing on this foundation the imposing structure of class field theory,
including the general reciprocity law for algebraic numbers, whereas on Hensel's side no such
progress was recorded. More recently however, it turned out, first in the theory of quadratic
forms and then especially in the theory of hypercomplex numbers (algebras), not only that the
divisor-theoretic or valuation-theoretic approach is capable of expressing the arithmetic structural laws more simply and naturally, by making it possible to carryover the well-known
connection between local and global relations from function theory to arithmetic, but also that
the true significance of class field theory and the general reciprocity law of algebraic numbers
204
MATHIEU MARION
are revealed only through this approach. Thus, the scales now tip in favour of the divisortheoretic approach. (1949, p. vi)42
Hasse's statement is of interest; it records that the tide has turned. This is
an important change in XXth century mathematics, seldom acknowledged
by philosophers. Often, contemporary philosophers take their lead from predecessors, whose opinions reflect a past state of affairs in mathematics. Thus,
although such important changes do not necessarily invalidate the conclusions of past philosophers, much of contemporary philosophy of mathematics
runs the risk of becoming irrelevant to the preoccupations of mathematicians.
One is reminded here of Jean Dieudonne's words, when he complained about
philosophers "who believed that they spoke to today's mathematics, while they
considered in fact mathematics of the day before yesterday" (1981, p. 27).
Kronecker's foundational ambition, as he expressed it in 'Uber den Zahlbegriff', was to achieve an "arithmetization" (Arithmetisierung) of algebra and
analysis, i.e., to found them "solely on the concept of number understood in
its narrowest sense" (l887a, pp. 252-253). It was indeed his belief that we
could "base all of pure mathematics on the theory of whole numbers"
(Meschowski, 1967, p. 238). But most mathematicians (and philosophers)
believe, as Cantor did, that an arithmetization of analysis could not be "effected
without the use of the actual infinite in some form" (Meschowski, 1967,
p. 250). Kronecker's programme of a strict arithmetization, with his requisites (1)-(3), is not, however, a prima facie unrealizable dream, as shown by
Reverse Mathematics. The latter was devised by Harvey Friedman and his
collaborators in order to answer the question: "Which set existence axioms
are needed to prove the theorems of ordinary mathematics?" What is meant
here by ordinary mathematics is "mainstream or non-set-theoretic mathematics,
i.e. mathematics as it was before the abstract set theorists got hold of it"
(Simpson, 1985, p. 461). As a starting point, theorems are formalized within
a system of second order arithmetic called Z2 (or, sometimes, rr~-CAo). A hierarchy of subsystems of Z2 is obtained by restricting set existence axioms;
restricting, for example, the comprehension scheme to arithmetical formulas
(the resulting system being called ACA o). The basic, weakest system considered in this hierarchy is RCAo (recursive comprehension axiom), from which
the remaining systems are built up. Although it is quite weak, it is strong
enough to develop basic results about continuous functions of a real variable
and countable algebraic structures. The task of Reverse Mathematics is to
find out which parts of 'ordinary mathematics' can be developed within
which of these subsystems. The principal phenomenon uncovered is this:
Very often, if a theorem of ordinary mathematics is proved from the weakest possible set existence axioms, it will be possible to "reverse" the theorem by proving that it is equivalent to
those axioms over a weak base theory. (Simpson, 1985, p. 467)
205
(In modem terms, Lagrange resolvents are called Gaussian sums.) These words
should come as no surprise, considering the importance of the requisite (3)
for Kronecker. To my mind, Jourdain's "contemplation sub specie aeternitatis"
of the concept of function, mentioned above, is a perfect example of the
theorizing that Kronecker wanted to avoid in his work. It could be argued
that Cantorian set theory cannot be dissociated from a Platonist or realist
philosophy, i.e. that it is essential to Cantor that, in parallel to our intensional understanding of a concept such as that of natural numbers, its extension
is already to exist in its totality - e.g. that the infinite table is somewhat already
written down, irrespective of our ability to read it over. While Cantor has
devoted much space to this philosophical question, Kronecker wanted nothing
of the sort, no "speculation".
Reading Kronecker's letter to Cantor, one cannot but be reminded of
Wittgenstein,46 who thought that:
It is a strange mistake of some mathematicians to believe that something inside mathematics
might be dropped because of a critique of the foundations. Some mathematicians have the right
206
MATHIEU MARION
instinct: once we have calculated something it cannot drop out and disappear! And in fact,
what is caused to disappear by such a critique are names and allusions that occur in the calculus.
(1979, p. 149)
207
208
MATHIEU MARION
to Kronecker' reluctance to have it published. This is a supposedly telling fact: "The Beitrag,
in signaling the lengths to which its adversaries would carry their attempts to defeat it, also
reflected the revolutionary ideas with which Cantor had begun to work" (Dauben, 1979,
p.66).
5 Contrary to Cantor, there is precious little verbal abuse in Kronecker's writings; evidence upon
which this reputation rests is mostly hearsay (see for example the letter from Mittag-Leffler to
Hermite in Dugac (1973, pp. 16lf. or invention. For example, the well-known accusation
that Cantor was a "corruptor of the youth" (ein Verderber der lugend) was never made by
Kronecker: it is Schoenflies (1927, p. 2), a follower of Cantor, who assumed that Kronecker
was of that opinion. (Many stories about Kronecker are simply not backed by any hard evidence.
One such story is that of Kronecker's comment to Lindemann, on his proof of the transcendence of 1t: "Of what value is your beautiful proof, since irrational numbers do not exist?"
(Bell, 1940, p. 253, Dauben, 1979, p. 69). As far as I know, there is no primary source for
this implausible story.) I do not wish to enter a sterile debate on Kronecker's personality: as
philosophers, we ought not to pay too much attention to the (largely successful) attempts to smear
Kronecker's reputation - mathematicians don't, as we shall see. (In fact, we could tum around
the quotation of Dauben and say: the lengths to which adversaries of Kronecker's ideas carried
their attempts to suppress them (for more than a hundred years!) also reflect their revolutionary
nature.) Dedekind himself did not share Cantor's negative opinion of Kronecker. Compare also
Dauben's remarks, in the footnote above, with (Edwards, 1989, pp. 71f.) and with Edwards'
discussion of the relations between Kronecker and Dedekind in (1980, pp. 368-372).
6
At any rate, there are some very useful studies. An early introduction to Kronecker's algebraic number theory is Hermann Weyl's Algebraic Theory of Numbers (1940). In Divisor
Theory (1990), Harold Edwards develops a theory of divisors on the basis of Kronecker's
Grundziige (1882), and both Andre Weil's Elliptic Functions according to Eisenstein and
Kronecker (1976), and Serge Vladut's Kronecker's lugendtraum and Modular functions (1991)
concentrate on his work on modular and elliptic functions. Edwards has also published a series
of articles (1980, 1987, 1989, 1992a, b), expounding Kronecker's foundational standpoint and
rectifying common misconceptions about it.
7
The mathematics involved here are quite difficult, especially for a non-specialist, and it is
extremely difficult to present them in a concise manner in such a restricted space. I apologize
for the shortcomings of my presentation and hope to have at least given enough references to
the readers for them to be able find the relevant information.
S
All translations from German or French are mine, except when an already available translation is listed in the bibliography. In that case, the references are to the translation.
9 The last point is very interesting, if we think in terms of Methodenreinheit: it is usually
assumed, in particular by Hilbert, that the use of impure methods was simply a shorter, simpler
way to obtain results. Steinitz's proof of the fundamental theorem of algebra (1910), based on
the so-called Kronecker-Steinitz construction, is a good example to the contrary. In 'The Modem
Algebraic Method', Hasse remarked that "The objection that this construction is formal and yields
nothing new in terms of content is without merit. To my mind, this procedure is conceptually
richer and less burdened with formal computations than the so-called algebraic proofs of the
so-called fundamental theorem of algebra" (1930, p. 19), the latter being in fact a theorem of
analysis. It should be said, however, that 'impure' proofs have, in Hasse's words, a greater and surely non-negligible - Beziehungsrechtum, which is lost when a purer proof is given. I
hasten to add also that the expression 'Kronecker-Steinitz' is somewhat misleading. It is true
that Steinitz - who was student of Hensel - built on Kronecker's ideas but, on the other hand,
he did not respect requisite (2), stated below: his result on the uniqueness of an algebraic
closure uses the axiom of choice. Compare also Steinitz's remarks on Zermelo's Auswahlprinzip:
"Many mathematicians are still opposed to the axiom of choice. With the increasing recognition that there are questions in mathematics which cannot be decided without this axiom, the
opposition to it must increasingly disappear. On the other hand, it seems expedient in the
interest of the purity of method to avoid the said axiom when the nature of the question
does not require its application. I have endeavoured to make these limits conspicuous" (1910,
209
pp. 170-171). If however, Kronecker's approach to algebraic number theory is followed faithfully, there is simply no need to discuss uniqueness. See Weyl (1940. p. 12).
\0
On the allgemeine Arithmetik, see Kronecker's remarks at the beginning of 'Ein Fundamentalsatz der allgemeinen Arithmetik' (1887b, p. 212) and at the end of 'Uber den Zahlbegriff'
(1887a, pp. 273-274), Hensel's remarks in his introduction to Kronecker's Vorlesungen iiber
Mathematik (Kronecker, 1901, p.v), and the comments in Edwards (1992b, pp. 132-133).
II
To give only one example, in his 1886 paper' Zur Theorie der elliptischen Funktionen XI',
Kronecker used the modular function p - K + 11K (with K being the Legendre modulus), and
the Rationalitiitsbereich corresponded to Q(p) and the ground ring to Z[pl, so that the divisor
theory concerned rings which are integral over Z[p].
12
Kronecker also wrote to Cantor, in 1884, that "the new conceptual constructions are at
least not necessary" (Meschowski, 1967, p. 238). Edwards also insisted on this point (l992b,
p. 138).
13
See also Edwards (1990, pp. vi-vii & 24) and Vladut (1991, p. 45).
14
For more details on Wittgenstein on the foundations of mathematics, see Marion (forthcoming
(a.
15
In modem terms, Dirichlet proved that the Fourier series converges pointwise to the function
for any periodic function with period 21t that is piecewise monotone and has at most a finite
number of discontinuities, at each of which the function has a left-hand limit and a right-hand
limit and a value that is the average of these two limits.
16
This interest in Riemann's work is quite explicit in the introduction to their paper (1882,
pp. 238-241). See also Dedekind's letter to Cantor, dated February 17, 1882, where he says
that in constrast to Kronecker, his work with Weber was motivated by Riemann's notion of a
"point" (Dugac, 1976, p. 252).
11
There are many reports that he stated his views forcefully in his lectures, exasperating
Weierstrass (for example, see Dugac, 1973, pp. 161-162).
18
In the following, I shall quote from Molk's translation (1909): although it is presented as
written by Pringsheim, it is almost twice the size of the original text, and contains many developments not to be found in the latter, including the first two passages quoted below.
19
For Pringsheim's original statement, see his (1899, pp. 9f.).
20
For example, it helps us in explaining his objections to Ramsey's notion of 'functions in
extension'. There is a telling reference to Dirichlet' notion of an arbitrary function occuring in
a passage where Wittgenstein criticizes Ramsey's 'functions in extension' in Philosophical
Grammar (1974, p. 315). On this topic, see Marion (forthcoming (b.
21
On the unity of Dedekindian mathematics, see Kneebone (1963, p. 154), Dugac (1976,
pp. 141-142) and Edwards (1983, p. 13).
22
Strictly speaking, Dedekind insisted that an irrational number is "created by the human mind"
and "corresponds to" or "produces" a cut (1872, p. 15). Others, such as Weber, suggested that
irrational numbers should be taken to be the cuts. For Dedekind's objections to this suggestion, see his letter to Weber, January 24, 1888 (1932, vol. 3, p. 489).
23
This "freeing the elements from every other content" is a justification, according to Dedekind,
for calling also the natural numbers of "free creation of the human mind" (1888, 73).
24
In this very paper, Kummer announced the publication of "a work of Mr. Kronecker which
will appear soon, in which the theory of the most general complex numbers (... ) is developed
fully and with great simplicity" (Kummer, 1859, p. 737). Both Dedekind's and Kronecker's divisor
theories were conceived as generalizations, but, while Dedekind's theory extended Kummer's
theory to algebraic functions of one variable, Kronecker's extended it to algebraic function in
general. (It is very striking to think that Kronecker's algebraic number theory was intended as
a generalization of Kummer's theory from cyclotomic fields to all algebraic extensions of
natural fields.) In order to do so, Kronecker consciously developed his general theory without
having mind factorization into primes as the ultimate goal, as Dedekind did (Kronecker, 1882,
p. 309f.). (See, however, the result about 'prime forms' in (1882, p. 352).) But Kronecker published his results in his Grundziige, in 1882, that is eleven years after the second version of
Dedekind's Supplement Xl.
210
MATHIEU MARION
25
This objection sounds "curious" to modem number theoreticians. For example, see Mazur
(1977, p. 982). In Divisor Theory, Edwards also adopted the Kummer-Kronecker approach: "The
contemporary style of mathematics trains mathematicians to ask "What is a divisor?" and to want
the answer to be framed in terms of set theory. Those trained in this tradition will want to
think of a divisor as an equivalence class of polynomials, when the equivalence of polynomials is the property of representing the same divisor. I believe, however, that instead of asking
what a divisor is one should ask what is does. It divides things" (1990, p. 19). Philosophers should
reflect on the difference between the algorithmical approach of Kummer and Kronecker
and the set-theoretic attitude of Dedekind taken for granted by most of them. For Kummer's
divisibility tests, see Edwards (1980, p. 338 or 1992a, p. 60).
26
On common mistake at the time but avoided by Kummer was precisely to assume the validity
of unique factorization. See Edwards (1975).
27
These completed infinities are rather harmless compared to those involved in the theory of
irrationals as cuts, but it has been said that, because of these essential uses, Dedekind's Supplement
Xl is one of the origins of set theory, alongside Dirichlet's concept of an arbitrary function.
See Dugac (1976, p. 29)
28
See Dedekind's letter to Lipschitz, July 27, 1876 (1932, vol. 3, p. 477). For a similar presentation of Dedekind's viewpoint, see Sieg (1990, pp. 263f.).
29
Even Poincare wrote, in his obituary of Weierstrass, that Kronecker obtained results only
when he forgot that "he was a philosopher, and by voluntarily abandoning his principles, which
were condemned in advance to sterility" (Poincare, 1899, p. 17). It should be said that this remark
is based, however, on a misunderstanding of Kronecker's position (and, worse, it is all too
often quoted out of context). The principle in question is not (1) - Poincare ascribes it also to
Weierstrass - but something like: 'natural numbers being the foundations of everything, they
must be in evidence everywhere'. It is hard to see how one could ascribe such a principle to
Kronecker. See Edwards (1989, pp. 72-7).
30
On this topic, see Gauthier (1994).
31
Hilbert could then prove that for any system of forms every (rational integral) invariants
could be expressed as the linear combination of a finite set of them (1890, pp. 208f.) The proof
was criticized for its shortcomings by Paul Gordan, who supposedly cried out: "This is not
mathematics, it is theology!" (Bell, 1940, p. 403). While recognizing that the proof was correct,
he felt there was a gap in its execution, because Hilbert "limited himself to the proof of the
existence of that system of invariants, and renounced any discussion of their properties", such
as the upper bound to their number (1893, p. 132). Hilbert recognized in a subsequent paper
that his proof "provides no way of constructing such a system of invariants by means of a
finite number of processes which can be surveyed before the start of the computation so that,
for example, an upper bound for the number of the invariants of the system or for their degrees
in the coefficients of the ground form can be given" (1893, p. 268).
32
Perhaps it is worth mentioning in this context Dirichlet's 'other principle', expressed by
Hermann Minskowski: "to tackle problems with a minimum of blind calculations and a maximum
of insightful thoughts" (1905, pp. 460-461). Surely, Hilbert's results, as opposed to Gordan's,
is the perfect example of a result satisfying this principle. But there is no reason to associate
here Kronecker's work with Gordan's. On the contrary, Kronecker's papers contain a maximum
of thoughts so insightful that they were not understood by his contemporaries.
33
For Dedekind's notes on Kronecker's paper, see Edwards, Neumann & Puckert (1982).
34
It has been argued that the approach using the Prague Theorem is preferable to Dedekind's
fourth and last version of the Supplement Xl (Edwards, 1980, p. 352). But it brings the treatment of algebraic number theory closer to Kronecker's and it violates Dedekind 's methodological
requirement. This may explain why Dedekind objected to uses of his own theorem. See Dedekind
(1895) for Dedekind's negative reaction to Hurwitz's paper and Edwards (1980, sec. 13) for a
discussion of these matters.
35
It is worth mentioning here Herbrand's (posthumous) contributions to class field theory, such
as a classic result on local fields in (1931) and his contributions to the theory of number fields
211
of infinite degree in (1932) and (1933). Herbrand introduced a (now common) method based
on the notions of 'inductive limit' and 'projective limit' in connection with generalisations of
splitting groups and inenia groups when the fields k and K :::J k (K being the Galois extension
of the number field k ) are of infinite degree on Q. Claude Chevalley followed Herbrand in
extending the theory of class fields to the number fields of infinite degree. After introducing
his notion of idele, Chevalley went on to rework class field theory, getting rid of analysis by
giving algebraic proofs of all its main theorems (1940). For example, Chevalley gave an arithmetical proof of the so-called (in old terminology) 'first fundamental inequality', while Takagi
previously used Dirichlet's L-series. See Tate (1967, pp. 180f.). This is another example of
Methodenreinheit. On the other hand, Chevalley made room, following Anin, for the local-global
principle in class field theory.
36
For an adequate statement of the theorems see the chapter by W.J. & F. Ellison in Abrege
d'histoire des mathematiques 1700-1900 (Dieudonne, 1986, pp. 188-189).
37
Again, for an adequate statement, see Dieudonne (1986, p. 189). Hilbert's student Erich Hecke
made the first breakthroughs on the problem. It should be noted, however, that Hecke made ample
use of analytic continuation, which was never mentioned by Kronecker, presumably for philosophical reasons (Weil, 1976, p. 55).
38
lowe this imponant point to David Reed.
39
On Kronecker's first and second limit formulas, see Lang (1987, chaps. 20-22).
40
See the classic Shimura & Taniyama (1961).
41
For an introduction to the 'Langlands programme', see Gelban (1984) or Arthur (1981).
42
See also Hasse (1967, p. 266n.) and Weil (1950, p. 90) for a similar conclusion. In his preface
to his book Algebraic Theory of Numbers, in 1940, Weyl wrote: "I have axiomatized Kronecker's
approach to the problem of divisibility, which has recently been completely neglected; [... ]
The ultimate verdict may be that the one outstanding way for any deeper penetration into the
subject is the Kummer-Hensel p-adic theory" (1940, p. i). We are now in a better position to
evaluate the role of p-adic numbers, and it is so obvious that Weyl's prediction was entirely
correct that it is not even wonh arguing for.
43
For details about the programme and the results, see Simpson (1985, 1988).
44
Steinitz (1910) was the first to establish the existence and uniqueness of an algebraic
closure for a given field, a theorem which is considered by Hasse as the Fundamental theorem
of Algebra, since the original (Gaussian) Fundamental Theorem cannot be proved without analytic
tools and its validity is limited to the field of complex numbers (Hasse, 1926-27, pp. 247-248).
See Steinitz (1910, p. 33) and the editorial note 43 by Baer and Hasse (1910, p. 153). Bart
van der Waerden's Modern Algebra contains a description of Steinitz's existence and uniqueness theorems for countable fields (1931, pp. 193-196). Compare the treatment by Birkhoff &
MacLane (1941, pp. 1I4f. & 393).
45
Even in the countable case, the axiom of choice is needed for the proof of the uniqueness
theorem. See van der Waerden (1931, vol. I, p. 195). See notes 9 and 44 for other remarks
about Steinitz (1910).
46
One is also reminded of Poincare's answer to Russell's inquietude: when Russell stated
that until "a complete solution of our difficulties [the paradoxes] ... is found we cannot be
sure how much mathematics it will leave intact" (1906, p. 53), Poincare replied that "only
Cantorism and Logistic are called into question; real mathematics, ... , will continue to develop
according to their own principles" (1906, p. 307).
47
This much was shown by Frey and Ribet. See Barry Mazur's 'Number Theory as Gadfly'
(1991). On the recent spectacular attempt at a proof of the conjecture by Andrew Wiles, which
has been temporarily withdrawn since, see Ribet's repon in Notices of the American Mathematical
Society (1993).
48
Research for this paper was made possible by Post-doctoral Fellowships from the Social
sciences and Humanities Research Council of Canada and from the Fonds pour la Formation
de Chercheurs et l' Aide ala Recherche (Quebec), while 1 was successively a Research Associate
at the Center for the Philosophy and History of Science at Boston University and a Post-
212
MATHIEU MARION
Doctoral Fellow at the Universite de Montreal. I would like to thank Harold Edwards, Daniel
Isaacson, Yvon Gauthier, Angus Macintyre and David Reed for their comments on earlier versions
of this paper which led to substantial improvements.
REFERENCES
Arakelov, S. J., 1974a, 'Intersection Theory of Divisors on an Arithmetic Surface', Math. USSR
Izvestija, 1167-1180.
Arakelov, S. J., 1974b, 'Theory of Intersections on the Arithmetic Surface', Proceedings
of the International Congress of Mathematicians. Vancouver, 1974, vol. 1, 405408.
Arthur, J., 1981, 'Automorphic Representations and Number Theory', Canadian Mathematical
Society/Societe Canadienne de mathematiques. Conference Proceedings, Volume 1: 1980
Seminar on Harmonic Analysis, 3-51.
Bell, E. T., 1940, The Development of Mathematics, McGraw-Hi11, New York.
Birkhoff, G. and MacLane, S., 1941, A Survey of Modern Algebra, Macmi11an, New York.
Cantor, G., 1878, 'Ein Beitrag zur Mannigfaltigkeitslehre', }ournalfUr die reine und angewandte
Mathematik 77, 242-258; reprint in E. Zermelo (ed.): 1966, Gesammelte Abhandlungen,
reprint, Hildesheim, Georg Olms, 119-133.
Cassels, J. W. S. and Frohlich, J. (eds.), 1967, Algebraic Number Theory, Academic Press,
London/New York.
Chevalley, c., 1940, 'La theorie du corps de classes', Annals of Mathematics 41, 394--417.
Couturat, L., 1986, De l'infini mathematique, reprint: 1980, Blanchard, Paris.
Dauben, J., 1979, Georg Cantor. His Mathematics and Philosophy of the Infinite, Harvard
University Press, Cambridge Mass.
Dedekind, R., 1872, Stetigkeit und irrationale Zahlen, Vieweg & Sohn, Brunswick; English translation: 'Continuity and Irrational Numbers', in Dedekind (1963), 1-27.
Dedekind, R., 1877, 'Sur la tMorie des nombres entiers algebriques', in Dedekind (1932), vol.
3,262-296.
Dedekind, R., 1888, Was sind und was sollen die Zahlen?, Vieweg & Sohn, Brunswick, second
edition: 1893, third edition: 1911; English translation, 'The Nature and Meaning of Numbers',
in Dedekind (1963), 29-115.
Dedekind, R., 1890, 'Letter to Keferstein', in J. van Heijenoort (ed.): 1967, From Frege to Godel.
A Sourcebook in Mathematical Logic, 1879-1931, Harvard University Press, Cambridge
Mass., 98-103.
Dedekind, R., 1892, 'Uber einen arithmetischen Satz von GauS', in Dedekind (1932), vol. 2,
28-39.
Dedekind, R., 1895, 'Uber die Begriindung der Idealtheorie', in Dedekind (1932), vol. 2, 5058.
Dedekind, R., 1932, Gesammelte mathematische Werke, 3 vols., R. Fricke, E. Noether, and O.
Oystein (eds.), Vieweg & Sohn, Brunswick.
Dedekind, R., 1963, Essays on the Theory of Numbers, reprint, Dover, New York.
Dedekind, R. and Weber, H., 1882, 'Theorie der algebraischen Funktionen einer Veriindlichen',
in Dedekind (1932), vol. 1,238-349.
Dieudonne, J., 1981, 'La philosophie des matMmatiques de Bourbaki', in Choix d' oeuvres
mathematiques, Hermann, Paris, vol. 1, 27-39.
Dieudonne, J. (ed.), 1986, Abrege d'histoire des mathematiques 1700-1900, Hermann, Paris.
Dugac, P., 1973, 'Elements d'analyse de Karl Weierstrass', Archives for History of Exact Sciences
10, 41-176.
Dugac, P., 1976, Richard Dedekind et lesfondements des mathematiques, Librairie philosophique
J. V rin, Paris.
Edwards, H. M., 1975, 'The Background of Kummer's Proof of Fermat's Last Theorem for
Regular Primes', Archive for History of Exact Sciences 14, 219-236.
213
Edwards, H. M., 1980, 'The Genesis of Ideal Theory', Archive for History of Exact Sciences
23, 322-378.
Edwards, H. M., 1983, 'Dedekind's Invention of Ideals', Bulletin of the London Mathematical
Society 15, 8-17.
Edwards, H. M., 1987, 'An Appreciation of Kronecker', The Mathematical Intelligencer 9(1),
28-35.
Edwards, H. M., 1989, 'Kronecker's Views on the Foundations of Mathematics', in Rowe, D.
E. and J. McCleary (eds.) The History of Modern Mathematics. Volume I: Ideas and their
Reception, Academic Press, London, 67-77.
Edwards, H. M., 1990, Divisor Theory, Birkhiiuser, Boston-Basel-Berlin.
Edwards, H. M., 1992a, 'Mathematical Ideas, Ideals, and Ideology', The Mathematical
Intelligencer 14(2), 6-19.
Edwards, H. M., 1992b, 'Kronecker's Arithmetical Theory of Algebraic Quantities', Jahresbericht
der deutschen Mathematiker-Vereinigung 94, 130-139.
Edwards, H., Neumann, 0., and Purkert, W., 1982, 'Dedekinds 'Bunte Bemerkungen' zu
Kroneckers 'Grundzuge' " Archive for History of Exact Sciences 27, 49-85.
Faltings, G., 1983, 'Endlichkeitssiitze fiir abelsche Varietiiten iiber Zahlkorpern', Inventiones
Mathematicae 73, 349-366.
Faltings, G., 1984, 'Calculus on Arithmetic Surfaces', Annals of Mathematics 119, 387-424.
Friedman, H. M., Simpson, S. G. and Smith, R. L., 1983, 'Countable Algebra and Set Existence
Axioms', Annals of Pure and Applied Logic 25, 141-181.
Gauthier, Y., 1994, 'Hilbert and the Internal Logic of Mathematics', Synthese 101,1-14.
Gelbart, S., 1984, 'An elementary Introduction to the Langlands Program', Bulletin of the
American Mathematical Society 10, 177-219.
Gillet H. and Soule, C., 1992, 'An Arithmetic Riemann-Roch Theorem', Inventiones
Mathematicae 110, 473-543.
Gordan, P., 1893, 'Uber einem Satz von Hilbert', Mathematische Annalen 42, 132-142.
Hasse, H., 1926-27, Hohere Algebra I, II, Walter de Gruyter, Berlin, second edition: 1933,
third edition: 1951; English translation: 1954, Higher Algebra, Frederick Ungar, New York.
Hasse, H., 1930, 'Die moderne algebraische Methode', Jahresbericht der deutschen
Mathematiker-Vereinigung 39, 22-34; English translation: 1986, 'The Modern Algebraic
Method', The Mathematical Intelligencer 8(2), 18-23.
Hasse, H., 1949, Zahlentheorie, Springer, Berlin; English translation: 1980, Number Theory,
Springer, Berlin.
Hasse, H., 1967, 'History of Class Field Theory', in Cassels & Frohlich (1967), 266279.
Herbrand, J., 1931, 'Sur la tbeorie des groupes de decomposition, d'inertie et de ramification',
Journal de mathematiques pures et appliquees 10, 481-498.
Herbrand, J., 1932, 'Tbeorie arithmetique des corps de nombres de degre infini. I. Extensions
algebriques finies de corps infinis', Mathematische Annalen 106, 473-501.
Herbrand, J., 1933, 'Tbeorie arithmetique des corps de nombres de degre infini. II. Extensions
algebriques de degre infini', Mathematische Annalen 107, 699-717.
Hilbert, D., 1890, 'Uber die Theorie der algebraischen Formen', Mathematische Annalen 36,
473-534, reprint in Hilbert (1935), vol. 2, 199-257; English translation: 'On the theory of
Algebraic Forms', in Hilbert (1978), 143-224.
Hilbert, D., 1893, 'Uber die vollen Invariantensysteme', Mathematische Annalen 42, 313-373,
reprint in Hilbert (1935), vol. 2, 287-344; English translation: 'On the Complete systems
of Invariants', in Hilbert (1978), 225-302.
Hilbert, D., 1897, 'Die Theorie der algebraischen Zahlkorper', Jahresbericht der deutschen
Mathematiker-Vereinigung 4,175-546; reprint in Hilbert (1935), vol. 1,63-363.
Hilbert, D., 1900, 'Mathematische Probleme', Gottinger Nachrichten, 253-297; reprint in Hilbert
(1935), vol. 3, 290-329; English translation: 1902, 'Mathematical Problems' Bulletin of the
American Mathematical Society 8,437-479; reprint in 1976, Proceedings of Symposia in Pure
Mathematics 28, 1-34.
214
MA TRIEU MARION
Hilbert, D., 1922, 'Neubegriindung der Mathematik (Erste Mitteilung)', reprint in Hilbert (1935),
vol. 3, 157-177.
Hilbert, D., 1931, 'Die Grundlegung der elementaren Zahlenlehre', Mathematische Annalen
104, 485-494.
Hilbert, D., 1935, Gesammelte Abhandlungen, Springer, Berlin.
Hilbert, D., 1978, Hilbert's Invariant Theory Papers, M. Ackerman, R. Hermann (eds.), Math
Sci Press, Brookline Mass.
Hurwitz, A., 1895, 'Uber einen Fundamentalsatz der arithmetischen Theorie der algebraischen
GraBen', in: 1932, Mathematische Werke, Birkhiiuser, Basel, vol. 2, 198-207.
Jourdain, P. E. B., 1905, 'On the General Theory of Functions', Journal for reine und angewandte Mathematik 128, 169-210.
Kirk, G. S., Raven, J. E., and Schofield, M., 1983, The Presocratic Philosophers, second
edition, Cambridge University Press, Cambridge.
Kneebone, G. T., 1963, Mathematical Logic and the Foundations of Mathematics. An Introductory
Survey, Van Nostrand, Toronto/New York/Princeton.
Konig, J., 1903, Einleitung in die allgemeine Theorie der algebraischen Grossen, Teubner,
Leipzig.
Kronecker, L., 1853, 'Uber die algebraisch auflosbaren Gleichungen', in Kronecker (1899),
vol. 4, 27-37.
Kronecker, L., 1880, 'Auszug aus einem Briefe von L. Kronecker and R. Dedekind', in Kronecker
(1899), vol. 5, 453-457.
Kronecker, L., 1882, 'Grundziige einer arithmetischen Theorie der algebraischen Grossen', in
Kronecker (1899), vol. 2, 287-296.
Kronecker, L., 1886a, 'Uber einige Anwedungen der Modulsysteme auf elementare algebraische
Fragen', in Kronecker (1899), vol. 3, 147-208.
Kronecker, L., 1886b, 'Zur Theorie der Elliptischen Funktionen XI', in Kronecker (1899), vol.
4,389-470.
Kronecker, L., 1887a, 'Uber den Zahlbegriff', in Kronecker (1899), vol. 3,251-274.
Kronecker, L., 1887b, 'Einer Fundamentalsatz der allgemeinen Arithmetik', in Kronecker (1899),
vol. 3, 211-240.
Kronecker, L., 1899, Werke, 5 vols., K. Hensel (ed.), Teubner, Leipzig.
Kronecker, L. 1901, Vorlesungen iiber Zahlentheorie, K. Hensel (ed.), Teubner, Leipzig.
Kummer, E. E., 1851, 'Memoire sur la theorie des nombres complexes composes de racines de
l'unite et de nombres entiers', Journal de mathematiques pures et appliqU/?es 15, 377-498,
reprint in Kummer (1975), vol. 1,363-484.
Kummer, E. E., 1859, 'Uber die allgemeinen Reciprocitiitsgesetze den Resten und Nichtresten
der Potenzen, deren Grad eine Primzahl ist', Mathematische Abhandlungen der Koniglichen
Akademie der Wissenschaften zu Berlin, 19-159; reprint in Kummer (1975), vol. 1,699-838.
Kummer, E. E., 1975, Collected Papers, 2 vols., A. Weil (ed.), Springer, Berlin.
Lang, S., 1987, Elliptic Functions, second edition, Springer, Berlin.
Lang, S., 1988, Introduction to Arakelov Theory, Springer, Berlin.
Langlands, R. P., 1976, 'Some Contemporary Problems with Origins in the Jugendtraum',
Proceedings of Symposia in Mathematics 28, 401-418.
Lejeune-Dirichlet, P. G., 1837, 'Uber die Darstellung ganz wilkiirlicher Funktionen durch Sinusund Cosinusreihen', in P. G. Lejeune-Dirichlet (1969), Werke, L. Kronecker (ed.), reprint,
Chelsea, New York, 1969, vol 1, 135-160.
Lejeune-Dirichlet, P. G., 1879, Vorlesungen iiber Zahlentheorie, R. Dedekind (ed.), Vieweg &
Sohn, Brunswick.
Macaulay, F. S., 1916, The Algebraic Theory of Modular Systems, Cambridge University Press,
Cambridge.
Marion, M., Forthcoming (a), Wittgenstein, Finitism, and the Foundations of Mathematics, Oxford
University Press, Oxford.
Marion, M., Forthcoming (b), 'Wittgenstein and Ramsey on Identity', to appear in 1. Hintikka
215
MARIO BUNGE
* First published in Revista Brasileira de Fisica, volume especial os 70 anos de Mario Schonberg,
pp. 150-168 (1984).
217
M. Marion and R. S. Cohen (etis.), Quebec Studies in the Philosophy of Science I, 217-227.
1995 Kluwer Academic Publishers.
218
MARIO BUNGE
219
220
MARIO BUNGE
P(x, y) 2= P(x, z)
+ P(y, z).
On the other hand any hidden variables theory predicts the exact reversal of
the inequality sign:
HV
P(x, y) $ P(x, z)
+ P(y, z).
This is one of Bell's inequalities. Bell himself, and others, derived several other
inequalities that hold in all local hidden variables theories, and that involve
exclusively measurable quantities such as coincidence counter rates. (See
Clauser and Shimony [12] for a masterly review.)
Thanks to this theoretical work a number of ingenious high precision
measurements were designed. Some of them, involving photons, have been
performed. The latest version is that of Aspect [2, 3]. An atom emits two
photons in opposite directions at the same time. Their linear polarizations
are measured by polarizers oriented in directions that are adjustable at will.
The photons coming out of the polarizers activate photomultipliers the coincidence rate of which is monitored by a counter. The result is that the
polarization states of the two photons are highly correlated. That is, the state
of polarization measured by one of the polarizers depends upon that measured
by the other polarizer and conversely (Aspect [2]). Essentially the same result
obtains if the polarizers are rotated during the flight of the photons and before
221
they strike the polarizers, to prevent any communication between them through
signals propagating at a finite speed [3].
Although there is consensus that Bell's inequalities, and with them all of
the hidden variables theories, have been experimentally refuted, it is not clear
what is at issue: philosophical realism, determinism, the EPR dogma that all
properties have sharp values at all times, or the EPR hypothesis that distant
things behave independently of one another.
Most authors - e.g. Clauser and Shimony [12], d'Espagnat [13], Aspect
et al. [2] - claim that realism is the casualty, without however clarifying
what they mean by "realism", and ostensibly conflating philosophical realism
with what we have called classicism or the EPR dogma. Yet it should be
clear that the very design and performance of measurements presuppose the
reality (independent existence) of the entire measurement system, hence of
every component of it (object, apparatus, and experimenter). So, philosophical realism is definitely not at stake: the downfall of Bell's inequalities has
not refuted the principle that the physical world manages to exist without
the help of those who attempt to know it. Nor is determinism at issue, since
some hidden variables theories entailing Bell-type inequalities contain random
variables. So, only the EPR dogma and the separability (locality) hypotheses
are at issue. As a matter of fact this is the conclusion Clauser and Shimony
[12] reach after their careful analysis: Hidden variables (sharpness) & Locality
(separability) => Bel/'s inequalities. Hence, the experimental refutation of
the latter refutes either separability or the EPR dogma - or both. We shall argue
in a while that the or is inclusive, for the EPR dogma entails separability
(locality). But before doing this we must take another glance at the Aspect
experiment.
The experiment poses two problems. One is to ascertain whether the photons
emitted by the source possess all their properties, in particular polarization,
or acquire them when interacting with the polarizers. The second problem is
to account for the correlations at a distance, or EPR correlations, between
the results obtained with one of the analysers and those obtained with the other
- or, equivalently, to explain the coincidences registered by the coincidence
counter.
Let us start with the first problem. According to both classical and hidden
variables theories, the fragments resulting from the process at the source
have sharp properties (eigenvalues) from start to finish: the analysers only
exhibit what is already there. On the other hand, according to quantum theory
the entities leaving the source are in a superposition of polarization states. This
superposition collapses (reduces) to eigenstates through interaction of the
system with the analysers. In other words, the analysers do not just measure
an existing polarization but also produce it (In Pauli's terminology, they are
instruments of the second kind.) Clearly, this explanation will satisfy only
those who believe that accurate measurement is accompanied by a projection of the state function - even though one need not believe the dogmas
that this projection is subjective, instantaneous, and unrelated to the smooth
222
MARIO BUNGE
processes described by the Schr6dinger equation. (See Cini [14], Bunge and
Kalnay [15].)
The second problem is this: if only one of the analyzers is read, we may
infer the result obtained with the other analyzer without looking at it. (This
cannot be done with the apparatus designed by Aspect et al. [2, 3], which
registers only coincidences. The equivalent operation here is to alter the angle
between the analyzers and note the change in coincidence rate. The experiment shows that the coincidence rate attains its maximum when the two
analyzers are parallel.) This result is nicely explained by any hidden variables theory jointly with the theorem of conservation of the total spin. Indeed,
the first quantum object would arrive at the analyzer possessing a definite
spin value, say 1. so that the second must have spin -1, since the total spin
of the system at the source was O. The trouble with this explanation is that
it takes hidden variables for granted. Since experiment has refuted hidden variables theories we must try to explain the same results with the help of quantum
theory.
If the two quantum objects leaving the source were to become independent after a while - as they should according to classical physics and to EPR
- the result obtained with one of the polarizers should be independent of the
result obtained with the other. Thus it should be possible to observe, say, the
left quantum object with its spin pointing in the x-direction, and the right
one with its spin pointing in the z-direction. But this is not what quantum theory
predicts: according to this theory there is total spin conservation, so that if
the left polarizer projects the spin onto the positive x-axis, the right polarizer will project it onto the negative x-axis. In other words, there is a strong
correlation at a distance between the polarizations of the two former components of the original system. This correlation is correctly predicted by quantum
theory, which treats the "two" quantum objects as constituting a single entity.
Indeed, only the state function for the entire system satisfies the Schr6dinger
equation. (In other words, the superposition principle is not optional but mandatory in this case.)
The quantum-theoretical explanation of distant correlation is often found
unsatisfactory or even mysterious because it involves the counter-intuitive
("paradoxical") hypothesis that a complex system may continue to be one even
after its components have moved far apart. In fact all known classical forces
fall off quickly with increasing distance. For that reason a host of more or
less unorthodox explanations of the EPR distant correlation have been
proposed. Among them we note those in terms of hidden variables and a
peculiar quantum potential that does not occur in the hamiltonian but derives
from the state function of the system (Bohm and Hiley [16]); the occurrence
of actions at a distance - and even psychokinesis. We reject all three explanations for the following reasons.The first occurs in a hidden variables theory,
hence one containing some of Bell's false inequalities. The second violates
special relativity and electrodynamics (both classical and quantal), which
involve the principle of nearby action (or locality in the field-theoretic sense).
223
And the third violates the principle of conservation of energy, since every
message must ride on a physical process - unless, of course, one is prepared
to believe in the existence of non-physical things.
We need not look for any special mechanism explaining the distant or
EPR correlations, any more than we need to explain the Lorentz "contractions"
and "dilatations" in terms of mechanisms (Paty [17]). Quantum mechanics
accounts for distant correlations as follows. If two quantum objects are initially independent from one another, then the state function of the aggregate
they constitute is correctly represented by the product of their individual
stage functions. So, separability or "locality" obtains in this case just as in classical physics. But if two quantum objects have initially been part of a system,
i.e., if they have interacted strongly at the beginning, then the state function
of the whole cannot be so factorized even after the components have moved
far apart. In other words, the state of each component is determined not only
by the local conditions (i.e. the state of the immediate surroundings of the
spatially separated quantum objects), but also by their still belonging to a
system.
So, although physical separation entails spatial separation, the converse is
false. Quantum theory is then non-local in this peculiar sense or, as we prefer
to say, it is systemic in that it does not incorporate the classical principle
that widely separated things cannot belong to the same system. (Note again
the ambiguity of the word "local". In field physics, whether classical or quantal,
"locality" means that nearby action, or non-action-at-a-distance, obtains: all
actions are assumed to proceed de proche en proche. Quantum theory is local
in this traditional sense of the word.)
We submit that the non-separability or (non-"locality") inherent in quantum
theory is a consequence of the superposition principle together with the
Schrodinger equation. In fact, once a system is in a state consisting of an
inextricable merger (not just either a sum or a product but a sum of products)
of the states of its components, the Schrodinger equation guarantees that the
system will evolve in such a way that its state function will retain that structural property even though the relative contribution of every individual
eigenstate is likely to change in the course of time. (In the case of the EPR
experiment, the total system I + 2 is in some such state as 2- 112 (U1V 1 + U 2V 2)
or 2- 112 (U1V 1 - U 2V 2 ). If the u's and v's are spin eigenstates, the preceding
state functions are distance-independent: the components continue to be entangled no matter how far apart they move.)
The EPR paradox is such only if one presuppose the classical principle
that the behavior of things that are far apart cannot be correlated. (See Einstein
[18] and Bell [11] for statements and discussions of this classical principle
of "locality" or separability.) The physicist who takes this principle for granted
is bound to puzzle over how it could be possible for two things to continue
interacting after they have stopped being components of system. Quantum
theory solves the "mystery" much in the same way as a detective solves all
alleged murder cases by proving that there was no murder to begin with:
224
MARIO BUNGE
that the system was not dismantled. that the distant components are still part
of the original system, that the "wave" function has not split into two parts
but continues to cover the entire system.
What the quantum theorist may puzzle over is a different question, namely:
How, that being the case. could one effectively dismantle the system, and
thus factorize its state function. We submit that the answer to this query is:
The original system becomes dismantled only when at least one of its original
components gets integrated into another system - e.g. when it is captured or
absorbed by an atom.
Philosophers have been quick to exploit the downfall of Bell's inequalities. Thus Fine [19] has interpreted it as a refutation of determinism. and
van Fraassen [20] as a refutation of philosophical realism. We have seen that
it is neither. In fact. some hidden variables theories, such as stochastic quantum
mechanics. are non-deterministic in that they contain random variables. (See
additional criticism by Nordin [21] and Eberhard [22].) As for philosophical
realism, it would indeed be dead if, as van Fraassen contends. it involved
the hypothesis that there is a causal mechanism underlying every correlation, since no such mechanism is known to exist in the case of the EPR
distant correlations. But such a version of philosophical realism is a straw man.
Philosophical realism happens to be an epistemological, not an ontological.
view, and it boils down to the thesis that nature exists even if it is neither
perceived nor conceived. (If in doubt consult any philosophical dictionary.)
Moreover, philosophical realism is compatible with alternative ontologies, in
particular neodeterminism, which acknowledges non-causal (in particular probabilistic) types of determination (Bunge [23]).
If anything, the experimental refutation of Bell's inequalities, like any
other experiment, has confirmed philosophical realism once more, since every
well designed experiment involves a clear distinction between knowing subject
(in particular experimenter), apparatus, and object of knowledge. (See Wheeler
[24], Rohrlich [25], and Bunge [26].) Only "realism" in the idiosyncratic sense
of EPR has been refuted along with Bell's inequalities - but that was only a
classicist dogma. Actually this dogma went down the moment quantum theory
succeeded in solving the problems that classical physics had left unsolved.
Since then we have learned that reality is smudged rather than neat: that
every quantum object possesses some properties that, far from having sharp
values all the time, have distributions of values; and that, unlike rigid bodies
but rather like fluids and fields, quantum objects are extremely sensitive to
their environments, whether natural or artificial. This discovery did not alter
nature and, in particular, it did not enslave nature to the supposedly omnipotent Observer. It only taught us that nature is composed not only of classical
objects but also of quantum and semiquantum objects. It altered our representation of the world, not the world itself; it was a scientific revolution, not
a cosmic cataclysm.
Hidden variables theorizing remained outside the mainstream of physics
until Bell and others succeeded in deriving formulas capable of being subjected
225
to rather direct crucial experimental tests. Then it came briefly to the fore,
and it may soon become little more than a historical curiosity. However, the
failure of hidden variables theories has taught us a valuable lesson: that, like
other revolutionary theories before, quantum theory has got to be understood
in its own terms. (See Bunge [8, 9].) Besides, hidden variable theorists have
rendered a valuable service to the scientific and philosophical communities.
Firstly, they have exhibited some of the inconsistencies and obscurities of
the Copenhagen orthodoxy. (Unfortunately they have often confused the issues
- e.g. the problem of objectivity with that of determinism: see Bunge [27].)
Secondly, they have helped others remove those inconsistencies and obscurities by reinterpreting the standard mathematical formalism of quantum theory
in a strictly physical, hence realistic, fashion. Thirdly, they suggested the
very experiments that were to refute classicism.
Now that the hidden variables episode seems to be closed for good we
can look forward to theories even less intuitive than quantum theory. One
wonders whether Einstein was right in believing that God (sive natura) is
not malicious.
ACKNOWLEDGEMENTS
I thank Andres Kitlnay (Secci6n Fisica, IVIC, Caracas) and Maria E. Burgos
(Depto. de Fisica, Universidad de Merida, Merida) for many long and enlightening discussions on EPR and related matters.
APPENDIX. THE EPR "PARADOX"
Let us recall and examine the original EPR "paradox" [I]. Consider a complex system, such
as a molecule, that breaks up into two components that fly off in opposite directions. Call P =
PI + P2 the total linear momentum of the system - a quantity that is conserved - and x - XI X2 the relative distance between its components. Since PI + P2 commutes with XI - X 2, both are
definite (have sharp values) and can be measured simultaneously with as much accuracy as the
state of the art will allow. At all times after break-up, the system is said to be in a state represented by
where 3 is Dirac's delta, and (XI + x2)/2 is the position coordinate of the center of mass of the
system.
By measuring XI we can infer (without measurements) the exact position of component 2,
given by X2 - XI - a. It would seem that component 2 "knows" what its place is even though
no information about XI has reached it. Alternatively, if PI is measured we can infer the exact
momentum of component P2' given by P2 = P - PI' In both cases it seems clear that the property
of the component that is being inferred (not measured directly) exists without measurement which contradicts the Copenhagen doctrine. Moreover, the information on the position (or the
momentum) of component 2 is not included in the state function (I). Therefore, according to
EPR, quantum mechanics is incomplete.
I do not profess to understand the criticisms leveled by Bohr [28, 29] - except that they
presuppose the very operationist interpretation that Einstein and his coworkers were criticizing.
It seems to me, on the other hand, that the EPR paper can be criticized as follows.
226
MARIO BUNGE
Firstly, the "wave" function (1) does not describe correctly the system in question because
it is not a possible eigenfunction of any operator. The
delta must be replaced with a well-behaved (bounded and square integrable) function sharply
peaked at XI - X2 = a, symmetric about a, and with a nonvanishing width (standard deviation)
!J.x. But this amounts to saying that the relative distance X - XI - X2 has an objective spread or
indeterminacy !J.x. Hence, measuring XI exactly does not allow us to infer X2 with certainty but
only to within !J.x. Therefore Heisenberg's inequalities are not violated. (Dirac [30] conjectured that the infinite length of the state vectors corresponding to sharp position eigenvalues
"is connected with their unrealizability". As a matter of fact nobody has measured a sharp position
value: all one can do is to locate a quantum object within a small though nonvanishing region
of space of, say, vi in linear dimension.)
Secondly, although XI - X2 and PI + P2 commute, in practice they cannot be measured accurately at the same time. In fact, measuring XI - X2 involves localizing component I at XI and
component 2 at X2' But by so doing both PI and P2 get smudged according to Heisenberg's inequalities. Consequently the total momentum PI + P2 is not conserved while measuring XI - X2' The
classical explanation (adopted by Bohr [29]) is that there are exchanges of momentum PI and
P2 between the quantum objects and the measuring instrument. The quantum-theoretical explanation is that the quantum objects are thrown into superpositions of momentum states.
In our view either of the above objections disposes of the EPR claim that quantum mechanics
is incomplete. On the other hand neither touches on the real crux of the matter, namely the
inseparability inherent in quantum theory - which Schrooinger [31] regarded as "the characteristic trait of quantum mechanics".
McGill University
REFERENCES
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
Einstein, A., Podolsky, B., and Rosen, N., 1935, Phys. Rev. 47, 777.
Aspect, A., Grangier, P., and Roger, G., 1981, Phys. Rev. Lett. 47, 460.
Aspect, A., Dalibard, J., and Roger, G., 1982, Phys. Rev. Lett. 49, 1804.
Bohm, D., 1957, Causality and Chance in Modern Physics, Routledge & Kegan Paul,
London.
Bohm, D., 1952, Phys. Rev. 85, 166-180.
de la Pena-Auerbach, L., 1969, J. Math. Phys. 10, 1620.
Claverie, P. and Diner, S., 1977, Intern. J. Quantum Chemistry XX, Suppl. 1,41.
Bunge, M., 1967, Foundations of Physics, Springer, New York.
Bunge, M., 1973, Philosophy of Physics, Reidel, Dordrecht. French translation 1975,
Philosophie de la physique, Seuil, Paris.
Bell, J., 1964, Physics I, 195.
Bell, J., 1966, Rev. Mod. Phys. 38, 447.
Clauser J. F. and Shimony, A., 1978, Rep. Prog. Phys. 41, 1881.
d'Espagnat, B., 1979, Sci. Amer. 241(5), 158.
Cini, M., 1983, Nuovo Cim. 73B, 27.
Bunge, M. and Kalnay, A., 1983, Nuovo Cim. 77, I.
Bohm, D. and Hiley, B., 1975, Founds. Phys. 5, 93.
Paty, M., Centre de Rechrches Nucleaires de Strasbourg HE, 81-10.
Einstein, A., 1948, Dialectica 2, 320.
Fine, A., 1982, Phys. Rev. Lett. 48, 291.
van Fraassen, B., 1982, Synthese 52,25.
Nordin, I., 1979, Synthese 42,71.
Eberhard, P. H., 1982, Phys. Rev. Lett. 49, 1474.
Bunge, M., 1959, Causality, Harvard University Press, Cambridge. Rev. ed.: New York,
Dover, 1979.
227
24. Wheeler, J. A., in R. G. Jahn, ed., The Role of Consciousness in the Physical World,
Westview Press, Boulder, Colo.
25. Rohrlich, F., 1983, Science 221, 1251.
26. Bunge, M., 1983, Treatise on Basic Philosophy 6, Understanding the World, Reidel, Boston.
27. Bunge, M., 1979, Lecture Notes in Phys. 100,204.
28. Bohr, N., 1935, Phys. Rev. 48,696.
29. Bohr, N., 1949, in P. A. Schilpp, ed., Albert Einstein: Philosopher-Scientist, The Library
of Living Philosophers, Evanston, Ill.
30. Dirac, P. A. M., The Principles of Quantum Mechanics, 4th ed., Clarendon Press, Oxford.
31. Schrooinger, E., 1935, Proc. Cambridge Phil. Soc. 31, 555.
STORRS McCALL
In his 1935 axiomatization of quantum mechanics, von Neumann postulated two very different types of change which quantum systems undergo:
Schrodinger evolution and state vector reduction. Since that time, many
attempts have been made to reconstruct or to reaxiomatize quantum theory
so as to eliminate reduction or collapse and retain unitary evolution as the
sole principle of change.
In this paper an interpretation of quantum mechanics, called the "branched"
interpretation, is proposed which (i) accommodates both of von Neumann's
evolutionary processes, (ii) provides a physical model of state vector reduction, and (iii) is non-local in a precise sense to be explained below. The
branched interpretation is discussed in chapter 4 of McCall (1994).
The theory which the branched interpretation resembles most closely is
Everett's, but at the same time there are deep and fundamental differences
between the two. As Everett presented it, his interpretation did not envisage
a literal spatio-temporal splitting of the world on measurement, but a branching
only of the observer's state, with the result that each element of the superposition of states which describes the observer finds itself on a separate
branch (Everett, 1957, p. 146). Although Healey (1984) cautions against it, the
interpretation is generally understood today as involving full-blooded spatiotemporal branching. DeWitt describes the theory as the many-worlds or
"many-universes" interpretation, and speaks of the world as "constantly splitting into a stupendous number of branches" (1972, p. 178).
The branched interpretation of this paper differs from the many-worlds interpretation in the following respects.
(1) In the many-worlds interpretation there are multiple present states of the
world, reSUlting from past measurement interactions. In the branched interpretation, the present and past are unique, and only the future is branched.
(2) In the many-worlds interpretation new branches are continually being
created by measurement-like interactions (DeWitt, 1970, p. 161), whereas
in the branched interpretation all branches have existed since the creation
of the universe. The flow of time is objectively modelled in the branched
interpretation as the progressive vanishing of all branches but one at the first
branch point to reveal a unique past. In this interpretation, the universe has
the form of a tree which loses branches to reveal a progressively lengthening
trunk.
(3) The many-worlds interpretation has difficulty interpreting probabilities of quantum events. In the branched interpretation the probability of a future
event is given by the proportion of branches on which the event in question
occurs. Since branch selection is random, probability = proportionality.
229
M. Marion andR. S. Cohen (elis.), Quebec Studies in the Philosophy of Science 1.229-241.
1995 Kluwer Academic Publishers.
230
STORRS McCALL
(4) The branched interpretation accounts for quantum entanglement, and for
the distant correlations of measurements performed at different locations
upon an entangled system, by the fact that two different measurement outcomes
may occur on the same branch, or on a fixed proportion of branches, no
matter how far apart the measurements may have been made. Consequently,
the joint probability of two distant events may not equal the product of the
probabilities of the events taken separately, and in that case it may appear as
if the occurrence of one event influences the occurrence of the other. This nonlocal character of the interpretation arises from the fact that all splitting of
branches is along space-like hyperplanes.
(5) The many-worlds interpretation was specifically devised to eliminate
the need for von Neumann's projection postulate. It will appear to each
observer on each branch that the measured system has "jumped" into an eigenstate of the measured observable, but in fact all branches corresponding to
the initial superposition are equally actual and no jump has taken place (Everett,
1957, p. 146). In the branched interpretation on the other hand, one and only
one branch is selected as actual and state vector reduction occurs when branch
attrition takes place within a specific kind of branched structure. This is the
physical feature which models "collapse".
In what follows these points of difference between the branched and the
many-worlds interpretation will be enlarged upon and made explicit. Before
starting, it might be asked in what sense the word "interpretation" is being
used. To this day, no agreement on a correct interpretation of quantum
mechanics exists. In particular, the principle of superposition poses severe
difficulties in this regard. Everett attempted to explain why observers are never
in fuzzy superpositions of observational states by postulating that every element
of such a superposition is actualized on a separate branch. Those who, following von Neumann, attempt to avoid superpositions of observational states
by positing "reduction" or "collapse", still have the problem of interpreting
"collapse" itself, i.e., specifying what kind of process it is. The aim of the
branched interpretation is to provide what Einstein sought for and was never
prepared to abandon, namely "the idea of direct representation of physical
reality in space and time" (Einstein, 1940, quoted in Fine, 1986, p. 97). In
its branching space-time structure the interpretation gives one possible answer
(c. Healey, 1989, p. 6 and Hughes, 1989, p. 296) to the question, "What
would the world have to be like, in order for quantum mechanics to be
true?"
(1) Shape of the Model. Roughly and broadly, the many-worlds interpretation in its branching space-time version envisages a model in the shape of
a bush, with no distinguished branch and with multiple present moments,
each containing perfect or slightly imperfect copies of oneself and other
existing individuals. In the branched interpretation by contrast there is only
one present and one past, and all branching lies in the future. The model is
tree-like, not bush-like. Every path through the tree is a complete four dimensional space-time manifold, and all branching is along space-like hyperplanes
231
232
STORRS McCALL
branch on which I do not observe its decay. The concept of probability plays
no role here, since what will happen in the future is certain. (C. Albert and
Loewer, 1988, p. 201.) In the branched interpretation on the other hand
the notion of probability is central, and the interpretation provides not
only a physical meaning, but in addition a precise value, to all quantum
probabilities.
In the branched interpretation the probability of any future event, including
quantum events, is given by the proportion of branches on which the event
in question is located. For example, suppose that a vertically-polarized photon
is entering a two-channel polarizer which is inclined at an angle <I> to the
vertical. This polarizer has two exit channels, <1>+ and cI>-, and quantum theory
tells us that the probabilities of the photon exiting in the <1>+ and cI>- channels
are cos 2 <I> and sin2 <I> respectively.' In the branched model there are two kinds
of branch above the space-time point A where the photon enters the polarizer, and the proportionality of <1>+ branches to cI>- branches is cos 2 <I> to sin2 <I>
(see Figure 1).
Fig. 1.
There is a problem here, as was first pointed out to the author by Bas van
Fraassen, and the problem stems from the fact that in general cos 2 <I> and
sin 2 <I> will not be rational numbers. If they always took rational values nlm
then finite branching at point A would serve to represent these values, but if
cos 2 <I> is irrational then infinite branching at A would seem to be required,
and Cantor showed long ago that there are no fixed proportionalities in infinite
sets. (Consider for example, what the proportion is of even numbers in the
set of natural numbers.) It would appear that if the set of branches at A were
infinite, no means would exist of designating exactly cos2 <I> of them as <1>+
branches.
There is however a way in which an infinite set of branches can contain
subsets of fixed proportionality corresponding to any real number between 0
and 1, despite what Cantor says. In the branched interpretation there is at
point A a highly ramified branched structure of a certain specific kind. This
structure is of temporal height At, where I1t can be as small as we please.
The branched structure, which we shall call a "decenary tree", takes the following form. At time t, when the photon first enters the polarizer, the decenary
tree branches in 10, and each branch branches in 10 at each of the subse-
233
234
STORRS McCALL
only very few possible future events to become actual. Where pee) is the
probability for e to become actual, the value of pee) is given by branch
proportionality.
(4) Non-Locality. Suppose, in a version of the EPR experiment which
measures the plane polarization of two photons in a single state, that the
left measuring device is a two-channel horizontal-vertical polarizer, while
on the right there is a similar polarizer set at an angle <I> to the vertical (see
Figure 2).
Fig. 2.
In this experimental set-up there are four different possible joint outcomes,
namely (v, <1>+), (v, <j)-), (h, <1>+), (h, <j)-), with probabilities 112 sin2 <1>, 112 cos 2 <1>,
112 cos 2 <I> and 112 sin 2 <I> respectively. In the branched interpretation there
will be four different kinds of branch, and the probabilities of the joint
outcomes on left and right are given by the proportionality of the respective
branch sets. Denoting the event consisting of the left photon entering its
polarizer by A, and the corresponding event for the right photon by B, the
branched model for the frame of reference in which A and B are simultaneous takes the form as shown in Figure 3.
,v
Fig. 3.
Except for the case where <I> = 45, which yields p(v, <1>+) = p(v, <j)-) = p(h,<I>+)
p(h, <j)-) = 114, it is apparent on inspection of this diagram that the outcomes
of the measurements at A and B cannot be statistically independent. The condition for the two outcomes v on the left and <1>+ on the right to be independent
is that p(v, <1>+) = p(v) x p(<I>+). In the terminology of Fine (1980, p. 536),
the two outcomes are independent if the probability of the joint outcome
(v, <1>+) is "factorizable" into the product of the probability of v with the
probability of <1>+. But except where <I> = 45 this equality fails. For example
if <I> = 30, p(v, <1>+) = 112 sin 2 <I> = 118 and p(v, <j)-) 112 cos 2 <I> = 3/8. Then
235
p(v) = p(V, <1>+) + p(v, <1>-) = 118 + 3/8 = 112; p(<I>+) = p(v, <1>+) + p(h, <1>+) =
118 + 3/8 = 112; and p(v) x p(<I>+) = 114 "# p(v, <1>+). The two measurements at A and B are space-like separated, but their outcomes are not
independent.
The existence of a statistical correlation between the separate measurements
performed at A and B would not be puzzling if a common-cause explanation
or a local hidden variables explanation of the correlation could be given. But
Bell's theorem rules this out. A non-local hidden variables explanation would
presumably involve the transfer of information between the two polarizers at
faster-than-light velocities, and such transfers contradict relativity theory. At
present we seem to be left with no explanation at all of the distant correlations revealed by the EPR experiment.
One possible reaction to this situation would be simply to accept the correlations as brute fact, and forego any attempt to provide an explanation of
them. This course of action is proposed in Fine (1989). However, an explanation of the non-local reciprocal influence exhibited by the measurements
performed at A and B can be extracted from the branched interpretation. The
kind of explanation that the interpretation provides emerges most clearly if
we consider a frame of reference in which the left measurement of the EPR
experiment precedes the right measurement.
Assume that the left polarizer is positioned close to the source on earth,
and the right polarizer on the moon. The left photon passes through its polarizer and is measured v. Here it is natural to take a frame of reference in
which the measurement event A on earth occurs before B. Given that the left
photon has been observed to be vertically polarized, any subsequent measurement on the right photon will reveal it to be polarized horizontally. If it
is passed through a <I> polarizer it will have probability sin 2 <I> of being
measured <1>+. If it is passed through an HV polarizer it will invariably be
measured h. In every conceivable respect, the right photon behaves exactly
as if it had the property of being horizontally polarized. The question is,
when did it acquire this property? Consider the following three questions,
derived from Mermin (1985, pp. 46--47):
(i) Did the right photon have the property of being horizontally polarized
prior to the measurement performed on the left photon? The answer would
seem to be no. If for example the left photon had not been measured v, but
had emerged from its polarizer in the h channel, then the right photon would
not have had the property of being horizontally polarized, but would instead
have been vertically polarized. If at the last moment the left polarizer is
removed, so that no measurement at all is performed on the left photon, then
tests on the right photon reveal no consistent angle of polarization. The photon
passes + or - through polarizers at any angle with equal frequency. All the
evidence suggests that the right photon did not have the property of being
horizontally polarized before the left measurement was performed.
(ii) Did the right photon have the property of horizontal polarization after
the measurement on earth? Yes. No matter what tests are performed on the
236
STORRS McCALL
right photon after the left measurement, it will behave like a horizontally polarized photon.
(iii) If the right photon did not possess the property of being horizontally
polarized before the left measurement, but did possess it afterward, how did
it acquire the property, given that it was then travelling towards the moon?
Alternatively, how did it receive instructions (as it apparently did) about how
to behave when it encountered the right polarizer?
In the branched interpretation, question (iii) receives an answer that is
both simple and natural. It is this. The measurement performed on the left
photon has two possible outcomes, represented on the branched model by
two sets of branches. On one of these sets the left photon is measured as
being vertically polarized, and on all the branches which belong to this set
the right photon behaves like a horizontally polarized photon. For example,
its probability of passing $+ on the moon is sin2 $. But on the other set,
where the left photon is horizontally polarized, the right photon behaves
as if it were vertically polarized. Its probability of passing $+ is cos 2 $, not
sin 2 $. In a frame of reference in which the measurement event A precedes B
the branched model takes the form as shown in Figure 4.
"2
""
s
Fig. 4.
At the instant the left photon passes through its polarizer, one and only
one of the branches which divide along the hyperplane containing A is selected,
and the rest vanish. If the surviving branch is a v-branch, the right photon
behaves exactly as if it were horizontally polarized - on that branch, in fact,
it is horizontally polarized. If the surviving branch is an h-branch, then the
right photon is vertically polarized. Branch attrition ensures that one and
only one branch is selected, and hence that one and only one of these joint
outcomes is realized. Through the mechanism of branch selection, a measurement performed on earth can instantaneously bestow a property on a photon
which is travelling to the moon.
Suppose now that the experimental arrangement on earth contains an ultrasonic switch, as in the latest Aspect version of the EPR experiment, so that
237
at the last moment the left photon can be diverted to a $ polarizer instead
of an HV polarizer. Here again, if the left photon were measured $+, the
property of being polarized in the $- direction would be instantaneously
bestowed on the right photon, and the proportion of branches on which it
was measured $+ on the moon would be zero. The branched diagram for this
experimental arrangement would be a little more complicated in that there
would be four types of branch for the left outcomes, (v, h, $+ and t- instead
of just v and h), but the principle would be the same.
The branched interpretation, then, can explain how it is that a measurement performed on earth can instantaneously affect the outcome of a
measurement performed on the moon, without there being anything which
"travels" from one location to the other. Since the two measurement events
A and B are space-like separated, there will also be a frame of reference in
which the measurement on the moon precedes that on earth. A branched model
constructed according to this coordinate frame will show how the moon measurement can instantaneously affect the earth measurement. The three different
models, in which A is respectively simultaneous with, earlier than, or later than
B, are all relativistic variants of one another - i.e. all pictures of one and the
same set of events in different coordinate frames. The fact of there being equivalent frame-dependent models of the same underlying spatio-temporal reality
is what provides an explanation of the unique reciprocal influence exerted
on one another by the left and right measurements in the EPR experiment. 1
In their (1978, p. 1883), Clauser and Shimony state that the experimental
violation of Bell's inequality would appear to entail that in physics either
the thesis of realism or that of locality must be abandoned. They remark that
"Either choice will dramatically change our concepts of reality and of spacetime". In the branched interpretation it is locality, not realism, that is sacrificed.
The branched model is completely realistic, and the fact that two or more
distant events can be linked by their co-presence (or co-absence) on branches
which separate along space-like hyperplanes means that non-locality is one
of the model's central features.
(5) Reduction of the State-Vector. In the EPR experiment with two photons,
measurement of the angle of polarization of one photon instantaneously affects
measurement of the other. Thus if the outcome of the first measurement on
earth is v, the probability of the outcome $+ on the moon is sin2 $, whereas
if the first outcome had been h, the probability of the second outcome would
have been different. This is, for friends of "collapse", a paradigm case of
state vector reduction. At the instant at which the first measurement is made,
the state vector is reduced (relative of course to a coordinate frame, which
determines which measurement is "first"). This instantaneous collapse is
modelled in the branched interpretation by branch attrition. Since future
branches vanish instantaneously along the entire length of the hyperplane at
which they intersect the actual world, branch attrition, i.e. state vector reduction, is necessarily a non-local phenomenon. In the branched model, it is
both non-local and relativistic.
238
STORRS McCALL
U-Type
U-Type
-w
w -w w -w
w -w
R-Type
Fig. 5.
239
240
STORRS McCALL
p. 380) will it be faced with an R-type stack. There is, in sum, no "democracy of bases" in the branched interpretation, but at each point every physical
system is faced by a specific U-type or R-type prism stack for each dynamical variable which characterizes it.
Finally, the branched interpretation differs from the many-worlds interpretation in making Heisenberg's "transition from potentiality to actuality" one
of its central features. (Heisenberg, 1958, pp. 54-55.). The multiplicity of future
branches represents what is potential, the single trunk represents what is actual,
and branch attrition represents the transition. (C. the "selection process"
referred to in Stapp, 1991). In connection with the idea of the passage from
potentiality to actuality, Howard Stein cautions us that such an idea, although
attractive in itself, "should be entertained only with the reservation that we
do not yet know of any clear case that can be characterized as such a passage"
(Stein, 1982, p. 576). It is suggested in this paper that the vanishing of branches
in the branched interpretation is the "clear case" to which Stein alludes.
McGill University
NOTE
I
In the same way, the branched interpretation accounts for correlations in the spin measurements on entangled three-particle systems in the GHZ thought-experiment (see Greenberger,
Home, Shimony and Zeilinger, 1990, Mermin, 1990, and Clifton, Redhead and Butterfield, 1991).
Suppose that a system of three spin-I12 particles is emitted in the state 'If = I1..J2(11, I, I) 1-1, -I, -I, where I or -I denotes spin-up or spin-down in the direction of propagation, i.e.
along the z-axis of each particle. If measurements are made of the x-component of spin of each
particle at different locations,then quantum theory predicts that the product of such spin measurements will always equal -1. As a consequence, if one particle is observed to be spin-up,
then this measurement will constrain the x-spins of the other two particles to be different, whereas
if it is observed to be spin-down, then the x-spins of the other particles will be the same. The
answer to the question, how can a measurement of one particle's spin instantaneously affect
the outcomes of measurements performed on the others, is the same as in the EPR experiment.
In the branched interpretation there are no branches with the joint outcomes (1, I, 1), (1, -1 , -I),
(-1, I, -1) or (-I, -I, 1) for measurements of the x-component of spin of the three particles.
The lack of branches of these four kinds, combined with branch attrition, is what produces the
instantaneous non-local influence of one measurement upon the others. For example, since
there are no (I, -I, -1) branches, if the first particle is measured 1, then if the second particle
is measured -1, the third particle will perforce be measured 1.
REFERENCES
Albert, D. and Loewer, B., 1988, 'Interpreting the Many-Worlds Interpretation', Synthese 77,
195-213.
Albert, D. and Loewer, B., 1990, 'Wanted Dead or Alive: Two Attempts to Solve SchrOdinger's
Paradox', PSA 1990, 1,277-285.
Clifton, R. K., Redhead, M. L. G., and Butterfield, J. N., 1991, 'Generalization of the GreenbergerHome-Zeilinger Algebraic Proof of Nonlocality', Foundations of PhYSics 21, 149-184.
Clauser, J. F. and Shimony, A., 1978, 'Bell's Theorem: Experimental Tests and Implications',
Reports on Progress in Physics 44, 1881-1927.
241
MICHEL J. BLAIS
INTRODUCTION
Given such a deterministic outlook on the physical universe, some philosophers have argued that human freedom can exist only if humans are not
themselves (or at least, if their spirits or souls or minds are not) subject to
the same laws that govern other material bodies of the universe. This view,
known as incompatibilism, requires that human beings be in some way able
to act for other reasons than purely physical, determinist ones; that freedom
is incompatible with determinism. Incompatibilism does not require that there
be uncaused events, only that voluntarily caused events not be subject to
physical law. And so, a distinction is often drawn between event-causation and
agent-causation. 2
The opposite view - compatibilism, as its name implies - considers freedom
to be compatible with determinism. David Hume is a typical compatibilist
243
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy oj Science I, 243-258.
1995 Kluwer Academic Publishers.
244
MICHEL J. BLAIS
philosopher; rather than deal with the ontological reality behind the terms,
he dissected various uses of them in order to show that in one, legitimate,
sense, we are quite free and our actions are not deterministically necessitated, while in another, illegitimate, sense, we are not and cannot be free.
The legitimate sense of 'free' that Hume recognizes is that usage whereby
one is said to be free if one's action is guided by nothing else but internal
reasons. As long as one's actions are not externally coerced, one can correctly be said to be free, even if the internal motivations themselves are
determined. 3 To act freely in Hume's sense means only to act without the influence of any external deterministic cause, and does not require the absolute
absence of deterministic cause. On the contrary, some form of determinism
is required in Hume's view, for otherwise one cannot be said to have done
an act, in the sense of being responsible for it. It is thus possible to be both
free and determined, according to Hume; indeed, the latter is required for
the former to be possible.
For an example of an incompatibilist viewpoint, one can turn to a contemporary figure, Peter van Inwagen, whose construal of freedom is such
that the free individual must be able in some way to act in independence of
physically determined law or to change the course of physically determined
events. For the incompatibilist, Hume's analysis of freedom is tantamount to
saying that it is an illusion, for a Humean free act, although free from external
coercion, is nonetheless internally determined and so not really free from
the incompatibilist's point of view. 4
DETERMINISM
Both the compatibilist and the incompatibilist require that the universe be determined in the Laplacian sense. Both need determinism, but for different reasons.
Hume needs it for the agent to be responsible. Van Inwagen needs it as an
essential feature that differentiates the free agent from the rest of the physically determined world. Both regard the world as determined, and determinism
in both cases is construed as necessarily implying predictability. Hume's determinism, of course, is grounded a posteriori on experienced constant
conjunction, and he refuses to countenance any ontological reality behind
this perceived conjunction.
In van Inwagen's well-known proposal, determinism is analyzed as the conjunction of two theses:
(a) For every instant of time, there is a proposition that expresses the state of the world at that
instant.
(b) If A and B are any propositions that express the state of the world at some instants, then
the conjunction of A with the laws of physics entails B.5
But the laws of physics (the natural laws of the universe) do not include as
a subset the voluntaristic laws governing the behavior of rational agents; otherwise, one could not claim that rational agents are free, since determinism as
245
defined above excludes any deviation from physical law, and this conception of physical law implies strict (albeit theoretical only, in many cases)
predictability. Also, if a rational agent can exercise his or her will and bring
it about that some proposition P is not true, then P can't be a law of physics,
by definition of 'law of physics,' for laws of physics are stipulated to be
necessarily true - in some non-logical sense of the word - and no agent is
supposed to be able to render a law of nature false. Van Inwagen construes
a law of nature ontologically rather than epistemically; although our knowledge of laws of nature may be imperfect and subject to subsequent revision,
a genuine law of nature is immutable, else it simply is not a law at all. Also,
his view of determinism is quite Laplacian; where Po is the proposition that
expresses the state of the world at To, P is the proposition that expresses the
state of the world at T, and L is the conjunction into a single proposition of
all laws of physics, "[i]f determinism is true, then the conjunction of Po and
L entails p.,,6
Just what does determinism entail? Laplacian determinism implies perfect
reversibility and predictability such that, given the state of a system at time
To and the causal physical laws in question, the state of the system at any
later time T+n (or any earlier time Ln) could be effectively calculated. Any
deviation from calculated results should be attributed to ever refinable
measurement errors, it being assumed that the world is ontologically and
thus rigidly determined. Obviously a small error in determining the initial
conditions can lead to a corresponding error in the resulting calculation; but
the resulting error is supposed to always be roughly commensurate to the initial
error and, more importantly, initial measurement error is supposed to be at least
theoretically eliminable, thereby eliminating the resulting error (see Figure 6
(left), below). Such a deterministic system is considered to be so constrained
by the initial conditions and the inflexible rules governing its evolution that,
were it possible to reverse the flow of time, the system would return to its
exact starting point. With the advent of thermodynamics and the recognition
of the inexorable flow of entropy, reversibility in many deterministic systems
is no longer considered a realistic supposition. But predictability remains to
this day securely linked to determinism, as one can readily see in van Inwagen's
aforementioned proposal.
DETERMINISTIC CHAOS
Traditionally, determinism and chaos have been set in diametrical opposition, for determinism seemed to necessarily imply predictability, whereas chaos
seemed to imply the exact opposite: impossibility of prediction of future
states (and impossibility of retrodiction of past states). Also, the absence of
predictability usually signaled the possible presence of chaotic (random)
fluctuations totally ungoverned by law. The "new science of chaos" has contributed much to the deeper understanding of determinism, of chaos and of
predictability. 7
246
MICHEL 1. BLAIS
Some types of unpredictability are due to random motions that are in effect
unknowable and thus unpredictable: such is the case for Brownian motion
where the progress of a microscopic mote is the result of innumerable collisions on the atomic level; not only is it in practice impossible to take into
account each and every molecule that collides with the mote, it is in principle impossible to do so, because of quantum indeterminacy.8 However,
quantum indeterminacy only sets in when the measurement is made; a quantum
system is always perfectly deterministic and always evolves in a deterministic manner.9 Macro-scale unpredictability, such as the spatial path of a rapidly
deflating untethered balloon, has nothing to do with quantum effects, and
although the balloon and a pitched baseball obey the same physical laws,
yet the baseball is relatively predictable in flight (witness the batting averages
of some batting aces), whereas the balloon is not. The difference between
the two classical physics tells us, depends upon the turbulent flow of the
balloon and the laminar flow of the baseball.
Regular, laminar flow, exemplified by the rush of air around a spinning
baseball or over the wing of a speeding aircraft, is quite predictable and understandable because such flows are calculable through differential equations;
turbulent flows such as the flight of the balloon through the air or the unorderly
crash of water through a rapids are much more difficult to understand and to
predict, since they escape the neat formalisms of the calculus. It was thought
that the unruliness of turbulent systems was directly linked to their basically
chaotic origins, and that chaos was basically unpredictable. But the new science
of chaos has changed the meanings of all these terms.
A rock poised on the crest of a hill is sensitive to initial conditions: a
very small push one way or the other will cause it to wind up in different
valleys. However, this type of sensitivity is restricted to the initial conditions; after the rock has started down one or the other side of the hill, its
behavior is largely controlled by well-known laws of physics, and small deviations in its course down the hillside will cause commensurately small
modifications of its final resting place at the bottom. A chaotic system,
however, is sensitive not only to initial conditions but to conditions at every
point of its evolution.
Another example lO is that of the exact prediction of the motion of a billiard
ball; assuming no loss of energy, if the gravitational effect of only one electron
at the edge of the galaxy is neglected in the calculation of the ball's motion
among the other balls on the table, the predicted motion is off after only one
minute. Such a large scale uncertainty stems from the curvature of the balls
that amplify small accumulated differences in impact points into large deviations in trajectories. This high sensitivity to small influences is one of the
characteristics of chaos.
The uncertainties inherent in quantum phenomena can always be accommodated by a Laplacian determinist because, although initial measurements
are subject to Heisenberg's quantum indeterminacy, it is still possible to predict
in the long run what the behavior of a given quantum system will be on
247
average: the probabilities in a sense converge to specific values. But the uncertainties inherent in chaotic phenomena are not so limited and do not permit
such a global convergence, for small influences give rise to very large
inaccuracies in the predicted outcome.
STRANGE AND NORMAL ATTRACTORS
~r
,..J
~
Damped
pendUlum
n
.- .velocity
velocity
..:....
"""0
'-----""
Pendulum's orbit
in state space
Undamped
pendulum
Pendulum's orbit
in state space
Fig. 1. Representation of the state space of a pendulum: damped [left] and undamped [right].
248
MICHEL 1. BLAIS
C
A simple system's two
basins 01 attraction (A & C)
F
A more complex system's four
basins 01 attraction (D. F, H & J)
Fig. 2. A system's basins of attraction can be pictured as valleys (troughs) separated by crests
(barrie~), where the depth of a trough is an indication of the "attractiveness" of the basin and
the height of a barrier that of the relative "difficulty" of leaving the basin; a ball rolling up
and down the inclines will be trapped in a trough unless it has the energy to carry it over a barrier.
Although trough C [left] is lower than trough A, the system cannot reach it unless sufficient
energy is on hand to get it over the barrier B. In more complex systems [right], some troughs
may be visited only rarely, (for example, F) but, once reached, may capture the system for a
lengthy period, keeping it from returning to an inherently more stable trough (for instance, J).
249
the motion along the short axis. Higher dimensional attractors have also been
studied, but their representations require a correspondingly higher number of
state space dimensions; in order to make them "visible," one may choose to
sample the motion at fixed intervals, in effect "intercepting" the orbit with
an orthogonal plane that then records a scattering of points representing the
state of the system at regular intervals. 11
In Figure 3, the resulting Poincare section through a torus (an orthogonal
planar cut) has the shape of a circle which is traced out by a large number
of intersections with the Poincare plane of the orbit of the attractor in its
phase space. Once the circle is completed, no new information is gained by
examining the section, for the attractor cycles through the same orbit forever.
A Poincare section through the orbit of a strange attractor, however, displays
a pattern that never repeats itself and that shows more and more self-similar
detail as one magnifies it (see Figure 4).
Fig. 3. Torus-shaped attractor [left] and Poincare section [right] through the torus, showing
the resulting set of intersecting points on the Poincare section. Successive blow-ups of a portion
of the Poincare section show no more detail than the original.
Fig. 4. Strange attractor [left] and associated Poincare section [right] through the attractor. Successive blow-ups of a portion of the Poincare section show self-similar but never
identical detail.
Until 1963, fixed points, limit cycles and tori were the only known attractors, and they were all "normal.,,12 In that year, Edward Lorenz l3 published
results showing for the first time how relatively simple systems could display
behavior exhibiting the structure of a "chaotic" or "strange" attractor. In such
an attractor, microscopic perturbations are amplified to affect macroscopic
behavior in such a way that
250
MICHEL J. BLAIS
[t]wo orbits with nearby initial conditions diverge exponentially fast and so stay close together
for only a short time. The situation is qualitatively different for non-chaotic attractors. For
these. nearby orbits stay close to one another. small errors remain bounded and the behavior is
predictable. 14
1- - ~
r.=~~ - ~
Fig. 5. Baker's transformation. The starting square is flattened and stretched, then cut in half;
the resulting halves are superposed, and the transformation reiterated. Two proximate points
can be rapidly separated by this type of transformation.
251
There are only three kinds of attractor: equilibrium (fixed point); periodic
(limit cycle) and quasi-periodic (torus); and chaotic. 15 Periodic and quasiperiodic motion, even if very complex, is predictable; chaotic motion is not.
Also, strange attractors require a state space of three or more dimensions.
DETERMINISM WITHOUT PREDICTABILITY
Strange or chaotic attractors are characterized by two basic laws: they are
extremely sensitive to initial conditions, and nearby trajectories rapidly diverge
(see Figure 6). In normal, non-chaotic attractors, small deviations in the
initial conditions will result in corresponding relatively small deviations of
orbits after a given time; in strange attractors, very slight differences in initial
conditions rapidly cause arbitrarily large deviations in the trajectories after
even a short time. In normal attractors, narrowing down the measurement errors
for the initial conditions will correspondingly narrow down the errors in the
calculated outcomes; but in strange attractors no amount of initial precision
will permit a corresponding precision in the calculated outcome.
Fig. 6. In a non-chaotic system [left] two nearby trajectories, arbitrarily close together in state
space at the outset, diverge relatively little as they evolve. In a chaotic system, however, [right]
two nearby trajectories diverge quite rapidly. Chaotic systems are very sensitive to slight
differences in initial conditions, whereas non-chaotic ones are not.
Normal attractors (such as those associated with laminar flows) are theoretically predictable, because it is possible to extract all the information about
them, once and for all. Strange attractors (such as those associated with turbulent flows) are not theoretically predictable, because they are information
sources: they continuously create information at each folding or bifurcation.
The seeming randomness is the result of the shuffling or folding process.
252
MICHEL J. BLAIS
Because they create information, chaotic attractors reveal ever more detail
as one scrutinizes more finely: this is their self-similar fractal l ? nature.
In a normal, non-chaotic, attractor, small uncertainties give a certain amount
of information that is preserved with time; this is the sense in which they
are predictable, because these systems are not overly sensitive to measurement
errors. Because their trajectories settle down in a finite time either to a fixed
point or an endlessly repeating limit cycle, no information is lost and none
is created. But in a chaotic attractor, initial information is removed and replaced
by new information; instead of mixing up the dough (in the Baker's transformation), the attractor mixes up the state space.
The stretching and folding operation of a chaotic attractor systematically removes the initial information and replaces it with new infonnation: the stretch makes small-scale uncertainties larger,
the fold brings widely separated trajectories together and erases large-scale infonnation. Thus
chaotic attractors act as a kind of pump bringing microscopic fluctuations up to a macroscopic
expression. [... tJhere is simply no causal connection between past and future. 's
The very existence of deterministic chaos thus breaks the traditional link
between determinism and predictability, because the only way to calculate a
future event for these systems is to actually step through each and every
individual calculation until finally reaching the point sought at the desired
future time. In other words, to find out the future state of a strange attractor,
it is necessary to calculate its whole history, one step at a time. In the case
of normal attractors, it is necessary to perform one calculation only, where
the desired future time is substituted into the equation.
253
In a perfectly determined Laplacian world, in which agents' actions are completely governed by physical laws, what appear to be genuine choices for an
agent are in fact only imagined options because they are physically unrealizable; although the agent may think that two options A and B are really open,
strict determinism in fact forces the result of the choice. In a world that is
not perfectly determined, in which agents would be genuinely able to alter
the course of at least some events, even if an agent has chosen option A, it
is theoretically possible to have chosen option B; the agent can thus imagine
a reversal of time's arrow, permitting a return to the moment of decision where
option B is chosen instead of option A.
Obviously, not many people believe that time can actually be reversed.
But the undetermined agent's inherent freedom, unconstrained by physical law,
should permit the choice of option B if the agent were again placed in exactly
the same circumstances once again (see Figure 7 [right]). The determined agent,
however, must necessarily choose option A all over again were time reversed,
because option B was never real and thus was never really open (see Figure
7 [left]).
_10_ _ _---;11/
............
------.------------ B
Fig. 7. An agent's time line [left] encounters a bifurcation at t" where two options are open:
A and B; the agent chooses, and the time line consequently continues along line A; option B
is unrealized. According to some views of freedom of choice, were it possible [right] to reverse
the flow of time at t2 , an agent could return to t, and choose option B instead.
254
MICHEL J. BLAIS
255
Recent research has shown deterministic chaos to exist not only in inert
physical and chemical systems but also in biological systems as well. Of
particular interest is the demonstration that electroencephalograms of the
human brain and electrocardiograms of the human heart can be modeled by
deterministic chaos. In both cases, the chaotic regimes characterized by
epilepsy and by fibrillation can be effectively modeled by deterministic
chaos. Ocean dynamics, sunspots, animal population dynamics, blood pressure
variations and hormonal concentrations are other examples of systems that
are being better understood for being modeled on deterministic chaos.z Without
going so far as to assume that the human mind is purely biological, it is still
fascinating to discover that human bodies respond to chaos.
The most important characteristic of deterministic chaos is the impossibility
of predicting behavior beyond the short term into the far future, for the very
reason that chaotic systems, being extremely sensitive to initial conditions,
generate information as they evolve. It is thus impossible at time to to predict
what exactly will happen at a later time ~ without actually calculating each
intermediate state. However, and this is the paradoxical second most important characteristic, each of these intervening states is as perfectly determined
as a Laplacian could wish. Thus, even though an agent is perfectly determined,
if the agent is governed even partly by deterministic but chaotic laws, then
the agent's behavior may be inherently unpredictable.
The possibilities that are considered open to the agent at the moment of
choice can be quite real, and quite determined, but the outcome of the choice
can also be quite unpredictable. Even though an option is determined, it is
not true that the very same option would or even could be chosen, were time
reversed. This is because subtle differences in initial conditions can bring about
large differences in final behavior, on the one hand, and it is impossible to
really be sure that a system has been brought back to its exact prior starting
point, on the other. Figllre 8 shows the difference between a normal [left]
and a chaotic [right] interpretation of a time-reversal in a determined world.
The left-hand portion of Figure 8 shows the assumption that freedom should
permit the choice of a different option B at tl were time reversed - small uncertainties in the initial conditions give rise to at most small variations in the
resulting effects; this, in essence, is a combination of Figure 6 [left] with Figure
7 [right]. The right-hand portion of Figure 8 shows the assumption that freedom
would not necessarily permit the same repetition - small uncertainties that
can never be completely eliminated can give rise to very large variations in
the resulting effects; this combines Figure 6 [right] with Figure 7 [right].
There is thus from the standpoint of deterministic chaos some truth to the
claim that the free agent could have done otherwise; this however doesn't entail
that the agent wasn't determined. It just means that, all other things being
just about equal, very slight differences could have led to a different trajectory in state space. In Figure 8 [left] the imagined return to the exact prior
256
MICHEL J. BLAIS
Fig. 8. Normal depiction [left] of the possibility of making a different choice were timereversal possible; small discrepancies would not fundamentally change the chosen option, B.
Chaotic depiction [right] of the same possibility; small discrepancies can give rise to a quite
different option, C.
257
attract the agent. Whether the basin was freely chosen by the agent, or physically coerced by natural causes, chaos theory cannot decide. All one can
say is that in similar past circumstances, this particular agent (or any agent,
in general) followed different time-lines.
Informal modeling of human behavior along the lines of deterministic chaos
doesn't of course solve the free-will problem. Probably nothing will, if only
because humans feel that deliberation makes a difference, and nothing seems
able to reduce this feeling to any form of determinism. But deterministic chaos
sheds new light on the free-will problem by showing that determinism doesn't
necessarily entail predictability and, conversely, that unpredictability doesn't
necessarily entail freedom from determinism either.
So, if chaos doesn't set you free, it at least allows you to eat your deterministic cake and have your non-predictability, too.
University of Sherbrooke
NOTES
I
"All events, even those which on account of their insignificance do not seem to follow the
great laws of nature, are a result of it just as necessarily as the revolutions of the sun. In ignorance of the ties which unite such events to the entire system of the universe, they have been
made to depend upon final causes or upon hazard, according as they occur and are repeated
with regularity, or appear without regard to order; but these imaginary causes have gradually
receded with the widening bounds of knowledge and disappear entirely before sound philosophy, which sees in them only the expression of our ignorance of the true causes. [... ] We
ought then to regard the present state of the universe as the effect of its anterior state and as
the cause of the one which is to follow. Given for one instant an intelligence which could
comprehend all the forces by which nature is animated and the respective situation of the
beings who compose it - an intelligence sufficiently vast to submit these data to analysis - it
would embrace in the same formula the movements of the greatest bodies of the universe and
those of the lightest atom; for it, nothing would be uncertain and the future, as the past, would
be present to its eyes." Pierre Simon, Marquis de Laplace: A Philosophical Essay on Probabilities.
Dover Publications, Inc., New York, 1915 (translated from the 6th edition by F. W. Turscott
and F. L. Emory), pp. 3-4.
2 In this regard, see for example: Antony Flew and Godfrey Vesley: Agency and Necessity. Basil
Blackwell, Great Debates in Philosophy, Oxford & N.Y., 1987.
3 See Section VIII entitled "Of Liberty and Necessity" in David Hume: An Enquiry Concerning
Human Understanding. 3rd edition, edited by L. A. Selbye-Bigge & P. H. Nidditch, Oxford,
Clarendon Press (1748) (1902) 1975. Hume's definition of freedom: "By liberty, then, we can
only mean a power of acting or not acting, according to the determinations of the will."
[po 95]. His assimilation of volition to natural causation: "But being once convinced that we know
nothing farther of causation of any kind than merely the constant conjunction of objects, and
the consequent inference of the mind from one to another, and finding that these two circumstances are universally allowed to have place in voluntary actions; we may be more easily led
to own the same necessity common to all causes." [po 92].
4
See the following section for more details on van Inwagen's incompatibilism.
5 Peter van Inwagen: "Incompatibility of Free Will and Determinism." Philosophical Studies
27, no. 3 (March 1975, pp. 185-199), p. 186.
6
[van Inwagen, 1975, p. 191].
7 See Henri Poincare: Science et methode. Flammarion, Paris, [1908]1947. "A very small cause,
258
MICHEL J. BLAIS
which we do not notice, determines a considerable effect that we cannot but see, and then
we say that the effect is due to chance. If we were to exactly know the laws of nature and the
situation of the universe at the initial instant, we could exactly predict the situation of the same
universe at an ulterior instant. But, even if the laws of nature no longer held any secrets for
us, we could only know the initial situation but approximately. If that permitted us to predict
the ulterior situation with the same approximation, that is all we require, we say that the phenomenon has been predicted, that it is governed by laws; but it is not always thus, it may
happen that small differences in the initial conditions generate very large ones in the final phenomena; a small error in the former will produce an enormous one in the latter. Prediction becomes
impossible and we have a fortuitous phenomenon." [pp. 68-9 - translated from the original
French]
In the 1960's, Lorenz sparked what has become an explosion of theory and of practical
applications of deterministic chaos; see Edward N. Lorenz: "Deterministic Nonperiodic Flow."
Journal of Atmospheric Science 20, 1963, pp. 130--141. Many excellent introductory texts exist
on the rapidly burgeoning subject of deterministic chaos. See for example James Gleick: Chaos
- Making a New Science. Viking, New York, 1987.
The following sketch of deterministic chaos follows in part: James P. Crutchfield et al.:
"Chaos." Scientific American 255, no. 6 (December 1986, pp. 46-57).
S the interested reader may wish to consult Jesse Hobbs: "Chaos and Indeterminism." Canadian
Journal of Philosophy 21, no. 2 (June 1991, pp. 141-164), in which the relationship between
chaos and quantum indeterminacy is explored. In the present article, this particular aspect of
chaotic behavior will not be pursued.
9
On this subject, see for example Richard P. Feynman: QED - The Strange Theory of Light
and Matter. Princeton University Press, Princeton, N.J., 1985.
10
Also given by [Crutchfield et al., 1986, pp. 48-9].
II
This is the method pioneered by Poincare.
12
For the history of chaos theory, see for example the introduction to: Hao Bai-Lin (ed.): Chaos.
World Scientific, Singapore, 1984.
13
See [Lorenz, 1963].
14
[Crutchfield et al., 1986, p. 51].
15
for a detailed discussion of various types of chaotic attractors, see for example: J. M. T.
Thompson and H. B. Stewart: Nonlinear Dynamics and Chaos. John Wiley and Sons, Toronto
& N.Y., 1987.
16
Robert Shaw: "Strange Attractors, Chaotic Behavior, and Information Flow." Zeitschrift
fiir Naturforschung 36a (1981, pp. 80--112), pp. 106.
11
See Benoit Mandelbrot: The Fractal Geometry of Nature. Freeman, N.Y., 1977.
18
[Crutchfield et al., 1986, p. 53].
19
Robert M. May: "Simple Mathematical Models with Very Complicated Dynamics." In [BaiLin, 1984, pp. 149-157].
20
See for example Hermann Haken and Arne Wunderlin: "Le Chaos deterministe." La
Recherche 21, no. 225 (October 1990, pp. 1248-1255), p. 1255. See also the numerous examples
cited in: Dahan Dalmedico, A. et al. (eds): Chaos et dererminisme. Points/Sciences, Seuil,
Paris, 1992.
PAUL M. PIETROSKI
This idea has fallen on hard times. But rumors of its death have been exaggerated, in my view, by the mistaken view that laws are exceptionless universal
generalizations. We can construe Ramsey's proposal in terms of a more relaxed
(and empirically motivated) conception of law. l The resulting account of
causation is worth taking seriously. It also shows that (plausible) coveringlaw and probability-raising conceptions of causation are not so different after
all; and while neither conception leads to a simple theory, I think we have
to live with complexity here.
1.
where x and y range over nomologically possible events, 'F' and 'G' are
(perhaps complex) predicates, t it a time, and E is an interval. An ordered event
pair (e j , ej ) instantiates (1), iff: e j satisfies 'F' at t, and ej satisfies 'G' at
some time between t and t + E. Instead of appealing to nonactual events to
handle accidental generalizations, one might require "projectible" predicates.
But I do not share Goodman's (1979) ontological concerns; and I think we
have some sense of nomological possibility independent of our commitment
to any given generalization. In any case, I will be appealing only to actual
events, or events easily made actual by performing experiments.
The key feature of strict laws is their universality: whenever the antecedent
is satisfied, the consequent must be satisfied. So any instance of [Fxt &
-,3y(Gyt+E)] is inconsistent with (1). I assume that we are excluding, for
present purposes, generalizations that are true by virtue of logic or meaning.
Still further requirements - e.g., that antecedents not be otiose, and the absence
of the word 'cause' - would be needed to define a notion of causal law without
circularity. But this is not my goal, although I offer a tentative proposal
259
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science I, 259-273.
1995 Kluwer Academic Publishers.
260
PAUL M. PIETROSKI
below. So henceforth, let 'law' be short for 'causal law' . That said, we might
render Ramsey's proposal as follows:
e j causes ej , if there is an ordered n-tuple of events (e l e 2, en).
such that e j = e l ej = en. and for each ek , (ek ehl ) instantiates a strict
law.
This thesis is unsatisfactory; though not because events can have a common
cause, or because of causal asymmetries. The rising of one barometer does
not cause the rising of another. But neither is there a strict law here. If Dudley
tinkers with his barometer on a fair day, his barometer may rise while others
remain constant. Similarly. a few experiments will show that no strict law determines the height of a flagpole as a function of (i) the length of a nearby pattern
of illumination on the ground, and (ii) the position of the sun: paint a long
streak on the ground near a short pole (or absent any pole at all) on a cloudy
day; erect a pole on a steep hill; build a (dark) wall near the pole; etc. More
restricted shadow-to-flagpole generalizations may avoid particular counterexamples. But no finitely statable version will be exceptionless. 2
Of course. such "defenses" of the strict covering law account serve to
illustrate its basic inadequacy: there are few, if any, strict laws. Even if there
are no counterexamples to a thesis that some condition suffices for causation. such a thesis will be unexplanatory if the condition is never satisfied. And
it is not a strict law barometers rise when atmospheric pressure rises.
Barometers can malfunction; and various things can interfere with the "normal"
effects of atmospheric pressure on operational barometers. Similarly. an
indefinite number of (possibly conspiring) interfering factors might result in
a shadow that is too long (or short). according to some flagpole-to-shadow
generalization. Nor is this difficulty confined to homey examples.
It is not a strict law that if organisms possessing a (heritable) property P
are better able to survive and reproduce than organisms possessing an alternative property P*, then the proportion of organisms in the popUlation having
P will increase. As Sober (1984) notes, genetic drift, pleiotropy, cataclysmic
events. etc., can work against the "fittest". Joseph (1980) and Cartwright (1983)
have argued that, absent a unified field theory, the same point applies to
physics. To take Joseph's example: Coulomb's Law tells us that a point charge
produces (the relativistically invariant analog of) a spherically symmetric electromagnetic fields; but if there is a dense mass in the vicinity, the quanta
of the electromagnetic field will respond to the gravitational influence. thus
distorting their spherically symmetric distribution. Cartwright offers similar
examples.
One might reply that laws do not make claims about the behavior of objects,
not even the trajectories of fundamental particles. This claim has some plausibility in physics. since the "fundamental" laws are often interpreted as saying
that whenever certain initial conditions are met, a field is produced; where a
field is held to be an entity in its own right quite apart from the objects upon
which its acts. Creary (1981), for example. seems to hold that fields cause
261
the behavior of particles, and that only a compositional "Law of Total Force"
would make claims about such behavior. While this suggestion is plausible
a priori, Joseph notes that nature has thus far been recalcitrant. We cannot
just "put the parts of physics together," because they do not form "a single
consistent theory demonstrably possessing a physical model (779-80)."
It seems less likely that the laws of other sciences can be construed a la
Creary. Natural selection is not any kind of field that causes changes in gene
pools. And once we consider psychological or economic generalizations, it
becomes implausible to say that (in general) laws should not be interpreted
as entailing claims about the behavior of objects in the relevant domains.
Moreover, even if a unified field theory is imminent, there is no reason to
expect parallel unifications of biology or other special sciences. The "force"
of natural selection, for example, can be swamped by nonbiological cataclysmic
events (e.g., earthquakes or meteor strikes). In Davidson's (1970) terms, special
science generalizations are "heteronomic," and this would seem to make
them irremediably nonstrict. For similar reasons, it is Panglossian to expect
a "Law of Total Force" stated in the vocabulary of the special sciences. But
without composition laws, appeals to forces cannot be translated into laws
that particular events instantiate.
I take these considerations to motivate the claim that laws have implicit
ceteris paribus (henceforth, cp) clauses; though without some explanation of
their theoretical role, appeal to such clauses merely relabels the problem at
hand. In discussing the relevance of these issues for deductive-nomological
models of explanation, Hempel (1988) notes that the idea of a cp clause "is
itself vague and elusive .... What other things, and equal to what?" Schiffer
(1991) wants to know which proposition (if any) is expressed by a putative
cp-Iaw. For on pain of rendering
(2)
cp{V'x[Fxt
3y(Gyt+e)]}
as a trivial tautology, the cp-clause cannot mean, "Barring cases of [Fxt &
-,3y(Gyt + e))." And as Schiffer's real worry is that (2) is meaningless, it is
no help to say that 'cp{0}' means 0, cp (Cf. 'All mimsy were the borogoves.' means that the borogoves were all mimsy.) Let us call instances of
[Fxt & 3y(Gyt + E)) and [Fxt & -,3y(Gyt + E)), respectively, "Normal" and
"Abnormal" instances of (2). If there are cp-Iaws, they are compatible with
some nomologically possible Abnormal instances, else they would be strict.
This tells us something about what doesn't make a cp-Iaw false - viz., the mere
fact that it faces an Abnormal instance; though we still need an informative
claim about what does make a putative cp-Iaw false. But if we can say under
what conditions (2) would be false, then (2) must be meaningful.
Joseph (1980) considers interpreting laws as holding "ceteris absentibus,"
by which he means that (say) Coulomb's law would describe the trajectories of relevant quanta in situations where all non-electromagnetic forces
were absent. But he rejects this idea for good reason: the subjunctives will
often be contralegals of a radical sort. Given objects with mass and charge,
262
PAUL M. PIETROSKI
'tx (lA
---7
[Fxt
---7
3y(Gyt + E)]}
263
more fully developed in Pietroski and Rey (1995), and applied to other domains
in Pietroski (1993, 1994); and I show how this account can be deployed in
the service of a covering-law conception of causation.
2.
264
PAUL M. PIETROSKI
265
constraint to be iterative, just as in the last paragraph. Intuitively, a nonaccidentally normal instance is one that occurs when any potentially interfering
factors are themselves interfered with. For in the presence of "undefeated"
interference, we expect Abnormal instances of cp-Iaws; but (as with Dudley's
swan) the consequent of a cp-Iaw can be satisfied when the antecedent is
satisfied, even if the latter does not explain the former.
The task for the covering (cp) law theorist, as I see it, is to make explicit
the kinds of constraints on explanation I am gesturing at. Given such constraints, it may be that the necessary condition on cp-Iaws proposed above
can also serve as a sufficient condition. Following Ramsey, the idea would
be that events exhibit a good deal of regularity of succession - though not
so much regularity that we can state strict laws. A putative cp-Iaw would serve
to set a standard for what counts as a Normal case; and the standard would
be correct, if every Abnormal case "left over" can be handled - where there
are constraints on what counts as "handling" an Abnormal instance. But the
present task is to show how there can be non-trivial cp-Iaws; so (substantial) necessary conditions are more important than sufficient conditions. And
the crucial claim is that we quantify over Abnormal instances of a putative
cp-Iaw, holding that each must be explicable, instead of trying to state in
advance a condition (in the form of an extra antecedent) that covers every
possible Abnormal case.
If this is correct, events can instantiate cp-Iaws even when other things
are not perfectly equal. Differential fitness can lead to evolution in the direction of the fitter trait in the presence of some counteracting drift, pleiotropy,
etc. So many Normal instances of Darwin's principle would not instantiate
an indicative conditional whose antecedent is, 'If there is no drift or pleiotropy
or . . .'. Of course, given a generalization that purports to state the precise
quantitative effects of selection on a gene pool, the problem arises again.
But qualitative laws are often what we want; and we settle for them when
they are all we are likely to get. More importantly, while it may be a tautology that A leads to B unless it doesn't, the same is not true of the claim that
A leads to B unless there is an explanation for why it doesn't. In particular,
(4)
266
PAUL M. PIETROSKI
If there tum out to be no strict laws - i.e., if all (causal) laws are cp-Iaws then it would be tempting to say that causal laws are those laws that have
nomologically possible Abnormal instances, thus excluding logical and analytic
truths by virtue of the fact that any such generalizations will be exceptionless. The motto would be, "No causation without exceptions." I find this idea
attractive, although I won't press it here. The current proposal says nothing
about events covered by no law. But given the relaxed notion of law (which
I will relax still further) there may be enough laws to go around; though
many laws may be scientifically uninteresting. Suppose, however, that there
is an event e*, such that for no ek is (ek , e*) a Normal instance of a law. It
does not follow that the proposal is false, or even that we should look for
an alternative sufficient condition with wider explanatory scope. For I don't
see why we have to assume that every event has a cause. Perhaps "No uncaused
events" slogans express an ideal: strive to find laws that cover all events.
Moreover, if the current rendering of Ramsey's proposal has the consequence
that there are uncaused events, perhaps we can allow that some events "just
happen;" where this just means that they cannot be located in a pattern of nomic
regUlarity. But I propose to set aside such questions. For the most serious worry
about the current proposal, I suspect, will be that the maneuvering used to
defend appeal to cp-laws is unnecessary. One might hope that we can appeal
to probabilistic laws instead of cp-Iaws, and that we can appeal directly to
probability-raising instead of laws in our account of causation, thereby getting
a theoretically simpler account. But I think that such simplicity is not to be
had.
3.
1. For
cp{\lx[Fxt -7 3y(Gyt+e)]}
as: if F, then N%(G); and if F, then cp[G]. (The account of cp-Iaws offered
in section two, recall, renders the consequent of such laws disjunctive in form.)
We cannot dispense with cp-clauses in favor of probability operators, because
these two ways of modifying the traditional (strict) form of laws scratch different itches. We need probability operators because the world is not
deterministic; whereas we need cp-clauses because the world is complex and
full of interacting factors. Consider again Coulomb's law, which faces
Abnormal instances given nearby dense masses. We cannot make a strict statement of this law true just by adding a probability operator. For we cannot
determine the probability of there being a nearby dense mass; and there is
267
268
PAUL M. PIETROSKI
such a population, the chance of getting heart disease given that one smokes
may be lower than the incidence of heart disease in the whole population.
This would not show that smoking is a prophylactic against heart disease in
the population. Indeed, smoking would still be a cause of heart disease. But
the simple theory above offers no explanation of this fact.
A now standard response to this problem is to partition the population, in
this case into exercisers and non-exercisers. Eells and Sober (1983), for
example, suggest that smoking causes heart disease, as long as the chance
of getting heart disease given that one smokes is higher than the incidence
of heart disease in each relevant sub-population. This complicates matters
significantly, since there will be a relevant partition for every factor (other than
the putative cause in question) that is causally relevant to the effect in question.
In the context of the present example, this would mean considering partitions for diet, genetic predispositions to heart disease, exercise, etc. The appeal
to causal relevance will render the resulting theory unsuitable as a reduction
of the causal to the non-causal. But I am not hoping for such a reduction.
And while it is difficult in practice to apply such a theory to cases, it might
still be true that
e j causes ej, if there are types T m and Tn such that
eJT m], ej[Tn], and in each causally relevant partition,
P(TjTm) > P(Tn)
Dupre (1984, 1993) calls this requirement "contextual unanimity" and points
out that it is unduly strong. 8 If scientists (employed by the tobacco industry)
discover a rare physiological condition P, such that those with P are less
likely to get lung cancer if they smoke, this would not count as a discovery
that smoking does not cause lung cancer. But Dupre thinks that if having P
were the "rule rather than the exception," we could conclude that "smoking
was a prophylactic against lung cancer;" and this conclusion would not be
refuted by the fact that "those abnormal and unfortunate individuals who lacked
this physiological advantage were actually more likely to get lung cancer if
they smoked (1984, p. 172)." According to Eells and Sober, if smoking raises
the probability of (lung) cancer in some causally relevant partitions, but lowers
it in others, then there is "no such thing as the causal role of smoking" with
respect to cancer "in the population as a whole" (p. 37, their emphasis). And
indeed, smoking does not have a uniform causal role in the population Dupre
describes. But a contextual unanimity theory leaves us with no explanation
of the fact that smoking causes cancer in many of those who lack P.
If one restricts the population from, say, North Americans to North
Americans without P, Dupre could offer a similar example for the restricted
(and rather arbitrary) population. And repeated restrictions threaten to limit
(eventually to zero) the population size, thus threatening the very idea of probability raising. More importantly, as Dupre (1984, p. 174) suggests, imposing
the same condition on every causally relevant partition is the analog in the
269
270
PAUL M. PIETROSKI
time because "that's just the way it goes sometimes." In deterministic cases,
it seems perfectly reasonable to ask for explanations of why causes fails to
produce their effects; and at least often, it is reasonable to ask why indeterministic causes fail to raise the probability of their effects. In Dupre's own
example, we are inclined to ask if having condition P explains why smoking
reduces the chance of cancer in those with P.
Dupre rejects Cartwright's suggestion, because he thinks it can be a mere
matter of chance that a cause fails to raise the probability of its effect. If the
relation between smoking and cancer is indeterministic, an individual smoker
can simply be "lucky" and not get cancer. So, Dupre asks, couldn't a group
of smokers (say those with the condition P) be "second-order lucky" in that
their chance of getting cancer wasn't raised, without there being any reason
for this? If so, I agree that this would not show that smoking wasn't a cause
of cancer. And like Dupre, I don't think we have evidence against the claim
that the world is "chancy" in just this way. Nor should we assume that all
such probabilistic truths must be manifestations of underlying (competing)
causal capacities. So it may be too much to demand an explanation whenever
a cause fails to raise the probability of its effect. But again, "average effect"
proposals will miss something by never requiring such an explanation. This
is, I hope, enough to suggest that an adequate probability-raising theory is
unlikely to be simple. Moreover, I think we can get between Cartwright and
Dupre by completing our relaxation of the notion of law.
I have argued that we should allow for laws with: cp-clauses, probability
operators, and both. But modifying the strict form in two ways presents the
possibility of a scope difference. Recall that 'if F, then cp[N%(G)]' says: given
F, either (i) there is an N% chance that G will obtain, or (ii) there is an
explanation for why (i) is false. We can think of a cp-probabilistic law as
the result of probabilizing and then idealizing. If we idealized and then probabilized, we would get a law that says: given initial conditions, there is an
N% chance that either (i) the consequent condition will obtain, or (ii) there
is an explanation for why (i) is false. A "probabilistic-cp" law of the form
'If F, then N%(cp[G])' would allow for a percentage of cases in which there
is no explanation for why initial conditions do not lead to consequent conditions. But not every case in which (say) smoking does not lead to cancer would
be written off as a mere matter of chance. We could distinguish scenarios in
which some smokers fail to get cancer because they have condition P and
scenarios in which smokers with condition P just get lucky.9
The resulting covering-law account does not reject probability-raising conceptions of causation. For it appeals to a range of law forms. In so doing, it
appeals to probability operators and the notion of quantifying over (and
explaining) Abnormal instances. Moreover, suppose that, as I have suggested,
a plausible probability-raising theory will quantify over partitions, and appeal
to explanations of why causes fail to raise the probability of effects in some
partitions. Then in some sense, the two approaches converge. But Dupre's case
provides reason for preserving the covering law conception. For I think his
271
case is one in which smoking causes cancer in some members of the population, while it inhibits cancer in others; and if there are such cases, we
cannot cash out the notion of causation solely in terms of probabilities and
(non ad hoc) populations. For smoking cannot raise and lower the chance of
cancer in a population.
The covering law theorist, however, can describe Dupre's population as one
governed by a (probabilistic-cp) law to the effect that smoking leads to cancer
(perhaps with condition P as an interfering factor), and another law to the
effect that smoking inhibits cancer in those with condition P. For nonstrict laws
will be nonmonotonic. CP, falling bodies accelerate at 32 ft.lsec 2 - Abnormal
instances of this generalization to be explained by citing, inter alia, friction.
But it does not follow that cp, bodies affected by friction accelerate at 32
ft.lsec 2 Indeed, if we can specify the (normal) effects of friction (in a given
environment), we can say that cp, bodies fall at some other rate R; though
de re, as it were, each body affected by friction would still be such that it accelerates at 32 ft.lsec 2 , Cp.lO Dupre thinks that whether smoking causes or inhibits
cancer depends on whether P is the rule or the exception in the popUlation,
where this is a statistical matter (given a fair sample). I think what the "rules"
(and hence the exceptions) are is a matter of which laws hold, where this is
not just a matter of probability. So a smoker getting cancer can be the rule
or the exception, depending on the law in question.
By way of conclusion, let me offer the following familiar observation: In
saying that one event causes another, we commit ourselves to something
regarding similar events. Strict laws represent too strong a commitment. But
this is because laws need not be strict, not because the notion of a covering
law cannot help us give substance to the slogan, "Same cause, same effect."
A more relaxed notion of law makes the relevant commitments rather more
open-ended and easier to satisfy, without making them vacuous. This rendition of Ramsey's proposal may not provide the rigorous and tidy picture of
the world's causal order that some covering-law theorists hoped for. But I don't
think we need the rigor. And I see no reason to think the world is tidy.ll
McGill University
NOTES
I will not, however, defend Ramsey's claim that causal laws do not express propositions.
Perhaps, by definition, the shadow of x is caused by x. But flagpoles may cause a disturbed
Dudley to elongate the next comparative darkness he sees; or there might be a large refracting
lens between a pole and its shadow. And to say the shadow of x must be caused by x in "the
normal way" is to grant that there are no strict laws involving shadows.
3 Cartwright must distinguish among false sentences - e.g., F = Gmm' /d 2 vs. F = Gmm'. If
we use 'FE' to label the false but explanatory (by virtue of expressing a capacity) generalizations, then some nomic generalizations will have the form: FE{'v'x[Fxt ~ 3y(Gyt+e)]}. Arguably,
the difference between this proposal and appeal to cp-laws is notational. Cartwright would not
subscribe to a covering-law account of causation. But that is another matter.
1
272
PAUL M. PIETROSKI
But I do not stipulate that we have an explanation only if the putative cp-Iaw is true. That
would make the proposal viciously circular. See, e.g., van Fraassen (1980) for discussion of
the role of contrast classes and presuppositions in explanation.
S Or to take a familiar case, while there is an explanation of why Mercury orbits the sun as it
does, this does not explain the Abnormal instance of Newton's laws. The anomaly of Uranus'
orbit, however, was explicable by citing the presence of Neptune.
6
Only if atmospheric pressure is rising, will (4) be free from inexplicable Abnormal instances;
and building this condition into the antecedent would make it otiose. Similarly, there will be
no "shadow-to-flagpole" cp-Iaws. The presence of a "shadow-like" streak on the ground, or a
refracting lens between the pole and its shadow (see note 3 above) will not explain the relevant
Abnormal instances. (Again, my goal is not to say why such appeals would not explain why
the flagpole was "too short," much less to do so without appealing to causation.)
1 Suppose we have laws of the form 'if F, then cp[N%(G)], and 'if K, then cp[M%(G)]'. If
K obtains, it seems that an instance of F&G can be an accidentally Normal instance of the
first law, even if there is no undefeated interference with respect to either law. For
intuitively, there is an M% chance that K caused G and a (IOO-N)% chance that F did not
cause G. I am not entirely sure that we should take such intuitions at face value. But if we do,
we can also speak of the chance that an event is overdetermined, and the probability that we have
an accidentally Normal instance of a cp-probabilistic law.
8 Skyrms (1980) aruges that a cause need only raise the probability of its effect in some background condition and not lower the probability of its effect in any. But Dupre's point is unaffected,
since this still requires unanimity with respect to not lowering probability.
9 Are there cp-probabilistic-cp laws, and so on? I take the question to be an empirical one.
But at some point, it will become impossible to gather evidence for iteratively hedged laws.
10
See Pietroski (1993), pp. 504-5. The Ideal Gas Law and van der Waal's equation provide
a similar example; see Pietroski and Rey (1995) for further discussion. The prevalence of an
interfering condition (like friction) is irrelevant to whether it can explain Abnormal instances.
And cp-Iaws will not be transitive for the same reason they are not monotonic.
II
My thanks to FCAR and SSHRCC for financial support, and to Susan Dwyer for helpful
discussion.
4
REFERENCES
Cartwright, N., 1983, How the Laws of Physics Lie, Clarendon, Oxford.
Cartwright, N., 1989, Nature's Capacities and Their Measurement, Clarendon, Oxford.
Creary, L., 1981, 'Causal Explanation and the Reality of Natural Component Forces', Pacific
Philosophical Quarterly 62, 148-157.
Davidson, D., 1970, 'Mental Events' , in Essays on Actions and Events, Clarendon, Oxford (1980).
Dicke, Peebles et al., 1965, 'Cosmic Black Box Radiation', Astrophysical Journal 142, 414419.
Dupre, J., 1984, 'Probabilistic Causality Emancipated', Midwest Studies in Philosophy 9, 169175.
Dupre, J., 1993, The Disorder of Things, Harvard, Cambridge.
Eells, E., 1987, 'Probabilistic Causality: Reply to John Dupre', Philosophy of Science 53,
52-64.
Eells, E. and Sober, E., 1983, 'Probabilistic Causality and the Question of Transitivity',
Philosophy of Science SO, 35-57.
Goodman, N., 1979, Fact, Fiction, and Forecast, Harvard University Press, Cambridge.
Hanson, N., 1958, Patterns of Discovery, University Press, Cambridge.
Hart, H. and Honore, A., 1959, Causation in the Law, Clarendon, Oxford.
Hempel, C., 1988, 'Provisoes: A Problem Concerning the Inferential Function of Scientific
Theories', Erkenntnis 28, 147-164.
Joseph, G., 1980, 'The Many Sciences and the One World', Journal of Philosophy 77, 773-790.
273
Laymon, R., 1985, 'Idealization and the Testing of Theories by Experimentation', in P. Achinstein
and O. Hannaway (eds.), Observation, Experiment, and Hypothesis in Modern Physical
Science, MIT Press, Cambridge.
Penzias, A. and Wilson, R., 1965, 'A Measurement of Excess Antenna Temperature at 4080 mcfs',
Astrophysical Journal 142, 419-421.
Pietroski, P., 1993, 'Prima Facie Obligations, Ceteris Paribus Laws in Moral Theory', Ethics 103,
489-515.
Pietroski, P., 1994, 'Mental Causation for Dualists', Mind and Language 9, 336-66.
Pietroski, P. and Rey, G., 1995, 'When Other Things Aren't Equal: Saving Ceteris Paribus
Laws from Vacuity', British Journal for the Philosophy of Science 46, 81-110
Ramsey, F., 1929, 'General Propositions and Causality', in H. Mellor (ed.), Philosophical Papers,
Cambridge University Press, Cambridge (1990).
Schiffer, S., 1991, 'Ceteris Paribus Laws', Mind 100,1-18.
Sober, E., 1984, The Nature of Selection, MIT, Cambridge.
Skynns, B., 1980, Causal Necessity, New Haven, Yale.
van Fraassen, B., 1980, The Scientific Image, Clarendon, Oxford.
DA VID DAVIES
276
DA VID DAVIES
as this model is to serve for all theories, the notions employed in characterising the model cannot receive their interpretation from within any particular
theory. (Putnam, 1976b, p. 125)
In his earlier writings, Putnam himself embraced a form of Realism, a
position he termed "sophisticated realism" (see especially Putnam, 1976a):
on such a view, language users construct representations of THE WORLD, and
the contribution of language use to the successful prosecution of our practical goals justifies the hypothesis that these representations refer to and are
approximately true of THE WORLD. 'Internal realism', which Putnam now
embraces, "employs a similar picture within a theory" (Putnam, 1976b, p. 125).
'The world' of which language users can be said to construct representations
is the world as it is conceived through one or more of our various representations: it is to objects so represented that our terms refer. For the internal
realist, the 'causal interactions' with the world through which knowledge is
acquired are themselves "part of the subject-matter of the representation"
that is our scientific account of knowledge.
The model-theoretic argument challenges the externalist thesis that truth
is radically non-epistemic - that even an 'Ideal Theory' satisfying the highest
standards of rational assertibility might still be false. Putnam argues that the
externalist is not entitled to something that is a precondition for the possible
falsity of such a theory. The precondition is that there be a unique, determinate, and pre-theoretic relation of reference between terms in a language and
entities in THE WORLD whereby states of THE WORLD confer determinate truth-values on sentences containing such terms, and might therefore
confer the determinate truth-value 'false' on some sentence comprised by an
'Ideal Theory'. The MTA, which purports to establish that no such relation
of reference is available to the Realist, is formulated from within the externalist perspective and proceeds roughly as follows.
Assume the externalist conception of THE WORLD, and of truth as reference-mediated correspondence to THE WORLD. Then, given an Ideal Theory
(IT) satisfying all operational and theoretical constraints and of an appropriate cardinality:
(a) There is a mapping, and hence a 'reference' relation, from the terms
of the language of IT onto elements, or sets of elements, in THE WORLD such
that THE WORLD satisfies IT - call this mapping SAT;
(b) Nothing in our use of the language can serve to exclude this mapping
as an 'unintended' interpretation, since IT, and hence SAT, satisfies all operational and theoretical constraints (ex hypothesi), and since SAT also satisfies
our general intention to refer in such a way that our beliefs come out (for
the most part) true;
(c) Furthermore, no other constraints available to the externalist can exclude
SAT as an 'unintended' interpretation; so
(d) If truth consists in a reference-mediated correspondence between a theory
and THE WORLD, then IT cannot fail to be true.
The foregoing sketches the presentation of the MTA in Putnam, 1976b.
277
In Putnam, 1980, the point is made in a slightly different way. It is argued that,
given a set of operational constraints formulated in terms of one vocabulary
- what is, relatively speaking, the 'observational' vocabulary - we cannot
single out a unique 'interpretation-independent' reference for the terms in
another vocabulary - relatively speaking, the 'theoretical' vocabulary - by
means of the operational constraints in conjunction with a plausible set of
theoretical constraints. Putnam establishes this result for various choices of
'observational' and 'theoretical' vocabularies. For example, we may take the
'theoretical' vocabulary to comprise terms that purport to refer to 'unobservables' postulated in scientific theories, where the corresponding 'observational'
vocabulary contains terms that refer to things, events, and properties "observable with the human sensorium". Alternatively, we may take the 'observational'
vocabulary to comprise only terms that refer to present sense-data, and include
all terms that purport to refer to past and future sense-data in the 'theoretical' vocabulary. In either case, it is claimed, we can run what is essentially
the argument analysed above [(a)-(d)] to establish that terms in the 'theoretical' vocabulary fail to receive a unique and determinate interpretation and
must therefore be viewed as "formal constructs variously interpreted in various
models" (Putnam, 1980, p. 475).
What I shall term the 'canonical' response to the MTA consists in granting
- albeit, perhaps, for the sake of argument - steps (a) and (b) above, but balking
at step (c). The externalist maintains that, in the words of David Lewis, "there
must be some additional constraint on reference: some constraint that might,
if we are unlucky in our theorising, eliminate all the allegedly intended interpretations that make the theory come true" (Lewis, 1984, p. 224). More
specifically, the canonical response consists in some variation on the following
line of reasoning:
(1) Putnam assumes that the only plausible constraints on reference available to the Realist are those imposed by the acceptance of an uninterpreted,
or partially interpreted, set of sentences (see, e.g., Lewis, 1984, p. 225;
Goldman, 1986, p. 155; Demopoulos, 1982, p. 138); but
(2) This assumption is incorrect: our use of language in our more general
commerce with each other and with the world (THE WORLD) furnishes
additional constraints on reference, and the realist can appeal to such constraints in explaining how a unique, determinate, and pre-theoretic reference
relation is possible, and why SAT-like interpretations are unacceptable.
The general thrust of the canonical response is well expressed in the following remarks by Ian Hacking:
All [Putnam] has shown is that you cannot succeed in reference by stating a set of truths expressed
in first-order logic ... Language is embedded in a wide range of doings in the world ... Language
is more than talking ... Cherries are for eating, cats, perhaps, for stroking. Once speech
becomes embedded in action, talk of Lowenheim and Skolem seems scholastic ... We can do
nothing with very large numbers except talk about them. With cats we relate in ways other
than speech ... Assuring reference is not primarily a matter of uttering truths, but of interacting with the world ... (Hacking, 1983, pp. 104-8; see also Blackburn, 1984, p. 301)
278
DA VID DAVIES
As to the precise mechanism whereby our "interacting with the world" generates a unique and determinate relation of reference between language and
THE WORLD, most exponents of the canonical response have taken the
relevant constraints to consist in the existence of appropriate causal relationships between terms and their referents (see, e.g., Glymour, 1982, p. 177;
Devitt, 1983, p. 298; Brueckner, 1984). An alternative approach championed
by David Lewis takes the desired additional constraints to be furnished by
the 'structure' of THE WORLD itself (Lewis, 1984, p. 227; see also Merrill,
1980).
That the use of natural language occurs in the context of a more general
intercourse with non-linguistic reality is obvious. No less obvious, it would
seen, is that this feature of language use might yield constraints upon interpretation. The contention, implicit or explicit in the canonical response, that
Putnam ignores such possible resources for the Realist cause therefore invites
the question of how he could have overlooked something so transparent,
especially in the light of his own earlier criticisms of the 'descriptivist' programme in the philosophy of language. It is the recognition of this anomaly
confronting the canonical response that motivates Carsten Hansen's "Lockean"
reading of the model-theoretic argument. According to Hansen, Putnam
assumes that any adequate Realist response to the model-theoretic argument
must satisfy the condition that "we have knowledge of, or access to (mental)
representations only", where the latter "are in a significant respect like Lockean
'ideas' - they are mental entities of a sort that form a 'veil' between us and
the external world." (Hansen, 1987, p. 92)
Given this "Lockean epistemological premiss", Putnam's offhand dismissal
of Realist appeals to "causal connections" and "non-linguistic facts" is easily
understood, for "it will be question-begging to assume the possibility of having
knowledge of the external, non-representational objects in terms of which
the language is to be interpreted" (Hansen, 1987, p. 93). But, if the modeltheoretic argument does presuppose such a Lockean premiss, then, Hansen
argues, it has little force against Realism. For the Realist is not committed
to any such premiss, nor is the latter plausible on independent grounds. Thus
she can answer Putnam by simply rejecting the "Locke an" epistemology and
offering the sort of 'causal' story about reference beloved of (most) devotees
of the canonical response.
3. The aim of the MTA is to undermine Realism while leaving internal realism
unscathed. 1 The Realist contends that truth is a matter of correspondence to
THE WORLD, grounded in a determinate relation of reference. Putnam challenges the Realist to provide an account of how such a relation of reference
is established. But - and this point is crucial - an adequate Realist response
to this challenge must advert to constraints on reference that allow for the
possibility, at least in principle, that, given the extensions determined for the
terms in a language L, even an epistemically Ideal theory in L could be false.
This point is not undermined by the contention (see Hansen, 1987, and Devitt,
279
280
DAVID DAVIES
Indeed, internal realism can also incorporate the sort of 'causal' account of
the relationship between representations and what they represent that writers
like Devitt and Glymour have urged as a counter to the MTA. For internal
realism preserves the empirical content of Putnam's earlier "sophisticated
realism", now interpreted within a theory, and this includes precisely the kind
of scientific picture of linguistic representation advocated by Putnam's Realist
opponents. In 'Models and Reality', Putnam rejects the suggestion that a causal
theory of reference might enable the Realist to evade the force of the MTA,
and then immediately adds the following remarks:
This is not to say that the construction of such a theory would be worthless as philosophy or
as natural science. The program of cognitive psychology already alluded to, the program of
describing our brains as computers which construct an "internal representation of the environment", seems to require that mentalese utterances be, in some cases at least, describable as the
causal products of devices in the brain and nervous system which "transduce" information from
the environment, and such a description might well be what the causal theorists are looking
281
for. And the program of realism in the philosophy of science - of empirical realism, not metaphysical realism - is to show that scientific theories can be regarded as better and better
representations of an objective world with which we are interacting; and if such a view is to
be part of science itself, as empirical realists contend it should be, then the interactions with
the world by means of which this representation is formed must themselves be part of the
subject matter of the representation. But the problem as to how the whole representation, including
the empirical theory of knowledge that is a part of it, can determinately refer is not a problem
that can be solved by developing more and better empirical theory. (Putnam, 1980,p. 479)
It should be apparent from the foregoing that, in presenting the MTA against
externalism, Putnam doesn't neglect the role of "non-linguistic facts" and
our interactions with the world in securing reference. Rather, he takes such
things to be elements in the sort of internalist picture he endorses. But such
resources cannot aid the cause of externalism, for they fail to determine reference-classes that are 'mind-independent' in the required sense. It is because
the Realist requires such reference-classes that she needs to avail herself of
constraints that go beyond those furnished by our causal, or quasi-causal, interactions with things in our everyday and scientific practice. She has traditionally
relied upon an appeal to 'operational and theoretical contraints', but the
'Skolemisation' argument of 'Models and Reality' purports to show that
such an appeal fails to secure determine reference to classes in THE WORLD,
whatever kinds of entities such classes are taken to comprise - 'theoretical'
entities in science, middle-sized physical objects, or past and future sense-data.
The problem is not the 'Lockean' one of determinately referring to extra-mental
entities given that we have immediate referential access only to our own
"ideas"; it is, rather, the problem of securing determinate reference to any class
of entities that satisfies the Realist's notion of 'mind-independence'. One way
of setting up this problem is indeed in terms of a 'Lockean' epistemology:
assume that we have immediate referential access to our own inner states,
and try to secure determinate reference to external objects that are 'mindindependent' in the Realist's sense. But the problem can be set up equally
well with a Kantian veil rather than a Lockean one: assume that experience
provides us with immediate referential access to spatio-temporal objects,
and try to establish the possibility of determinately referring to noumenal
objects.
The foregoing distinction between Lockean and Kantian veils requires, of
course, that we resist a 'phenomenalist' reading of Kant's talk of spatiotemporal objects as "appearances". I follow, here, Henry Allison's influential
construal of Kant's Transcendental Idealism (Allison, 1983). Allison stresses
the distinction between transcendental and empirical senses of 'mind-dependence', 'appearance', and 'thing-in-itself'. Only in the empirical sense does
the 'mind-dependence' of an entity entail that it is a mental representation
in the Lockean sense; taken transcendentally, the 'mind-dependence' of an
object consists in its being considered qua subject to what Allison terms "epistemic conditions", conditions necessary for the representation of an object
or an objective state of affairs. For beings like ourselves who possess a discursive intellect - who know by bringing intuitions under concepts - something
282
DAVID DAVIES
283
to cut up the "noumenal dough", and to 'causal relations' between these classes
and terms in our language. It will involve, in other words, what Putnam has
termed a "Ready-Made World" (Putnam, 1983, pp. 205 fO.
The 'fortified' version of the canonical response appeals not merely to additional constraints on interpretation deriving from the use of language, but
also to pre-existing structure in THE WORLD. It is because THE WORLD
itself contains the kind of 'mind-independent' reference classes that the Realist
requires that our use of language, whereby terms are associated with members
of those classes, furnishes the additional constraints on reference that can
rule out SAT interpretations of an Ideal Theory. I shall now suggest that the
Kantian reading of Putnam's argument sketched above illuminates Putnam's
reasons for rejecting such a strategy.2
Consider the following kind of counter which Putnam offers to the contention that reference is a causal relation in THE WORLD, and to the further
contention that what he terms 'conceptual relativity' - the relativity of facts
about 'what there is' to choice of conceptual system (see below) - is simply
a matter of there being different ways of carving the 'noumenal dough'. In
"Why There Isn't a Ready-Made World" (Putnam, 1983, pp. 205-228), Putnam
argues that reference cannot be a relation in THE WORLD because it is a
notion whose extension depends upon what we are willing to count as reference in our attempts to understand and explain the linguistic and non-linguistic
behaviour of others: reference is
a flexible, interest-relative notion: what we count as referring to something depends on background knowledge and our willingness to be charitable in interpretation. To read a relation
so deeply human and so pervasively intentional into the world and to call the resulting metaphysical picture satisfactory ... is absurd. (Putnam, 1983, p. 225)
In a similar vein, Putnam (1985, 1988) has recently argued that Realism cannot
account for the phenomenon of conceptual relativity, because any attempt to
appeal to different ways of cutting the 'noumenal dough' falls foul of the
question, of what does the 'dough' itself consist? To offer any answer to this
question is to privilege one among the many categorial frameworks countenanced by the thesis of conceptual relativity.
But why is it not open to the Realist to reply that, while we may not be
able to tell which of our categorial frameworks corresponds to the way in which
THE WORLD divides itself into kinds, this doesn't count against the Realist
contention that there is "a way THE WORLD is"? Why, in other words, is
the damage sustained by Realism in the face of Putnamian arguments for
'conceptual relativity' any more than epistemological damage? (See, again,
Lewis, 1984, for this charge). Exactly parallel considerations apply to the
arguments over reference. Granted that our practices of ascribing referents
are permeated by our explanatory interests, and granted that these features
cannot with any plausibility be imputed to the Realist's WORLD, why should
this reflect upon the nature of reference itself, the thing we attempt to correctly
characterise through such practices?
284
DA VID DAVIES
285
286
DA VID DAVIES
NOTES
I
Hansen claims that his concern is to defend what he terms "minimal realism", and that
Putnam's characterisation of Realism incorporates certain substantive assumptions that neither
belong to nor are implied by minimal realism. But the central tenets of minimal realism, ad
identified by Hansen, seem identical to those of the Realism sketched above - namely, a commitment to "the notion of a mind-independent world and to a non-epistemic notion of truth"
(Hansen, 1987, p. 78).
2 It should be noted that both components of the fortified canonical strategy are necessary if
it is to have a chance of succeeding where its unfortified counter-part failed. As Gregory Currie
(1982) has pointed out, no additional purchase on THE WORLD is obtained by merely asserting
(as Lewis does) that we intend our terms to pick out "elite classes"; for any such talk can be
added to our Ideal Theory, and a SAT interpretation can be given for the IT so augmented.
REFERENCES
Allison, H., 1983, Kant's Transcendental Idealism: An Interpretation and Defense, Yale
University Press, New Haven.
Blackburn, S., 1984, Spreading the Word, Clarendon Press, Oxford.
Brueckner, A., 1984, 'Putnam's Model-Theoretic Argument Against Metaphysical Realism',
Analysis 44, 134-140.
Currie, G., 1982, 'A Note on Realism', Philosophy of Science 49,263-267.
Demopoulos, W., 1982, 'The Rejection of Truth-Conditional Semantics by Dummett and Putnam',
Philosophical Topics 13.1, 135-153.
Devitt, M., 1983, 'Realism and the Renegade Putnam', Nous XVII, 291-301.
Glymour, C., 1982, 'Conceptual Scheming', Synthese 51, 169-180.
Goldman, A., 1986, Epistemology and Cognition, Harvard University Press, Cambridge.
Hacking, I., 1983, Representing and Intervening, Cambridge University Press, Cambridge.
Hansen, C., 1987, 'Putnam's Indeterminacy Argument: The Skolemisation of Absolutely Everything', Philosophical Studies 51, 77-99.
Lewis, D., 1984, 'Putnam's Paradox', Australasian Journal of Philosophy 62, 221-236.
Merrill, G. H., 1980, 'The Model-Theoretic Argument Against Realism', Philosophy of Science
47,69-81.
Putnam, H., 1976a, 'The Locke Lectures: Meaning and Knowledge', in Putnam, 1978, pp.
7-80.
Putnam, H., 1976b, 'Realism and Reason', in Putnam, 1978, pp. 123-140.
Putnam, H., 1978, Meaning and the Moral Sciences, Routledge and Kegan Paul, London.
Putnam, H., 1980, 'Models and Reality', Journal of Symbolic Logic XLV, 464-482.
Putnam, H., 1981, Reason, Truth, and History, Cambridge University Press, Cambridge.
Putnam, H., 1983, Realism and Reason: Philosophical Papers Volume III, Cambridge University
Press, Cambridge.
Putnam, H., 1985, The Many Faces of Realism, Open Court, LaSalle.
Putnam, H., 1988, Representation and Reality, M.I.T. Press, Cambridge.
Putnam, H., 1990, Realism with a Human Face, Harvard University Press, Cambridge.
Quine, Willard Van Orman, 1960, Word and Object, M.I.T. Press, Cambridge.
JEAN LEROUX
II. Helmholtz's century has occasionally been characterized as having witnessed in physics the refutation of Kantian philosophy of science, while
it has indeed been marked by multiple attempted transformations of the
latter. 2 Many physicists, Helmholtz among them, felt the need to emend the
Kantianism in which they had been trained. If German physicists of the first
half of the nineteenth century had been more than willing to discuss philosophical topics, they had also been unanimous in their hostility towards the
Naturphilosophie associated with the then prevailing idealism in German
Universities. 3 Helmholtz himself deplored this unhappy relation which, under
the influence of what he called the "philosophy of identity" of Schelling and
Hegel, had come to exist between philosophy and natural science. Talking
about the generation of physicists that preceded him, Helmholtz remarked
that Hegel's philosophy of nature "seemed, at least to natural philosophers,
absolutely meaningless. Of all the distinguished scientists who were his contemporaries, not one was found to stand up for his ideas".4
On the other hand, Helmholtz claimed to have remained a faithful Kantian
throughout his career, considering that what he had altered in Kant's philosophy of geometry was of secondary importance compared to what had been
Kant's fundamental results. 5 By this, he meant the demonstration of the transcendental character of the principle of causality (as a principle of the
intelligibility of natural phenomena). "The law of causality", writes Helmholtz,
"is in reality a transcendental law, a law which is given a priori. It is impossible to prove it by experience, for [... J even the most elementary levels of
experience are impossible [... J without the law of causality,,;6 nor can it be
287
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science I, 287-296.
1995 Kluwer Academic Publishers.
288
JEAN LEROUX
289
290
JEAN LEROUX
Helmholtz inaugurated a picture theory that (in Kantian spirit) explains the
possibility of Newtonian mechanics as a description of movements of matter
in space. It is noteworthy that Helmholtz's theory of science bases knowledge of the external world on a single fundamental process of symbol formation
(the "linguistic tum"?) and not, as was the case with Kant, on two mutually
exclusive processes of intuition and intellection.
III. This semiotic conception of sensations is at the center of a complex position
regarding scientific realism. 14 Contrary to the opinion that Helmholtz always
remained a consistent realist, although his realm faltered for a moment in
his later Introduction to the Lectures on Theoretical Physics (1894), I want
to contend that Helmholtz had from the start a more nuanced standpoint on
the question.
IlIa. Already in the Memoir on the ConservatiOn of Force,ls which was his
first article on physics, Helmholtz took up considerations that run counter to
the realist thesis. One important aspect of this anti-realist standpoint concerns
as it were the notion of force in mechanics.
The Memoir starts with general considerations concerning the aim of
science. "The task of the physical sciences", says Helmholtz, "is to discover
laws so that natural phenomena can be traced back to, and deduced from,
general principles" .16 The search for such laws is the task of experimental
physics, while theoretical physics tries to find the causes of natural phenomena
on the basis of their observable effects. We are justified in and indeed compelled to this search for causes by the fundamental principle according to which
all changes in nature must have a sufficient cause. These causes that we
291
attribute to phenomena are variable or invariable; in the latter case, the same
principle that makes us ask for the cause of changes will make us ask anew
for the cause of this variation, so that only the discovery of constant causes
will achieve complete comprehension of natural phenomena. 17
Helmholtz goes on to say that in order to achieve this task, we have recourse
to two abstractions: force and matter. These abstractions are correlative in
the sense that they only have definite meanings in relation to and in combination with each other. A pure force, for instance, would be contradictory
and self-refuting, since it would correspond to a law describing change where
there is nothing to undergo change. In an appendix to the Memoir, Helmholtz
notes that this impossibility to conceive of force and matter without one another
follows simply from the fact that a force always presupposes certain conditions under which it is realized. "A force separated from matter would be
the objectification of a law, which lacked the conditions for its realization".18
To talk about forces in general terms without giving the conditions under which
they are realized is meaningless. This principle motivates the ontological
distrust regarding the diverse forces posited by mechanics, as it also underlies the descriptivist standpoint in mechanics of which Kirchhoff was the
most influential proponent. 19 If he doesn't completely adhere to Kirchhoff's
elimination from physics of the search for causes and causal explanations,
Helmholtz, on the other hand, subordinates the notions of force and cause
to that of law. We only speak of causes and forces, he explains, when we
recognize them as independent of our will. What is independent of our will
in tum, can only be recognized when we are able to recognize the uniform
effects of our will. Hence, lawlikeliness is the essential presupposition of
what has the character of reality.
In short, the Helmholtz of the Memoir acknowledges the reality of causes
because he judges that the category of causality is necessary to our comprehension of natural phenomena. However, his realism about causality is a critical
realism since our knowledge of causes always remains hypothetical and subject
to revision. As for causes themselves, they are real insofar as they are effective (Effectivity - Wirklichkeit - being the German word for reality). If it is
true that Helmholtz, by the time of Memoir, considered central forces as objective, this reality of forces as causes of movement nevertheless reduced to
the reality of laws. This brand of realism constitutes metaphysical realism
and not scientific realism, since it is the realm of laws, and not laws themselves, that, for Helmholtz, is devoid of hypothetical character.
IIIb. Helmholtz's scepticism about forces is accentuated in his Introduction
to the Lectures on Theoretical Physics, where he draws close to the descriptivist view alluded to above. Calling upon the tradition of Faraday, Maxwell
and Kelvin, and using Kirchhoff's own words, he states that the diverse laws
of movement constitute only a description of observable events, while adding
that a complete and simplest possible description of natural phenomena cannot
do without dynamical forces. Helmholtz states that he prefers to talk about
292
JEAN LEROUX
"laws of forces" and not "forces", since, according to the descriptivist view,
the notion of force, qua substance, has no factual content. "In a factual sense,
to be sure, we express nothing more and nothing less by this abstraction than
what is contained in the mere description of the phenomena.,,2o This instrumentalist standpoint, which only sees the necessity of the notion of force in
the context of prediction and in view of the economy of theoretical description, foreshadows indeed the logical empiricists' discussion of "theoretical
terms" or "auxiliary expressions" and their attempted syntactical and semantical elimination. I refer here of course to the use that Carnap and Hempel have
made of W. Craig's and F. P. Ramsey's formal results on definability within
first-order axiomatic systems. The idea that a force, that is, a capacity to
produce effects, cannot be conceived of independently of the effects it
produces, finds it definitive formulation in Helmholtz's thesis that a force,
separated from its conditions of realization, has no factual content. Logical
Empiricism was later to take over this notion of factual content and develop
it as a metatheoretical notion generalizable to entire theories. The factual
content of a physical or empirical theory was either defined in syntactical terms
as the class of all observational logical consequences of the theory's axioms,
or in semantical terms as the class of all observational substructures of the
models of the theory. It is interesting to note that already for Helmholtz, the
notion of factual content or empirical content was a semantic notion: Helmholtz
used the term "factual meaning".
HIc. The Introduction to the Lectures brings forward new considerations
enabling Helmboltz to present more systematically his views on the ontological status of forces. The systematic character of his standpoint lies in the
fact that it is now fully integrated into a theory of language. In the first Section
of the Lectures, Helmholtz explains that we are naturally inclined to use the
substantive mode of expression and, instead of speaking of "laws of forces"
- that term being closer to the factual meaning - we prefer to speak substantially of forces acting in a specific way. "As long as its meaning was not
completely clear", writes Helmholtz, "this mode of expression caused many
errors. More specifically, the abstract substantive force was thought to denote
something actually existent, and some believed that they were entitled to make
statements about the real properties of forces. [...] Of this hypothetical substantive, for that is what we must consider a force to be, we know nothing
except that it lies in its nature to produce a specific effect".21
We knew that forces were idealizations: we now know that they are no
more and no less than the substantivization of laws of force, and this
substantivization leads us to adopt the realist stance. This explanation of
realism as originating in the use of language (to adopt the substantive
mode as "preferred mode of expression") and as "leading to statements about
the real properties of forces that, however, either are tautologies or have only
an apparently real content", prefigures Carnap's distinction between the
material mode of speech (inhaltliche Redeweise) and the formal mode of
293
294
JEAN LEROUX
295
books. These were faint echoes of what they said in private letters, reviews, addresses, periodicals, and elsewhere" (p. 27).
4
'Uber das Verhiiltnis der Naturwissenschaften zur Gesamtheit der Wissenschaft' (1862),
p. 126 in Kahl, Selected Writings, trans. as 'The Relation of the Natural Sciences to Science in
General', pp. 122-143.
5 However, what Helmholtz had emended could barely be considered of secondary importance: it had to do with the epistemological status of the axioms of geometry. Appealing to
previous work done by Beltrami and reaching to results on n-dimensional manifolds that Riemann
had obtained just before, Helmholtz showed that physical geometry is based on certain physical
facts and constitutes as such an empirical science. Eventually, Helmholtz admitted that space
could be considered as an a priori form of intuition; but the question as to whether this space
is Euclidian or not (a question that is involved in the choice of axioms of geometry) remained
for him an empirical question.
6 'Die Tatsachen in der Wahrnehmung' (1878), p. 390 in Kahl, Selected Writings, trans. as
'The Facts of Perception', pp. 366-408.
7
Handbuck der physiologischen Optik, III (1867), p. 33 in J. P. Southhall ed., Helmholtz's
Treatise on Physiological Optics, III. New York: Dover, 1962.
8 "What we unquestionably can find as a fact, without any hypothetical element whatsoever,
is the lawful regularity of the phenomena. From the very first, in the case where we perceive
stationary objects distributed before us in space, this perception involves the recognition of a
uniform or lawlike connection between our movements and the sensations which result from
them. Thus even the most elementary ideas contain a mental element and occur in accordance
with the laws of thought" ('The Facts of Perception', p. 386 in Kahl, Selected Writings).
9 'The Facts of Perceptions', p. 372 in Kahl, Selected Writings.
JO
Helmholtz talks about "unconscious conclusions" that are the product of inductive judgements. This is tied with Helmholtz's associational account of the processes underlying perception.
With the benefit of hindsight, one would judge Helmholtz's conception of logic, especially his
associative account of inference, as psychologistic. As Hatfield (The Natural and the Normative,
pp. 207) remarks: "Retrospectively describing his {Helmholtz} account of the inductive inferences of perception, he asserted that they depend not merely on the laws of association but
also on the Kantian 'law of causality' ".
II
Cf. 'Uber das Ziel und die Fortschritte der Wissenschaft' (1869), trans. as 'The Aim and
Progress of Physical Science', pp. 223-245 in Kahl, Selected Writings: "From these [Miiller's
law of the specific energy of the senses] and related facts we are led to the very important
conclusion that our sensations are, insofar as their quality is concerned, only signs of external
objects, not images with any sort of resemblance to them. An image must be similar in some
respect to an object. [... ]. For a sign, it is sufficient that it appear whenever that which it
signifies makes an appearance, the correspondence between them being restricted to their
appearing simultaneity. The correspondence existing between our sensations and the objects
producing them is precisely of this kind. Sensations are signs which we have learned to decipher.
They form a language given to us by our physical make-up, a language by which external
objects speak to us. It is, however, a language, which, like our mother tongue, we can learn
only by practice and experience" (pp. 241-242).
12
Die Prinzipien der Mechanik (1894), pp. 1-2 in The Principles of Mechanics, trans. by D.
E. Jones and J. T. Walley. New York: Dover, 1956.
13
This translation differs somewhat from Kahl's (p. 388 in Selected Writings). Cf. H. von
Helmholtz, Die Tatsachen in der WahrnehmunglZiihlen und Messen. Darmstadt: Wissenschaftliche Buchgesellschaft, 1959, p. 45.
14
Helmholtz's stand on the question is surely more complex than the above quoted main
contention from Kahl would lead one to believe, for Kahl feels the need to add: "Sometimes,
especially in the Introduction to the Lectures on Theoretical Physics, his [Helmholtz] scientific realism faltered for a moment and he favored the position that to use abstract substantives
such as force was only a far,;on de parler. It was convenient and consistent with the natural,
common-sense realism of everyday life; but a formulation, for example, of a law of motion
296
JEAN LEROUX
using the tenn force ("the force of gravity") was, he maintained at these times, empirically no
more than a description of motion" (Selected Writings, p. xx).
15
'Uber die Erhaltung der Kraft' (1847), pp. 12-75 in Wissenschftliche Abhandlungen,
I(Leipzig: Barth, 1882}, also reproduced as monograph, Uber die Erhaltung der Kraft. Weinheim:
Physik-Verlag, 1883, trans. as 'The Conservation of Force: A Physical Memoir', pp. 3-55 in
Kahl, Selected Writings.
16
Memoir, p. 3 in Kahl, Selected Writings.
17
One of the major point of the memoir was indeed to insist on the necessity of reducing all
natural phenomena to invariant forces of attraction and repulsion, to central movement forces,
the intensities of which are dependent only upon the mutual distance of material bodies.
IS
Memoir, Appendix I, pp. 49-50. in Kahl, Selected Writings.
19
Cf. Gustav Kirchhoff, Vorlesungen iiber mathematische Physik: Mechanik. 2nd. ed., Leipzig,
1877. Preface: "We usually define Mechanics as the science of forces, and forces, as causes which
produce movements or have the capacity to produce movements. [... ]. However, [this definition] is beset with the obscurity inherent in the concepts of cause and capacity. Given the rigor
of demonstrations otherwise obtained in Mechanics, it has appeared desirable to me to remove
such obscurities, even if this proves to be only feasible at the cost of restricting its assigned
task. Hence I give to Mechanics the task to describe the movements that occur in nature, in a
complete and simplest possible way. By this, I mean that Mechanics has soley to detennine
what are the phenomena that occur, and not to search for their causes." (My translation)
20
Einleitung zu den Vorlesungen iiber theoretische Physik [Koenig & Runge (eds.). Leipzig:
Barth, 1903], p. 521 in Kahl, Selected Writings (trans. as 'Introduction to the Lectures on
Theoretical Physics', pp. 513-529).
21
Ibid., pp. 521-4.
22
Cf. Rudolf Carnap, 'On the Character of Philosophical Problems', Philosophy of Science I
(1934) 5-19.
23
E. Nagel, The Structure of Science. 2nd ed., Indianapolis: Hackett, 1979, p. 152.
24
'The Facts of Perception, Appendix 1', in Kahl, Selected Writings, p. 398.
25
"Undoubtedly the realistic hypothesis is the simplest that can be fonnulated. [...]. We cannot
assert that it is necessarily true, for opposed to it there is always the possibility of other irrefutable
idealistic hypotheses. [... ]. It is always well to keep this in mind in order not to infer from
the facts more than can be rightly inferred from them. The various idealistic and realistic interpretations are metaphysical hypotheses which, as long as they are recognized as such, are
scientifically completely justified. They may become dangerous, however, if they are presented
as dogmas or as alleged necessites of thought" (,The Facts of Perception', p. 385-6 in Kahl,
Selected Writings).
26
The Treatise on Physiological Optics, which closes on a chapter dealing with "The
Foundations of the Empirical Theory" is eloquent in that respect: "I acknowledge that we are still
far from a real scientific comprehension of psychic phenomena. We may agree with the idealistic philosophers [Spiritualisten] and take the ground that it is absolutely impossible to
comprehend them, or we may take precisely the contrary view of the materialistic school,
according as we are inclined toward one speculation or the other. The natural philosopher must
stick to the facts and try to find out their laws; and he has no means of deciding between these
two kinds of speculation, because materialism, it should be remembered, is just as much a
metaphysical speculation or hypothesis as idealism, and therefore it has no right to decide
about matters of fact in natural philosophy except on a basis of facts. [... ] It is safer, in my
opinion, to connect the phenomena of vision with other processes that are certainly present
and actually effective [... ] instead of trying to base these phenomena on perfectly unknown
hypotheses [... ] which have been invented for the purpose and have no analogy of any sort"
(p. 532). On Helmholtz's methodological empiricism (in relation with his attack on nativistic
theories of vision), see Hatfield, The Natural and the Normative, Chapter 5, Section 2.
27
Ibid., p. 19.
WILLIAM R. SHEA
INTRODUCTION
The power of water was used in several earlier civilizations but it was nowhere
exploited so widely as in medieval Latin Christendom. In England in 1086
there were 5,624 water-mills south of the Trent and Severn. Given the population at the time, this is about one to every fifty households, enough to make
a profound difference in lifestyle. In the eleventh century, the tide-mill was
297
M. Marion and R. S. Cohen (eds.), Quebec Studies in the Philosophy of Science 1,297-307.
1995 All rillhts reserved.
298
WILLIAM R. SHEA
299
300
WILLIAM R. SHEA
301
is to possess?".10 These texts were often quoted in the sixteenth century, for
instance by writers like Henricus Cornelius Agrippa, who published his De
incertitudine et vanitate omnium scientiarum et artium in 1530. The Ovidian
theme is also echoed in the two greatest epic poems in English: Edmund
Spencer's Faerie Queene (1595) and John Milton's Paradise Lost (1667).
Spencer laments the day when mining began:
Then gan a cursed hand the quiet wombe
Of his great Grandmother with steele to wound,
And the hid treasures in her sacred tombe
With Sacrilege to dig ... 11
Milton describes "bands of pioners with Spade and Pickaxe" who, led by
Mammon,
Ransacked the Center, and with impious hands
Rifled the bowels of their mother Earth
For Treasures better hid. Soon had his crew
Opened into the Hill a spacious wound
And Diged out ribbs of Gold l2
302
WILLIAM R. SHEA
for most of the illustrators, they rose to the challenge and produced pictures
that are both pleasing to the eye and accurate in the rendering of technical
details. The importance of illustrations in conveying a precise idea of nature
and functions of the parts of a machine is obvious, and it is surprising that
these three great books should have appeared so late. The delay in publishing
sketches of mechanical inventions may have something to do with a world that
knew little of patents or copyrights, and in which inventors had no interest
in publicizing their work for others to plagiarize.
Closely related to the growing interest in machines was the increased
number of automata based or developed from the models found in antiquity.13 The singing birds of Philo and Hero had been powered by compressed
air or steam. An important innovation of the sixteenth century that made
possible the reproduction of sound within a self-contained unit was the
revolving pinned barrel or cylinder. The action of pins or pegs attached to
the circumference of the cylinder or barrel at right angles to the axis could
be transmitted some distance by means of simple levers as the cylinder
revolved. If these levers were placed in contact with valves of organ pipes,
the pipes would sound for as long as the pins continued to make contact
with the levers. This invention made possible the completely mechanical
performance of automatic sounding instruments. One of the earliest applications of this idea can be found in an organ clock that was presented as a gift
from Queen Elizabeth to the Sultan of Turkey in 1599.
The fascination with machinery is illustrated in the writings of Renaissance
engineers such as Leonardo da Vinci (1452-1519) and Francesco di Giorgio
Martini (1439-1501). By the time Montaigne went on an extended tour of
Switzerland, Southern Germany and Italy in 1580-1581, it had become fashionable to be on the look-out for technological innovations, especially if they
had entertainment value. In his Journal de voyage, Montaigne notes practical devices for hoisting and distributing water, and he particularly admired
the fine display of fountains at the Villa d'Este in Tivoli. He comments enthusiastically on the hydraulic organs that played music to the accompaniment
of the fall of water and devices that imitated the sound of trumpets. He relates
how birds began to sing and how, when an owl appeared on a rock, the birdsong ceased abruptly. He does not seem to have realised that this sequence was
borrowed from Hero of Alexandria's Pneumatics which Federico Commandino
had made available in a Latin translation in 1575. The rest of Europe sought
to emulate Italian achievements and Henri IV borrowed from the Grandduke
Ferdinand I (1551-1609) the services of Tommaso Francini and his brother,
Alessandro, to design the water-works at Saint-Germain-en-Laye. Their creations were to inspire Descartes who either saw them personally or read about
them in Salomon de Caus' illustrated La raison des forces mouvantes avec
diverses machines tant utiles que plaisantes ausquelles sont adjoints plusieurs
desseings de grotes et fontaines (Frankfurt, 1615).
303
The Italian artisans, much more given to reading than their ancestors, were
readier to look for theoretical explanations and to expect improvement from
a knowledge of mathematics. To build his cupola in Florence Brunelleschi
needed the guidance of theory, and the guidance of theory presupposed faith
in theory. Renaissance projects were impressive and needed master-builders
but it is important to remember that megaprojects, as such, do not lead to a
knowledge of the laws of nature. The Egyptian builders of the pyramids left
no treatises on statics and no writings on the resistance of materials. The
Renaissance books on machines stress how important it is to apply theoretical considerations to machines as opposed to merely putting them together
by rule of thumb. Craft knowledge had been craft secret. Vannoccio Biringuccio
inveighed against this and called for openness in science. The medieval
craftsman suffered from lack of familiarity with any theoretical notions and
from a narrow, ingrained specialization that there was only one way of doing
things rightly. Biringuccio realized that an analysis of the action of machines
is essential to the realization of their full potential.
The spokesmen of this new school of thought were numerous but none
was more eloquent than the Lord Chancellor, Francis Bacon. In the opening
lines of the Novum Organum he wrote:
I. Man, as the minister and interpreter of nature, does and understands as much as his observations on the order of nature, either with regard to things or the mind, permit him, and neither
knows nor is capable of more.
II. The unassisted hand and the understanding left to itself possess but little power. Effects
are produced by the means of instruments and helps, which the understanding requires no less
than the hand; and as instruments either promote or regulate the motion of the hand, so those
that are applied to the mind prompt or protect the understanding.
III. Knowledge and human power are synonymous. I.
Bacon stressed that knowledge was power and that nature was not only
to be modified but improved. In a significant passage of his Advancement of
Learning, he declares,
But if my judgment be of any weight, the use of history mechanical is of all others the most
radical and fundamental towards natural philosophy; such natural philosophy as shall not vanish
in the fume of subtile, sublime, or delectable speCUlation, but such as shall be operative to the
endowment and benefit of man's life: for it will not only minister and suggest for the present
many ingenious practices in all trades, ... but further, it will give a more true and real illumination concerning causes and axioms than is, hitherto attained. IS
BACONIAN IDEALS
Knowledge confers power and for the Lord Chancellor power was to be used
for the establishment of the kingdom of man over nature. This utilitarianism
has been illuminated in recent years from a rather unexpected source, the
hermetic tradition. The intent to manipulate nature for useful ends was a goal
304
WILLIAM R. SHEA
305
organism, to intervene in nature was to act upon entities endowed with their
own vital forces. Now it is only a matter of winding up, regulating and oiling
the clockwork mechanism. 20
NOVEL TY AND PROGRESS
The mechanical philosophy denied that the world was an organism and as a
consequence repudiated any attempt to explain change as an endeavour to
realize forms not yet existing. The quest for tendencies or final causes was
denounced as misguided and true knowledge was identified with the discovery
of the mechanical structure that produced a specific kind of motion. This vastly
enlarged man's possibility of acting on nature because the laws that governed
the world as a whole ceased to be different from those with which he was
familiar in machines.
The fascination with clockwork figures lay in the demonstration that the
actions of living creatures could be mimicked by the simplest movements of
springs, cords and levers. There is no doubt of their influence on Descartes'
understanding of animals. In his Treatise on Man, he refers to water-powered
automata to explain the operations of the human body which, he claims, is
nothing but a machine:
Indeed, one may compare the nerves of the machine I am describing with the pipes in the
works of these fountains, its muscles and tendons with the various devices and springs which
serve to set them in motion, its animal spirits with the water which drives them, the heart with
the source of the water, and the cavities of the brain with the storage tanks. Moreover, breathing
and other such activities which are normal and natural to this machine, and which depend on
the flow of the spirits, are like the movements of a clock or mill, which the normal flow of
water can render continuous. 21
306
WILLIAM R. SHEA
ical philosophy would not have been given the enthusiastic welcome it
received.
McGill University
NOTES
A. Rupert Hall, 'The Changing Technical Act', Technology and Culture 3 (1962), p. 513.
The nature of the contribution of technology to the rise of the mechanical philosophy remains
a moot question and forms a subsection of the ongoing debate on the cultural antecedents of
the scientific revolution of the seventeenth century. Leonardo Olschki in Geschichte der
neusprachlichen wissenschaftlichen Literatur, 3 vols. (Leipzig 1919-1922, Halle, 1927), and
Edgar Zilsel in 'The Sociological Roots of Science', American Journal of Sociology 47 (1942),
pp. 544-562, claimed that artisans, because of their respect for handiwork, played an important role in the development of a more empirical viewpoint. This view is generally considered
exaggerated although it has been revised and defended with qualifications by more recent scholars
such as Paolo Rossi in I filosofi e Ie macchine (Milan, 1962), A. Rupert Hall, 'The Scholar
and the Craftsman in the Scientific Revolution', in M. Clagett (ed.), Critical Problems in the
History of Science, Madison, 1959, pp. 3-23, and 'The Changing Technical Act', Technology
and Culture 3 (1962), pp. 501-515. More general reassessments are to be found in Alexander
Keller, 'Has Science Created Technology', Minerva XXII (1984), pp. 160-182 and Richard S.
Westfall, 'Robert Hooke, Mechanical Technology and Scientific Investigation', in John G. Burke
(ed.), The Uses of Science in the Age of Newton, Berkeley, 1983, pp. 85-110.
3
Works that carry sections on the period with which we are concerned include Abbott Payson
Usher, A History of Mechanical Inventions, revised edition, Cambridge, MA, 1954; Charles
Singer, E. J. Holmyard, A. Rupert Hall, and Trevor I. Williams (eds.), A History of Technology,
especially vol. II, The Mediterranean Civilization and the Middle Ages A.D. 1500 and vol.
III, From the Renaissance to the Industrial Revolution I500-f. 1700, Oxford, 1957; T. K.
Derry and Trevor I. Williams, A Short History of Technology, Oxford, 1960; U. Forti, Storia
della tecnica italiana, Florence, 1940; George Basalla, The Evolution of Technology, Cambridge,
1988; Samuel Lilley, Men, Machines and History: the Story of Tools and Machines in Relation
to Social Progress, New York, 1966.
4
I quote from John Dryden's seventeenth-century translation of Plutarch's Lives revised by
Arthur Hugh Clough. New York: Modern Library, no date, being a reprint of Clough's edition
of 1864. The passages are from The Life of Marcellus XIV, 4, and XVII, 3-4.
5 Ibid.
6 Quoted in Ivor Thomas (editor and translator), Greek Mathematical Works, 2 vols. Loeb
Classical Library. Cambridge, MA, 1941, reprinted 1981, vol. 2, p. 615.
7 Historia Naturalis, book XXXIII, 1. Pliny, Natural History, translated by H. Rackham (Loeb
Classical Library). Cambridge, MA, 1968, vol. 9, p. 3.
8 Historica Naturalis, book XXXIII, 1, ibid., p. 5.
9 Metamorphoses, book I, lines 137-150.
10
Naturales Questiones, book V, 15.3, in Seneca, Naturales Questiones, translated by Thomas
A. Corcoran (Loeb Classical Library). Cambridge, MA, 1972, vol. 2, p. 103.
II
Edmund Spencer, The Faerie Queene, book II, canto 7, stanza 17.
12
Paradise Lost, book I, lines 686-690.
13
The role of automata is analysed by Derek J. De Solla Price in 'Automata and the Origins
of Mechanism and Mechanistic Philosophy', Technology and Culture 5 (1964), pp. 9-23, and
Silvio A. Bedini, 'The Role of Automata in the History of Technology', Technology and Culture
5 (1964), pp. 24-42. On the influence of Hero, see the excellent survey by Marie Boas, 'Hero's
Pneumatica: A Study of its Transmission and Influence', Isis 40 (1949), pp. 38-48.
14
Novum Organum, book I, aphorisms 1-3, in The Works of Francis Bacon, edited by James
I
307
Spedding, Robert Leslie Ellis and Douglas Denon Heath, 13 vols. London, 1857-1874, vol. I,
p. 157.
15
Francis Bacon, Advancement of Learning (Everyman's Library). New York, 1965, pp. 72-73.
16
Ibid., vol. III, p. 156.
17
Ibid., p. 159.
18
The Philosophical Works of Descartes, translated by I. Cottingham, R. Stoothoff and D.
Murdoch, 2 vols. Cambridge, MA, 1985, vol. I, pp. 142-143.
19
Ibid., p. 142.
20
The significance of the evolution of technology and the mechanical philosophy is discussed
by J. A. Bennett, 'The Mechanic's Philosophy and the Mechanical Philosophy', History of Science
XXIV (1986), pp. 1-28. Other important studies include Andre Leroi-Gourhan, Evolution et
technique, Paris, 1971; Gilbert Simondon, Due mode d'existence des objets techniques, Paris,
1958; P. M. Schuhl, Machinisme et philosophie, Paris, 1947. An excellent survey is Marie
Boas' 'The Establishment of the Mechanical Philosophy', Osiris 10 (1952), pp. 412-541.
21
The Philosophical Works of Descartes, vol. I, pp. 100-101.
22
Letter to the Marquis of Newcastle, 23 November 1646, Oeuvres de Descartes, edited by
C. Adam and P. Tannery. Paris, 1897-1913, reprinted 1956-1973, vol. IV, p. 575.
23
Robert Boyle, The Origin of Forms and Qualities According tgo the Corpuscular Philosophy
in Selected Philosophical Papers of Robert Boyle, edited by M. A. Stewart, Indianapolis, 1991,
p. 71, A Free Enquiry into the Vulgarly Received Notion of Nature, Section II, ibid., pp. 190-191.
24
See, for instance, An Essay Containing a Requisite Digression, Concerning Those That Would
Exclude the Deity from Intermeddling with Matter, ibid., pp. 160, 170, 174.
2S
The Narratio Prima was reprinted by Kepler as an appendix to his Mysterium Cosmographicum in 1596. I quote from Edward Rosen's translation in Three Copernican Treatises, 3rd
edition. New York, 1971, p. 137.
310
311
312
books include Names for Things (1982) and A Border Dispute (1986). With
G. E. Reyes he has edited The Logical Foundations of Cognition (1994).
MATHIEU MARION, born in Montreal (1962), studied philosophy at the
Universite de Montreal (B.A., 1984 and M.A. 1986) and at New College,
Oxford (D. PhiL, 1992). He was a Visiting Scholar at the Department of
Logic & Metaphysics of the University of St. Andrews in 1991-92, a Research
Associate at the Center for the Philosophy and History of Science at Boston
University in 1992-93 and a Post-Doctoral Fellow at the Universite de
Montreal in 1993-94. He is currently Assistant Professor at the University
of Ottawa. His fields of interest are Wittgenstein, Philosophy of Mathematics
and Philosophical Logic. He has published a number of articles and he is
the author of Wittgenstein, Finitism, and the Foundations of Mathematics
(forthcoming).
JEAN-PIERRE MARQUIS, born in Quebec City (1961), studied philosophy
at McGill (Ph.D. 1988) and did post-doctoral research in Paris with the
REHSEIS team and then spent a year at the Center for Philosophy of Science
in Pittsburgh as a Research Fellow. His interests are Philosophy of Logic,
Philosophy of Mathematics, Philosophy of Science and General Epistemology.
He has published articles in topics in Philosophy of Science and Philosophy
of Logic and Mathematics.
STORRS McCALL, born in Montreal in 1930, studied philosophy at McGill
and at Oxford, where he obtained a B. Phil. and a D. Phil. He taught at
McGill from 1955 to 1963, then over the next 11 years spent 6 years at the
University of Pittsburgh and 5 years at Makerere University in Uganda, where
he started courses of formal instruction in Philosophy in 1965, and where he
was later joined by Andre Gombay and Ian Hacking. In 1974--75 he worked
on defining the concept of quality of life (QOL) in the Department of
Environment in Ottawa, following which he returned to McGill. He is the editor
of Polish Logic 1920-1939, and is the author of Aristotle's Modal Syllogisms
(1963) and A Model of the Universe (1994).
JUDITH PELHAM was born in Montreal (1961), although her family's roots
are dug into Nova Scotia soil. She completed her B.A. at Dalhousie University,
her M.A. at Simon Fraser University and her Ph.D. at the University of Toronto.
She is interested in Russell's early work in Philosophy and Logic, and in
resolutions to the paradox, as well as Philosophical Logic generally. She is
at present Assistant Professor of Philosophy at Concordia University.
PAUL M. PIETROSKI, born in the U.S. (1964), studied History at Rutgers
University (1986), and received his Ph.D. in Philosophy from the Massachussetts Institute of Technology (1990). He is an Assistant Professor of
Philosophy at McGill University. His primary interests are in the Philosophy
313
of Mind and Language, and in particular, the structure of intentional explanation. He is currently working on a manuscript, Idealization and Intentional
Explanation, which draws together a number of articles dealing with mental
content, mental causation, the semantics of opaque constructions, and the
role of ceteris paribus clauses in various explanatory enterprises.
GONZALO E. REYES, born in Santiago, Chile (1937), studied Electrical
Engineering and Mathematics at the University of Chile (Santiago), and Logic
and Methodology of Sciences at the University of California (Berkeley). He
holds the degrees of Licenciado en Matematicas (University of Chile, Santiago,
1961) and Ph.D. in Logic and Methodology of Sciences (University of
California, Berkeley, 1967). He is Professor of Mathematics at the Universite
de Montreal. He has been Visiting Professor in several Universities including
Louvain-Ia-Neuve (Belgium), Lille (France), Sydney (Australia) and the
Catholic University of Chile (Santiago). His current interests include applications of Category Theory to Modal Logic and to Semantics of Natural
Languages. He has co-authored with M. Makkai First-Order Categorical Logic
and with I. Moerdijk Model for Smooth Infinitesimal Analysis. He has co-edited
with J. Macnamara The Logical Foundations of Cognition and has published
many articles.
WILLIAM R. SHEA, born in the province of Quebec (1937), holds a Ph.D.
in Philosophy from the University of Cambridge. He is Professor of History
and Philosophy of Science at McGill University. He is a former Fellow of
Harvard University and the Institute for Advanced Studies in Berlin. He was
Directeur d'etudes at the Ecole des Hautes Etudes en Sciences Sociales de Paris
and a Visiting Professor at the University of Rome. He is a Fellow in the Royal
Society of Canada and in 1993 he was awarded the Koyre Medal, the most
prestigious award of the International Academy of the History of Science.
His main interest is the History and Philosophy of the Scientific Revolution.
He is the author of Galileo's Intellectual Revolution and The Magic of Numbers
and Motion, The Scientific Career of Rene Descartes. He has edited several
books including Reason, Experiment and Mysticism in the Scientific Revolution; Nature, Experiment and the Sciences, Creativity in the Arts and Sciences;
and Nature Mathematized.
DANIEL VANDERVEKEN, born in Belgium (1949), received a PhD in
Philosophy at the Universite Catholique de Louvain in 1974. He was First
Researcher at that University (1970-71) and at the Fonds National Beige de
la Recherche Scientifique (1971-75). Later, he has been Post-Doctoral
Research Fellow at the University of California at Berkeley, thanks to the
NATO Scientific Committee (1975-76) and to the Institute of Humanities of
Berkeley (1976-77). He is Professor and Director of the Research Group in
Analytic Philosophy at the Universite du Quebec a Trois-Rivieres. His primary
research interests are in Logic and Philosophy of Language, Speech Act theory
314
and Universal Grammar. He was the co-author with John Searle of Foundations
of Illocutionary Logic (1985). He is also the author of Meaning and Speech
Acts, Volume 1: Principles of Language Use, Volume 2: Formal Semantics
of Success and Satisfaction (1990-91).
NAME INDEX
Bach, E. 67
Bacon, F. 303, 304, 306n
Baer, R. 211n
Bai-Lin, H. 258n
Barnes, I. 67
Barr, M. 87, 93n
Barwise, I. 41, 55, 62, 67
Basalla, G. 306n
Bedini, S.A. 306n
Bell, E.T. 201, 202, 208n, 21On, 212
Bell, I.L. 93n, 94n, 217, 219-224, 226n, 235,
237
Bellmam, R.E. SI, 93n
Beltrami, E. 295n
Benacerraf, P. 181, 182
Bencivenga, E. 1,8, 19n-21n, 21
Bennet, I.A. 307n
Berkeley, G. 76
Bernays, P. 135, 136-13S, 144, 145, 147,
149, 150, 156, 157, 160, 175n, 177n179n, ISln, 182, 185
Besson, I. 301
Bhargava, M. 71, 78
Biringuccio, V. 300, 301, 303
Birkhoff, G. 211n, 212
Bismut, I.-M. 207
Black, M. 146, 147, 177n, 183
Blackburn, S. 277, 286
Blais, M. xi, 309
Campbell, K. 231
Cantor,G. 114, 116,117,138,148,149,152,
153, 157-159, 161, 168, 177n-179n,
181n, 182, 190, 191,205, 207n-209n,
212, 232
Carnap,R.95,9S, 100, 105, 175n, 292,296n
Carson, E. 174n
Cartwright, N. 260, 262, 269, 270, 271n,
272
Cassels, J. W. S. 212
Cassirer, E. 294n
Cauchy, A. 114, 143, 177n
Caus, S. de 302
Chevalley, C. 202, 211n, 212
Church, A. ix, 24, 38, 97, 105, 117, 135,
IS2
Chwistek, L. 24
Cini, M. 222, 226n
Clauser, I.F. 220, 221, 226n, 237, 241
Claverie, P. 21S, 226n
315
316
NAME INDEX
NAME INDEX
Girard, J.-Y. 109, 122
Gleick, J. 258n
Glymour, C. 278, 280, 286
Gooei, K. 43, 70, 114, 116, 171, 177n, 178n,
180n, 183
Goldblatt, R.I. 93n
Goldman, A. 277, 286
Gombay, A. 312
Goodman, N. 259, 272
Gordan, P. 200, 201, 210n, 213
Grangier, P. 226n
Greenberger, D.M. 24On, 241
Grothendieck, A. 121, 207
Grzegorczyk, A. 28, 39
Gupta, A.K. 60, 68
Hacking, I. 277, 279, 284, 286, 312
Hailperin, T. ix, 18n
Haken, H. 258n
Hall, R.A. 297, 306n
Hallett, M. x, xi, 172, 174n, 175n, 177n,
179n, 181n, 183, 310
Hansen, e. 278, 285, 286n, 286
Hanson, N.R. 263,272
Hart, H. 263, 272
Hasse, H. 189, 202-204, 208n, 211n, 212
Hatfield, G. 294n-296n
Hausdorff, F. 82
Hawtrey, R. 131
Healey, R. 229, 230, 241
Heck, R. 174
Hecke, E. 189, 203, 211n
Hegel, G.W.F. 75, 287
Heidegger, M. 76
Heine, H. 192
Heisenberg, W. 226, 240, 241, 246
Hellman, G. 239, 241
Helmholtz, H.L.F. von 287-293, 294n, 295n
Hempel, C. 261, 272, 292
Henkin, L. 25, 27, 28, 37, 38n, 39
Hensel, K. 189, 191, 193, 195,203, 209n,
211n
Heraclitus 75, 189
Herbrand, J. 108, 109, 116, 122, 21On, 211n,
213
Hermite, e. 208n
Hero (of Alexandria) 299, 302, 306n
Hertz, H. 289
Heyting, A. 70, 116
Hilbert, D. xi, 107, Ill, 115, 135-138, 140,
141,143-174, 175n-181n, 184, 189-191,
194, 198,200, 201-203, 205, 208n, 210n,
213, 214, 309
Hiley, B. 222, 226n
317
318
NAME INDEX
Lambert, K. 1, 2, 8, 18n, 21
Lang, S. 207, 211n, 214
Langlands, R.P. 189, 203, 206, 214
Lapierre, S. x, 39, 46, 55, 310
Laplace, P.S. Marquis de 243, 248, 252,
257n
Lasker, E. 207n
Lawvere, F.W. 57,71,78,88, 94n
Laymon, R. 262, 273
Lebesgue, H. 177n
Leblanc, H. ix, x, 18n, 21, 22,311
Legendre, A.M. 209n
Leibniz, G.W. 139, 194
Lejeune-Dirichlet, P.-G. see Dirichlet
Lenin (V.1. Oulianov) 294n
Leonard, H.S. 1,7, 18n, 19n, 22
Lepage, F. x, 39, 311
Leroi-Gourhan, A. 307n
Leroux, J. xi, 311
Lesniewski, S. 28
Lewis, D.K. 52, 55, 98, 101,277,278,282,
283, 286n, 286
Lilley, S. 306n
Lindeman, C.L.F. 208n
Lindenbaum, A. 71, 82
Lindstrom, P. 41, 55
Lipschitz, R. 210n
Loewer, B. 231, 232, 239, 240
Lorentz, H.A. 223, 231
Lorenz, E.N. 249, 250, 258n
Lowenheim, L. 178n, 277
Lukasiewicz, J. 59, 65, 67, 68, 94n
Macaulay, F.S. 207n, 214
Machover, M. 174n
Macintyre, A. 212n
MacLane, S. 68, 93n, 94n, 21ln, 212
Macnamara, J. x, 68, 77, 78,311,313
Majer, U. 174n
Makkai, M. 68, 93n, 174n, 313
Mandelbrot, B. 258n
Marion, M. xi, 174n, 209n, 214, 215, 312
Marquis, J.-P. x, 312
Mascheroni, L. 161, 179n, 180n
Mawanda, M.M. 93n
Maxwell, J.e. 291
May, R.M. 258n
Mayberry, J. 174n
Mazur, B. 209n, 211n, 215
McCall, S. xi, 76, 78, 229, 241, 312
McCommach, R. 294n
McLarty, e. 93n
McMullin, E. 241
Mendelson, E. 135, 186
Menn, S. 174n
Mermin, D. 235, 240n, 241
Merrill, G.H. 278, 286
Meschowski, H. 204, 205, 209n, 215
Milton, J. 301
Minkowski, H. 210n
Mittag-Leffler, G. 208n
Moerdijk, I. 68, 93n, 94n, 313
Molk, J. 193, 195, 196, 207n, 209n, 215
Montague, R. 28, 38n., 39, 41, 55, 100, 105
Montaigne, M.E. de 302
Mordell, L.J. 206
Moschovakis, Y.N. 131, 132
Mostowski, A. 41,55
Mulhern, M. 58, 59, 68
Miiller, J. 288, 295n
Musken, R., 38n, 39
Nagel, E. 293, 296n
Nelson, E. 117, 118, 122
Neumann, O. 213
Newton, I. xi, 262, 272n, 297
Noether, E. 189, 194
Nordin, I. 224, 226n
Ockham, W. of 57,68,77
Olschki, L. 306n
Ovid 300
Pappus 300
Parmenides 72, 75
Parry, W.T. 100, 101, 105
Pasch, M. 138
Pastino, L. 21n
Paty, M. 223, 226n
Patzig, G. 186
Pauli, W. 221
Peano, G. 115, 116
Pearle, P. 239, 241
Peebles, P. 272
Pelham, J. x, xi, 123, 131, 132n, 132, 312
Penzias, A. 262, 273
Perry, J. 62, 67
Philo (of Alexandria) 69
Philo (of Byzentum) 299, 302
Pietroski, P.M. xi, 263, 272n, 273, 312
Pitts, A. 93n
Plato 74, 109, 299
Pliny 300
Plutarch 299, 306n
Podolsky, B. 217, 226n
Poincare, H. 116,122,138,171, 175n, 180n,
186, 201, 206, 210n, 211n, 215, 249,
257n
NAME INDEX
Popper, K.P. ix, 14,224
Post, E. 146, 151, 177n, 186
Pringsheim, A. 195, 209n, 215
Purkert, W. 179n, 186, 2IOn, 213
Putnam, H. xi, 181, 182,275-285, 286n, 286,
310
Pythagoras 73, 74
Quine, W.v. ix, 3, 15, 21n, 22, 27, 38n, 39,
77, 175n, 285, 286
Ramelli, A. 301
Ramsey, F.P. 24, 209n, 259, 260, 263,
265-267, 271, 273, 292
Rang, B. 186
Rasiowa, H. 94n
Raven, J.E. 207n, 214
Redhead, M.L.G. 24On, 240
Reed, D. 211n, 212n
Reid, C. 180n, 186
Rescher, N. 89, 94n
Rey, G. 263, 272n
Reyes, G.E. x, 68, 93, 313
Rheticus, J. 305
Ribet, K. 211n
Richard, J. 160-162, 165-167, 180n, 186
Riemann, B. 154, 190, 195, 197, 198, 203,
207, 209n, 295n
Rimini, A. 239, 241
Roch, G. 203, 207
Rodabaugh, S.E. 93n
Roeper, P. ix, 17, 21n
Roger, G. 226n
Rohrlich, F. 224, 226n
Rosen, M. 121, 122
Rosen, N. 217, 226n
Rossi, P. 306n
Russell, B. xi, 18n, 22, 24, 96, 123-131,
132n, 132, 135, 138-143, 154, 155, 168,
175n, 176n, 181n, 186, 194, 211n, 215,
312
Sapir, E. 77
Scharlau, W. 177n, 186
Schelling, F.W.J. von 287
Schiffer, S. 261, 273
Schlick, M. 178n, 186
Schneider, T. 180n
Schoenflies, A. 208n, 215
Schofield, M. 207n, 214
SchOnberg, M. xi, 217n
SchrOder, E. 155, 186
SchrOdinger, E. 222, 223, 226, 227n, 229,
238, 239
319
320
NAME INDEX
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
R.S. Cohen and M.W. Wartofsky (eds.): Methodological and Historical Essays
in the Natural and Social Sciences. Proceedings of the Boston Colloquium for
the Philosophy of Science, 1969/72, Part II. [Synthese Library 60] 1974
ISBN 90-277-0392-2; Pb 90-277-0378-7
R.S. Cohen, 1.1. Stache1 and M.W. Wartofsky (eds.): For Dirk Struik.
Scientific, Historical and Political Essays in Honor of Dirk I. Struik. [Synthese
Library 61] 1974
ISBN 90-277-0393-0; Pb 90-277-0379-5
N. Geschwind: Selected Papers on Language and the Brains. [Synthese Library
68] 1974
ISBN 90-277-0262-4; Pb 90-277-0263-2
B.G. Kuznetsov: Reason and Being. Translated from Russian. Edited by C.R.
Fawcett and R.S. Cohen. 1987
ISBN 90-277-2181-5
P. Mittelstaedt: Philosophical Problems of Modern Physics. Translated from
the revised 4th German edition by W. Riemer and edited by R.S. Cohen.
[Synthese Library 95] 1976
ISBN 90-277-0285-3; Pb 90-277-0506-2
H. Mehlberg: Time, Causality, and the Quantum Theory. Studies in the
Philosophy of Science. Vol. I: Essay on the Causal Theory of Time. Vol. II:
Time in a Quantized Universe. Translated from French. Edited by R.S. Cohen.
1980
Vol. I: ISBN 90-277-0721-9; Pb 90-277-1074-0
Vol. II: ISBN 90-277-1075-9; Pb 90-277-1076-7
K.F. Schaffner and R.S. Cohen (eds.): PSA 1972. Proceedings of the 3rd
Biennial Meeting of the Philosophy of Science Association (Lansing,
Michigan, Fall 1972). [Synthese Library 64] 1974
ISBN 90-277-0408-2; Pb 90-277-0409-0
R.S. Cohen and J.J. Stachel (eds.): Selected Papers of Leon Rosenfeld.
[Synthese Library 100] 1979
ISBN 90-277-0651-4; Pb 90-277-0652-2
M. Capek (ed.): The Concepts of Space and Time. Their Structure and Their
Development. [Synthese Library 74] 1976
ISBN 90-277-0355-8; Pb 90-277-0375-2
M. Grene: The Understanding of Nature. Essays in the Philosophy of Biology.
[Synthese Library 66] 1974
ISBN 90-277-0462-7; Pb 90-277-0463-5
D. Ihde: Technics and Praxis. A Philosophy of Technology. [Synthese Library
130] 1979
ISBN 90-277-0953-X; Pb 90-277-0954-8
l Hintikka and U. Remes: The Method of Analysis. Its Geometrical Origin and
Its General Significance. [Synthese Library 75] 1974
ISBN 90-277-0532-1; Pb 90-277-0543-7
lE. Murdoch and E.D. Sylla (eds.): The Cultural Context of Medieval
Learning. Proceedings of the First International Colloquium on Philosophy,
Science, and Theology in the Middle Ages, 1973. [Synthese Library 76] 1975
ISBN 90-277-0560-7; Pb 90-277-0587-9
M. Grene and E. Mendelsohn (eds.): Topics in the Philosophy of Biology.
[Synthese Library 84] 1976
ISBN 90-277-0595-X; Pb 90-277-0596-8
I. Agassi: Science in Flux. [Synthese Library 80] 1975
ISBN 90-277-0584-4; Pb 90-277-0612-3
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.
J.J. Wiatr (ed.): Polish Essays in the Methodology of the Social Sciences.
[Synthese Library 131] 1979
ISBN 90-277-0723-5; Pb 90-277-0956-4
P. Janich: Protophysics of Time. Constructive Foundation and History of Time
ISBN 90-277-0724-3
Measurement. Translated from German. 1985
R.S. Cohen and M.W. Wartofsky (eds.): Language, Logic, and Method. 1983
ISBN 90-277-0725-1
R.S. Cohen, C.A. Hooker, A.C. Michalos and lW. van Evra (eds.): PSA 1974.
Proceedings of the 4th Biennial Meeting of the Philosophy of Science
Association. [Synthese Library 101] 1976
ISBN 90-277-0647-6; Pb 90-277-0648-4
G. Holton and W.A. Blanpied (eds.): Science and Its Public. The Changing
Relationship. [Synthese Library 96] 1976
ISBN 90-277-0657-3; Pb 90-277-0658-1
M.D. Grmek, R.S. Cohen and G. Cimino (eds.): On Scientific Discovery. The
1977 Erice Lectures. 1981
ISBN 90-277-1122-4; Pb 90-277-1123-2
S. Amsterdamski: Between Experience and Metaphysics. Philosophical
Problems of the Evolution of Science. Translated from Polish. [Synthese
Library 77] 1975
ISBN 90-277-0568-2; Pb 90-277-0580-1
M. Markovic and G. Petrovic (eds.): Praxis. Yugoslav Essays in the Philosophy
and Methodology of the Social Sciences. [Synthese Library 134] 1979
ISBN 90-277-0727-8; Pb 90-277-0968-8
H. von Helmholtz: Epistemological Writings. The Paul Hertz / Moritz Schlick
Centenary Edition of 1921. Translated from German by M.F. Lowe. Edited
with an Introduction and Bibliography by R.S. Cohen and Y. Elkana. [Synthese
Library 79] 1977
ISBN 90-277-0290-X; Pb 90-277-0582-8
R.M. Martin: Pragmatics, Truth and Language. 1979
ISBN 90-277-0992-0; Pb 90-277-0993-9
R.S. Cohen, P.K. Feyerabend and M.W. Wartofsky (eds.): Essays in Memory of
Imre Lakatos. [Synthese Library 99] 1976
ISBN 90-277-0654-9; Pb 90-277-0655-7
Not published.
Not published.
H.R. Maturana and F.J. Varela: Autopoiesis and Cognition. The Realization of
the Living. With a Preface to' Autopoiesis' by S. Beer. 1980
ISBN 90-277-1015-5; Pb 90-277-1016-3
A. Kasher (ed.): Language in Focus: Foundations, Methods and Systems.
Essays in Memory ofYehoshua Bar-Hillel. [Synthese Library 89] 1976
ISBN 90-277-0644-1; Pb 90-277-0645-X
T.D. Thao: Investigations into the Origin of Language and Consciousness.
1984
ISBN 90-277-0827-4
Not published.
P.L. Kapitza: Experiment, Theory, Practice. Articles and Addresses. Edited by
R.S. Cohen. 1980
ISBN 90-277-1061-9; Pb 90-277-1062-7
51.
52.
53.
54.
55.
56.
57.
58.
59.
60.
61.
62.
63.
64.
65.
66.
67.
M.L. Dalla Chiara (ed.): Italian Studies in the Philosophy of Science. 1981
ISBN 90-277-0735-9; Pb 90-277-1073-2
M.W. Wartofsky: Models. Representation and the Scientific Understanding.
[Synthese Library 129] 1979
ISBN 90-277-0736-7; Pb 90-277-0947-5
T.D. Thao: Phenomenology and Dialectical Materialism. Edited by R.S.
Cohen. 1986
ISBN 90-277-0737-5
Y. Fried and J. Agassi: Paranoia. A Study in Diagnosis. [Synthese Library
102] 1976
ISBN 90-277-0704-9; Pb 90-277-0705-7
K.H. Wolff: Surrender and Cath. Experience and Inquiry Today. [Synthese
ISBN 90-277-0758-8; Pb 90-277-0765-0
Library 105] 1976
K. Kosik: Dialectics of the Concrete. A Study on Problems of Man and World.
1976
ISBN 90-277-0761-8; Pb 90-277-0764-2
N. Goodman: The Structure ofAppearance. [Synthese Library 107] 1977
ISBN 90-277-0773-1; Pb 90-277-0774-X
H.A. Simon: Models of Discovery and Other Topics in the Methods of Science.
ISBN 90-277-0812-6; Pb 90-277-0858-4
[Synthese Library 114] 1977
M. Lazerowitz: The Language of Philosophy. Freud and Wittgenstein.
ISBN 90-277-0826-6; Pb 90-277-0862-2
[Synthese Library 117] 1977
T. Nickles (ed.): Scientific Discovery, Logic, and Rationality. 1980
ISBN 90-277-1069-4; Pb 90-277-1070-8
J. Margolis: Persons and Mind. The Prospects of Nonreductive Materialism.
ISBN 90-277-0854-1; Pb 90-277-0863-0
[Synthese Library 121] 1978
G. Radnitzky and G. Andersson (eds.): Progress and Rationality in Science.
[Synthese Library 125] 1978
ISBN 90-277-0921-1; Pb 90-277-0922-X
G. Radnitzky and G. Andersson (eds.): The Structure and Development of
Science. [Synthese Library 136] 1979 ISBN 90-277-0994-7; Pb 90-277-0995-5
T. Nickles (ed.): Scientific Discovery. Case Studies. 1980
ISBN 90-277-1092-9; Pb 90-277-1093-7
M.A. Finocchiaro: Galileo and the Art of Reasoning. Rhetorical Foundation of
Logic and Scientific Method. 1980
ISBN 90-277-1094-5; Pb 90-277-1095-3
W.A. Wallace: Prelude to Galileo. Essays on Medieval and 16th-Century
Sources of Galileo's Thought. 1981 ISBN 90-277-1215-8; Pb 90-277-1216-6
F. Rapp: Analytical Philosophy of Technology. Translated from German. 1981
ISBN 90-277-1221-2; Pb 90-277-1222-0
R.S. Cohen and M.W. Wartofsky (eds.): Hegel and the Sciences. 1984
ISBN 90-277-0726-X
J. Agassi: Science and Society. Studies in the Sociology of Science. 1981
ISBN 90-277-1244-1; Pb 90-277-1245-X
L. Tondl: Problems of Semantics. A Contribution to the Analysis of the
Language of Science. Translated from Czech. 1981
ISBN 90-277-0148-2; Pb 90-277-0316-7
J. Agassi and R.S. Cohen (eds.): Scientific Philosophy Today. Essays in Honor
of Mario Bunge. 1982
ISBN 90-277-1262-X; Pb 90-277-1263-8
Also of interest:
R.S. Cohen and M.W. Wartofsky (eds.): A Portrait of Twenty-Five Years Boston
Colloquia for the Philosophy of Science, 1960-1985. 1985
ISBN Pb 90-277-1971-3
Previous volumes are still available.
KLUWER ACADEMIC PUBLISHERS - DORDRECHT / BOSTON / LONDON